Skip to content

Commit

Permalink
Fix case for homomers resulting in errors (#183)
Browse files Browse the repository at this point in the history
  • Loading branch information
wukevin authored Nov 27, 2024
1 parent 13c7b00 commit a6fd246
Showing 1 changed file with 4 additions and 2 deletions.
6 changes: 4 additions & 2 deletions chai_lab/data/dataset/msas/colabfold.py
Original file line number Diff line number Diff line change
Expand Up @@ -408,5 +408,7 @@ def generate_colabfold_msas(protein_seqs: list[str], msa_dir: Path):
),
)
msa_path = msa_dir / expected_basename(protein_seq)
assert not msa_path.exists()
aligned_df.to_parquet(msa_path)
if not msa_path.exists():
# If we have a homomer, we might see the same chain multiple
# times. The MSAs should be identical for each.
aligned_df.to_parquet(msa_path)

0 comments on commit a6fd246

Please sign in to comment.