AbLinuga: Antibody Language Model with Linguistic Tokenization

Main models you should use

Model	Dataset	Description
AbLingua-300M	OAS-Unpaired-300M	A small language model that is easy to fine-tune
AbLingua-600M	OAS-Unpaired-300M	Medium-sized model that can support a variety of downstream tasks and performs well in antibody structure prediction

Install

As a prerequisite, you must have PyTorch and Transformers installed to use this repository. PyTorch and Transformers only require the latest version to be installed.

# Python >= 3.10
pip install torch
pip install transformers

Download language model weight

# weight should in weight floder
cd AbLingua/weight/
# download model json
wget -c 'https://huggingface.co/IDEA-XtalPi/AbLingua/resolve/main/config.json'
# download model weight
wget -c 'https://huggingface.co/IDEA-XtalPi/AbLingua/resolve/main/pytorch_model.bin'

Usage

Antibody sequence embedding

Representations from AbLinuga may be useful as features for deep learning models.

from AbLingua.embed import get_collator, get_model

collator = get_collator()
model = get_model()

seq = ['QVTLRESGPAL', 
       'VKPTQTLTLTC']
seq_input = collator(seq)

tokens_embedding = model(**seq_input).hidden_states[-1]

# tokens_embedding.shape
# [2, 256, 1280]

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
AbLingua		AbLingua
LICENSE		LICENSE
README.md		README.md
example.ipynb		example.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AbLinuga: Antibody Language Model with Linguistic Tokenization

Main models you should use

Install

Download language model weight

Usage

Antibody sequence embedding

Citation

About

Releases

Packages

Languages

License

IDEA-XL/AbLingua

Folders and files

Latest commit

History

Repository files navigation

AbLinuga: Antibody Language Model with Linguistic Tokenization

Main models you should use

Install

Download language model weight

Usage

Antibody sequence embedding

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages