Load ESM2 in bf16 if supported #235

brianloyal · 2024-12-09T23:09:43Z

Description

Updated ESM2 model load to use bfloat16 (if supported).

Motivation

Slightly improved VRAM usage for greater support on A10 gpus

Test plan

Run on test data using g5.2xlarge instances on AWS

arogozhnikov · 2024-12-09T23:35:27Z

we offload ESM, so this unlikely to bring any memory benefits
if model works well in fp16/bf16 (this needs checking by looking at difference in predictions vs fp32 version), we'd better just use one of them, not both.

arogozhnikov · 2024-12-09T23:35:53Z

chai_lab/data/dataset/embeddings/esm.py

+                model_name,
+                cache_dir=esm_cache_folder,
+                torch_dtype=(
+                    torch.float16 if is_torch_bf16_gpu_available() else torch.bfloat16


you want opposite

🤦 sigh - sorry about that

Load ESM2 in bf16 if supported

b6530a6

wukevin requested a review from arogozhnikov December 9, 2024 23:31

arogozhnikov reviewed Dec 9, 2024

View reviewed changes

typo

508a037

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load ESM2 in bf16 if supported #235

Load ESM2 in bf16 if supported #235

brianloyal commented Dec 9, 2024

arogozhnikov commented Dec 9, 2024

arogozhnikov Dec 9, 2024 •

edited

Loading

brianloyal Dec 10, 2024

Load ESM2 in bf16 if supported #235

Are you sure you want to change the base?

Load ESM2 in bf16 if supported #235

Conversation

brianloyal commented Dec 9, 2024

Description

Motivation

Test plan

arogozhnikov commented Dec 9, 2024

arogozhnikov Dec 9, 2024 • edited Loading

Choose a reason for hiding this comment

brianloyal Dec 10, 2024

Choose a reason for hiding this comment

arogozhnikov Dec 9, 2024 •

edited

Loading