HPC-Coder-v2

The HPC-Coder-v2-6.7b model is an HPC code LLM fine-tuned on an instruction dataset catered to common HPC topics such as parallelism, optimization, accelerator porting, etc. This version is a fine-tuning of the Deepseek Coder 6.7b model. It is fine-tuned on the hpc-instruct, oss-instruct, and evol-instruct datasets. We utilized the distributed training library AxoNN to fine-tune in parallel across many GPUs.

HPC-Coder-v2-6.7b is the best performing LLM under 30b parameters on the ParEval parallel code generation benchmark in terms of correctness and performance. It scores similarly to 34B and commercial models like Phind-V2 and GPT-4 on parallel code generation.

Using HPC-Coder-v2

The model is provided as a standard huggingface model with safetensor weights. The weights are available on huggingface. It can be used with transformers pipelines, vllm, or any other standard model inference framework. HPC-Coder-v2 is an instruct model and prompts need to be formatted as instructions for best results. It was trained with the following instruct template:

Below is an instruction that describes a task. Write a response that appropriately completes the request.

### Instruction:
{instruction}

### Response:

Quantized Models

4 and 8 bit quantized weights are available in the GGUF format for use with llama.cpp. The 4 bit model requires ~3.8 GB memory and can be found here. The 8 bit model requires ~7.1 GB memory and can be found here. Further information on how to use them with llama.cpp can be found in its documentation.

Evaluation

We evaluated the model on the ParEval benchmark for parallel code generation. It scores a pass@1 of 31.17 on parallel code generation tasks including OpenMP, MPI, MPI+OpenMP, CUDA, HIP, and Kokkos. This is the best performing open-source model on ParEval under 30B parameters. Furthermore, it performs similarly to the 34B parameter model Phind-V2-34B (pass@1 = 32.12) and GPT-4 (pass@1 = 37.75). Check out ParEval for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
data-generation		data-generation
fine-tuning		fine-tuning
v1		v1
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HPC-Coder-v2

Using HPC-Coder-v2

Quantized Models

Evaluation

About

Releases

Packages

Contributors 3

Languages

License

parallelcodefoundry/HPC-Coder

Folders and files

Latest commit

History

Repository files navigation

HPC-Coder-v2

Using HPC-Coder-v2

Quantized Models

Evaluation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages