Skip to content
View htqin's full-sized avatar
🦊
Focusing
🦊
Focusing

Highlights

  • Pro

Block or report htqin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Efficient-ML/Awesome-Model-Quantization Efficient-ML/Awesome-Model-Quantization Public

    A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

    1.9k 209

  2. Macaronlin/LLaMA3-Quantization Macaronlin/LLaMA3-Quantization Public

    A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..

    Python 172 8

  3. Efficient-ML/Awesome-Efficient-LLM-Diffusion Efficient-ML/Awesome-Efficient-LLM-Diffusion Public

    A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Wel…

    164 11

  4. BiBERT BiBERT Public

    This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.

    Python 84 7

  5. IR-QLoRA IR-QLoRA Public

    [ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retention

    Python 60 5

  6. BiBench BiBench Public

    [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarization.

    Python 54 4