tinyinfer

介绍

一个推理框架，目标是最精简的技术实现，开发中。
更新地址1: https://github.com/lyffly/tinyinfer
更新地址2: https://gitee.com/yunfeiliu/tinyinfer
长期计划如下，现在还比较早期，晚上和周末才有时间更新

编译

cuda version: 12.4 cudnn version: 9.1

git clone https://github.com/lyffly/tinyinfer
# or https://gitee.com/yunfeiliu/tinyinfer
cd tinyinfer
git submodule update --init --recursive

# build wheel
python3 setup.py bdist_wheel
# install
pip install dist/*.whl

使用Renset18测试

python3 test_resnet18_fp32.py
python3 test_resnet18_fp16.py

愿景

不求速度多快，不求技术多高级，只做技术积累，把一个推理引擎所需要的内容整合完毕。
cuda kernel的实现目前先实现最naive的版本，后续再优化

参考或看过或未来会用到的仓库

1、https://github.com/OpenPPL/ppl.kernel.cuda
2、https://github.com/OpenPPL/ppl.llm.kernel.cuda
3、https://github.com/alibaba/MNN
4、https://github.com/Tencent/TNN
5、https://github.com/ggerganov/llama.cpp
6、https://github.com/karpathy/llm.c
7、https://github.com/triton-lang/triton
8、https://github.com/NVIDIA/cutlass
9、https://github.com/NVIDIA/TensorRT
10、https://github.com/OpenPPL/ppq
11、https://github.com/vllm-project/vllm

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
.github/workflows		.github/workflows
3rd		3rd
data		data
flash_attention		flash_attention
impl		impl
tests		tests
tinyinfer		tinyinfer
utils		utils
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
build.sh		build.sh
clean.sh		clean.sh
dev_log.md		dev_log.md
image_utils.py		image_utils.py
llm_qwen_readme.md		llm_qwen_readme.md
setup.py		setup.py
test_resnet18_fp16.py		test_resnet18_fp16.py
test_resnet18_fp32.py		test_resnet18_fp32.py
test_resnet18_ort.py		test_resnet18_ort.py
test_yolox_fp16.py		test_yolox_fp16.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tinyinfer

介绍

编译

使用Renset18测试

愿景

参考或看过或未来会用到的仓库

About

Releases

Packages

Languages

License

lyffly/tinyinfer

Folders and files

Latest commit

History

Repository files navigation

tinyinfer

介绍

编译

使用Renset18测试

愿景

参考或看过或未来会用到的仓库

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages