MADTP++ is a novel approach that integrates tailored token and weight pruning processes into a unified framework, achieving superior compression in both parameter counts and computational costs
-
Notifications
You must be signed in to change notification settings - Fork 0
MADTP++ is a novel approach that integrates tailored token and weight pruning processes into a unified framework, achieving superior compression in both parameter counts and computational costs
double125/MADTP-plus
About
MADTP++ is a novel approach that integrates tailored token and weight pruning processes into a unified framework, achieving superior compression in both parameter counts and computational costs
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published