Hugging Face Transformers is vulnerable to ReDoS through its MarianTokenizer
Moderate severity
GitHub Reviewed
Published
Sep 12, 2025
to the GitHub Advisory Database
•
Updated Sep 15, 2025
Description
Published by the National Vulnerability Database
Sep 12, 2025
Published to the GitHub Advisory Database
Sep 12, 2025
Reviewed
Sep 15, 2025
Last updated
Sep 15, 2025
A Regular Expression Denial of Service (ReDoS) vulnerability was discovered in the Hugging Face Transformers library, specifically affecting the MarianTokenizer's
remove_language_code()
method. This vulnerability is present in version 4.52.4 and has been fixed in version 4.53.0. The issue arises from inefficient regex processing, which can be exploited by crafted input strings containing malformed language code patterns, leading to excessive CPU consumption and potential denial of service.References