Skip to content

CounterVectorizer does not support strip_accents=unicode #1144

@thang-le-klaviyo

Description

@thang-le-klaviyo

Hi!
For one of our models, we are using TfidVectorizer with one of our params being strip_accents='unicode'. However, when trying to convert our pipeline using convert_sklearn, we got the error that only strip_accents=None is supported. I was wondering if we can support strip_accents unicode. Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions