Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there any support for text preprocessing mainly for transformer models #86

Open
sachinsharma9780 opened this issue Jul 15, 2021 · 1 comment
Labels
question Further information is requested

Comments

@sachinsharma9780
Copy link

Hi,

I have seen that dali already have prebuilt functions for image preprocessing like dali.fn.resize(images, resize_x=299, resize_y=299).
But does it provide any preprocessing functions for text as well like performing tokenization of text on the Triton server-side, typically for transformer models?

Thanks!

@szalpal
Copy link
Member

szalpal commented Jul 21, 2021

Hello @sachinsharma9780 !

Unfortunately, at the moment DALI doesn't support text preprocessing. Is there any particular tokenizer you are referring to? As far as I know, transferring tokenization on the GPU wouldn't make much gain, since this algorithm is not easily parallelizeable.

@szalpal szalpal added the question Further information is requested label Jul 21, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Development

No branches or pull requests

2 participants