Is there any support for text preprocessing mainly for transformer models #86

sachinsharma9780 · 2021-07-15T21:18:53Z

Hi,

I have seen that dali already have prebuilt functions for image preprocessing like dali.fn.resize(images, resize_x=299, resize_y=299).
But does it provide any preprocessing functions for text as well like performing tokenization of text on the Triton server-side, typically for transformer models?

Thanks!

szalpal · 2021-07-21T07:13:31Z

Hello @sachinsharma9780 !

Unfortunately, at the moment DALI doesn't support text preprocessing. Is there any particular tokenizer you are referring to? As far as I know, transferring tokenization on the GPU wouldn't make much gain, since this algorithm is not easily parallelizeable.

szalpal added the question Further information is requested label Jul 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there any support for text preprocessing mainly for transformer models #86

Is there any support for text preprocessing mainly for transformer models #86

sachinsharma9780 commented Jul 15, 2021

szalpal commented Jul 21, 2021

Is there any support for text preprocessing mainly for transformer models #86

Is there any support for text preprocessing mainly for transformer models #86

Comments

sachinsharma9780 commented Jul 15, 2021

szalpal commented Jul 21, 2021