Dynamic shape for input like NLP #4957

milad1378yz · 2022-05-10T13:03:25Z

Hi , I have a question and that is :
I know that nGraph supports dynamic shape for input but is there any improvement for such models ? (especially NLP models like Bert )

diyessi · 2022-05-10T14:53:04Z

I thought of dynamic shapes as like C++ vectors; there's allocated space and something separate that indicates how much of that space you are actually using. You might cache particular compiled combinations of max sizes for inputs. If you are a server, dynamic batch size can help with latency since you would only need to compute for as many samples as you had ready. You could also imagine kernels that could make use of knowing actual sample lengths to reduce the computation in those transformer GEMMs (some hardware skips 0 arithmetic and some hardware does a tile of GEMM in the same amount of time as a partial tile, so it wouldn't matter for them). Whatever Intel did with ngraph along those lines would be in the OpenVINO repo.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic shape for input like NLP #4957

Dynamic shape for input like NLP #4957

milad1378yz commented May 10, 2022

diyessi commented May 10, 2022

Dynamic shape for input like NLP #4957

Dynamic shape for input like NLP #4957

Comments

milad1378yz commented May 10, 2022

diyessi commented May 10, 2022