Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

unusual high time for ex.extract() line 345-350 in inference.py Yolov8 #6

Open
manish33scss opened this issue May 3, 2023 · 3 comments

Comments

@manish33scss
Copy link

   begin = perf_counter()
    ret1, mat_out1 = ex.extract(output_names[0])  # stride 8
    assert not ret1, f'extract {output_names[0]} with something wrong!'
    ret2, mat_out2 = ex.extract(output_names[1])  # stride 16
    assert not ret2, f'extract {output_names[1]} with something wrong!'
    ret3, mat_out3 = ex.extract(output_names[2])  # stride 32
    assert not ret3, f'extract {output_names[2]} with something wrong!'
   
     end = perf_counter()

These lines of code are taking around 400-500ms for each frame, running this on jetson nano !!
What could be the problem ?
any work around ?

@triple-Mu
Copy link
Owner

How about adding:

net.opt.use_vulkan_compute = True
net.opt.num_threads = 4
net.opt.use_bf16_storage = True

Or convert your ncnn with int8 quant.

@manish33scss
Copy link
Author

still same results, even the c++ file gives 1 fps on jetson nano !

@triple-Mu
Copy link
Owner

still same results, even the c++ file gives 1 fps on jetson nano !

Do you try int8 quant?
Or you can try my yolov8-tensorrt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants