Added supervision library support, batch processing, optimised ONNX with io_binding and memory managment #70

hidara2000 · 2025-04-15T06:03:59Z

fixed error with newer transforms package missing defs (should work with old and newer versions)
added supervision for varied annotations and tracking
added optimised onnx inference with relevant operations moved to GPU (cupy). IO binding to improve performance and GPUMemoryPool to better manage mem

IO Binding Benefits for Multiprocessing

Reduces contention for CPU-GPU data transfer pathways when multiple processes share GPU resources Enables more efficient process-per-GPU distribution by minimizing transfer overhead Improves scalability across multiple GPUs by optimizing each process-GPU communication Supports pipeline parallelism by keeping intermediate data on GPU between processing stages Allows for better load balancing across processes by reducing data movement bottlenecks Enables higher GPU utilization when distributing work across multiple processes Minimizes IPC (inter-process communication) overhead for inference workloads Helps maintain consistent performance when scaling to multiple workers

Couldn't test on a multiGPU setup

Results on the same 30s video

I have this table testing 1 video at a time

Script	Time	Method
torch_inf.py (orig)	13.20s	cv2
torch_inf_super.py	5.76s	supervision + batch
onnx_inf_super.py	5.36s	supervision + batch
onnx_inf_super_io.py	4.27s	supervision + batch + io_binding + GPU manager
onnx_inf_super_io.py	1.77s ea	mp.Pool(8) supervision + batch + io_binding + GPU manager

NOTE: onnx_inf_super_io was created as the script was being used in a multiprocessing env. Originally this caused overwriting of GPU mem. The memory management mitigated this and sped up inference. Above the test was carried out for 24 30s videos processed in a pool of 8.

- fixed error with newer transforms package missing defs - added supervision for varied annotations and tracking - added optimised onnx inference with relevant operations moved to GPU (cupy). IO binding to improve performance and GPUMemoryPool to better manage mem # IO Binding Benefits for Multiprocessing Reduces contention for CPU-GPU data transfer pathways when multiple processes share GPU resources Enables more efficient process-per-GPU distribution by minimizing transfer overhead Improves scalability across multiple GPUs by optimizing each process-GPU communication Supports pipeline parallelism by keeping intermediate data on GPU between processing stages Allows for better load balancing across processes by reducing data movement bottlenecks Enables higher GPU utilization when distributing work across multiple processes Minimizes IPC (inter-process communication) overhead for inference workloads Helps maintain consistent performance when scaling to multiple workers

hidara2000 added 3 commits April 15, 2025 15:21

fixed not using GPU

97c2b39

added args.model_img_size

c148c54

hidara2000 mentioned this pull request Apr 15, 2025

Add support for io_binding and supervision package (PR in DEIM github repo) Peterande/D-FINE#259

Open

hidara2000 added 9 commits September 30, 2025 11:07

...

fa7399d

Clean up inference code

e922de3

remove pyc

bc76086

...

db5f906

save before transformation tweaks

27c6b30

remove pycache

1f51fe4

make code easier to use

7b7dc6b

update readme

bb0ad40

clean up inference code

7d81be5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added supervision library support, batch processing, optimised ONNX with io_binding and memory managment #70

Added supervision library support, batch processing, optimised ONNX with io_binding and memory managment #70

Uh oh!

hidara2000 commented Apr 15, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Added supervision library support, batch processing, optimised ONNX with io_binding and memory managment #70

Are you sure you want to change the base?

Added supervision library support, batch processing, optimised ONNX with io_binding and memory managment #70

Uh oh!

Conversation

hidara2000 commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

IO Binding Benefits for Multiprocessing

Results on the same 30s video

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

hidara2000 commented Apr 15, 2025 •

edited

Loading