12.02.2024, 22:11
Hello Selur,
I solved the problem by building "dlib" with CUDA enabled.
I attached the new dlib wheel, feel free to try it.
With this version the encoding speed increased from 0.05fps to 2.5fps -> 50x faster!
I encoded the dlib by setting the CUDA capabilities to 8.0 (good for RTX 30 and above).
I suspect that the few dlib versions with CUDA support available, were compiled with CUDA capabilities equal to 5.0 (because this is the default in cmake).
For compatibility reason I compiled "dlib" against CUDA SDK v11.4 (good for RTX 30 and above).
In the case the pip installer refuse to install the wheel because is not compatible.
You have to rename the weel extension from ".whl" to ".zip" so that you can edit the file __init__.py in dlib folder.
Change the following code:
to match your SDK installation. I not tried it, I hope that it will work
Dan
I solved the problem by building "dlib" with CUDA enabled.
I attached the new dlib wheel, feel free to try it.
With this version the encoding speed increased from 0.05fps to 2.5fps -> 50x faster!
I encoded the dlib by setting the CUDA capabilities to 8.0 (good for RTX 30 and above).
I suspect that the few dlib versions with CUDA support available, were compiled with CUDA capabilities equal to 5.0 (because this is the default in cmake).
For compatibility reason I compiled "dlib" against CUDA SDK v11.4 (good for RTX 30 and above).
In the case the pip installer refuse to install the wheel because is not compatible.
You have to rename the weel extension from ".whl" to ".zip" so that you can edit the file __init__.py in dlib folder.
Change the following code:
if 'ON' == 'ON':
add_lib_to_dll_path('C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.4/lib/x64/cudnn.lib')
add_lib_to_dll_path('C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.4/lib/x64/cudart.lib')
Dan