20.08.2025, 00:55
In my case it never worked, I just get a black screen after the .engine finishes compiling (3090).
I'm using 2025.07.27.1 and it didn't work with the May version either (same behavior).
The mlrt version works, but for some reason is way slower. Plus I don't need tiling there cause FP16 or BF16 are enough to save memory.
Question: is there a way to use the vsSCUnet models quantized?
I tried converting them to bf16 and fp16 with a python script, they load up but the memory usage remains the same, I think they are expanded back to fp32.
It would be nice to have parity between the vs and mlrt in terms of options to see which one is faster.
I'm using 2025.07.27.1 and it didn't work with the May version either (same behavior).
The mlrt version works, but for some reason is way slower. Plus I don't need tiling there cause FP16 or BF16 are enough to save memory.
Question: is there a way to use the vsSCUnet models quantized?
I tried converting them to bf16 and fp16 with a python script, they load up but the memory usage remains the same, I think they are expanded back to fp32.
It would be nice to have parity between the vs and mlrt in terms of options to see which one is faster.