SCUNet call itself looks fine to me:
The decoder call:
(assuming the source can be hardware decoded) could be tweaked.
Personally I would use DGDecNV, but alternatively just enabling hardware decoding for LWLibavSource might be a good idea.
(see under "Filtering->Vapoursynth->Misc->Source")
That said, that speed seems kind of normal for SCUNet.
I get around 3-4 fps with my Geforce RTX 4080 with those settings.
Disabling CUDAGraph has no impact on the encoding for me.
Depending on the VRAM you have tiling can help. (I got 16GB VRAM)
1 tile: 3.77 fps
2 tiles: 3.84 fps (so no real change, that much can be due to general system fluctuation on a small clip)
4 tiles: 5.21 fps
8 tiles: 3.02 fps (slower than 1)
At least on my system 4 tiles seems to be the sweet spot for HD and 16GB VRAM.
As a side note: on SD sources I usually use streams 3, with tiles 1.
I also tried 4 tiles + streams 3, which gave me 5.55 fps.
=> you might want to do some testing regarding tiles&streams on your setup too
Cu Selur
clip = vsmlrt.SCUNet(clip, model=3, backend=Backend.TRT_RTX(fp16=True,device_id=0,verbose=True,use_cuda_graph=True,num_streams=1,builder_optimization_level=3,engine_folder="D:/APPD/HybridTemp"))clip = core.lsmas.LWLibavSource(source="C:/Users/mmddffkk/Desktop/tmp/TestSrc.mkv", format="YUV420P8", stream_index=0, cache=0, prefer_hw=0)Personally I would use DGDecNV, but alternatively just enabling hardware decoding for LWLibavSource might be a good idea.
(see under "Filtering->Vapoursynth->Misc->Source")
That said, that speed seems kind of normal for SCUNet.
I get around 3-4 fps with my Geforce RTX 4080 with those settings.
Disabling CUDAGraph has no impact on the encoding for me.
Depending on the VRAM you have tiling can help. (I got 16GB VRAM)
1 tile: 3.77 fps
2 tiles: 3.84 fps (so no real change, that much can be due to general system fluctuation on a small clip)
4 tiles: 5.21 fps
8 tiles: 3.02 fps (slower than 1)
At least on my system 4 tiles seems to be the sweet spot for HD and 16GB VRAM.
As a side note: on SD sources I usually use streams 3, with tiles 1.
I also tried 4 tiles + streams 3, which gave me 5.55 fps.
=> you might want to do some testing regarding tiles&streams on your setup too

Cu Selur
----
Dev versions are in the 'experimental'-folder of my GoogleDrive, which is linked on the download page.
Dev versions are in the 'experimental'-folder of my GoogleDrive, which is linked on the download page.

