This forum uses cookies
This forum makes use of cookies to store your login information if you are registered, and your last visit if you are not. Cookies are small text documents stored on your computer; the cookies set by this forum can only be used on this website and pose no security risk. Cookies on this forum also track the specific topics you have read and when you last read them. Please confirm whether you accept or reject these cookies being set.

A cookie will be stored in your browser regardless of choice to prevent you being asked this question again. You will be able to change your cookie settings at any time using the link in the footer.

ColormnetV2 Project
#1
Hello Dan & Selur ,

I am working on a custom video colorization pipeline heavily inspired by ColorMNet, but I completely overhauled the core architecture to make it state-of-the-art:

1. Backbone Upgrade: Replaced DINOv2 with DINOv3 for denser and richer semantic feature extraction.
2. Memory Upgrade: Upgraded the tracking engine to the XMem++ architecture (incorporating Permanent Memory).

The Progress:
I successfully trained the model from scratch up to 145,000 iterations (DAVIS AND REDS AND 16MM FILM)
The temporal stability and object tracking are mind-blowing. If I provide a reference frame with a red car, the car stays perfectly red throughout the whole video, even through severe occlusions.

The Problem:
While the tracking is perfect, I am experiencing a spatial issue: Color Bleeding / Spilling ( specifically spilling over the ground/road and the sky )



Call for Collaboration:
I am reaching out to see if we can team up to stabilize this model. Once we fix this spatial bleeding, I truly believe this will be the ultimate upgrade to ColorMNet.

To get things started, I have attached all the files to this post:

    The complete training and inference source code.

    The test scripts.

    The trained model weights (at 145k iterations).

    The visual results along with the reference images.

Let's build something great together. Any advice or pull requests are welcome!

Best

NASS

Script and model: https://drive.google.com/file/d/1JV7V2pp...sp=sharing

Resultat: https://drive.google.com/file/d/1aKtCB5Q...sp=sharing

For Test: python nass.py --input 0000.mp4 --ref_path REF --model saves/color_v3_3090_145000.pth
Reply


Messages In This Thread
ColormnetV2 Project - by NASS - 10.04.2026, 00:27
RE: Deoldify Vapoursynth filter - by Dan64 - 10.04.2026, 09:51
RE: ColormnetV2 Project - by Selur - 10.04.2026, 10:32
RE: ColormnetV2 Project - by NASS - 10.04.2026, 12:06
RE: ColormnetV2 Project - by Dan64 - 10.04.2026, 16:58
RE: ColormnetV2 Project - by NASS - 10.04.2026, 18:48
RE: ColormnetV2 Project - by Dan64 - 11.04.2026, 10:15
RE: ColormnetV2 Project - by Dan64 - 11.04.2026, 12:14
RE: ColormnetV2 Project - by NASS - 11.04.2026, 12:39
RE: ColormnetV2 Project - by Dan64 - 11.04.2026, 16:09
RE: ColormnetV2 Project - by NASS - 11.04.2026, 16:40
RE: ColormnetV2 Project - by Dan64 - 11.04.2026, 17:31
RE: ColormnetV2 Project - by NASS - 11.04.2026, 18:44
RE: ColormnetV2 Project - by Dan64 - 15.04.2026, 11:03
RE: ColormnetV2 Project - by NASS - 16.04.2026, 19:34
RE: ColormnetV2 Project - by Dan64 - 16.04.2026, 21:52
RE: ColormnetV2 Project - by NASS - 16.04.2026, 22:21
RE: ColormnetV2 Project - by Dan64 - 19.04.2026, 15:16

Forum Jump:


Users browsing this thread: 1 Guest(s)