This forum uses cookies
This forum makes use of cookies to store your login information if you are registered, and your last visit if you are not. Cookies are small text documents stored on your computer; the cookies set by this forum can only be used on this website and pose no security risk. Cookies on this forum also track the specific topics you have read and when you last read them. Please confirm whether you accept or reject these cookies being set.

A cookie will be stored in your browser regardless of choice to prevent you being asked this question again. You will be able to change your cookie settings at any time using the link in the footer.

ColormnetV2 Project
#14
Hi NASS,

  good newes! I extended ColorMNet with Xmem2. 
  I named the project CMNET2, you can find it at the following link: https://github.com/dan64/cmnet2 
  The key features implemented are:
  • Reference-based colorization
  • Permanent memory (XMem++ style)
  • Preloading API
  • Sliding window memory management
  • Adaptive VRAM management
  • DINOv2 + ResNet50 fusion backbone

I tried to add also DinoV3 but a full implementation requires a complete new training, which requires a lot of time to perform and implement and given that my time to develop this project is limited I decided to skip this extension (my attempt to train only last 7m nodes was unsuccessful).  

The pipeline in which this model will be used involves extracting a certain number of reference images from a B&W video, which will then be colored using this model, passing the reference images  (colored with Qwen-Image-Edit) to CMNET2. In this context, there are two main problems: 1) there may be frames that do not have a reference image; in this case, the colors provided by the model are faded, with people's faces appearing gray; 2) the same object appears in multiple reference frames with different colors; in this case, the model often provides an intermediate color between the two. 

DinoV3 doesn't solve either of the two problems:
Problem 1 (frame without reference → faded colors) — This is a temporal memory coverage problem, not a feature quality problem. DinoV3 extracts better features, but if there's no reference close in time, the result will still be faded.
Problem 2 (same object with different colors between references) — This is a semantic inconsistency problem between references, caused by Qwen. DinoV3 doesn't know that two references show the same object with different colors — it would calculate the same average as DinoV2.

Instead I'm working on including SAM3 in the pipeline, I hope I can further improve the coloring process this way, we'll see...

Dan
Reply


Messages In This Thread
ColormnetV2 Project - by NASS - 10.04.2026, 00:27
RE: Deoldify Vapoursynth filter - by Dan64 - 10.04.2026, 09:51
RE: ColormnetV2 Project - by Selur - 10.04.2026, 10:32
RE: ColormnetV2 Project - by NASS - 10.04.2026, 12:06
RE: ColormnetV2 Project - by Dan64 - 10.04.2026, 16:58
RE: ColormnetV2 Project - by NASS - 10.04.2026, 18:48
RE: ColormnetV2 Project - by Dan64 - 11.04.2026, 10:15
RE: ColormnetV2 Project - by Dan64 - 11.04.2026, 12:14
RE: ColormnetV2 Project - by NASS - 11.04.2026, 12:39
RE: ColormnetV2 Project - by Dan64 - 11.04.2026, 16:09
RE: ColormnetV2 Project - by NASS - 11.04.2026, 16:40
RE: ColormnetV2 Project - by Dan64 - 11.04.2026, 17:31
RE: ColormnetV2 Project - by NASS - 11.04.2026, 18:44
RE: ColormnetV2 Project - by Dan64 - 15.04.2026, 11:03
RE: ColormnetV2 Project - by NASS - 16.04.2026, 19:34
RE: ColormnetV2 Project - by Dan64 - 16.04.2026, 21:52
RE: ColormnetV2 Project - by NASS - 16.04.2026, 22:21
RE: ColormnetV2 Project - by Dan64 - 19.04.2026, 15:16

Forum Jump:


Users browsing this thread: 1 Guest(s)