MangaNinja: Line Art Colorization with Precise
Reference Following

Zhiheng Liu1,3* Ka Leong Cheng2,4* Xi Chen1,3 Jie Xiao3 Hao Ouyang4 Kai Zhu3 Yu Liu3
Yujun Shen4 Qifeng Chen2 Ping Luo1

1HKU   2HKUST   3Tongyi Lab   4Ant Group

MangaNinja is a reference-based line art colorization method that enables precise matching and fine-grained interactive control.

[Paper]     [Code]     [Youtube]     [BibTeX]

Abstract

Derived from diffusion models, MangaNinja specializes in the task of reference-guided line art colorization. We incorporate two thoughtful designs to ensure precise character detail transcription, including a patch shuffling module to facilitate correspondence learning between the reference color image and the target line art, and a point-driven control scheme to enable fine-grained color matching. Experiments on a self-collected benchmark demonstrate the superiority of our model over current solutions in terms of precise colorization. We further showcase the potential of the proposed interactive point control in handling challenging cases (e.g., extreme poses and shadows), cross-character colorization, multi-reference harmonization, etc., beyond the reach of existing algorithms.

Approach

We randomly select two frames from video data, using one frame as a reference image and extracting the line art from the other. Both frames are input into the Reference U-Net and the Denoising U-Net, respectively. To enhance the model's automatic matching and fine-grained control capabilities, we propose a series of training strategies, including progressive patch shuffling. Additionally, we employ an off-the-shelf model to extract matching points from the two frames, and these point maps are fed into the main branch through PointNet.

Results

We compare our method with the state-of-the-art non-generative colorization method BasicPBC, the consistency generation method IP-Adapter, and AnyDoor. The results demonstrate that our method significantly outperforms them in terms of colorization accuracy and generated image quality. Notably, our method does not use points for guidance in the generated results.

Challenging Cases

Visualization of varying poses or missing details. With point guidance, MangaNinja can tackle many challenging cases, especially when there are significant variations between the reference image and line art. When dealing with multiple objects, point guidance can also effectively prevent color confusion.

Visualization of multi-reference colorization. MangaNinja enables users to select specific areas from multiple reference images through points, providing guidance for all elements in the line art. Additionally, it effectively resolves conflicts between similar visual elements across the reference images.

Visualization of colorization with discrepant reference. MangaNinja uses points as guidance to achieve semantic color matching with fine control. We believe this interactive colorization with discrepant references can effectively assist users in their colorization attempts and inspire new ideas.

BibTex

@article{liu2024manganinja,
  author  = {Zhiheng Liu and Ka Leong Cheng and Xi Chen and Jie Xiao and Hao Ouyang and Kai Zhu and Yu Liu and Yujun Shen
             and Qifeng Chen and Ping Luo},
  title   = {MangaNinja: Line Art Colorization with Precise Reference Following},
  journal = {CoRR},
  volume  = {abs/xxxx.xxxxx},
  year    = {2024}
}