DiffBMP Differentiable Rendering with Bitmap Primitives

Seongmin Hong1,* Junghun James Kim2,* Daehyeop Kim3 Insoo Chung3 Se Young Chun1,2,3,†

1 INMC, 2 IPAI, 3 Dept. of ECE, Seoul National University, Republic of Korea

{smhongok, jonghean12, 2012abcd, insoo_chung, sychun}@snu.ac.kr

* Co-first authors    Corresponding author

Accepted to CVPR 2026

News

(03/18/2026) pip install pydiffbmp is now available! See the PyPI.
(02/22/2026) Seongmin Hong conducted a hands-on workshop at Seoul Science Center, where participants created their own artworks using DiffBMP.
Workshop Photo 1
Workshop Photo 2
Workshop Photo 3
Workshop Photo 4

Sample Gallery

Video

Primitives
Primitives
Primitives
ID_cards.png flags.png flowers.png foot_finger_hand_prints.png Person Names 1000 items
Primitives

HTML (GIF)

demo_gif_IPA.gif
Primitives
Circle
demo_gif_daehyeop.gif
Primitives
D A E H Y E O P
demo_gif_kyungryreol.gif
Primitives

Image

demo_image_clip.jpg
Primitives
Maxwell_equation.png maxwell_eq1.png maxwell_eq2.png maxwell_eq3.png maxwell_eq4.png Prompt : Galaxy
demo_image_beethoven.jpg
Primitives

Supplementary Video

Abstract

We introduce DiffBMP, a scalable and efficient differentiable rendering engine for a collection of bitmap images. Our work addresses a limitation that traditional differentiable renderers are constrained to vector graphics, given that most images in the world are bitmaps. Our core contribution is a highly parallelized rendering pipeline, featuring a custom CUDA implementation for calculating gradients. This system can, for example, optimize the position, rotation, scale, color, and opacity of thousands of bitmap primitives all in under 1 minute using a consumer GPU. We employ and validate several techniques to facilitate the optimization: soft rasterization via Gaussian blur, structure-aware initialization, noisy canvas, and specialized losses/heuristics for videos or spatially constrained images. We demonstrate DiffBMP is not just an isolated tool, but a practical one designed to integrate into creative workflows. It supports exporting compositions to a native, layered file format, and the entire framework is publicly accessible via an easy-to-hack Python package.

As discussed in the Introduction, many real-world tasks can be formulated as optimization problems, and first-order methods remain a practical standard for large-scale settings. DiffBMP brings that optimization perspective directly into bitmap-based visual composition, making large primitive sets trainable and controllable while preserving flexibility for practical image/video creation pipelines.