Question 1

What size is the BiRefNet lite model?

Accepted Answer

BiRefNet lite is approximately 85 MB quantized, downloaded once from the Hugging Face CDN. The full-precision variant is closer to 339 MB; the quantized weights produce nearly the same masks at a fraction of the size. The browser caches the model in IndexedDB so later visits load in a few seconds.

Question 2

Does my image get uploaded for background removal?

Accepted Answer

No. After the model finishes downloading on first use, every background removal runs entirely in your browser. The image you pick stays on your device and is never sent to our servers or to a third-party API.

Question 3

How does this compare to the existing UDT background remover?

Accepted Answer

The existing tool is based on MediaPipe Selfie Segmentation, which is portrait-tuned. It is excellent on headshots and struggles on objects, pets, and products. BiRefNet lite is a general-purpose dichotomous image segmentation model and produces cleaner edges across all those categories at the cost of a larger model download and slower per-image inference.

Question 4

What license does BiRefNet use?

Accepted Answer

BiRefNet is released by Peng Zheng under the MIT license. MIT permits commercial use, modification, and redistribution. The ONNX weights are mirrored by the onnx-community organization on Hugging Face under the same license.

Question 5

What kinds of subjects does it work well on?

Accepted Answer

People, products, animals, plants, vehicles, furniture, food, and most everyday objects against a wider variety of backgrounds. It also handles complex edges like hair, fur, leaves, and translucent fabric noticeably better than portrait-only tools.

Question 6

Are there input image size limits?

Accepted Answer

The model resizes inputs to 1024 by 1024 internally before inference, then upscales the resulting matte to match the original image dimensions. Inputs of any reasonable size work; very large inputs (over about 4000 pixels on the longest side) use more memory and may run slowly on phones.

Question 7

Why does the first removal take longer than later ones?

Accepted Answer

The first run includes the model download (about 85 MB) and a warm-up pass through the network. Subsequent runs reuse the cached model and warmed-up runtime, so they only spend time on inference. On a modern laptop with WebGPU a warm removal takes 2 to 6 seconds; the WebAssembly fallback takes roughly 3x longer.

Question 8

Can I batch-process multiple images?

Accepted Answer

Not in the current UI — drag and drop processes one image at a time. The model itself supports batched inference, and adding a queue is on the v35 roadmap. For now, removing the background from a few dozen images means running them through one at a time.

AI Background Remover v2