Question 1

How is Smart Cutout different from a background remover?

Accepted Answer

A background remover makes one automatic decision: keep the whole foreground, drop everything else. Smart Cutout is interactive and object-level, so you click the specific thing you want. That lets you pull one person out of a group, lift a single product off a busy shelf, or isolate an object the automatic tools merge into the background. Add more clicks to extend the selection, or switch to Remove mode to subtract anything the model included by mistake.

Question 2

How big is the download and is it cached?

Accepted Answer

The model used here (SlimSAM, a compressed Segment Anything) is a compact download of roughly 40 MB, split across a vision encoder and a lightweight mask decoder. It downloads once from the Hugging Face CDN and is cached in your browser through IndexedDB, so every later visit loads it in a second or two.

Question 3

Are my images uploaded to a server?

Accepted Answer

No. Once the model has downloaded on first use, both the image encoding and every mask you generate run entirely in your browser. The image never leaves your device and is never sent to our servers or to any third-party API. You can disconnect from the internet after the model loads and the tool keeps working.

Question 4

Which model powers this and what is its license?

Accepted Answer

It uses SlimSAM, a compressed version of Meta AI Segment Anything Model (SAM), with ONNX weights mirrored by the Xenova organization on Hugging Face. SAM is released under the Apache 2.0 license, which permits commercial use. That clean license is the reason this can be offered free on an ad-supported site.

Question 5

How do I select exactly the object I want?

Accepted Answer

Click once on the object to get an initial selection. If the model grabs too little, click another spot on the same object to extend it. If it grabs too much, switch the click mode to Remove (or right-click on desktop) and click the area you want excluded. Each click refines the mask in place, and Clear points starts over.

Question 6

Why is the first click on a new image slow?

Accepted Answer

The heavy step is encoding the image, which the vision encoder does once per image right after you load it. After that, each click only runs the small mask decoder, which is near-instant. So the first selection waits on encoding, and every refinement afterward is fast.

Question 7

What image formats and sizes work?

Accepted Answer

PNG, JPEG, WebP, and GIF (first frame) all work. There is no server limit since processing is local, but very large images take longer to encode and use more memory, especially on the WebAssembly fallback path. Browsers with WebGPU encode noticeably faster.

Question 8

Can I use the cutouts commercially?

Accepted Answer

Yes. The model license (Apache 2.0) permits commercial use, and the cutout you export is just your own image with a transparent background, so its rights are whatever they already were for your original image. Nothing about this tool adds a watermark or a usage restriction.

AI Smart Cutout

AI Smart Cutout

Why Object-Level Selection Beats One-Click Background Removers

How AI Smart Cutout Works

The Model, the License, and Your Privacy

Frequently Asked Questions

AI Smart Cutout

Related Tools

AI Smart Cutout

Why Object-Level Selection Beats One-Click Background Removers

How AI Smart Cutout Works

The Model, the License, and Your Privacy

Frequently Asked Questions

Related Articles