Question 1

How big is the model download?

Accepted Answer

The summarizer model is approximately 155 MB, served from the Hugging Face CDN. The download happens once on first use — after that the model is cached in your browser and loads in a few seconds for subsequent sessions.

Question 2

Is the article I paste in sent anywhere for summarization?

Accepted Answer

No. After the model files finish downloading on first use, every summary runs entirely in your browser. The text you paste never leaves your machine and is not sent to any server, including our own.

Question 3

What model and license does the summarizer use?

Accepted Answer

The summarizer uses distilbart-cnn-6-6 from Sam Shleifer at Hugging Face, a distilled version of BART fine-tuned on the CNN/DailyMail dataset. It is released under the Apache 2.0 license, which permits commercial use.

Question 4

What input length does the summarizer accept?

Accepted Answer

The model handles about 1,024 tokens per pass — roughly 800 words of typical English prose. For longer inputs the tool chunks the text, summarizes each chunk, and concatenates the chunk summaries. Very long inputs may take a minute or two on slower devices.

Question 5

Why is the first summary slow but later ones fast?

Accepted Answer

The first run includes the model download (about 155 MB) and a warm-up pass. Subsequent runs reuse the cached model and warmed-up runtime, so they only spend time on inference. On a modern laptop with WebGPU a cold start can take 30-60 seconds and a warm summary runs in 3-8 seconds.

Question 6

Does this work on phones?

Accepted Answer

Yes on iPhones running iOS 17+ and on modern Android phones, though performance is slower than on a laptop. WebGPU support on mobile is still uneven — Safari on iOS uses WebGPU on iPhone 15 Pro and later, and Chrome on Android uses it on most flagships from 2023 onward. WebAssembly fallback works everywhere else.

Question 7

Can I summarize PDFs or Word documents directly?

Accepted Answer

Not directly — paste the extracted text into the input area. For PDFs use a PDF-to-text tool first; for Word documents, copy-paste from the document. Adding native file parsing is on the v34 roadmap.

Question 8

How do the summaries compare to ChatGPT or Claude?

Accepted Answer

The hosted models from OpenAI and Anthropic produce noticeably better summaries — they are 10-100x larger and trained on much more diverse data. The trade-off is that this tool sends nothing to a server, requires no API key, has no rate limits, and works offline after the first model download. For sensitive content or high-volume use the privacy and cost advantages often outweigh the quality gap.

AI Summarizer

AI Summarizer

Why a Summarizer That Runs in Your Browser

How the Summarizer Works

Frequently Asked Questions

AI Summarizer

Related Tools

AI Summarizer

Why a Summarizer That Runs in Your Browser

How the Summarizer Works

Frequently Asked Questions

Related Articles