Question 1

Does the output preserve table structure as real cells?

Accepted Answer

Yes. The tool detects rows by y-coordinate and columns by x-coordinate alignment, then writes the result to a real .xlsx file with one cell per detected cell. The output opens in Excel, Google Sheets, Numbers, and LibreOffice Calc as a proper spreadsheet, not a single column of text.

Question 2

How are multi-page tables handled?

Accepted Answer

Each PDF page becomes one Excel sheet, named Page 1, Page 2, and so on. When the column structure matches across pages, the same column positions are used for every sheet so you can copy data between them or stack them into a single sheet manually. The page-range input lets you limit conversion to specific pages.

Question 3

What about merged cells or rotated text?

Accepted Answer

Merged cells are detected when the row contains fewer text runs than the dominant row width — the merged value is placed in the first column it spans and the trailing columns are left blank. Rotated text (90 or 270 degrees) is read as a separate column because the y-coordinate dominates over the visual flow; if the rotation is in headers only, the headers may need a manual transpose.

Question 4

Will my file get uploaded anywhere?

Accepted Answer

No. The PDF is parsed locally by pdf.js, the rows and columns are detected locally, and the .xlsx is written locally by SheetJS. Nothing leaves your browser. This is the key reason to use a browser-based tool for bank statements, payroll, and other sensitive financial documents.

Question 5

What if the PDF has prose mixed with tables?

Accepted Answer

The detection works best when the page is mostly tabular. Mixed pages tend to bring some surrounding paragraphs into the spreadsheet as long single-cell rows. The cleanest workaround is to enter a page range like "3-5" before conversion to limit the output to pages that contain tables you actually want.

Question 6

Can I convert specific pages only?

Accepted Answer

Yes. The page range input accepts notation like "1-5,10,15-20" and limits conversion to those pages. This is useful for grabbing only the financial-statements pages from a long quarterly report or only the data appendix from an academic paper.

Question 7

Does it work with scanned bank statements?

Accepted Answer

Not directly. A scanned PDF is an image of text and has no positioning data to work with. The fix is to run the PDF through OCR first to add a text layer; the OCR text layer then has x and y coordinates the tool can read. Some bank statement PDFs from older systems are scanned; most modern ones include a text layer and convert cleanly.

Question 8

Is there a row or page limit?

Accepted Answer

There is no hard limit. The work happens in your browser, so the practical ceiling is your device memory. A 50-page bank statement with one table per page converts in a few seconds on modern hardware. If a conversion stalls on a very large file, narrow the page range and run it in chunks.

PDF to Excel Converter

PDF to Excel (.xlsx) Converter

Why Browser-Based PDF to Excel Beats Upload Services

How the Table Detection Works

Use Cases and Limitations

How We Compare to Paid PDF-Table Tools

Frequently Asked Questions