Page Weight & Asset Size Checker
Measure your HTML and every asset on the page against Googlebot's 2 MB per-file crawl limit and Bing's HTML size guidance.
Why Page Weight Matters for Crawling
In February 2026, Google updated its crawler documentation with a number that had been folklore for years: Googlebot crawls only the first 2 MB of a supported file type (64 MB for PDFs). The limit applies per resource — your HTML document, each CSS file, and each JavaScript file are fetched separately, and every one of them is cut off at 2 MB of uncompressed data. Anything past that point is never fetched: content is invisible to Search, and a JavaScript bundle truncated mid-file can make Google misread the entire page.
Bing approaches the document itself with soft guidance rather than a hard cutoff. Bing Webmaster Tools shows a low-priority notice for HTML above roughly 125 KB — "risks not being fully cached" — and current Bing documentation references a 1 MB soft limit. Pages over those thresholds are routinely indexed; the notice is an early signal, not a penalty. For perspective, the median HTML page on the web weighs about 30–33 KB — most sites are nowhere near any of these numbers. The risk group is JavaScript-heavy single-page apps and "inline everything" builds: hydration state dumped into the document, critical CSS inlined wholesale, SVG sprites embedded in markup, and multi-megabyte vendor bundles.
Because the limits apply to uncompressed bytes, gzip or Brotli compression does not rescue an oversized file. A 5 MB bundle that travels as 1 MB on the wire is still truncated at 2 MB once decoded. This tool reports both numbers for every file so you can see exactly where you stand.
What Gets Measured
- The HTML document — transferred and uncompressed size, checked against Google's hard 2 MB limit and Bing's soft 125 KB / 1 MB thresholds
- JavaScript —
script[src]tags and modulepreload hints - CSS — stylesheets and preloaded styles
- Images — img tags, every unique srcset variant, picture sources, favicons, and url(...) references in inline styles
- Fonts, media, iframes — preloaded fonts, video/audio sources and posters, embedded frames and objects
- Inline data: URIs — counted toward page weight without being fetched
How to Use the Results
Fix red warnings first — any file over Google's hard limit is being truncated by Googlebot today. Amber notices are softer signals worth addressing on your own schedule, not emergencies. For oversized HTML, externalize inlined scripts, styles, and state into separate files (each gets its own 2 MB budget). For oversized JavaScript, code-split the bundle into smaller chunks. For images and media, compress and serve modern formats. Remember that the limits are about uncompressed size: minification and code removal shrink the real number, gzip alone does not.
Heavy JavaScript is also the root cause of most rendering and indexation problems. If your content only appears after a multi-megabyte bundle executes, pre-rendering serves bots fully rendered HTML so indexing never depends on the bundle being fetched and executed at all. Pair this tool with the Quick SEO Audit for a broader technical check, Core Web Vitals for the user-facing performance picture, and the Crawler Checker to see what each bot actually receives from your server.
The canonical reference is Google's Googlebot documentation, which describes the per-file crawl limits and truncation behavior.