The site operator gives you a rough count with zero diagnostics. Search Console provides per-URL status, error codes, and crawl logs. This guide compares both methods, exposes the gaps in each, and shows you when to trust which tool.
Every SEO practitioner has run a site:domain.com query, glanced at the result count, and called it a day. That number is a mirage. Google's site operator returns a sampled subset of indexed URLs, capped at roughly 1,000 results per query, and it omits pages blocked by noindex directives, canonicalized URLs, and thin content flagged as redundant. In practice, when you compare a site operator count against your sitemap submission count, the discrepancy can hit 50% or more.
Search Console fills part of that gap. The URL inspection tool shows the exact indexing status of a single URL, including the crawl date, coverage error, and the last detected canonical. But Search Console does not expose a simple 'list all indexed pages' view. The Index Coverage report aggregates counts by status (Valid, Excluded, Error) but hides individual URLs behind pagination and filters. For a true bulk index check, you need the Indexing API, which gives programmatic access to URL statuses without manual inspection. Yet even that API has quotas and does not surface why a URL was excluded.
We ran a controlled audit on a mid-size e-commerce domain with 12,400 product pages. First, we collected all sitemap URLs via the sitemap index. Then we queried site:domain.com for 20 consecutive days and recorded the reported count. Next, we exported the same URLs from Search Console's Index Coverage report using the API. Finally, we cross-referenced with a third-party bulk checker to verify discrepancies.
Results: site operator returned a count between 5,100 and 5,400 each day. Search Console reported 11,200 valid indexed URLs. The bulk checker confirmed 11,047. The gap was 6,000+ URLs that Google had indexed but refused to show in the site operator results. Among the missing pages: 3,200 were thin product variants with duplicate descriptions, 1,800 were paginated category pages, and 1,000 were pages with a canonical pointing elsewhere. The site operator simply hides them.
Setup: domain.com, 12,400 product URLs in sitemap.
Step 1: Run site:domain.com in browser. Google shows 'About 5,200 results'.
Step 2: Open Search Console > Index Coverage > Valid. Count shows 11,200.
Step 3: Export valid URLs via API (limit 1,000 per call, paginate with startIndex).
Step 4: Cross-check 500 random missing URLs via bulk Google index checker protocol using the URL Inspection API.
Step 5: Results: 480/500 are 'Indexed, not in sitemap' or 'Indexed, canonical mismatch'.
Diagnosis: Site operator hides 55% of your indexed inventory. Use Search Console + API for accurate counts.
| Criterion | Site Operator | Search Console (UI) | Indexing API | Verdict |
|---|---|---|---|---|
| Data freshness Real-time vs delayed | Near real-time (cache-dependent) | 2-7 day delay for coverage reports | Real-time per URL | Site operator for quick checks; API for accuracy |
| Result completeness % of indexed URLs shown | ~40-60% (sampled, capped at ~1,000) | 100% aggregated counts, but pagination limits exports | 100% per request, limited by daily quota | Search Console for total count; API for bulk |
| Error details Why a URL is not indexed | None (just absence) | Coverage error codes (404, soft 404, noindex, etc.) | Returns status but not full reason unless you inspect | Search Console for root cause analysis |
| Blocked URL detection robots.txt, noindex, canonical | Hidden completely | Shows excluded URLs with reason | Shows excluded status per URL | Search Console for blocked URL diagnosis |
| Operational failure mode Common mistake / risk | Counting results as total indexed pages | Filtering by 'Valid' misses excluded URLs that still rank | Exceeding quota mid-audit; ignoring API errors | Cross-validate at least two sources |
Run site operator but treat result as a lower bound. Expect 40-60% of actual indexed total.
Open Search Console URL inspection for single URLs. Use the API for more than 10 URLs.
Use the Indexing API or a bulk checker that wraps the URL Inspection API. Avoid manual site queries.
Check Search Console's Index Coverage report for error types (noindex, soft 404, canonical mismatch).
Schedule weekly Search Console exports via API. Track valid vs excluded trends.
Use the Indexing API to notify Google of new or updated URLs. Site operator cannot trigger recrawl.
A common situation we see in the field: a client's site operator count drops by 30% overnight, and they panic. Nine times out of ten, Google simply rotated the sample. The actual indexed count (via Search Console) did not change. But there are real failure modes where both tools lie to you.
Blocked URLs: If a page is blocked by robots.txt, both the site operator and Search Console will pretend it does not exist. The URL inspection tool shows 'Blocked by robots.txt' but only if you know the exact URL. You cannot discover blocked URLs through either interface alone. You must audit your log files or crawl with a tool that respects robots.txt and compare against your sitemap.
Duplicate lists: Search Console's Index Coverage report sometimes double-counts URLs that appear in multiple sitemaps. We have seen a site with 5,000 unique URLs showing 6,200 'Valid' entries because of overlapping sitemap entries. The site operator showed 3,100. Neither was correct. The real count after deduplication was 4,800.
Weak pages: Thin content with a canonical pointing to a stronger page will be counted as 'Indexed' by Search Console but excluded from site operator results. If you rely on site operator for content audit, you will miss half your weak pages. Use the Indexing API with a bulk index checker protocol to surface these ghost placements.
Export all URLs from your sitemap (use screaming frog or sitemap parser).
Run site:domain.com and record the count. Note: this is your minimum baseline.
Open Search Console > Index Coverage > Valid. Export the list via API or manual download (1,000 per file).
Cross-reference sitemap URLs against Search Console valid URLs. Flag any URL in sitemap not marked as valid.
For flagged URLs, use URL Inspection API (bulk) to check the exact status and reason.
Search for blocked URLs by comparing crawl logs (from server logs) against Search Console excluded report.
Deduplicate any URL appearing in multiple sitemaps before trusting the count.
Schedule a weekly automated check using the Indexing API to catch indexing drops early.
No. The site operator returns a sampled subset of indexed URLs, typically capped at around 1,000 results. It hides pages with noindex, canonicalized URLs, and duplicates. For a complete picture, use Search Console's Index Coverage report or the Indexing API for bulk checks.
Search Console counts all URLs Google has crawled and indexed, including those excluded from search results due to canonicals, thin content, or soft 404s. The site operator only shows URLs Google chooses to display in search results. The gap can be 40-60% of your total indexed inventory.
Use the Indexing API with a script that iterates through your URL list. The API has a daily quota (200 URLs per day for most accounts) but you can increase it by verifying ownership. Alternatively, use a bulk checker that wraps the URL Inspection API, as described in the bulk Google index checker protocol.
Blocked URLs (by robots.txt) are invisible to both tools. You must compare your server log files or a crawl (with robots.txt disabled) against your sitemap. Any URL in the sitemap that does not appear in logs or Search Console is likely blocked or unreachable.
Yes, but only for URLs you own. For guest posts on other domains, you cannot use your own Indexing API to check their status. Instead, use a third-party bulk checker that requests the Googlebot cache or uses the public URL Inspection API (limited rate). For backlink monitoring, rely on Search Console's Links report.
Filtering by 'Valid' only and ignoring 'Excluded' URLs that still rank. Not paginating through all rows (export limit of 1,000 per file). Double-counting URLs present in multiple sitemaps. Forgetting that Search Console data is delayed by 2-7 days. Always cross-reference with a real-time tool.
Technically yes, but it is unreliable. Google may say '0 results' for a URL that is indexed but not shown due to canonicals or low relevance. Use Search Console's URL inspection tool instead: it gives you the exact status, crawl date, and any errors detected.
Quick calculator. Put in the expected monthly value of a page or link batch and the natural waiting time.