460-Question Benchmark Overview
How Lewsearch synthetic panels are evaluated against independently fielded survey instruments from Texas, California, and national tri-metro sources.
Read →Swarmgram Notes
Short memos on how we evaluate Lewsearch: aggregate benchmarks, protocols, and failure modes. Not a substitute for a full academic paper. Customer-facing tables live on lewsearch.com/methodology.
How Lewsearch synthetic panels are evaluated against independently fielded survey instruments from Texas, California, and national tri-metro sources.
Read →Why every Lewsearch accuracy number is out-of-sample, and how we prevent calibration leakage across the benchmark pool.
Read →The strictest generalization test: questions sourced after training freeze, scored under pre-specified exclusion rules.
Read →Building credibility without publishing the full internal roadmap. Public validation boundaries for Lewsearch.
Read →Frozen predictions on Pew AI Trends and Pew Top Problems benchmarks before ground truth is published.
Read →