Coming soon for Gradio and Hugging Face apps

Finding the True Happy Path

Your AI app works in the demo. The trouble starts when real people show up angry, confused, typing in another language, or trying to break it on purpose. AppTestSite puts your Gradio and Hugging Face apps through those moments before your users do, then shows you exactly where they hold up and where they fall apart.

Three ways to break it before launch

Bring your own model and search keys. AppTestSite handles the hosting, the testing, the tracking, and the dashboards, so you stay in control of which models run and what they cost.

Synthetic stress tests

Simulate the users who give support bots trouble: the angry customer, the confused newcomer, the non-native English speaker, the deliberate breaker, and the one with oddly worded questions. You pick the model that plays them and the model that judges the results.

Fails 18% with non-native speakers

Human A/B tests

Wrap a Space with traffic splitting and put two versions in front of real people. Collect thumbs, head to head choices, and rubric scores, record the sessions, and feed the winning signal straight back into your evaluation set.

Variant B preferred 2 to 1

Factuality checks

Tell us what your app is about. AppTestSite proposes a fact checking plan, you approve it, and every answer gets checked against live web search. Catch the confident, plausible, and wrong responses before your users quote them back to you.

3 unsupported claims found

You keep the keys

Use your own frontier model and search keys. We never charge you for inference, and your keys are encrypted and never written to logs.

Export and share

Pull results into PDF, Word, CSV, or JSON, and share a single result set with a teammate through a link.

Test, fix, retest

Every report suggests changes to try. Apply one and rerun the same suite to see whether it actually moved the numbers.

Be first through the door

We are building AppTestSite now and looking for developers who ship public facing AI apps. Join the list to get early access and help shape what we build.