Your Tests Pass.
Your UX Breaks.
CI is green. Deploy shipped. But users can't checkout because a promo banner covers the button on mobile.
Capture real rendered UI state โ with salience scoring โ so LLMs catch regressions that E2E tests miss.
E2E Tests Say
- โ Button exists
- โ Button clickable
- โ Form submits
All pass. Ship it.
Semantic Check Says
- โ "Pay" button: salience 95%
- โ Promo banner: salience 70%
- โ Occlusion detected: 60%
CTA blocked on mobile. Regression.
"We don't ask the model what matters โ we tell it."
SiFR pre-weights elements by salience. LLM interprets, not discovers. That's why CSS noise is ignored, but hidden CTAs trigger alerts.
What Tests Miss, This Catches
Layout Regressions
- โข Banner covers checkout button
- โข Products pushed below the fold
- โข CMS update breaks grid
- โข A/B variant hides CTA
Responsive Breaks
- โข Desktop OK, mobile broken
- โข Tablet layout overlap
- โข Third-party widget covers form
- โข Chat widget blocks action
Security Issues
- โข Defacement detection
- โข Phishing overlay on login
- โข Content injection in critical areas
- โข Unexpected DOM changes
How It Compares
| Issue | E2E Tests | Visual Diff | E2LLM |
|---|---|---|---|
| Button covered by banner | โ Pass | โ Noise | โ Alert |
| Content pushed off-screen | โ Pass | โ Pass | โ Alert |
| Mobile-only break | โ Miss | โ Maybe | โ Alert |
| CSS font change | โ Ignore | โ 500 alerts | โ Ignore |
How It Works
Capture
SiFR extracts DOM with salience scores. Not raw HTML โ semantic structure.
Interpret
LLM describes functional state. What users can do, not what code says.
Compare
Detect meaning changes, not pixel changes. Functional regressions only.
The Loop
No screenshots. No guessing. Real state โ Real answers.
100% Local
Nothing leaves your browser. Ever. No tracking, no cloud.
Token Efficient
Compact JSON. 2KB instead of 10MB HTML. Built for LLM context.
Playwright Ready
Programmatic API for CI/CD integration. Post-deploy checks.
LLM Agnostic
Works with Claude, GPT, Grok, Llama. Your choice.
Your tests check if code works.
This checks if users can use it.
Free. Open source. Install in seconds.
Prompts & Examples ยท Curated prompts for QA, accessibility, security checks.
Your tests check if code works.
This checks if users can use it.