Stop testing knowledge. Start testing judgment.
WorkProbe measures how candidates actually think with AI — not whether they can pass a test AI could ace for them.
How it works
Candidate
AI Assistant
Judgment Score
The Problem
The problem isn't that candidates use AI. It's that nobody is measuring how well they use it.
64%
of employees put less effort into work, knowing AI will help.
KPMG/University of Melbourne 2025
58%
rely on AI output without verifying it.
KPMG/University of Melbourne 2025
The next hire
is statistically likely to accept AI output uncritically — unless you test for it.
Research-backed insight
Your current hiring process probably measures whether candidates can perform tasks. WorkProbe measures whether they can perform tasks with AI.
How WorkProbe Works
Three steps to measure real judgment.
Sarah Chen
VP of Operations
I need you to analyze Q3 revenue and find the issue. The board meets at 3pm.
Use any tools you need. Let me know what you find.
AI Assistant
Ready to help
I can help analyze the Q3 data. Would you like me to start with the revenue_q3.csv file?
Based on my analysis, revenue dropped 12% due to seasonal trends...
[May contain inaccuracies]
Data Files
4 files available
Click to preview • Drag to AI
The three-panel work environment: Boss, AI Assistant, and Data Files
Candidate enters a simulated work scenario
They get a realistic task, data files, an AI assistant, and a boss who's waiting for answers.
AI is available — and deliberately imperfect
The AI assistant helps, but includes subtle inaccuracies. The system observes whether candidates catch them.
We measure judgment, not knowledge
Five behavioral dimensions: prompt quality, critical filtering, adaptation speed, AI awareness, and calibrated trust.
The Five Dimensions We Measure
Prompt Quality
How precisely does the candidate formulate questions to the AI? Vague prompts signal vague thinking.
Critical Filtering
Does the candidate accept AI output at face value, or do they challenge, verify, and filter?
Adaptation Speed
When the AI gives an unexpected or unhelpful response, how quickly does the candidate adjust their approach?
AI Awareness
Does the candidate understand what the AI can and can't do, and use it accordingly?
Calibrated Trust
Does the candidate trust AI the right amount — neither blindly accepting nor reflexively rejecting?
Why This Matters
WorkProbe's methodology is grounded in peer-reviewed research from MIT, Harvard, Nature Human Behaviour, and CHI — the leading conferences in human-computer interaction. These aren't decorative citations. They're the reason the product works.
Human–AI teams underperform on decisions unless humans retain independent judgment
Vaccaro et al., Nature Human Behaviour, 2024
Overconfident professionals are the highest-risk group for AI overreliance
He et al., CHI 2023
The only proven intervention is cognitive forcing — requiring independent reasoning before AI exposure
Buçinca et al., Harvard, 2021
Even a mathematically ideal reasoner is vulnerable to AI sycophancy
Chandra et al., MIT, 2026
Assessment methodology grounded in 9 peer-reviewed studies spanning 106+ experiments and 48,000+ survey respondents.
The hiring process is broken.
You can't measure what you don't test for. WorkProbe fills the gap. It tests for judgment in the AI era — the one thing your current process probably doesn't.
Who It's For
If you hire people, WorkProbe is for you.
For HR Leaders
Replace gut-feel interviews with behavioral data on AI collaboration skills.
For Hiring Managers
See how candidates actually work before you commit to a hire.
For CTOs
Ensure your team can think with AI, not just use it.
Not just for technical roles.
Any role where judgment, decision-making, and critical thinking matter. In the AI era, that's every role.
Be among the first to test WorkProbe.
We're currently onboarding pilot companies. You'll hear from us within 48 hours.