01
Easy
Billing Receipt Request
Classify as Billing, optionally verify policy via search_kb, then resolve clearly.
02
Medium
Dashboard Error 404
Identify Technical issue, pull KB guidance, and provide the correct fix path.
03
Hard
Digital Item Refund Demand
For non-refundable digital downloads, policy-safe handling usually means escalation.
04
Hard
VIP Outage
Tests if the agent can correctly classify high-urgency technical issues without being distracted by account status keywords.
05
Medium
Vague Complaint
The user says "It is broken." Ask one clarifying question, then classify/search/resolve efficiently.
06
Hard
Subscription Plan Refund Difference
Reason over refund policy edge case and complete with policy-consistent action.