Confusion Matrix

Top-15 most frequent misclassifications on 468 valid predictions. 83 hallucination/parse failures excluded.

Actual DCAPredicted DCACount
121
RIGHT THROUGH
->110
CROSS TRAFFIC(INTERSECTIONS ONLY)
42
130
REAR END(VEHICLES IN SAME LANE)
->110
CROSS TRAFFIC(INTERSECTIONS ONLY)
30
113
RIGHT NEAR (INTERSECTIONS ONLY)
->110
CROSS TRAFFIC(INTERSECTIONS ONLY)
21
120
HEAD ON (NOT OVERTAKING)
->130
REAR END(VEHICLES IN SAME LANE)
11
160
VEHICLE COLLIDES WITH VEHICLE PARKED ON LEFT OF ROAD
->130
REAR END(VEHICLES IN SAME LANE)
11
147
VEHICLE STRIKES ANOTHER VEH WHILE EMERGING FROM DRIVEWAY
->130
REAR END(VEHICLES IN SAME LANE)
9
132
RIGHT REAR.
->110
CROSS TRAFFIC(INTERSECTIONS ONLY)
8
130
REAR END(VEHICLES IN SAME LANE)
->120
HEAD ON (NOT OVERTAKING)
6
131
LEFT REAR
->110
CROSS TRAFFIC(INTERSECTIONS ONLY)
5
171
LEFT OFF CARRIAGEWAY INTO OBJECT/PARKED VEHICLE
->175
OFF END OF ROAD/T-INTERSECTION.
5
133
LANE SIDE SWIPE (VEHICLES IN PARALLEL LANES)
->130
REAR END(VEHICLES IN SAME LANE)
5
173
RIGHT OFF CARRIAGEWAY INTO OBJECT/PARKED VEHICLE
->166
STRUCK OBJECT ON CARRIAGEWAY
5
133
LANE SIDE SWIPE (VEHICLES IN PARALLEL LANES)
->110
CROSS TRAFFIC(INTERSECTIONS ONLY)
4
184
OUT OF CONTROL ON CARRIAGEWAY (ON BEND)
->174
OUT OF CONTROL ON CARRIAGEWAY (ON STRAIGHT)
4
180
OFF CARRIAGEWAY ON RIGHT BEND
->174
OUT OF CONTROL ON CARRIAGEWAY (ON STRAIGHT)
4
Key finding: 121 ->110 is the single largest confusion pair (42 cases). The model shows strong bias toward DCA 110 (cross-traffic / intersection) -- the most frequent category in the training distribution. This is a calibration failure, not just a coverage gap.