Blind DCA Classifier -- Results
Narrative -> DCA code | MiniMax-M2.7 | temp=0 | 551 records | 81-code taxonomy
F1 FINDING 83 of 551 records (15.1% hallucination or parse failures) are excluded from valid accuracy. Valid denominator = 468.
22.5%
Top-1 Accuracy
124/551 all records
26.5%
Top-1 (valid)
124/468 valid predictions only
42.6%
Top-3 (all)
235/551 all records
50.2%
Top-3 (valid)
235/468 valid predictions only
83
Failures
off-taxonomy or parse failure
427
Disagreements
LLM vs coder, arg-case supported
Per-decade accuracy (all 551 records)
Top confusion pairs (valid predictions only)
| Actual -> Predicted | Count | Predicted description |
|---|---|---|
121 -> 110 | 42 | CROSS TRAFFIC(INTERSECTIONS ONLY) |
130 -> 110 | 30 | CROSS TRAFFIC(INTERSECTIONS ONLY) |
113 -> 110 | 21 | CROSS TRAFFIC(INTERSECTIONS ONLY) |
120 -> 130 | 11 | REAR END(VEHICLES IN SAME LANE) |
160 -> 130 | 11 | REAR END(VEHICLES IN SAME LANE) |
147 -> 130 | 9 | REAR END(VEHICLES IN SAME LANE) |
132 -> 110 | 8 | CROSS TRAFFIC(INTERSECTIONS ONLY) |
130 -> 120 | 6 | HEAD ON (NOT OVERTAKING) |
131 -> 110 | 5 | CROSS TRAFFIC(INTERSECTIONS ONLY) |
171 -> 175 | 5 | OFF END OF ROAD/T-INTERSECTION. |