| Modeling judgment | Chooses models based on constraints and objective fit with clear rationale. | Defaults to popular models without context-based justification. |
| Production readiness | Covers serving architecture, observability, rollback, and retraining strategy. | Focuses only on offline accuracy and ignores deployment reliability. |
| Evaluation rigor | Explains metric choice, bias risks, and validation methodology. | Reports single metric outcomes without methodology detail. |
| Communication clarity | Translates complex ML tradeoffs into concrete product impact. | Uses technical jargon with unclear business relevance. |