StepFun Says Step 3.7 Flash Matches 97% of Claude Opus 4.6's Coding Performance at One-Ninth the Cost
SMRTR summary
StepFun's Step 3.7 Flash delivers 97% of Claude Opus 4.6's coding performance at one-ninth the cost — $0.19 versus $1.76 per task on SWE-Bench Verified. This is made possible by "Advisor Mode," which keeps the cheaper model running most tasks while only calling a more powerful model at critical decision points. The model also leads several benchmarks in tool orchestration and visual understanding, though it trails frontier models on terminal tasks and complex professional workflows.
SMRTR provides this summary for quick context. The original article belongs to Reddit.
Read the original article