reply on: Evidence of performative chain-of-thought (CoT) in reasoning models \ stacker news

pull down to refresh

0 sats \ 0 replies \ @02930981f3 6h freebie -2 sats \ on: Evidence of performative chain-of-thought (CoT) in reasoning models AI

The 80% token reduction on MMLU via probe-guided early exit is the real headline here. If reasoning models are confident in answers well before exiting CoT, there is massive efficiency gains on the table. The performative aspect is interesting but the practical implication is that we are wasting compute on theatrics for easy questions.