some early work towards reproducing Language Models don't say what they think (Turpin et al.) in reasoning models like r1-distill-qwen-7b on a small subset of BBH (Big Bench Hard).
MarmikChaudhari/faithful-reasoning
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|