├── .env.example ├── .gitignore ├── .python-version ├── LICENSE ├── README.md ├── data └── puzzles.json ├── lib ├── __init__.py ├── chat_completions.py ├── grpo.py ├── inference_early_stop.py ├── models.py ├── pack.py ├── recipe.py ├── stream.py ├── tasks.py ├── temporal_clue.py ├── tokenize.py ├── tqdm.py ├── tune.py ├── types.py ├── utils.py └── vllm.py ├── pyproject.toml ├── train.ipynb ├── train.py └── uv.lock /.env.example: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/.env.example -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/.gitignore -------------------------------------------------------------------------------- /.python-version: -------------------------------------------------------------------------------- 1 | 3.12 2 | -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/README.md -------------------------------------------------------------------------------- /data/puzzles.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/data/puzzles.json -------------------------------------------------------------------------------- /lib/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/__init__.py -------------------------------------------------------------------------------- /lib/chat_completions.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/chat_completions.py -------------------------------------------------------------------------------- /lib/grpo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/grpo.py -------------------------------------------------------------------------------- /lib/inference_early_stop.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/inference_early_stop.py -------------------------------------------------------------------------------- /lib/models.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/models.py -------------------------------------------------------------------------------- /lib/pack.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/pack.py -------------------------------------------------------------------------------- /lib/recipe.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/recipe.py -------------------------------------------------------------------------------- /lib/stream.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/stream.py -------------------------------------------------------------------------------- /lib/tasks.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/tasks.py -------------------------------------------------------------------------------- /lib/temporal_clue.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/temporal_clue.py -------------------------------------------------------------------------------- /lib/tokenize.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/tokenize.py -------------------------------------------------------------------------------- /lib/tqdm.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/tqdm.py -------------------------------------------------------------------------------- /lib/tune.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/tune.py -------------------------------------------------------------------------------- /lib/types.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/types.py -------------------------------------------------------------------------------- /lib/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/utils.py -------------------------------------------------------------------------------- /lib/vllm.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/lib/vllm.py -------------------------------------------------------------------------------- /pyproject.toml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/pyproject.toml -------------------------------------------------------------------------------- /train.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/train.ipynb -------------------------------------------------------------------------------- /train.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/train.py -------------------------------------------------------------------------------- /uv.lock: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OpenPipe/deductive-reasoning/HEAD/uv.lock --------------------------------------------------------------------------------