├── .gitignore ├── 1.train_gen.ipynb ├── 2.train_dpo.ipynb ├── 3.test.ipynb ├── README.md ├── common.ipynb ├── dpo.model └── gen.model /.gitignore: -------------------------------------------------------------------------------- 1 | **/.ipynb_checkpoints 2 | **/__pycache__ -------------------------------------------------------------------------------- /1.train_gen.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lansinuote/Simple_LLM_DPO/HEAD/1.train_gen.ipynb -------------------------------------------------------------------------------- /2.train_dpo.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lansinuote/Simple_LLM_DPO/HEAD/2.train_dpo.ipynb -------------------------------------------------------------------------------- /3.test.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lansinuote/Simple_LLM_DPO/HEAD/3.test.ipynb -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lansinuote/Simple_LLM_DPO/HEAD/README.md -------------------------------------------------------------------------------- /common.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lansinuote/Simple_LLM_DPO/HEAD/common.ipynb -------------------------------------------------------------------------------- /dpo.model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lansinuote/Simple_LLM_DPO/HEAD/dpo.model -------------------------------------------------------------------------------- /gen.model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lansinuote/Simple_LLM_DPO/HEAD/gen.model --------------------------------------------------------------------------------