├── LICENSE.txt ├── README.md ├── config.py ├── data └── train_data.json ├── dpo.py ├── inference.py ├── main.py ├── model ├── model.py └── reference_model.py ├── requirements.txt └── utils └── data_load.py /LICENSE.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/LICENSE.txt -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/README.md -------------------------------------------------------------------------------- /config.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/config.py -------------------------------------------------------------------------------- /data/train_data.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/data/train_data.json -------------------------------------------------------------------------------- /dpo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/dpo.py -------------------------------------------------------------------------------- /inference.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/inference.py -------------------------------------------------------------------------------- /main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/main.py -------------------------------------------------------------------------------- /model/model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/model/model.py -------------------------------------------------------------------------------- /model/reference_model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/model/reference_model.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/requirements.txt -------------------------------------------------------------------------------- /utils/data_load.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/OctopusMind/DPO/HEAD/utils/data_load.py --------------------------------------------------------------------------------