├── CODE_OF_CONDUCT.md ├── CONTRIBUTING.md ├── LICENSE ├── README.md ├── alpaca-farm-modifications.patch ├── scripts ├── consolidate_preference_data.py ├── format_context_distillation_training_data.py ├── format_input_prompts.py ├── gpt4_compare.py ├── postprocess.py └── simulate_preference_data.py └── simulated_data.zip /CODE_OF_CONDUCT.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/CODE_OF_CONDUCT.md -------------------------------------------------------------------------------- /CONTRIBUTING.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/CONTRIBUTING.md -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/README.md -------------------------------------------------------------------------------- /alpaca-farm-modifications.patch: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/alpaca-farm-modifications.patch -------------------------------------------------------------------------------- /scripts/consolidate_preference_data.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/scripts/consolidate_preference_data.py -------------------------------------------------------------------------------- /scripts/format_context_distillation_training_data.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/scripts/format_context_distillation_training_data.py -------------------------------------------------------------------------------- /scripts/format_input_prompts.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/scripts/format_input_prompts.py -------------------------------------------------------------------------------- /scripts/gpt4_compare.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/scripts/gpt4_compare.py -------------------------------------------------------------------------------- /scripts/postprocess.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/scripts/postprocess.py -------------------------------------------------------------------------------- /scripts/simulate_preference_data.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/scripts/simulate_preference_data.py -------------------------------------------------------------------------------- /simulated_data.zip: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/facebookresearch/RLCD/HEAD/simulated_data.zip --------------------------------------------------------------------------------