├── .gitignore ├── LICENSE ├── README.md ├── datasets └── DeepConsult │ ├── queries.csv │ └── responses_OpenAI-DeepResearch_vs_ARI_2025-05-15.csv ├── evals ├── deep_research_pairwise_evals.py ├── metrics │ └── deep_research_pairwise_metric.py └── utils.py └── requirements.txt /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/youdotcom-oss/ydc-deep-research-evals/HEAD/.gitignore -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/youdotcom-oss/ydc-deep-research-evals/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/youdotcom-oss/ydc-deep-research-evals/HEAD/README.md -------------------------------------------------------------------------------- /datasets/DeepConsult/queries.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/youdotcom-oss/ydc-deep-research-evals/HEAD/datasets/DeepConsult/queries.csv -------------------------------------------------------------------------------- /datasets/DeepConsult/responses_OpenAI-DeepResearch_vs_ARI_2025-05-15.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/youdotcom-oss/ydc-deep-research-evals/HEAD/datasets/DeepConsult/responses_OpenAI-DeepResearch_vs_ARI_2025-05-15.csv -------------------------------------------------------------------------------- /evals/deep_research_pairwise_evals.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/youdotcom-oss/ydc-deep-research-evals/HEAD/evals/deep_research_pairwise_evals.py -------------------------------------------------------------------------------- /evals/metrics/deep_research_pairwise_metric.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/youdotcom-oss/ydc-deep-research-evals/HEAD/evals/metrics/deep_research_pairwise_metric.py -------------------------------------------------------------------------------- /evals/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/youdotcom-oss/ydc-deep-research-evals/HEAD/evals/utils.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/youdotcom-oss/ydc-deep-research-evals/HEAD/requirements.txt --------------------------------------------------------------------------------