First, to evaluate Graph-R1, we should use evaluation as the working directory.
cd evaluationThen, we need to set openai api key in openai_api_key.txt file.
python get_remote_score.py --dir ../expr_results/Qwen2.5-3B-Instruct_2WikiMultiHopQA_grpo
# python get_remote_score.py --dir ../expr_results/Qwen2.5-3B-Instruct_HotpotQA_grpo
# python get_remote_score.py --dir ../expr_results/Qwen2.5-3B-Instruct_Musique_grpo
# python get_remote_score.py --dir ../expr_results/Qwen2.5-3B-Instruct_NQ_grpo
# python get_remote_score.py --dir ../expr_results/Qwen2.5-3B-Instruct_PopQA_grpo
# python get_remote_score.py --dir ../expr_results/Qwen2.5-3B-Instruct_TriviaQA_grpo