This is the collection of papers related to bias and fairness in IR with LLMs. These papers are organized according to our survey paper Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM Era [PDF].
[2024/10/25] Our updated version tutorial proposal has been accepted by WSDM 2025, see you in Hannover, Germany!
[2024/08/25] We provide a Lecture-Style Tutorial at KDD 2024 about "Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM Era"! For more details, please check it here. This survey is published with this tutorial as part of KDD 2024 proceedings.
Please feel free to contact us if you have any questions or suggestions!
If you find our work useful for your research, please cite our work:
@inproceedings{dai2024bias,
title={Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM Era},
author={Dai, Sunhao and Xu, Chen and Xu, Shicheng and Pang, Liang and Dong, Zhenhua and Xu, Jun},
booktitle={Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining},
pages={6437--6447},
year={2024}
}
In this survey, we provide a comprehensive review of emerging and pressing issues related to bias and unfairness in three key stages of the integration of LLMs into IR systems.
We introduce a unified framework to understand these issues as distribution mismatch problems and systematically categorize mitigation strategies into data sampling and distribution reconstruction approaches.
- LLMs may Dominate Information Access: Neural Retrievers are Biased Towards LLM-Generated Texts, Preprint 2023. [Paper]
- AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval, Preprint 2023. [Paper]
- Blinded by Generated Contexts: How Language Models Merge Generated and Retrieved Contexts for Open-Domain QA?, Preprint 2024. [Paper]
- Textbooks Are All You Need , Preprint 2023. [Paper]
- Measuring and Narrowing the Compositionality Gap in Language Models, Findings of EMNLP 2023 [Paper]
- In-Context Retrieval-Augmented Language Models, TACL 2023 [Paper]
- Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks, WWW 2024 [Paper]
- List-aware Reranking-Truncation Joint Model for Search and Retrieval-augmented Generation, WWW 2024 [Paper]
- Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation, Preprint 2024 [Paper]
- Improving Language Models via Plug-and-Play Retrieval Feedback, Preprint 2024 [Paper]
- Llama 2: Open Foundation and Fine-Tuned Chat Models, Preprint 2023 [Paper]
- Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization, ICLR 2023 [Paper]
- Recitation-Augmented Language Models, ICLR 2023 [Paper]
- Self-Consistency Improves Chain of Thought Reasoning in Language Models, ICLR 2023 [Paper]
- Large Language Models are Zero-Shot Rankers for Recommender Systems, ECIR 2024. [Paper]
- Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models, Preprint 2023. [Paper]
- RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation, Preprint 2023. [Paper]
- Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting, Preprint 2023. [Paper]
- Exploring Large Language Model for Graph Data Understanding in Online Job Recommendations, Preprint 2023. [Paper]
- Large Language Models are Not Stable Recommender Systems, Preprint 2023. [Paper]
- A Bi-Step Grounding Paradigm for Large Language Models in Recommendation Systems, Preprint 2023. [Paper]
- Large Language Models as Zero-Shot Conversational Recommenders, CIKM 2023. [Paper]
- Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation, EMNLP 2023. [Paper]
- Understanding Biases in ChatGPT-based Recommender Systems: Provider Fairness, Temporal Stability, and Recency, Preprint 2024. [Paper]
- ChatGPT for Conversational Recommendation: Refining Recommendations by Reprompting with Feedback, Preprint 2024. [Paper]
- Cross-Task Generalization via Natural Language Crowdsourcing Instructions, ACL 2022 [Paper]
- Multitask Prompted Training Enables Zero-Shot Task Generalization, ICLR 2022 [Paper]
- Self-Instruct: Aligning Language Models with Self-Generated Instructions, ACL 2023 [Paper]
- Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation, TACL 2023 [Paper]
- Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following, AAAI 2024 [Paper]
- LongAlign: A Recipe for Long Context Alignment of Large Language Models, Preprint 2024. [Paper]
- Data Engineering for Scaling Language Models to 128K Context, Preprint 2024. [Paper]
- Large Language Models Are Not Robust Multiple Choice Selectors, ICLR 2024. [Paper]
- Humans or LLMs as the Judge? A Study on Judgement Biases, Preprint 2024. [Paper]
- Benchmarking Cognitive Biases in Large Language Models as Evaluators, Preprint 2023. [Paper]
- Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions, Preprint 2023. [Paper]
- Large Language Models are not Fair Evaluators, Preprint 2023. [Paper]
- Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena, NeurIPS 2023. [Paper]
- Can Large Language Models be Trusted for Evaluation? Scalable Meta-Evaluation of LLMs as Evaluators via Agent Debate, Preprint 2024. [Paper]
- EvalLM: Interactive Evaluation of Large Language Model Prompts on User-Defined Criteria, CHI 2024. [Paper]
- LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores, Preprint 2023. [Paper]
- Verbosity Bias in Preference Labeling by Large Language Models, Preprint 2023. [Paper]
- Style Over Substance: Evaluation Biases for Large Language Models, Preprint 2023. [Paper]
- An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Models are Task-specific Classifiers, Preprint 2024. [Paper]
- G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment, Preprint 2023. [Paper]
- PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations, Preprint 2023. [Paper]
- ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning, Preprint 2023. [Paper]
- Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models, Preprint 2024. [Paper]
- PRE: A Peer Review Based Large Language Model Evaluator, Preprint 2024. [Paper]
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators, Preprint 2024. [Paper]
- LLM Evaluators Recognize and Favor Their Own Generations, Preprint 2024. [Paper]
- Measuring and Mitigating Unintended Bias in Text Classification, AIES 2018. [Paper]
- Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models, ACL 2023. [Paper]
- Gender Bias in Neural Natural Language Processing, Preprint 2019. [Paper]
- MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions, ACL 2023. [Paper]
- SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety Failures, ACL 2022. [Paper]
- Do LLMs Implicitly Exhibit User Discrimination in Recommendation? An Empirical Study, Preprint 2023. [Paper]
- Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation, Recsys 2023. [Paper]
- Mitigating harm in language models with conditional-likelihood filtration, Preprint 2021. [Paper]
- Exploring the limits of transfer learning with a unified text-to-text transformer, JMLR 2020. [Paper]
- CFaiRLLM: Consumer Fairness Evaluation in Large-Language Model Recommender System, Preprint 2024. [Paper]
- BLIND: Bias Removal With No Demographics, ACL 2023. [Paper]
- Identifying and Reducing Gender Bias in Word-Level Language Models, NAACL 2019. [Paper]
- Reducing Sentiment Bias in Language Models via Counterfactual Evaluation, Findings-EMNLP' 20. [Paper]
- Reducing Gender Bias in Word-Level Language Models with a Gender-Equalizing Loss Function, ACL-workshop 2019. [Paper]
- Bias of AI-Generated Content: An Examination of News Produced by Large Language Models, Preprint 2023. [Paper]
- Educational Multi-Question Generation for Reading Comprehension, BEA-workshop 2022 [Paper]
- Pseudo-Discrimination Parameters from Language Embeddings, Preprint 2024 [Paper]
- Item-side Fairness of Large Language Model-based Recommendation System, WWW 2024 [Paper]
- Bias of AI-generated content: an examination of news produced by large language models, Scientific Reports [Paper]
- Generating Better Items for Cognitive Assessments Using Large Language Models, BEA-workshop 2023 [Paper]
- Dynamically disentangling social bias from task-oriented representations with adversarial attack, NAACL 2021 [Paper]
- Using In-Context Learning to Improve Dialogue Safety, EMNLP-findings 2023 [Paper]
- Large pre-trained language models contain human-like biases of what is right and wrong to do, NML 2023 [Paper]
- BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage, Preprint 2022 [Paper]
- Balancing out Bias: Achieving Fairness Through Balanced Training, EMNLP 2022 [Paper]
- Should We Attend More or Less? Modulating Attention for Fairness, Preprint 2023 [Paper]
- Constitutional AI: Harmlessness from AI Feedback, reprint 2022 [Paper]
- He is very intelligent, she is very beautiful? On Mitigating Social Biases in Language Modelling and Generation, ACL-findings 2021 [Paper]
- Does Gender Matter? Towards Fairness in Dialogue Systems, COLING 2020 [Paper]
- Training language models to follow instructions with human feedback, NeurIPS 2022 [Paper]
- Never Too Late to Learn: Regularizing Gender Bias in Coreference Resolution, WSDM 2023 [Paper]
- CFaiRLLM: Consumer Fairness Evaluation in Large-Language Model Recommender System, Preprint 2024 [Paper]
- UP5: Unbiased Foundation Model for Fairness-aware Recommendation, EACL 2024 [Paper]
- ADEPT: A DEbiasing PrompT Framework, AAAI 2023 [Paper]
- Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation, Recsys 2023. [Paper]
- Automatic Generation of Distractors for Fill-in-the-Blank Exercises with Round-Trip Neural Machine Translation, ACL-workshop2023. [Paper]
- Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions, ACL 2023 [Paper]
- Critic-Guided Decoding for Controlled Text Generation, ACL-finding 2023 [Paper]
- Item-side Fairness of Large Language Model-based Recommendation System, WWW 2024 [Paper]
- Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness, Preprint 2023 [Paper]
- Understanding Biases in ChatGPT-based Recommender Systems: Provider Fairness, Temporal Stability, and Recency, Preprint 2024 [Paper]
- A Preliminary Study of ChatGPT on News Recommendation: Personalization, Provider Fairness, Fake News, Preprint 2023 [Paper]
- Estimating the Personality of White-Box Language Models, Preprint 2022 [Paper]
- Tailoring Personality Traits in Large Language Models via Unsupervisedly-Built Personalized Lexicons, Preprint 2022 [Paper]
- FairMonitor: A Four-Stage Automatic Framework for Detecting Stereotypes and Biases in Large Language Models, Preprint 2023 [Paper]
- Evaluating and Inducing Personality in Pre-trained Language Models, NeurIPS 2023 [Paper]
- Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models, Preprint 2023 [Paper]
- Studying Large Language Model Generalization with Influence Functions, Preprint 2023 [Paper]
- Towards Tracing Knowledge in Language Models Back to the Training Data, EMNLP findings 2023 [Paper]
- Detecting Pretraining Data from Large Language Models, Preprint 2023 [Paper]
- Watermarking Makes Language Models Radioactive, Preprint 2024 [Paper]
- WASA: WAtermark-based Source Attribution for Large Language Model-Generated Data, Preprint 2023 [Paper]
- User Behavior Simulation with Large Language Model based Agents, Preprint 2023 [Paper]
- On Generative Agents in Recommendation, Preprint 2023 [Paper]
🎉👍 Please feel free to open an issue or make a pull request! 🎉👍