Key Takeaways
1. Large language models struggle to provide accurate information to vulnerable groups, particularly those with lower education and English skills.
2. Research shows significant disparities in response rates, with advanced chatbots refusing to answer queries from less-educated, non-native English speakers more often than from others.
3. Some AI models respond to lower-educated users with patronizing or mocking language, contributing to a negative user experience.
4. Certain factual topics are withheld from less-educated users, creating unequal access to information based on background.
5. The personalization of AI systems may exacerbate existing biases, leading to misinformation spreading among those least equipped to challenge it.
Large language models are often praised as game-changing tools that can make information accessible to everyone around the world. Yet, recent findings from the Massachusetts Institute of Technology Center for Constructive Communication show that these AI systems do not perform well for the vulnerable groups that could gain the most from them.
Research Findings
This study was shared at the AAAI Conference on Artificial Intelligence and looked into advanced chatbots, including OpenAI’s GPT-4, Anthropic’s Claude 3 Opus, and Meta’s Llama 3. The researchers assessed these models using the TruthfulQA and SciQ datasets to evaluate their factual accuracy and truthfulness. They included user backgrounds that varied by education level, English skills, and nationality. The findings revealed a noticeable decline in accuracy for users who had less formal education or lower English skills. The negative impacts were even worse for users who fell into both of these categories.
Disparities in Query Handling
Additionally, the study pointed out concerning differences in how these models responded to requests. For example, Claude 3 Opus declined to answer almost 11% of queries from users with lower education and who were not native English speakers, while only 3.6% of queries were refused from control users. A lot of these refusals were met with patronizing or mocking responses, sometimes imitating broken English. The models also chose not to provide factual information on subjects like nuclear energy and historical topics to less-educated users from countries like Iran or Russia, even though they answered the same questions correctly for users from other backgrounds.
Warning Signs Ahead
The researchers caution that as personalization becomes more common in these systems, the built-in sociocognitive biases might worsen existing gaps in information access. This could lead to the spread of harmful behaviors and false information to those who are the least likely to notice or challenge it.
MIT News’
Source:
Link









