Key Takeaways
1. Mistral AI’s Le Chat is the top model for user privacy, gathering minimal data during training and use.
2. OpenAI’s ChatGPT ranks second, offering transparency and user options to control data usage.
3. Grok from xAI is third, noted for its clear data protection measures.
4. Meta AI, Google’s Gemini, and Microsoft’s Copilot score poorly due to extensive data collection and lack of opt-out options.
5. Only four out of nine evaluated models allow users to opt out of data use for training, highlighting a need for better privacy practices in AI.
The growing presence of artificial intelligence (AI) in our daily routines is accompanied by an increasing need for data privacy. Analysts at Incogni have recently assessed nine prominent Large Language Models (LLMs) to see which ones prioritize user privacy the most. Their evaluation is based on eleven criteria, grouped into three primary categories: data collection for training, transparency, and how data is used and shared.
Leading the Pack
Mistral AI’s Le Chat tops the list, according to Incogni. This model gathers relatively minimal user data, providing a highly efficient method for handling data during both training and active use. Close behind is OpenAI’s ChatGPT, which earns high marks for its transparency. Users have the option to indicate in their accounts whether their conversations can contribute to the model’s further development, and if they choose to deactivate history, their data remains unrecorded. In third place is Grok from xAI, also notable for its clear data protection measures.
At the Bottom
In contrast, Meta AI, Google’s Gemini, and Microsoft’s Copilot rank at the bottom. Incogni reports that these platforms gather large amounts of data while lacking clear options for users to opt out. The Chinese model DeepSeek also scored poorly, as these providers do not present straightforward ways for users to exclude their data from being used in training.
Evaluation Criteria
The analysis conducted by Incogni looked at platforms based on eleven criteria divided into three categories: data collection for training, transparency, and data use and sharing. The “data use and sharing” category made up 50% of the total score, with “transparency” accounting for 30% and “training data” 20%. Notably, out of the nine tools examined, only four allow users to actively opt out of having their data used for training purposes.
The findings suggest that data protection is not universally applied across the AI landscape. Therefore, it is crucial for users to understand the privacy policies of different providers, especially when dealing with sensitive information, confidential client details, or personal data. For those who prioritize data security, options like Le Chat, ChatGPT, and Grok offer reasonable solutions. However, truly data-efficient LLMs are likely to remain rare even in 2025.
Source:
Link



