Tag: DeepSeek

  • Anthropic Claims China’s DeepSeek Plagiarized Claude AI for Censorship

    Anthropic Claims China’s DeepSeek Plagiarized Claude AI for Censorship

    Key Takeaways

    1. Anthropic is concerned about Chinese firms like DeepSeek copying its Claude LLM and lacking protective measures against harmful usage.

    2. DeepSeek has removed free speech protections from Claude and conforms to Chinese government censorship.

    3. Anthropic engaged with the Pentagon, clarifying that Claude is not used for military control or surveillance, despite the Pentagon’s desire for more flexible AI use.

    4. Anthropic highlights alarming AI plagiarism practices by Chinese firms, particularly MiniMax, which has significantly copied Claude’s capabilities.

    5. Anthropic calls for industry collaboration with major players like OpenAI and Google to strengthen AI export controls and combat AI model distillation.


    As AI technologies from Chinese firms become more advanced, Anthropic has disclosed a significant attempt by companies like DeepSeek to copy the core elements of its acclaimed Claude LLM and present it as their proprietary development.

    Concerns Over AI Safeguards

    Anthropic is apprehensive that these AI agents lack the protective measures against harmful usage that Claude possesses. The open-source nature of DeepSeek might enable powerful AI resources to fall into the hands of both state and non-state entities that are less concerned about how they are used.

    DeepSeek, which has made waves in the industry for its ability to maximize efficiency in AI computing, has removed Claude’s free speech protections. Moreover, its agent avoids discussions on topics that the Chinese government’s censorship apparatus seeks to suppress.

    Pentagon Encounter and AI Ethics

    Anthropic, known for creating some of the most cutting-edge AI tools available, recently had an engagement with the Pentagon about using Claude in military contexts. The company maintains that Claude is not employed for controlling unmanned weapon systems or surveilling American citizens. However, the Pentagon desires greater leeway in utilizing its AI without such stringent restrictions.

    When DeepSeek and similar companies use Claude’s decision-making algorithms through a vast distributed proxy network that processes millions of queries via fake accounts—an act referred to as “distillation”—the resultant agents could be employed by a state military without any limitations.

    Alarm Over AI Plagiarism

    This is why Anthropic is raising concerns about this practice, which it has linked to three Chinese AI firms: DeepSeek, MiniMax, and Moonshot. Anthropic is also working on ways to counter these issues. In particular, the extent of AI model copying by MiniMax is astonishing. Anthropic identified over 13 million exchanges with Claude that frequently enhanced the MiniMax model in real time while significant updates to Claude were being implemented.

    Anthropic has now established measures to recognize millions of seemingly harmless queries that are utilized for distillation. For example, inquiries about how Claude would approach a “goal to deliver data-driven insights—not summaries or visualizations—grounded in real data and supported by complete and transparent reasoning” are analyzed to refine Chinese LLMs.

    Call for Industry Collaboration

    Anthropic cautions that it cannot combat this challenge independently and is seeking to enlist other major players in the industry, such as OpenAI and Google. The aim is to persuade policymakers to bolster AI export control laws with anti-distillation provisions instead of merely prohibiting Nvidia Blackwell GPUs and similar hardware.

    Source:
    Link


     

  • DeepSeek OCR AI: Process 200,000 Pages Daily with Nvidia A100

    DeepSeek OCR AI: Process 200,000 Pages Daily with Nvidia A100

    Key Takeaways

    1. DeepSeek stands out for its efficiency and cost-effectiveness compared to other AI models like ChatGPT and Gemini due to its open-source nature.
    2. The DeepSeek-OCR model achieves 97% recognition accuracy while compressing documents into images, with a compression ratio of under 10x.
    3. DeepSeek-OCR can process up to 200,000 pages daily using just one Nvidia A100 GPU, significantly outperforming other solutions in speed and scale.
    4. The model employs advanced algorithms that maintain accuracy across various document sizes and types, including complex documents with graphs and diagrams.
    5. Extensive training on 30 million PDF pages in multiple languages has improved accuracy, but the impact on reasoning abilities in language models remains uncertain.


    With the rise of AI data centers and the related costs of processing, the focus has shifted towards the effectiveness of algorithms. Among all, DeepSeek stands out for its efficiency. Its models are available as open source, making their training considerably cheaper than that of OpenAI’s ChatGPT or Google’s Gemini.

    A Breakthrough in Learning Efficiency

    The recently introduced DeepSeek-OCR model demonstrates remarkable learning efficiency. It utilizes optical mapping to significantly compress lengthy documents by transforming them into images, achieving an impressive 97% recognition accuracy with a compression ratio of under 10x.

    By employing advanced encoder and decoder techniques, the model can turn over nine tokens of document text into just a single visual token, which greatly reduces the computational resources needed for processing. Even at a 20x compression ratio, the DeepSeek-OCR system can still maintain a 60% optical recognition accuracy, which is quite an extraordinary achievement.

    Speed and Scale of Processing

    Thanks to innovative AI compression algorithms, DeepSeek-OCR can process scientific or historical texts at an astonishing rate of 200,000 pages each day using just one Nvidia A100 data center GPU. This means that a 20-node A100 cluster can handle about 33 million document pages daily, marking a significant advancement in the learning of text-heavy LLMs. Based on the OmniDocBench rankings, DeepSeek-OCR far surpasses other well-known solutions like GOT-OCR2.0 and MinerU2.0 in terms of the number of vision tokens utilized per page.

    The new DeepEncoder algorithms are capable of managing various document sizes and resolutions without losing speed or accuracy. Meanwhile, the DeepSeek3B-MoE-A570M decoder uses a mixture-of-experts architecture that shares knowledge among specialized models tailored for each OCR task. This enables DeepSeek-OCR to effectively process intricate documents that include graphs, scientific formulas, diagrams, or even images, regardless of the languages used.

    Comprehensive Training for Accuracy

    To reach such a high level of scale and precision, DeepSeek processed 30 million pages in Portable Document Format (PDF) across nearly 100 different languages. This extensive training included diverse categories, from newspapers and scientific handwriting to textbooks and PhD dissertations. However, while the rapid and efficient visual tokenization provided by the new DeepSeek-OCR system is impressive, it remains uncertain whether this will translate into improved performance in language models, particularly in reasoning abilities when compared to the existing text-based token systems.

  • Shanghai AI Conference Begins with 800 Exhibitors Amid US Restrictions

    Shanghai AI Conference Begins with 800 Exhibitors Amid US Restrictions

    Key Takeaways

    1. The World Artificial Intelligence Conference (WAIC) will feature over 800 companies showcasing more than 3,000 AI products, including 40 large language models and 60 intelligent robots.

    2. Premier Li Qiang will emphasize the need for a global governance framework for AI to prevent fragmented regulations that could favor a few powerful states or corporations.

    3. The conference occurs amidst US restrictions on advanced chips, yet local companies like DeepSeek are achieving significant innovations that rival top US systems.

    4. Major Chinese firms like Huawei and Alibaba will dominate the exhibition, alongside Western companies such as Tesla and Amazon, highlighting a mix of established and start-up players.

    5. The WAIC serves as both a technology exhibition and a diplomatic forum, with potential implications for future global AI regulations and competition.


    Shanghai’s World Artificial Intelligence Conference (WAIC) kicks off this weekend, boasting the largest exhibitor lineup in its history. The organizers are anticipating participation from over 800 companies, presenting more than 3,000 AI products. This includes 40 large language models, 50 AI devices, and 60 intelligent robots.

    Keynote Address

    Premier Li Qiang is set to give the keynote speech, where he will advocate for a global governance framework that balances technological advancement with security concerns. He points out that fragmented regulations could make AI an “exclusive game” for a small number of states or corporations. To address this, he has suggested the creation of a new organization to coordinate standards, especially for countries that are still developing.

    Event Context

    The conference takes place amid ongoing US restrictions on advanced chips and manufacturing tools, which began during the Trump era and have since become stricter. Despite these supply issues, local innovations have emerged: DeepSeek, based in Hangzhou, revealed a cost-effective model this year that competes with top US systems. Nvidia’s CEO Jensen Huang has recently commended AI models from DeepSeek, Alibaba, and Tencent, labeling them as “world-class.”

    Show Highlights

    The show floor will be dominated by major Chinese players like Huawei and Alibaba, while Western companies such as Tesla, Alphabet, and Amazon also have a presence. Start-ups add diversity to the event; for example, Unitree, a humanoid-robot company, will showcase its newest bipedal designs.

    As national AI strategies continue to diverge, the three-day WAIC serves as both a technology exhibition and a diplomatic meeting point. The impact of Li’s appeal for unified regulations could significantly influence the future of the global AI competition.

    Source:
    Link

  • German Authorities Urge Google and Apple to Remove Deepseek App

    German Authorities Urge Google and Apple to Remove Deepseek App

    Key Takeaways

    1. Deepseek has been prohibited in Italy, and German officials are taking action to remove it from Google and Apple’s platforms.
    2. Allegations include violations of EU data protection laws, particularly regarding user data transfer to China without adequate safeguards.
    3. The app collects sensitive user information, raising concerns about potential access by Chinese authorities.
    4. The Berlin data protection authority may impose fines of up to 4% of Deepseek’s global revenue, but enforcement against a foreign entity is challenging.
    5. The request to block Deepseek follows a previous warning to halt data transfers, and while it may be removed from app stores, it will still be accessible via web browsers.


    Deepseek has been prohibited in Italy, and now German data protection officials are taking action against the widely-used AI application from China. According to Der Spiegel, the Berlin Commissioner for Data Protection and Freedom of Information has filed a complaint with both Google and Apple, formally asking them to remove the Deepseek app from their platforms, making it unavailable to users in Germany.

    Allegations of Data Violations

    The basis for this request is purported violations of data protection laws, particularly concerning the transfer of user data from Europe to China. The company has not presented adequate proof that user data is safeguarded in China in a similar way as it is in Europe. According to the EU’s GDPR (General Data Protection Regulation), protecting user data is a fundamental requirement for its transfer to nations outside the EU. However, this does not ensure that other Chinese firms or the Chinese government cannot access data from European users.

    Concerns Over User Data

    This situation is particularly alarming because the chatbot app gathers a wide variety of potentially sensitive information about its users, such as text inputs, chat histories, uploaded files, location details, and device data. Chinese authorities may potentially gain access to all this information, which is already in the possession of the state for all domestic businesses.

    Possible Penalties and Future Actions

    The Berlin data protection authority has the option to impose a fine that could reach up to 4% of the company’s worldwide revenue. Nonetheless, as officials have indicated, enforcing this against a foreign entity would be a challenging task. It is worth noting that this action did not come without prior warning; in May, Berlin’s data protection officials had already set a deadline for the company to halt data transfers to China. Since the Deepseek developers failed to meet this deadline, the request for blocking has been made under the Digital Services Act. Apple and Google are now required to make a decision regarding the blocking very soon. However, the model will still remain accessible through web browsers in the future.

    Source:
    Link

  • DeepSeek Accused of Evading U.S. Export Controls on Nvidia Chips

    DeepSeek Accused of Evading U.S. Export Controls on Nvidia Chips

    Key Takeaways

    1. DeepSeek’s Military Ties: The U.S. State Department accuses DeepSeek of knowingly assisting China’s military and intelligence operations through various defense projects.

    2. Export Control Evasion: DeepSeek allegedly attempted to circumvent U.S. export regulations to acquire Nvidia H100 chips using shell companies and foreign data centers.

    3. Data Sharing Concerns: DeepSeek is claimed to be supplying user data and analytics to Chinese state-controlled surveillance systems under local laws.

    4. Discrepancies in Chip Acquisition: While DeepSeek is believed to have some Nvidia H100 chips, it reportedly owns far fewer than previously suggested, with Nvidia asserting compliance with export regulations.

    5. Skepticism Over Performance Claims: Independent analysts question DeepSeek’s claims of matching U.S. technology, raising doubts about its rapid growth reliant on U.S. tech and resources.


    A senior official from the U.S. State Department has stated that the AI company DeepSeek is “knowingly” aiding China’s military and intelligence operations while attempting to evade American export regulations to obtain the latest Nvidia H100 accelerators. This assessment, initially revealed by Reuters, represents a significant public condemnation from Washington towards the Hangzhou-based startup.

    DeepSeek’s Involvement with the PLA

    Records from procurement indicate that DeepSeek has engaged in over 150 projects for the People’s Liberation Army and various defense-related organizations. Under Chinese law, companies are required to provide data when requested by authorities. However, the official claims that DeepSeek is actively delivering user data and usage analytics to state-controlled surveillance systems.

    Alleged Attempts to Bypass Export Controls

    The same source claims that DeepSeek attempted to acquire H100 chips through shell companies located in Southeast Asia and sought to utilize foreign data centers to gain remote access to U.S. hardware—actions aimed at circumventing restrictions set in 2022. Three sources in the industry informed Reuters that while the startup does possess some H100 chips, it is likely to have far fewer than the 50,000 units speculated earlier this year. Nvidia disputes this claim, asserting that internal investigations reveal DeepSeek acquired only the export-compliant H800 model.

    Skepticism Surrounding DeepSeek’s Claims

    DeepSeek gained attention in January when it announced that its DeepSeek-V3 and DeepSeek-R1 models either matched or surpassed leading U.S. technologies “at a much lower price.” However, independent analysts contest this assertion, suggesting the actual training costs probably exceeded $5.6 million. Washington’s recent concerns contribute to growing doubts about the company’s rapid growth, which appears to significantly rely on U.S. technology and resources.

    DeepSeek has not responded to inquiries regarding its privacy policies, chip purchases, or alleged military collaborations. Nvidia has stated it does not endorse any parties that violate export regulations and emphasizes that current controls have effectively barred it from the Chinese data center market. Meanwhile, Malaysia’s trade ministry is investigating whether an unnamed Chinese company is using Nvidia-powered servers in the country for training large language models, highlighting regional caution against similar circumventions.

    Source:
    Link

  • Deepseek Supports China’s Military Operations, Says US Official

    Deepseek Supports China’s Military Operations, Says US Official

    Key Takeaways

    1. DeepSeek’s Initial Claims: DeepSeek claimed its tools were equal or superior to American competitors upon its debut in January 2025.

    2. Allegations of Military Support: A State Department official revealed that DeepSeek allegedly provides aid to China’s military and intelligence efforts, beyond just open-source access.

    3. Data Sharing Issues: Concerns have arisen about DeepSeek sharing user data with China’s surveillance system amid ongoing U.S.-China trade tensions.

    4. Links to Chinese Military: DeepSeek is reportedly linked to the Chinese People’s Liberation Army and has been accused of using tactics to bypass U.S. export controls for advanced chips.

    5. Nvidia Access and Investigations: DeepSeek has significant access to Nvidia’s information, raising concerns about potential remote access to U.S. chips by China, prompting further investigations.


    When DeepSeek made its debut in January 2025, it created quite a stir by asserting that its tools were on par with or even better than those of its American rivals. However, the company is now facing a significant scandal after a senior official from the State Department revealed shocking information to Reuters.

    Serious Allegations Unfold

    In a confidential interview, the official stated, “We are aware that DeepSeek has willingly provided and will probably continue to provide aid to China’s military and intelligence efforts.” This official, who requested to remain unnamed, elaborated, “This action goes far beyond mere open-source access to DeepSeek’s AI models.”

    Data Sharing Concerns

    Amid the ongoing trade tensions between the U.S. and China, the official claimed that DeepSeek has shared user data and analytics through the surveillance system employed by Beijing. Moreover, since companies like Amazon, Alphabet, and Microsoft offer DeepSeek’s services to their clients, there are serious worries regarding potential data breaches.

    Infrastructure and Military Links

    U.S. lawmakers have pointed out that DeepSeek sends American data to China via backend systems linked to China Mobile, a state-owned telecom giant. Additionally, the company appears in procurement documents over 150 times related to the Chinese People’s Liberation Army and other defense-related organizations in China. The official alleged that DeepSeek has employed various tactics to bypass export controls to obtain advanced chips made in the U.S.

    Nvidia Data Access

    It’s important to recognize that DeepSeek has considerable access to Nvidia’s information. Nonetheless, due to the current restrictions, there are suspicions that China might have remote access to U.S. chips. Furthermore, U.S. officials have yet to blacklist DeepSeek and have not asserted that Nvidia was aware of any collaboration between DeepSeek and the Chinese military. As a result, additional investigations are set to occur in the upcoming days, potentially leading to new findings.

    Source:
    Link

  • DeepSeek Launches Enhanced V3 AI Model Just 3 Months After Release

    DeepSeek Launches Enhanced V3 AI Model Just 3 Months After Release

    Key Takeaways

    1. DeepSeek introduced the DeepSeek-V3-0324, an updated version of its V3 AI large-language model with improved performance and reduced hardware requirements.
    2. The V3 model is a non-reasoning AI designed for quick answers, featuring 685 billion parameters, making it one of the largest publicly available LLMs.
    3. The updated model scored between 5.3% and 19.8% higher on AI benchmarks compared to its predecessor, competing well with other AIs like GPT-4.5 and Claude Sonnet 3.7.
    4. Enhanced capabilities include improved web page design, user interface creation for online games, and better skills in Chinese searching, writing, and translating.
    5. To run the full DeepSeek-V3-0324 model, users need 700 GB of storage and Nvidia A100/H100 GPUs, while smaller versions can operate on a single Nvidia 3090 GPU.


    DeepSeek, a Chinese artificial intelligence firm, has introduced the DeepSeek-V3-0324, an updated iteration of its V3 AI large-language model (LLM), which was first unveiled in December 2024. This V3 model astonished many due to its significantly reduced hardware needs for training, quicker training durations, and lowered API expenses, while still delivering high performance compared to rival AIs like OpenAI’s GPT models.

    Key Features of the V3 Model

    The revised V3 model is classified as a non-reasoning AI, which means it focuses on providing quick answers without taking extra time to ponder complex issues, in contrast to DeepSeek’s R1 model. With a massive size of 685 billion parameters, it ranks among the largest publicly accessible LLMs. Users can leverage the latest model under the MIT License.

    DeepSeek-V3-0324 has shown improvements, scoring between 5.3% and 19.8% higher on AI benchmarks compared to the previous V3 version. Its performance stands alongside other leading AIs, including GPT-4.5 and Claude Sonnet 3.7.

    Enhanced Capabilities

    The updated model brings several enhancements. One notable upgrade is its enhanced ability to design attractive web pages and user interfaces for online games. Additionally, its skills in searching, writing, and translating in Chinese have seen significant improvements.

    For those interested in testing the complete 685B DeepSeek-V3-0324, it is necessary to have at least 700 GB of available storage and several Nvidia A100/H100 GPUs. However, smaller and distilled versions of the model can be operated on a single GPU, like the Nvidia 3090 (available for purchase on Amazon).

    Source:
    Link


  • China’s PLA Boosts Support Ops with DeepSeek AI Technology

    China’s PLA Boosts Support Ops with DeepSeek AI Technology

    Key Takeaways

    1. DeepSeek’s AI systems are currently used in healthcare and logistics within the People’s Liberation Army (PLA), with potential future applications in military intelligence.
    2. The PLA employs DeepSeek’s open-source large language models (LLMs) in various military hospitals and support units, ensuring patient data remains secure.
    3. DeepSeek’s technology assists the People’s Armed Police in physical training and psychological support, allowing for initial testing before sensitive military applications.
    4. DeepSeek plays a significant role in military intelligence, enabling real-time analysis of battlefield information to improve situational awareness and strategic planning.
    5. Experts predict that continued integration of AI in the PLA could lead to direct combat applications, highlighting the importance of effective AI incorporation in military operations.


    DeepSeek’s AI systems are currently operational in hospitals and support units of the People’s Liberation Army, with experts predicting a future shift towards battlefield intelligence functions.

    Current Usage in Healthcare

    Reports from Chinese media indicate that the PLA is utilizing DeepSeek’s AI technology for support roles that do not involve combat. At this time, the focus is primarily on healthcare and logistics. However, defense analysts anticipate that it won’t take long before AI will be deployed in military intelligence and crucial decision-making processes.

    Implementation in Defense

    DeepSeek’s open-source large language models (LLMs) are being utilized in PLA hospitals, the People’s Armed Police, and various defense mobilization departments. The Central Theatre Command general hospital has recently mentioned its “embedded deployment” of the R1-70B LLM, which provides treatment suggestions while ensuring patient data remains on local servers. Additionally, Beijing’s renowned “301 Hospital,” which caters to high-ranking officials, has collaborated with Huawei to integrate the DeepSeek-R1 model into its operations.

    Training and Counseling Applications

    The People’s Armed Police also employs DeepSeek’s technology for physical training and psychological support initiatives. Sam Bresnick, a research fellow at Georgetown University’s Centre for Security and Emerging Technology, emphasizes that these limited applications allow the PLA to test the technology before applying it to more sensitive missions.

    A state-run outlet, Guangming Daily, underscored DeepSeek’s “increasingly vital role in military intelligentization,” highlighting its ability to quickly analyze real-time battlefield information for improved situational awareness during conflicts. The publication discussed how AI could be applied in amphibious landings, where it might adjust strategic plans in response to changing battlefield conditions.

    Outlook on Future Integration

    Chinese military expert Fu Qianshao stated that DeepSeek illustrates “the PLA’s dedication to staying current and fully leveraging AI technology to enhance overall combat capabilities.” He noted that the ongoing integration of AI in command systems might eventually lead to direct combat applications.

    Recently, the Nanjing National Defence Mobilisation Office published a user manual for DeepSeek, detailing its applications for emergency evacuation strategies, defense education, and resource evaluations. While many experts still consider the US to be ahead in AI advancements, Bresnick points out that the true advantage will go to whichever nation effectively incorporates AI into its military operations.

    Source:
    Link

  • South Korea Halts DeepSeek AI Downloads Over Privacy Concerns

    South Korea Halts DeepSeek AI Downloads Over Privacy Concerns

    Key Takeaways

    1. South Korea has halted new downloads of the Chinese AI chatbot DeepSeek due to data privacy concerns, effective February 15, 2025.
    2. The Personal Information Protection Commission (PIPC) found weaknesses in DeepSeek’s communication features and management of personal information with third parties.
    3. Users are advised to refrain from entering personal information into the chatbot until issues are resolved and compliance with South Korean laws is ensured.
    4. Other countries, including the U.S., Italy, and Australia, have also imposed restrictions on DeepSeek due to security threats and excessive data collection.
    5. DeepSeek faces significant challenges in rebuilding trust and complying with regulations as global scrutiny of its data handling practices increases.


    South Korea has put a stop to new downloads of the Chinese AI chatbot DeepSeek, citing worries about data privacy breaches. This decision was made public by the Personal Information Protection Commission (PIPC) and became effective on February 15, 2025, at 6:00 p.m. local time. Although the app is no longer available for download, users can still access the web version while the company works on fulfilling the necessary regulatory conditions.

    South Korea Takes Action Against DeepSeek AI

    The PIPC stated that its investigation into DeepSeek, which began shortly after the chatbot’s introduction, revealed weaknesses in its communication features and how it manages personal information with third-party providers. The commission made it clear that until these problems are fixed in accordance with South Korea’s Personal Information Protection Act, new downloads of the app will not be permitted. Current users have been told to avoid entering any personal information into the chatbot’s prompts until further notice.

    DeepSeek has admitted that it did not adequately consider South Korean data protection laws prior to its launch and has since appointed a local representative to help ensure compliance with regulations. The government intends to use this situation as a model to enhance guidance and oversight, aiming to prevent similar issues from happening again in the future.

    Increased Scrutiny and Security Concerns

    The suspension comes after increased scrutiny from South Korea’s National Intelligence Service (NIS), which had previously alerted that DeepSeek was collecting user data excessively and might be using it to train its AI models. Furthermore, security analysts found that both the Android and iOS versions of the app were sending certain user data to its servers without encryption, raising even more red flags.

    This situation is part of a larger trend, as global concerns about DeepSeek’s data handling practices have surged. Countries like the United States, Italy, Australia, and Taiwan have already placed restrictions on the AI service for government use, citing security threats. NASA has blocked DeepSeek from its systems, and the U.S. Navy has cautioned its personnel against using the app due to potential data risks.

    Challenges Ahead for DeepSeek

    In the midst of these escalating worries, Beijing claims that it allows international internet firms to operate in China while adhering to local laws and asserts it does not force companies to unlawfully collect or store data. However, as regulatory scrutiny grows around the globe, DeepSeek now faces significant challenges in rebuilding trust and adhering to regulations in various regions.

    Source:
    Link

  • DeepSeek May Soon Be Banned in the US, Says ARM CEO

    DeepSeek May Soon Be Banned in the US, Says ARM CEO

    Key Takeaways

    1. DeepSeek, a Chinese AI firm, claims high efficiency with its V3 model costing only $5.576 million to train, outperforming competitors like OpenAI’s o1 model.
    2. Concerns over national security have arisen, leading to actions such as NASA restricting access to DeepSeek and the U.S. Navy advising against its use.
    3. DeepSeek’s R1 model is reported to be 50-75% more effective than rivals while being free, open source, and capable of running on consumer devices.
    4. The rise of DeepSeek has caused a significant decline in AI-related stocks, with Nvidia losing $600 billion in market value due to fears of cheaper AI models reducing demand for costly chips.
    5. Major U.S. companies, including Microsoft and Perplexity, are adopting DeepSeek’s services, while Chinese smartphone manufacturers are collaborating to create lighter versions of its R1 model for on-device AI.


    DeepSeek is a Chinese AI firm that has really made waves in the industry due to its claims of high efficiency. They say that training their V3 model, which came before the now-popular R1 model, costs only $5.576 million.

    Rising Concerns

    Since the R1 model’s launch, DeepSeek has gained massive recognition globally. However, the CEO of Arm warns that the U.S. government might soon impose a ban on this Chinese AI company. What’s the concern about national security? Let’s dig into the possible reasons behind this.

    A prior report noted that NASA has already restricted DeepSeek from its systems, and the U.S. Navy has advised their personnel against utilizing this AI service.

    Efficiency and Performance

    DeepSeek is said to be 50-75% more effective than its rivals. It manages to perform better than OpenAI’s o1 model in a variety of tests while using less power and computing resources. Moreover, it’s offered for free and is open source, allowing anyone to create their own AI applications based on it.

    Worried about privacy? Users can download the model and run it on their own machines. The models with 7b or 8b parameters (both under 5GB in size) can be set up and used on regular consumer devices.

    Market Impact

    One possible explanation for the CEO’s prediction is that DeepSeek’s global adoption (including in the U.S.) has caused a significant decline in AI-related stocks, with Nvidia losing $600 billion in market value. This decline stems from concerns that cheaper AI models might lessen the demand for costly AI chips.

    DeepSeek has not only topped the Apple app store but has also persuaded U.S.-based giants like Microsoft to use its services, which were previously loyal to OpenAI. Perplexity has also integrated DeepSeek R1 into its system for enhanced reasoning and smarter responses, which they are now utilizing on their servers.

    As we wait to see if the prediction holds true, it will be interesting to watch how U.S. companies already using DeepSeek navigate these developments.

    Additionally, many Chinese smartphone manufacturers, such as Huawei, Honor, Oppo, and ZTE, are collaborating with DeepSeek to create lighter versions of the R1 model tailored for their on-device AI requirements.