Category: Artificial intelligence

  • Google Unveils Gemini 2.0 Models for New Agentic Era

    Google Unveils Gemini 2.0 Models for New Agentic Era

    Nine months after the introduction of Gemini 1.5, Google has unveiled the next significant update for its Large Language Model (LLM), named Gemini 2.0. The first model in this new lineup, Gemini 2.0 Flash, is now available as an experimental option in Google AI Studio and Vertex AI.

    Improved Speed and Functionality

    Gemini 2.0 Flash boasts "enhanced performance at similarly fast response times" and is said to be "twice as fast" compared to 1.5 Flash. The upgraded LLM supports various modes of input, including images, text, video, and audio. Additionally, it can handle mixed media, combining pictures with text, as well as multilingual text-to-speech audio.

    New Features and APIs

    This new version allows direct access to Google Search and accommodates third-party code execution along with predefined functions. Google is also launching a Multimodal Live API for developers to utilize. A version of 2.0 Flash optimized for chat will soon be accessible on both desktop and mobile browsers, with plans for a release on the Gemini mobile app in the near future.

    Advanced Prototypes

    Google’s Project Astra, a research prototype, has been upgraded with Gemini 2.0, showing improvements in dialogue, reasoning abilities, and native integration with tools such as Google Search, Lens, and Maps. This prototype can maintain up to 10 minutes of memory during a session.

    Another research effort, Project Mariner, utilizes 2.0 to comprehend complex instructions and retrieve data from a browser screen. This includes analyzing "pixels and web elements like text, code, images and forms," and employing an experimental Chrome extension to assist users in completing tasks.

    AI Code Assistant

    The third prototype, an experimental AI code assistant named Jules, can be seamlessly integrated into GitHub workflows. It possesses reasoning and logic skills to address coding challenges and formulate solutions under the supervision of developers.

    Google also revealed that it has created AI agents "using Gemini 2.0" capable of assisting users in navigating the virtual realms of video games. These agents can analyze game dynamics based solely on on-screen actions and provide real-time suggestions on what to do next in a conversational manner.

    Source: Link

  • FBI Alerts Public to Rise in AI Fraud and Voice Cloning Scams

    FBI Alerts Public to Rise in AI Fraud and Voice Cloning Scams

    The FBI has raised an alert about the increasing use of generative AI in criminal acts, especially in fraud. They indicate that these AI tools allow criminals to execute scams more effectively and with greater detail.

    AI in Fraudulent Activities

    The recent warning showcases how AI assists fraudsters in creating fake social media accounts, crafting persuasive phishing emails, and establishing fraudulent cryptocurrency investment websites. Although producing fake content itself isn’t against the law, its application in fraudulent activities is a growing issue for law enforcement agencies.

    Advanced Techniques Used by Criminals

    Criminals are also leveraging AI to produce lifelike profile images, forge identification documents, and fabricate false celebrity endorsements for imitation products. Scams involving market manipulation are also being powered by AI. Some groups have even resorted to voice cloning to deceive individuals during emergency scams, impersonating relatives in distress. Additionally, AI can generate realistic video footage of executives or authority figures, enhancing the believability of these schemes.

    Recognizing AI-Generated Content

    The FBI’s warning emphasizes the increasing difficulty in distinguishing AI-generated content from real material. Be cautious of indicators like distorted hands or faces, unusual shadows, or unnatural movements in images or video clips.

    To safeguard yourself, the FBI advises establishing secret phrases with family members and verifying any financial solicitations through reliable channels. They underline the importance of not disclosing sensitive information or sending money to individuals you only know through the internet.

    This alert arrives as AI tools continue to advance and become more readily available, complicating the efforts of law enforcement to keep pace.

    Source: Link

  • Microsoft AI Chief Predicts Conversational AI Will Replace Browsers by 2029

    Microsoft AI Chief Predicts Conversational AI Will Replace Browsers by 2029

    Microsoft AI CEO Mustafa Suleyman forecasts that within the next three to five years, conversational AI will take over as the main way for users to interact with online content, replacing traditional web browsers.

    Changing the Landscape of Online Interaction

    In a recent chat with The Verge, Suleyman expressed his views on how interfaces powered by AI could transform the way we search and explore the internet. He criticized classic search engines, highlighting that their dependence on structured searches and the outdated "10 blue links" approach is no longer effective.

    Progress in AI Development

    Suleyman is in charge of Microsoft’s consumer products like Bing, Edge, MSN, and Copilot. His team is making strides in minimizing AI hallucinations, a significant hurdle in creating more dependable AI interactions. The long-term collaboration between Microsoft and OpenAI is driving these enhancements forward.

    A Cautious Perspective on AGI

    Regarding artificial general intelligence (AGI), Suleyman takes a more measured view compared to some of his peers in the industry. While OpenAI’s Sam Altman believes AGI could be imminent with current technologies, Suleyman predicts it will take between two to ten years. He characterizes AGI not as superintelligence but rather as a system capable of performing most knowledge-based tasks efficiently.

    Distinct Personalities in AI Companions

    Microsoft’s approach focuses on crafting AI companions that possess unique personalities and emotional intelligence. Suleyman is confident that these attributes will differentiate their products in a competitive landscape. Apple stands as a formidable rival, leveraging its dominance over iOS distribution channels to maintain a strong position.

    The Shift Towards AI Interfaces

    These advancements indicate a definitive shift towards AI-driven interfaces, with companies like Google and OpenAI also exploring similar initiatives, such as Google’s Jarvis and possible ChatGPT-boosted browsers.

    Source: Link

  • New Memory Device Functions at Temperatures Over 1,100°F

    New Memory Device Functions at Temperatures Over 1,100°F

    Engineers from the University of Michigan have created a new form of solid-state memory that can store and rewrite information at temperatures exceeding 1,100°F (600°C), which is hotter than the surface of Venus.

    A Shift from Traditional Memory

    This innovative device is not like standard silicon-based memory, which is limited to functioning under 300°F (150°C). Instead, it operates by moving oxygen ions to transfer data, rather than depending on electron movement. This advancement could pave the way for electronics designed for extreme environments such as fusion reactors, jet engines, and geothermal wells.

    Insights from Researchers

    Yiyang Li, an assistant professor in materials science and engineering, and the lead author of the study, mentioned, "It could enable electronic devices for high-temperature applications that didn’t exist before." To write data, the device requires temperatures of at least 500°F (250°C). However, researchers believe that a heater could assist in cooler settings. At present, it can only store one bit of data, but the team is optimistic that it could eventually store much larger amounts, such as megabytes or gigabytes, with additional development.

    Implications for AI Technology

    This advancement could prove particularly beneficial for artificial intelligence in harsh environments. Alec Talin, a senior scientist at Sandia National Laboratories, noted, "There’s a lot of interest in using AI to improve monitoring in these extreme settings, but they require beefy processor chips that run on a lot of power, and a lot of these extreme settings also have strict power budgets."

    By allowing in-memory computing, this new technology could process data prior to sending it to AI processors, leading to energy savings in challenging conditions.

    Source: Link

  • OpenAI CFO Calls Donald Trump President of the AI Generation

    OpenAI CFO Calls Donald Trump President of the AI Generation

    During a discussion at the Reuters NEXT conference held in New York, OpenAI’s CFO Sarah Frair responded to inquiries about the potential influence of President-elect Donald Trump, suggesting he might become the "president of this AI generation."

    Trump’s Timing with AI Advancements

    Frair expressed that Trump will assume office just as essential infrastructure is established for a significant advancement in AI, specifically Artificial General Intelligence (AGI). "He’ll be right there at the onset, perhaps even as we approach things like AGI," remarked Frair.

    AGI represents a theoretical AI capable of human-like reasoning and adaptable enough to tackle various tasks across multiple fields. Leading tech companies, including Google, Microsoft, Apple, and Amazon, are already heavily invested in AGI research.

    OpenAI’s Sora and Its Popularity

    OpenAI’s Sora video generator has garnered considerable interest. Initially introduced in February of this year, Sora has been launched in a limited form, with the company halting new account sign-ups due to high web traffic.

    Frair explained that the restricted access is a result of "capacity, but a lot is also about wanting to be cautious…it’s available only to a very small group of users at this time because we aim to listen and learn."

    Prioritizing Safety in AI Development

    "There are instances where we will proceed a bit more slowly to ensure we are continually prioritizing safety," Frair added.

    Discussing Elon Musk, who has voiced his disapproval of OpenAI’s for-profit model, Frair noted that they trust Musk as a competitor, stating that he "will prioritize the national interest and engage in fair competition."

    The Future of AI Agents

    Frair also predicts an increase in AI agents being deployed soon. These agents are independent bots designed to perform specific tasks autonomously, without needing human oversight.

    "I believe we will witness significant activity surrounding agents next year, and I think many will be astonished at how quickly this technology arrives," Frair shared with Reuters.

    Source: Link

  • Google Urges FTC to Block Microsoft’s OpenAI Cloud Partnership

    Google Urges FTC to Block Microsoft’s OpenAI Cloud Partnership

    Google has requested the US Federal Trade Commission (FTC) to look into Microsoft’s exclusive cloud service deal with OpenAI. The Information, as reported by Reuters, indicates that this discussion took place while the FTC was probing Google about Microsoft’s business practices as part of a wider investigation.

    Microsoft and OpenAI’s Growing Partnership

    The alliance between Microsoft and OpenAI started back in 2019 when Microsoft made an initial investment of one billion dollars, which has now surged to $13 billion. In return for this investment, Microsoft gained exclusive rights to provide hosting for OpenAI’s services on its cloud platform. Notably, Microsoft intervened to prevent the ousting of Sam Altman last year.

    Shift from Non-Profit to For-Profit

    OpenAI was founded in 2015 as a non-profit research organization, but things took a turn with the establishment of OpenAI Global in 2019, which operates as a for-profit branch.

    According to a report from The Financial Times, OpenAI may be thinking about dropping a clause related to Artificial General Intelligence (AGI) in their agreement, which would have limited Microsoft’s access to more advanced models in exchange for additional investments. Recently, OpenAI rolled out a subscription plan priced at $200, named ChatGPT Pro, aimed at researchers and engineers.

    Impact on Competitors

    Competing companies in the cloud market, such as Google and Amazon, find themselves having to rent Microsoft’s servers, even if they are mainly focused on utilizing OpenAI’s technology. Microsoft’s rivals argue that this could lead to higher costs for consumers.

    Source: Link

  • Grok Unveils Advanced Image Generation Model with Text and Face Features

    Grok Unveils Advanced Image Generation Model with Text and Face Features

    xAI has recently introduced an image generation feature to Grok, marking a significant enhancement for the platform. Currently, this feature is accessible to X users in select countries, with a worldwide launch anticipated in approximately one week.

    Advanced Image Creation

    The image generator, which was originally named Aurora, is now integrated into the Grok family. It employs a sophisticated autoregressive mixture-of-experts system that has been trained on billions of examples sourced from the internet. In simple terms, it can foresee the next pieces of information by merging text and visuals, enabling it to produce far more lifelike images than before.

    Enhanced Functionality

    However, the capabilities extend beyond just generating images from nothing. This system can also modify existing images, allowing users to adjust them or draw inspiration for new designs. According to xAI, the model excels particularly in areas where other generators tend to falter, such as accurately rendering text, logos, and human faces.

    Continuous Improvement

    This update follows the launch of Grok 1.5V in April, which provided the platform with its initial experience in visual processing. xAI has plans for ongoing improvement—currently, they are enhancing their Colossus supercomputer located in Memphis, which already boasts 100,000 Nvidia H100 and H200 GPUs, with intentions to soon double that capacity.

    The timing of this release is noteworthy, especially since OpenAI has just unveiled its own video generation model, Sora. This development highlights the intensifying competition in generative AI among major industry players.

    Source: Link

  • Amazon Launches New AI-Agent R&D Lab in San Francisco

    Amazon Launches New AI-Agent R&D Lab in San Francisco

    Amazon has launched a new research and development lab in San Francisco aimed at establishing "foundational" abilities for AI agents. This initiative will be headed by David Luan, who co-founded the AI startup Adept and previously served as its CEO.

    Leadership Background

    David Luan has an impressive background, having worked as the vice president of Engineering at OpenAI and spent a year in a leadership role at Google Research. He started Adept in 2022 and then transitioned to Amazon, where he now leads the Artificial General Intelligence (AGI) lab in San Francisco.

    Strategic Hiring

    In June, Amazon brought Luan on board along with his co-founders Augustus Odena, Maxwell Nye, Erich Elsen, and Kelsey Szot. This move was part of a larger agreement that allows Amazon to use certain technology licenses from Adept. The startup had recently secured $350 million in a Series B funding round in March 2023, reaching a valuation of $1 billion.

    The AGI SF team is set to collaborate closely with Amazon’s extensive research group to develop AI agents capable of "taking actions in both digital and physical environments." Their primary goal is to create AI agents that can "carry out real-world tasks, learn from feedback provided by humans, self-correct, and understand our objectives."

    Source: Link

  • Microsoft Launches Copilot Vision Beta for Select Pro Subscribers

    Microsoft Launches Copilot Vision Beta for Select Pro Subscribers

    Microsoft Copilot Labs has launched beta testing for Copilot Vision, which is exclusive for some invited Copilot Pro subscribers. This new Vision AI monitors what users are doing in the Microsoft Edge browser to offer help, information, and tips in real-time.

    Integration with Microsoft Products

    The Copilot AI is built into the newest editions of Microsoft Windows, Edge, and Office. It responds to user prompts through text input, providing answers and support. With the addition of Copilot Vision, users no longer need to describe visual elements like objects and maps in text, as the AI can recognize everything happening within Microsoft Edge.

    Enhancing the Gaming Experience

    Gamers can benefit from the advice and insights Vision provides during gameplay, although it currently can’t control games directly. While users browse the web, the Vision AI identifies objects, assisting them in comparing items for purchases such as hotels, toys, or other goods. It can also provide specific product details, like washing instructions for clothing. For those who are unsure about what to buy or how to spend their day, they can ask the AI for recommendations, making life easier for busy individuals.

    Limited Availability and Data Management

    At the moment, Copilot Vision is restricted to a small number of websites during its beta phase, but this selection will grow in the future. The visual information and user interactions that Copilot Vision gathers during a session are erased once the session concludes, but Microsoft retains all the responses generated by the AI.

    People bogged down by endless meetings might find it helpful to get a Plaud AI voice recorder (available on Amazon) that can automatically transcribe and summarize what they missed.

    For more information, visit Microsoft Copilot Labs, check out the Microsoft Copilot blog, or watch Microsoft Copilot on YouTube, and don’t forget to review the Microsoft Privacy Statement.

  • X Unveils New Image Generator for Limited Time

    X Unveils New Image Generator for Limited Time

    xAI, the AI startup started by Elon Musk, launched a new image generator called Aurora over the weekend, but then quickly took it down again. The company shared news about this generator, and Musk himself said that it was in beta.

    Quick Removal

    Just a few hours after Aurora was made available, the model was pulled offline. The option to choose it in Grok’s model picker was removed. TechCrunch had the chance to try out the model and noted that it did not have any restrictions regarding public figures or celebrities.

    Creative Outputs

    Some users who were able to access the generator shared some fun images. Among these were pictures of Adam Sandler and Ray Romano on a sitcom set, Sam Altman riding a giraffe, and a boxing match between Mickey Mouse and Luigi.

    Specifications and Future Improvements

    Details about the model’s specifications are not clear, but Musk mentioned that it was an internal model in beta that would “improve very fast.” Recently, the social media platform owned by Musk made Grok free for all users, but with certain limitations.

    TechCrunch’s coverage highlights the excitement and mystery surrounding the sudden launch and removal of the Aurora image generator.