Tag: Gemini AI

  • How Gemini AI Enhances Samsung Galaxy S25 Series Performance

    How Gemini AI Enhances Samsung Galaxy S25 Series Performance

    Samsung has introduced the Galaxy S25 series of flagship smartphones at the Galaxy Unpacked 2025 event. This new lineup is packed with cutting-edge technology, including a custom Snapdragon 8 Elite For Galaxy SoC. Beyond its impressive specifications, the Galaxy S25 series also features advanced AI capabilities with its Gemini system. Here’s a look at what it brings to the table and how it operates.

    Galaxy S25 Series and Its Gemini AI Capabilities

    The Galaxy S25 series utilizes Qualcomm’s Hexagon NPU, enabling enhanced on-device AI functions. This means that for certain AI-driven features and tools, an internet connection isn’t necessary. With the sophisticated Gemini AI, users can access improved accessibility tools that are smoothly integrated into these devices.

    Quick Access to Gemini AI

    The Galaxy S25 series allows Quick Access to Gemini. By long-pressing the side button, users can activate Gemini instantly, which then assists with various tasks or provides relevant information based on what’s currently displayed on the screen.

    Gemini Extensions and AI Tasks

    Gemini Extensions are now integrated into numerous Google and Samsung applications, such as Google Maps, YouTube, Gmail, and Samsung Reminder. This means that Gemini is capable of executing tasks that span multiple applications.

    Samsung has also introduced real-time transcription through Gemini Live. However, it doesn’t stop there; the AI can smartly record, analyze, and simplify crucial conversations. It even has the capability to incorporate images or YouTube videos into these recordings.

    Upcoming Features with Project Astra

    Project Astra is a forthcoming set of features that will debut with the Galaxy S25 series on the Gemini mobile app and will later be available on other Android devices. These features will include screen sharing and live video sharing capabilities.

    Samsung and Google are collaborating closely on these AI enhancements, with additional tools expected to be released in the future. This will include TalkBack, which will provide detailed descriptions of images powered by Gemini, specifically designed to assist users who are visually impaired or have low vision.


  • Google Docs Introduces AI for Easy Formatted Document Creation

    Google Docs Introduces AI for Easy Formatted Document Creation

    Google has rolled out an exciting new tool for Docs that utilizes the Gemini AI model to help users create formatted documents.

    Ease of Document Creation

    As posted on the company’s support pages (via Gadgets360), this feature allows you to request Docs to produce a variety of documents such as proposals, project trackers, document ideas, blog posts, press releases, campaign briefs, dinner party menus, newsletters, itineraries, and even more.

    To get started, users can simply click on "help me create" and provide a brief description of what they need. It is important to include at least one existing document by typing "@filename" for Gemini to generate content effectively.

    Availability and Limitations

    Currently, this feature is exclusively available in Google Workspace Alpha and the initial testing phase known as Google Workspace Labs. Google has indicated that they are gradually making this feature available, but it’s presently limited to desktop users and can only be used in new documents.

    However, there are certain restrictions. Google has pointed out that it cannot "incorporate web search results or content from your Workspace files," nor can it "generate cover or inline images of people." Additionally, it is restricted to "content extraction" from files and is unable to replicate the "structure or style" of those documents.

  • Google Integrates Gemini AI into Maps, Earth, and Waze

    Google Integrates Gemini AI into Maps, Earth, and Waze

    Google is making big changes to its mapping services by introducing significant generative AI updates. The company is integrating Gemini AI into Google Maps, Earth, and Waze, aiming to transform how users interact with geospatial information and solve location-based challenges.

    Advanced Tools for Developers

    The Maps Platform, utilized by over 10 million websites and apps, has introduced a new feature called "grounding with Google Maps." This allows developers to access real-time location data for creating AI-enhanced experiences. By leveraging Maps’ extensive database of 250 million locations, large language models can provide more precise information.

    Rivian’s Exciting Integration

    Rivian is also getting involved with this cutting-edge technology. Starting next month, they will incorporate Gemini-powered Places API features into their vehicle infotainment systems. This means while you’re on the road, you’ll receive AI-generated summaries of nearby attractions like restaurants, stores, and grocery outlets, improving your driving experience.

    A New Era for Urban Planning

    In addition, Google Earth is collaborating with Google Research and X to integrate Gemini. This partnership will bring advanced analytical tools for urban development. The system can address complex spatial inquiries and create custom visuals, reducing analysis time from several days to mere minutes. For transportation planners, this tool will help determine optimal locations for new electric vehicle charging stations based on actual demand.

    For U.S. Google Maps users on Android and iOS, Gemini AI is enhancing searches with more relevant and contextual results. According to Chris Phillips, VP and General Manager of Geo at Google, it cross-references information from Maps’ database and user reviews to boost accuracy.

    New Features and Expansions

    They’ve also rolled out some cool new features, like improved route exploration with landmark suggestions, information about parking availability at your destination, and enhanced walking navigation details. Moreover, the immersive view has expanded to over 150 cities globally, with better lane information expected next month.

    Waze Joins the AI Revolution

    Waze is also getting in on the action. They are incorporating AI-driven natural language processing for reporting road incidents, allowing users to simply speak about road conditions instead of selecting icons. Later this year, the app will begin providing alerts for school zones for both iOS and Android users.

  • Google Developing AI Agent to Control Web Browsers

    Google Developing AI Agent to Control Web Browsers

    According to a report from The Information, Google is developing an AI tool that can manage web browsers to make boring tasks easier, like filling out forms or reserving flights.

    Project Jarvis Unveiled

    This AI agent, known as Project Jarvis, is set to launch alongside the upcoming Gemini AI model, which might be released in December of this year. The name "Jarvis" stands for "Just Another Very Intelligent System," inspired by a fictional AI helper in the Marvel films who assists Tony Stark.

    Features of the AI Agent

    Google plans to restrict the agent’s functionality to browsers like Chrome. It will assist users with activities such as booking cinema tickets or buying goods online. People will have the ability to interact with the agent directly and give commands for various tasks.

    If this sounds a bit like something you’ve heard before, it’s because it bears a resemblance to Anthropic’s recent Claude 3.5 Sonnet, which enables app developers to "guide Claude to operate computers like humans do". OpenAI is also believed to be creating similar solutions.

    The Information, Anthropic, Reuters, Image Source.

  • Google TV Streamer 4K Unveiled: Replaces Chromecast Officially

    Google TV Streamer 4K Unveiled: Replaces Chromecast Officially

    Google has revealed its new Google TV Streamer 4K, which will serve as the official replacement for the Chromecast. While the Chromecast has reached its end, the Google TV Streamer 4K will carry its legacy forward with enhanced capabilities and integrated AI features.

    Google TV Streamer 4K

    The latest offering from the tech giant features a compact set-top box design, making it suitable to place in front of your TV instead of plugging it in at the back, as was the case with the Chromecast. In addition to performance enhancements, what distinguishes the Google TV Streamer from the Chromecast is the inclusion of Gemini AI. This generative AI developed by Google can assist users in various tasks, such as finding answers and providing comprehensive summaries, reviews, and more about the content they are about to watch.

    Google TV Streamer 4K in Hazel

    This new model comes with an upgraded processor that boasts a 22 percent increase in speed over its predecessor, along with 4GB of RAM (double the previous generation) and 32GB of internal storage. Google claims that the TV Streamer offers faster app load times and overall performance upgrades. As indicated by its name, it supports 4K HDR, and it also includes Dolby Vision and Dolby Audio for a more immersive viewing experience. Connectivity features include a USB Type-C port, an HDMI 2.1 port, an Ethernet port, Bluetooth 5.1, and dual-band WiFi. Additionally, it can function as a smart home hub, supporting Google Home Panel, which allows users to control their smart home devices.

    Pricing and Availability

    The Google TV Streamer 4K is available in two matte color options: Hazel and Porcelain. It has been launched in the US with a price tag of 99.99 US Dollars. Pre-orders are currently open, with the first sale scheduled to begin on 24th September 2024. The device can be purchased through the Google Store or offline retailers.

  • Pixel 9 Pro Fold, Pixel 9 Pro Launching August 14 in India

    Pixel 9 Pro Fold, Pixel 9 Pro Launching August 14 in India

    Google has recently announced a global launch event on August 13 to unveil the Pixel 9 series and the Pixel Watch 3. This marks a departure from their usual schedule, which typically sees Pixel phones being released in the first week of October. Additionally, the Indian division of Google has confirmed that the Pixel 9 Pro and Pixel 9 Pro Fold will be launched in India at 10:30pm IST on August 14. Here’s a closer look at what these new devices will offer.

    The Pixel 9 Pro’s official teaser features a raised, horizontal camera visor with rounded corners. This visor includes three cameras, an LED flash, and a temperature sensor. While the Pixel 9 Pro has been teased for the Indian market, it remains uncertain whether the Pixel 9 and the rumored Pixel 9 Pro XL will also be available in India.

    Pixel 9 Pro Fold Details

    The Google Pixel 9 Pro Fold will be the brand’s second-generation foldable phone. According to the official teaser, its camera island is located in the upper-left corner and seems to house three cameras, an LED flash, and a microphone. The teaser video also reveals the inner design of the Pixel 9 Pro Fold. Although the teaser doesn’t show an internal camera, leaks have confirmed its presence. Unlike the original Pixel Fold, which had its internal camera on the bezel, the Pixel 9 Pro Fold is expected to feature a screen cutout for the internal camera in the top-left corner.

    Integration of Gemini AI

    Google has also announced the integration of Gemini AI into the upcoming Pixel phones. The foldable model will be available in Porcelain and Obsidian color options. While Google India has only teased the Porcelain variant of the Pixel 9 Pro, it is expected that more color options will be available.


    Pixel 9 Pro Fold, Pixel 9 Pro Launching August 14 in India
  • Google Confirms Alteration of Gemini AI Demo in Recent Report

    Google Confirms Alteration of Gemini AI Demo in Recent Report

    Google’s Gemini AI Model Faces Skepticism Over Demo Video

    Google recently introduced its new Gemini AI model, positioning it as a competitor to OpenAI’s GPT-4. However, the authenticity of Gemini’s demo video has come under scrutiny following a report by Bloomberg.

    Gemini Outperforms GPT-4

    According to Google, the Gemini model surpasses GPT-4 in terms of performance. It achieved a score of 90.04% on the MMLU benchmark, while GPT-4 scored 87.29%.

    Questionable Demo Video

    Google also released a demo video showcasing Gemini’s impressive capabilities. In the video, Gemini interacted with humans in real-time, demonstrating its ability to understand and respond to complex visuals and prompts seamlessly.

    Edited Video Raises Doubts

    However, Bloomberg’s investigation revealed that the video was edited and did not accurately represent real-time interactions. The “live” demonstration relied on still image frames and pre-written prompts, rather than actual responses in real-time.

    Google’s Response

    In response to the controversy, Google co-lead Oriol Vinyals defended the video. He claimed that the purpose of the video was to inspire developers by showcasing the potential of Gemini-powered user experiences. However, many believe that his statement failed to address the discrepancy between the video’s presentation and the actual capabilities of the technology.

    Lack of Transparency

    Even Google’s official disclaimer for the video on YouTube mentions “reduced latency” and “shortened outputs.” However, it does not fully disclose the extent of the editing that Bloomberg uncovered. This discrepancy has diminished the appeal of Gemini AI for many.

    Past Criticisms

    This is not the first time Google has faced criticism for such practices. Earlier in 2023, a rushed demo of its Bard AI resulted in significant errors, damaging the company’s reputation.

    Communication Challenges

    Despite being a leader in machine learning research, Google seems to struggle with effectively communicating the capabilities of its AI tools. Only time will tell whether Gemini can live up to the initial hype.