Tag: GPT-4o

  • Apple Critiques AI Photos: iPhone Should Capture Reality, Not Fantasy

    Apple Critiques AI Photos: iPhone Should Capture Reality, Not Fantasy

    Apple’s software chief, Craig Federighi, recently shared insights with the Wall Street Journal about Apple Intelligence and the upcoming AI features that will be launched next week for users in the US with the release of iOS 18.1. European users can expect to see these features at a later time. Initially, Apple Intelligence will provide just a handful of features, utilizing the GPT-4o AI model, the same one that powers ChatGPT, for some functionalities.

    Limited Features in iOS 18.1

    This cautious approach towards AI, particularly in image processing, appears to be a deliberate choice. In the iOS 18.1 update, Apple introduces just one AI capability in the Photos app called "Clean Up." As demonstrated in the video below, this feature lets users easily erase unwanted items from their photos with a simple tap, much like Google’s Magic Eraser has offered for some time. Federighi mentioned that there were extensive internal debates at Apple about whether the "Clean Up" feature might go too far, as removing objects could mean that a photo no longer accurately represents reality.

    Comparison with Competitors

    In contrast, Google and Samsung are pushing the boundaries of AI in image editing much more aggressively. Google’s Magic Editor not only has the ability to eliminate objects but can also insert new elements, zoom in on subjects, rearrange them, or even replace the sky to alter the image’s atmosphere. Federighi voiced his worries that such capabilities may lead people to see pictures less as truthful representations and more as imaginative creations. As a result, differentiating between authentic photography and AI-generated images could become increasingly challenging in the future.

    Addressing Authenticity in Photography

    Adobe has proposed a potential answer with its Content Credentials, a system designed to confirm the authenticity of photos and track their editing history. However, the limitation is that only images taken with cameras compatible with this platform are eligible for verification, including models like the Leica M11-P, Sony A1, A7S III, and A9, as well as the Nikon Z6 III. Some of these camera models will receive support only after a future firmware upgrade.

  • Solos AirGo Vision: First GPT-4o Smart Glasses

    Solos AirGo Vision: First GPT-4o Smart Glasses

    Solos, a company known for its innovative smart glasses, has introduced the AirGo Vision. These smart glasses are the first to feature OpenAI’s latest large language model, GPT-4o.

    Solos AirGo Vision Details

    The AirGo Vision includes a built-in camera, enabling users to capture their environment and utilize GPT-4o to identify objects and answer related questions. This setup allows for hands-free interaction and easy access to information, similar to what Meta Ray-Ban smart glasses offer.

    The AirGo Vision is also compatible with other leading AI models like Google’s Gemini and Anthropic’s Claude. This compatibility expands the glasses’ functionalities, such as asking for directions, summarizing shopping experiences, or obtaining recipes through voice commands.

    Additional Features and Design

    The smart glasses feature LED notification lights within the frame, which alert users to incoming messages and act as a flash when taking photos with the built-in camera. However, unlike the Meta Ray-Ban smart glasses, the AirGo Vision does not support video recording at this time.

    A distinctive feature of the AirGo Vision is its detachable camera, located on the arm rather than embedded in the frame. This design choice provides users with more flexibility and can offer a more traditional look when the camera is removed. Users can also buy additional frames, and the glasses maintain their AI functionalities through audio input even without the camera.

    Availability and Pricing

    Solos has not yet announced an exact release date, but the AirGo Vision is expected to launch later this year. Pricing details for the camera-equipped glasses are still unknown. However, the company will offer LED-only frames in three styles on their website next month, each priced at $249.99.

  • OpenAI Releases GPT-4o: Enjoy GPT-4 Premium Features for Free

    OpenAI Releases GPT-4o: Enjoy GPT-4 Premium Features for Free

    OpenAI has introduced a new model, GPT-4o, which will become available to the public over the coming weeks. This new model incorporates premium features of GPT-4 and includes an updated web user interface. During the launch event, OpenAI’s CTO Mira Murati showcased several capabilities of this advanced model. Let's explore them in detail.

    GPT-4o Announcement

    GPT-4o is designed to be more efficient, with enhanced abilities to process both auditory and visual inputs. OpenAI describes this as "a step towards much more natural human-computer interaction." The model can now handle text, images, and audio input, offering seamless assistance to its users. The voice mode has been significantly improved, providing quicker responses and better comprehension.

    Previously, the voice mode required three separate models for transcription, intelligence, and text-to-speech functions, which often resulted in delays. In contrast, GPT-4o integrates these functions natively, enabling smoother performance. Using your phone's camera, you can easily share information with the model and ask questions using the voice mode. The new model can respond to voice inputs in just 232 milliseconds, closely matching human response times. It also offers responses in various tones to suit user preferences and has better and faster comprehension of non-English languages compared to GPT-4 Turbo. Additionally, GPT-4o can function as an interpreter.

    API and Premium Features

    GPT-4o will also be accessible via API, allowing developers to build and enhance AI applications using its advanced capabilities. While the new model's features are available for free, premium users will have access to five times the resources compared to the standard offering.

    OpenAI has also released a ChatGPT app for macOS-based desktops. This app provides deeper integration into the macOS platform, aiming to simplify user workflows. With a keyboard shortcut (Option + Space), users can quickly access the tool's conversation page.

    In summary, GPT-4o brings several improvements and new features, enhancing the efficiency and versatility of human-computer interactions. The new model's capabilities, combined with the new app for macOS, aim to offer a more integrated and seamless user experience.