Tag: GPT-4o

August 13, 2025
GPT-5 Launch Issues Prompt Quick Comeback of GPT-4
Key Takeaways
1. OpenAI launched GPT-5 as their most powerful model but faced issues with its rollout, leading to the reintroduction of GPT-4o for ChatGPT Plus users.
2. GPT-5 uses a real-time routing system to select the best engine for prompts, but some technical problems caused user dissatisfaction with its performance.
3. Users expressed concerns about GPT-5’s “personality,” preferring the warmer tone of GPT-4o and feeling that GPT-5 was more robotic.
4. A significant error in benchmark data during the launch was acknowledged by CEO Sam Altman, leading to an update with corrected information.
5. OpenAI plans to monitor user feedback and may allow Plus users to continue using GPT-4o while assessing the trade-offs between the two models.
OpenAI has rolled out GPT-5, calling it their new leading model and claiming it’s “the most powerful model yet.” Just a day later, however, CEO Sam Altman admitted the launch didn’t go smoothly. He announced that GPT-4o would be available again for ChatGPT Plus users while they assess how the new model is being used. This swift change came after many users complained that the update took away a familiar experience without notice.
Features of GPT-5
GPT-5 merges previous models using a real-time routing system designed to select the best engine for different prompts. Sometimes it provides answers right away, but other times it takes longer to “think” for more complex questions. Altman mentioned that some outages affecting the autoswitching feature made users feel that GPT-5 wasn’t as capable, and he promised to tweak the decision-making process and offer more clarity on which model is responding to user requests.
User Sentiments
Aside from worries about performance, the launch stirred up discussions about AI “personality.” Many loyal users felt that GPT-4o had a warmer, friendlier tone, while they described GPT-5 as feeling more distant and robotic, even though it might be technically proficient. Various discussions on Reddit and posts on X displayed users’ frustrations over the lack of model choice, with some even threatening to cancel their subscriptions if GPT-4o didn’t come back. OpenAI indicated that the future availability of GPT-4o will hinge on actual user demand.
Launch Issues
The launch also faced a hiccup due to a charting mistake. A bar chart related to SWE-bench Verified seemed to inaccurately show benchmark results, with some bars appearing disproportionately tall for lower scores. Altman later referred to this as a “mega chart screwup.” OpenAI updated their blog with corrected data, and additional documentation from August 7, 2025, now reflects the accurate figures discussed.
In a recent Reddit AMA, Altman answered numerous questions, reiterating that GPT-5 would “seem smarter” as the routing system improves and promising to “continue listening to feedback.” He also mentioned that the company is “considering allowing Plus users to stick with 4o” while they collect more data on the trade-offs between the models. For the time being, OpenAI intends to watch how users interact with the models before making decisions about how long to keep GPT-4o alongside GPT-5.
Source:
Link

Tags: GPT-4o, GPT-5, OpenAI
March 28, 2025
GPT-4o Can Generate Images with Near-Perfect Text
Key Takeaways
1. GPT-4o now features image generation that allows for precise text rendering, improving the quality of visuals created from textual prompts.
2. The model uses an interactive approach for dynamic image creation, enabling users to modify images step-by-step based on their ideas.
3. Users can merge elements from different images and produce clear text, marking a significant advancement over previous AI image generation technologies.
4. GPT-4o can handle 10-20 elements in a scene, surpassing competitors that typically manage only 5-8, making it easier to visualize complex ideas.
5. Despite its advancements, limitations like bottom cropping, hallucinations, and challenges with non-Latin scripts still exist.
OpenAI’s GPT-4o, which was introduced nearly a year ago, just received a significant update: it now includes image generation with incredibly precise text rendering. This new capability allows users to create intricate, high-quality visuals from textual prompts and engage in conversation to adjust these images until they align with their ideas—no more nonsensical signs or strange letters that earlier AI models produced.
Dynamic Image Creation
Unlike traditional methods of generating images by simply refining a single prompt, GPT-4o employs a more interactive technique. You begin with a straightforward request—like a cat—and then discuss modifications to capture your vision: perhaps adding a detective hat, a monocle, or any other detail you desire.
Step-by-Step Modifications
OpenAI provides examples that illustrate this process: users can construct and alter scenes incrementally, merging elements from various images into a single, unified result. The model excels at producing clear text on signs or items, a significant improvement over the distorted outputs of past AI image generation technologies.
Impressive Capabilities
Importantly, OpenAI acknowledges some selective showcasing—many images are labeled as “best of 2” or “best of 8″—but the outcomes remain impressive, particularly given the very user-friendly interface. GPT-4o can even start with your own photo and apply changes, managing 10-20 elements in a scene while competitors struggle with just 5-8. Just last week, I tried to recreate a scene from The Count of Monte Cristo, and it was quite challenging. Now, with GPT-4o’s image generation, not only will the images produced feature readable text, but it will also be significantly easier to turn your imagination into reality.
Some Limitations
However, it’s not without its flaws. OpenAI points out issues like bottom cropping, persistent hallucinations, difficulties with non-Latin scripts, and problems when exceeding 20 objects. Still, the capacity to create intricate, text-filled images using simple English distinguishes GPT-4o from its predecessors. If you’re working on a poster design, this tool offers the accuracy and flexibility that older models could only wish for.
Tags: GPT-4o, Image Generation, OpenAI
October 24, 2024
Apple Critiques AI Photos: iPhone Should Capture Reality, Not Fantasy
Apple’s software chief, Craig Federighi, recently shared insights with the Wall Street Journal about Apple Intelligence and the upcoming AI features that will be launched next week for users in the US with the release of iOS 18.1. European users can expect to see these features at a later time. Initially, Apple Intelligence will provide just a handful of features, utilizing the GPT-4o AI model, the same one that powers ChatGPT, for some functionalities.
Limited Features in iOS 18.1
This cautious approach towards AI, particularly in image processing, appears to be a deliberate choice. In the iOS 18.1 update, Apple introduces just one AI capability in the Photos app called "Clean Up." As demonstrated in the video below, this feature lets users easily erase unwanted items from their photos with a simple tap, much like Google’s Magic Eraser has offered for some time. Federighi mentioned that there were extensive internal debates at Apple about whether the "Clean Up" feature might go too far, as removing objects could mean that a photo no longer accurately represents reality.
Comparison with Competitors
In contrast, Google and Samsung are pushing the boundaries of AI in image editing much more aggressively. Google’s Magic Editor not only has the ability to eliminate objects but can also insert new elements, zoom in on subjects, rearrange them, or even replace the sky to alter the image’s atmosphere. Federighi voiced his worries that such capabilities may lead people to see pictures less as truthful representations and more as imaginative creations. As a result, differentiating between authentic photography and AI-generated images could become increasingly challenging in the future.
Addressing Authenticity in Photography
Adobe has proposed a potential answer with its Content Credentials, a system designed to confirm the authenticity of photos and track their editing history. However, the limitation is that only images taken with cameras compatible with this platform are eligible for verification, including models like the Leica M11-P, Sony A1, A7S III, and A9, as well as the Nikon Z6 III. Some of these camera models will receive support only after a future firmware upgrade.
Tags: Apple, Google Magic Eraser, GPT-4o
July 1, 2024
Solos AirGo Vision: First GPT-4o Smart Glasses
Solos, a company known for its innovative smart glasses, has introduced the AirGo Vision. These smart glasses are the first to feature OpenAI’s latest large language model, GPT-4o.
Solos AirGo Vision Details
The AirGo Vision includes a built-in camera, enabling users to capture their environment and utilize GPT-4o to identify objects and answer related questions. This setup allows for hands-free interaction and easy access to information, similar to what Meta Ray-Ban smart glasses offer.
The AirGo Vision is also compatible with other leading AI models like Google’s Gemini and Anthropic’s Claude. This compatibility expands the glasses’ functionalities, such as asking for directions, summarizing shopping experiences, or obtaining recipes through voice commands.
Additional Features and Design
The smart glasses feature LED notification lights within the frame, which alert users to incoming messages and act as a flash when taking photos with the built-in camera. However, unlike the Meta Ray-Ban smart glasses, the AirGo Vision does not support video recording at this time.
A distinctive feature of the AirGo Vision is its detachable camera, located on the arm rather than embedded in the frame. This design choice provides users with more flexibility and can offer a more traditional look when the camera is removed. Users can also buy additional frames, and the glasses maintain their AI functionalities through audio input even without the camera.
Availability and Pricing
Solos has not yet announced an exact release date, but the AirGo Vision is expected to launch later this year. Pricing details for the camera-equipped glasses are still unknown. However, the company will offer LED-only frames in three styles on their website next month, each priced at $249.99.
Tags: GPT-4o, Solos AirGo Vision
May 14, 2024
OpenAI Releases GPT-4o: Enjoy GPT-4 Premium Features for Free
OpenAI has introduced a new model, GPT-4o, which will become available to the public over the coming weeks. This new model incorporates premium features of GPT-4 and includes an updated web user interface. During the launch event, OpenAI’s CTO Mira Murati showcased several capabilities of this advanced model. Let's explore them in detail.
GPT-4o Announcement
GPT-4o is designed to be more efficient, with enhanced abilities to process both auditory and visual inputs. OpenAI describes this as "a step towards much more natural human-computer interaction." The model can now handle text, images, and audio input, offering seamless assistance to its users. The voice mode has been significantly improved, providing quicker responses and better comprehension.
Previously, the voice mode required three separate models for transcription, intelligence, and text-to-speech functions, which often resulted in delays. In contrast, GPT-4o integrates these functions natively, enabling smoother performance. Using your phone's camera, you can easily share information with the model and ask questions using the voice mode. The new model can respond to voice inputs in just 232 milliseconds, closely matching human response times. It also offers responses in various tones to suit user preferences and has better and faster comprehension of non-English languages compared to GPT-4 Turbo. Additionally, GPT-4o can function as an interpreter.
API and Premium Features
GPT-4o will also be accessible via API, allowing developers to build and enhance AI applications using its advanced capabilities. While the new model's features are available for free, premium users will have access to five times the resources compared to the standard offering.
OpenAI has also released a ChatGPT app for macOS-based desktops. This app provides deeper integration into the macOS platform, aiming to simplify user workflows. With a keyboard shortcut (Option + Space), users can quickly access the tool's conversation page.
In summary, GPT-4o brings several improvements and new features, enhancing the efficiency and versatility of human-computer interactions. The new model's capabilities, combined with the new app for macOS, aim to offer a more integrated and seamless user experience.
Tags: GPT-4o, Voice and Vision Processing

Tag: GPT-4o

GPT-5 Launch Issues Prompt Quick Comeback of GPT-4

Key Takeaways

Features of GPT-5

User Sentiments

Launch Issues

GPT-4o Can Generate Images with Near-Perfect Text

Key Takeaways

Dynamic Image Creation

Step-by-Step Modifications

Impressive Capabilities

Some Limitations

Apple Critiques AI Photos: iPhone Should Capture Reality, Not Fantasy

Limited Features in iOS 18.1

Comparison with Competitors

Addressing Authenticity in Photography

Solos AirGo Vision: First GPT-4o Smart Glasses

Solos AirGo Vision Details

Additional Features and Design

Availability and Pricing

OpenAI Releases GPT-4o: Enjoy GPT-4 Premium Features for Free

GPT-4o Announcement

API and Premium Features