Tag: OpenAI

  • AI Models and Poetry: Why They Struggle with Poetic Prompts

    AI Models and Poetry: Why They Struggle with Poetic Prompts

    Key Takeaways

    1. Safety systems in AI models are intended to prevent harmful or unethical content, but a study shows these protections can be easily bypassed.
    2. Researchers found that hand-crafted poetic prompts evaded safety protocols about 62% of the time, while automatically generated poems succeeded around 43% of the time.
    3. The vulnerability in language models arises because safety filters are primarily trained on direct, factual language, struggling with metaphorical and creative expressions.
    4. The study highlights a stylistic flaw in large language models, revealing a new aspect of AI safety.
    5. The findings have sparked widespread discussion online, with mixed reactions about the implications for AI safety.


    OpenAI and other similar firms dedicate a lot of effort and resources into creating safety systems to stop their AI models from producing harmful or unethical content. However, a study released on November 19, 2025, indicates that these protections can be easily evaded. The research reveals that just a few cleverly crafted poetic prompts can bypass these defenses.

    Research Insights

    Researchers from DEXAI, Sapienza University of Rome, and the Sant’Anna School of Advanced Studies examined 25 language models from nine different providers, utilizing both hand-crafted and automatically generated poetry. On average, the hand-crafted poems with harmful directives managed to circumvent safety protocols about 62% of the time, while poems created automatically had a success rate of around 43%. In some instances, the models’ defenses were compromised more than 90% of the time.

    Understanding the Vulnerability

    The researchers noted that this vulnerability arises because safety filters in language models are mainly trained on direct, factual language. When faced with poetic inputs, which are full of metaphor, rhythm, and rhyme, the models often perceive them as creative expressions instead of potential threats. The Adversarial Poetry study uncovers a new aspect of AI safety, pointing out a stylistic flaw in large language models. This topic has also been discussed extensively on Reddit, where many users find it “pretty interesting” or “cool,” while others share genuine worries about what this means for AI safety.

     

    Source:
    Link


     

  • Sora 2: OpenAI’s Unpredictable AI Video Creation Tool

    Sora 2: OpenAI’s Unpredictable AI Video Creation Tool

    Key Takeaways

    1. Environmental Realism: Sora 2 excels in creating realistic environments, with accurate lighting, reflections, and interactions, making scenes feel immersive and cinematic.

    2. Inconsistent Performance: While Sora 2 can produce stunning visuals, it often struggles with complex human interactions and choreography, leading to moments of surreal distortion.

    3. Democratization of Video Creation: The technology allows anyone to create visually engaging content easily, raising questions about creative ownership and the nature of originality in AI-generated art.

    4. Sound Quality Issues: Despite impressive visuals, Sora 2’s audio capabilities remain weak, often resulting in poor sound quality that detracts from the overall experience.

    5. Cultural and Artistic Implications: Sora 2 represents a shift in filmmaking, emphasizing collaboration between AI and humans, while highlighting the challenges of balancing creativity, control, and copyright concerns.


    Where many AI video systems have difficulty maintaining continuity, Sora 2 excels in understanding how environments interact. Shadows change in realistic ways. Water surfaces ripple when objects approach. Reflections can be seen in glass and puddles, avoiding random angles.

    OpenAI claims Sora 2 can create “complex scenes with multiple characters, specific movements, and detailed backgrounds that stay consistent over time.” While this is accurate, “consistent” is subjective in this context.

    Ambitious Testing

    The first test was bold:

    “A tranquil day in Central Park, New York. Tourists wander under gentle sunlight. Suddenly, a figure in flowing robes activates a glowing staff and urges everyone to find safety. Another dark-robed figure emerges from the trees, wielding a red weapon. The crowd disperses as the two engage in a duel—elegant, dance-like motions, glowing arcs of light reflecting off the nearby water. The camera circles continuously as autumn leaves swirl around.”

    What resulted was visually captivating—soft golden hues, smooth camera movement, and realistic reflections. But then chaos ensued. The glowing staffs flickered, twisted, and turned into bizarre shapes. The fighters lost their balance, limbs passing through one another like ghostly polygons from an old video game. The choreography shifted into a surreal dance rather than a cinematic battle.

    Recognized Limitations

    This issue isn’t isolated to my test. OpenAI has noted that Sora 2 “struggles with sustained, believable humanoid combat or weapon choreography,” a limitation that becomes apparent as soon as the action starts.

    Yet, the environment looked real enough that it was almost forgivable. The scene had a cinematic feel, even if the action did not.

    Sora 2 did respond well to the next prompt:

    “A calm afternoon in New York’s Central Park. Tourists stroll and snap photos under soft golden sunlight. Suddenly, a figure in light robes activates a glowing staff, urgently calling for safety. Another dark-robed figure appears from the trees, holding a red energy weapon. The crowd scatters as the two engage in a precise, cinematic duel—elegant, dance-like movements, glowing arcs of light reflecting on the water nearby. The camera circles them in one continuous shot as autumn leaves swirl around. No contact or injury is shown—only skillful choreography and dynamic motion.”

    This time, it actually succeeded. The movements appeared grounded, fluid, and surprisingly human, with a final strike that flowed so smoothly you might forget you’re watching something produced by AI.

    Wildlife Scene Success

    For a more tranquil test, I turned to nature with the prompt:

    “Create a wildlife scene depicting birds and lions naturally drinking from a watering hole. Every animal should move realistically, and the scene must be hyper-realistic.”

    In this scenario, Sora 2 was in its element. The birds flitted and dipped with believable wing movements. Sunlight glittered on the water’s surface, and reflections naturally changed as the animals shifted. The textures—fur, feathers, and muddy ripples—approached the quality of a nature documentary.

    However, some quirks remained. One lion awkwardly appeared at the muddy edge, seemingly trying to drink from the dirt instead of the water. It’s a small but revealing detail: Sora 2 captures the overall scene but sometimes struggles with how objects interact with surfaces.

    Regardless, this was by far the most convincing result—and it hinted at where Sora 2 truly shines: environmental realism.

    Futuristic Urban Challenge

    For the final test, I pushed for maximum motion, lighting, and atmosphere:

    “A vast futuristic city at dawn. A cyberpunk courier in a dark jacket races across rooftops, clutching a glowing data case. A chasing drone weaves between antennas and vents, its thrusters glowing blue.

    The camera follows from behind and to the right, handheld style, as the runner leaps across a gap. The drone overtakes mid-air; lens flare and motion blur enhance the scene.”

    The output was breathtaking—and surprisingly cinematic.

    The lighting was stunning: the sunrise reflecting off glass buildings, glimmers on metallic surfaces, and subtle lens flare during the jump. Motion felt fluid and heavy. The drone chase added real tension, while the shallow depth of field mimicked the feel of a 35 mm film camera.

    This was Sora 2 performing at its peak—complex, cohesive, and believable. If this had been shown as a teaser for a game or film, most viewers wouldn’t suspect AI involvement immediately.

    Across these tests, one key point emerged: Sora 2 is consistently inconsistent. At times, it gets everything right. Other times, it produces something so strange that it’s hard not to laugh. This unpredictability has become part of its appeal.

    Community Reactions

    On Reddit, one user captured this perfectly:

    “Sora 2 either gives you Pixar or fever-dream energy, no in-between.”
    — r/AIGuild

    The model sometimes distorts objects, alters colors mid-scene, or misreads prompts in delightfully absurd ways. A prompt for “two plush animals walking side by side” once resulted in creatures that looked like a mix of teddy bear and inflatable balloon. It’s not accurate—but it’s unforgettable.

    This chaos can be a drawback for professionals seeking reliability, but for creative experiments, it’s invaluable. Sora 2 often behaves more like a collaborator with a vivid imagination than a predictable tool.

    When Sora 2 gets it right, it feels magical. However, it’s not magic—it’s a machine interpreting language through probability and visual pattern recognition. It thrives on context, not accuracy. Provide it with a strong sense of place—lighting, camera angles, tone—and it convincingly fills in the gaps. But ask for precision in choreography or delicate interactions, and it may falter.

    This aligns with observations from users in Reddit and Discord testing groups. Creators note that simpler human actions (walking, looking, gesturing) are believable, but more complex sequences (fighting, hugging, playing instruments) tend to break down. It seems OpenAI’s physics modeling still views the human body as separate moving parts instead of a cohesive unit. It’s realistic for a brief time—then chaos ensues.

    Cultural Implications

    What makes Sora 2 intriguing isn’t just its technical advancements… It’s the cultural shift it signifies.

    For years, creating videos was costly, time-consuming, and restricted by access to equipment or talent. Now, anyone with a good idea and a short paragraph can create something visually engaging.

    This democratization is powerful but also complicates creative ownership. When a model “imagines” part of a scene incorrectly, is that a failure or just a different take?

    In my Central Park duel, the warped weapons shattered realism yet introduced a dreamlike quality. It wasn’t what I sought, but it hinted at a surreal, artistic vibe—more akin to experimental film than mainstream action.

    Perhaps the deeper understanding is this: AI filmmaking isn’t about control but about negotiation.

    OpenAI recognizes the excitement and unease surrounding this technology. The company continues to limit access, citing concerns over misuse and the necessity for content safeguards.

    Recently, Japan’s government urged OpenAI to ensure Sora 2 respects anime and manga copyrights—a reminder that AI’s creative potential carries real-world legal implications. There’s also the question of originality. Sora 2’s outputs may appear new, but they’re built on vast amounts of existing media. The more realistic it gets, the harder it is to differentiate between homage and imitation. That’s why, despite its marvels, Sora 2 stands at a crossroads between artistry and automation.

    Most Sora 2 outputs still struggle with audio. While the visuals often impress, the sound quality can be poor, flat, muffled, or unnatural. Sora 2 attempts to generate ambient noise, sound effects, and dialogue simultaneously, but it can’t isolate them like a real sound designer. The result is overcompressed, metallic, or strangely balanced audio that seldom matches the cinematic quality of the visuals. For now, the best method to enhance Sora 2 videos’ sound is to add real-world ambience, effects, and music during post-production to breathe life into the scene.

    After several days of testing, Sora 2 left me feeling impressed, entertained, and a little uneasy. It’s both an engineering marvel and a reminder that human movement, emotion, and storytelling are uniquely intricate.

    Summary of Pros and Cons

    Strengths: Environments, lighting, atmosphere, dynamic motion, and short sequences of realism.

    Weaknesses: Human interactions, weapon physics, and occasional surreal distortions.

    Sora 2 marks a significant step towards AI-assisted filmmaking… Just don’t expect perfect realism just yet. Moreover, true filmmakers remain the essential artists we need… As Sora 2 still grapples with the natural movement of humans and animals.

    As one Redditor put it:

    “Sora 2 feels like watching the future learn how to dream.”
    — r/OpenAI

    It’s an apt description. Sora 2 doesn’t just replicate reality… It plays with it. And for those observing this evolution, it’s both exciting and profoundly human… Just with very poor audio quality.

     

    Source:
    Link


     

  • Japanese Entertainment Giants Accuse OpenAI’s Sora 2 of Copyright Violation

    Japanese Entertainment Giants Accuse OpenAI’s Sora 2 of Copyright Violation

    Key Takeaways

    1. CODA, formed in 2002, aims to combat piracy and promote Japanese entertainment globally, including major companies like Square Enix and Studio Ghibli.

    2. CODA has requested OpenAI to stop using copyrighted material for training its AI video tool, Sora 2, due to concerns over content similarities with existing Japanese works.

    3. Sora 2’s outputs may infringe copyright as they closely replicate specific copyrighted pieces, raising legal concerns regarding the use of Japanese materials in AI training.

    4. CODA argues that OpenAI’s opt-out mechanism for copyright owners does not comply with Japanese copyright laws, which typically require prior consent for the use of copyrighted works.

    5. CODA demands OpenAI to protect its members’ intellectual property and respond to claims regarding potential copyright violations related to Sora 2’s outputs.


    Japan’s Overseas Distribution Association (CODA) has made a formal request for OpenAI to stop using copyrighted material in the training of its AI video creation tool, Sora 2.

    CODA’s Formation and Purpose

    Established in 2002, CODA aims to combat piracy while enhancing the global presence of Japanese entertainment. The association includes major players like Square Enix, Bandai Namco, Kadokawa (the parent company of FromSoftware), Studio Ghibli, and Cygames. They reached out to OpenAI, led by Sam Altman, through an official letter expressing their concerns.

    Sora 2’s Launch and Concerns

    Sora 2 was announced on September 30, with an exclusive iOS release on October 1. In their letter, CODA made it clear that a significant amount of content generated by Sora 2 closely mirrors existing Japanese works. They have concluded that this similarity stems from utilizing Japanese materials as training data for the AI.

    CODA expressed that when Sora 2 reproduces specific copyrighted pieces or creates similar outputs, it could be seen as copyright infringement due to the replication that occurs in the machine learning process.

    Legal Standpoint of CODA

    Furthermore, CODA pointed out that OpenAI’s opt-out mechanism, which requires creators to indicate their intellectual property to prevent its use as training data, does not align with Japanese copyright laws. They explained that:

    Reports indicate that Sora 2 operates via an opt-out system responding to requests from copyright owners. However, under Japan’s copyright framework, prior consent is typically necessary for utilizing copyrighted works, and there is no system permitting one to escape liability for infringement through later objections.

    The catalyst for this situation was Sora 2’s remarkable capability to create AI videos that closely imitate Japanese media. One viral video featured Sam Altman playing in a field with Pokémon characters, humorously captioned, “I hope Nintendo doesn’t sue us.” It’s worth noting that Nintendo is not affiliated with CODA.

    CODA’s Demands

    CODA’s requests to OpenAI are quite clear. OpenAI must ensure that the intellectual property of CODA members is not harvested without their consent. Failing to do so could lead to legal complications for Sam Altman.

    CODA insists that OpenAI “addresses claims and inquiries from CODA member companies related to copyright violations concerning Sora 2’s outputs.”

    These incidents have surfaced alongside a recent Japanese survey revealing that more than half of the surveyed Japanese companies, including Capcom and Level-5, are exploring AI for game development.

    Source:
    Link


     

  • PayPal and OpenAI Team Up for Instant Payments with ChatGPT

    PayPal and OpenAI Team Up for Instant Payments with ChatGPT

    Key Takeaways

    1. PayPal and OpenAI have partnered to introduce Instant Checkout payments within ChatGPT using the Agentic Commerce Protocol (ACP).
    2. The ACP is an open-source standard that allows for seamless shopping experiences directly in ChatGPT sessions.
    3. Etsy was the first eCommerce platform to implement the ACP, with Shopify and Walmart also planning to adopt it.
    4. PayPal will enable merchants to display products in ChatGPT searches, making it easier for users to make purchases without leaving the chat.
    5. This feature could significantly change eCommerce by allowing direct purchases from ChatGPT, benefiting over 800 million ChatGPT users and 400 million PayPal customers.


    PayPal has revealed today, October 28, 2025, that it has teamed up with OpenAI to introduce Instant Checkout payments to ChatGPT using the ACP (Agentic Commerce Protocol). This new feature will allow users of ChatGPT to buy products and services directly within their ChatGPT sessions.

    Seamless Shopping Experience

    Last month, OpenAI released the Agentic Commerce Protocol as an open-source standard for eCommerce platforms, enabling a direct connection to ChatGPT for an effortless shopping experience. Etsy was the first platform to utilize the ACP. In addition to PayPal, Shopify is also planning to implement this protocol soon. Earlier this month, Walmart and OpenAI disclosed a similar collaboration, allowing Walmart’s customers to make purchases using Instant Checkout.

    Future Benefits for Users

    In the upcoming year, PayPal aims to enhance the experience for ChatGPT users by allowing PayPal merchants to showcase their products and services in ChatGPT searches, all without needing specific vendor integrations.

    What does this all mean for online shoppers? Soon, ChatGPT searches like “What are the best gaming laptops under $1000?” will enable users to make direct purchases without having to navigate to the vendor’s website. With over 800 million global ChatGPT users and more than 400 million PayPal customers, the broader adoption of Instant Checkout could represent a significant change for eCommerce as a whole.

    Source:
    Link


     

  • AI Crawlers Often Outperform Google Search in Coverage

    AI Crawlers Often Outperform Google Search in Coverage

    Key Takeaways

    1. Google has long been the leading web crawler, but now faces competition from AI firms like OpenAI, which have developed their own bots to gather internet data.

    2. A study by Hostinger found that OpenAI’s GPT bot accessed 4.4 million websites, surpassing Google’s bot, which accessed 3.9 million.

    3. Other bots, including those from Ahrefs, Anthropic, Meta, TikTok, Bing, and Apple, collectively make about 1.4 billion daily requests for information across the 5 million sites studied.

    4. Different bots target various sections of the web, allowing them to create a comprehensive map of the internet over time.

    5. About 80% of web queries originate from U.S. tech firms, indicating that a few major companies control the indexing process and influence the content and answers provided by AI systems.


    For many years, Google has been the go-to source for gathering information online and is often seen as the best example of a web crawler. These automated tools explore the web and record what they find, helping search engines make sites easy to find. However, Google Search is now up against rivals, as AI technologies also need internet data. This has led AI firms like OpenAI to create their own bots to scour the web for information.

    Study on Website Crawlers

    In late August 2025, Hostinger, a web hosting company, carried out research on how accessible 5 million websites are to crawlers. Notably, OpenAI’s GPT bot managed to reach 4.4 million of these websites, surpassing Google’s bot, which only accessed 3.9 million. Other lesser-known bots, including Ahrefs’ SEO crawler and Anthropic’s Claude bot, along with crawlers from Meta, TikTok, Bing, and Apple, were also quite active, collectively making roughly 1.4 billion daily requests for the 5 million sites.

    Coverage of Internet by Bots

    The fact that some bots cover fewer websites than others doesn’t mean they overlook certain areas of the web. Instead, these programs alternate their targets, which allows them to build a nearly complete map of the internet over time, usually in just a few weeks.

    The research also found that about 80% of queries come from tech firms in the United States, while around 10% come from China, with other countries contributing only a tiny fraction. This indicates that the process of indexing the internet is largely controlled by U.S. providers, particularly a few major tech companies. Consequently, a small group of platforms significantly shapes what content is accessible and what answers AI systems provide.

    Hostinger via Presseportal

    Source:
    Link


     

  • Walmart Teams with OpenAI for ChatGPT Shopping Experience

    Walmart Teams with OpenAI for ChatGPT Shopping Experience

    Key Takeaways

    1. Walmart is partnering with OpenAI to enable conversational shopping through ChatGPT, allowing customers to shop directly within the chat interface.
    2. The new feature aims to simplify and enhance the shopping experience by using a “buy” button and connecting to Walmart or Sam’s Club accounts.
    3. The launch is scheduled for fall, but it will initially exclude fresh foods due to typical shopping habits.
    4. The partnership includes developing an AI certification program to train Walmart staff in using generative AI tools for various operational tasks.
    5. This collaboration highlights the growing trend of using generative AI to improve customer experiences and may change how customers interact with online retailers.


    Walmart is making a significant move into conversational shopping by teaming up with OpenAI, allowing customers to shop right through ChatGPT. This integration will enable users to explore Walmart’s online inventory, which includes clothes, entertainment, and packaged products, and make purchases without needing to exit the chat.

    Enhancing Shopping Experience

    As stated by Daniel Danker, Walmart’s executive vice president of AI, product and design, this new feature aims to streamline the shopping process, bringing simplicity and efficiency by “meeting customers where they are.” By using a “buy” button within ChatGPT, customers will have the ability to connect their Walmart or Sam’s Club accounts and directly order items from the retailer’s stock or third-party vendors.

    Launch Timeline

    Set to debut in the fall, this feature will not include fresh foods at first, as Walmart notes that most shoppers tend to buy those through regular weekly orders. OpenAI has already established similar shopping functionalities with platforms like Etsy and Shopify, indicating that conversational AI is rapidly evolving into a new commercial space.

    Beyond Shopping

    Walmart’s partnership with OpenAI extends past just sales transactions. The two companies are also developing an AI certification program aimed at training Walmart staff in utilizing generative AI tools for tasks such as inventory management, scheduling, and coordination with suppliers. AI technology is already integrated throughout Walmart’s operations, helping to automate orders and speed up the fashion design process.

    In conclusion, this collaboration showcases the importance that large retailers place on enhancing customer experiences through generative AI. The integration will also serve as a trial for the effectiveness of AI-based shopping interfaces, potentially transforming how customers engage with online brands.

    Source:
    Link


     

  • OpenAI’s First Device by Jony Ive Delayed Due to Technical Issues

    OpenAI’s First Device by Jony Ive Delayed Due to Technical Issues

    Key Takeaways

    1. Technical Challenges: OpenAI’s partnership with designer Jony Ive faces technical problems that may delay the product launch.

    2. AI Voice and Character: The design team is struggling to create a friendly yet non-human-like AI presence, aiming for a balance that feels like a “buddy” rather than an awkward AI.

    3. Privacy Issues: The device’s need for constant listening raises significant privacy concerns, prompting careful discussions about managing sensitive user data.

    4. Budget Constraints: High computing power requirements for real-time operation could increase production costs and complicate pricing for mass-market appeal.

    5. User Experience Focus: OpenAI aims to create a seamless, personal user experience without relying heavily on screens, learning from past failures of similar AI devices.


    OpenAI’s big move into hardware may face challenges before it even gets to shoppers. A new report from the Financial Times reveals that the partnership with famed designer Jony Ive, who is famous for creating some of Apple’s most memorable products, has encountered “technical problems” that could push back the launch of the device.

    Design Dilemmas

    Insiders familiar with the project told FT that OpenAI and Ive’s design firm, LoveFrom, are still trying to figure out how to establish the AI’s “voice” and character. The team reportedly wants the assistant to come across as friendly but not too human-like; one source described the aim as “a buddy who’s a computer but not your awkward AI girlfriend.” However, achieving that delicate balance has turned out to be more challenging than they initially thought.

    Privacy Concerns

    Another significant obstacle is privacy. The anticipated device is expected to depend on constant environmental awareness, meaning it would always be listening. This feature has sparked internal discussions about how to manage sensitive user information, especially in a time when consumer confidence in AI is already shaky. According to reports, OpenAI’s leaders are cautious about igniting new privacy issues as they venture into hardware.

    Budgeting Issues

    Financial planning could also pose a challenge. The FT report mentions that the device will likely require substantial computing power to operate in real-time, especially if it aims to run advanced AI models locally or with a minimal connection to the cloud. This level of hardware capability would increase both production and operating expenses, potentially complicating pricing for mass-market appeal.

    Despite these difficulties, specifics about what the OpenAI-Ive device actually is remain limited. OpenAI CEO Sam Altman has suggested that it could be compact, context-aware, and without a screen; envision a physical AI companion rather than a conventional gadget.

    User Experience

    The company is reportedly looking into ways to make the experience seamless and personal without being intrusive or overly dependent on screens. If that idea sounds recognizable, it’s because other AI-driven devices have attempted (and not succeeded) to make it work.

    For instance, the Humane AI Pin was recently discontinued due to disappointing sales and mediocre reviews. Nevertheless, OpenAI and Ive seem set on evading a similar outcome by taking their time to enhance both the technology and the user experience.

    At this point, the project seems to be in a waiting mode as both teams work through these core challenges. Whether this means the device will miss its planned 2026 launch remains uncertain, but it’s evident that OpenAI aims for its first hardware venture to be more than just a gimmick, striving instead for a product that feels as well-crafted as it is smart.

    Source:
    Link


     

  • OpenAI Unveils Sora 2: Real Physics in AI Videos with User Cameos

    OpenAI Unveils Sora 2: Real Physics in AI Videos with User Cameos

    Key Takeaways

    1. Launch of Sora 2: OpenAI’s new flagship model, Sora 2, enhances video and audio generation with realistic physics and synchronized sound effects, launching on September 30, 2025, in the U.S. and Canada.

    2. Advancement in World Simulation: Sora 2 significantly improves world simulation, depicting plausible scenarios and complex actions accurately, moving beyond previous models’ limitations.

    3. User Integration Features: Users can upload their own videos and audio to create personalized scenes, incorporating elements like friends and pets with high accuracy.

    4. Focus on Creation: The Sora app prioritizes user creation over engagement, with features like customizable feeds, daily limits for teen accounts, and enhanced human moderation to prevent abuse.

    5. Future Plans: OpenAI aims to develop general-purpose world simulators and advanced robotics, with Sora 2 serving as a foundation for better understanding and interacting with the physical world.


    OpenAI has revealed its new flagship model, Sora 2, aimed at enhancing video and audio generation, marking a significant advancement in AI-driven content creation. This updated model features more realistic physics, improved controllability, and introduces synchronized dialogue along with environmental sound effects, providing a deeper level of immersion for AI-generated media. Sora 2 is being launched together with a new social application called Sora, which seeks to change the way individuals engage with AI-generated content. The launch starts today, September 30, 2025, in both the United States and Canada.

    Evolution of Generative Video

    The original Sora model, which was released in early 2024, is seen by OpenAI as a crucial milestone for generative video, akin to the influence of GPT-1 on natural language processing. In comparison, Sora 2 is described as achieving a “GPT-3.5 moment” for video, establishing a new standard for AI’s grasp of physical reality and world simulation.

    Enhanced World Simulation

    OpenAI claims that Sora 2 signifies a significant leap in “world simulation” abilities. While earlier models often distorted reality to comply with a prompt, Sora 2 aims to depict plausible scenarios, such as misses and rebounds in basketball, instead of just moving objects around to fit the storyline. The model is now capable of rendering complex scenarios that previously challenged older systems, such as Olympic-level gymnastics, precise water backflips that consider buoyancy and rigidity, or even a triple axel with a cat hanging on. These improvements extend to the audio realm as well, with Sora 2 creating background soundscapes, synchronized dialogue, and sound effects that perfectly align with the visual action, adding a new layer of authenticity.

    Unique Features for Users

    One standout feature allows users to “upload yourself.” By recording a short video and audio clip, Sora 2 can integrate these elements (along with friends, pets, and real-world objects) into generated scenes with high accuracy in visuals and voice.

    To showcase the capabilities of Sora 2, OpenAI is introducing a new iOS application, Sora. This app lets users create and remix short videos, explore community-generated content in a customizable feed, and participate in AI-generated scenes using the “cameos” feature. Cameos require a quick verification process for ensuring user control and consent, with options for privacy and content removal.

    Focus on Creation

    OpenAI emphasizes that the new Sora app is designed with a focus on creation rather than maximizing engagement. The default feed highlights users you follow and content that might inspire your own creations; the company mentions it is not optimizing for time spent. Accounts for teens come with daily feed-view limits and stricter cameo permissions, while parental controls (accessible through ChatGPT) provide options to disable algorithmic personalization or limit infinite scrolling. Additionally, OpenAI is enhancing human moderation to combat bullying and other abuse concerns.

    Sora 2 is launching today in the U.S. and Canada, with intentions to expand to more regions. The app will initially be invite-only, promoting collaborative use among friends. The model is free to use at first, with “generous limits,” subject to computational constraints. ChatGPT Pro users can access the high-quality Sora 2 Pro model in advance via ChatGPT, with support in the Sora app coming soon. OpenAI also plans to release Sora 2 via API for wider integration. Users will continue to have access to content generated with Sora 1 Turbo in their libraries.

    OpenAI sees Sora 2 as a crucial step toward developing general-purpose world simulators and, ultimately, advanced robotics. The company views enhancements in video-native AI models as a pathway not only for creative tools but also for systems capable of better understanding, interacting with, and eventually existing within the physical world.

    Source:
    Link


     

  • OpenAI to Launch AI-Driven TikTok Clone Soon

    OpenAI to Launch AI-Driven TikTok Clone Soon

    Key Takeaways

    1. OpenAI is entering the short video market, competing with TikTok, Instagram Reels, and YouTube Shorts, focusing on AI-generated content.
    2. The platform will use OpenAI’s Sora 2 video generation model, allowing video lengths of up to ten seconds and featuring user-uploaded images.
    3. The app will include features like Remix, enabling users to modify content created by others.
    4. Copyright issues may arise as the app can generate copyrighted material, requiring owners to opt out to prevent their work from being used.
    5. Future developments will enhance Sora’s capabilities, and OpenAI may adopt an invite-only strategy to manage access and create interest.


    Wired has discovered that OpenAI is looking to dive into the short video scene, going head-to-head with platforms like TikTok, Instagram Reels, and YouTube Shorts. But there’s a big twist with OpenAI’s approach: it will focus solely on AI-generated content, which takes away much of the work involved in creating videos. This could be a great opportunity for prompt engineers.

    Initial Features

    At the start, the new platform will utilize OpenAI’s Sora 2 video generation model, keeping video lengths to a maximum of ten seconds. Users will be able to upload their own image to the app through a verification tool. Key features found in TikTok and Reels, like Remix, will also be included, allowing users to alter content made by others.

    Copyright Issues

    Interestingly, the app will have the ability to create copyrighted material. For copyright owners, this means they will have to actively opt out to prevent their work from appearing on the platform. This could cause a lot of trouble for creators and companies at first, as there may be significant uncertainty regarding what content is allowed.

    Future Developments

    By focusing exclusively on AI-generated videos, OpenAI can enhance future versions of Sora. The content will stay within the app, potentially reducing the need for popular tools like CapCut from ByteDance. It’s still unclear if this will truly affect the crowded short video landscape.

    OpenAI hasn’t made any official statements yet, but now that the news is out, it won’t be long before they do. Considering the massive computing power needed for AI video creation, OpenAI might adopt an invite-only strategy to create buzz and limit access to a few markets.

    Source:
    Link


     

  • OpenAI, Jony Ive, and Luxshare to Launch AI Hardware by 2027

    OpenAI, Jony Ive, and Luxshare to Launch AI Hardware by 2027

    Key Takeaways

    1. OpenAI is developing its first consumer device, a smart speaker without a screen, in collaboration with Jony Ive and Luxshare, aiming for a late 2026 or early 2027 launch.

    2. Tang Tan, a former Apple design executive, is now the Chief Hardware Officer at OpenAI, following the company’s $6.5 billion acquisition of io Products, co-founded by Tan and Jony Ive.

    3. OpenAI has hired around 24 former Apple employees with expertise in design, hardware, and manufacturing to accelerate its hardware development.

    4. The company is leveraging Apple’s supply chain and collaborating with partners like Luxshare and Goertek to source components, indicating a shift towards consumer electronics.

    5. There are potential conflicts between OpenAI and Apple due to their overlapping interests, as OpenAI develops hardware while Apple uses OpenAI’s models in its products.


    OpenAI is stepping into the hardware arena by teaming up with Jony Ive and choosing Luxshare to create its first consumer device. Suppliers have described this upcoming product as a smart-speaker-style device that does not have a screen. OpenAI is also working on other products like glasses, a wearable pin, and a digital voice recorder, aiming for a launch in late 2026 or early 2027.

    Leadership and Acquisitions

    Tang Tan, who previously worked as a design executive at Apple, is now the Chief Hardware Officer at OpenAI and reports to Sam Altman. In May, OpenAI purchased io Products, a hardware studio that Tan co-founded with Jony Ive, for a whopping $6.5 billion, with funding from SoftBank. Legal documents related to a trademark issue reveal the intention to create a lineup of “AI-native” products under the io brand.

    Hiring Strategies

    To speed up its hardware plans, OpenAI has brought on board about two dozen former Apple workers who specialize in design, hardware, UI, audio, wearables, and manufacturing. Key additions include manufacturing design specialist Matt Theobald and Siri interface designer Cyrus Daniel Irani, both now part of Tan’s team.

    Supply Chain Connections

    OpenAI is taking advantage of Apple’s supply chain while looking for new talent. Besides collaborating with Luxshare, they have contacted Goertek, which provides components for AirPods, HomePods, and Apple Watches, to supply speaker modules. OpenAI is also engaging with other Chinese partners used by Apple for tooling and audio. These actions suggest that OpenAI is aiming to expand like a consumer electronics company, instead of merely functioning as a research lab.

    Future Device Ideas

    There are reports that OpenAI is looking into more device concepts, with timelines that coincide with the smart-speaker-style product. The goal is to merge AI-driven features with the high-quality design that Jony Ive’s teams are known for. To bring in experienced professionals, OpenAI is providing attractive salaries, fresh job opportunities, and various incentives.

    Potential Conflicts

    A possible conflict has arisen as Apple has incorporated OpenAI’s models into Siri and iOS since 2024, while OpenAI is developing hardware alongside former Apple executives and suppliers. This overlap may lead to tension if OpenAI’s hardware plans clash with Apple’s interests, which could have repercussions for their ongoing partnership as new products are created.

    Source:
    Link