Tag: AI video generation

  • New PixVerse R1 Model: Live Input and Subject Priority

    Key Takeaway

    – PixVerse focuses on maintaining subject consistency across clips to improve narrative storytelling.
    – It calculates token cost before generation to reduce user risk.
    – The R1 model enables real-time, collaborative video generation from live prompts.
    – AI video excels at dangerous or costly scenes but struggles with realistic physics.
    – Aggressive pricing allows low-risk trial for both novices and professionals.


    A New Approach to AI Video

    Undoubtedly, text-to-video generation has made huge leaps in capabilities over the past few years, and while this has improved quality and video fidelity, the core narrative function of what video represents has largely been overlooked. This is where PixVerse takes a new approach. In addition to generating higher-quality AI videos, the platform emphasizes keeping the video’s subject as consistent as possible across generated clips, allowing users to focus on directing character movements and establishing narrative concepts. Also, to address the risk involved whenever a user commits to generating a video, the PixVerse platform calculates the number of tokens required to generate that specific video. With features like these, PixVerse aims to cater to both novices and professionals.

    Struggles with Real-World Physics

    While this is ambitious, there are still a few pitfalls to address in practice, as well as the purpose of AI-generated video. One of the examples used in the presentation was to generate a video of a car accident where a taxi was going underwater, an event that would be too dangerous for a person to film. This type of scene is ideal for generating effects to avoid the high costs of filming, but it faces the perpetual struggle of replicating real-world physics. Additionally, because the platform offers three tiers of capabilities, users will still need to navigate the model selection process. However, among these models, the R1 is by far their most ambitious offering, featuring real-time video generation from live user prompts.

    By gathering information from multiple users simultaneously, the R1 model uses context from each second of video to maintain PixVerses’ character focus while incorporating user requests. In the demo, the results were disjointed, but the potential for true collaboration is there if a team is all on the same page.

    Aggressive Pricing and Availability

    Lastly, PixVerse is aggressive with its pricing strategy, so at the very least, users can try out the platform without too much investment. Curious users can find more information in the resources below. The platform offers a free tier for basic generation, with paid plans starting at $15 per month for 5000 tokens, while the Pro plan at $49 per month gives 25000 tokens and access to the R1 model. The R1 model processes video at 720p resolution, generating up to 15 seconds per clip, with token costs varying based on complexity—typically 60 tokens for a short scene. Special effects and longer durations require additional tokens, but the precise calculations are handled automatically during generation.

    Sources
  • OpenAI Closes Sora: Anti-AI Filmmakers Celebrate Video Platform Shutdown

    OpenAI Closes Sora: Anti-AI Filmmakers Celebrate Video Platform Shutdown

    Key Takeaways

    1. OpenAI is discontinuing its video generation service, Sora, just six months after its launch due to a shift in focus to new models.
    2. Sora initially gained popularity with a million downloads quickly, but faced backlash over unauthorized celebrity deepfakes, leading to reduced user engagement.
    3. OpenAI implemented strict safety protocols requiring celebrity consent for likeness use, which diminished the app’s appeal.
    4. This closure marks OpenAI’s first significant service shutdown, highlighting challenges in meeting market expectations and regulatory demands.
    5. Filmmakers may welcome Sora’s cancellation, while alternatives like Google Veo 3.1, Runway Gen-4, and Kling AI remain available.


    In an unexpected development, OpenAI, the creator of ChatGPT, has revealed that it will be ending its video generation service, Sora, just six months after its launch in September 2025. Although Sora had a fantastic start thanks to extensive publicity from YouTube creators and filmmakers, OpenAI has decided to discontinue it, possibly to focus on new models.

    Farewell to Sora

    We’re bidding farewell to the Sora app. To all who have created, shared, and fostered a community with Sora: we appreciate you. Your contributions with Sora were significant, and we understand this news is a letdown. More information will be shared soon, including timelines for the app and API, along with details on how to keep your work. – The Sora Team

    Mixed Reactions

    When Sora was launched, it received a blend of excitement and concern from the creator community. The app enabled users to create high-resolution videos featuring “cameos” of themselves and achieved a million downloads more quickly than ChatGPT – quite remarkable.

    Nevertheless, it appears that the initial excitement faded swiftly as the platform faced backlash over unauthorized celebrity deepfakes. In reaction, OpenAI put in place stringent safety protocols that mandated consent from celebrities before their likenesses could be used in AI-generated videos. These new rules reportedly diminished the app’s viral charm, causing a drop in active users.

    Industry Implications

    It’s important to point out that this is the first significant service closure for OpenAI, indicating that even the leading companies in the industry can struggle when dealing with market expectations and regulations.

    What’s for sure is that many filmmakers will likely rejoice at Sora’s cancellation, while others will turn to alternative platforms. With rivals like Google Veo 3.1, Runway Gen-4, and Kling AI still available, it’s doubtful that this will be the last word from OpenAI on AI video creation.

    Source:
    Link


     

  • Google Gemini Update: Enhanced AI Videos with Multiple Images

    Google Gemini Update: Enhanced AI Videos with Multiple Images

    Key Takeaways

    1. OpenAI’s Sora 2 is not available in Europe, while Google is improving its Veo model.
    2. Veo 3.1 has been added to the Gemini app for mobile and desktop platforms.
    3. Users can now upload multiple reference images with text prompts for better precision in AI-generated videos.
    4. This feature allows users to combine different elements, like characters, backgrounds, and visual styles, in their prompts.
    5. The “Ingredients to Video” functionality has expanded from Google’s specialized tools to the standard Gemini app.


    While OpenAI’s Sora 2 isn’t yet accessible in Europe, Google is enhancing its rival model, Veo. Following the release of Veo 3.1 in the middle of October, this model has now been incorporated into the Gemini app for both mobile and desktop platforms. With the recent update, Google aims to tackle one of the main obstacles and offers users greater precision in their control. As per Google’s announcement on their X account for the Gemini app, users now have the ability to upload several reference images alongside their text prompts.

    Enhanced User Experience

    For instance, one image might serve as a reference for a character, another could depict the background, and a third might illustrate the overall visual style. The accompanying text prompt can then provide clearer directions on how these components should merge in the AI-generated video.

    New Features Available

    This functionality isn’t entirely fresh, considering the “Ingredients to Video” feature was integrated into Google’s AI filmmaking tool, Flow, back in October. With the recent update, this feature has expanded beyond Google’s specialized developer tools like Flow or Vertex AI, and is now directly integrated into the standard Gemini app.

    According to Google, the introduction of this new feature has already begun to roll out.

    Source:
    Link