OpenAI, the owner of ChatGPT, has expanded the possibilities of its artificial intelligence capabilities with the launch of Sora, its text-to-video AI model.
Open AI explained that Sora can generate videos lasting up to a minute while maintaining visual quality and adhering to the user’s prompt. It can also create complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background.
While announcing the launch of Sora on X, Sam Altman, OpenAI’s Chief Executive Officer, told his followers, “We’d like to show you what Sora can do. Please reply with captions for videos you’d like to see, and we’ll start making some.”
In a blog post, OpenAI noted that its AI model is available to red teamers to assess critical areas for harm or risks and to several visual artists, designers, and filmmakers to gain feedback on advancing the model.
The firm noted that the model still has apparent weaknesses. “The current model has weaknesses. It may struggle with accurately simulating the physics of a complex scene and may not understand specific instances of cause and effect. For example, a person might take a bite out of a cookie, but afterwards, the cookie may not have a bite mark,” it highlighted.
Sora is not yet available in OpenAI’s products, with the firm still working with domain experts in areas like misinformation, hateful content, and bias.
Since OpenAI debuted its still image generator Dall-E in 2021 and generative AI chatbot ChatGPT in November 2022, it has amassed over 100 million users (primarily helped by ChatGPT’s success). OpenAI’s launch of Sora follows in the footsteps of Google and Meta, who are also working on generative video tools.
Aside from Sora, the firm disclosed on Wednesday that it is experimenting with adding more profound memory to ChatGPT to remember more of its users’ chats.