OpenAI launches DALL-E 3 API, new text-to-speech models | TechCrunch thumbnail
OpenAI launches DALL-E 3 API, new text-to-speech models | TechCrunch
techcrunch.com
DALL-E 3, OpenAI’s text-to-image model, is now available via an API after first coming to ChatGPT and Bing Chat. Similar to the previous version of DALL-E (e.g. DALL-E 2), the API incorporates built-in moderation to help protect against misuse, OpenAI says. The DALL-E 3 API offers different format
1 Users
0 Comments
1 Highlights
0 Notes

Summary

OpenAI has launched the DALL-E 3 API, allowing users to access its text-to-image model. The API includes built-in moderation to prevent misuse and offers different format and quality options. However, it is currently limited compared to the DALL-E 2 API, as it cannot create edited versions or variations of existing images. OpenAI has also introduced a text-to-speech API called Audio API, which provides six preset voices and two generative AI model variants. The company does not offer control over the emotional affect of the generated audio. OpenAI has also released an updated version of its open source automatic speech recognition model, Whisper large-v3.

Top Highlights

  • DALL-E 3, OpenAI’s text-to-image model, is now available via an API after first coming to ChatGPT and Bing Chat. Similar to the previous version of DALL-E (e.g. DALL-E 2), the API incorporates built-in moderation to help protect against misuse, OpenAI says. The DALL-E 3 API offers different format and quality options and resolutions ranging from 1...

Ready to highlight and find good content?

Glasp is a social web highlighter that people can highlight and organize quotes and thoughts from the web, and access other like-minded people’s learning.