DALL·E mini, an open source alternative to DALL·E
DALL·E mini, an open source alternative to DALL·E

DALL·E mini
DALL·E mini
DALL·E mini
By Sofía Sánchez González
DALL·E 2 has been around for a while now, but if you’re looking for a free and open-source alternative, we’ve got you covered. In this post we offer you a way to try it: DALL·E mini, an open source alternative to DALL·E.
What are image-generating models?
But first things first. DALL·E was the first creative artificial intelligence model capable of generating images from texts. In other words, it combines both an understanding of natural language with the generating of realistic images.
But DALL·E’s artistic talents don’t end with a simple snail drawing. If you wanted, it could generate multiple harp-shaped snail illustrations, or something more bizarre than you could even imagine. Just provide a description and it will generate a number of alternatives.
It can generate images of anything you can think of. And when we say anything, we mean absolutely anything! This model can:
-
- Transform existing images
- Create anthropomorphic animals and objects
- Combine concepts which are seemingly unrelated
- Represent text
Here are some figures on the DALL·E model:
- 12 million parameters
- 1,280 tokens (256 for text and 1024 for images)
In short, the image-generating models are diffusion models to synthesize image + large language models for the text.
What’s new with DALL·E 2?
Well, the truth is, not much. DALL·E was already using diffusion models, now they have simply scaled it up with larger datasets and parameters in order to allow for a larger capacity.
The process has been the same as with GPT-3 and GPT-4. At a structural level it is the same model, but during the training the new one has been entered with a much larger text. DALL.E 2 works on a 3.5 billion parameter model while using another 1.5 billion parameter model to improve the resolution of the digitally produced images.
Impressive numbers, but the problem is that the model comes with limitations based on free or paid plans. That is why we offer you an alternative.
DALL·E mini, an open source alternative to DALL·E
This image generator is available on the Hugging Face profile because it is an open source alternative to DALL·E. As the name suggests, it is a ‘mini’ version of DALL·E, while still boasting some incredible results.
This is what its creator, Boris Dayma, explains to us on the project blog:
The model is trained by looking at millions of images from the internet with their associated captions. Over time, it learns how to draw an image from a text prompt.
Some of the concepts are learned from memory as it may have seen similar images. However, it can also learn how to create unique images that don’t exist such as “the Eiffel tower is landing on the moon” by combining multiple concepts together.
Fortunately, the goal of Hugging Face is to democratize artificial intelligence. So all Internet users can try it!
DALL·E Mega
And we have good news, because the creator of DALL·E mini trained DALL·E Mega, an even more powerful imager. Even the training could be viewed for free via this link! Anyone could see the learning curve and even see the parameters that changed.
Dayma promises that it has higher quality. It will also be available on the Hugging Face platform and will have a demo like the one for DALL·E mini.
Imagen from Google
Another image builder model option is Google Imagen. According to various market analyses, it is the generator that offers the best results. But the downside is that it’s not available to everyone. On a technical level, it has a huuuuge LLM.
About Narrativa
Narrativa® is the global leader in generative AI content automation. Through the no-code Narrativa® Navigator platform and the collaborative writing assistant, Narrativa® Sidekick, organizations large and small are empowered to accelerate content creation at scale with greater speed, accuracy, and efficiency.
For companies in the life sciences industry, Narrativa® Navigator provides secure and specialized AI-powered automation features. It includes complementary user-friendly tools such as CSR Atlas, Narrative Pathway, TLF Voyager, and Redaction Scout, which operate cohesively to transform clinical data into submission-ready regulatory documents. From database to delivery, pharmaceutical sponsors, biotech firms, and contract research organizations (CROs) rely on Narrativa® to streamline workflows, decrease costs, and reduce time-to-market across the clinical lifecycle and, more broadly, throughout their entire businesses.
The dynamic Narrativa® Navigator platform also supports non-clinical industries such as finance, marketing, and media. It helps teams drive measurable impact by creating high-quality, scalable content on any topic. Available as a self-serve SaaS solution or a fully managed service, built-in AI agents enable the production, refinement, and iteration of large volumes of SEO-optimized news articles, engaging blog posts, insightful thought leadership pieces, in-depth financial reports, dynamic social media posts, compelling white papers, and much more.
Explore www.narrativa.com and follow on LinkedIn, Facebook, Instagram, and X. Accelerate the potential with Narrativa®.