NewPowered by Baidu's open-source ERNIE-Image 8B DiT model

Generate Images with ERNIE-Image

ERNIE-Image is Baidu's open-source bilingual AI image generator — an 8B Diffusion Transformer model that leads the field in text-in-image accuracy (LongTextBench 0.9733), structured poster and comic layouts, and native Chinese/English prompt support. Whether you need an AI poster generator, a bilingual infographic, or a comic panel with precise speech bubbles, ERNIE-Image produces professional results in seconds.

🎁 Free daily generation — 1 image per day, no sign-up required
0 / 500
✦ Standard3 Credits
Higher quality · ~60s · Basic/Pro
Example AI-generated image

What is ERNIE-Image?

ERNIE-Image is Baidu's open-source 8B single-stream Diffusion Transformer (DiT) model, released in April 2025. It leads all open-source AI image generators in text rendering accuracy, bilingual Chinese/English support, and structured layout generation — making it the go-to tool for designers, marketers, and developers who need precise text inside images. Licensed under Apache 2.0, ERNIE-Image is freely available for commercial and research use.

Open-Source 8B DiT Model

Fully open-source under Apache 2.0. Built on an 8B single-stream Diffusion Transformer architecture — accessible, powerful, and community-driven. Unlike closed AI image generators such as DALL-E or Midjourney, ERNIE-Image can be self-hosted, fine-tuned, and integrated into your own pipelines without licensing restrictions.

Precise Text in Images

LongTextBench score of 0.9733 — the best-performing open-source AI image generator for rendering accurate text inside images, including comic speech bubbles, poster headlines, infographic labels, and UI mockup copy. If your use case requires legible, correctly-spelled text baked into the image, ERNIE-Image is the clear leader.

Bilingual Chinese/English

Supports Chinese and English prompts with equal quality. ERNIE-Image leads the OneIG bilingual benchmark — ideal for Asian markets, dual-language content creation, and any workflow that requires ERNIE-Image-quality output in both scripts. Write your prompt in Chinese, English, or a mix of both and get professional results every time.

State-of-the-Art Benchmarks

GENEval 0.8856 — competitive with FLUX.1 and Qwen-Image on general image quality, while surpassing both on structured poster and comic generation tasks. ERNIE-Image's dual strength in text rendering and compositional accuracy makes it the most complete open-source AI image generator available today.

Why Choose ERNIE-Image

ERNIE-Image delivers capabilities that other AI image generators simply cannot match — especially for text-heavy, bilingual, and structured-layout visuals. Here is what makes it the best choice for professional creators and developers.

Generate posters, comic panels, and infographics where the text inside the image is exactly what you wrote — no garbled characters, no hallucinations, no spelling errors. ERNIE-Image achieves a LongTextBench score of 0.9733, the highest of any open-source AI image generator. This makes it uniquely suited for AI poster generation, comic creation, event flyers, and any design where readability of embedded text is non-negotiable.

How It Works

How ERNIE-Image Works

ERNIE-Image uses a single-stream Diffusion Transformer (DiT) architecture with 8 billion parameters. Unlike traditional UNet-based diffusion models, the DiT processes the entire image as a sequence of patches in a unified transformer, enabling better global coherence, more accurate text placement, and superior structured-layout generation. Here is how to go from prompt to finished image in under a minute.

1

Write Your Prompt

Describe the image you want in Chinese or English. Be specific about content, style, layout, and any text you want rendered inside the image. For example: 'A vibrant event poster for a music festival, headline text SUMMER BEATS 2025, bold sans-serif font, neon colors on dark background.' The more detail you include about text placement and composition, the better ERNIE-Image performs relative to other AI image generators.

2

Choose Mode and Aspect Ratio

Select Turbo mode (8 steps, ~15 seconds, 1 credit) for fast ideation or Standard mode (50 steps, ~60 seconds, 3 credits) for final-quality output. Then pick your aspect ratio: square 1:1 for social media, portrait 3:4 for phone wallpapers, landscape 4:3 for presentations, vertical 9:16 for TikTok and Instagram Stories, or widescreen 16:9 for banners and YouTube thumbnails. ERNIE-Image maintains bilingual text accuracy across all aspect ratios.

3

Generate and Download

Click Generate and ERNIE-Image runs inference via the fal.ai serverless GPU infrastructure. Turbo images arrive in roughly 15 seconds; Standard images in about 60 seconds. Once generated, you can download the full-resolution PNG — watermark-free on paid plans — share a link, or regenerate with a refined prompt. Paid subscribers also have generation history so you can revisit and download any image you have ever created.

Key Features

Everything you need to generate, download, and share AI images — from free daily credits to pro-tier unlimited generation. ERNIE-Image combines state-of-the-art bilingual AI image generation with a practical credit system designed for creators at every level.

Turbo Mode (1 Credit)

8-step generation in ~15 seconds. Perfect for rapid ideation and testing your prompts before committing to high-quality renders. Turbo mode is available on all plans including the free tier.

Standard Mode (3 Credits)

50-step generation in ~60 seconds for maximum quality. ERNIE-Image's Standard mode produces sharper details, more accurate text rendering, and better prompt adherence than Turbo. Available to Basic and Pro subscribers for final-quality outputs.

5 Aspect Ratios

Square (1024×1024), portrait 3:4, landscape 4:3, vertical 9:16, and widescreen 16:9. Pick the right canvas for every use case — from Instagram posts to YouTube banners. ERNIE-Image maintains bilingual text accuracy across all sizes.

Free Daily Generation

1 free image per day — no account required. Try ERNIE-Image's best-in-class text rendering and bilingual AI image generation capabilities before subscribing. The free tier uses Turbo mode at 1024×1024 with a watermark.

Credits System

Basic ($9/mo, 300 credits) and Pro ($19/mo, 1000 credits) subscriptions, plus one-time credit packs that never expire. Turbo uses 1 credit; Standard uses 3. Unused subscription credits reset each billing cycle.

Commercial License

Basic and Pro subscribers get full commercial rights to use generated images in products, marketing, and client work. Combine with ERNIE-Image's Apache 2.0 model license for a fully open, commercially unrestricted AI image generation workflow.

What Can You Create with ERNIE-Image?

ERNIE-Image is the only AI image generator purpose-built for text-heavy and bilingual visuals. From marketing teams generating hundreds of social media assets to indie developers building AI-powered design tools, here are the most popular ways people use ERNIE-Image — the ERNIE-Image powered platform.

AI Poster Generator

Create event posters, promotional banners, and movie-style one-sheets with accurate headlines, body copy, and credits — all rendered inside the image without any post-production text overlay. Describe the layout, color scheme, and exact wording, and ERNIE-Image produces a print-ready poster. Marketing teams use ERNIE-Image to generate dozens of poster variations in the time it takes to brief a designer, dramatically cutting content production costs.

Comic and Manga Creation

ERNIE-Image renders speech bubbles, thought bubbles, and sound effects directly in the artwork — something that has historically required manual editing after AI generation. Comic creators use ERNIE-Image to draft panel layouts with placeholder dialogue, iterate on character designs, and produce web comics at scale. The bilingual AI image capability also means you can create Chinese-language manga and English-language comics from the same workflow.

Infographic and Data Visualization

Generate visually engaging infographics where labels, numbers, and callouts are accurate and legible. ERNIE-Image understands structured data presentation, making it ideal for creating explainer graphics, comparison charts, timeline visuals, and educational content. Teams producing bilingual reports can generate Chinese and English versions of the same infographic in under a minute each.

Bilingual Social Media Content

Generate Instagram posts, Twitter/X headers, LinkedIn banners, WeChat Moments cards, and Xiaohongshu (RED) visuals — with native Chinese text support that most AI image generators cannot provide. Social media managers use ERNIE-Image to produce a full week of branded visual content in an afternoon. The bilingual AI image generation capability is particularly valuable for brands running parallel Chinese and English social channels.

Marketing and Advertising Materials

Produce product ads, email header images, landing page hero visuals, and display advertising creatives. ERNIE-Image's structured layout generation ensures that headlines, sub-copy, and CTAs appear where you specify them in the prompt. Agencies use ERNIE-Image to mock up campaign concepts for client approval before investing in photography or professional design, reducing the creative development cycle from weeks to hours.

Developer API Integration

ERNIE-Image is available via the fal.ai API, making it straightforward to integrate bilingual AI image generation into your own applications. Build AI-powered design tools, content automation pipelines, e-commerce product image generators, or white-label creative platforms. The Apache 2.0 model license means there are no legal barriers to commercial integration. Combine fal.ai's serverless GPU infrastructure with ERNIE-Image's best-in-class text rendering for a production-ready AI image generator API.

ERNIE-Image by the Numbers

ERNIE-Image's benchmark scores reflect a genuine leap in text-in-image AI and bilingual generation quality. These are reproducible results on standardized evaluation sets that position ERNIE-Image as the leading open-source AI image generator for text rendering and bilingual support.

0.9733

LongTextBench

Best open-source score for text rendering inside images — ahead of FLUX and Stable Diffusion

0.8856

GENEval Score

General image generation quality — competitive with FLUX.1 Dev and Qwen-Image

8B

Model Size

Single-stream Diffusion Transformer architecture, open-source and self-hostable

Apache 2.0

Open License

Apache 2.0 — free for personal and commercial use, no royalties or restrictions

Simple, Transparent Pricing

Start free, upgrade when you need more. All paid plans include commercial usage rights and no watermarks. ERNIE-Image is the most affordable way to access best-in-class bilingual AI image generation with precise text-in-image rendering.

Free

$0 / month
Try ERNIE Image Generator at no cost.

Includes

  • 1 free image per day
  • Turbo mode only
  • 1024×1024 max resolution
  • Watermark on output
  • No generation history

Basic

$9
$7.20 / month
Perfect for regular creators and small projects.
~$0.024 per image · billed $86.40/yr

Includes

  • 300 credits per month
  • No watermark
  • Turbo & Standard modes
  • 1264×848 max resolution
  • Commercial license
  • 30-day generation history
Most Popular

Pro

$19
$15.20 / month
For power users and professional creators.
~$0.015 per image · billed $182.40/yr

Includes

  • 1000 credits per month
  • No watermark
  • Turbo & Standard modes
  • 1376×768 max resolution
  • Commercial license
  • Permanent generation history
  • Priority queue

Frequently Asked Questions

Everything you need to know about ERNIE-Image — the bilingual AI image generator with best-in-class text-in-image accuracy. Have another question? Contact us at support@ernie-image.com













Start Generating Images with ERNIE-Image Today

Free to try — 1 image per day, no sign-up required. Upgrade for unlimited access, commercial rights, and the full power of the world's best bilingual AI image generator with precise text-in-image rendering.