managarten/apps/picture/docs/models/qwen-image.md
Wuesteon d36b321d9d style: auto-format codebase with Prettier
Applied formatting to 1487+ files using pnpm format:write
  - TypeScript/JavaScript files
  - Svelte components
  - Astro pages
  - JSON configs
  - Markdown docs

  13 files still need manual review (Astro JSX comments)
2025-11-27 18:33:16 +01:00

2.4 KiB

Qwen Image

Overview

Qwen Image is Alibaba's advanced image generation model that combines strong multilingual understanding with high-quality image generation capabilities. It's particularly notable for its excellent handling of Asian languages and cultural contexts.

Model Details

  • Provider: Qwen (Alibaba)
  • Replicate ID: qwen/qwen-image
  • Version: 9bc5cb891bfe948b11c7bb9e63ccb1c7e03c4cf53e89b963a99e673f84c5d8ef

Key Features

  • Multilingual Excellence: Superior understanding of Chinese, Japanese, Korean, and other languages
  • Cultural Awareness: Strong understanding of diverse cultural contexts
  • Balanced Quality: Good balance of speed and image quality
  • Versatile Styles: Handles both Eastern and Western artistic styles

Default Parameters

  • Resolution: 1024x1024
  • Steps: 30
  • Guidance Scale: 7.5
  • Supports Negative Prompts: Yes
  • Supports Seed: Yes

Supported Aspect Ratios

7 aspect ratios:

  • Square: 1:1
  • Landscape: 4:3, 3:2, 16:9
  • Portrait: 3:4, 2:3, 9:16

Supported Resolutions

  • Custom Range: 512x512 to 2048x2048
  • Quality Modes:
    • "optimize_for_quality" (higher resolution)
    • "optimize_for_speed" (lower resolution)
  • Custom width/height override available

Best Use Cases

  • Multilingual content creation
  • Asian market visuals
  • Cultural and traditional artwork
  • E-commerce product images
  • Educational illustrations

Example Prompts

  1. "Traditional Chinese garden with pavilion, koi pond, and cherry blossoms in spring"
  2. "Modern Tokyo street fashion, young person in Harajuku style clothing"
  3. "Korean traditional hanbok in modern minimalist style illustration"

Tips for Best Results

  • Can handle prompts in multiple languages effectively
  • Excellent for culture-specific imagery
  • Good at combining traditional and modern elements
  • Specify regional artistic styles when needed
  • Works well with detailed scene descriptions

Strengths

  • Best-in-class for Asian language prompts
  • Excellent cultural representation
  • Good at traditional art styles
  • Reliable and consistent output

Limitations

  • May require more specific prompting for Western styles
  • Generation time moderate (10 seconds)

Special Features

  • Accepts prompts in Chinese, Japanese, Korean, and English
  • Understands cultural nuances and symbols
  • Good at generating text in Asian languages

Cost

Estimated at $0.03 per generation