Introduction

Professional AI image editing platform powered by Qwen large model

What is Qwen Image Edit?

Qwen Image Edit is a professional AI image editing platform powered by Alibaba Cloud's Qwen large model. Built upon the 20B Qwen-Image model, it successfully extends Qwen-Image's unique text rendering capabilities to image editing tasks, enabling precise text editing and advanced image manipulation.

Key Features

Semantic and Appearance Editing

Qwen-Image-Edit supports both:

  • Low-level visual appearance editing: Adding, removing, or modifying elements while keeping all other regions completely unchanged
  • High-level visual semantic editing: IP creation, object rotation, and style transfer with overall pixel changes while maintaining semantic consistency

Precise Text Editing

Supports bilingual (Chinese and English) text editing, allowing direct addition, deletion, and modification of text in images while preserving the original font, size, and style.

Advanced Image Operations

From photorealistic scenes to impressionist paintings, from anime aesthetics to minimalist design, the model adapts fluidly to creative prompts. Advanced operations include:

  • Style transfer
  • Object insertion or removal
  • Detail enhancement
  • Text editing within images
  • Human pose manipulation

How It Works

Qwen-Image-Edit simultaneously feeds the input image into:

  1. Qwen2.5-VL for visual semantic control
  2. VAE Encoder for visual appearance control

This dual approach achieves capabilities in both semantic and appearance editing, bringing professional-grade editing within reach of everyday users.

References