Instruction-Based Image Editing

What is ICEdit?

ICEdit (In-Context Edit) is an innovative framework for instruction-based image editing that transforms how we modify images using natural language commands. This efficient solution allows users to edit images by simply describing what changes they want in plain English.

Edit images with natural language instructions

No technical skills required

Fast, efficient, and user-friendly

ICEdit demo showcasing instruction-based image editing
NATURAL LANGUAGE
Powerful Capabilities

Key Features

Discover how ICEdit transforms video editing with these powerful features designed for creators at every skill level

Efficient Image Editing

Achieves state-of-the-art results with only 0.5% of the training data and 1% of the parameters required by prior methods.

Instruction-Based Editing

Enables robust image modification via natural language prompts, ensuring high precision and efficiency.

In-Context Generation

Leverages large-scale Diffusion Transformer for enhanced generation capacity and native contextual awareness.

Hybrid Tuning Strategy

Combines LoRA and MoE for efficient adaptation and dynamic expert routing, ensuring flexibility without extensive retraining.

Early Filter Inference-Time Scaling

Uses vision-language models to select better initial noise early, improving edit quality.

Open-Source and Cost-Effective

Offers a more open-source approach with lower costs, faster processing speed (9 seconds per image), and powerful performance.

Simple & Powerful

How ICEdit Works

How ICEdit Works?

ICEdit leverages large-scale Diffusion Transformer (DiT) technology with three key innovations:

In-Context Editing Framework

Uses in-context prompting to understand and apply edits without requiring structural changes to the model.

LoRA-MoE Hybrid Tuning

Combines LoRA (Low-Rank Adaptation) with Mixture-of-Experts routing to enable flexible and efficient adaptation.

Inference-Time Scaling

Employs vision-language models to select better initial noise patterns, significantly improving edit quality.

ICEdit: Technical Highlights

In-Context Diffusion Transformer

ICEdit leverages a large-scale Diffusion Transformer (DiT) backbone, enabling context-aware image editing that preserves the integrity and style of the original image.

LoRA-MoE Hybrid Tuning

Combines Low-Rank Adaptation (LoRA) with Mixture-of-Experts (MoE) routing for efficient, flexible adaptation to diverse editing instructions—without extensive retraining.

Inference-Time Scaling

Utilizes vision-language models to select optimal initial noise, improving edit quality and instruction following, while maintaining fast processing (about 9 seconds per image).

100% Open Source  |  Try the live demo on Hugging Face  |  Read the Paper
Visual Showcase

ICEdit Demo Gallery

Explore real-world examples of how ICEdit transforms video backgrounds across various industries and content types

ICEdit demo image
ICEdit demo image
ICEdit demo image
ICEdit demo image
ICEdit demo image
ICEdit demo image
ICEdit demo image
ICEdit demo image
Powerful Capabilities

How to Use ICEdit on Huggingface?

ICEdit makes image editing as simple as writing a sentence. With this powerful AI tool, you can transform your photos in seconds—no design skills required. Here’s how to get started:

  1. 1

    Visit the ICEdit Space

    Go to IC-Edit space: huggingface.co/spaces/RiverZ/ICEdit. The interface is clean and beginner-friendly.

  2. 2

    Upload Your Image

    Click the "Upload image for editing" button on the left. Choose any photo from your device to modify.

  3. 3

    Describe Your Edit

    In the text box at the bottom, type what you want to change. Be specific but simple! For example:

    • Change the sunglasses to a Christmas hat
    • Make the background blue
    • Add flowers to the scene
  4. 4

    Click "Run"

    Hit the "Run" button and wait a few seconds. ICEdit processes images in about 9 seconds—much faster than most AI editors.

  5. 5

    View Your Result

    Your edited image appears on the right. If you’re happy, download it using the button in the top right corner.

Got Questions?

Frequently Asked Questions

Find answers to common questions about ICEdit's features, requirements, and capabilities

ICEdit is described as the future of AI-powered image editing. It is an advanced AI tool that allows users to transform images using simple text instructions. It focuses on in-context image editing, meaning it understands the context of your image.

You use ICEdit by simply telling it what you want to change using natural language instructions. There is an intuitive text-to-image editing interface where you describe the desired change, and ICEdit handles the technical details. To use the demo, you would upload your image or choose one from examples, type your instruction, and click "Generate".

In-context image editing means that ICEdit understands the context of your image. It analyzes your entire image to understand its visual context before performing any edits. This approach ensures that edits respect the original composition, lighting, and style, resulting in natural-looking transformations that maintain the image's integrity and preserve context. Unlike traditional editors, it doesn't treat each pixel independently.

ICEdit is powered by advanced AI and utilizes an efficient image editing engine. Key features include seamless in-context editing & preservation, an innovative Hybrid Tuning Strategy called LoRA-MoE, and optimized performance for faster editing through inference scaling. It also has an intuitive text-to-image editing interface.

LoRA-MoE stands for Low-Rank Adaptation - Mixture of Experts. It is ICEdit's proprietary/innovative Hybrid Tuning Strategy. This approach combines the efficiency of low-rank adaptations with the versatility of specialized expert models. This strategy allows ICEdit to handle diverse image editing tasks with exceptional precision and adaptability, while also minimizing computational overhead.

Compared to traditional editors, ICEdit uses simple text instructions instead of complex tools and menus. It has a minimal learning curve, only requiring you to describe what you want, whereas traditional editors require technical knowledge. ICEdit can achieve complex edits with simple instructions, while complex edits in traditional editors require multiple steps. ICEdit also has context awareness and automatically preserves image coherence, unlike traditional tools that treat pixels independently and may require manual adjustments for natural results.

When compared to commercial models like Gemini and GPT-4O, ICEdit offers comparable or even superior results in character ID preservation and instruction following. It processes each image relatively quickly, in about 9 seconds. It is also highlighted as being fully open-source and having lower costs.

Yes, ICEdit is an open-source AI project that invites collaboration. You can experience ICEdit firsthand by trying the live demo hosted on Hugging Face. More technical details and the research behind it can be found in the Research Paper on arxiv.org, and the source code is available on the GitHub Repository maintained by River Zhang