What is ICEdit?
ICEdit (In-Context Edit) is an innovative framework for instruction-based image editing that transforms how we modify images using natural language commands. This efficient solution allows users to edit images by simply describing what changes they want in plain English.
Edit images with natural language instructions
No technical skills required
Fast, efficient, and user-friendly

Key Features
Discover how ICEdit transforms video editing with these powerful features designed for creators at every skill level
Efficient Image Editing
Achieves state-of-the-art results with only 0.5% of the training data and 1% of the parameters required by prior methods.
Instruction-Based Editing
Enables robust image modification via natural language prompts, ensuring high precision and efficiency.
In-Context Generation
Leverages large-scale Diffusion Transformer for enhanced generation capacity and native contextual awareness.
Hybrid Tuning Strategy
Combines LoRA and MoE for efficient adaptation and dynamic expert routing, ensuring flexibility without extensive retraining.
Early Filter Inference-Time Scaling
Uses vision-language models to select better initial noise early, improving edit quality.
Open-Source and Cost-Effective
Offers a more open-source approach with lower costs, faster processing speed (9 seconds per image), and powerful performance.
How ICEdit Works
How ICEdit Works?
ICEdit leverages large-scale Diffusion Transformer (DiT) technology with three key innovations:
In-Context Editing Framework
Uses in-context prompting to understand and apply edits without requiring structural changes to the model.
LoRA-MoE Hybrid Tuning
Combines LoRA (Low-Rank Adaptation) with Mixture-of-Experts routing to enable flexible and efficient adaptation.
Inference-Time Scaling
Employs vision-language models to select better initial noise patterns, significantly improving edit quality.
How ICEdit Works?
ICEdit leverages large-scale Diffusion Transformer (DiT) technology with three key innovations:
In-Context Editing Framework
Uses in-context prompting to understand and apply edits without requiring structural changes to the model.
LoRA-MoE Hybrid Tuning
Combines LoRA (Low-Rank Adaptation) with Mixture-of-Experts routing to enable flexible and efficient adaptation.
Inference-Time Scaling
Employs vision-language models to select better initial noise patterns, significantly improving edit quality.
ICEdit: Technical Highlights
In-Context Diffusion Transformer
ICEdit leverages a large-scale Diffusion Transformer (DiT) backbone, enabling context-aware image editing that preserves the integrity and style of the original image.
LoRA-MoE Hybrid Tuning
Combines Low-Rank Adaptation (LoRA) with Mixture-of-Experts (MoE) routing for efficient, flexible adaptation to diverse editing instructions—without extensive retraining.
Inference-Time Scaling
Utilizes vision-language models to select optimal initial noise, improving edit quality and instruction following, while maintaining fast processing (about 9 seconds per image).
ICEdit Demo Gallery
Explore real-world examples of how ICEdit transforms video backgrounds across various industries and content types








How to Use ICEdit on Huggingface?
ICEdit makes image editing as simple as writing a sentence. With this powerful AI tool, you can transform your photos in seconds—no design skills required. Here’s how to get started:
- 1
Visit the ICEdit Space
Go to IC-Edit space: huggingface.co/spaces/RiverZ/ICEdit. The interface is clean and beginner-friendly.
- 2
Upload Your Image
Click the "Upload image for editing" button on the left. Choose any photo from your device to modify.
- 3
Describe Your Edit
In the text box at the bottom, type what you want to change. Be specific but simple! For example:
- Change the sunglasses to a Christmas hat
- Make the background blue
- Add flowers to the scene
- 4
Click "Run"
Hit the "Run" button and wait a few seconds. ICEdit processes images in about 9 seconds—much faster than most AI editors.
- 5
View Your Result
Your edited image appears on the right. If you’re happy, download it using the button in the top right corner.
Frequently Asked Questions
Find answers to common questions about ICEdit's features, requirements, and capabilities
ICEdit is described as the future of AI-powered image editing. It is an advanced AI tool that allows users to transform images using simple text instructions. It focuses on in-context image editing, meaning it understands the context of your image.
You use ICEdit by simply telling it what you want to change using natural language instructions. There is an intuitive text-to-image editing interface where you describe the desired change, and ICEdit handles the technical details. To use the demo, you would upload your image or choose one from examples, type your instruction, and click "Generate".
In-context image editing means that ICEdit understands the context of your image. It analyzes your entire image to understand its visual context before performing any edits. This approach ensures that edits respect the original composition, lighting, and style, resulting in natural-looking transformations that maintain the image's integrity and preserve context. Unlike traditional editors, it doesn't treat each pixel independently.
ICEdit is powered by advanced AI and utilizes an efficient image editing engine. Key features include seamless in-context editing & preservation, an innovative Hybrid Tuning Strategy called LoRA-MoE, and optimized performance for faster editing through inference scaling. It also has an intuitive text-to-image editing interface.
LoRA-MoE stands for Low-Rank Adaptation - Mixture of Experts. It is ICEdit's proprietary/innovative Hybrid Tuning Strategy. This approach combines the efficiency of low-rank adaptations with the versatility of specialized expert models. This strategy allows ICEdit to handle diverse image editing tasks with exceptional precision and adaptability, while also minimizing computational overhead.
Compared to traditional editors, ICEdit uses simple text instructions instead of complex tools and menus. It has a minimal learning curve, only requiring you to describe what you want, whereas traditional editors require technical knowledge. ICEdit can achieve complex edits with simple instructions, while complex edits in traditional editors require multiple steps. ICEdit also has context awareness and automatically preserves image coherence, unlike traditional tools that treat pixels independently and may require manual adjustments for natural results.
When compared to commercial models like Gemini and GPT-4O, ICEdit offers comparable or even superior results in character ID preservation and instruction following. It processes each image relatively quickly, in about 9 seconds. It is also highlighted as being fully open-source and having lower costs.
Yes, ICEdit is an open-source AI project that invites collaboration. You can experience ICEdit firsthand by trying the live demo hosted on Hugging Face. More technical details and the research behind it can be found in the Research Paper on arxiv.org, and the source code is available on the GitHub Repository maintained by River Zhang