
Instruction-based image editing improves the controllability and flexibility of image manipulation via natural commands without elaborate descriptions or regional masks. However, human instructions are sometimes too brief for current methods to capture and follow. Multimodal large language models (MLLMs) show promising capabilities in cross-modal understanding and visual-aware response generation via LMs. We investigate how MLLMs facilitate edit instructions and present MLLM-Guided Image Editing (MGIE). MGIE learns to derive expressive instructions and provides explicit guidance. The editing model jointly captures this visual imagination and performs manipulation through end-to-end training. We evaluate various aspects of Photoshop-style modification, global photo optimization, and local editing. Extensive experimental results demonstrate that expressive instructions are crucial to instruction-based image editing, and our MGIE can lead to a notable improvement in automatic metrics and human evaluation while maintaining competitive inference efficiency.
👇 press the tab for different datasets
数据统计
数据评估
关于[ICLR’24] MGIE特别声明
本站鸟瑞导航提供的[ICLR’24] MGIE数据都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由鸟瑞导航实际控制,在2025年9月10日 下午7:04收录时,该网页上的内容,都属于合法合规,后期网页的内容如出现违规,请联系本站网站管理员进行举报,我们将进行删除,鸟瑞导航不承担任何责任。
相关导航

Rapidly integrate authentication and authorization for web, mobile, and legacy applications so you can focus on your core business.

Plask Motion: AI
Plask offers AI motion capture from video, transforming your videos into stunning animations. Dive into our step-by-step guide and learn how to use our motion capture camera for the best results.

喵呜提示词助手
可对复杂的Midjourney提示词进行可视化编辑和二次修改,自带翻译中文,保存工作区,已帮助众多设计师和AI绘画艺术家提高效率,期待您的使用。

Civitai社区
Explore thousands of high-quality Stable Diffusion & Flux models, share your AI-generated art, and engage with a vibrant community of creators

Hyper3d.
AI-3D生成--Create professional 3...

字加AI
AI文像是一个AI创意设计平台,集成文生图,智能推荐字体等AI创作能力,将设计流程简化为 “输入需求 + 一键生成”,高效完成创意输出。

Raphael AI
Raphael AI, the world's first completely free and unlimited AI Image Generator powered by FLUX.1-Dev model. No registration required, superior image quality.

巨日禄AI漫画
一站式一键生成AI漫画推文神器,免费体验,免费小说推文授权平台;AI绘画文生图、AI视频文生视频、文本转视频、AI漫画创作平台;自媒体、漫剪、小说漫画推文工具教程
暂无评论...