
Instruction-based image editing improves the controllability and flexibility of image manipulation via natural commands without elaborate descriptions or regional masks. However, human instructions are sometimes too brief for current methods to capture and follow. Multimodal large language models (MLLMs) show promising capabilities in cross-modal understanding and visual-aware response generation via LMs. We investigate how MLLMs facilitate edit instructions and present MLLM-Guided Image Editing (MGIE). MGIE learns to derive expressive instructions and provides explicit guidance. The editing model jointly captures this visual imagination and performs manipulation through end-to-end training. We evaluate various aspects of Photoshop-style modification, global photo optimization, and local editing. Extensive experimental results demonstrate that expressive instructions are crucial to instruction-based image editing, and our MGIE can lead to a notable improvement in automatic metrics and human evaluation while maintaining competitive inference efficiency.
👇 press the tab for different datasets
数据统计
数据评估
关于[ICLR’24] MGIE特别声明
本站鸟瑞导航提供的[ICLR’24] MGIE数据都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由鸟瑞导航实际控制,在2025年9月10日 下午7:04收录时,该网页上的内容,都属于合法合规,后期网页的内容如出现违规,请联系本站网站管理员进行举报,我们将进行删除,鸟瑞导航不承担任何责任。
相关导航

绘蛙-是一款功能强大,简洁好用的智能图片、文案创作平台,并且拥有海量虚拟模特可选择。在绘蛙,你可训练自己的商品模型和模特模型,可通过AI生成商拍图和种草文案,可以创作小红书图片,电商商品主图,跨境电商主图,小红书种草文案,穿搭文案,视频口播文案,可在线一键美图,输入口令修改图片内容,一键换装,一键去水印,一键智能消除,一键换脸,一键高清修复图片。

OPS/OpenPromptStudio
在 Moonvy 月维上在线管理并交付你的设计资源,强大的设计标注与代码生成,支持海量文件格式。无论使用 Sketch、Figma、即时设计、Photoshop 等各种设计工具都有完美的支持

AI 3D Model Generator
Discover our AI 3D Model Generator, perfect for quick, creative 3D designs from text. Ideal for game devs, designers, and 3D print fans. Try now!

即梦Dreamina
即梦AI一站式智能创作平台,即刻造梦。提供AI绘画和AIGC视频创作体验,拥有激发无限创作灵感的社区。让即梦AI开启您的智能创作之旅,探索梦境实现的无限可能!

绘AI
志设网(www.zs9.com) 是优秀设计师分享设计作品的下载和AI创作平台,独创作品每日收益和收益盲盒领取模式,让会员之间有更多的互动更多的交流。

喵呜提示词助手
可对复杂的Midjourney提示词进行可视化编辑和二次修改,自带翻译中文,保存工作区,已帮助众多设计师和AI绘画艺术家提高效率,期待您的使用。

Plask Motion: AI
Plask offers AI motion capture from video, transforming your videos into stunning animations. Dive into our step-by-step guide and learn how to use our motion capture camera for the best results.

PromptFolder
The ultimate AI prompt manager. Build, save, and discover innovative prompts for use in ChatGPT, Midjourney, and other artificial intelligence powered tools.
暂无评论...





