
Instruction-based image editing improves the controllability and flexibility of image manipulation via natural commands without elaborate descriptions or regional masks. However, human instructions are sometimes too brief for current methods to capture and follow. Multimodal large language models (MLLMs) show promising capabilities in cross-modal understanding and visual-aware response generation via LMs. We investigate how MLLMs facilitate edit instructions and present MLLM-Guided Image Editing (MGIE). MGIE learns to derive expressive instructions and provides explicit guidance. The editing model jointly captures this visual imagination and performs manipulation through end-to-end training. We evaluate various aspects of Photoshop-style modification, global photo optimization, and local editing. Extensive experimental results demonstrate that expressive instructions are crucial to instruction-based image editing, and our MGIE can lead to a notable improvement in automatic metrics and human evaluation while maintaining competitive inference efficiency.
👇 press the tab for different datasets
数据统计
数据评估
关于[ICLR’24] MGIE特别声明
本站鸟瑞导航提供的[ICLR’24] MGIE数据都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由鸟瑞导航实际控制,在2025年9月10日 下午7:04收录时,该网页上的内容,都属于合法合规,后期网页的内容如出现违规,请联系本站网站管理员进行举报,我们将进行删除,鸟瑞导航不承担任何责任。
相关导航

志设网(www.zs9.com) 是优秀设计师分享设计作品的下载和AI创作平台,独创作品每日收益和收益盲盒领取模式,让会员之间有更多的互动更多的交流。

Leap AI
Create content, generate leads, and run campaigns at scale—just like the big companies do, but with the agility of a small team.

可图AI
Create professional videos and images with Kling AI's state-of-the-art generative AI platform. Our tools support video generation, image creation, and advanced editing capabilities for content creators.

吐司AI绘画
可免费在线生图的 AI 模型分享社区,支持 Stable Diffusion Model & LoRA, ComfyUI Workflow, Tencent Hunyuan-DiT

aigccafe.net
aigccafe.net

巨日禄AI漫画
一站式一键生成AI漫画推文神器,免费体验,免费小说推文授权平台;AI绘画文生图、AI视频文生视频、文本转视频、AI漫画创作平台;自媒体、漫剪、小说漫画推文工具教程
艾绘
艾绘是一家专注于使用AI技术创作儿童绘本创作的平台,结合人工智能技术的绘本创作平台,提供文生图、文生视频、图生图、背景生成和涂鸦绘画等创新工具,让孩子们的想象力得以无限扩展,创作出独特的个性化绘本,提供多样化的故事类型,包括魔法冒险、动物友谊、科普知识、历史传说等,旨在通过寓教于乐的方式,激发孩子们的想象力、创造力和学习兴趣,让孩子们在阅读中学习和成长。

Clip Interrogator
Run open-source machine learning models with a cloud API
暂无评论...




