
Instruction-based image editing improves the controllability and flexibility of image manipulation via natural commands without elaborate descriptions or regional masks. However, human instructions are sometimes too brief for current methods to capture and follow. Multimodal large language models (MLLMs) show promising capabilities in cross-modal understanding and visual-aware response generation via LMs. We investigate how MLLMs facilitate edit instructions and present MLLM-Guided Image Editing (MGIE). MGIE learns to derive expressive instructions and provides explicit guidance. The editing model jointly captures this visual imagination and performs manipulation through end-to-end training. We evaluate various aspects of Photoshop-style modification, global photo optimization, and local editing. Extensive experimental results demonstrate that expressive instructions are crucial to instruction-based image editing, and our MGIE can lead to a notable improvement in automatic metrics and human evaluation while maintaining competitive inference efficiency.
👇 press the tab for different datasets
数据统计
数据评估
关于[ICLR’24] MGIE特别声明
本站鸟瑞导航提供的[ICLR’24] MGIE数据都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由鸟瑞导航实际控制,在2025年9月10日 下午7:04收录时,该网页上的内容,都属于合法合规,后期网页的内容如出现违规,请联系本站网站管理员进行举报,我们将进行删除,鸟瑞导航不承担任何责任。
相关导航

ice.js 3 lite scaffold

堆友AI绘画 – 免费
堆友是Alibaba Design打造的设计师全成长周期服务平台,围绕品质、效率、技能、成就、收入五大用户价值布局平台能力,全力服务设计师,旨在成为设计师的好朋友。
堆友历经大厂设计师团队多轮打磨雕刻,集海量高品质3D素材、实时在线渲染、多元场景功能应用、轻便好学易上手等多重优势于一身的设计神器,更自带免费可商用属性,为专业设计师、运营工友、学生小白、社交达人提供了一个零成本的在线设计站点和资源库。

a1
As a free online AI image generator, a1 allows you to easily build and discover image filters, creating your own stunning AI art with just a click. Start free now!

Nvidia·GET3D
NVIDIA 发明了 GPU,并推动了 AI、HPC、游戏、创意设计、自动驾驶汽车和机器人开发领域的进步。

CSM — The fastest way to create 3D with AI
Common Sense Machines builds industry-leading 3D generative-AI models that transform images, text, and sketches into game-ready 3D assets and worlds. Trusted by world leading game studios, product designers and industrial designers.

Hyper3d.
AI-3D生成--Create professional 3...

Manga Translator
The best Manga Translator extension! Scan manga/comic/manhua/manhwa translator online,suport 135 languages manga translate use chatgpt,MangaMTL's Perfect Replacement!

腾讯混元3D
AI-3D生成--腾讯混元3D AI创作引擎基于腾讯混元3D...
暂无评论...


