
Depth Anything
This work presents Depth Anything, a highly practical solution for robust monocular depth estimation. Without pursuing novel technical modules, we aim to build a simple yet powerful foundation model dealing with any images under any circumstances. To this end, we scale up the dataset by designing a data engine to collect and automatically annotate large-scale unlabeled data (~62M), which significantly enlarges the data coverage and thus is able to reduce the generalization error. We investigate two simple yet effective strategies that make data scaling-up promising. First, a more challenging optimization target is created by leveraging data augmentation tools. It compels the model to actively seek extra visual knowledge and acquire robust representations. Second, an auxiliary supervision is developed to enforce the model to inherit rich semantic priors from pre-trained encoders. We evaluate its zero-shot capabilities extensively, including six public datasets and randomly captured photos. It demonstrates impressive generalization ability. Further, through fine-tuning it with metric depth information from NYUv2 and KITTI, new SOTAs are set. Our better depth model also results in a much better depth-conditioned ControlNet. All models have been released.
We thank the MagicEdit team for providing some video examples for video depth estimation, and Tiancheng Shen for evaluating the depth maps with MagicEdit. The middle video is generated by MiDaS-based ControlNet, while the last video is generated by Depth Anything-based ControlNet.
数据统计
数据评估
关于Depth Anything特别声明
本站鸟瑞导航提供的Depth Anything数据都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由鸟瑞导航实际控制,在2025年9月10日 下午7:03收录时,该网页上的内容,都属于合法合规,后期网页的内容如出现违规,请联系本站网站管理员进行举报,我们将进行删除,鸟瑞导航不承担任何责任。
相关导航

AI时代必备的全能AI工具,为用户提供全方位AI服务,如AI绘图、AI写作、AI在线图片处理。提供海量图片制作设计模板:电商图,logo设计,证件照,智能抠图,图片高清修复,一键去水印,一键换背景等各种应用场景。新手小白也能轻松玩转AI。

SceneXplain
SceneXplain - Leading AI Solution for Image Captions and Video Summaries

Artefacts
Artefacts is a 3D AI toolkit that enables users to effortlessly transform text or 2D images into 3D assets. Unleash your creativity with Artefacts - the future of 3D content creation.

51建模网
51建模网是深圳积木易搭科技技术有限公司旗下3D数据服务平台,包含3D建模业务对接与制作分发,3D模型数据云存储与调用展示,提供真正的一站式整体解决方案,加快推动各地区各行各业的3D数字化技术应用.

喵呜提示词助手
可对复杂的Midjourney提示词进行可视化编辑和二次修改,自带翻译中文,保存工作区,已帮助众多设计师和AI绘画艺术家提高效率,期待您的使用。

Arthub ai
Arthub.ai is a creative community for showcasing, discovering and creating AI generated art.

Holopix AI
AI一键生成游戏角色/场景/三视图,3分钟完成3D建模转换!游戏美术创作效率提升70%!注册立享欧美卡通、二次元、Q版、国风等10000+独家游戏风格模型,10万+游戏团队选用的低门槛、高可控AI设计解决方案!

StartAI画图软件官网
StartAI绘画软件是由StartAI推出的一款Photoshop的AI画图和图像处理PS插件,StartAI绘画软件提供一系列强大的AI画图和绘画功能,该PS插件能帮助设计师轻松完成图像处理实现高效。
暂无评论...





