开源项目 > 人工智能 > 计算机视觉/人脸识别 &&

加入 Gitee

与超过 1200万开发者一起发现、参与优秀开源项目，私有仓库也完全免费：）

克隆/下载

README.md 3.62 KB

# AdaBins: Depth Estimation Using Adaptive Bins

## Reference

> [AdaBins: Depth Estimation Using Adaptive Bins](https://arxiv.org/abs/2011.14141)

## Introduction

<a href="https://github.com/shariqfarooq123/AdaBins">Official Repo</a>

<a href="https://github.com/open-mmlab/mmsegmentation/tree/dev-1.x/projects/Adabins">Code Snippet</a>

## <img src="https://user-images.githubusercontent.com/34859558/190043857-bfbdaf8b-d2dc-4fff-81c7-e0aac50851f9.png" width="25"/> Abstract

We address the problem of estimating a high quality dense depth map from a single RGB input image. We start out with a baseline encoder-decoder convolutional neural network architecture and pose the question of how the global processing of information can help improve overall depth estimation. To this end, we propose a transformer-based architecture block that divides the depth range into bins whose center value is estimated adaptively per image. The final depth values are estimated as linear combinations of the bin centers. We call our new building block AdaBins. Our results show a decisive improvement over the state-of-the-art on several popular depth datasets across all metrics.We also validate the effectiveness of the proposed block with an ablation study and provide the code and corresponding pre-trained weights of the new state-of-the-art model.

Our main contributions are the following:

- We propose an architecture building block that performs global processing of the scene’s information.We propose to divide the predicted depth range into bins where the bin widths change per image. The final depth estimation is a linear combination of the bin center values.
- We show a decisive improvement for supervised single image depth estimation across all metrics for the two most popular datasets, NYU and KITTI.
- We analyze our findings and investigate different modifications on the proposed AdaBins block and study their effect on the accuracy of the depth estimation.

<div align="center">
<img src="https://github.com/open-mmlab/mmsegmentation/assets/15952744/915bcd5a-9dc2-4602-a6e7-055ff5d4889f"  width = "1000" />
</div>

## <img src="https://user-images.githubusercontent.com/34859558/190044217-8f6befc2-7f20-473d-b356-148e06265205.png" width="25"/> Performance

### NYU and KITTI

| Model         | Encoder         | Training epoch | Batchsize | Train Resolution | δ1    | δ2    | δ3    | REL   | RMS   | RMS log | params(M) | Links                                                                                                                   |
| ------------- | --------------- | -------------- | --------- | ---------------- | ----- | ----- | ----- | ----- | ----- | ------- | --------- | ----------------------------------------------------------------------------------------------------------------------- |
| AdaBins_nyu   | EfficientNet-B5 | 25             | 16        | 416x544          | 0.903 | 0.984 | 0.997 | 0.103 | 0.364 | 0.044   | 78        | [model](https://download.openmmlab.com/mmsegmentation/v0.5/adabins/adabins_efficient_b5_nyu_third-party-f68d6bd3.pth)   |
| AdaBins_kitti | EfficientNet-B5 | 25             | 16        | 352x764          | 0.964 | 0.995 | 0.999 | 0.058 | 2.360 | 0.088   | 78        | [model](https://download.openmmlab.com/mmsegmentation/v0.5/adabins/adabins_efficient-b5_kitty_third-party-a1aa6f36.pth) |

## Citation

```bibtex
@article{10.1109/cvpr46437.2021.00400,
    author = {Bhat, S. A. and Alhashim, I. and Wonka, P.},
    title = {Adabins: depth estimation using adaptive bins},
    journal = {2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    year = {2021},
    doi = {10.1109/cvpr46437.2021.00400}
}
```

一键复制编辑原始数据按行查看历史

提交于 2023-09-13 17:06 . [CodeCamp2023-608] Add Adabins model (#3257)

AdaBins: Depth Estimation Using Adaptive Bins

Reference

AdaBins: Depth Estimation Using Adaptive Bins

Introduction

Abstract

We address the problem of estimating a high quality dense depth map from a single RGB input image. We start out with a baseline encoder-decoder convolutional neural network architecture and pose the question of how the global processing of information can help improve overall depth estimation. To this end, we propose a transformer-based architecture block that divides the depth range into bins whose center value is estimated adaptively per image. The final depth values are estimated as linear combinations of the bin centers. We call our new building block AdaBins. Our results show a decisive improvement over the state-of-the-art on several popular depth datasets across all metrics.We also validate the effectiveness of the proposed block with an ablation study and provide the code and corresponding pre-trained weights of the new state-of-the-art model.

Our main contributions are the following:

We propose an architecture building block that performs global processing of the scene’s information.We propose to divide the predicted depth range into bins where the bin widths change per image. The final depth estimation is a linear combination of the bin center values.
We show a decisive improvement for supervised single image depth estimation across all metrics for the two most popular datasets, NYU and KITTI.
We analyze our findings and investigate different modifications on the proposed AdaBins block and study their effect on the accuracy of the depth estimation.

Performance

NYU and KITTI

Model	Encoder	Training epoch	Batchsize	Train Resolution	δ1	δ2	δ3	REL	RMS	RMS log	params(M)	Links
AdaBins_nyu	EfficientNet-B5	25	16	416x544	0.903	0.984	0.997	0.103	0.364	0.044	78	model
AdaBins_kitti	EfficientNet-B5	25	16	352x764	0.964	0.995	0.999	0.058	2.360	0.088	78	model

Citation

@article{10.1109/cvpr46437.2021.00400,
    author = {Bhat, S. A. and Alhashim, I. and Wonka, P.},
    title = {Adabins: depth estimation using adaptive bins},
    journal = {2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    year = {2021},
    doi = {10.1109/cvpr46437.2021.00400}
}

Python

1

https://gitee.com/monkeycc/mmsegmentation.git

git@gitee.com:monkeycc/mmsegmentation.git

monkeycc

mmsegmentation

mmsegmentation

main