The present Multi-view stereo (MVS) methods with supervised learning-based networks have an impressive performance comparing with traditional MVS methods. However, the ground-truth depth maps for training are hard to be obtained and are within limited kinds of scenarios. In this paper, we propose a novel unsupervised multi-metric MVS network, named M3VSNet, for dense point cloud reconstruction without any supervision. To improve the robustness and completeness of point cloud reconstruction, we propose a novel multi-metric loss function that combines pixel-wise and feature-wise loss function to learn the inherent constraints from different perspectives of matching correspondences. Besides, we also incorporate the normal-depth consistency in the 3D point cloud format to improve the accuracy and continuity of the estimated depth maps. Experimental results show that M3VSNet establishes the state-of-the-arts unsupervised method and achieves comparable performance with previous supervised MVSNet on the DTU dataset and demonstrates the powerful generalization ability on the Tanks and Temples benchmark with effective improvement.
Please cite:
@inproceedings{huang2021m3vsnet,
title={M3VSNet: Unsupervised multi-metric multi-view stereo network},
author={Huang, Baichuan and Yi, Hongwei and Huang, Can and He, Yijia and Liu, Jingbin and Liu, Xiao},
booktitle={2021 IEEE International Conference on Image Processing (ICIP)},
pages={3163--3167},
year={2021},
organization={IEEE}
}
- python 3.6.9
- pytorch 1.0.1
- CUDA 10.1 cudnn 7.5.0
The conda environment is listed in requirements.txt
- Download the preprocessed DTU training data (Fixed training cameras, from Original MVSNet,or the Baiduyun link, the password is mo8w ), and upzip it as the
MVS_TRANING
folder - in
train.sh
, setMVS_TRAINING
as your training data path - create a logdir called
checkpoints
- Train MVSNet:
./train.sh
- Download the preprocessed test data DTU testing data (from Original MVSNet, or the Baiduyun link, the password is mo8w ) and unzip it as the
DTU_TESTING
folder, which should contain onecams
folder, oneimages
folder and onepair.txt
file. - in
test.sh
, setDTU_TESTING
as your testing data path andCKPT_FILE
as your checkpoint file. You can find some models in the /checkpoint/. You can use the trained models to test your image. - Test MVSNet:
./test.sh
Acc. | Comp. | Overall. | |
---|---|---|---|
MVSNet(D=196) | 0.444 | 0.741 | 0.592 |
Unsup_MVS | 0.881 | 1.073 | 0.977 |
MVS2 | 0.760 | 0.515 | 0.637 |
M3VSNet(D=192) | 0.636 | 0.531 | 0.583 |
The best unsupervised MVS network until April 17, 2020. See the leaderboard .
Thanks for the funding from Megvii Technology Limited. We acknowledge the following repositories MVSNet and MVSNet_pytorch.
Happy to be acknowledgemented by the AAAI 2020 paper.