Lidé
Ing. et Ing. Václav Vávra
Všechny publikace
Fundamental matrix estimation using relative depths
- Autoři: Yaqing Ding, Ph.D., Ing. et Ing. Václav Vávra, Bhayani, S., Wu, Q., Yang, J., RNDr. Zuzana Kúkelová, Ph.D.,
- Publikace: Computer Vision – ECCV 2024, Part LXXI. Springer, Cham, 2025. p. 142-159. LNCS. vol. 15129. ISSN 0302-9743. ISBN 978-3-031-73208-9.
- Rok: 2025
- DOI: 10.1007/978-3-031-73209-6_9
- Odkaz: https://doi.org/10.1007/978-3-031-73209-6_9
- Pracoviště: Skupina vizuálního rozpoznávání
-
Anotace:
We propose a novel approach to estimate the fundamental matrix from point correspondences and their relative depths. Relative depths can be approximated from the scales of local features, which are commonly available or can be obtained from non-metric monocular depth estimates provided by popular deep learning-based methods. This makes the considered problem very relevant. To derive efficient solutions, we explore new geometric constraints on the fundamental matrix with known relative depths and present new algebraic constraints between the fundamental matrix and the translation vector. Using point correspondences and their relative depths, we derive novel efficient minimal solvers for two fully uncalibrated cameras, two cameras with different unknown focal lengths, and two cameras with equal unknown focal lengths, respectively. We propose different variants of these solvers based on the source of the relative depth information. We present detailed analyses and comparisons with state-of-the-art solvers, including results with 86, 306 image pairs from three large-scale datasets
RePoseD: Efficient Relative Pose Estimation With Known Depth Information
- Autoři: Yaqing Ding, Ph.D., Kocur, V., Ing. et Ing. Václav Vávra, Berger Haladova, Z., Yan, J., Sattler, T., RNDr. Zuzana Kúkelová, Ph.D.,
- Publikace: ICCV2025: Proceedings of the International Conference on Computer Vision. Anchorage: IEEE Communications Society, 2025. p. 14876-14886. ISSN 1550-5499.
- Rok: 2025
- Pracoviště: Katedra kybernetiky, Skupina vizuálního rozpoznávání
-
Anotace:
Recent advances in monocular depth estimation methods (MDEs) and their improved accuracy open new possibilities for their applications. In this paper, we investigate how monocular depth estimates can be used for relative pose estimation. In particular, we are interested in answering the question whether using MDEs improves results over traditional point-based methods. We propose a novel framework for estimating the relative pose of two cameras from point correspondences with associated monocular depths. Since depth predictions are typically defined up to an unknown scale or even both unknown scale and shift parameters, our solvers jointly estimate the scale or both the scale and shift parameters along with the relative pose. We derive efficient solvers considering different types of depths for three camera configurations: (1) two calibrated cameras, (2) two cameras with an unknown shared focal length, and (3) two cameras with unknown different focal lengths. Our new solvers outperform stateof-the-art depth-aware solvers in terms of speed and accuracy. In extensive real experiments on multiple datasets and with various MDEs, we discuss which depth-aware solvers are preferable in which situation. The code is available at https://github.com/kocurvik/mdrp.
Camera Pose Estimation from Bounding Boxes
- Autoři: Ing. et Ing. Václav Vávra, Sattler, T., RNDr. Zuzana Kúkelová, Ph.D.,
- Publikace: 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024). Piscataway: IEEE, 2024. p. 5535-5542. ISSN 2153-0866. ISBN 979-8-3503-7770-5.
- Rok: 2024
- DOI: 10.1109/IROS58592.2024.10801546
- Odkaz: https://doi.org/10.1109/IROS58592.2024.10801546
- Pracoviště: Skupina vizuálního rozpoznávání
-
Anotace:
Visual localization is an important part of many interesting applications, including robotics. The dominant localization strategy is to estimate the camera pose from 2D-3D matches between 2D pixel positions and 3D points. Yet, such approaches can be quite memory intensive and can lead to privacy risks. An interesting alternative to point-based matches is to use higher-level primitives for pose estimation. Consequently, this work investigates using correspondences between 2D and 3D bounding boxes for camera pose estimation. The resulting scene representation is compact and poses fewer privacy risks. In this setting, there are typically orders of magnitude fewer matches available compared to classical feature-based methods. In addition, the available correspondences are significantly more noisy. We investigate multiple strategies based on converting bounding box correspondences to point correspondences and propose a novel and simple 2-point camera absolute pose solver (DP2P) that exploits the fact that the depths of the objects can be approximated from the sizes of their bounding boxes.
DoG Accuracy Via Equivariance: Get The Interpolation Right
- Autoři: Ing. et Ing. Václav Vávra, Mgr. Dmytro Mishkin, Ph.D., prof. Ing. Jiří Matas, Ph.D.,
- Publikace: 2023 IEEE International Conference on Image Processing (ICIP). New York: Institute of Electrical and Electronics Engineers, 2023. p. 136-140. ISBN 978-1-7281-9835-4.
- Rok: 2023
- DOI: 10.1109/ICIP49359.2023.10222153
- Odkaz: https://doi.org/10.1109/ICIP49359.2023.10222153
- Pracoviště: Skupina vizuálního rozpoznávání
-
Anotace:
We study the influence of image interpolation algorithms on local feature detectors operating on a scale pyramid, focusing on the Difference-of-Gaussian, as used in SIFT. We show that commonly used implementations, such as in OpenCV and Kornia, are neither rotational nor scale equivariant. We present a simple solution and demonstrate its positive influence on the downstream image matching tasks. The implementation of the method has been accepted in standard libraries OpenCV and Kornia.