PDF(4786 KB)
Overview of binocular stereo vision processor for robot navigation
CHEN Zhuoyu, AN Fengwei
Integrated Circuits and Embedded Systems ›› 2024, Vol. 24 ›› Issue (11) : 15-28.
PDF(4786 KB)
PDF(4786 KB)
Overview of binocular stereo vision processor for robot navigation
With the rapid development of the robotics industry, robotic technology has emerged as a new driving force for enhancing productivity, particularly highlighting the importance of technologies such as 3D reconstruction and obstacle avoidance navigation. However, active 3D imaging technologies based on Time of Flight (ToF) and structured light suffer from limitations such as low resolution, lack of original color information, and and susceptibility to ambient light interference, leading to suboptimal performance. Therefore, passive binocular stereo vision sensors, which can output dense depth and color information (RGB-D) in real-time, have been widely applied in fields such as autonomous robots, automobiles, and drones. Nonetheless, binocular stereo vision technology, which calculates disparity by mimicking human binocular vision for depth information, is computationally intensive and reliant on general-purpose computing platforms. This results in high energy consumption and latency for binocular stereo vision processors, limiting the technology's application in high-speed scenarios, small robots and edge computing. In recent years, binocular stereo vision processors integrated with hardware accelerators for stereo vision algorithms have gained significant attention in both academia and industry. This article systematically explains the theoretical foundation of binocular 3D stereo vision and its application examples in robotic stereo vision in the first section. It then introduces the structural components of binocular stereo vision processors, including core parts such as image acquisition, camera calibration and correction, and stereo matching. For the convenience of stereo vision hardware developers, this paper reviews the basic concepts, research status, challenges, and future trends based on the core components of the binocular stereo vision system, with a special focus on comparing new hardware computing architectures.
robots / stereo vision / visual obstacle navigation / image signal processor / hardware architecture / hardware acceleration
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
DA SILVEIRA T L T,
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
章毓晋. 图像工程(下册)——图像理解[M]. 4版. 北京: 清华大学出版社, 2018.
|
| [20] |
章毓晋. 3D计算机视觉:原理、算法及应用[M]. 北京: 电子工业出版社, 2021.
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
The assumption that epipolar lines are parallel to image scan lines is made in many algorithms for stereo analysis. If valid, it enables the search for corresponding image features to be confined to one dimension and, hence, simplified. An algorithm that generates a vertically aligned stereo pair by warped resampling is described. The method uses grey scale image matching between the components of the stereo pair but confined to feature points.
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
|
| [61] |
|
| [62] |
|
/
| 〈 |
|
〉 |