Abstract: |
The intelligent monitoring of complex scenes usually requires the adoption of different sensors depending
on the type of application (i.e. radar, sonar, chemical, etc.). From the past few years, monitoring is mainly
represented by visual-surveillance. In this field, the research has proposed great innovation improving the
surveillance from the standard CCTV to modern systems now able to infer behaviors in limited contexts.
Though, when environments allow the creation of complex scenes (i.e. crowds, clutter, etc.) robust solutions
are still far to be available. In particular, one of the major problems is represented by the occlusions that often
limit the performance of the algorithms. As matter of fact, the majority of the proposed visual surveillance
solutions processes the data flow generated by a single camera. These methods fail to correctly localize an
occluded object in the real environment. Stereo vision can be introduced to solve such a limit but the number
of needed sensors would double. Thus, to obtain the benefits of the stereo vision discharging some of its
drawbacks, a novel framework in stereo vision is proposed by adopting the sensors available in common
visual-surveillance networks. In particular, we will focus on the analysis of a stereo vision system which
is build from a pairs of heterogeneous sensors, i.e., static and PTZ cameras with a task to locate objects
accurately. |