News
Hybrid Augmented Model for Detecting Pedestrian Attention
Abstract
Pedestrian safety on the road largely depends on how accurately a pedestrian perceives the traffic situation and how reliably a vehicle can assess the pedestrian’s level of situational awareness. Estimating pedestrian gaze and attention in real-world street environments is therefore an important but challenging problem due to small face resolution, motion blur, occlusions, and extreme illumination variations. In previous studies, face-based gaze estimation and pupil tracking were shown to provide fine-grained attention cues under favorable visual conditions. However, these approaches remain fragile in unconstrained street scenes, where eye visibility and facial landmark detection frequently fail, leading to unreliable or missing outputs in safety-critical situations. In this paper, we present a robust hybrid framework that extends our earlier work by introducing a pipeline of pedestrian attention estimation methods with explicit error control at each stage to enable continuous gaze direction estimation under complex urban conditions. In addition to a learning-based gaze estimator, we incorporate geometric methods and a lightweight head pose estimation network to compensate for the breakdown of individual methods under noisy input data. The proposed framework integrates face detection, gaze estimation, geometric head pose computation, and learning-based head pose prediction, including the prediction of head position and orientation at the next time step. Unlike prior gaze-centric systems, neural networks in this design are not only used to improve accuracy but are elevated to ensure continuous video processing under real-world visual degradation associated with complex traffic scenarios. Extensive experiments on open datasets collected in real street environments demonstrate that the proposed method substantially enhances the success rate and accuracy of long-distance gaze recognition, thereby improving the feasibility of end-to-end assessment of pedestrian situational awareness while maintaining accuracy comparable to gaze estimation under favorable conditions.
Keywords
Edition
Proceedings of the Institute for System Programming, vol. 38, issue 3, part 3, 2026, pp. 115-124
ISSN 2220-6426 (Online), ISSN 2079-8156 (Print).
DOI: 10.15514/ISPRAS-2026-38(3)-39
For citation
Full text of the paper in pdf
Back to the contents of the volume