KIT | KIT-Bibliothek | Impressum | Datenschutz

TriBand-BEV: Real-Time LiDAR-Only 3D Pedestrian Detection via Height-Aware BEV and High-Resolution Feature Fusion

Khoshkdahan, Mohammad 1; Vinel, Alexey 1
1 Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB), Karlsruher Institut für Technologie (KIT)

Abstract:

Safe autonomous agents and mobile robots need fast real time 3D perception, especially for vulnerable road users (VRUs) such as pedestrians. We introduce a new bird's eye view (BEV) encoding, which maps the full 3D LiDAR point cloud into a light-weight 2D BEV tensor with three height bands. We explicitly reformulate 3D detection as a 2D detection problem and then reconstruct 3D boxes from the BEV outputs. A single network detects cars, pedestrians, and cyclists in one pass. The backbone uses area attention at deep stages, a hierarchical bidirectional neck over P1 to P4 fuses context and detail, and the head predicts oriented boxes with distribution focal learning for side offsets and a rotated IoU loss. Training applies a small vertical re bin and a mild reflectance jitter in channel space to resist memorization. We use an interquartile range (IQR) filter to remove noisy and outlier LiDAR points during 3D reconstruction.
On KITTI dataset, TriBand-BEV attains 58.7/52.6/47.2 pedestrian BEV AP(%) for easy, moderate, and hard at 49 FPS on a single consumer GPU, surpassing Complex-YOLO, with gains of +12.6%, +7.5%, and +3.1%. Qualitative scenes show stable detection under occlusion. ... mehr


Verlagsausgabe §
DOI: 10.5445/IR/1000194449
Veröffentlicht am 23.06.2026
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2026
Sprache Englisch
Identifikator ISBN: 979-8-4007-2317-9
KITopen-ID: 1000194449
Erschienen in Proceedings of the 25th International Conference on Autonomous Agents and Multiagent Systems
Veranstaltung 25th International Conference on Autonomous Agents and Multiagent Systems (2026), Paphos, Zypern, 25.05.2026 – 29.05.2026
Verlag International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Seiten 1294 - 1303
Vorab online veröffentlicht am 24.05.2026
Externe Relationen Siehe auch
Schlagwörter LiDAR-only 3D Object Detection, Bird’s Eye View Representation,Real-Time Perception, Pedestrian Detection, Autonomous Robotics
Nachgewiesen in Scopus
OpenAlex
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page