| Zugehörige Institution(en) am KIT | Institut für Fördertechnik und Logistiksysteme (IFL) |
| Publikationstyp | Forschungsdaten |
| Publikationsdatum | 23.06.2026 |
| Erstellungsdatum | 31.03.2026 |
| Identifikator | DOI: 10.35097/qx7b62vnbercxzj9 KITopen-ID: 1000194520 |
| Lizenz | Creative Commons Namensnennung 4.0 International |
| Projektinformation | SeI_MoR (WM_BaWü, BW8_1222) |
| Schlagwörter | intralogistics; mobile robots; semantic mapping; instance segmentation; vision-language models; RGB-LiDAR fusion; open-vocabulary; SLAM; robot perception |
| Liesmich | Recording setup. Data was recorded with a mobile robot equipped with two 2D laser scanners (360° range) and a forward-facing RGB camera. Robot poses and the geometric map were obtained with GMapping (2D SLAM). RGB and laser observations are temporally synchronized, establishing a point-to-pixel correspondence between geometric and visual data. The full exploration run was subsampled by motion thresholds (30 cm / 15°) to 74 frames in a single controlled environment. Images: PNG, 768 × 480, RGB, lens-undistorted. Pixel origin top-left; u = column, v = row. Software / reuse. All annotations are plain JSON and images are standard PNG; no proprietary software is required. Files can be parsed with any JSON library (e.g. Python json) and inspected with standard image tools or NumPy/OpenCV/Matplotlib. Pixel coordinates index directly into the corresponding undistorted image, and laser (u, v) values map laser returns into the same image plane. The dataset supports tasks such as instance segmentation, multi-view object association, geometric/semantic mapping, and benchmarking vision-language models for intralogistics perception. A detailed README.md (including the two evaluated VLM prompts) is included in the dataset. |
| Art der Forschungsdaten | Dataset |