KIT | KIT-Bibliothek | Impressum | Datenschutz

A Novel Correspondence Model for Linking Objects and Texts in Construction Plans

Hong, Shuwei 1; Landgraf, Steven ORCID iD icon 1; Hillemann, Markus ORCID iD icon 1; Ulrich, Markus ORCID iD icon 1
1 Institut für Photogrammetrie und Fernerkundung (IPF), Karlsruher Institut für Technologie (KIT)

Abstract:

Construction plans integrate visual and textual information that is essential for construction projects. However, the huge diversity of formats of these plans poses challenges for automated analysis. This paper presents a novel correspondence model that links objects and texts in construction plans, providing a unified approach to interpreting various formats, such as scanned blueprints, CAD drawings, and digital construction documents. Leveraging deep-learning-based object detection and text recognition techniques, our model establishes semantic correspondences between visual and textual elements. We integrate CLIP-based models with ViT-based encoders as part of our approach to enhance feature extraction and correspondence learning. By employing a threshold-based determination, our model effectively resolves cases where a single text passage may describe multiple objects or where a single object is referenced by multiple pieces of text. This capability enables the model to establish robust correspondences between objects and texts, laying a strong foundation for subsequent semantic understanding and information extraction. We evaluate its effectiveness on labeled datasets and demonstrate that our model achieves high precision, recall, F1-score, and accuracy. ... mehr


Verlagsausgabe §
DOI: 10.5445/IR/1000183575
Veröffentlicht am 29.07.2025
Originalveröffentlichung
DOI: 10.5194/isprs-archives-XLVIII-G-2025-597-2025
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Photogrammetrie und Fernerkundung (IPF)
Publikationstyp Zeitschriftenaufsatz
Publikationsjahr 2025
Sprache Englisch
Identifikator ISSN: 2194-9034
KITopen-ID: 1000183575
Erschienen in The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Verlag Copernicus Publications
Band XLVIII-G-2025
Seiten 597–604
Vorab online veröffentlicht am 28.07.2025
Schlagwörter Object Detection, Text Recognition, Multi-modal Analysis, Construction Plans, Correspondence Model
Nachgewiesen in Dimensions
OpenAlex
Scopus
Globale Ziele für nachhaltige Entwicklung Ziel 11 – Nachhaltige Städte und Gemeinden
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page