KIT | KIT-Bibliothek | Impressum | Datenschutz

HOGraspFlow: Exploring Vision-based Generative Grasp Synthesis with Hand-Object Priors and Taxonomy Awareness

Shi, Yitian; Guo, Zicheng; Wolf, Rosa 1; Welte, Edgar; Rayyes, Rania
1 Institut für Fördertechnik und Logistiksysteme (IFL), Karlsruher Institut für Technologie (KIT)

Abstract (englisch):

We propose Hand-Object (HO)GraspFlow, an affordance-centric approach that retargets a single RGB with hand-object interaction (HOI) into multi-modal executable parallel jaw grasps without explicit geometric priors on target objects. Building on foundation models for hand reconstruction and vision, we synthesize SE(3) grasp poses with denoising flow matching (FM), conditioned on the following three complementary cues: RGB foundation features as visual semantics, HOI contact reconstruction, and taxonomy-aware prior on grasp types. Our approach demonstrates high fidelity in grasp synthesis without explicit HOI contact input or object geometry, while maintaining strong contact and taxonomy recognition. Another controlled comparison shows that HOGraspFlow consistently outperforms diffusion-based variants (HOGraspDiff), achieving high distributional fidelity and more stable optimization in SE(3). We demonstrate a reliable, object-agnostic grasp synthesis from human demonstrations in real-world experiments, where an average success rate of over 83% is achieved.


Volltext §
DOI: 10.5445/IR/1000189663
Veröffentlicht am 15.01.2026
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Fördertechnik und Logistiksysteme (IFL)
Publikationstyp Forschungsbericht/Preprint
Publikationsdatum 21.09.2025
Sprache Englisch
Identifikator KITopen-ID: 1000189663
Verlag arxiv
Umfang 9 S.
Schlagwörter Robotics (cs.RO)
Nachgewiesen in OpenAlex
arXiv
Dimensions
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page