Abstract: This paper proposes a model-level fusion-based multi-modal object detection and recognition method. This method employs various modalities to process images, speech, videos, etc., and fuses ...
Abstract: Zero-shot 6D object pose estimation involves the detection of novel objects with their 6D poses in cluttered scenes, presenting significant challenges for model generalizability. Fortunately ...