Abstract: Visible-Infrared person Re-IDentification (VI-ReID) is a challenging cross-modality image retrieval task that aims to match pedestrians’ images across visible and infrared cameras. To solve ...
Abstract: Synthesis of unavailable imaging modalities from available ones can generate modality-specific complementary information and enable multi-modality based medical images diagnosis or treatment ...
🏡 Project Page | 📄 Paper | UniME(Phi3.5-V-4.2B) 🤗/🤖 | UniME(LLaVA-v1.6-7B)🤗/🤖 | UniME(LLaVA-OneVision-7B)🤗/🤖 UniME achieves the top ranking on the MMEB leaderboard training using only 336×336 ...
This repository is the official Pytorch implementation for the paper Rethinking Multi-modal Object Detection from the Perspective of Mono-Modality Feature Learning. If you have any questions, please ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results