Abstract: Existing RGB-Thermal trackers usually treat intra-modal feature extraction and inter-modal feature fusion as two separate processes, therefore the mutual promotion of extraction and fusion ...
Abstract: Remote sensing cross-modality text-image retrieval aims to retrieve a specific object from a large image gallery based on a natural language description, and vice versa. Existing methods ...