Cross-view geo-localization(CVGL) is beset with numerous difficulties and challenges, mainly due to the significant discrepancies in viewpoint, the intricacy of localization scenarios, and global localization needs. Given these challenges, we present a novel cross-view image geo-localization framework. The experimental results demonstrate that the proposed framework outperforms existing methods on multiple public datasets and self-built datasets. To improve the cross-view geo-localization performance of the framework on a global scale, we have built a novel global cross-view geo-localization dataset, CV-Cities. This dataset encompassing a diverse range of intricate scenarios. It serves as a challenging benchmark for cross-view geo-localization.
We collected 223,736 ground images and 223,736 satellite images with high-precision GPS coordinates of 16 typical cities in five continents. To download this dataset, you can click: 🤗CV-Cities or 🤗CV-Cities (mirror).
City scene | Nature scene | ||
Water scene | Occlusion |
London, UK | Rio, Brazil | Seattle, USA |
Singapore | Sydney, Australia | Taipei, China |
🚧 Under Construction
python train/train_cvcities.py
This code is based on the amazing work of:
@article{huangCVCities2024,
author={Huang, Gaoshuang and Zhou, Yang and Zhao, Luying and Gan, Wenjian},
journal={IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing},
title={CV-Cities: Advancing Cross-View Geo-Localization in Global Cities},
year={2024},
volume={},
number={},
pages={1-15},
keywords={Cross-view geo-localization;dataset;global cities;image retrieval;visual place recognition},
doi={10.1109/JSTARS.2024.3502160}}