GitHub - Vincentqyw/cv-arxiv-daily: 🎓Automatically Update CV Papers Daily using Github Actions

Updated on 2025.03.06

Usage instructions: here

Table of Contents

SLAM
SFM
Visual Localization
Keypoint Detection
Image Matching
NeRF

SLAM

Publish Date	Title	Authors	PDF	Code
2025-03-04	Introspective Loop Closure for SLAM with 4D Imaging Radar	Maximilian Hilger et.al.	2503.02383	null
2025-03-04	DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting	Haoyuan Li et.al.	2503.02223	null
2025-03-03	Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM	Marco Giberna et.al.	2503.02050	null
2025-03-03	vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding	Ali Tourani et.al.	2503.01783	null
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661	null
2025-03-03	OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding	Dianyi Yang et.al.	2503.01646	null
2025-03-03	MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features	Chao Ye et.al.	2503.01571	null
2025-03-03	AI-Driven Relocation Tracking in Dynamic Kitchen Environments	Arash Nasr Esfahani et.al.	2503.01547	link
2025-03-03	Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning	Xintao Chao et.al.	2503.01543	null
2025-03-03	RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation	Shu Pan et.al.	2503.01434	null
2025-02-27	BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground	Yufei Wei et.al.	2502.20078	null
2025-02-26	Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects	Petri Mäkinen et.al.	2502.19169	null
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932	null
2025-02-25	S-Graphs 2.0 -- A Hierarchical-Semantic Optimization and Loop Closure for SLAM	Hriday Bavle et.al.	2502.18044	link
2025-02-25	MegaLoc: One Retrieval to Place Them All	Gabriele Berton et.al.	2502.17237	link
2025-02-24	SLABIM: A SLAM-BIM Coupled Dataset in HKUST Main Building	Haoming Huang et.al.	2502.16856	link
2025-02-27	Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM	Yao Zhang et.al.	2502.16495	null
2025-02-19	Slamming: Training a Speech Language Model on One GPU in a Day	Gallil Maimon et.al.	2502.15814	link
2025-02-21	RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes	Sicheng Yu et.al.	2502.15633	null
2025-02-20	Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2502.14931	null
2025-02-19	3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments	Vincent Ress et.al.	2502.13803	null
2025-02-19	Active Illumination for Visual Ego-Motion Estimation in the Dark	Francesco Crocetti et.al.	2502.13708	null
2025-02-17	From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations	Matteo Scucchia et.al.	2502.12303	null
2025-02-19	pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM	Luigi Freda et.al.	2502.11955	link
2025-02-17	Anti-Degeneracy Scheme for Lidar SLAM based on Particle Filter in Geometry Feature-Less Environments	Yanbin Li et.al.	2502.11486	null
2025-02-16	GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting	Zelin Zhou et.al.	2502.10975	null
2025-02-19	MonoForce: Learnable Image-conditioned Physics Engine	Ruslan Agishev et.al.	2502.10156	link
2025-02-13	Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions	Dario Pisanti et.al.	2502.09795	null
2025-02-13	DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior	Mingrui Li et.al.	2502.09111	null
2025-02-12	LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features	Shujie Zhou et.al.	2502.08676	link
2025-02-10	Occupancy-SLAM: An Efficient and Robust Algorithm for Simultaneously Optimizing Robot Poses and Occupancy Map	Yingyu Wang et.al.	2502.06292	link
2025-02-09	PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map	Yue Pan et.al.	2502.05752	link
2025-02-07	Joint State and Noise Covariance Estimation	Kasra Khosoussi et.al.	2502.04584	null
2025-02-05	GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM	Mingrui Li et.al.	2502.03228	null
2025-02-04	SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification	Yifu Tao et.al.	2502.02657	null
2025-02-04	HeRCULES: Heterogeneous Radar Dataset in Complex Urban Environment for Multi-session Radar SLAM	Hanjun Kim et.al.	2502.01946	null
2025-02-03	Statistical enhance learning for modeling and prediction tennis matches at Grand Slam tournaments	Nourah Buhamra et.al.	2502.01613	null
2025-02-03	Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter	Dabin Kim et.al.	2502.01092	null
2025-02-01	FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps	Maximilian Leitenstern et.al.	2502.00395	link
2025-01-31	LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks	Liudi Yang et.al.	2501.19382	link
2025-01-31	Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping	Yiming Huang et.al.	2501.19319	link
2025-01-31	GO: The Great Outdoors Multimodal Dataset	Peng Jiang et.al.	2501.19274	null
2025-01-30	Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems	Liudi Yang et.al.	2501.18110	null
2025-01-28	SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios	Yinqi Chen et.al.	2501.16754	null
2025-01-27	Visual-Lidar Map Alignment for Infrastructure Inspections	Jake McLaughlin et.al.	2501.14486	link
2025-01-24	Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video	Xiaohao Xu et.al.	2501.14319	link
2025-01-24	HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting	Javier Yu et.al.	2501.14147	null
2025-01-23	FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation	Bingyang Zhou et.al.	2501.13876	null
2025-01-23	VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM	Gyuhyeon Pak et.al.	2501.13402	null
2025-01-22	Grid-based Submap Joining: An Efficient Algorithm for Simultaneously Optimizing Global Occupancy Map and Local Submap Frames	Yingyu Wang et.al.	2501.12764	null
2025-01-21	DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM	Jesse Morris et.al.	2501.11893	link
2025-01-21	Survey on Monocular Metric Depth Estimation	Jiuling Zhang et.al.	2501.11841	null
2025-01-19	OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors	Dominik Kulmer et.al.	2501.11111	null
2025-01-19	Factor Graph-Based Active SLAM for Spacecraft Proximity Operations	Lorenzo Ticozzi et.al.	2501.10950	null
2025-01-23	Mesh2SLAM in VR: A Fast Geometry-Based SLAM Framework for Rapid Prototyping in Virtual Reality Applications	Carlos Augusto Pinheiro de Sousa et.al.	2501.09600	null
2025-01-16	Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment	Maksim Filipenko et.al.	2501.09490	null
2025-01-15	Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures	Pengru Deng et.al.	2501.09203	null
2025-01-15	AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning	Assaf Lahiany et.al.	2501.09160	null
2025-01-15	SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM	Yuhang Ming et.al.	2501.08880	null
2025-01-15	GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping	Sheng Hong et.al.	2501.08672	null
2025-01-16	BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module	Dongzhihan Wang et.al.	2501.08659	null
2025-01-15	Self-Organizing Edge Computing Distribution Framework for Visual SLAM	Jussi Kalliola et.al.	2501.08629	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286	null
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399	null
2025-01-13	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015	null
2025-01-12	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927	link
2025-01-11	SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors	Zhen Hong et.al.	2501.06469	null
2025-01-09	Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping	Wen Tianci et.al.	2501.05242	null
2025-01-07	SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment	Yuchun Fan et.al.	2501.03681	link
2025-01-06	HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos	Jinglei Zhang et.al.	2501.02973	null
2025-01-09	LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments	Haosong Yue et.al.	2501.02580	link
2025-01-04	ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle	Yinchuan Wang et.al.	2501.02166	link
2024-12-31	PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM	Runnan Chen et.al.	2501.00352	null
2024-12-30	Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields	Evgenii Kruzhkov et.al.	2412.20976	null
2024-12-28	MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing	Shuo Wang et.al.	2412.20082	null
2024-12-27	DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction	Kai Xu et.al.	2412.19584	null
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130	null
2024-12-23	End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework	Fuhua Jia et.al.	2412.17343	null
2024-12-23	LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation	Riku Uemura et.al.	2412.17282	null
2024-12-23	Selective Kalman Filter: When and How to Fuse Multi-Sensor Information to Overcome Degeneracy in SLAM	Jie Xu et.al.	2412.17235	null
2025-01-03	Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry	Zhaoxing Zhang et.al.	2412.16923	null
2024-12-21	Query Quantized Neural SLAM	Sijia Jiang et.al.	2412.16476	link
2024-12-20	SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training	Wenxi Chen et.al.	2412.15649	link
2024-12-18	Energy-Efficient SLAM via Joint Design of Sensing, Communication, and Exploration Speed	Zidong Han et.al.	2412.13912	null
2024-12-18	Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation	Sait Akturk et.al.	2412.13752	null
2024-12-18	4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching	Fernando Amodeo et.al.	2412.13639	link
2024-12-17	NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment	Andrea Dunn Beltran et.al.	2412.13176	null
2024-12-18	Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera	Zhengdi Yu et.al.	2412.12861	null
2024-12-16	Global SLAM in Visual-Inertial Systems with 5G Time-of-Arrival Integration	Meisam Kabiri et.al.	2412.12406	null
2024-12-16	MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors	Riku Murai et.al.	2412.12392	null
2024-12-16	Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges	Martin Aubard et.al.	2412.11840	null
2024-12-19	RoMeO: Robust Metric Visual Odometry	Junda Cheng et.al.	2412.11530	null
2024-12-14	Affine EKF: Exploring and Utilizing Sufficient and Necessary Conditions for Observability Maintenance to Improve EKF Consistency	Yang Song et.al.	2412.10809	link
2024-12-13	RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting	Lizhi Bai et.al.	2412.09868	null
2024-12-12	SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos	Yuzheng Liu et.al.	2412.09401	link
2024-12-12	eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction	Jad Mansour et.al.	2412.09209	link
2024-12-12	Drift-free Visual SLAM using Digital Twins	Roxane Merat et.al.	2412.08496	null
2024-12-10	A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM	Zongbo Liao et.al.	2412.07513	null
2024-12-08	DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments	Juwon Kim et.al.	2412.05839	null
2024-12-06	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463	null
2024-12-05	Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset	Fuzhang Han et.al.	2412.04287	link
2024-12-10	MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application	Hyesu Jang et.al.	2412.03887	null
2024-12-04	Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars	John McConnell et.al.	2412.03760	null
2024-12-04	BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement	Miguel Arturo Vega Torres et.al.	2412.03434	link
2024-12-04	NeRF and Gaussian Splatting SLAM in the Wild	Fabian Schmidt et.al.	2412.03263	link
2024-12-04	MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras	Huai Yu et.al.	2412.03146	link
2024-12-04	An indoor DSO-based ceiling-vision odometry system for indoor industrial environments	Abdelhak Bougouffa et.al.	2412.02950	null
2024-12-03	ROVER: A Multi-Season Dataset for Visual SLAM	Fabian Schmidt et.al.	2412.02506	link
2024-12-04	RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting	Zhenzhong Cao et.al.	2412.01217	link
2024-11-28	Visual SLAMMOT Considering Multiple Motion Models	Peilin Tian et.al.	2411.19134	null
2024-11-27	ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching	Yangrui Dong et.al.	2411.18174	null
2024-11-27	HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction	Wei Zhang et.al.	2411.17982	null
2024-11-26	MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework	Xiangcheng Hu et.al.	2411.17928	link
2024-11-29	DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting	Christian Homeyer et.al.	2411.17660	link
2024-11-25	MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM	Vladimir Yugay et.al.	2411.16785	null
2024-11-24	Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors	Soumava Paul et.al.	2411.15966	null
2024-11-24	Near-Range Environmental Perception for Inland Waterway Vessels: A Comparative Study of LiDAR and Automotive FMCW RADAR Sensors	R. Herrmann et.al.	2411.15901	null
2024-11-24	PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments	Haoang Li et.al.	2411.15800	null
2024-11-23	Gassidy: Gaussian Splatting SLAM in Dynamic Environments	Long Wen et.al.	2411.15476	null
2024-11-22	OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping	Tomas Berriel Martins et.al.	2411.15043	null
2024-11-22	A Benchmark Dataset for Collaborative SLAM in Service Environments	Harin Park et.al.	2411.14775	link
2024-11-21	InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation	Marziyeh Bamdad et.al.	2411.14358	link
2024-11-20	Robust Monocular Visual Odometry using Curriculum Learning	Assaf Lahiany et.al.	2411.13438	null
2024-11-20	Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds	Jelena Trisovic et.al.	2411.13310	null
2024-11-19	3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality	Hanbeom Chang et.al.	2411.12514	null
2024-11-19	LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2411.12185	null
2024-11-18	Exploring Emerging Trends and Research Opportunities in Visual Place Recognition	Antonios Gasteratos et.al.	2411.11481	null
2024-11-18	The Blue Horizontal-Branch Stars From the LAMOST Survey: Atmospheric Parameters	Jie Ju et.al.	2411.11250	null
2024-11-17	A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality	Wei-Hsiang Lien et.al.	2411.10940	null
2024-11-16	DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment	Mangyu Kong et.al.	2411.10722	link
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546	null
2024-11-15	BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation	Yufei Wei et.al.	2411.10195	null
2024-11-13	DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization	Yueming Xu et.al.	2411.08373	null
2024-11-13	MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation	Peng Wang et.al.	2411.08279	link
2024-11-12	Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments	Ankit Shaw et.al.	2411.08231	null
2024-11-12	NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN	Sonia Raychaudhuri et.al.	2411.07848	null
2024-11-11	Lost in Tracking Translation: A Comprehensive Analysis of Visual SLAM in Human-Centered XR and IoT Ecosystems	Yasra Chandio et.al.	2411.07146	null
2024-11-11	Learning from Feedback: Semantic Enhancement for Object SLAM Using Foundation Models	Jungseok Hong et.al.	2411.06752	null
2024-11-11	HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation	Xiaolong Wang et.al.	2411.06700	null
2024-11-08	Development of an indoor localization and navigation system based on monocular SLAM for mobile robots	Thanh Nguyen Canh et.al.	2411.05337	null
2024-11-07	Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping	Sayat Ibrayev et.al.	2411.04797	null
2024-11-07	MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation	Sayan Paul et.al.	2411.04796	null
2024-11-09	DEIO: Deep Event Inertial Odometry	Weipeng Guan et.al.	2411.03928	link
2024-11-06	Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward	Shashi Kumar et.al.	2411.03866	null
2024-11-06	LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior	Jiahui Wang et.al.	2411.03610	link
2024-11-05	LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting	Huibin Zhao et.al.	2411.02703	null
2024-11-04	Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing	Xinran Zhang et.al.	2411.02553	null
2024-11-04	Semantic Masking and Visual Feature Matching for Robust Localization	Luisa Mao et.al.	2411.01804	null
2024-10-31	XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM	Xiaomeng Wang et.al.	2410.23690	link
2024-10-30	LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM	Yucheng Huang et.al.	2410.23231	link
2024-10-30	ISAC Prototype System for Multi-Domain Cooperative Communication Networks	Jie Yang et.al.	2410.22956	null
2024-10-30	SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark	HyunJun Jung et.al.	2410.22715	link
2024-10-29	LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues	Hanqing Jiang et.al.	2410.22213	null
2024-10-29	EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments	Linus Nwankwo et.al.	2410.22200	null
2024-10-28	NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments	Taiyi Pan et.al.	2410.21615	link
2024-10-28	coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM	Emiliano Höss et.al.	2410.21149	link
2024-11-01	RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior	Mingjiang Liang et.al.	2410.20358	null
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341	link
2024-10-22	AG-SLAM: Active Gaussian Splatting SLAM	Wen Jiang et.al.	2410.17422	null
2024-10-22	Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study	J. Jorge et.al.	2410.17171	null
2024-10-19	EndoMetric: Near-light metric scale monocular SLAM	Raúl Iranzo et.al.	2410.15065	null
2024-10-17	Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot	Dongkun Han et.al.	2410.13612	null
2024-10-17	TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal	Yanpeng Jia et.al.	2410.13240	null
2024-10-16	QueensCAMP: an RGB-D dataset for robust Visual SLAM	Hudson M. S. Bruno et.al.	2410.12520	link
2024-10-18	PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM	Guanghao Li et.al.	2410.12324	null
2024-10-16	Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem	Yichen Sha et.al.	2410.12169	null
2024-10-15	V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting	Tuan Dang et.al.	2410.12068	link
2024-10-15	GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information	Wancai Zheng et.al.	2410.11356	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	link
2024-10-14	MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator	Taozhe Li et.al.	2410.10669	null
2024-10-13	Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph	Benoit Casseau et.al.	2410.09896	null
2024-10-12	SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs	Wenxi Chen et.al.	2410.09503	link
2024-10-12	An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation	Wei Liang et.al.	2410.09443	null
2024-10-12	ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras	Junkai Niu et.al.	2410.09374	link
2024-10-11	Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System	Zheng Liu et.al.	2410.08935	link
2024-10-11	Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints	Yicheng He et.al.	2410.08780	null
2024-10-10	ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization	Mason B. Peterson et.al.	2410.08262	link
2024-10-10	IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera	Jian Huang et.al.	2410.08107	link
2024-10-08	Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching	Gongxin Yao et.al.	2410.06285	null
2024-10-08	Submodular Optimization for Keyframe Selection & Usage in SLAM	David Thorne et.al.	2410.05576	null
2024-10-07	SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones	Denis Davletshin et.al.	2410.05405	null
2024-10-07	Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection	Ang He et.al.	2410.05017	null
2024-10-05	A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems	Nikola Radulov et.al.	2410.04242	link
2024-10-05	High-Speed Stereo Visual SLAM for Low-Powered Computing Devices	Ashish Kumar et.al.	2410.04090	link
2024-10-04	EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM	Shi Chen et.al.	2410.03812	null
2024-10-04	Estimating Body and Hand Motion in an Ego-sensed World	Brent Yi et.al.	2410.03665	null
2024-10-03	LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features	Zihao Dong et.al.	2410.02961	null
2024-10-02	ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space	Hogyun Kim et.al.	2410.01325	null
2024-10-01	Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency	William Dubois et.al.	2410.00758	null
2024-10-02	CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM	Dapeng Feng et.al.	2410.00486	link
2024-09-30	Additively Manufactured Open-Source Quadruped Robots for Multi-Robot SLAM Applications	Zachary Fuge et.al.	2410.00122	null
2024-09-30	Direct Multipath-Based SLAM	Mingchao Liang et.al.	2409.20552	null
2024-09-30	Robust Gaussian Splatting SLAM by Leveraging Loop Closure	Zunjie Zhu et.al.	2409.20111	null
2024-09-30	DynORecon: Dynamic Object Reconstruction for Navigation	Yiduo Wang et.al.	2409.19928	null
2024-09-29	CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation	Yifan Duan et.al.	2409.19597	null
2024-09-29	CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought	Yexing Du et.al.	2409.19510	link
2024-09-29	Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface	Ziniu Wu et.al.	2409.19499	null
2024-09-27	Royal Reveals: LiDAR Mapping of Kronborg Castle, Echoes of Hamlet's Halls	Leon Davies et.al.	2409.18752	null
2024-09-26	BlinkTrack: Feature Tracking over 100 FPS via Events and Images	Yichen Shen et.al.	2409.17981	null
2024-09-26	Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry	Qi Zhang et.al.	2409.17729	null
2024-09-26	Event-based Stereo Depth Estimation: A Survey	Suman Ghosh et.al.	2409.17680	null
2024-09-25	Efficient Submap-based Autonomous MAV Exploration using Visual-Inertial SLAM Configurable for LiDARs or Depth Cameras	Sotiris Papatheodorou et.al.	2409.16972	null
2024-09-25	Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM	Phu Pham et.al.	2409.16944	null
2024-09-25	Inline Photometrically Calibrated Hybrid Visual SLAM	Nicolas Abboud et.al.	2409.16810	link
2024-09-25	Topological SLAM in colonoscopies leveraging deep features and topological priors	Javier Morlana et.al.	2409.16806	link
2024-09-25	Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots	Masoud Dayani Najafabadi et.al.	2409.16595	link
2024-09-25	Task-driven SLAM Benchmarking	Yanwei Du et.al.	2409.16573	null
2024-09-24	SoMaSLAM: 2D Graph SLAM for Sparse Range Sensing with Soft Manhattan World Constraints	Jeahn Han et.al.	2409.15736	null
2024-09-23	Spectral Graph Theoretic Methods for Enhancing Network Robustness in Robot Localization	Neelkamal Somisetty et.al.	2409.15506	null
2024-09-22	SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms	Niraj Pudasaini et.al.	2409.14515	null
2024-09-21	Point Cloud Structural Similarity-based Underwater Sonar Loop Detection	Donghwi Jung et.al.	2409.14020	link
2024-09-20	HMD $^2$ : Environment-aware Motion Generation from Single Egocentric Head-Mounted Device	Vladimir Guzov et.al.	2409.13426	null
2024-09-20	Learning Visual Information Utility with PIXER	Yash Turkar et.al.	2409.13151	null
2024-09-19	MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting	Yan Song Hu et.al.	2409.13055	null
2024-09-19	Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2409.12518	null
2024-09-18	Bundle Adjustment in the Eager Mode	Zitong Zhan et.al.	2409.12190	null
2024-09-23	Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping	Jaehyung Jung et.al.	2409.12051	null
2024-09-18	Metric-Semantic Factor Graph Generation based on Graph Neural Networks	Jose Andres Millan-Romera et.al.	2409.11972	null
2024-09-18	Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments	Lei Cheng et.al.	2409.11854	null
2024-09-18	ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation	Yanlin Jin et.al.	2409.11692	null
2024-09-18	SLAM assisted 3D tracking system for laparoscopic surgery	Jingwei Song et.al.	2409.11688	null
2024-09-17	GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure	Ziheng Xu et.al.	2409.10982	null
2024-09-17	Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells	Ankit Butola et.al.	2409.10971	null
2024-09-17	Evaluating and Improving the Robustness of LiDAR-based Localization and Mapping	Bo Yang et.al.	2409.10824	link
2024-09-16	P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty	Yufan Zhang et.al.	2409.10143	link
2024-09-16	SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning	Amogh Joshi et.al.	2409.09990	null
2024-09-16	Enhancing Visual Inertial SLAM with Magnetic Measurements	Bharat Joshi et.al.	2409.09904	null
2024-09-15	Marginalizing and Conditioning Gaussians onto Linear Approximations of Smooth Manifolds with Applications in Robotics	Zi Cong Guo et.al.	2409.09871	null
2024-09-15	Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping	Yi Liu et.al.	2409.09763	null
2024-09-15	High Definition Map Mapping and Update: A General Overview and Future Directions	Benny Wijaya et.al.	2409.09726	null
2024-09-14	MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry	Yuheng Qiu et.al.	2409.09479	null
2024-09-14	Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM	Haoying Li et.al.	2409.09410	null
2024-09-14	GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians	Dasong Gao et.al.	2409.09295	link
2024-09-14	Panoramic Direct LiDAR-assisted Visual Odometry	Zikang Yuan et.al.	2409.09287	link
2024-09-11	Object Depth and Size Estimation using Stereo-vision and Integration with SLAM	Layth Hamad et.al.	2409.07623	null
2024-09-11	Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry	Anbo Tao et.al.	2409.06948	null
2024-09-10	Technical Report of Mobile Manipulator Robot for Industrial Environments	Erfan Amoozad Khalili et.al.	2409.06693	null
2024-09-10	Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios	Zhiqiang Chen et.al.	2409.04961	link
2024-09-08	FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat	Changfei Fu et.al.	2409.03457	null
2024-09-03	Integration of Augmented Reality and Mobile Robot Indoor SLAM for Enhanced Spatial Awareness	Michael D. Friske et.al.	2409.01915	null
2024-09-03	Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric	Tingchen Ma et.al.	2409.01856	null
2024-09-02	Saying goodbyes to rotating your phone: Magnetometer calibration during SLAM	Ilari Vallivaara et.al.	2409.01242	null
2024-09-02	Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection	Manon Kok et.al.	2409.01091	null
2024-09-02	Robust Vehicle Localization and Tracking in Rain using Street Maps	Yu Xiang Tan et.al.	2409.01038	link
2024-08-31	UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM	Mostafa Mansour et.al.	2409.00362	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373	null
2024-08-30	Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning	Shuyang Zhang et.al.	2408.17005	link
2024-08-29	Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry	Michael Adlerstein et.al.	2408.16472	null
2024-08-28	Single-Photon 3D Imaging with Equi-Depth Photon Histograms	Kaustubh Sadekar et.al.	2408.16150	null
2024-08-28	BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR	Miguel Arturo Vega Torres et.al.	2408.15870	link
2024-08-30	Addressing the challenges of loop detection in agricultural environments	Nicolás Soncini et.al.	2408.15761	link
2024-08-28	ES-PTAM: Event-based Stereo Parallel Tracking and Mapping	Suman Ghosh et.al.	2408.15605	link
2024-08-28	PointEMRay: A Novel Efficient SBR Framework on Point Based Geometry	Kaiqiao Yang et.al.	2408.15583	null
2024-09-02	Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration	Rongge Zhang et.al.	2408.14726	link
2024-08-26	A Survey on Reinforcement Learning Applications in SLAM	Mohammad Dehghani Tezerjani et.al.	2408.14518	null
2024-08-28	FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry	Chunran Zheng et.al.	2408.14035	link
2024-08-21	Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild	Turcan Tuna et.al.	2408.11809	null
2024-08-21	LiFCal: Online Light Field Camera Calibration via Bundle Adjustment	Aymeric Fleith et.al.	2408.11682	null
2024-08-21	Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars	Zhihao Lin et.al.	2408.11582	null
2024-08-21	RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions Transform	Maximilian Hilger et.al.	2408.11576	link
2024-08-21	Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models	Kento Kawaharazuka et.al.	2408.11380	null
2024-08-20	LoopSplat: Loop Closure by Registering 3D Gaussian Splats	Liyuan Zhu et.al.	2408.10154	link
2024-08-19	Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM	Sanghyun Hahn et.al.	2408.09727	link
2024-08-17	GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System	Shuo Wang et.al.	2408.09191	null
2024-08-15	GOReloc: Graph-based Object-Level Relocalization for Visual SLAM	Yutong Wang et.al.	2408.07917	link
2024-08-14	Inverse k-visibility for RSSI-based Indoor Geometric Mapping	Junseo Kim et.al.	2408.07757	null
2024-08-14	Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition	Hogyun Kim et.al.	2408.07330	link
2024-08-12	CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments	Yanpeng Jia et.al.	2408.05981	null
2024-08-21	Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis	Zhongche Qu et.al.	2408.05635	null
2024-08-10	TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping	Seoyeon Jang et.al.	2408.05453	null
2024-08-08	Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods	Yiming Zhou et.al.	2408.04268	null
2024-08-07	Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM	Yan Song Hu et.al.	2408.03825	null
2024-08-07	AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System	Kuan Xu et.al.	2408.03520	link
2024-08-06	BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications	G. Manni et.al.	2408.03078	link
2024-08-04	SLAMS-Propelled Electron Acceleration at High-Mach Number Astrophysical Shocks	Vladimir Zeković et.al.	2408.02084	null
2024-08-03	Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing	Fabian Schmidt et.al.	2408.01716	null
2024-08-03	Deep Patch Visual SLAM	Lahav Lipson et.al.	2408.01654	link
2024-08-02	Momentum Capture and Prediction System Based on Wimbledon Open2023 Tournament Data	Chang Liu et.al.	2408.01544	null
2024-08-07	IG-SLAM: Instant Gaussian SLAM	F. Aykut Sarikamis et.al.	2408.01126	null
2024-08-01	Collecting Larg-Scale Robotic Datasets on a High-Speed Mobile Platform	Yuxin Lin et.al.	2408.00545	null
2024-08-01	High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets	Jian Li et.al.	2408.00538	link
2024-07-31	SuperVINS: A visual-inertial SLAM framework integrated deep learning features	Hongkun Luo et.al.	2407.21348	link
2024-07-30	NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding	Hongjia Zhai et.al.	2407.20853	null
2024-07-29	A flexible framework for accurate LiDAR odometry, map manipulation, and localization	José Luis Blanco-Claraco et.al.	2407.20465	link
2024-07-28	Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data	Azmyin Md. Kamal et.al.	2407.19518	null
2024-07-26	Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation	Aditya Penumarti et.al.	2407.19046	null
2024-07-26	HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM	Zhe Xin et.al.	2407.18813	null
2024-07-25	CodedVO: Coded Visual Odometry	Sachin Shah et.al.	2407.18240	null
2024-07-28	HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation	Zhenzhi Wang et.al.	2407.17438	link
2024-07-22	Memory Management for Real-Time Appearance-Based Loop Closure Detection	Mathieu Labbé et.al.	2407.15890	null
2024-07-22	Reinforcement Learning Meets Visual Odometry	Nico Messikommer et.al.	2407.15626	link
2024-07-22	Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM	Mathieu Labbe et.al.	2407.15305	null
2024-07-21	Semi-Supervised Pipe Video Temporal Defect Interval Localization	Zhu Huang et.al.	2407.15170	null
2024-07-21	VoxDepth: Rectification of Depth Images on Edge Devices	Yashashwee Chakrabarty et.al.	2407.15067	null
2024-07-20	From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM	Lorenzo Montano-Oliván et.al.	2407.14797	null
2024-07-19	MSSP : A Versatile Multi-Scenario Adaptable Intelligent Robot Simulation Platform Based on LIDAR-Inertial Fusion	Qiyan Li et.al.	2407.14102	null
2024-07-18	A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion	Jianxiang Xu et.al.	2407.13878	link
2024-07-18	Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM	Baicheng Li et.al.	2407.13338	null
2024-07-18	Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain	Bach Nguyen Gia et.al.	2407.13159	link
2024-07-17	Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge	Andrea Albanese et.al.	2407.12663	null
2024-07-17	Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM	Markus Weißflog et.al.	2407.12408	null
2024-07-19	Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion	Sangjun Lee et.al.	2407.12405	link
2024-07-17	Fusion LiDAR-Inertial-Encoder data for High-Accuracy SLAM	Manh Do Duc et.al.	2407.11870	null
2024-07-17	GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jingwen Yu et.al.	2407.11736	link
2024-07-16	Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems	Jianzhu Huai et.al.	2407.11705	null
2024-07-16	Batch SLAM with PMBM Data Association Sampling and Graph-Based Optimization	Yu Ge et.al.	2407.11643	null
2024-07-16	I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM	Gwangtak Bae et.al.	2407.11347	null
2024-07-16	FR-SLAM: A SLAM Improvement Method Based on Floor Plan Registration	Jiantao Feng et.al.	2407.11299	null
2024-07-15	Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method	Adam Korycki et.al.	2407.11238	null
2024-07-12	An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks	Seyed Alireza Rahimi Azghadi et.al.	2407.09242	null
2024-07-11	SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM	Neng Wang et.al.	2407.08106	link
2024-07-09	Hyperion -- A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM	David Hug et.al.	2407.07074	link
2024-07-15	A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM	Yasra Chandio et.al.	2407.06889	null
2024-07-08	Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots	Siva Krishna Ravipati et.al.	2407.06077	link
2024-07-10	Co-RaL: Complementary Radar-Leg Odometry with 4-DoF Optimization and Rolling Contact	Sangwoo Jung et.al.	2407.05820	null
2024-07-07	Active Collaborative Visual SLAM exploiting ORB Features	Muhammad Farhan Ahmed et.al.	2407.05453	null
2024-07-06	VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking	Xuefeng Jiang et.al.	2407.05017	null
2024-07-06	Symmetric Linear Arc Monadic Datalog and Gadget Reductions	Manuel Bodirsky et.al.	2407.04924	null
2024-07-03	Ultra-Lightweight Collaborative Mapping for Robot Swarms	Vlad Niculescu et.al.	2407.03136	null
2024-07-01	RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields	Haochen Jiang et.al.	2407.01303	link
2024-07-01	Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation	Lianjie Guo et.al.	2407.01292	link
2024-07-01	Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization	Ruofei Bai et.al.	2407.01013	link
2024-06-30	Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation	Adnan Abdullah et.al.	2407.00848	null
2024-06-30	OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration	Fengyuan Yang et.al.	2407.00574	null
2024-06-24	Compressing Search with Language Models	Thomas Mulc et.al.	2407.00085	null
2024-06-28	CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services	DongKi Noh et.al.	2406.19634	null
2024-06-25	Benchmarking SLAM Algorithms in the Cloud: The SLAM Hive System	Xinzhe Liu et.al.	2406.17586	null
2024-07-02	SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation	Xu Liu et.al.	2406.17249	link
2024-06-24	From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking	Xiaohao Xu et.al.	2406.16850	link
2024-06-23	Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy	Chen Wang et.al.	2406.16087	null
2024-06-19	Simultaneous Map and Object Reconstruction	Nathaniel Chodosh et.al.	2406.13896	null
2024-06-14	Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization	Wonho Song et.al.	2406.11599	null
2024-06-16	Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry	Boris Chidlovskii et.al.	2406.11019	null
2024-06-15	Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM	Yinjie Li et.al.	2406.10494	link
2024-06-12	From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers	Swaminathan Gurumurthy et.al.	2406.07785	link
2024-06-27	Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF)	Gyubeom Im et.al.	2406.06427	null
2024-06-10	Notes on Various Errors and Jacobian Derivations for SLAM	Gyubeom Im et.al.	2406.06422	null
2024-06-23	Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation	Shenghao Li et.al.	2406.06374	link
2024-06-15	Visual-Inertial SLAM as Simple as A, B, VINS	Nathaniel Merrill et.al.	2406.05969	null
2024-06-09	MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps	Jianhao Zheng et.al.	2406.05849	null
2024-06-06	Open Problem: Active Representation Learning	Nikola Milosevic et.al.	2406.03845	null
2024-06-04	ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization	Chen Mao et.al.	2406.01906	link
2024-06-03	The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry	Paolo Cudrano et.al.	2406.01797	null
2024-06-03	Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Takayuki Kanai et.al.	2406.00929	null
2024-06-02	Visual place recognition for aerial imagery: A survey	Ivan Moskalenko et.al.	2406.00885	link
2024-05-30	Structure Gaussian SLAM with Manhattan World Hypothesis	Shuhong Liu et.al.	2405.20031	null
2024-05-30	Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar	Wouter Jansen et.al.	2405.19869	null
2024-05-30	SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization	Jiang Wang et.al.	2405.19813	link
2024-05-30	TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM	Peifeng Jiang et.al.	2405.19614	null
2024-05-27	CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy	Richard Elvira et.al.	2405.16932	null
2024-05-26	Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians	Erik Sandström et.al.	2405.16544	link
2024-05-24	NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes	Lizhi Bai et.al.	2405.15151	null
2024-05-23	ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization	Han Song et.al.	2405.15082	null
2024-05-23	Synergistic Global-space Camera and Human Reconstruction from Videos	Yizhou Zhao et.al.	2405.14855	null
2024-05-23	CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments	Yang Zhou et.al.	2405.14731	link
2024-05-23	Efficient Robot Learning for Perception and Mapping	Niclas Vödisch et.al.	2405.14688	null
2024-05-22	Monocular Gaussian SLAM with Language Extended Loop Closure	Tian Lan et.al.	2405.13748	null
2024-05-26	NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments	Dongha Chung et.al.	2405.12563	link
2024-05-20	EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving	Boyi Liu et.al.	2405.12120	null
2024-05-24	Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation	Hyungtae Lim et.al.	2405.11176	null
2024-05-18	MotionGS : Compact Gaussian Splatting SLAM by Motion Filter	Xinli Guo et.al.	2405.11129	link
2024-05-17	CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion	Gang Wang et.al.	2405.10793	null
2024-05-17	Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map	Liang Zhao et.al.	2405.10743	null
2024-05-10	MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization	Pengcheng Zhu et.al.	2405.06241	null
2024-05-07	Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map	Yuxuan Xia et.al.	2405.04290	null
2024-05-07	IMU-Aided Event-based Stereo Visual Odometry	Junkai Niu et.al.	2405.04071	link
2024-04-27	An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation	Olivier Brochu Dufour et.al.	2404.17745	null
2024-04-26	Camera Motion Estimation from RGB-D-Inertial Scene Flow	Samuel Cerezo et.al.	2404.17251	null
2024-04-23	Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization	Lahav Lipson et.al.	2404.15263	link
2024-04-18	SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints	Spencer Carmichael et.al.	2404.12339	null
2024-04-17	VBR: A Vision Benchmark in Rome	Leonardo Brizi et.al.	2404.11322	link
2024-04-14	Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration	Yanhao Zhang et.al.	2404.09169	link
2024-04-06	Salient Sparse Visual Odometry With Pose-Only Supervision	Siyu Chen et.al.	2404.04677	null
2024-03-25	A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments	Gianluca D'Amico et.al.	2403.17084	null
2024-03-19	On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine	Jagatpreet Singh Nir et.al.	2403.13170	null
2024-03-18	The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions	Margaret Hansen et.al.	2403.12194	null
2024-03-18	An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation	Zewen Xu et.al.	2403.11639	null
2024-03-16	Efficient Domain Adaptation for Endoscopic Visual Odometry	Junyang Wu et.al.	2403.10860	null
2024-03-14	Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO)	Matthew Lisondra et.al.	2403.09882	null
2024-03-02	Grid-based Fast and Structural Visual Odometry	Zhang Zhihe et.al.	2403.01110	null
2024-02-25	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961	link
2024-02-22	Secure Navigation using Landmark-based Localization in a GPS-denied Environment	Ganesh Sapkota et.al.	2402.14280	null
2024-02-19	Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment	Ganesh Sapkota et.al.	2402.12551	null
2024-02-07	Online and Certifiably Correct Visual Odometry and Mapping	Devansh R Agrawal et.al.	2402.05254	null
2024-02-06	YOLOPoint Joint Keypoint and Object Detection	Anton Backhaus et.al.	2402.03989	link
2024-01-19	Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning	André O. Françani et.al.	2401.10857	null
2024-01-17	Event-Based Visual Odometry on Non-Holonomic Ground Vehicles	Wanting Xu et.al.	2401.09331	link
2024-01-11	On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering	Feng Zhu et.al.	2401.05836	null
2023-12-19	Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry	Olaya Álvarez-Tuñón et.al.	2401.05396	link
2024-01-07	Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people	Ali Samadzadeh et.al.	2401.03604	link
2024-01-03	LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry	Weirong Chen et.al.	2401.01887	null
2023-12-28	SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction	Zikang Yuan et.al.	2312.16800	link
2023-12-20	NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields	Jens Naumann et.al.	2312.13471	null
2023-12-22	Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM	Junru Lin et.al.	2312.13332	null
2023-12-20	Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach	Habib Boloorchi Tabrizi et.al.	2312.13162	link
2023-12-20	Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera	Abdulkadhem A. Abdulkadhem et.al.	2312.12680	null
2023-12-15	Deep Event Visual Odometry	Simon Klenk et.al.	2312.09800	link
2023-12-10	SuperPrimitive: Scene Reconstruction at a Primitive Level	Kirill Mazur et.al.	2312.05889	null
2023-12-04	iMatching: Imperative Correspondence Learning	Zitong Zhan et.al.	2312.02141	link
2023-11-30	Event-based Visual Inertial Velometer	Xiuyuan Lu et.al.	2311.18189	null
2023-11-21	CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems	Young-Hee Lee et.al.	2311.12580	null
2023-11-10	Dense Visual Odometry Using Genetic Algorithm	Slimane Djema et.al.	2311.06149	null
2023-11-07	Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM	Seongwook Yoon et.al.	2311.03722	null
2023-10-23	Converting Depth Images and Point Clouds for Feature-based Pose Estimation	Robert Lösch et.al.	2310.14924	link
2023-10-17	Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms	Yanyan Li et.al.	2310.10931	link
2023-10-12	Jointly Optimized Global-Local Visual Localization of UAVs	Haoling Li et.al.	2310.08082	null
2023-10-10	l-dyno: framework to learn consistent visual features using robot's motion	Kartikeya Singh et.al.	2310.06249	link
2023-10-08	XVO: Generalized Visual Odometry via Cross-Modal Self-Training	Lei Lai et.al.	2309.16772	null
2023-10-22	ObVi-SLAM: Long-Term Object-Visual SLAM	Amanda Adkins et.al.	2309.15268	link
2023-09-23	Tag-based Visual Odometry Estimation for Indoor UAVs Localization	Massimiliano Bertoni et.al.	2309.13311	null
2023-09-22	Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms	Olivier Gamache et.al.	2309.13139	link
2023-09-20	Conformalized Multimodal Uncertainty Regression and Reasoning	Domenico Parente et.al.	2309.11018	null
2023-09-20	OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving	Heng Li et.al.	2309.11011	link
2023-09-19	LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation	Haizhou Zhang et.al.	2309.10436	link
2023-09-21	Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration	Hongbo Zhao et.al.	2309.10314	null
2023-09-18	End-to-End Learned Event- and Image-based Visual Odometry	Roberto Pellerito et.al.	2309.09947	link
2023-09-14	An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments	Yehao Liu et.al.	2309.07408	null
2023-09-11	Evaluating Visual Odometry Methods for Autonomous Driving in Rain	Yu Xiang Tan et.al.	2309.05249	null
2023-09-08	Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry	Akankshya Kar et.al.	2309.04147	null
2023-09-04	EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity	Zijie Jiang et.al.	2309.01296	null
2023-08-27	Deep Learning for Visual Localization and Mapping: A Survey	Changhao Chen et.al.	2308.14039	null
2023-08-19	Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters	Xiao Liu et.al.	2308.09870	link
2023-08-12	4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion	Guirong Zhuo et.al.	2308.06573	null
2023-08-10	Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU	U. V. B. L. Udugama et.al.	2308.05515	null
2023-08-02	A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry	Cora A. Dimmig et.al.	2308.01398	null
2023-08-02	Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network	Shenbagaraj Kannapiran et.al.	2308.01125	null
2023-08-02	Preliminary Design of the Dragonfly Navigation Filter	Ben Schilling et.al.	2307.13513	null
2023-07-19	Optimizing the extended Fourier Mellin Transformation Algorithm	Wenqing Jiang et.al.	2307.10015	link
2023-07-15	Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents	Ke Cao et.al.	2307.07763	null
2023-07-26	Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression	Jianeng Wang et.al.	2306.01188	null
2023-07-06	OSPC: Online Sequential Photometric Calibration	Jawad Haidar et.al.	2305.17673	null
2023-05-15	Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface	Shifan Zhu et.al.	2305.08962	null
2023-05-10	Transformer-based model for monocular visual odometry: a video understanding approach	André O. Françani et.al.	2305.06121	link
2023-04-29	Modality-invariant Visual Odometry for Embodied Vision	Marius Memmel et.al.	2305.00348	link
2023-04-21	FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving	Yuxuan Liu et.al.	2304.10719	null
2023-07-08	Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Visual Bootstrapping	Hanyu Cai et.al.	2304.08978	null
2023-04-12	SiLK -- Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194	link
2023-04-11	ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster	Yifei Dong et.al.	2304.04943	null
2023-03-21	Learning a Depth Covariance Function	Eric Dexheimer et.al.	2303.12157	null
2023-03-21	Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network	Alessandro Navone et.al.	2303.11725	null
2023-03-20	VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors	Thien Hoang Nguyen et.al.	2303.10903	null
2023-03-17	CoVIO: Online Continual Learning for Visual-Inertial Odometry	Niclas Vödisch et.al.	2303.10149	link
2023-03-15	UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry	Chaoyang Jiang et.al.	2303.08550	null
2023-03-13	Discovering Multiple Algorithm Configurations	Leonid Keselman et.al.	2303.07434	null
2023-03-09	Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation	Masahiro Hirano et.al.	2303.05192	null
2023-03-16	Stereo Event-based Visual-Inertial Odometry	Kunfeng Wang et.al.	2303.05086	link
2023-03-07	Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor	Eduardo Gallo et.al.	2303.03804	null
2023-03-03	Lightweight, Uncertainty-Aware Conformalized Visual Odometry	Alex C. Stutts et.al.	2303.02207	null
2023-02-24	FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets	Yelena Randall et.al.	2302.12772	null
2023-02-27	CP+: Camera Poses Augmentation with Large-scale LiDAR Maps	Jiadi Cui et.al.	2302.12198	null
2023-02-19	EdgeVO: An Efficient and Accurate Edge-based Visual Odometry	Hui Zhao et.al.	2302.09493	null
2023-01-27	HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera	Mostafa Ahmadi et.al.	2301.11823	null
2023-01-26	Distributed Optimization Methods for Multi-Robot Systems: Part I -- A Tutorial	Ola Shorinwa et.al.	2301.11313	null
2023-01-24	Generalized Object Search	Kaiyu Zheng et.al.	2301.10121	null
2023-01-22	Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories	Hanlin Chen et.al.	2301.09194	null
2023-01-21	Dense RGB SLAM with Neural Implicit Maps	Heng Li et.al.	2301.08930	null
2023-01-18	Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information	Junshi Chen et.al.	2301.07560	null
2023-01-17	COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM	Manthan Patel et.al.	2301.07147	link
2023-01-31	Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems	Pierre-Yves Lajoie et.al.	2301.06230	link
2023-01-13	A LiDAR-Inertial-Visual SLAM System with Loop Detection	Kangcheng Liu et.al.	2301.05604	null
2023-01-11	AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization	Ying Chen et.al.	2301.04620	link
2023-01-12	TBV Radar SLAM -- trust but verify loop candidates	Daniel Adolfsson et.al.	2301.04397	link
2022-12-31	Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges	Maxwell McManus et.al.	2301.03359	null
2023-01-09	Motion Addition and Motion Optimization	Liqun Qi et.al.	2301.03174	null
2023-01-08	Towards Open World NeRF-Based SLAM	Daniil Lisus et.al.	2301.03102	null
2023-01-06	CyberLoc: Towards Accurate Long-term Visual Localization	Liu Liu et.al.	2301.02403	null
2023-01-03	LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation	Shreyansh Daftry et.al.	2301.01350	null
2022-12-31	4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Patrick Wenzel et.al.	2301.01147	null
2023-01-03	BS3D: Building-scale 3D Reconstruction from RGB-D Images	Janne Mustaniemi et.al.	2301.01057	null
2023-01-10	An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping	Masoud Dayani Najafabadi et.al.	2301.00618	link
2022-12-25	A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion	Nadia Figueroa et.al.	2212.14772	null
2022-12-29	An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping	Kangcheng Liu et.al.	2212.14209	link
2022-12-27	Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands	Felipe Gómez-Cuba et.al.	2212.13477	link
2022-12-26	ESVIO: Event-based Stereo Visual Inertial Odometry	Peiyu Chen et.al.	2212.13184	link
2022-12-24	A Comprehensive Review on Autonomous Navigation	Saeid Nahavandi et.al.	2212.12808	null
2022-12-23	Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation	Marina Lotti et.al.	2212.12388	null
2022-12-23	Implementation of a Blind navigation method in outdoors/indoors areas	Mohammad Javadian Farzaneh et.al.	2212.12185	null
2022-12-22	S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations	Hriday Bavle et.al.	2212.11770	link
2022-12-22	Active SLAM: A Review On Last Decade	Muhammad Farhan Ahmed et.al.	2212.11654	null
2022-12-27	Motion, Unit Dual Quaternion and Motion Optimization	Liqun Qi et.al.	2212.11593	null
2022-12-22	Vision-Based Environmental Perception for Autonomous Driving	Fei Liu et.al.	2212.11453	null
2022-12-19	Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models	Yong Cheng et.al.	2212.09553	null
2022-12-16	Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments	Lasitha Weerakoon et.al.	2212.08633	null
2022-12-16	rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments	Bo Wei et.al.	2212.08418	null
2023-03-02	AirVO: An Illumination-Robust Point-Line Visual Odometry	Kuan Xu et.al.	2212.07595	link
2022-12-14	Autonomous Vehicle Navigation with LIDAR using Path Planning	Rahul M K et.al.	2212.07155	null
2022-12-14	RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping	Hyowon Kim et.al.	2212.07141	null
2022-12-13	Know What You Don't Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version)	Daniil Lisus et.al.	2212.06923	null
2022-12-13	SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance	Chenyangguang Zhang et.al.	2212.06524	null
2022-12-13	Localization and Navigation System for Indoor Mobile Robot	Yanbaihui Liu et.al.	2212.06391	null
2022-12-12	Evaluation of RGB-D SLAM in Large Indoor Environments	Kirill Muravyev et.al.	2212.05980	null
2022-12-19	A Light-Weight LiDAR-Inertial SLAM System with Loop Closing	Kangcheng Liu et.al.	2212.05743	link
2022-12-12	An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds	Kangcheng Liu et.al.	2212.05705	link
2022-12-09	SLAM for Visually Impaired People: A Survey	Marziyeh Bamdad et.al.	2212.04745	null
2022-12-09	Ego-Body Pose Estimation via Ego-Head Pose Estimation	Jiaman Li et.al.	2212.04636	null
2022-12-06	Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles	Sushant Veer et.al.	2212.03323	link
2022-12-06	PRISM: Probabilistic Real-Time Inference in Spatial World Models	Atanas Mirchev et.al.	2212.02988	null
2022-12-06	RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps	Florian Sauerbeck et.al.	2212.02085	link
2022-12-05	DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization	Xuebo Tian et.al.	2212.02077	null
2022-12-05	ObjectMatch: Robust Registration using Canonical Object Correspondences	Can Gümeli et.al.	2212.01985	null
2022-12-02	Sparse SPN: Depth Completion from Sparse Keypoints	Yuqun Wu et.al.	2212.00987	null
2022-12-01	maplab 2.0 -- A Modular and Multi-Modal Mapping Framework	Andrei Cramariuc et.al.	2212.00654	link
2022-12-01	AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body -- Theory and Experiments	Mehregan Dor et.al.	2212.00350	null
2022-11-30	MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves	Pranjali Pathre et.al.	2211.16882	null
2022-11-29	PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images	Hartmut Surmann et.al.	2211.16266	link
2022-11-29	MmWave Mapping and SLAM for 5G and Beyond	Yu Ge et.al.	2211.16024	null
2022-11-28	Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map	Xi Zheng et.al.	2211.15127	null
2022-11-29	BALF: Simple and Efficient Blur Aware Local Feature Detector	Zhenjun Zhao et.al.	2211.14731	null
2022-11-27	Development of a Modular Real-time Shared-control System for a Smart Wheelchair	Vaishanth Ramaraj et.al.	2211.14711	null
2022-11-26	A1 SLAM: Quadruped SLAM using the A1's Onboard Sensors	Jerred Chen et.al.	2211.14432	link
2022-11-23	ActiveRMAP: Radiance Field for Active Mapping And Planning	Huangying Zhan et.al.	2211.12656	null
2022-11-22	Vision-based localization methods under GPS-denied conditions	Zihao Lu et.al.	2211.11988	null
2022-11-21	Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques	David Ramirez et.al.	2211.11836	null
2022-11-21	ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields	Mohammad Mahdi Johari et.al.	2211.11704	null
2022-11-24	Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths	Erik Leitinger et.al.	2211.09241	null
2022-11-16	Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery	Hao Qu et.al.	2211.08904	null
2022-11-20	Detecting Line Segments in Motion-blurred Images with Events	Huai Yu et.al.	2211.07365	link
2022-11-13	Automatic Eye-in-Hand Calibration using EKF	Aditya Ramakrishnan et.al.	2211.06881	null
2022-11-12	Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling	Zhihao Wang et.al.	2211.06557	link
2022-11-11	Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications	Jie Yang et.al.	2211.05982	null
2022-11-10	Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time	Ignacio Torroba et.al.	2211.05601	link
2022-11-07	When Geometry is not Enough: Using Reflector Markers in Lidar SLAM	Gerhard Kurz et.al.	2211.03484	null
2022-11-07	Detecting Invalid Map Merges in Lifelong SLAM	Matthias Holoch et.al.	2211.03423	null
2022-11-06	Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU	Yibin Wu et.al.	2211.03174	link
2022-11-07	Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments	Daniel Adolfsson et.al.	2211.02445	link
2022-11-03	DyOb-SLAM : Dynamic Object Tracking SLAM System	Rushmian Annoy Wadud et.al.	2211.01941	null
2022-11-03	Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM	Yang Chen et.al.	2211.01749	null
2022-11-04	$D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm	Hao Xu et.al.	2211.01538	link
2022-11-02	Semantic SuperPoint: A Deep Semantic Descriptor	Gabriel S. Gama et.al.	2211.01098	link
2022-11-02	Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation	Myung-Hwan Jeon et.al.	2211.00960	link
2022-10-31	Mapping Extended Landmarks for Radar SLAM	Shuai Sun et.al.	2210.17207	null
2022-10-25	MAROAM: Map-based Radar SLAM through Two-step Feature Selection	Dequan Wang et.al.	2210.13797	null
2022-10-25	S3E: A Large-scale Multimodal Dataset for Collaborative SLAM	Dapeng Feng et.al.	2210.13723	link
2022-10-24	NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields	Antoni Rosinol et.al.	2210.13641	link
2022-10-24	Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging	Geng Wang et.al.	2210.13556	null
2022-10-28	VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points	Andreas Georgis et.al.	2210.12756	null
2022-10-22	SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation	Junliang Chen et.al.	2210.12417	null
2022-10-21	DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm	Shipeng Zhong et.al.	2210.11978	link
2022-10-21	Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments	Shubham Kedia et.al.	2210.11652	null
2022-10-22	Visual SLAM: What are the Current Trends and What to Expect?	Ali Tourani et.al.	2210.10491	null
2022-10-18	Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM	Geon Choi et.al.	2210.09636	null
2022-10-16	D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments	Ayman Beghdadi et.al.	2210.08647	null
2022-10-16	Indoor Smartphone SLAM with Learned Echoic Location Features	Wenjie Luo et.al.	2210.08493	null
2022-10-15	Self-Improving SLAM in Dynamic Environments: Learning When to Mask	Adrian Bojko et.al.	2210.08350	link
2022-10-13	Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems	Pushyami Kaveti et.al.	2210.07315	link
2022-10-12	RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map	Xuecheng Xu et.al.	2210.05984	link
2022-10-11	Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization	Yuanzheng He et.al.	2210.05600	null
2022-10-11	Autonomous Asteroid Characterization Through Nanosatellite Swarming	Kaitlin Dennison et.al.	2210.05518	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517	null
2022-10-11	Multi-Object Navigation with dynamically learned neural implicit representations	Pierre Marza et.al.	2210.05129	link
2022-10-12	Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation	Yulun Tian et.al.	2210.05020	null
2022-10-10	Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios	Xingyu Chen et.al.	2210.04562	null
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236	null
2022-10-06	SCORE: A Second-Order Conic Initialization for Range-Aided SLAM	Alan Papalia et.al.	2210.03177	link
2022-10-06	Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding	Kirill Mazur et.al.	2210.03043	null
2022-10-06	Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence	Osian Morgan et.al.	2210.02642	null
2022-10-05	MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation	Hanwei Zhang et.al.	2210.02038	null
2022-10-04	O2S: Open-source open shuttle	Nwankwo Linus et.al.	2210.01627	null
2022-10-04	Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing	Weiying Wang et.al.	2210.01320	null
2022-10-03	Probabilistic Volumetric Fusion for Dense Monocular SLAM	Antoni Rosinol et.al.	2210.01276	null
2022-10-03	DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams	John McConnell et.al.	2210.00867	link
2022-10-03	A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments	Ha Sier et.al.	2210.00812	link
2022-10-01	Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2	Ali Eslamian et.al.	2210.00278	null
2022-09-30	PyPose: A Library for Robot Learning with Physics-based Optimization	Chen Wang et.al.	2209.15428	link
2022-09-29	DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment	Mariia Gladkova et.al.	2209.14965	null
2022-09-28	Robust Incremental Smoothing and Mapping (riSAM)	Daniel McGann et.al.	2209.14359	null
2022-09-27	Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping	Chi-Ming Chung et.al.	2209.13274	link
2022-09-24	Graph Neural Networks for Multi-Robot Active Information Acquisition	Mariliza Tzes et.al.	2209.12091	null
2022-09-24	Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes	Jonathan J. Y. Kim et.al.	2209.11894	null
2022-09-23	involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs	Gilad Rotman et.al.	2209.11591	null
2022-09-23	Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot	David Balaban et.al.	2209.11432	null
2022-09-22	SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation	Xiao Han et.al.	2209.10817	null
2022-09-22	Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio	Wenhao Qiu et.al.	2209.10726	null
2022-09-21	Visual Localization and Mapping in Dynamic and Changing Environments	João Carlos Virgolino Soares et.al.	2209.10710	null
2022-09-20	Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM	Sabir Hossain et.al.	2209.10047	null
2022-09-20	WGICP: Differentiable Weighted GICP-Based Lidar Odometry	Sanghyun Son et.al.	2209.09777	null
2022-09-20	PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention	José Arce et.al.	2209.09699	link
2022-09-19	MeSLAM: Memory Efficient SLAM based on Neural Fields	Evgenii Kruzhkov et.al.	2209.09357	null
2022-09-19	LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM	Letian Zhang et.al.	2209.08810	null
2022-09-18	HGI-SLAM: Loop Closure With Human and Geometric Importance Features	Shuhul Mujoo et.al.	2209.08608	null
2022-09-18	Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM	Jiarui Tan et.al.	2209.08578	link
2022-09-17	DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments	Shihao Shen et.al.	2209.08430	link
2022-09-17	OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM	Matthieu Zins et.al.	2209.08338	null
2022-09-17	PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments	Adam Dai et.al.	2209.08248	link
2022-09-16	ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM	Aditya Arun et.al.	2209.08091	null
2022-09-16	iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking	Yuhang Ming et.al.	2209.07919	null
2022-09-16	TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM	Mathieu Gonzalez et.al.	2209.07888	null
2022-09-15	Landmark Management in the Application of Radar SLAM	Shuai Sun et.al.	2209.07199	link
2022-09-15	PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization	Xianwei Meng et.al.	2209.07061	null
2022-09-14	Semantic Visual Simultaneous Localization and Mapping: A Survey	Kaiqi Chen et.al.	2209.06428	null
2022-09-13	Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets	Islam Ali et.al.	2209.06316	null
2022-09-12	A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding	Tin Lai et.al.	2209.05222	null
2022-09-12	Attitude-Guided Loop Closure for Cameras with Negative Plane	Ze Wang et.al.	2209.05167	link
2022-09-09	General Place Recognition Survey: Towards the Real-world Autonomy Age	Peng Yin et.al.	2209.04497	link
2022-09-08	ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology	Julio A. Placed et.al.	2209.03693	link
2022-09-08	R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator	Jiarong Lin et.al.	2209.03666	link
2022-09-06	Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection	Brendon Forsgren et.al.	2209.02658	link
2022-09-05	Neuromorphic Visual Odometry with Resonator Networks	Alpha Renner et.al.	2209.02000	null
2022-09-05	MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM	Pavel Karpyshev et.al.	2209.01936	null
2022-09-05	ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics	Boyi Liu et.al.	2209.01774	null
2022-09-04	CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud	Evgeny Yudin et.al.	2209.01605	null
2022-08-31	PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM	Yifan Duan et.al.	2208.14848	null
2022-08-30	BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition	Peng Yin et.al.	2208.14543	null
2022-08-27	Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes	Ali Safa et.al.	2208.12997	null
2022-08-25	FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms	Jianhao Jiao et.al.	2208.11865	null
2022-08-25	Lidar SLAM for Autonomous Driving Vehicles	Farhad Aghili et.al.	2208.11855	null
2022-08-24	DynaVINS: A Visual-Inertial SLAM for Dynamic Environments	Seungwon Song et.al.	2208.11500	link
2022-08-22	Doppler Exploitation in Bistatic mmWave Radio SLAM	Yu Ge et.al.	2208.10204	null
2022-08-21	Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping	Lintong Zhang et.al.	2208.09825	link
2022-08-26	JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario	Longrui Dong et.al.	2208.09777	null
2022-08-15	BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM	Yunge Cui et.al.	2208.07473	link
2022-08-12	Handling Constrained Optimization in Factor Graphs for Autonomous Navigation	Barbara Bazzana et.al.	2208.06325	null
2022-08-11	RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild	Jason Y. Zhang et.al.	2208.05963	null
2022-08-08	Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation	Yifei Ren et.al.	2208.04274	link
2022-08-08	SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty	Shuai Zhang et.al.	2208.03945	link
2022-08-05	A Survey on Visual Map Localization Using LiDARs and Cameras	Elhousni Mahdi et.al.	2208.03376	null
2022-08-04	SROS2: Usable Cyber Security Tools for ROS 2	Victor Mayoral Vilches et.al.	2208.02615	link
2022-08-03	Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms	Bharath Garigipati et.al.	2208.02063	null
2022-08-02	Present and Future of SLAM in Extreme Underground Environments	Kamak Ebadi et.al.	2208.01787	null
2022-08-01	Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion	Simon Boche et.al.	2208.00709	null
2022-07-29	Neural Density-Distance Fields	Itsuki Ueda et.al.	2207.14455	link
2022-07-25	DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions	Tristan Laidlow et.al.	2207.12244	null
2022-07-25	Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration	Kenji Koide et.al.	2207.11942	null
2022-07-22	NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction	Yunlong Ran et.al.	2207.10985	null
2022-07-22	Dense RGB-D-Inertial SLAM with Map Deformations	Tristan Laidlow et.al.	2207.10940	null
2022-07-22	PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes	BaoSheng Zhang et.al.	2207.10916	null
2022-07-21	Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion	Suman Ghosh et.al.	2207.10494	link
2022-07-21	Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions	Quentin Serdel et.al.	2207.10489	link
2022-07-21	On applicability of von Karman's momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity	Yujin Lu et.al.	2207.10413	null
2022-07-19	Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM	Tuvy Lemberg et.al.	2207.09103	null
2022-07-18	DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM	Weicai Ye et.al.	2207.08794	link
2022-07-18	Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction	Marco Orsingher et.al.	2207.08439	null
2022-07-18	ORB-based SLAM accelerator on SoC FPGA	Vibhakar Vemulapati et.al.	2207.08405	null
2022-07-14	Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset	Riccardo Giubilato et.al.	2207.06815	null
2022-07-14	Semi-supervised Vector-Quantization in Visual SLAM using HGCN	Amir Zarringhalam et.al.	2207.06738	null
2022-07-14	Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders	Amir Zarringhalam et.al.	2207.06732	null
2022-07-13	SLAM: SLO-Aware Memory Optimization for Serverless Applications	Gor Safaryan et.al.	2207.06183	null
2022-07-19	Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras	Fangwen Shu et.al.	2207.06058	link
2022-07-12	Accelerating Certifiable Estimation with Preconditioned Eigensolvers	David M. Rosen et.al.	2207.05257	null
2022-07-12	Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features	Meiyu Zhi et.al.	2207.05244	null
2022-07-14	SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial	Chih-Yuan Chiu et.al.	2207.05043	null
2022-07-08	BlindSpotNet: Seeing Where We Cannot See	Taichi Fukuda et.al.	2207.03870	null
2022-07-08	Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints	Philipp Glira et.al.	2207.03785	null
2022-07-08	Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements	Ran Liu et.al.	2207.03700	null
2022-07-07	RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments	Qihao Peng et.al.	2207.03539	null
2022-07-06	VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization	Marius Laska et.al.	2207.02668	null
2022-07-06	A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models	Axel Garcia-Vega et.al.	2207.02396	null
2022-07-04	VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM	Ling Gao et.al.	2207.01404	null
2022-07-04	VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM	Danpeng Chen et.al.	2207.01158	null
2022-07-03	Wireless Channel Prediction in Partially Observed Environments	Mingsheng Yin et.al.	2207.00934	null
2022-07-01	A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers	Julio A. Placed et.al.	2207.00254	null
2022-07-01	Keeping Less is More: Point Sparsification for Visual SLAM	Yeonsoo Park et.al.	2207.00225	null
2022-06-30	Controlled and impulsive compression of an entrapped air bubble during impact	Utkarsh Jain et.al.	2206.15297	null
2022-06-30	Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery	Yuehao Wang et.al.	2206.15255	link
2022-06-27	IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments	Abanob Soliman et.al.	2206.13455	link
2022-06-26	An Efficient Global Optimality Certificate for Landmark-Based SLAM	Connor Holmes et.al.	2206.12961	link
2022-06-21	Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping	Davide Tateo et.al.	2206.10263	link
2022-06-20	Data Fusion for Radio Frequency SLAM with Robust Sampling	Erik Leitinger et.al.	2206.09746	null
2022-06-19	RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments	Chenglong Qian et.al.	2206.09463	null
2022-06-17	Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments	Khairuldanial Ismail et.al.	2206.08733	null
2022-06-17	An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions	Yijun Yuan et.al.	2206.08712	link
2022-06-13	ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy	Hao Bai et.al.	2206.06435	null
2022-06-10	Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming	Javier Cremona et.al.	2206.05066	link
2022-06-09	SparseFormer: Attention-based Depth Completion Network	Frederik Warburg et.al.	2206.04557	null
2022-06-07	Robot Self-Calibration Using Actuated 3D Sensors	Arne Peters et.al.	2206.03430	null
2022-06-07	Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map	Haodong Yuan et.al.	2206.03062	null
2022-06-05	DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions	Alena Savinykh et.al.	2206.02199	null
2022-06-04	C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy	Erez Posner et.al.	2206.01961	null
2022-06-01	PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry	Dong-Uk Seo et.al.	2206.00266	link
2022-05-27	A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching	Arno Solin et.al.	2205.13821	null
2022-05-31	LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments	Yun Chang et.al.	2205.13135	link
2022-05-25	Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM	Milad Ramezani et.al.	2205.12595	null
2022-05-24	Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM	Christopher E. Denniston et.al.	2205.12402	link
2022-05-22	ALITA: A Large-scale Incremental Dataset for Long-term Autonomy	Peng Yin et.al.	2205.10737	link
2022-05-19	FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2	Jeffrey Ichnowski et.al.	2205.09778	link
2022-05-17	Global Data Association for SLAM with 3D Grassmannian Manifold Objects	Parker C. Lusk et.al.	2205.08556	null
2022-05-19	Cluster on Wheels	Yuanyuan Yang et.al.	2205.08151	null
2022-05-12	Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry	Shihao Shen et.al.	2205.05916	link
2022-05-12	S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization	Ran Cheng et.al.	2205.05861	null
2022-05-14	Multi-modal Semantic SLAM for Complex Dynamic Environments	Han Wang et.al.	2205.04300	link
2022-05-06	OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations	Carmen Delgado et.al.	2205.03256	null
2022-05-05	CNN-Augmented Visual-Inertial SLAM with Planar Constraints	Pan Ji et.al.	2205.02940	null
2022-05-05	PMBM-based SLAM Filters in 5G mmWave Vehicular Networks	Hyowon Kim et.al.	2205.02502	null
2022-05-04	BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking	Dorian Henning et.al.	2205.02301	null
2022-05-04	A Global Asymptotic Convergent Observer for SLAM	Seyed Hamed Hashemi et.al.	2205.01953	null
2022-05-04	Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation	Nathaniel Merrill et.al.	2205.01823	link
2022-05-03	GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping	Pan Ji et.al.	2205.01656	null
2022-04-29	Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM	Jinwoo Jeon et.al.	2204.13877	link
2022-04-27	The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection	Konstantinos A. Tsintotas et.al.	2204.12831	null
2022-04-27	Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment	Wenyu Li et.al.	2204.12769	null
2022-04-29	MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment	Tingchen Ma et.al.	2204.11621	null
2022-04-23	Indoor simultaneous localization and mapping based on fringe projection profilometry	Yang Zhao et.al.	2204.11020	null
2022-04-22	Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria	Julio A. Placed et.al.	2204.10631	null
2022-04-22	Fast Autonomous Robotic Exploration Using the Underlying Graph Structure	Julio A. Placed et.al.	2204.10610	null
2022-04-22	Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions	Yutong Hu et.al.	2204.10552	null
2022-04-22	Implicit Object Mapping With Noisy Data	Jad Abou-Chakra et.al.	2204.10516	link
2022-04-19	Photometric single-view dense 3D reconstruction in endoscopy	Victor M. Batlle et.al.	2204.09083	null
2022-04-18	Pulsar skips: Understanding variations in the regular periods of rotating neutron stars	Clayton Miller et.al.	2204.08449	null
2022-04-18	Tracking monocular camera pose and deformation for SLAM inside the human body	Juan J. Gomez Rodriguez et.al.	2204.08309	null
2022-04-18	Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker	Hanjing Ye et.al.	2204.08163	null
2022-04-14	ViViD++: Vision for Visibility Dataset	Alex Junho Lee et.al.	2204.06183	null
2022-04-12	HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud	Zhixing Hou et.al.	2204.05481	null
2022-04-12	RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room	Cong Gao et.al.	2204.05467	null
2022-04-11	Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context	Lizhou Liao et.al.	2204.04932	link
2022-04-04	Monitoring social distancing with single image depth estimation	Alessio Mingozzi et.al.	2204.01693	null
2022-04-01	Bi-directional Loop Closure for Visual SLAM	Ihtisham Ali et.al.	2204.01524	null
2022-04-04	IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers	Lei Sun et.al.	2204.01324	link
2022-04-03	Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor	Wenyan Ou et.al.	2204.01154	null
2022-04-02	UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps	Ayyappa Swamy Thatavarthy et.al.	2204.00865	link
2022-03-31	Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects	Yujie Lu et.al.	2204.00035	null
2022-03-30	GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios	Chih-Yuan Chiu et.al.	2203.16690	null
2022-03-29	Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field	Mostafa Osman et.al.	2203.15866	null
2022-03-29	Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform	Mingjun Li et.al.	2203.15439	null
2022-03-29	Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots	Pranay Mathur et.al.	2203.15272	null
2022-03-28	Are High-Resolution Event Cameras Really Needed?	Daniel Gehrig et.al.	2203.14672	null
2022-03-25	Spectral Measurement Sparsification for Pose-Graph SLAM	Kevin J. Doherty et.al.	2203.13897	link
2022-03-25	FD-SLAM: 3-D Reconstruction Using Features and Dense Matching	Xingrui Yang et.al.	2203.13861	null
2022-03-25	Gravity-constrained point cloud registration	Vladimír Kubelka et.al.	2203.13799	null
2022-03-24	MD-SLAM: Multi-cue Direct SLAM	Luca Di Giammarino et.al.	2203.13237	link
2022-03-24	Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video	Shun Taguchi et.al.	2203.12804	null
2022-03-19	Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems	Jie Yang et.al.	2203.10267	null
2022-03-16	Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR	Ian D. Miller et.al.	2203.08925	link
2022-03-15	Neural RF SLAM for unsupervised positioning and mapping with channel state information	Shreya Kadambi et.al.	2203.08264	null
2022-03-15	Simultaneous Localisation and Mapping with Quadric Surfaces	Tristan Laidlow et.al.	2203.08040	null
2022-03-14	Drift Reduced Navigation with Deep Explainable Features	Mohd Omama et.al.	2203.06897	link
2022-03-11	An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs	Keisuke Sugiura et.al.	2203.05763	null
2022-03-10	High Definition, Inexpensive, Underwater Mapping	Bharat Joshi et.al.	2203.05640	link
2022-03-10	SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning	Jaehoon Choi et.al.	2203.05332	null
2022-03-08	Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM	Pierre-Yves Lajoie et.al.	2203.04446	link
2022-03-08	SLAM-Supported Self-Training for 6D Object Pose Estimation	Ziqi Lu et.al.	2203.04424	link
2022-03-08	An Online Semantic Mapping System for Extending and Enhancing Visual SLAM	Thorsten Hempel et.al.	2203.03944	null
2022-03-07	Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms	Qingqing Li et.al.	2203.03454	link
2022-03-07	OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition	Junyi Ma et.al.	2203.03397	link
2022-03-06	Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM	Kazushi Aiba et.al.	2203.02887	null
2022-03-06	RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects	Ran Long et.al.	2203.02882	null
2022-03-03	STUN: Self-Teaching Uncertainty Estimation for Place Recognition	Kaiwen Cai et.al.	2203.01851	link
2022-03-03	Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning	Niclas Vödisch et.al.	2203.01578	link
2022-03-02	FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry	Chunran Zheng et.al.	2203.00893	link
2022-03-02	Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation	Yulun Tian et.al.	2203.00851	null
2022-03-01	Descriptellation: Deep Learned Constellation Descriptors for SLAM	Chunwei Xing et.al.	2203.00567	null
2022-03-01	Collaborative Robot Mapping using Spectral Graph Analysis	Lukas Bernreiter et.al.	2203.00308	null
2022-02-26	RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization	Nikolaos Kourtzanidis et.al.	2202.13221	link
2022-02-25	Probabilistic Data Association for Semantic SLAM at Scale	Elad Michael et.al.	2202.12802	link
2022-02-24	TwistSLAM: Constrained SLAM in Dynamic Environment	Mathieu Gonzalez et.al.	2202.12384	null
2022-02-24	Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion	Hyeonsoo Jang et.al.	2202.12108	null
2022-02-23	MITI: SLAM Benchmark for Laparoscopic Surgery	Regine Hartwig et.al.	2202.11496	null
2022-02-23	DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization	Xuebo Tian et.al.	2202.11431	null
2022-02-23	Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets	Islam Ali et.al.	2202.11312	null
2022-02-22	SAGE: SLAM with Appearance and Geometry Prior for Endoscopy	Xingtong Liu et.al.	2202.09487	link
2022-02-18	OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure	Stefan Leutenegger et.al.	2202.09199	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146	link
2022-02-18	An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems	Qiang Liu et.al.	2202.08952	null
2022-02-17	Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study	Giovanni Cioffi et.al.	2202.08894	link
2022-02-17	LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building	Jiashi Zhang et.al.	2202.08487	null
2022-02-16	Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments	Jinkun Wang et.al.	2202.08359	null
2022-02-11	Overhead Image Factors for Underwater Sonar-based SLAM	John McConnell et.al.	2202.05811	null
2022-02-10	Scale Estimation with Dual Quadrics for Monocular Object SLAM	Shuangfu Song et.al.	2202.04816	null
2022-02-08	A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition	Nie Jiwei et.al.	2202.03677	null
2022-01-25	Autonomous Vehicles: Open-Source Technologies, Considerations, and Development	Oussama Saoudi et.al.	2202.03148	null
2022-02-07	Temporal Point Cloud Completion with Pose Disturbance	Jieqi Shi et.al.	2202.03084	null
2022-02-04	DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments	Xinggang Hu et.al.	2202.01938	null
2022-02-01	A Model for Multi-View Residual Covariances based on Perspective Deformation	Alejandro Fontan et.al.	2202.00765	null
2022-01-30	Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM	Xinghe Chu et.al.	2201.12726	null
2022-01-28	RGB-D SLAM Using Attention Guided Frame Association	Ali Caglayan et.al.	2201.12047	null
2022-02-04	Learning to Act with Affordance-Aware Multimodal Neural SLAM	Zhiwei Jia et.al.	2201.09862	link
2022-01-22	Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems	Xi Zheng et.al.	2201.09048	link
2022-01-17	SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System	Giseop Kim et.al.	2201.06423	null
2022-01-14	SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions	Ali Samadzadeh et.al.	2201.05386	link
2022-01-19	Multi-Hypothesis Scan Matching through Clustering	Giorgio Iavicoli et.al.	2201.03814	null
2022-01-11	Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM	Kevin J. Doherty et.al.	2201.03773	null
2022-01-10	High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM	Brian M. Hopkinson et.al.	2201.03364	link
2022-01-10	Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition	M. Usman Maqbool Bhutta et.al.	2201.03212	link
2022-01-04	Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds	Xueliang Wen et.al.	2201.00959	null
2021-12-29	Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic	Khen Elimelech et.al.	2112.14428	null
2021-12-19	M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots	Jie Yin et.al.	2112.13659	link
2021-12-27	UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping	Hyunjun Lim et.al.	2112.13515	link
2021-12-25	Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs	Yusheng Wang et.al.	2112.13224	null
2021-12-25	Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping	Peng Huang et.al.	2112.13222	null
2021-12-24	3D Point Cloud Reconstruction and SLAM as an Input	Ziyu Li et.al.	2112.12907	null
2021-12-22	NICE-SLAM: Neural Implicit Scalable Encoding for SLAM	Zihan Zhu et.al.	2112.12130	link
2021-12-18	Fast and Robust Registration of Partially Overlapping Point Clouds	Eduardo Arnold et.al.	2112.09922	link
2021-12-17	Symmetry-aware Neural Architecture for Embodied Visual Navigation	Shuang Liu et.al.	2112.09515	null
2021-12-27	Homography Decomposition Networks for Planar Object Tracking	Xinrui Zhan et.al.	2112.07909	link
2021-12-14	Autonomous Navigation System from Simultaneous Localization and Mapping	Micheal Caracciolo et.al.	2112.07723	link
2021-12-12	360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation	Bolivar Solarte et.al.	2112.06180	link
2021-12-11	Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization	Amay Saxena et.al.	2112.05921	null
2021-12-07	Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems	Gideon Billings et.al.	2112.03826	link
2021-12-05	Iterated Posterior Linearization PMB Filter for 5G SLAM	Yu Ge et.al.	2112.02575	null
2021-12-03	Fast Direct Stereo Visual SLAM	Jiawei Mo et.al.	2112.01890	link
2021-12-02	MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment	Jie Ren et.al.	2112.01349	link
2021-12-01	Research on Event Accumulator Settings for Event-Based SLAM	Kun Xiao et.al.	2112.00427	link
2021-11-29	An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments	Assem Sadek et.al.	2111.14666	null
2021-11-29	Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report	Hartmut Surmann et.al.	2111.14542	null
2021-11-24	Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment	V. Ayala-Alfaro et.al.	2111.12690	null
2021-11-24	Autonomous bot with ML-based reactive navigation for indoor environment	Yash Srivastava et.al.	2111.12542	null
2021-11-22	A General Framework for Lifelong Localization and Mapping in Changing Environment	Min Zhao et.al.	2111.10946	link
2021-11-17	Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network	Xiaoming Zhao et.al.	2111.09006	null
2021-11-10	Comparing dominance of tennis' big three via multiple-output Bayesian quantile regression models	Bruno Santos et.al.	2111.05631	null
2021-11-10	TomoSLAM: factor graph optimization for rotation angle refinement in microtomography	Mark Griguletskii et.al.	2111.05562	null
2021-11-07	Hierarchical Segment-based Optimization for SLAM	Yuxin Tian et.al.	2111.04101	null
2021-11-07	Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM	Shing Yan Loo et.al.	2111.04096	null
2021-11-05	MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry	Joan P. Company-Corcoles et.al.	2111.03408	null
2021-10-31	Loop closure detection using local 3D deep descriptors	Youjie Zhou et.al.	2111.00440	link
2021-10-27	Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification	Mingsheng Yin et.al.	2110.14789	link
2021-10-27	Efficient Placard Discovery for Semantic Mapping During Frontier Exploration	David Balaban et.al.	2110.14742	null
2021-10-26	Robust Multi-view Registration of Point Sets with Laplacian Mixture Model	Jin Zhang et.al.	2110.13744	null
2021-10-25	WOLF: A modular estimation framework for robotics based on factor graphs	Joan Sola et.al.	2110.12919	null
2021-10-21	Real-Time Ground-Plane Refined LiDAR SLAM	Fan Yang et.al.	2110.11517	null
2021-10-21	SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words	Jonathan J. Y. Kim et.al.	2110.11491	null
2021-10-21	InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion	Zhenkun Zhu et.al.	2110.11040	null
2021-10-20	SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training	Ankur Bapna et.al.	2110.10329	null
2021-10-18	Enhancing exploration algorithms for navigation with visual SLAM	Kirill Muravyev et.al.	2110.09156	null
2021-10-18	Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment	Rui Tian et.al.	2110.08977	null
2021-10-16	Partial Hierarchical Pose Graph Optimization for SLAM	Alexander Korovko et.al.	2110.08639	null
2021-10-14	Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach	Shumon Koga et.al.	2110.07546	null
2021-10-13	Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity	Ran Liu et.al.	2110.06541	null
2021-10-12	Learning Efficient Multi-Agent Cooperative Visual Exploration	Chao Yu et.al.	2110.05734	null
2021-10-07	Self-Supervised Depth Completion for Active Stereo	Frederik Warburg et.al.	2110.03234	null
2021-10-06	InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes	Zhenkun Zhu et.al.	2110.02593	null
2021-10-03	AEROS: Adaptive RObust least-Squares for Graph-Based SLAM	Milad Ramezani et.al.	2110.02018	null
2021-10-04	Fast Uncertainty Quantification for Active Graph SLAM	Julio A. Placed et.al.	2110.01289	link
2021-10-04	Geometry-based Graph Pruning for Lifelong SLAM	Gerhard Kurz et.al.	2110.01286	null
2021-10-03	Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration	Marcus Greiff et.al.	2110.01099	null
2021-10-02	Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows	Qiangqiang Huang et.al.	2110.00876	link

(back to top)

SFM

Publish Date	Title	Authors	PDF	Code
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661	null
2025-03-03	ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization	Anas Abdelkarim et.al.	2503.01311	null
2025-03-04	A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping	Jialei He et.al.	2503.01202	null
2025-03-02	MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain	Rui Yi Yong et.al.	2503.00853	null
2025-03-02	PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery	BoCheng Li et.al.	2503.00848	null
2025-03-02	Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration	Jinjiang You et.al.	2503.00737	link
2025-02-28	The THESAN-ZOOM project: Burst, quench, repeat -- unveiling the evolution of high-redshift galaxies along the star-forming main sequence	William McClymont et.al.	2503.00106	null
2025-02-27	Best Foot Forward: Robust Foot Reconstruction in-the-wild	Kyle Fogarty et.al.	2502.20511	null
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932	null
2025-03-04	Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model	Yaxuan Huang et.al.	2502.16779	null
2025-02-20	CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting	Qilin Zhang et.al.	2502.14684	link
2025-02-19	Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections	Seong Jong Yoo et.al.	2502.13986	null
2025-02-19	IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras	Dongki Jung et.al.	2502.12545	null
2025-02-12	Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors	Vishwanath Pratap Singh et.al.	2502.08587	null
2025-02-10	FOCUS -- Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences	Oliver Boyne et.al.	2502.06367	link
2025-02-09	Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models	Jing-Xuan Zhang et.al.	2502.05766	link
2025-02-10	Building Rome with Convex Optimization	Haoyu Han et.al.	2502.04640	null
2025-02-04	SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification	Yifu Tao et.al.	2502.02657	null
2025-02-05	GP-GS: Gaussian Processes for Enhanced Gaussian Splatting	Zhihao Guo et.al.	2502.02283	link
2025-02-03	XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications	Shangjin Zhai et.al.	2502.01297	null
2025-01-29	Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment	Zixue Zeng et.al.	2501.17690	null
2025-01-28	Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction	Tim Flückiger et.al.	2501.16221	null
2025-01-25	Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos	Zhen-Hui Dong et.al.	2501.15096	null
2025-01-24	MATCHA:Towards Matching Anything	Fei Xue et.al.	2501.14945	null
2025-01-24	Light3R-SfM: Towards Feed-forward Structure-from-Motion	Sven Elflein et.al.	2501.14914	null
2025-01-24	Dense-SfM: Structure from Motion with Dense Consistent Matching	JongMin Lee et.al.	2501.14277	null
2025-01-21	Theory of quantum-geometric charge and spin Josephson diode effects in strongly spin-polarized hybrid structures with noncoplanar spin textures	Niklas L. Schulz et.al.	2501.12232	null
2025-01-14	Selective Attention Merging for low resource tasks: A case study of Child ASR	Natarajan Balaji Shankar et.al.	2501.08468	link
2025-01-14	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015	null
2025-02-02	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927	link
2025-01-11	Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis	Aditya Rauniyar et.al.	2501.06431	null
2025-01-09	Existence of dynamical fluctuation in AMPT generated data for Au+Au collisions at 10 AGeV	Somen Gope et.al.	2501.05175	null
2025-01-06	Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation	Yuezhang Lv et.al.	2501.02821	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	null
2025-01-02	EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy	Ao Gao et.al.	2501.01003	null
2024-12-30	KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences	Keng-Wei Chang et.al.	2412.20767	null
2024-12-27	Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images	Xudong Cai et.al.	2412.19518	null
2024-12-25	Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition	Shujie Hu et.al.	2412.18832	null
2024-12-23	Reconstructing People, Places, and Cameras	Lea Müller et.al.	2412.17806	null
2024-12-18	Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Rémi Marsal et.al.	2412.14103	null
2024-12-16	Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection	Beomseok Lee et.al.	2412.11978	null
2024-12-18	SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video	Jongmin Park et.al.	2412.09982	null
2024-12-12	CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework	Yushan Han et.al.	2412.08344	null
2024-12-10	Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling	Hui Deng et.al.	2412.07230	null
2024-12-08	Unveiling True Talent: The Soccer Factor Model for Skill Evaluation	Alexandre Andorra et.al.	2412.05911	null
2024-12-08	Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features	Yuanbo Xiangli et.al.	2412.05826	null
2024-12-06	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463	null
2024-12-03	ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification	Pan Zhang et.al.	2412.02044	link
2024-12-02	SfM-Free 3D Gaussian Splatting via Hierarchical Training	Bo Ji et.al.	2412.01553	link
2024-12-02	MVImgNet2.0: A Larger-scale Dataset of Multi-view Images	Xiaoguang Han et.al.	2412.01430	null
2024-12-02	TAS-TsC: A Data-Driven Framework for Estimating Time of Arrival Using Temporal-Attribute-Spatial Tri-space Coordination of Truck Trajectories	Mengran Li et.al.	2412.01122	null
2024-12-02	Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM	Alejandro Fontan et.al.	2412.01116	null
2024-11-27	RoMo: Robust Motion Segmentation Improves Structure from Motion	Lily Goli et.al.	2411.18650	null
2024-11-26	The MAGPI Survey: radial trends in star formation across different cosmological simulations in comparison with observations at $z \sim$ 0.3	Marcie Mun et.al.	2411.17882	null
2024-11-25	Characterizing Stellar and Gas Properties in NGC 628: Spatial Distributions, Radial Gradients, and Resolved Scaling Relations	Peng Wei et.al.	2411.16150	null
2024-11-24	ZeroGS: Training 3D Gaussian Splatting from Unposed Images	Yu Chen et.al.	2411.15779	null
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291	null
2024-11-15	SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction	Yutao Tang et.al.	2411.12592	link
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546	null
2024-11-13	4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization	Mijeong Kim et.al.	2411.08879	null
2024-11-13	Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model	Yutao Shen et.al.	2411.08453	null
2024-11-08	From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS	Haoran Zhang et.al.	2411.05362	link
2024-10-29	A Cascade Approach for APT Campaign Attribution in System Event Logs: Technique Hunting and Subgraph Matching	Yi-Ting Huang et.al.	2410.22602	null
2024-10-29	LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues	Hanqing Jiang et.al.	2410.22213	null
2024-10-17	Stochastic Flow Matching for Resolving Small-Scale Physics	Stathi Fotiadis et.al.	2410.19814	null
2024-10-25	A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint	Changshi Mu et.al.	2410.19473	link
2024-10-30	Large Spatial Model: End-to-end Unposed Images to Semantic 3D	Zhiwen Fan et.al.	2410.18956	link
2024-10-23	CO-CAVITY project: Molecular gas and star formation in void galaxies	M. I. Rodríguez et.al.	2410.18078	null
2024-10-23	PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting	Yu Wang et.al.	2410.17505	null
2024-10-20	Neural Active Structure-from-Motion in Dark and Textureless Environment	Kazuto Ichimaru et.al.	2410.15378	null
2024-10-17	SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation	Shiao Xie et.al.	2410.13486	null
2024-10-16	Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks	Orchid Chetia Phukan et.al.	2410.12947	null
2024-10-16	Gravity-aligned Rotation Averaging with Circular Regression	Linfei Pan et.al.	2410.12763	link
2024-10-16	Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals	Orchid Chetia Phukan et.al.	2410.12645	null
2024-10-15	SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection	Yizhe Liu et.al.	2410.12080	link
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	link
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-10-09	Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models	Ange Lou et.al.	2410.07434	null
2024-10-09	Deep HI Mapping of M 106 Group with FAST	Yao Liu et.al.	2410.07038	null
2024-10-09	MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data	Mingu Kang et.al.	2410.06442	null
2024-10-08	Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation?	Charalambos Tzamos et.al.	2410.05984	link
2024-10-04	Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	Laura Fink et.al.	2410.03861	null
2024-10-01	MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages	Marco Gaido et.al.	2410.01036	link
2024-10-01	Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance	Hongchao Shu et.al.	2410.00386	null
2024-09-29	Robust Incremental Structure-from-Motion with Hybrid Features	Shaohui Liu et.al.	2409.19811	null
2024-09-27	MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion	Bardienus Duisterhof et.al.	2409.19152	null
2024-09-27	Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras	Yipeng Lu et.al.	2409.18673	null
2024-09-26	BlinkTrack: Feature Tracking over 100 FPS via Events and Images	Yichen Shen et.al.	2409.17981	null
2024-09-25	How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not	Francesco Verdini et.al.	2409.17044	null
2024-09-24	Frequency-based View Selection in Gaussian Splatting Reconstruction	Monica M. Q. Li et.al.	2409.16470	null
2024-10-07	Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion	Juan-Diego Florez et.al.	2409.16465	null
2024-09-24	Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research	Vandita Shukla et.al.	2409.15914	null
2024-09-23	Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments	Francisco Roza de Moraes et.al.	2409.15602	null
2024-09-23	Evaluating Robot Influence on Pedestrian Behavior Models for Crowd Simulation and Benchmarking	Subham Agrawal et.al.	2409.14844	null
2024-09-21	Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models	Orchid Chetia Phukan et.al.	2409.14131	null
2024-09-17	GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module	Yichen Zhang et.al.	2409.11307	null
2024-09-13	Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints	Shan Chen et.al.	2409.08613	null
2024-09-09	KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction	Davide Di Nucci et.al.	2409.05407	null
2024-09-06	The Arizona Molecular ISM Survey with the SMT: Variations in the CO(2-1)/CO(1-0) Line Ratio Across the Galaxy Population	Ryan P. Keenan et.al.	2409.03963	null
2024-09-05	Active Galactic Nuclei in the Green Valley at z $\sim$ 0.7	Charity Woodrum et.al.	2409.03197	null
2024-09-04	Object Gaussian for Monocular 6D Pose Estimation from Sparse Views	Luqing Luo et.al.	2409.02581	null
2024-09-11	Geometry-aware Feature Matching for Large-Scale Structure from Motion	Gonglin Chen et.al.	2409.02310	null
2024-09-04	The study of strongly intensive observables for $π^{\pm,0}$ in $pp$ collisions at LHC energy in the framework of PYTHIA model	Tumpa Biswas et.al.	2409.00525	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373	null
2024-09-15	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445	link
2024-08-21	Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations	Lintong Zhang et.al.	2408.11966	null
2024-08-20	TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks	Jinjie Mai et.al.	2408.10739	null
2024-08-16	Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS	Wei Sun et.al.	2408.08723	null
2024-08-15	CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning	Wei Zhu et.al.	2408.08134	link
2024-08-13	A Miniature Vision-Based Localization System for Indoor Blimps	Shicong Ma et.al.	2408.06648	null
2024-08-07	Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM	Yan Song Hu et.al.	2408.03825	null
2024-08-05	Context-aware Mamba-based Reinforcement Learning for social robot navigation	Syed Muhammad Mustafa et.al.	2408.02661	null
2024-08-04	Birational geometry of critical loci in Algebraic Vision	Marina Bertolini et.al.	2408.02067	null
2024-08-04	PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone	Xin Yang et.al.	2408.02053	null
2024-08-02	Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris	Kentaro Uno et.al.	2408.01035	null
2024-08-01	LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting	Zhenyu Bao et.al.	2408.00254	null
2024-07-29	Global Structure-from-Motion Revisited	Linfei Pan et.al.	2407.20219	link
2024-08-06	Revisit Self-supervised Depth Estimation with Local Structure-from-Motion	Shengjie Zhu et.al.	2407.19166	null
2024-07-23	The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations	Hao Liu et.al.	2407.16452	null
2024-07-22	Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures	Ruizhe Wang et.al.	2407.15435	null
2024-07-16	NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models	Francesco Milano et.al.	2407.12207	link
2024-07-15	LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning	Zhuozhu Jian et.al.	2407.10782	null
2024-07-15	Towards Scale-Aware Full Surround Monodepth with Transformers	Yuchen Yang et.al.	2407.10406	null
2024-07-14	3DEgo: 3D Editing on the Go!	Umar Khalid et.al.	2407.10102	null
2024-07-10	Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization	Jinjie Mai et.al.	2407.08023	link
2024-07-10	Euclid preparation. Forecasting the recovery of galaxy physical properties and their relations with template-fitting and machine-learning methods	Euclid Collaboration et.al.	2407.07940	null
2024-07-10	Controlling Space and Time with Diffusion Models	Daniel Watson et.al.	2407.07860	null
2024-07-09	Computer vision tasks for intelligent aerospace missions: An overview	Huilin Chen et.al.	2407.06513	null
2024-07-08	Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views	Jiawei Guo et.al.	2407.05666	null
2024-07-05	Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization	Shaohan Li et.al.	2407.04260	null
2024-07-15	SfM on-the-fly: Get better 3D from What You Capture	Zongqian Zhan et.al.	2407.03939	null
2024-07-03	Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction	Jiaxin Guo et.al.	2407.02918	link
2024-07-02	Indoor 3D Reconstruction with an Unknown Camera-Projector Pair	Zhaoshuai Qi et.al.	2407.01945	null
2024-06-27	SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas	John Lambert et.al.	2406.19390	link
2024-06-27	STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning	Yanan Zhang et.al.	2406.19362	null
2024-06-26	VDG: Vision-Only Dynamic Gaussian for Driving Simulation	Hao Li et.al.	2406.18198	null
2024-06-25	Consensus Learning with Deep Sets for Essential Matrix Estimation	Dror Moran et.al.	2406.17414	link
2024-06-24	Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction	Tong Qin et.al.	2406.16289	null
2024-06-21	The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization	Ivan Nikolić et.al.	2406.15237	link
2024-06-19	MVSBoost: An Efficient Point Cloud-based 3D Reconstruction	Umair Haroon et.al.	2406.13515	null
2024-06-17	MegaScenes: Scene-Level View Synthesis at Scale	Joseph Tung et.al.	2406.11819	link
2024-06-15	Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models	Ruchao Fan et.al.	2406.10507	link
2024-06-14	On the Evaluation of Speech Foundation Models for Spoken Language Understanding	Siddhant Arora et.al.	2406.10083	null
2024-06-12	Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement	Maxime Pietrantoni et.al.	2406.08463	null
2024-06-12	SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models	Chun Yin et.al.	2406.08445	null
2024-06-10	Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis	Xin Jin et.al.	2406.06216	link
2024-06-07	The Star-Forming Main Sequence in JADES and CEERS at $z>1.4$ : Investigating the Burstiness of Star Formation	Leonardo Clarke et.al.	2406.05178	null
2024-06-13	Gaussian Splatting with Localized Points Management	Haosen Yang et.al.	2406.04251	null
2024-06-05	L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration	Yibo Liu et.al.	2406.03298	link
2024-06-04	CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation	Dejia Xu et.al.	2406.02509	null
2024-05-29	Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy	Zijie Jiang et.al.	2405.18863	null
2024-05-29	3D Reconstruction with Fast Dipole Sums	Hanyu Chen et.al.	2405.16788	null
2024-05-26	MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups	Yusen Xie et.al.	2405.16599	null
2024-05-26	Categorical Flow Matching on Statistical Manifolds	Chaoran Cheng et.al.	2405.16441	link
2024-05-22	Exploring Galaxy Properties of eCALIFA with Contrastive Learning	G. Martínez-Solaeche et.al.	2405.13471	null
2024-05-23	Switched Flow Matching: Eliminating Singularities via Switching ODEs	Qunxi Zhu et.al.	2405.11605	null
2024-05-28	NeRO: Neural Road Surface Reconstruction	Ruibo Wang et.al.	2405.10554	link
2024-05-15	Three Dimensional Spatial Cognition: Bees and Bats	Robert Worden et.al.	2405.09413	null
2024-05-09	Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media	Zhizhen Zhang et.al.	2405.05760	null
2024-05-09	Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment	Simon Weber et.al.	2405.05079	link
2024-05-07	Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications	Markus Hillemann et.al.	2405.04345	null
2024-05-07	Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling	Jiawei Shi et.al.	2405.04309	null
2024-05-06	Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion	Yunfeng Li et.al.	2405.03177	link
2024-05-03	HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2	Miriam Jäger et.al.	2405.02005	null
2024-04-25	The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time	Marcie Mun et.al.	2404.16319	null
2024-04-22	Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer	Eric Brachmann et.al.	2404.14351	null
2024-04-22	RESFM: Robust Equivariant Multiview Structure from Motion	Fadi Khatib et.al.	2404.14280	null
2024-04-22	Does Gaussian Splatting need SFM Initialization?	Yalda Foroutan et.al.	2404.12547	null
2024-05-07	A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion	Feng Yu et.al.	2404.11590	link
2024-04-18	DeblurGS: Gaussian Splatting for Camera Motion Blur	Jeongtaek Oh et.al.	2404.11358	null
2024-05-21	LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives	Jiadi Cui et.al.	2404.09748	null
2024-04-12	MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance	Yuqun Wu et.al.	2404.08252	null
2024-04-11	Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation	Keonhee Han et.al.	2404.07933	null
2024-04-07	NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization	Peng Tu et.al.	2404.04875	null
2024-04-04	GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis	Emmanouil Nikolakakis et.al.	2404.03126	null
2024-03-29	InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds	Zhiwen Fan et.al.	2403.20309	link
2024-03-29	HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes	Zhuopeng Li et.al.	2403.20032	null
2024-03-26	NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation	Jiahao Chen et.al.	2403.17537	null
2024-03-25	INPC: Implicit Neural Point Clouds for Radiance Field Rendering	Florian Hahlbohm et.al.	2403.16862	null
2024-03-18	An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation	Zewen Xu et.al.	2403.11639	null
2024-03-14	Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting	Jaewoo Jung et.al.	2403.09413	link
2024-03-13	Refractive COLMAP: Refractive Structure-from-Motion Revisited	Mengkun She et.al.	2403.08640	null
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156	link
2024-03-11	SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection	Yifu Tao et.al.	2403.06877	null
2024-03-24	BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling	Cheng Peng et.al.	2403.04926	link
2024-02-22	GaussianPro: 3D Gaussian Splatting with Progressive Propagation	Kai Cheng et.al.	2402.14650	null
2024-02-25	A Robust Error-Resistant View Selection Method for 3D Reconstruction	Shaojie Zhang et.al.	2402.11431	null
2024-02-17	Dense Matchers for Dense Tracking	Tomáš Jelínek et.al.	2402.11287	null
2024-03-11	Local Feature Matching Using Deep Learning: A Survey	Shibiao Xu et.al.	2401.17592	link
2024-01-22	HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs	Zelin Gao et.al.	2401.11711	null
2024-01-19	SCENES: Subpixel Correspondence Estimation With Epipolar Supervision	Dominik A. Kloepfer et.al.	2401.10886	null
2024-01-15	3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data	Mathilde Letard et.al.	2401.09481	link
2024-01-17	3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey	Thiago Lopes Trugillo da Silveira et.al.	2401.09252	null
2024-01-17	ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization	Weiyao Wang et.al.	2401.08937	null
2024-01-16	Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions	Yi-Fan Zuo et.al.	2401.08043	link
2024-01-10	Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects	Tianhang Cheng et.al.	2401.05236	link
2024-01-07	A Classification of Critical Configurations for any Number of Projective Views	Martin Bråtelund et.al.	2401.03450	link
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471	null
2023-12-16	Transformers in Unsupervised Structure-from-Motion	Hemang Chawla et.al.	2312.10529	link
2023-12-14	HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video	Xueying Wang et.al.	2312.08863	null
2023-12-14	CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning	Qingsong Yan et.al.	2312.08760	null
2023-12-11	Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach	Travis Driver et.al.	2312.06865	link
2023-12-11	Gaussian Splatting SLAM	Hidenobu Matsuki et.al.	2312.06741	null
2023-12-10	SuperPrimitive: Scene Reconstruction at a Primitive Level	Kirill Mazur et.al.	2312.05889	null
2023-12-07	Visual Geometry Grounded Deep Structure From Motion	Jianyuan Wang et.al.	2312.04563	null
2023-11-30	Distributed Global Structure-from-Motion with a Deep Front-End	Ayush Baid et.al.	2311.18801	link
2023-11-21	Robot Hand-Eye Calibration using Structure-from-Motion	Nicolas Andreff et.al.	2311.11808	null
2023-11-18	LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation	Sébastien Henry et.al.	2311.11171	null
2023-11-10	MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty	Rémi Marsal et.al.	2311.06137	link
2023-11-08	VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering	Linus Franke et.al.	2311.04634	link
2023-10-22	A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video	Jan Emily Mangulabnan et.al.	2310.14364	null
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605	null
2023-10-09	Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration	Chunge Bai et.al.	2310.05504	link
2023-10-08	LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization	Artem Nenashev et.al.	2310.05134	null
2023-11-29	Pose-Free Generalizable Rendering Transformer	Zhiwen Fan et.al.	2310.03704	link
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092	null
2023-10-01	Propagating Semantic Labels in Video Data	David Balaban et.al.	2310.00783	null
2023-09-22	Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning	Jonathan Sauder et.al.	2309.12804	null
2023-09-21	On-the-Fly SfM: What you capture is What you get	Zongqian Zhan et.al.	2309.11883	link
2023-09-19	Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water	Jayesh Tripathi et.al.	2309.10269	null
2023-09-16	DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF	Mert Asim Karaoglu et.al.	2309.08927	link
2023-09-08	Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry	Akankshya Kar et.al.	2309.04147	null
2023-09-01	SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation	Youhong Wang et.al.	2309.00526	null
2023-09-01	Dense Voxel 3D Reconstruction Using a Monocular Event Camera	Haodong Chen et.al.	2309.00385	null
2023-08-30	Learning Structure-from-Motion with Graph Attention Networks	Lucas Brynte et.al.	2308.15984	link
2023-08-26	Disjoint Pose and Shape for 3D Face Reconstruction	Raja Kumar et.al.	2308.13903	null
2023-08-30	CamP: Camera Preconditioning for Neural Radiance Fields	Keunhong Park et.al.	2308.10902	null
2023-08-18	Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling	Haorui Ji et.al.	2308.10705	null
2023-08-14	Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation	Tao Liu et.al.	2308.07231	link
2023-08-11	Efficient Large-scale AUV-based Visual Seafloor Mapping	Mengkun She et.al.	2308.06147	null
2023-08-04	EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems	Weihan Wang et.al.	2308.02670	null
2023-08-15	Tirtha -- An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites	Jyotirmaya Shivottam et.al.	2308.01246	link
2023-08-02	Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network	Shenbagaraj Kannapiran et.al.	2308.01125	null
2023-07-27	PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking	Yang Zheng et.al.	2307.15055	link
2023-07-28	SACReg: Scene-Agnostic Coordinate Regression for Visual Localization	Jerome Revaud et.al.	2307.11702	null
2023-07-19	Lazy Visual Localization via Motion Averaging	Siyan Dong et.al.	2307.09981	null
2023-07-10	Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor	San Jiang et.al.	2307.04520	null
2023-07-07	RGB-D Mapping and Tracking in a Plenoxel Radiance Field	Andreas L. Teigen et.al.	2307.03404	link
2023-06-29	The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes	David Recasens et.al.	2306.16917	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669	link
2023-06-28	PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment	Jianyuan Wang et.al.	2306.15667	null
2023-06-24	3D Reconstruction of Spherical Images based on Incremental Structure from Motion	San Jiang et.al.	2306.12770	link
2023-06-15	NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations	Varun Jampani et.al.	2306.09109	link
2023-06-15	Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization	Dror Aiger et.al.	2306.09012	link
2023-06-10	3D reconstruction using Structure for Motion	Kshitij Karnawat et.al.	2306.06360	link
2023-06-02	Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images	Marcela Mera-Trujillo et.al.	2306.01938	null
2023-05-31	FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow	Cameron Smith et.al.	2306.00180	null
2023-05-19	SIDAR: Synthetic Image Dataset for Alignment & Restoration	Monika Kwiatkowski et.al.	2305.12036	link
2023-05-09	Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization	Clémentin Boittiaux et.al.	2305.05301	link
2023-05-09	Rotation Synchronization via Deep Matrix Factorization	Gk Tejus et.al.	2305.05268	link
2023-04-20	A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion	Miriam Jäger et.al.	2304.10664	null
2023-04-14	Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments	Felix Ott et.al.	2304.07250	null
2023-04-12	Visual Localization using Imperfect 3D Models from the Internet	Vojtech Panek et.al.	2304.05947	link
2023-04-08	Photometric Correction for Infrared Sensors	Jincheng Zhang et.al.	2304.03930	null
2023-04-07	DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium	Antyanta Bangunharcana et.al.	2304.03560	link
2023-04-05	Semantic Validation in Structure from Motion	Joseph Rowell et.al.	2304.02420	link
2023-03-31	Learning Internal Representations of 3D Transformations from 2D Projected Inputs	Marissa Connor et.al.	2303.17776	null
2023-03-30	3D Line Mapping Revisited	Shaohui Liu et.al.	2303.17504	link
2023-03-27	TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering	Jaehoon Choi et.al.	2303.15060	null
2023-03-26	On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks	HyunJun Jung et.al.	2303.14840	link
2023-03-24	Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container	Jinguang Tong et.al.	2303.13805	link
2023-03-24	Progressively Optimized Local Radiance Fields for Robust View Synthesis	Andreas Meuleman et.al.	2303.13791	null
2023-03-15	RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters	Shuja Khalid et.al.	2303.08695	null
2023-03-09	Revisiting Rotation Averaging: Uncertainties and Robust Losses	Ganlin Zhang et.al.	2303.05195	link
2023-02-28	Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images	Zhongli Fan et.al.	2302.14239	link
2023-03-25	BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling	Sameera Ramasinghe et.al.	2302.13543	null
2023-02-21	EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images	Zhichao Ye et.al.	2302.10544	link
2023-02-18	Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering	Tatsuro Yamane et.al.	2302.09208	null
2023-02-12	Uncertainty-Driven Dense Two-View Structure from Motion	Weirong Chen et.al.	2302.00523	null
2023-01-28	AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion	Yu Chen et.al.	2301.12135	null
2023-01-20	A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles	Zhefan Xu et.al.	2301.08422	link
2023-03-21	Robust Dynamic Radiance Fields	Yu-Lun Liu et.al.	2301.02239	link
2022-12-24	Polarimetric Multi-View Inverse Rendering	Jinyu Zhao et.al.	2212.12721	null
2022-12-13	Accidental Turntables: Learning 3D Pose by Watching Objects Turn	Zezhou Cheng et.al.	2212.06300	null
2022-12-04	3D Object Aided Self-Supervised Monocular Depth Estimation	Songlin Wei et.al.	2212.01768	null
2022-12-02	High-Res Facial Appearance Capture from Polarized Smartphone Images	Dejan Azinović et.al.	2212.01160	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069	link
2022-11-24	JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models	Sepidehsadat Hosseini et.al.	2211.13785	null
2022-11-24	SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks	Sergio Izquierdo et.al.	2211.13551	link
2022-11-22	Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces	Yuxi Xiao et.al.	2211.12018	link
2022-11-21	Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques	David Ramirez et.al.	2211.11836	null
2022-11-14	Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion	René Haas et.al.	2211.07195	null
2022-10-13	Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach	Zhiang Chen et.al.	2210.07349	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517	null
2022-10-07	Leveraging Structure from Motion to Localize Inaccessible Bus Stops	Indu Panigrahi et.al.	2210.03646	link
2022-10-01	Structure-Aware NeRF without Posed Camera via Epipolar Constraint	Shu Chen et.al.	2210.00183	link
2022-10-05	FAST-LIO, Then Bayesian ICP, Then GTSFM	Jerred Chen et.al.	2210.00146	null
2022-09-20	BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction	Ahalya Ravendran et.al.	2209.09470	null
2022-09-19	A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion	Gerry Chen et.al.	2209.08690	null
2022-09-14	End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes	Qiao Chen et.al.	2209.06926	null
2022-09-07	Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021	Hartmut Surmann et.al.	2209.03084	null
2022-08-27	Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data	Thomas A. Ciarfuglia et.al.	2208.13001	null
2022-08-12	Handling Constrained Optimization in Factor Graphs for Autonomous Navigation	Barbara Bazzana et.al.	2208.06325	null
2022-08-04	Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training	Yao-Chih Lee et.al.	2208.02709	link
2022-07-31	One Object at a Time: Accurate and Robust Structure From Motion for Robots	Aravind Battaje et.al.	2208.00487	null
2022-07-23	Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks	Daniel Posada et.al.	2207.11413	null
2022-07-25	MeshLoc: Mesh-Based Visual Localization	Vojtech Panek et.al.	2207.10762	link
2022-07-19	ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild	Wang Zhao et.al.	2207.09137	link
2022-07-16	Organic Priors in Non-Rigid Structure from Motion	Suryansh Kumar et.al.	2207.06262	null
2022-07-06	A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models	Axel Garcia-Vega et.al.	2207.02396	null
2022-06-24	Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set	San Jiang et.al.	2206.11499	null
2022-06-13	TC-SfM: Robust Track-Community-Based Structure-from-Motion	Lei Wang et.al.	2206.05866	null
2022-06-10	EigenFairing: 3D Model Fairing using Image Coherence	Pragyana Mishra et.al.	2206.05309	null
2022-06-01	Semantic Room Wireframe Detection from a Single View	David Gillsjö et.al.	2206.00491	link
2022-05-31	Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction	Qiancheng Fu et.al.	2205.15848	null
2022-05-09	Is my Depth Ground-Truth Good Enough? HAMMER -- Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression	HyunJun Jung et.al.	2205.04565	null
2022-05-07	Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs	Pedro F. Proença et.al.	2205.03522	null
2022-05-06	EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms	Levi Burner et.al.	2205.03467	null
2022-04-20	Learned Monocular Depth Priors in Visual-Inertial Initialization	Yunwen Zhou et.al.	2204.09171	null
2022-04-10	Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective	Hui Deng et.al.	2204.04730	null
2022-04-08	Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems	Debao Huang et.al.	2204.04145	null
2022-04-07	SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation	Yi Wei et.al.	2204.03636	link
2022-04-06	Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion	Lukas Bommes et.al.	2204.02733	link
2022-04-05	Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows	Sheng Liu et.al.	2204.02509	link
2022-03-31	Fast, Accurate and Memory-Efficient Partial Permutation Synchronization	Shaohan Li et.al.	2203.16505	null
2022-03-28	Visual Odometry for RGB-D Cameras	Afonso Fontes et.al.	2203.15119	null
2022-03-28	Optimizing Elimination Templates by Greedy Parameter Search	Evgeniy Martyushev et.al.	2203.14901	link
2022-03-23	Event-Based Dense Reconstruction Pipeline	Kun Xiao et.al.	2203.12270	null
2022-03-21	DiffPoseNet: Direct Differentiable Camera Pose Estimation	Chethan M. Parameshwara et.al.	2203.11174	null
2022-03-02	Asynchronous Optimisation for Event-based Visual Odometry	Daqi Liu et.al.	2203.01037	null
2022-03-02	Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation	Yulun Tian et.al.	2203.00851	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146	link
2022-01-20	GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry	Yunhan Zhao et.al.	2201.08131	null
2022-01-13	Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching	Yunpeng Shi et.al.	2201.04797	link
2022-01-10	High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM	Brian M. Hopkinson et.al.	2201.03364	link
2022-01-06	De-rendering 3D Objects in the Wild	Felix Wimbauer et.al.	2201.02279	link
2021-12-29	On the Instability of Relative Pose Estimation and RANSAC's Role	Hongyi Fan et.al.	2112.14651	null
2021-12-16	Road-aware Monocular Structure from Motion and Homography Estimation	Wei Sui et.al.	2112.08635	null
2021-12-10	Critical configurations for three projective views	Martin Bråtelund et.al.	2112.05478	null
2021-12-09	Critical configurations for two projective views, a new approach	Martin Bråtelund et.al.	2112.05074	null
2021-12-06	Dense Depth Priors for Neural Radiance Fields from Sparse Input Views	Barbara Roessle et.al.	2112.03288	link
2021-12-10	MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment	Jie Ren et.al.	2112.01349	link
2021-11-11	Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft	Pascal Schoppmann et.al.	2111.06271	null
2021-11-10	Damage Estimation and Localization from Sparse Aerial Imagery	Rene Garcia Franceschini et.al.	2111.03708	null
2021-11-03	Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems	Swarnabja Bhaumik et.al.	2111.02064	null
2021-10-14	Modeling dynamic target deformation in camera calibration	Annika Hagemann et.al.	2110.07322	null
2021-10-13	Hyperspectral 3D Mapping of Underwater Environments	Maxime Ferrera et.al.	2110.06571	null
2021-09-24	Automatic Map Update Using Dashcam Videos	Aziza Zhanabatyrova et.al.	2109.12131	null
2021-09-16	Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs	Gabriel Moreira et.al.	2109.08046	link
2021-09-06	Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications	Tejas Mane et.al.	2109.02740	null
2021-09-02	Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency	Beatrix-Emőke Fülöp-Balogh et.al.	2109.01018	null
2021-09-01	On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation	Eric Brachmann et.al.	2109.00524	link
2021-08-31	DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension	Roman Shapovalov et.al.	2109.00033	null
2021-08-29	Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration	Seyed-Mahdi Nasiri et.al.	2108.12876	null
2021-08-23	Burst Imaging for Light-Constrained Structure-From-Motion	Ahalya Ravendran et.al.	2108.09895	null

(back to top)

Visual Localization

Publish Date	Title	Authors	PDF	Code
2025-03-04	TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition	Oliver Grainge et.al.	2503.02511	null
2025-03-04	Introspective Loop Closure for SLAM with 4D Imaging Radar	Maximilian Hilger et.al.	2503.02383	null
2025-03-04	Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models	Kenta Tsukahara et.al.	2503.02256	null
2025-03-03	Composed Multi-modal Retrieval: A Survey of Approaches and Applications	Kun Zhang et.al.	2503.01334	link
2025-03-03	AirRoom: Objects Matter in Room Reidentification	Runmao Yao et.al.	2503.01130	null
2025-03-02	Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching	Jinyu Miao et.al.	2503.00862	null
2025-03-01	Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning	Songlin Dong et.al.	2503.00515	null
2025-02-28	EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration	Kuangyi Chen et.al.	2503.00167	null
2025-02-28	CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval	Zelong Sun et.al.	2502.20826	null
2025-02-28	SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition	Shanshan Wan et.al.	2502.20676	null
2025-02-27	A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization	Yejun Zhang et.al.	2502.20036	null
2025-02-27	On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation	Ruben T. Lucassen et.al.	2502.19285	null
2025-02-26	BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure	Haoxin Cai et.al.	2502.19242	null
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932	null
2025-02-25	MegaLoc: One Retrieval to Place Them All	Gabriele Berton et.al.	2502.17237	link
2025-02-23	Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries	Yin Wu et.al.	2502.16636	link
2025-02-23	SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition	Feng Lu et.al.	2502.16601	link
2025-02-21	ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval	Guanqi Zhan et.al.	2502.15682	null
2025-02-20	Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition	Tianyi Shang et.al.	2502.14195	link
2025-02-19	3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments	Vincent Ress et.al.	2502.13803	null
2025-02-18	Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization	Shuo Xing et.al.	2502.13146	link
2025-02-19	IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras	Dongki Jung et.al.	2502.12545	null
2025-02-17	From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations	Matteo Scucchia et.al.	2502.12303	null
2025-02-17	Descriminative-Generative Custom Tokens for Vision-Language Models	Pramuditha Perera et.al.	2502.12095	null
2025-02-17	ILIAS: Instance-Level Image retrieval At Scale	Giorgos Kordopatis-Zilos et.al.	2502.11748	null
2025-02-17	Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition	Jianyi Peng et.al.	2502.11742	null
2025-02-17	Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics	Francesco Croce et.al.	2502.11725	link
2025-02-17	Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization	Yuanze Xu et.al.	2502.11408	null
2025-02-13	ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation	Rotem Shalev-Arkushin et.al.	2502.09411	null
2025-02-12	SpeechCompass: Enhancing Mobile Captioning with Diarization and Directional Guidance via Multi-Microphone Localization	Artem Dementyev et.al.	2502.08848	null
2025-02-12	Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions	Prajwal Gatti et.al.	2502.08438	null
2025-02-11	Captured by Captions: On Memorization and its Mitigation in CLIP Models	Wenhao Wang et.al.	2502.07830	null
2025-02-11	Ultrafast 4D scanning transmission electron microscopy for imaging of localized optical fields	Petr Koutenský et.al.	2502.07338	null
2025-02-11	Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos	Haowen Gao et.al.	2502.07327	null
2025-02-11	PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval	Osman Tursun et.al.	2502.07215	null
2025-02-10	AstroLoc: Robust Space to Ground Image Localizer	Gabriele Berton et.al.	2502.07003	null
2025-02-09	Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education	Yanhao Jia et.al.	2502.05863	null
2025-02-07	Learning Street View Representations with Spatiotemporal Contrast	Yong Li et.al.	2502.04638	null
2025-02-06	Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion	Marco Mistretta et.al.	2502.04263	link
2025-02-05	Human-Aligned Image Models Improve Visual Decoding from the Brain	Nona Rajabi et.al.	2502.03081	null
2025-02-03	ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies	Costin F. Ciusdel et.al.	2502.01335	null
2025-01-31	LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks	Liudi Yang et.al.	2501.19382	link
2025-01-27	Freestyle Sketch-in-the-Loop Image Segmentation	Subhadeep Koley et.al.	2501.16022	null
2025-01-26	Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations	Zijun Long et.al.	2501.15379	null
2025-01-24	Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection	Viktor Kozák et.al.	2501.14587	null
2025-01-23	Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models	Jakob Krogh Petersen et.al.	2501.14051	link
2025-01-22	Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation	Kenta Uesugi et.al.	2501.13968	null
2025-01-19	Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection	Zhipeng Yu et.al.	2501.11063	link
2025-01-18	A Resource-Efficient Training Framework for Remote Sensing Text--Image Retrieval	Weihang Zhang et.al.	2501.10638	null
2025-01-17	FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis	Zhe Chen et.al.	2501.09887	null
2025-01-15	Vision Foundation Models for Computed Tomography	Suraj Pai et.al.	2501.09001	link
2025-01-12	SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval	Bhavin Jawade et.al.	2501.08347	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286	null
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399	null
2025-01-12	Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation	Zhenyang Feng et.al.	2501.06749	null
2025-01-06	Integrating Language-Image Prior into EEG Decoding for Cross-Task Zero-Calibration RSVP-BCI	Xujin Li et.al.	2501.02841	null
2025-01-03	A Minimal Subset Approach for Efficient and Scalable Loop Closure	Nikolaos Stathoulopoulos et.al.	2501.01791	link
2025-01-03	iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings	Shuhei Tomoshige et.al.	2501.01642	null
2025-01-02	R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization	Xudong Jiang et.al.	2501.01421	null
2025-01-02	Training Medical Large Vision-Language Models with Abnormal-Aware Feedback	Yucheng Zhou et.al.	2501.01377	null
2025-01-02	Domain-invariant feature learning in brain MR imaging for content-based image retrieval	Shuya Tobari et.al.	2501.01326	null
2024-12-28	GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting	Atticus J. Zeller et.al.	2412.20056	link
2024-12-25	FOR: Finetuning for Object Level Open Vocabulary Image Retrieval	Hila Levi et.al.	2412.18806	null
2024-12-24	ERVD: An Efficient and Robust ViT-Based Distillation Framework for Remote Sensing Image Retrieval	Le Dong et.al.	2412.18136	link
2024-12-22	Where am I? Cross-View Geo-localization with Natural Language Descriptions	Junyan Ye et.al.	2412.17007	null
2024-12-22	Large-Scale UWB Anchor Calibration and One-Shot Localization Using Gaussian Process	Shenghai Yuan et.al.	2412.16880	null
2024-12-24	Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling	Daichi Yashima et.al.	2412.16576	link
2024-12-20	A New Method to Capturing Compositional Knowledge in Linguistic Space	Jiahe Wan et.al.	2412.15632	null
2024-12-20	Stabilizing Laplacian Inversion in Fokker-Planck Image Retrieval using the Transport-of-Intensity Equation	Samantha J Alloo et.al.	2412.15513	null
2024-12-19	Learning Visual Composition through Improved Semantic Guidance	Austin Stone et.al.	2412.15396	null
2024-12-19	MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval	Junjie Zhou et.al.	2412.14475	null
2024-12-18	Adversarial Hubness in Multi-Modal Retrieval	Tingwei Zhang et.al.	2412.14113	link
2024-12-18	Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval	Giacomo Pacini et.al.	2412.13834	null
2024-12-18	ConDo: Continual Domain Expansion for Absolute Pose Regression	Zijun Li et.al.	2412.13452	link
2024-12-17	Three Things to Know about Deep Metric Learning	Yash Patel et.al.	2412.12432	null
2024-12-15	Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval	Zelong Sun et.al.	2412.11087	null
2024-12-18	Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2412.11077	null
2024-12-13	MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition	Qiwen Gu et.al.	2412.09199	null
2024-12-12	A Flexible Plug-and-Play Module for Generating Variable-Length	Liyang He et.al.	2412.08922	link
2024-12-11	Image Retrieval Methods in the Dissimilarity Space	Madhu Kiran et.al.	2412.08618	null
2024-12-11	Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization	Siyan Dong et.al.	2412.08376	link
2024-12-11	Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin	Benjamin D. Killeen et.al.	2412.08020	null
2024-12-10	On Motion Blur and Deblurring in Visual Place Recognition	Timur Ismagilov et.al.	2412.07751	null
2024-12-10	Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance	Wanwen Chen et.al.	2412.07741	null
2024-12-09	An Efficient Scene Coordinate Encoding and Relocalization Method	Kuan Xu et.al.	2412.06488	link
2024-12-09	A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition	Connor Malone et.al.	2412.06153	null
2024-12-07	Compositional Image Retrieval via Instruction-Aware Contrastive Learning	Wenliang Zhong et.al.	2412.05756	link
2024-12-06	DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification	Ying Jin et.al.	2412.04828	null
2024-12-04	Distillation of Diffusion Features for Semantic Correspondence	Frank Fundel et.al.	2412.03512	null
2024-12-04	Composed Image Retrieval for Training-Free Domain Conversion	Nikos Efthymiadis et.al.	2412.03297	link
2024-12-03	A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration	Thulio Amorim et.al.	2412.02881	null
2024-12-03	Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval	Leah Bar et.al.	2412.02310	link
2024-12-02	Mutli-View 3D Reconstruction using Knowledge Distillation	Aditya Dutt et.al.	2412.02039	link
2024-12-02	Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features	MD Shaikh Rahman et.al.	2412.01555	null
2024-12-02	Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models	Yi Liao et.al.	2412.01202	null
2024-12-01	EDTformer: An Efficient Decoder Transformer for Visual Place Recognition	Tong Jin et.al.	2412.00784	null
2024-11-28	EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval	Muhammad Huzaifa et.al.	2412.00139	null
2024-11-29	A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications	Liqiang Zhang Ye Tian Dongyan Wei et.al.	2411.19845	null
2024-11-27	Optimizing Image Retrieval with an Extended b-Metric Space	Abdelkader Belhenniche et.al.	2411.18800	null
2024-11-26	Learning Visual Hierarchies with Hyperbolic Embeddings	Ziwei Wang et.al.	2411.17490	null
2024-11-24	Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy	You Li et.al.	2411.16752	null
2024-11-24	AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks	You Li et.al.	2411.16749	null
2024-11-25	Image Generation Diversity Issues and How to Tame Them	Mischa Dombrowski et.al.	2411.16171	link
2024-11-24	PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments	Haoang Li et.al.	2411.15800	null
2024-11-22	Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval	Zengbao Sun et.al.	2411.14704	null
2024-11-20	Globally Correlation-Aware Hard Negative Generation	Wenjie Peng et.al.	2411.13145	link
2024-11-18	Exploring Emerging Trends and Research Opportunities in Visual Place Recognition	Antonios Gasteratos et.al.	2411.11481	null
2024-11-13	OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances	Youqi Liao et.al.	2411.08665	link
2024-11-13	Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval	Saul Santos et.al.	2411.08590	link
2024-11-22	Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments	Ashkan Nejad et.al.	2411.08567	link
2024-11-13	MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation	Peng Wang et.al.	2411.08279	link
2024-11-05	From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing	Xintian Sun et.al.	2411.05826	null
2024-11-04	TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives	Maitreya Patel et.al.	2411.02545	null
2024-11-11	INQUIRE: A Natural World Text-to-Image Retrieval Benchmark	Edward Vendrow et.al.	2411.02537	link
2024-11-20	Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Sharat Agarwal et.al.	2411.01925	null
2024-11-04	Semantic Masking and Visual Feature Matching for Robust Localization	Luisa Mao et.al.	2411.01804	null
2024-11-03	Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification	MD Shaikh Rahman et.al.	2411.01473	null
2024-11-01	Identifying Implicit Social Biases in Vision-Language Models	Kimia Hamidieh et.al.	2411.00997	null
2024-10-31	Nearest Neighbor Normalization Improves Multimodal Retrieval	Neil Chowdhury et.al.	2410.24114	link
2024-10-31	MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval	Haiwen Li et.al.	2410.23736	null
2024-10-30	Decoupling Semantic Similarity from Spatial Alignment for Neural Networks	Tassilo Wald et.al.	2410.23107	link
2024-10-29	Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications	Monica Riedler et.al.	2410.21943	link
2024-10-28	NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments	Taiyi Pan et.al.	2410.21615	link
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341	link
2024-10-24	ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval	Zijia Zhao et.al.	2410.18715	link
2024-10-25	On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features	Tomáš Pivoňka et.al.	2410.18573	null
2024-10-22	Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2410.17393	null
2024-10-20	GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Haiwen Diao et.al.	2410.15266	link
2024-10-19	Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book Collection	Marie Roald et.al.	2410.14969	link
2024-10-16	Development of Image Collection Method Using YOLO and Siamese Network	Chan Young Shin et.al.	2410.12561	null
2024-10-16	LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment	Juelin Zhu et.al.	2410.12269	link
2024-10-16	Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization	Nanda Febri Istighfarin et.al.	2410.12240	null
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	link
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-10-11	Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System	Zheng Liu et.al.	2410.08935	link
2024-10-16	Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP	Eunji Kim et.al.	2410.08469	null
2024-10-11	A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification	Eugene P. W. Ang et.al.	2410.08456	null
2024-10-10	A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks	Hoin Jung et.al.	2410.07593	link
2024-10-09	Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval	Mohammad Omama et.al.	2410.07022	null
2024-10-09	Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers	Stephen Hausler et.al.	2410.06614	link
2024-10-09	MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging	Noel C. F. Codella et.al.	2410.06542	null
2024-10-08	Temporal Image Caption Retrieval Competition -- Description and Results	Jakub Pokrywka et.al.	2410.06314	null
2024-10-08	Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching	Gongxin Yao et.al.	2410.06285	null
2024-10-08	GSLoc: Visual Localization with 3D Gaussian Splatting	Kazii Botashev et.al.	2410.06165	null
2024-10-08	Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning	Ayush Singh et.al.	2410.05928	null
2024-10-08	RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps	Minsoo Kim et.al.	2410.05621	null
2024-10-11	LoTLIP: Improving Language-Image Pre-training for Long Text Understanding	Wei Wu et.al.	2410.05249	null
2024-10-06	LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation	Jianhao Jiao et.al.	2410.04419	null
2024-10-02	Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension	Zaiquan Yang et.al.	2410.01544	null
2024-10-03	EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections	Francesc Net et.al.	2410.01536	link
2024-10-04	CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment	Safouane El Ghazouali et.al.	2410.01411	link
2024-09-30	Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation	Aleyna Kütük et.al.	2410.00266	null
2024-09-29	CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation	Yifan Duan et.al.	2409.19597	null
2024-09-28	VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition	Ahmad Khaliq et.al.	2409.19293	link
2024-09-27	MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion	Bardienus Duisterhof et.al.	2409.19152	null
2024-09-26	Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval	Mankeerat Sidhu et.al.	2409.18733	null
2024-09-26	Revisit Anything: Visual Place Recognition via Image Segment Retrieval	Kartik Garg et.al.	2409.18049	link
2024-09-24	GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization	Gennady Sidorov et.al.	2409.16502	link
2024-09-23	CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis	Xiang Zhang et.al.	2409.15169	null
2024-09-21	Combining Absolute and Semi-Generalized Relative Poses for Visual Localization	Vojtech Panek et.al.	2409.14269	null
2024-09-21	SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality	Hongjia Zhai et.al.	2409.14067	null
2024-09-20	Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval	Morris Florek et.al.	2409.13513	link
2024-09-18	Towards Global Localization using Multi-Modal Object-Instance Re-Identification	Aneesh Chavan et.al.	2409.12002	link
2024-09-17	Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching	Kurran Singh et.al.	2409.11555	null
2024-09-17	Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information	Kunal Chelani et.al.	2409.11536	null
2024-09-17	Improving the Efficiency of Visually Augmented Language Models	Paula Ontalvilla et.al.	2409.11148	link
2024-09-21	HGSLoc: 3DGS-based Heuristic Camera Pose Refinement	Zhongyan Niu et.al.	2409.10925	null
2024-09-16	SOLVR: Submap Oriented LiDAR-Visual Re-Localisation	Joshua Knights et.al.	2409.10247	null
2024-09-16	Garment Attribute Manipulation with Multi-level Attention	Vittorio Casula et.al.	2409.10206	null
2024-09-14	Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval	Amirreza Mahbod et.al.	2409.09430	link
2024-09-12	Structured Pruning for Efficient Visual Place Recognition	Oliver Grainge et.al.	2409.07834	null
2024-09-10	GeoCalib: Learning Single-image Calibration with Geometric Optimization	Alexander Veicht et.al.	2409.06704	link
2024-09-10	Weakly-supervised Camera Localization by Ground-to-satellite Image Registration	Yujiao Shi et.al.	2409.06471	link
2024-09-10	A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions	Zhicong Wu et.al.	2409.06381	null
2024-09-09	Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding	Bram Willemsen et.al.	2409.05721	link
2024-09-09	Open-World Dynamic Prompt and Continual Visual Representation Learning	Youngeun Kim et.al.	2409.05312	null
2024-09-12	Training-free ZS-CIR via Weighted Modality Fusion and Similarity	Ren-Di Wu et.al.	2409.04918	link
2024-09-12	Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models	Saghir Alfasly et.al.	2409.04631	null
2024-09-06	Reprojection Errors as Prompts for Efficient Scene Coordinate Regression	Ting-Ru Liu et.al.	2409.04178	null
2024-09-06	Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments	Therese Joseph et.al.	2409.03998	null
2024-09-04	Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications	Abby Stylianou et.al.	2409.03012	null
2024-09-04	NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval	Sepanta Zeighami et.al.	2409.02343	link
2024-09-03	Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment	Konstantin Schall et.al.	2409.01936	link
2024-09-02	A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches	Kim Jinwoo et.al.	2409.01219	null
2024-09-02	Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection	Manon Kok et.al.	2409.01091	null
2024-09-02	Evidential Transformers for Improved Image Retrieval	Danilo Dordevic et.al.	2409.01082	null
2024-09-05	EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System	Bonan Liu et.al.	2409.00343	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373	null
2024-09-02	RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance	Avideep Mukherjee et.al.	2408.17095	null
2024-08-29	A compact neuromorphic system for ultra energy-efficient, on-device robot localization	Adam D. Hines et.al.	2408.16754	link
2024-08-29	Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models	Kengo Nakata et.al.	2408.16296	null
2024-08-28	Temporal Attention for Cross-View Sequential Image Localization	Dong Yuan et.al.	2408.15569	link
2024-08-27	Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild	Tianqi Wei et.al.	2408.14723	null
2024-08-25	LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task	Ali Asgarov et.al.	2408.13909	link
2024-08-15	Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval	Lifeng Zhou et.al.	2408.13705	null
2024-08-15	Coarse-to-fine Alignment Makes Better Speech-image Retrieval	Lifeng Zhou et.al.	2408.13119	null
2024-08-21	FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization	Son Tung Nguyen et.al.	2408.12037	link
2024-08-21	Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations	Lintong Zhang et.al.	2408.11966	null
2024-08-21	UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation	Xiangyu Zhao et.al.	2408.11305	link
2024-08-20	GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting	Changkun Liu et.al.	2408.11085	link
2024-08-19	BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval	Zhenyu Lu et.al.	2408.10383	null
2024-08-23	Fashion Image-to-Image Translation for Complementary Item Retrieval	Matteo Attimonelli et.al.	2408.09847	link
2024-08-20	MambaLoc: Efficient Camera Localisation via State Space Model	Jialu Wang et.al.	2408.09680	null
2024-08-15	DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions	Ryosuke Korekata et.al.	2408.07910	null
2024-08-13	A Miniature Vision-Based Localization System for Indoor Blimps	Shicong Ma et.al.	2408.06648	null
2024-08-10	Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network	Junyan Ye et.al.	2408.05475	link
2024-08-09	Spherical World-Locking for Audio-Visual Localization in Egocentric Videos	Heeseung Yun et.al.	2408.05364	null
2024-08-06	AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval	Pavel Suma et.al.	2408.03282	link
2024-08-05	CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration	Gongxin Yao et.al.	2408.02394	null
2024-08-09	BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles	Lun Luo et.al.	2408.01841	link
2024-08-02	On Validation of Search & Retrieval of Tissue Images in Digital Pathology	H. R. Tizhoosh et.al.	2408.01570	null
2024-07-31	VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning	Yuhang Ming et.al.	2407.21416	null
2024-07-31	SuperVINS: A visual-inertial SLAM framework integrated deep learning features	Hongkun Luo et.al.	2407.21348	link
2024-07-30	Re-localization acceleration with Medoid Silhouette Clustering	Hongyi Zhang et.al.	2407.20749	null
2024-07-29	A flexible framework for accurate LiDAR odometry, map manipulation, and localization	José Luis Blanco-Claraco et.al.	2407.20465	link
2024-07-26	From 2D to 3D: AISG-SLA Visual Localization Challenge	Jialin Gao et.al.	2407.18590	null
2024-07-24	Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation	Yongqi Li et.al.	2407.17274	null
2024-07-24	Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments	Wei Gao et.al.	2407.17078	null
2024-07-24	Pose Estimation from Camera Images for Underwater Inspection	Luyuan Peng et.al.	2407.16961	null
2024-07-22	Memory Management for Real-Time Appearance-Based Loop Closure Detection	Mathieu Labbé et.al.	2407.15890	null
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791	null
2024-07-22	Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM	Mathieu Labbe et.al.	2407.15305	null
2024-07-22	Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation	Mathieu Labbé et.al.	2407.15304	null
2024-07-19	Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization	Yuehua Ding et.al.	2407.14643	null
2024-07-18	Visual Haystacks: Answering Harder Questions About Sets of Images	Tsung-Han Wu et.al.	2407.13766	link
2024-07-17	Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM	Markus Weißflog et.al.	2407.12408	null
2024-07-17	GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jingwen Yu et.al.	2407.11736	link
2024-07-16	EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis	Ruijie Yang et.al.	2407.11401	null
2024-07-15	No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Walter Simoncini et.al.	2407.10964	link
2024-07-15	DINO Pre-training for Vision-based End-to-end Autonomous Driving	Shubham Juneja et.al.	2407.10803	null
2024-07-15	Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval	Youngsun Lim et.al.	2407.10683	null
2024-07-15	An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots	J. J. Cabrera et.al.	2407.10596	link
2024-07-15	An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments	J. J. Cabrera et.al.	2407.10536	null
2024-07-12	Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval	Vaibhav Balloli et.al.	2407.08908	link
2024-07-11	Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates	Owen Claxton et.al.	2407.08162	link
2024-07-12	Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal	Xinyu Zhu et.al.	2407.08153	link
2024-07-11	SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM	Neng Wang et.al.	2407.08106	link
2024-07-09	LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition	Teng Wang et.al.	2407.06730	null
2024-07-09	CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding	Wenhao Xu et.al.	2407.06611	null
2024-07-08	Pseudo-triplet Guided Few-shot Composed Image Retrieval	Bohan Hou et.al.	2407.06001	null
2024-07-09	HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels	Yingying Jiang et.al.	2407.05795	null
2024-07-05	Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning	Mainak Singha et.al.	2407.04207	link
2024-07-04	Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models	Chang-Sheng Kao et.al.	2407.03615	link
2024-07-03	Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach	Pronay Debnath et.al.	2407.03486	null
2024-07-02	Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition	Sergio Izquierdo et.al.	2407.02422	link
2024-07-01	Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval	Aneeshan Sain et.al.	2407.01810	null
2024-07-01	Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval	Hanwen Su et.al.	2407.00979	null
2024-07-01	Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios	Connor Malone et.al.	2407.00863	null
2024-06-27	PathAlign: A vision-language model for whole slide images in histopathology	Faruk Ahmed et.al.	2406.19578	null
2024-07-05	360 in the Wild: Dataset for Depth Prediction and View Synthesis	Kibaek Park et.al.	2406.18898	null
2024-06-27	Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs	Huaying Zhang et.al.	2406.18836	null
2024-06-26	WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images	Yannik Glaser et.al.	2406.18765	null
2024-06-26	View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis	Subin Varghese et.al.	2406.18012	null
2024-06-25	Tell Me Where You Are: Multimodal LLMs Meet Place Recognition	Zonglin Lyu et.al.	2406.17520	null
2024-06-25	SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation	Xu Liu et.al.	2406.17249	link
2024-06-23	Breaking the Frame: Image Retrieval by Visual Overlap Prediction	Tong Wei et.al.	2406.16204	link
2024-06-19	Towards a multimodal framework for remote sensing image change retrieval and captioning	Roger Ferrod et.al.	2406.13424	link
2024-06-19	CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval	Christian Lülf et.al.	2406.13322	link
2024-06-17	Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization	Huaiji Zhou et.al.	2406.11766	null
2024-06-22	Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment	Jianan Jiang et.al.	2406.11551	link
2024-06-17	They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias	Salma Abdel Magid et.al.	2406.11331	null
2024-06-17	Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion	Guoyuan An et.al.	2406.11242	null
2024-06-14	Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval	Genc Hoxha et.al.	2406.10107	null
2024-06-14	BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval	Imanol Miranda et.al.	2406.09952	link
2024-06-13	Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases	Meng Wang et.al.	2406.09317	link
2024-06-13	Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval	Jaeseok Byun et.al.	2406.09188	null
2024-06-13	DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification	Zhengrui Xu et.al.	2406.08773	link
2024-06-12	Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement	Maxime Pietrantoni et.al.	2406.08463	null
2024-06-12	ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery	Kam Woh Ng et.al.	2406.08457	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502	link
2024-06-11	Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Shuvendu Roy et.al.	2406.07450	link
2024-06-11	Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval	Adrià Molina et.al.	2406.07315	null
2024-06-10	Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation	Shenghao Li et.al.	2406.06374	link
2024-06-09	Unified Text-to-Image Generation and Retrieval	Leigang Qu et.al.	2406.05814	null
2024-06-07	The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better	Scott Geng et.al.	2406.05184	link
2024-06-07	PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction	Eduard Poesina et.al.	2406.04746	link
2024-06-06	GLACE: Global Local Accelerated Coordinate Encoding	Fangjinhua Wang et.al.	2406.04340	link
2024-06-06	Monocular Localization with Semantics Map for Autonomous Vehicles	Jixiang Wan et.al.	2406.03835	null
2024-06-05	Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach	Saehyung Lee et.al.	2406.03411	link
2024-06-04	MeshVPR: Citywide Visual Place Recognition Using 3D Meshes	Gabriele Berton et.al.	2406.02776	null
2024-06-04	Can CLIP help CLIP in learning 3D?	Cristian Sbrolli et.al.	2406.02202	null
2024-06-03	Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP	Sriram Balasubramanian et.al.	2406.01583	link
2024-06-03	Scale-Free Image Keypoints Using Differentiable Persistent Homology	Giovanni Barbarani et.al.	2406.01315	link
2024-06-02	Visual place recognition for aerial imagery: A survey	Ivan Moskalenko et.al.	2406.00885	link
2024-06-01	NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization	Wugang Meng et.al.	2406.00312	null
2024-05-31	DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models	Linli Yao et.al.	2405.20985	link
2024-05-29	Multi-Modal Generative Embedding Model	Feipeng Ma et.al.	2405.19333	null
2024-05-29	ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions	Honglin Lin et.al.	2405.19226	null
2024-05-30	CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval	Xintong Jiang et.al.	2405.19149	link
2024-05-29	SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation	Zhenbei Wu et.al.	2405.18801	null
2024-05-29	Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs	Jialiang Xu et.al.	2405.18740	link
2024-05-28	EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition	Issar Tzachor et.al.	2405.18065	null
2024-05-28	AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval	Sihe Zhang et.al.	2405.17718	null
2024-05-26	MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups	Yusen Xie et.al.	2405.16599	null
2024-05-29	Composed Image Retrieval for Remote Sensing	Bill Psomas et.al.	2405.15587	link
2024-05-24	Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval	Yiming Wu et.al.	2405.15451	null
2024-05-20	UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization	Wenjia Xu et.al.	2405.11936	link
2024-05-19	Register assisted aggregation for Visual Place Recognition	Xuan Yu et.al.	2405.11526	null
2024-05-26	CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion	Gang Wang et.al.	2405.10793	null
2024-05-16	FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models	Adrian Bulat et.al.	2405.10286	null
2024-05-15	Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study	Farnaz Khun Jush et.al.	2405.09334	null
2024-05-14	BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment	Lihong Jin et.al.	2405.09001	null
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434	null
2024-05-13	OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition	Qiuchi Xiang et.al.	2405.07966	link
2024-05-14	HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval	Chao He et.al.	2405.07524	link
2024-05-13	JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation	Xubo Luo et.al.	2405.07429	link
2024-05-12	BoQ: A Place is Worth a Bag of Learnable Queries	Amar Ali-bey et.al.	2405.07364	link
2024-05-07	Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction	Nematollah Saeidi et.al.	2405.04211	null
2024-05-06	A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions	Sharath Raghvendra et.al.	2405.03664	null
2024-05-06	Knowledge-aware Text-Image Retrieval for Remote Sensing Images	Li Mi et.al.	2405.03373	null
2024-05-06	Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval	Jiacheng Cheng et.al.	2405.03190	null
2024-05-05	iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval	Lorenzo Agnolucci et.al.	2405.02951	link
2024-05-01	Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval	Young Kyun Jang et.al.	2405.00571	null
2024-04-30	Large Language Model Informed Patent Image Retrieval	Hao-Cheng Lo et.al.	2404.19360	null
2024-04-30	XFeat: Accelerated Features for Lightweight Image Matching	Guilherme Potje et.al.	2404.19174	null
2024-04-29	Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models	Hongyi Zhu et.al.	2404.18746	null
2024-04-29	Dual-Modal Prompting for Sketch-Based Image Retrieval	Liying Gao et.al.	2404.18695	null
2024-05-01	Semantic Line Combination Detector	Jinwon Ko et.al.	2404.18399	link
2024-04-26	Learning text-to-video retrieval from image captioning	Lucas Ventura et.al.	2404.17498	null
2024-04-25	CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching	Samia Shafique et.al.	2404.16972	link
2024-04-29	Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval	Ryoya Nara et.al.	2404.16398	null
2024-04-24	Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval	Haokun Wen et.al.	2404.15875	link
2024-04-24	DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines	Xin Jiang et.al.	2404.15771	null
2024-04-23	Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval	Young Kyun Jang et.al.	2404.15516	null
2024-04-22	EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models	Mathias Thorsager et.al.	2404.14236	null
2024-04-22	Hierarchical localization with panoramic views and triplet loss functions	Marcos Alfaro et.al.	2404.14117	link
2024-04-20	High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces	Baoru Huang et.al.	2404.13437	null
2024-04-20	Collaborative Visual Place Recognition through Federated Learning	Mattia Dutto et.al.	2404.13324	null
2024-04-18	SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints	Spencer Carmichael et.al.	2404.12339	null
2024-04-17	Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives	Zhangchi Feng et.al.	2404.11317	link
2024-04-17	Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing	Sanggeon Yun et.al.	2404.11025	null
2024-04-16	SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments	Niklas Gard et.al.	2404.10527	link
2024-04-20	CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning	Haojian Huang et.al.	2404.09640	link
2024-04-11	PRAM: Place Recognition Anywhere Model for Efficient Visual Localization	Fei Xue et.al.	2404.07785	null
2024-04-16	2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure	Bin Zhang et.al.	2404.07644	link
2024-04-11	Semantically-correlated memories in a dense associative model	Thomas F Burns et.al.	2404.07123	link
2024-04-09	Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation	Luca Barsellotti et.al.	2404.06542	null
2024-04-09	Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping	Anas Gouda et.al.	2404.06277	link
2024-04-07	Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval	Jinpeng Wang et.al.	2404.04998	link
2024-04-06	Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning	Juncheng Yang et.al.	2404.04538	link
2024-04-05	Towards introspective loop closure in 4D radar SLAM	Maximilian Hilger et.al.	2404.03940	null
2024-04-02	TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation	Yehui Shen et.al.	2404.01587	link
2024-04-01	On Train-Test Class Overlap and Detection for Image Retrieval	Chull Hwan Song et.al.	2404.01524	link
2024-04-01	NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification	Juyeop Han et.al.	2404.01400	null
2024-03-31	On the Estimation of Image-matching Uncertainty in Visual Place Recognition	Mubariz Zaffar et.al.	2404.00546	null
2024-03-31	NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation	Diwei Sheng et.al.	2404.00504	null
2024-03-30	SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs	Yang Miao et.al.	2404.00469	null
2024-03-30	Do Vision-Language Models Understand Compound Nouns?	Sonal Kumar et.al.	2404.00419	link
2024-04-05	FairRAG: Fair Human Generation via Fair Retrieval Augmentation	Robik Shrestha et.al.	2403.19964	null
2024-03-28	JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition	Gabriele Berton et.al.	2403.19787	link
2024-03-28	MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions	Kai Zhang et.al.	2403.19651	link
2024-03-27	AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation	Changkun Liu et.al.	2403.18281	null
2024-03-26	Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge	Dongjin Kim et.al.	2403.17420	link
2024-03-25	Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras	Gokul B. Nair et.al.	2403.16425	link
2024-03-24	Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval	Yucheng Suo et.al.	2403.16005	link
2024-03-24	BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval	Yinda Chen et.al.	2403.15992	null
2024-03-22	Long-CLIP: Unlocking the Long-Text Capability of CLIP	Beichen Zhang et.al.	2403.15378	link
2024-03-22	A Multimodal Approach for Cross-Domain Image Retrieval	Lucas Iijima et.al.	2403.15152	null
2024-03-22	Piecewise-Linear Manifolds for Deep Metric Learning	Shubhang Bhatnagar et.al.	2403.14977	null
2024-03-21	Enhancing Historical Image Retrieval with Compositional Cues	Tingyu Lin et.al.	2403.14287	link
2024-03-20	Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval	Aymene Berriche et.al.	2403.13747	null
2024-03-20	Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval	Haoyu Liu et.al.	2403.13317	null
2024-03-19	Learning Neural Volumetric Pose Features for Camera Localization	Jingyu Lin et.al.	2403.12800	null
2024-03-19	Quantixar: High-performance Vector Data Management System	Gulshan Yadav et.al.	2403.12583	null
2024-03-17	3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization	Peng Jiang et.al.	2403.11367	null
2024-03-17	MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data	Paul S. Scotti et.al.	2403.11207	link
2024-03-16	Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval	Shunsuke Tsubaki et.al.	2403.10756	null
2024-03-16	Vector search with small radiuses	Gergely Szilvasy et.al.	2403.10746	null
2024-03-13	Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer	Kenta Tsukahara et.al.	2403.10552	null
2024-03-20	Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression	Huy-Hoang Bui et.al.	2403.10297	link
2024-03-15	Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline	Fangming Yuan et.al.	2403.10283	null
2024-03-14	The NeRFect Match: Exploring NeRF Features for Visual Localization	Qunjie Zhou et.al.	2403.09577	null
2024-03-14	VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition	Benjamin Ramtoula et.al.	2403.09025	null
2024-03-13	PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models	Siddharth Mishra-Sharma et.al.	2403.08851	link
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156	link
2024-03-12	It's All About Your Sketch: Democratising Sketch Control in Diffusion Models	Subhadeep Koley et.al.	2403.07234	link
2024-03-12	You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval	Subhadeep Koley et.al.	2403.07222	null
2024-03-12	Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers	Subhadeep Koley et.al.	2403.07214	null
2024-03-11	How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?	Subhadeep Koley et.al.	2403.07203	null
2024-03-11	EarthLoc: Astronaut Photography Localization by Indexing Earth from Space	Gabriele Berton et.al.	2403.06758	link
2024-03-11	BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues	Fudong Ge et.al.	2403.06600	link
2024-03-11	Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology	Stefan Denner et.al.	2403.06567	link
2024-03-10	RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation	Mathieu Labbé et.al.	2403.06341	null
2024-03-10	Texture image retrieval using a classification and contourlet-based features	Asal Rouhafzay et.al.	2403.06048	null
2024-03-11	LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map	Xinrui Wu et.al.	2403.05002	link
2024-03-11	Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed	Yifan Wang et.al.	2403.04765	null
2024-03-07	mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar	Chengzhen Meng et.al.	2403.04703	null
2024-03-06	Self-supervised Photographic Image Layout Representation Learning	Zhaoran Zhao et.al.	2403.03740	link
2024-03-04	Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models	Benedikt Blumenstiel et.al.	2403.02059	link
2024-03-03	Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval	Yongchao Du et.al.	2403.01431	null
2024-03-01	Asymmetric Feature Fusion for Image Retrieval	Hui Wu et.al.	2403.00671	null
2024-03-01	Structure Similarity Preservation Learning for Asymmetric Image Retrieval	Hui Wu et.al.	2403.00648	link
2024-02-29	CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition	Feng Lu et.al.	2402.19231	link
2024-02-28	Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport	Bin Li et.al.	2402.18411	link
2024-02-28	Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning	Hanyao Wang et.al.	2402.18400	null
2024-02-28	Representing 3D sparse map points and lines for camera relocalization	Bach-Thuan Bui et.al.	2402.18011	link
2024-02-27	Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control	Thong Nguyen et.al.	2402.17535	link
2024-02-29	Active propulsion noise shaping for multi-rotor aircraft localization	Gabriele Serussi et.al.	2402.17289	link
2024-02-27	NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer	Bingxi Liu et.al.	2402.17159	link
2024-02-25	Deep Homography Estimation for Visual Place Recognition	Feng Lu et.al.	2402.16086	link
2024-02-25	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961	link
2024-02-28	Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries	Zijun Long et.al.	2402.15276	null
2024-02-23	Fine-tuning CLIP Text Encoders with Two-step Paraphrasing	Hyunjae Kim et.al.	2402.15120	null
2024-02-22	Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition	Feng Lu et.al.	2402.14505	link
2024-02-16	Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition	Chenming Hu et.al.	2402.10476	null
2024-02-15	Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task	Mirko Nava et.al.	2402.09886	link
2024-02-14	Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency	Yannis Kalantidis et.al.	2402.09237	null
2024-02-13	Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast	Xiangming Gu et.al.	2402.08567	link
2024-02-13	Learning to Produce Semi-dense Correspondences for Visual Localization	Khang Truong Giang et.al.	2402.08359	link
2024-02-10	Semantic Object-level Modeling for Robust Visual Camera Relocalization	Yifan Zhu et.al.	2402.06951	null
2024-02-09	Large Language Models for Captioning and Retrieving Remote Sensing Images	João Daniel Silva et.al.	2402.06475	null
2024-02-09	PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes	Xinggang Hu et.al.	2402.06131	null
2024-02-21	MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction	Heng Zhou et.al.	2402.03762	null
2024-02-04	Region-Based Representations Revisited	Michal Shlapentokh-Rothman et.al.	2402.02352	link
2024-02-03	Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization	Bo Yang et.al.	2402.02141	link
2024-02-01	BrainSLAM: SLAM on Neural Population Activity Data	Kipp Freud et.al.	2402.00588	null
2024-02-01	Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering	Tianxiao Gao et.al.	2402.00330	link
2024-01-31	Improved Scene Landmark Detection for Camera Localization	Tien Do et.al.	2401.18083	link
2024-01-31	Local Feature Matching Using Deep Learning: A Survey	Shibiao Xu et.al.	2401.17592	link
2024-01-29	Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors	Shiyin Dong et.al.	2401.16459	null
2024-01-29	Cross-Modal Coordination Across a Diverse Set of Input Modalities	Jorge Sánchez et.al.	2401.16347	null
2024-01-29	Regressing Transformers for Data-efficient Visual Place Recognition	María Leyva-Vallina et.al.	2401.16304	null
2024-01-27	Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval	Ayush Dubey et.al.	2401.15362	null
2024-01-24	Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode	Naresh Kumar Lahajal et.al.	2401.13613	null
2024-01-23	PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion	Shyam Sundar Kannan et.al.	2401.13082	null
2024-01-23	SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization	Mingyang Li et.al.	2401.13076	link
2024-01-25	CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios	Xiangshuo Qiao et.al.	2401.10475	link
2024-01-19	PhotoScout: Synthesis-Powered Multi-Modal Image Search	Celeste Barnaby et.al.	2401.10464	null
2024-01-19	Cross-Modality Perturbation Synergy Attack for Person Re-identification	Yunpeng Gong et.al.	2401.10090	null
2024-01-16	Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging	Zahra Tabatabaei et.al.	2401.08272	null
2024-01-16	Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments	Bruno Arcanjo et.al.	2401.08263	null
2024-01-15	Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing	Jakob Hackstein et.al.	2401.07782	link
2024-01-14	HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval	Zexuan Qiu et.al.	2401.07212	link
2024-01-11	UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization	Rouwan Wu et.al.	2401.05971	link
2024-01-10	Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval	Eunyi Lyou et.al.	2401.04860	link
2024-01-05	Benchmarking PathCLIP for Pathology Image Analysis	Sunyi Zheng et.al.	2401.02651	null
2024-01-03	DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM with Joint Semantic Encoding	Mingrui Li et.al.	2401.01545	null
2024-01-02	BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving	Dafeng Wei et.al.	2401.01065	null
2023-12-31	Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval	Liang Wang et.al.	2401.00371	link
2023-12-29	Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering	Long-Kun Du et.al.	2401.00032	null
2023-12-27	LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization	Sai Shubodh Puligilla et.al.	2312.16648	null
2023-12-26	Recursive Distillation for Open-Set Distributed Robot Localization	Kenta Tsukahara et.al.	2312.15897	null
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471	null
2023-12-23	CaLDiff: Camera Localization in NeRF via Pose Diffusion	Rashik Shrestha et.al.	2312.15242	null
2023-12-20	Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition	Bruno Arcanjo et.al.	2312.12995	null
2023-12-19	VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering	Chun-Mei Feng et.al.	2312.12273	link
2023-12-18	Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback	Boaz Lerner et.al.	2312.11078	link
2023-12-17	PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields	Boming Zhao et.al.	2312.10649	null
2023-12-17	DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition	Sijie Wang et.al.	2312.10616	link
2023-12-16	Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval	Decheng Liu et.al.	2312.10320	link
2023-12-15	Data-Efficient Multimodal Fusion on a Single GPU	Noël Vouitsis et.al.	2312.10144	link
2023-12-13	Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques	Hamed Qazanfari et.al.	2312.10089	null
2023-12-15	Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval	Zhe Ma et.al.	2312.09716	link
2023-12-14	Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition	Oliver Grainge et.al.	2312.09028	null
2023-12-14	Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking	Shitong Sun et.al.	2312.08924	null
2023-12-13	C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation	Florian Fervers et.al.	2312.08060	null
2023-12-12	Contextually Affinitive Neighborhood Refinery for Deep Clustering	Chunlin Yu et.al.	2312.07806	link
2023-12-12	Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval	Qiwei Tian et.al.	2312.07364	link
2023-12-12	Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection	Jonathan J. Y. Kim et.al.	2312.06991	null
2023-12-11	Dynamic Weighted Combiner for Mixed-Modal Image Retrieval	Fuxiang Huang et.al.	2312.06179	link
2023-12-06	Lite-Mind: Towards Efficient and Versatile Brain Representation Network	Zixuan Gong et.al.	2312.03781	link
2023-12-08	FreestyleRet: Retrieving Images from Style-Diversified Queries	Hao Li et.al.	2312.02428	link
2023-12-04	Implicit Learning of Scene Geometry from Poses for Global Localization	Mohammad Altillawi et.al.	2312.02029	null
2023-12-04	Language-only Efficient Training of Zero-shot Composed Image Retrieval	Geonmo Gu et.al.	2312.01998	link
2023-12-03	G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training	Che Liu et.al.	2312.01522	link
2023-12-01	Improve Supervised Representation Learning with Masked Image Modeling	Kaifeng Chen et.al.	2312.00950	null
2023-12-05	Grounding Everything: Emerging Localization Properties in Vision-Language Transformers	Walid Bousselham et.al.	2312.00878	link
2023-12-01	Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras	Mohammad Altillawi et.al.	2312.00500	null
2023-11-30	HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance	Zhuohao Yin et.al.	2311.18273	link
2023-11-30	Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models	Raviteja Vemulapalli et.al.	2311.18237	link
2023-11-29	Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce	Chang Liu et.al.	2311.17954	null
2023-11-28	Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames	Chao Chen et.al.	2311.17940	null
2023-11-29	360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries	Huajian Huang et.al.	2311.17389	link
2023-11-27	Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation	Samuele Poppi et.al.	2311.16254	link
2023-11-27	Optimal Transport Aggregation for Visual Place Recognition	Sergio Izquierdo et.al.	2311.15937	link
2023-11-27	AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval	Shicheng Xu et.al.	2311.14084	link
2023-11-23	3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology	Asma Ben Abacha et.al.	2311.13752	link
2023-11-22	Medical Image Retrieval Using Pretrained Embeddings	Farnaz Khun Jush et.al.	2311.13547	null
2023-11-22	Applications of Spiking Neural Networks in Visual Place Recognition	Somayeh Hussaini et.al.	2311.13186	link
2023-11-21	Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval	Xiu-Shen Wei et.al.	2311.12894	null
2023-11-21	Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs	Zhentian Qian et.al.	2311.12245	null
2023-11-19	From Categories to Classifier: Name-Only Continual Learning by Exploring the Web	Ameya Prabhu et.al.	2311.11293	null
2023-11-18	Lesion Search with Self-supervised Learning	Kristin Qi et.al.	2311.11014	null
2023-11-15	Flow reconstruction and particle characterization from inertial Lagrangian tracks	Ke Zhou et.al.	2311.09076	null
2023-11-15	Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval	Junyang Chen et.al.	2311.07622	null
2023-11-13	VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search	Shuting He et.al.	2311.07514	null
2023-11-10	Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval	Xin Lu et.al.	2311.06067	null
2023-11-08	Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model	Junya Shiraishi et.al.	2311.04788	null
2023-11-08	Training CLIP models on Data from Scientific Papers	Calvin Metzger et.al.	2311.04711	link
2023-11-07	DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding	Kehinde Ajayi et.al.	2311.04098	link
2023-11-06	Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences	Zador Pataki et.al.	2311.03345	null
2023-11-06	FocusTune: Tuning Visual Localization through Focus-Guided Sampling	Son Tung Nguyen et.al.	2311.02872	link
2023-11-01	DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing	Gaoshuang Huang et.al.	2311.00230	link
2023-10-29	Identifiable Contrastive Learning with Automatic Feature Importance Discovery	Qi Zhang et.al.	2310.18904	link
2023-10-27	LipSim: A Provably Robust Perceptual Similarity Metric	Sara Ghazanfari et.al.	2310.18274	link
2023-10-27	Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation	Susu Fang et.al.	2310.17879	null
2023-10-25	FoundLoc: Vision-based Onboard Aerial Localization in the Wild	Yao He et.al.	2310.16299	null
2023-10-24	Cross-view Self-localization from Synthesized Scene-graphs	Ryogo Yamamoto et.al.	2310.15504	null
2023-10-23	Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval	Xu Yuan et.al.	2310.14637	link
2023-10-21	Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation	Anastasia Kritharoula et.al.	2310.14025	link
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605	null
2023-10-20	CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants	Shaoan Wang et.al.	2310.13320	link
2023-10-27	Representation Learning via Consistent Assignment of Views over Random Partitions	Thalles Silva et.al.	2310.12692	link
2023-10-18	Evaluating the Fairness of Discriminative Foundation Models in Computer Vision	Junaid Ali et.al.	2310.11867	null
2023-10-17	Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification	Shuanglin Yan et.al.	2310.11210	null
2023-10-16	Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People	Dharmateja Adapa et.al.	2310.10290	null
2023-10-16	EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge	Tom Bryan et.al.	2310.10050	null
2023-10-15	CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes	Yulei Qin et.al.	2310.09761	link
2023-10-13	Pairwise Similarity Learning is SimPLE	Yandong Wen et.al.	2310.09449	link
2023-10-13	Vision-by-Language for Training-Free Compositional Image Retrieval	Shyamgopal Karthik et.al.	2310.09291	link
2023-10-12	Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning	Shiyang Yan et.al.	2310.08390	null
2023-10-12	Jointly Optimized Global-Local Visual Localization of UAVs	Haoling Li et.al.	2310.08082	null
2023-10-10	Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization	Le Chen et.al.	2310.06984	null
2023-10-10	Distillation Improves Visual Place Recognition for Low-Quality Queries	Anbang Yang et.al.	2310.06906	link
2023-10-10	Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets	Jiajun Zhang et.al.	2310.06566	null
2023-10-10	Topological RANSAC for instance verification and retrieval without fine-tuning	Guoyuan An et.al.	2310.06486	null
2023-10-10	3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments	Ghanta Sai Krishna et.al.	2310.06385	null
2023-10-09	Collaborative Visual Place Recognition	Yiming Li et.al.	2310.05541	null
2023-10-09	Sentence-level Prompts Benefit Composed Image Retrieval	Yang Bai et.al.	2310.05473	link
2023-10-08	AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition	Feng Lu et.al.	2310.05184	link
2023-10-08	LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization	Artem Nenashev et.al.	2310.05134	null
2023-10-12	ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer	Yifan Xu et.al.	2310.04099	null
2023-10-06	Sub-token ViT Embedding via Stochastic Resonance Transformers	Dong Lao et.al.	2310.03967	link
2023-10-04	Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach	Matthew Hanlon et.al.	2310.02650	null
2023-10-02	NEUCORE: Neural Concept Reasoning for Composed Image Retrieval	Shu Zhao et.al.	2310.01358	null
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092	null
2023-10-05	PlaceNav: Topological Navigation through Place Recognition	Lauri Suomela et.al.	2309.17260	null
2023-09-29	Segment Anything Model is a Good Teacher for Local Feature Learning	Jingqian Wu et.al.	2309.16992	link
2023-09-28	Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning	Albert Mohwald et.al.	2309.16351	link
2023-09-28	FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding	Pengxiang Wu et.al.	2309.16249	link
2023-09-28	Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2309.16137	link
2023-09-27	GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization	Vicente Vivanco Cepeda et.al.	2309.16020	link
2023-09-27	Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization	Zhenbo Song et.al.	2309.15556	null
2023-09-26	Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features	Hila Levi et.al.	2309.14999	null
2023-09-23	Resolving References in Visually-Grounded Dialogue via Text Generation	Bram Willemsen et.al.	2309.13430	link
2023-09-21	Face Identity-Aware Disentanglement in StyleGAN	Adrian Suwała et.al.	2309.12033	null
2023-09-21	On-the-Fly SfM: What you capture is What you get	Zongqian Zhan et.al.	2309.11883	link
2023-09-20	2D-3D Pose Tracking with Multi-View Constraints	Huai Yu et.al.	2309.11335	null
2023-09-19	VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition	Adam D. Hines et.al.	2309.10225	link
2023-09-18	DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach	Chenghao Xu et.al.	2309.09879	null
2023-09-18	Decompose Semantic Shifts for Composed Image Retrieval	Xingyu Yang et.al.	2309.09531	null
2023-09-16	Efficient Object Rearrangement via Multi-view Fusion	Dehao Huang et.al.	2309.08994	null
2023-09-16	DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF	Mert Asim Karaoglu et.al.	2309.08927	link
2023-09-16	Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning	Pengyu Yin et.al.	2309.08914	link
2023-09-15	Active Learning for Fine-Grained Sketch-Based Image Retrieval	Himanshu Thakur et.al.	2309.08743	null
2023-09-15	Optimization of Rank Losses for Image Retrieval	Elias Ramzi et.al.	2309.08250	link
2023-09-18	Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer	Yaoting Wang et.al.	2309.07929	link
2023-09-14	EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization	Minjung Kim et.al.	2309.07471	link
2023-09-13	RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline	Mirko Usuelli et.al.	2309.07094	null
2023-09-11	Towards Content-based Pixel Retrieval in Revisited Oxford and Paris	Guoyuan An et.al.	2309.05438	link
2023-09-08	Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning	Hiroki Nakamura et.al.	2309.04148	null
2023-09-05	Magnetic Navigation using Attitude-Invariant Magnetic Field Information for Loop Closure Detection	Natalia Pavlasek et.al.	2309.02394	null
2023-09-05	Dual Relation Alignment for Composed Image Retrieval	Xintong Jiang et.al.	2309.02169	null
2023-09-04	NLLB-CLIP -- train performant multilingual image retrieval model on a budget	Alexander Visheratin et.al.	2309.01859	null
2023-09-04	Target-Guided Composed Image Retrieval	Haokun Wen et.al.	2309.01366	null
2023-09-02	Deep supervised hashing for fast retrieval of radio image cubes	Steven Ndung'u et.al.	2309.00932	null
2023-08-31	Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval	Prateksha Udhayanan et.al.	2308.16649	null
2023-08-28	Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics	Nils Böhne et.al.	2308.14786	null
2023-08-28	CoVR: Learning Composed Video Retrieval from Web Video Captions	Lucas Ventura et.al.	2308.14746	link
2023-08-27	Deep Learning for Visual Localization and Mapping: A Survey	Changhao Chen et.al.	2308.14039	null
2023-08-26	Learning Efficient Representations for Image-Based Patent Retrieval	Hongsong Wang et.al.	2308.13749	null
2023-08-25	Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers	Mohammad Javad Rajabi et.al.	2308.13671	null
2023-08-24	Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities	Jinze Bai et.al.	2308.12966	link
2023-08-23	Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval	Huafeng Li et.al.	2308.11994	null
2023-08-23	OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes	Tao Xie et.al.	2308.11928	link
2023-08-22	Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features	Alberto Baldrati et.al.	2308.11485	link
2023-08-22	GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training	Xinchi Deng et.al.	2308.11331	null
2023-08-22	LDP-Feat: Image Features with Local Differential Privacy	Francesco Pittaluga et.al.	2308.11223	null
2023-08-21	EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition	Gabriele Berton et.al.	2308.10832	link
2023-08-20	FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory	Anwesan Pal et.al.	2308.10170	null
2023-08-18	3D Model-free Visual localization System from Essential Matrix under Local Planar Motion	Yanmei Jiao et.al.	2308.09566	null
2023-08-17	FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings	Yulin Su et.al.	2308.09012	link
2023-08-16	Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval	Aishwarya Venkataramanan et.al.	2308.08431	link
2023-08-16	Ranking-aware Uncertainty for Text-guided Image Retrieval	Junyang Chen et.al.	2308.08131	null
2023-08-19	Global Features are All You Need for Image Retrieval and Reranking	Shihao Shao et.al.	2308.06954	link
2023-08-14	MixBCT: Towards Self-Adapting Backward-Compatible Training	Yu Liang et.al.	2308.06948	link
2023-08-10	KS-APR: Keyframe Selection for Robust Absolute Pose Regression	Changkun Liu et.al.	2308.05459	null
2023-08-09	AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities	Jingdan Zhang et.al.	2308.04992	link
2023-08-08	Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval	Yi Bin et.al.	2308.04343	link
2023-08-08	Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval	Yunquan Zhu et.al.	2308.04008	link
2023-08-05	A Comprehensive Analysis of Real-World Image Captioning and Scene Identification	Sai Suprabhanu Nallapaneni et.al.	2308.02833	null
2023-08-03	Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies	Eunsuk Seo et.al.	2308.01871	null
2023-08-01	AnyLoc: Towards Universal Visual Place Recognition	Nikhil Keetha et.al.	2308.00688	link
2023-07-31	Guiding Image Captioning Models Toward More Specific Captions	Simon Kornblith et.al.	2307.16686	null
2023-07-31	Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks	Kousik Rajesh et.al.	2307.16395	null
2023-07-28	D2S: Representing local descriptors and global scene coordinates for camera relocalization	Bach-Thuan Bui et.al.	2307.15250	link
2023-07-26	Neural-based Cross-modal Search and Retrieval of Artwork	Yan Gong et.al.	2307.14244	null
2023-07-26	Boon: A Neural Search Engine for Cross-Modal Information Retrieval	Yan Gong et.al.	2307.14240	null
2023-07-25	Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network	Chull Hwan Song et.al.	2307.13254	null
2023-07-28	SACReg: Scene-Agnostic Coordinate Regression for Visual Localization	Jerome Revaud et.al.	2307.11702	null
2023-07-19	Lazy Visual Localization via Motion Averaging	Siyan Dong et.al.	2307.09981	null
2023-07-19	Quantum Optics based Algorithm for Measuring the Similarity between Images	Vivek Mehta et.al.	2307.09789	null
2023-07-18	Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments	Max Moebius et.al.	2307.09172	null
2023-07-18	3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving	Qipeng Li et.al.	2307.09044	null
2023-07-19	Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation	Rundong Luo et.al.	2307.08779	null
2023-07-17	Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition	Gabriele Trivigno et.al.	2307.08417	link
2023-07-17	Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification	Tengfei Liang et.al.	2307.08316	link
2023-07-17	NDT-Map-Code: A 3D global descriptor for real-time loop closure detection in lidar SLAM	Lizhou Liao et.al.	2307.08221	link
2023-07-20	Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer	Yujiao Shi et.al.	2307.08015	link
2023-07-10	Phoneme-retrieval; voice recognition; vowels recognition	Brunello Tirozzi et.al.	2307.07407	null
2023-07-14	Risk Controlled Image Retrieval	Kaiwen Cai et.al.	2307.07336	link
2023-07-11	ResMatch: Residual Attention Learning for Local Feature Matching	Yuxin Deng et.al.	2307.05180	link
2023-07-11	Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification	Yi Liao et.al.	2307.05017	null
2023-07-10	Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor	San Jiang et.al.	2307.04520	null
2023-07-10	RaPlace: Place Recognition for Imaging Radar using Radon Transform and Mutable Threshold	Hyesu Jang et.al.	2307.04321	link
2023-07-08	Calibration-Aware Margin Loss: Pushing the Accuracy-Calibration Consistency Pareto Frontier for Deep Metric Learning	Qin Zhang et.al.	2307.04047	null
2023-07-04	Unsupervised Quality Prediction for Improved Single-Frame and Weighted Sequential Visual Place Recognition	Helen Carson et.al.	2307.01464	null
2023-07-04	Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network	Zizhuo Li et.al.	2307.01447	null
2023-07-03	Cross-modal Place Recognition in Image Databases using Event-based Sensors	Xiang Ji et.al.	2307.01047	null
2023-06-30	DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions	Stephen Hausler et.al.	2306.17536	null
2023-06-30	Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization	Stephen Hausler et.al.	2306.17529	null
2023-06-27	Dental CLAIRES: Contrastive LAnguage Image REtrieval Search for Dental Research	Tanjida Kabir et.al.	2306.15651	null
2023-06-27	Mean Field Theory in Deep Metric Learning	Takuya Furusawa et.al.	2306.15368	null
2023-06-26	Hierarchical Matching and Reasoning for Multi-Query Image Retrieval	Zhong Ji et.al.	2306.14460	link
2023-06-25	Enhancing Dynamic Image Advertising with Vision-Language Pre-training	Zhoufutu Wen et.al.	2306.14112	null
2023-06-23	Catching Image Retrieval Generalization	Maksim Zhdanov et.al.	2306.13357	null
2023-06-22	Deep Metric Learning with Soft Orthogonal Proxies	Farshad Saberi-Movahed et.al.	2306.13055	null
2023-06-22	What to Learn: Features, Image Transformations, or Both?	Yuxuan Chen et.al.	2306.13040	null
2023-06-22	Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval	Katrin Glinka et.al.	2306.12843	null
2023-06-26	Annotation Cost Efficient Active Learning for Content Based Image Retrieval	Julia Henkel et.al.	2306.11605	null
2023-06-19	Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning	Shivaen Ramshetty et.al.	2306.11065	link
2023-06-18	LiDAR-Based Place Recognition For Autonomous Driving: A Survey	Pengcheng Shi et.al.	2306.10561	link
2023-06-15	Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization	Dror Aiger et.al.	2306.09012	link
2023-06-15	Prompt Performance Prediction for Generative IR	Nicolas Bizzozzero et.al.	2306.08915	null
2023-06-15	Graph Convolution Based Efficient Re-Ranking for Visual Retrieval	Yuqi Zhang et.al.	2306.08792	link
2023-06-13	GeneCIS: A Benchmark for General Conditional Image Similarity	Sagar Vaze et.al.	2306.07969	null
2023-06-13	MOFI: Learning Image Representations from Noisy Entity Annotated Images	Wentao Wu et.al.	2306.07952	link
2023-06-12	Zero-shot Composed Text-Image Retrieval	Yikun Liu et.al.	2306.07272	link
2023-06-12	Sticker820K: Empowering Interactive Retrieval with Stickers	Sijie Zhao et.al.	2306.06870	null
2023-06-11	Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models	Yuguang Yang et.al.	2306.06691	null
2023-06-03	Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval	Xu Zhang et.al.	2306.02092	null
2023-06-03	Class Anchor Margin Loss for Content-Based Image Retrieval	Alexandru Ghita et.al.	2306.00630	null
2023-05-31	Chatting Makes Perfect -- Chat-based Image Retrieval	Matan Levy et.al.	2305.20062	link
2023-05-31	Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization	Junan Chen et.al.	2305.20044	null
2023-05-30	A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation	Omar Seddati et.al.	2305.18988	null
2023-05-29	Synfeal: A Data-Driven Simulator for End-to-End Camera Localization	Daniel Coelho et.al.	2305.18260	link
2023-05-29	Nanoscale visualization of the thermally-driven evolution of antiferromagnetic domains in FeTe thin films	Shrinkhala Sharma et.al.	2305.18197	null
2023-05-29	TReR: A Lightweight Transformer Re-Ranking Approach for 3D LiDAR Place Recognition	Tiago Barros et.al.	2305.18013	null
2023-05-28	ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval	Jiapeng Wang et.al.	2305.17652	null
2023-06-01	FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing	Zhuang Li et.al.	2305.17497	link
2023-05-27	Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation	Yueh-Cheng Huang et.al.	2305.17463	null
2023-05-26	Generating Images with Multimodal Language Models	Jing Yu Koh et.al.	2305.17216	link
2023-05-25	Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder	Zheyuan Liu et.al.	2305.16304	link
2023-05-23	Leveraging BEV Representation for 360-degree Visual Place Recognition	Xuecheng Xu et.al.	2305.13814	link
2023-05-23	EDIS: Entity-Driven Image Search over Multimodal Web Content	Siqi Liu et.al.	2305.13631	link
2023-05-20	DAC: Detector-Agnostic Spatial Covariances for Deep Local Features	Javier Tirado-Garín et.al.	2305.12250	link
2023-05-19	Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach	Zahra Tabatabaei et.al.	2305.11728	null
2023-05-19	Learning Sequence Descriptor based on Spatiotemporal Attention for Visual Place Recognition	Fenglin Zhang et.al.	2305.11467	link
2023-05-12	IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images	Varuna Krishna et.al.	2305.10438	null
2023-05-17	Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval	Haokun Wen et.al.	2305.09979	null
2023-05-13	Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance	Xinyu Lin et.al.	2305.07943	link
2023-05-11	Foundations of Spatial Perception for Robotics: Hierarchical Representations and Real-time Systems	Nathan Hughes et.al.	2305.07154	link
2023-05-09	Visual Place Recognition with Low-Resolution Images	Mihnea-Alexandru Tomita et.al.	2305.05776	null
2023-05-09	Vision-Language Models in Remote Sensing: Current Progress and Future Trends	Congcong Wen et.al.	2305.05726	null
2023-05-09	An Evaluation and Ranking of Different Voting Schemes for Improved Visual Place Recognition	Maria Waheed et.al.	2305.05705	null
2023-05-09	Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query	Ho Hin Lee et.al.	2305.05598	null
2023-05-09	ColonMapper: topological mapping and localization for colonoscopy	Javier Morlana et.al.	2305.05546	null
2023-05-09	Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization	Clémentin Boittiaux et.al.	2305.05301	link
2023-05-09	Patch-DrosoNet: Classifying Image Partitions With Fly-Inspired Models For Lightweight Visual Place Recognition	Bruno Arcanjo et.al.	2305.05256	null
2023-05-09	Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval	Shiyin Dong et.al.	2305.05144	null
2023-05-08	Hierarchical Visual Localization Based on Sparse Feature Pyramid for Adaptive Reduction of Keypoint Map Size	Andrei Potapov et.al.	2305.04856	null
2023-05-08	Privacy-Preserving Representations are not Enough -- Recovering Scene Content from Camera Poses	Kunal Chelani et.al.	2305.04603	link
2023-05-06	Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer	Minyi Zhao et.al.	2305.04072	null
2023-05-06	Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing	Swagatika Dash et.al.	2305.03881	link
2023-05-05	COLA: How to adapt vision-language models to Compose Objects Localized with Attributes?	Arijit Ray et.al.	2305.03689	link
2023-05-05	HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer	Shuzhe Wang et.al.	2305.03595	null
2023-05-05	WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval	Zahra Tabatabaei et.al.	2305.03383	null
2023-05-04	Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval	Tan Pan et.al.	2305.02610	link
2023-05-03	Learning-based Relational Object Matching Across Views	Cathrin Elich et.al.	2305.02398	null
2023-05-05	A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text	Yunxin Li et.al.	2305.02265	link
2023-05-03	AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation	Shentong Mo et.al.	2305.01836	null
2023-04-30	Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection	Jie Ren et.al.	2305.00435	null
2023-04-28	SFD2: Semantic-guided Feature Detection and Description	Fei Xue et.al.	2304.14845	link
2023-04-28	Quantum enhanced non-interferometric quantitative phase imaging	Giuseppe Ortolano et.al.	2304.14727	null
2023-04-26	Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams	Yun Chang et.al.	2304.13487	null
2023-04-27	STIR: Siamese Transformer for Image Retrieval Postprocessing	Aleksei Shabanov et.al.	2304.13393	null
2023-04-25	DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design	Jiahao Weng et.al.	2304.12506	null
2023-04-24	Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning	Lucas Pascotti Valem et.al.	2304.12448	link
2023-04-23	IDLL: Inverse Depth Line based Visual Localization in Challenging Environments	Wanting Li et.al.	2304.11748	null
2023-04-23	Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval	Mehdi Rafiei et.al.	2304.11734	null
2023-04-17	Features-over-the-Air: Contrastive Learning Enabled Cooperative Edge Inference	Haotian Wu et.al.	2304.08221	null
2023-04-17	NeRF-Loc: Visual Localization with Conditional Neural Radiance Field	Jianlin Liu et.al.	2304.07979	link
2023-04-16	Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification	Luca Piano et.al.	2304.07883	null
2023-04-16	Language Guided Local Infiltration for Interactive Image Retrieval	Fuxiang Huang et.al.	2304.07747	null
2023-04-16	Long-term Visual Localization with Mobile Sensors	Shen Yan et.al.	2304.07691	null
2023-04-16	Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging	Jielin Qiu et.al.	2304.07675	null
2023-04-14	CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression	Mubariz Zaffar et.al.	2304.07426	null
2023-04-14	FM-Loc: Using Foundation Models for Improved Vision-based Localization	Reihaneh Mirjalili et.al.	2304.07058	null
2023-04-17	Toward Real-Time Image Annotation Using Marginalized Coupled Dictionary Learning	Seyed Mahdi Roostaiyan et.al.	2304.06907	link
2023-04-17	You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization problem and dataset	Matteo Toso et.al.	2304.06373	link
2023-04-12	Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation	Yifeng Shi et.al.	2304.06051	link
2023-04-12	Visual Localization using Imperfect 3D Models from the Internet	Vojtech Panek et.al.	2304.05947	link
2023-04-12	Are Local Features All You Need for Cross-Domain Visual Place Recognition?	Giovanni Barbarani et.al.	2304.05887	link
2023-04-12	Unicom: Universal and Compact Representation Learning for Image Retrieval	Xiang An et.al.	2304.05884	link
2023-04-12	SGL: Structure Guidance Learning for Camera Localization	Xudong Zhang et.al.	2304.05571	null
2023-04-14	Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency	Xingwu Ji et.al.	2304.05146	link
2023-04-10	CAVL: Learning Contrastive and Adaptive Representations of Vision and Language	Shentong Mo et.al.	2304.04399	null
2023-04-09	Unsupervised Multi-Criteria Adversarial Detection in Deep Image Retrieval	Yanru Xiao et.al.	2304.04228	null
2023-04-08	SGIDN-LCD: An Appearance-based Loop Closure Detection Algorithm using Superpixel Grids and Incremental Dynamic Nodes	Baosheng Zhang et.al.	2304.03872	null
2023-04-06	$R^{2}$Former: Unified $R$etrieval and $R$ eranking Transformer for Place Recognition	Sijie Zhu et.al.	2304.03410	null
2023-04-06	Distributed formation-enforcing control for UAVs robust to observation noise in relative pose measurements	Viktor Walter et.al.	2304.03057	link
2023-04-05	Efficient OCR for Building a Diverse Digital History	Jacob Carlson et.al.	2304.02737	link
2023-04-05	LogoNet: a fine-grained network for instance-level logo sketch retrieval	Binbin Feng et.al.	2304.02214	link
2023-04-04	OrienterNet: Visual Localization in 2D Public Maps with Neural Matching	Paul-Edouard Sarlin et.al.	2304.02009	link
2023-04-04	Cross-Domain Image Captioning with Discriminative Finetuning	Roberto Dessì et.al.	2304.01662	link
2023-04-02	Learning Similarity between Scene Graphs and Images with Transformers	Yuren Cong et.al.	2304.00590	link
2023-04-01	NPR: Nocturnal Place Recognition in Street	Bingxi Liu et.al.	2304.00276	null
2023-03-31	Unsupervised crack detection on complex stone masonry surfaces	Panagiotis Agrafiotis et.al.	2303.17989	null
2023-03-30	If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval	Finlay G. C. Hudson et.al.	2303.17703	null
2023-03-30	Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime	Rhydian Windsor et.al.	2303.17644	null
2023-03-30	3D Line Mapping Revisited	Shaohui Liu et.al.	2303.17504	link
2023-03-30	Methods and advancement of content-based fashion image retrieval: A Review	Amin Muhammad Shoib et.al.	2303.17371	null
2023-03-30	Adaptive Cross Batch Normalization for Metric Learning	Thalaiyasingam Ajanthan et.al.	2303.17127	null
2023-03-30	MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks	Weicheng Kuo et.al.	2303.16839	null
2023-03-29	Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval	Leo Sampaio Ferraz Ribeiro et.al.	2303.16769	null
2023-03-29	Bi-directional Training for Composed Image Retrieval via Text Prompt Learning	Zheyuan Liu et.al.	2303.16604	link
2023-03-27	Model Cascades for Efficient Image Search	Robert Hönig et.al.	2303.15595	null
2023-03-27	Zero-Shot Composed Image Retrieval with Textual Inversion	Alberto Baldrati et.al.	2303.15247	link
2023-03-27	What Can Human Sketches Do for Object Detection?	Pinaki Nath Chowdhury et.al.	2303.15149	null
2023-03-25	Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style	Fengyin Lin et.al.	2303.14348	link
2023-03-24	A-MuSIC: An Adaptive Ensemble System For Visual Place Recognition In Changing Environments	Bruno Arcanjo et.al.	2303.14247	null
2023-03-24	PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View	Ze Shi et.al.	2303.14095	link
2023-03-24	Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR	Aneeshan Sain et.al.	2303.13779	null
2023-03-28	CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not	Aneeshan Sain et.al.	2303.13440	null
2023-03-22	Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval	Xunguang Wang et.al.	2303.12658	null
2023-03-21	CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion	Geonmo Gu et.al.	2303.11916	link
2023-03-21	LIMITR: Leveraging Local Information for Medical Image-Text Representation	Gefen Dawidowicz et.al.	2303.11755	null
2023-03-25	Data-efficient Large Scale Place Recognition with Graded Similarity Supervision	Maria Leyva-Vallina et.al.	2303.11739	link
2023-03-20	Picture that Sketch: Photorealistic Image Generation from Abstract Sketches	Subhadeep Koley et.al.	2303.11162	null
2023-03-19	Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths	Ming Xu et.al.	2303.10778	link
2023-03-17	MRIS: A Multi-modal Retrieval Approach for Image Synthesis on Diverse Modalities	Boqi Chen et.al.	2303.10249	null
2023-03-17	IRGen: Generative Modeling for Image Retrieval	Yidan Zhang et.al.	2303.10126	link
2023-03-16	Data Roaming and Early Fusion for Composed Image Retrieval	Matan Levy et.al.	2303.09429	link
2023-03-16	Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval	Yi Xie et.al.	2303.09230	null
2023-03-16	Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space	Yuhang He et.al.	2303.09192	null
2023-03-16	Unsupervised Facial Expression Representation Learning with Contrastive Local Warping	Fanglei Xue et.al.	2303.09034	null
2023-03-15	A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval	Saeideh Yousefzadeh et.al.	2303.08398	null
2023-03-14	Data-Free Sketch-Based Image Retrieval	Abhra Chaudhuri et.al.	2303.07775	link
2023-03-14	PATS: Patch Area Transportation with Subdivision for Local Feature Matching	Junjie Ni et.al.	2303.07700	null
2023-03-10	Robotic Applications of Pre-Trained Vision-Language Models to Various Recognition Behaviors	Kento Kawaharazuka et.al.	2303.05674	null
2023-03-09	Dominating Set Database Selection for Visual Place Recognition	Anastasiia Kornilova et.al.	2303.05123	null
2023-03-07	Graph Neural Networks in Vision-Language Image Understanding: A Survey	Henry Senior et.al.	2303.03761	null
2023-03-07	Sketch-based Medical Image Retrieval	Kazuma Kobayashi et.al.	2303.03633	link
2023-03-06	Visual Place Recognition: A Tutorial	Stefan Schubert et.al.	2303.03281	link
2023-03-06	MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval	Rohit Agarwal et.al.	2303.03050	link
2023-03-06	Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints	Chenjie Cao et.al.	2303.02885	link
2023-03-05	Composing Mood Board with User Feedback in Concept Space	Shin Sano et.al.	2303.02547	null
2023-03-04	FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks	Xiao Han et.al.	2303.02483	link
2023-03-09	Self-Supervised Learning for Place Representation Generalization across Appearance Changes	Mohamed Adel Musallam et.al.	2303.02370	null
2023-03-03	MixVPR: Feature Mixing for Visual Place Recognition	Amar Ali-bey et.al.	2303.02190	link
2023-03-01	A Complementarity-Based Switch-Fuse System for Improved Visual Place Recognition	Maria Waheed et.al.	2303.00714	null
2023-03-01	ORCHNet: A Robust Global Feature Aggregation approach for 3D LiDAR-based Place recognition in Orchards	T. Barros et.al.	2303.00477	link
2023-03-03	Renderable Neural Radiance Map for Visual Navigation	Obin Kwon et.al.	2303.00304	null
2023-03-01	Region Prediction for Efficient Robot Localization on Large Maps	Matteo Scucchia et.al.	2303.00295	link
2023-02-28	OEKG: The Open Event Knowledge Graph	Simon Gottschalk et.al.	2302.14688	null
2023-02-28	Global Proxy-based Hard Mining for Visual Place Recognition	Amar Ali-bey et.al.	2302.14217	link
2023-02-27	Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation	Yue Xiang et.al.	2302.13929	link
2023-02-26	Data-Efficient Sequence-Based Visual Place Recognition with Highly Compressed JPEG Images	Mihnea-Alexandru Tomita et.al.	2302.13314	null
2023-02-26	Learning cross space mapping via DNN using large scale click-through logs	Wei Yu et.al.	2302.13275	null
2023-02-25	DeepBrainPrint: A Novel Contrastive Framework for Brain MRI Re-Identification	Lemuel Puglisi et.al.	2302.13057	null
2023-02-23	Teaching CLIP to Count to Ten	Roni Paiss et.al.	2302.12066	null
2023-02-22	Steerable Equivariant Representation Learning	Sangnie Bhardwaj et.al.	2302.11349	null
2023-02-21	iQPP: A Benchmark for Image Query Performance Prediction	Eduard Poesina et.al.	2302.10126	link
2023-02-20	Ontology-aware Network for Zero-shot Sketch-based Image Retrieval	Haoxiang Zhang et.al.	2302.10040	null
2023-02-20	TBPos: Dataset for Large-Scale Precision Visual Localization	Masud Fahim et.al.	2302.09825	link
2023-02-17	Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts	Zhihong Chen et.al.	2302.08958	link
2023-02-22	Fashion Image Retrieval with Multi-Granular Alignment	Jinkuan Zhu et.al.	2302.08902	null
2023-02-15	Unsupervised Hashing via Similarity Distribution Calibration	Kam Woh Ng et.al.	2302.07669	link
2023-02-13	Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior	Shen Yan et.al.	2302.06287	link
2023-02-13	Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation	Binqian Jiang et.al.	2302.06149	link
2023-02-13	Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval	Xu Wang et.al.	2302.06081	link
2023-02-11	Sketch Less Face Image Retrieval: A New Challenge	Dawei Dai et.al.	2302.05576	link
2023-02-10	Is multi-modal vision supervision beneficial to language?	Avinash Madasu et.al.	2302.05016	link
2023-02-06	Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval	Kuniaki Saito et.al.	2302.03084	link
2023-02-06	Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs	Michael Kirchhof et.al.	2302.02865	link
2023-02-03	Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization	Yingying Zhu et.al.	2302.01572	link
2023-02-04	Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval	Frederik Warburg et.al.	2302.01332	link
2023-01-31	Grounding Language Models to Images for Multimodal Generation	Jing Yu Koh et.al.	2301.13823	link
2023-01-31	UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers	Dachuan Shi et.al.	2301.13741	link
2023-01-23	Lexi: Self-Supervised Learning of the UI Language	Pratyay Banerjee et.al.	2301.10165	link
2023-01-17	Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval	Yuchen Wu et.al.	2301.06685	null
2023-01-19	High-bandwidth Close-Range Information Transport through Light Pipes	Joowon Lim et.al.	2301.06496	null
2023-01-13	A LiDAR-Inertial-Visual SLAM System with Loop Detection	Kangcheng Liu et.al.	2301.05604	null
2023-01-12	GH-Feat: Learning Versatile Generative Hierarchical Features from GANs	Yinghao Xu et.al.	2301.05315	null
2023-01-10	Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images	Xindi Wu et.al.	2301.04224	null
2023-01-10	Collaborative Semantic Communication at the Edge	Wing Fei Lo et.al.	2301.03996	null
2023-01-10	Online Backfilling with No Regret for Large-Scale Image Retrieval	Seonguk Seo et.al.	2301.03767	null
2023-01-06	CyberLoc: Towards Accurate Long-term Visual Localization	Liu Liu et.al.	2301.02403	null
2023-01-05	A Probabilistic Framework for Visual Localization in Ambiguous Scenes	Fereidoon Zangeneh et.al.	2301.02086	link
2022-12-31	4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Patrick Wenzel et.al.	2301.01147	null
2022-12-30	HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images	Dmitry Yudin et.al.	2212.14649	link
2022-12-27	Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning	Wooyoung Kang et.al.	2212.13563	link
2022-12-23	SuperGF: Unifying Local and Global Features for Visual Localization	Wenzheng Song et.al.	2212.13105	null
2022-12-24	GraffMatch: Global Matching of 3D Lines and Planes for Wide Baseline LiDAR Registration	Parker C. Lusk et.al.	2212.12745	null
2022-12-19	From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration	Zekun Qian et.al.	2212.09298	link
2022-12-14	The Infinite Index: Information Retrieval on Generative Text-To-Image Models	Niklas Deckers et.al.	2212.07476	null
2022-12-14	Shared Coupling-bridge for Weakly Supervised Local Feature Learning	Jiayuan Sun et.al.	2212.07047	link
2022-12-08	Group Generalized Mean Pooling for Vision Transformer	Byungsoo Ko et.al.	2212.04114	null
2022-12-12	Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models	Gowthami Somepalli et.al.	2212.03860	null
2022-12-07	LSVL: Large-scale season-invariant visual localization for UAVs	Jouko Kinnari et.al.	2212.03581	null
2022-12-06	ADIR: Adaptive Diffusion for Image Reconstruction	Shady Abu-Hussein et.al.	2212.03221	null
2022-12-08	Privacy-Preserving Visual Localization with Event Cameras	Junho Kim et.al.	2212.03177	link
2022-12-06	Semantic Communication for Internet of Vehicles: A Multi-User Cooperative Approach	Wenjun Xu et.al.	2212.03037	null
2022-12-06	Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds	Zhipeng Zhao et.al.	2212.02757	null
2022-12-04	Fast and Lightweight Scene Regressor for Camera Relocalization	Thuan B. Bui et.al.	2212.01830	link
2022-12-02	Information Retrieval from the Digitized Books	Riya Gupta et.al.	2212.00999	null
2022-12-09	StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition	Yanqing Shen et.al.	2212.00937	null
2022-11-30	Self-Supervised Feature Learning for Long-Term Metric Visual Localization	Yuxuan Chen et.al.	2212.00122	null
2022-11-30	SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation	Tianyu Zhang et.al.	2211.16697	link
2022-11-28	SLAN: Self-Locator Aided Network for Cross-Modal Understanding	Jiang-Tian Zhai et.al.	2211.16208	null
2022-11-29	RankDNN: Learning to Rank for Few-shot Learning	Qianyu Guo et.al.	2211.15320	link
2022-11-28	Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map	Xi Zheng et.al.	2211.15127	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069	link
2022-11-27	BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images	Zhihuang Zhang et.al.	2211.14927	null
2022-11-27	A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition	Rui Huang et.al.	2211.14864	null
2022-11-26	Visual Place Recognition	Bailu Guo et.al.	2211.14533	null
2022-11-26	Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval	Fan Yang et.al.	2211.14515	link
2022-11-30	Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark	Floriana Ciaglia et.al.	2211.13523	link
2022-11-23	InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images	Konstantin Kobs et.al.	2211.12760	link
2022-11-29	Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments	Joshua Knights et.al.	2211.12732	link
2022-11-23	FE-Fusion-VPR: Attention-based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events	Kuanxu Hou et.al.	2211.12244	null
2022-11-22	Multimorbidity Content-Based Medical Image Retrieval Using Proxies	Yunyan Xing et.al.	2211.12185	null
2022-11-22	Vision-based localization methods under GPS-denied conditions	Zihao Lu et.al.	2211.11988	null
2022-11-21	ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields	Mohammad Mahdi Johari et.al.	2211.11704	null
2022-11-21	LISA: Localized Image Stylization with Audio via Implicit Neural Representation	Seung Hyun Lee et.al.	2211.11381	null
2022-11-21	NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization	Shitao Tang et.al.	2211.11177	link
2022-11-16	Improving Feature-based Visual Localization by Geometry-Aided Matching	Hailin Yu et.al.	2211.08712	link
2022-11-15	LiePoseNet: Heterogeneous Loss Function Based on Lie Group for Significant Speed-up of PoseNet Training Process	Mikhail Kurenkov et.al.	2211.08480	null
2022-11-14	Degeneracy removal of spin bands in antiferromagnets with non-interconvertible spin motif pair	Lin-Ding Yuan et.al.	2211.07803	null
2022-11-14	Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition	Farid Alijani et.al.	2211.07696	null
2022-11-14	Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization	Yiyang Chen et.al.	2211.07394	link
2022-11-14	Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment	Junyang Wang et.al.	2211.07275	null
2022-11-14	ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations	Chanda Grover et.al.	2211.07122	null
2022-11-14	Few-shot Metric Learning: Online Adaptation of Embedding for Retrieval	Deunsol Jung et.al.	2211.07116	null
2022-11-12	Partial Visual-Semantic Embedding: Fashion Intelligence System with Sensitive Part-by-Part Learning	Ryotaro Shimizu et.al.	2211.06688	null
2022-11-09	Visual Named Entity Linking: A New Dataset and A Baseline	Wenxiang Sun et.al.	2211.04872	link
2022-11-07	Ultrafast Image Retrieval from a Holographic Memory Disc for High-Speed Operation of a Shift, Scale, and Rotation Invariant Target Recognition System	Julian Gamboa et.al.	2211.03881	null
2022-11-06	A Geometrically Constrained Point Matching based on View-invariant Cross-ratios, and Homography	Yueh-Cheng Huang et.al.	2211.03007	null
2022-11-02	Optimizing Fiducial Marker Placement for Improved Visual Localization	Qiangqiang Huang et.al.	2211.01513	link
2022-11-02	A comparison of uncertainty estimation approaches for DNN-based camera localization	Matteo Vaghi et.al.	2211.01234	null
2022-11-02	M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval	Layne Berry et.al.	2211.01180	null
2022-11-11	Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality	Anuj Diwan et.al.	2211.00768	link
2022-11-07	Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding	Ryotaro Shimizu et.al.	2210.17417	null
2022-10-27	Structuring User-Generated Content on Social Media with Multimodal Aspect-Based Sentiment Analysis	Miriam Anschütz et.al.	2210.15377	link
2022-10-27	Leveraging Computer Vision Application in Visual Arts: A Case Study on the Use of Residual Neural Network to Classify and Analyze Baroque Paintings	Daniel Kvak et.al.	2210.15300	null
2022-10-27	Towards Practicality of Sketch-Based Visual Understanding	Ayan Kumar Bhunia et.al.	2210.15146	null
2022-10-27	MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval	Chen Bao et.al.	2210.15128	null
2022-10-26	FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning	Suvir Mirchandani et.al.	2210.15028	null
2022-10-26	FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization	Junyang Wang et.al.	2210.14562	null
2022-11-02	A Framework for Collaborative Multi-Robot Mapping using Spectral Graph Wavelets	Lukas Bernreiter et.al.	2210.13856	null
2022-10-27	Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision	Tzu-Jui Julius Wang et.al.	2210.13591	null
2022-10-24	Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval	Zhaopeng Dou et.al.	2210.13440	link
2022-10-23	Neural Eigenfunctions Are Structured Representation Learners	Zhijie Deng et.al.	2210.12637	link
2022-10-21	Boosting vision transformers for image retrieval	Chull Hwan Song et.al.	2210.11909	link
2022-10-20	Communication breakdown: On the low mutual intelligibility between human and neural captioning	Roberto Dessì et.al.	2210.11512	link
2022-10-19	Image Semantic Relation Generation	Mingzhe Du et.al.	2210.11253	null
2022-10-20	General Image Descriptors for Open World Image Retrieval using ViT CLIP	Marcos V. Conde et.al.	2210.11141	link
2022-10-20	DeepRING: Learning Roto-translation Invariant Representation for LiDAR based Place Recognition	Sha Lu et.al.	2210.11029	null
2022-10-19	Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval	Abhra Chaudhuri et.al.	2210.10486	link
2022-10-19	GSV-Cities: Toward Appropriate Supervised Visual Place Recognition	Amar Ali-bey et.al.	2210.10239	link
2022-10-18	A Real-Time Fusion Framework for Long-term Visual Localization	Yuchen Yang et.al.	2210.09757	null
2022-10-17	Bridging the Gap between Local Semantic Concepts and Bag of Visual Words for Natural Scene Image Retrieval	Yousef Alqasrawi et.al.	2210.08875	null
2022-10-17	SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation	Woo Suk Choi et.al.	2210.08675	null
2022-10-16	Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers	Tao Tang et.al.	2210.08458	link
2022-10-14	Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding	Xuetong Xue et.al.	2210.07572	link
2022-10-14	Boosting Performance of a Baseline Visual Place Recognition Technique by Predicting the Maximally Complementary Technique	Connor Malone et.al.	2210.07509	null
2022-10-11	Large-to-small Image Resolution Asymmetry in Deep Metric Learning	Pavel Suma et.al.	2210.05463	link
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236	null
2022-10-05	Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features	Deepak Gupta et.al.	2210.02401	link
2022-10-05	Granularity-aware Adaptation for Image Retrieval over Multiple Tasks	Jon Almazán et.al.	2210.02254	null
2022-10-05	Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective	Zijian Zhang et.al.	2210.02206	link
2022-10-04	Supervised Metric Learning for Retrieval via Contextual Similarity Optimization	Christopher Liao et.al.	2210.01908	link
2022-10-04	Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing	Weiying Wang et.al.	2210.01320	null
2022-10-03	Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments	Bruno Arcanjo et.al.	2210.00834	null
2022-10-02	Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval	Kei Nishimaki et.al.	2210.00506	null
2022-09-29	Guided Unsupervised Learning by Subaperture Decomposition for Ocean SAR Image Retrieval	Nicolae-Cătălin Ristea et.al.	2209.15034	null
2022-09-28	TVLT: Textless Vision-Language Transformer	Zineng Tang et.al.	2209.14156	link
2022-09-28	SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval	Yang Shen et.al.	2209.13833	link
2022-09-28	Learning Deep Representations via Contrastive Learning for Instance Retrieval	Tao Wu et.al.	2209.13832	null
2022-09-28	Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text	Cheng-An Hsieh et.al.	2209.13764	link
2022-09-27	Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors	Hao Dong et.al.	2209.13586	link
2022-09-27	Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability	Peisong Wen et.al.	2209.13262	link
2022-09-26	NDD: A 3D Point Cloud Descriptor Based on Normal Distribution for Loop Closure Detection	Ruihao Zhou et.al.	2209.12513	link
2022-09-25	Personalized Saliency in Task-Oriented Semantic Communications: Image Transmission and Performance Analysis	Jiawen Kang et.al.	2209.12274	link
2022-09-24	Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes	Jonathan J. Y. Kim et.al.	2209.11894	null
2022-09-23	Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs	Youya Xia et.al.	2209.11673	null
2022-09-23	Query-based Hard-Image Retrieval for Object Detection at Test Time	Edward Ayers et.al.	2209.11559	link
2022-09-23	Unsupervised Hashing with Semantic Concept Mining	Rong-Cheng Tu et.al.	2209.11475	link
2022-09-22	UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low vision	Anbang Yang et.al.	2209.11336	null
2022-09-21	Visual Localization and Mapping in Dynamic and Changing Environments	João Carlos Virgolino Soares et.al.	2209.10710	null
2022-09-20	PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention	José Arce et.al.	2209.09699	link
2022-09-19	Deep Metric Learning with Chance Constraints	Yeti Z. Gurbuz et.al.	2209.09060	link
2022-09-18	HGI-SLAM: Loop Closure With Human and Geometric Importance Features	Shuhul Mujoo et.al.	2209.08608	null
2022-09-18	Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM	Jiarui Tan et.al.	2209.08578	link
2022-09-17	Data Efficient Visual Place Recognition Using Extremely JPEG-Compressed Images	Mihnea-Alexandru Tomita et.al.	2209.08343	null
2022-09-15	Efficient Planar Pose Estimation via UWB Measurements	Haodong Jiang et.al.	2209.06779	link
2022-09-14	Transformers and CNNs both Beat Humans on SBIR	Omar Seddati et.al.	2209.06629	null
2022-09-14	Tac2Structure: Object Surface Reconstruction Only through Multi Times Touch	J. Lu et.al.	2209.06545	link
2022-09-14	iSimLoc: Visual Global Localization for Previously Unseen Environments with Simulated Images	Peng Yin et.al.	2209.06376	null
2022-09-09	General Place Recognition Survey: Towards the Real-world Autonomy Age	Peng Yin et.al.	2209.04497	link
2022-09-09	Retinal Image Restoration and Vessel Segmentation using Modified Cycle-CBAM and CBAM-UNet	Alnur Alimanov et.al.	2209.04234	link
2022-09-13	Segment Augmentation and Differentiable Ranking for Logo Retrieval	Feyza Yavuz et.al.	2209.02482	null
2022-09-12	ScaleFace: Uncertainty-aware Deep Metric Learning	Roman Kail et.al.	2209.01880	link
2022-09-04	CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud	Evgeny Yudin et.al.	2209.01605	null
2022-08-31	EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing	Qihua Feng et.al.	2208.14657	link
2022-08-25	A Deep Perceptual Measure for Lens and Camera Calibration	Yannick Hold-Geoffroy et.al.	2208.12300	null
2022-08-25	A Privacy-Preserving and End-to-End-Based Encrypted Image Retrieval Scheme	Zhixun Lu et.al.	2208.11876	null
2022-08-23	Satellite Image Search in AgoraEO	Ahmet Kerem Aksoy et.al.	2208.10830	null
2022-08-20	Fuse and Attend: Generalized Embedding Learning for Art and Sketches	Ujjal Kr Dutta et.al.	2208.09698	null
2022-08-19	Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods	Chao Chen et.al.	2208.09315	link
2022-08-19	TTT-UCDR: Test-time Training for Universal Cross-Domain Retrieval	Soumava Paul et.al.	2208.09198	link
2022-08-17	Visual Cross-View Metric Localization with Dense Uncertainty Estimates	Zimin Xia et.al.	2208.08519	link
2022-08-17	Understanding Attention for Vision-and-Language Tasks	Feiqi Cao et.al.	2208.08104	link
2022-08-14	Visual Localization via Few-Shot Scene Region Classification	Siyan Dong et.al.	2208.06933	link
2022-08-14	HyP $^2$ Loss: Beyond Hypersphere Metric Space for Multi-label Image Retrieval	Chengyin Xu et.al.	2208.06866	link
2022-08-13	Finding Point with Image: An End-to-End Benchmark for Vision-based UAV Localization	Ming Dai et.al.	2208.06561	link
2022-08-16	Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation	Georgios Kouros et.al.	2208.06195	link
2022-08-12	Instance Image Retrieval by Learning Purely From Within the Dataset	Zhongyan Zhang et.al.	2208.06119	null
2022-08-07	CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization	Yujiao Shi et.al.	2208.03660	null
2022-08-05	A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch	Patsorn Sangkloy et.al.	2208.03354	null
2022-08-05	ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding	Bingning Wang et.al.	2208.03030	link
2022-08-04	Pattern Spotting and Image Retrieval in Historical Documents using Deep Hashing	Caio da S. Dias et.al.	2208.02397	null
2022-07-27	On the robustness of self-supervised representations for multi-view object classification	David Torpey et.al.	2208.00787	null
2022-07-26	Multimodal Neural Machine Translation with Search Engine Based Image Retrieval	ZhenHao Tang et.al.	2208.00767	null
2022-07-30	Towards Privacy-Preserving, Real-Time and Lossless Feature Matching	Qiang Meng et.al.	2208.00214	link
2022-07-30	DAS: Densely-Anchored Sampling for Deep Metric Learning	Lizhao Liu et.al.	2208.00119	link
2022-07-29	Curriculum Learning for Data-Efficient Vision-Language Alignment	Tejas Srinivasan et.al.	2207.14525	null
2022-07-29	Neural Density-Distance Fields	Itsuki Ueda et.al.	2207.14455	link
2022-07-27	Abstracting Sketches through Simple Primitives	Stephan Alaniz et.al.	2207.13543	link
2022-07-27	Satellite Image Based Cross-view Localization for Autonomous Vehicle	Shan Wang et.al.	2207.13506	null
2022-07-26	RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments	Jiahui Zhang et.al.	2207.12579	null
2022-07-25	A hybrid-qudit representation of digital RGB images	Sreetama Das et.al.	2207.12550	null
2022-07-19	ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization	Ivan Cisneros et.al.	2207.12317	link
2022-07-22	PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes	BaoSheng Zhang et.al.	2207.10916	null
2022-07-25	MeshLoc: Mesh-Based Visual Localization	Vojtech Panek et.al.	2207.10762	link
2022-07-20	Revisiting Hotels-50K and Hotel-ID	Aarash Feizi et.al.	2207.10200	link
2022-07-20	Feature Representation Learning for Unsupervised Cross-domain Image Retrieval	Conghui Hu et.al.	2207.09721	link
2022-07-19	SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany	Dominik Koßmann et.al.	2207.09507	null
2022-07-19	Context Unaware Knowledge Distillation for Image Retrieval	Bytasandram Yaswanth Reddy et.al.	2207.09070	link
2022-07-17	FashionViL: Fashion-Focused Vision-and-Language Representation Learning	Xiao Han et.al.	2207.08150	link
2022-07-14	AutoMerge: A Framework for Map Assembling and Smoothing in City-scale Environments	Peng Yin et.al.	2207.06965	null
2022-07-14	Semi-supervised Vector-Quantization in Visual SLAM using HGCN	Amir Zarringhalam et.al.	2207.06738	null
2022-07-14	Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders	Amir Zarringhalam et.al.	2207.06732	null
2022-07-19	Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras	Fangwen Shu et.al.	2207.06058	link
2022-07-12	CPO: Change Robust Panorama to Point Cloud Localization	Junho Kim et.al.	2207.05317	link
2022-07-05	Hierarchical Average Precision Training for Pertinent Image Retrieval	Elias Ramzi et.al.	2207.04873	link
2022-07-11	A clinically motivated self-supervised approach for content-based image retrieval of CT liver images	Kristoffer Knutsen Wickstrøm et.al.	2207.04812	link
2022-07-09	BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval	Wenqiao Zhang et.al.	2207.04211	null
2022-07-08	Learning Sequential Descriptors for Sequence-based Visual Place Recognition	Riccardo Mereu et.al.	2207.03868	link
2022-07-08	GEMS: Scene Expansion using Generative Models of Graphs	Rishi Agarwal et.al.	2207.03729	null
2022-07-05	Object-Level Targeted Selection via Deep Template Matching	Suraj Kothawade et.al.	2207.01778	null
2022-07-06	Adaptive Fine-Grained Sketch-Based Image Retrieval	Ayan Kumar Bhunia et.al.	2207.01723	link
2022-07-04	Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets	Paul Albert et.al.	2207.01573	link
2022-07-08	Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval	Keyu Wen et.al.	2207.00733	null
2022-07-01	DALG: Deep Attentive Local and Global Modeling for Image Retrieval	Yuxin Song et.al.	2207.00287	null
2022-07-04	BadHash: Invisible Backdoor Attacks against Deep Hashing with Clean Label	Shengshan Hu et.al.	2207.00278	link
2022-06-28	Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems	Stephen Hausler et.al.	2206.13883	null
2022-07-08	How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels	Tobias Fischer et.al.	2206.13673	link
2022-06-25	FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance	Yongzhi Fan et.al.	2206.12628	link
2022-06-25	Inverted Semantic-Index for Image Retrieval	Ying Wang et.al.	2206.12623	null
2022-06-17	RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval	Yihan Wu et.al.	2206.11225	null
2022-06-22	ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas	Prathmesh Madhu et.al.	2206.11115	null
2022-06-20	Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval	Guile Wu et.al.	2206.09806	null
2022-06-18	Attention-based Dynamic Subspace Learners for Medical Image Analysis	Sukesh Adiga V et.al.	2206.09068	null
2022-06-17	Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments	Khairuldanial Ismail et.al.	2206.08733	null
2022-06-06	Learning Treatment Plan Representations for Content Based Image Retrieval	Charles Huang et.al.	2206.02912	null
2022-06-19	NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation	Ekaterina Nepovinnykh et.al.	2206.02498	link
2022-06-05	Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks	B. G. Palm et.al.	2206.02278	null
2022-05-28	FaIRCoP: Facial Image Retrieval using Contrastive Personalization	Devansh Gupta et.al.	2205.15870	null
2022-05-31	Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark	Martin Humenberger et.al.	2205.15761	link
2022-05-27	Improving Road Segmentation in Challenging Domains Using Similar Place Priors	Connor Malone et.al.	2205.14112	null
2022-05-31	LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments	Yun Chang et.al.	2205.13135	link
2022-05-26	Fine-grained Image Captioning with CLIP Reward	Jaemin Cho et.al.	2205.13115	link
2022-05-25	Deep Dense Local Feature Matching and Vehicle Removal for Indoor Visual Localization	Kyung Ho Park et.al.	2205.12544	null
2022-05-24	OnePose: One-Shot Object Pose Estimation without CAD Models	Jiaming Sun et.al.	2205.12257	link
2022-05-23	VPAIR -- Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments	Michael Schleiss et.al.	2205.11567	link
2022-05-23	VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering	Yanan Wang et.al.	2205.11501	null
2022-05-23	Deep Image Retrieval is not Robust to Label Noise	Stanislav Dereka et.al.	2205.11195	null
2022-05-22	Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval	Zelong Zeng et.al.	2205.10878	link
2022-05-20	Visually-Augmented Language Modeling	Weizhi Wang et.al.	2205.10178	link
2022-05-18	Deep Features for CBIR with Scarce Data using Hebbian Learning	Gabriele Lagani et.al.	2205.08935	null
2022-05-19	Text Detection & Recognition in the Wild for Robot Localization	Zobeir Raisi et.al.	2205.08565	null
2022-05-12	One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code	Yong Dai et.al.	2205.06126	null
2022-05-11	Review on Panoramic Imaging and Its Applications in Scene Understanding	Shaohua Gao et.al.	2205.05570	null
2022-05-18	Identical Image Retrieval using Deep Learning	Sayan Nath et.al.	2205.04883	link
2022-05-09	Introspective Deep Metric Learning	Chengkun Wang et.al.	2205.04449	link
2022-05-11	Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting	Kai Uwe Barthel et.al.	2205.04255	link
2022-05-08	Adversarial Learning of Hard Positives for Place Recognition	Wenxuan Fang et.al.	2205.03871	null
2022-05-10	AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching	Khanh Nguyen et.al.	2205.02849	link
2022-04-29	Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval	Shupeng Su et.al.	2204.13919	null
2022-04-29	Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval	Siyu Ren et.al.	2204.13913	link
2022-04-28	Spatio-Temporal Graph Localization Networks for Image-based Navigation	Takahiro Niwa et.al.	2204.13237	null
2022-04-27	The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection	Konstantinos A. Tsintotas et.al.	2204.12831	null
2022-04-25	SceneTrilogy: On Scene Sketches and its Relationship with Text and Photo	Pinaki Nath Chowdhury et.al.	2204.11964	null
2022-04-23	On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning	Muhammad Umer Anwaar et.al.	2204.11848	null
2022-04-24	Progressive Learning for Image Retrieval with Hybrid-Modality Queries	Yida Zhao et.al.	2204.11212	null
2022-04-23	Training and challenging models for text-guided fashion image retrieval	Eric Dodds et.al.	2204.11004	link
2022-04-18	Centralized Adversarial Learning for Robust Deep Hashing	Xunguang Wang et.al.	2204.10779	link
2022-04-22	Transferring ConvNet Features from Passive to Active Robot Self-Localization: The Use of Ego-Centric and World-Centric Views	Kanya Kurauchi et.al.	2204.10497	null
2022-04-21	Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval	Zhiqiang Yuan et.al.	2204.09868	link
2022-04-21	Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information	Zhiqiang Yuan et.al.	2204.09860	link
2022-04-20	Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations	Leila Pishdad et.al.	2204.09268	null
2022-04-19	Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing	Georgii Mikriukov et.al.	2204.08707	null
2022-04-18	Multiple-environment Self-adaptive Network for Aerial-view Geo-localization	Tingyu Wang et.al.	2204.08381	link
2022-04-15	Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder	Hanjing Ye et.al.	2204.07350	link
2022-04-14	Composite Code Sparse Autoencoders for first stage retrieval	Carlos Lassance et.al.	2204.07023	null
2022-04-13	Reuse your features: unifying retrieval and feature-metric alignment	Javier Morlana et.al.	2204.06292	link
2022-04-12	Probabilistic Compositional Embeddings for Multimodal Image Retrieval	Andrei Neculai et.al.	2204.05845	link
2022-04-12	Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval	Yu-Wei Zhan et.al.	2204.05666	null
2022-04-12	HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud	Zhixing Hou et.al.	2204.05481	null
2022-04-11	Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context	Lizhou Liao et.al.	2204.04932	link
2022-04-10	Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image	Yujiao Shi et.al.	2204.04752	link
2022-04-08	A Generic Image Retrieval Method for Date Estimation of Historical Document Collections	Adrià Molina et.al.	2204.04028	null
2022-04-08	SnapMode: An Intelligent and Distributed Large-Scale Fashion Image Retrieval Platform Based On Big Data and Deep Generative Adversarial Network Technologies	Narges Norouzi et.al.	2204.03998	null
2022-04-05	Leveraging Equivariant Features for Absolute Pose Regression	Mohamed Adel Musallam et.al.	2204.02163	null
2022-04-04	"This is my unicorn, Fluffy": Personalizing frozen vision-language representations	Niv Cohen et.al.	2204.01694	link
2022-04-01	Bi-directional Loop Closure for Visual SLAM	Ihtisham Ali et.al.	2204.01524	null
2022-04-01	LASER: LAtent SpacE Rendering for 2D Visual Localization	Zhixiang Min et.al.	2204.00157	link
2022-03-31	Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning	Semih Orhan et.al.	2203.16945	null
2022-03-30	AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift	Burak Yildiz et.al.	2203.16291	link
2022-03-29	Long-term Visual Map Sparsification with Heterogeneous GNN	Ming-Fang Chang et.al.	2203.15182	null
2022-04-01	A Simulation Benchmark for Vision-based Autonomous Navigation	Lauri Suomela et.al.	2203.13048	link
2022-03-24	Is Geometry Enough for Matching in Visual Localization?	Qunjie Zhou et.al.	2203.12979	link
2022-03-21	MatchFormer: Interleaving Attention in Transformers for Feature Matching	Qing Wang et.al.	2203.09645	link
2022-03-10	ReF -- Rotation Equivariant Features for Local Feature Matching	Abhishek Peri et.al.	2203.05206	null
2022-03-09	Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction	Matthieu Zins et.al.	2203.04613	null
2022-03-08	Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM	Pierre-Yves Lajoie et.al.	2203.04446	link
2022-03-07	ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization	Simon Maurer et.al.	2203.03610	link
2022-03-07	Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms	Qingqing Li et.al.	2203.03454	link
2022-03-01	SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments	Maria Waheed et.al.	2203.00591	null
2022-02-28	Deep Camera Pose Regression Using Pseudo-LiDAR	Ali Raza et.al.	2203.00080	null
2022-02-25	RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation	Praveen Kumar Rajendran et.al.	2202.12838	null
2022-02-24	Highly-Efficient Binary Neural Networks for Visual Place Recognition	Bruno Ferrarini et.al.	2202.12375	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146	link
2022-02-14	Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition	Y. Shen et.al.	2202.06470	null
2022-02-11	Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition	Yingfeng Cai et.al.	2202.05738	null
2022-02-09	Object-Guided Day-Night Visual Localization in Urban Scenes	Assia Benbihi et.al.	2202.04445	null
2022-02-08	A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition	Nie Jiwei et.al.	2202.03677	null
2022-02-25	CFP-SLAM: A Real-time Visual SLAM Based on Coarse-to-Fine Probability in Dynamic Environments	Xinggang Hu et.al.	2202.01938	null
2022-02-03	Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization	Andrea Vallone et.al.	2202.01821	null
2022-02-02	Training Semantic Descriptors for Image-Based Localization	Ibrahim Cinaroglu et.al.	2202.01212	null
2022-01-31	Hydra: A Real-time Spatial Perception Engine for 3D Scene Graph Construction and Optimization	Nathan Hughes et.al.	2201.13360	null
2022-01-31	Rigidity Preserving Image Transformations and Equivariance in Perspective	Lucas Brynte et.al.	2201.13065	null
2022-01-25	Learning Semantics for Visual Place Recognition through Multi-Scale Attention	Valerio Paolicelli et.al.	2201.09701	link
2022-01-22	Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems	Xi Zheng et.al.	2201.09048	link
2022-01-15	A Critical Analysis of Image-based Camera Pose Estimation Techniques	Meng Xu et.al.	2201.05816	null
2022-01-14	SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions	Ali Samadzadeh et.al.	2201.05386	link
2021-12-23	NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning	Tony Ng et.al.	2112.12785	null
2021-12-16	CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data	Qi Yan et.al.	2112.09081	link
2021-12-05	RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather	Jialu Wang et.al.	2112.02469	null
2021-11-25	MegLoc: A Robust and Accurate Visual Localization Pipeline	Shuxue Peng et.al.	2111.13063	null
2021-10-08	Semantic Image Alignment for Vehicle Localization	Markus Herb et.al.	2110.04162	null
2021-10-05	Season-invariant GNSS-denied visual localization for UAVs	Jouko Kinnari et.al.	2110.01967	link
2021-09-30	Forming a sparse representation for visual place recognition using a neurorobotic approach	Sylvain Colomer et.al.	2109.14916	null
2021-09-22	Audio-Visual Grounding Referring Expression for Robotic Manipulation	Yefei Wang et.al.	2109.10571	null
2021-09-20	Efficient shape mapping through dense touch and vision	Sudharshan Suresh et.al.	2109.09884	link
2021-09-15	S3LAM: Structured Scene SLAM	Mathieu Gonzalez et.al.	2109.07339	null
2021-09-13	Monocular Camera Localization for Automated Vehicles Using Image Retrieval	Eunhyek Joa et.al.	2109.06296	null
2021-09-10	Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization	Sungho Yoon et.al.	2109.04753	link
2021-09-09	CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization	Ara Jafarzadeh et.al.	2109.04527	null
2021-09-09	Keeping an Eye on Things: Deep Learned Features for Long-Term Visual Localization	Mona Gridseth et.al.	2109.04041	link

(back to top)

Keypoint Detection

Publish Date	Title	Authors	PDF	Code
2025-03-04	A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection	Junyi Wang et.al.	2503.02481	null
2025-03-01	Autonomous Dissection in Robotic Cholecystectomy	Ki-Hwan Oh et.al.	2503.00666	null
2025-02-28	CNSv2: Probabilistic Correspondence Encoded Neural Image Servo	Anzhe Chen et.al.	2503.00132	null
2025-02-27	Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets	Jisoo Lee et.al.	2502.19766	null
2025-02-23	Rewards-based image analysis in microscopy	Kamyar Barakati et.al.	2502.18522	null
2025-02-19	2.5D U-Net with Depth Reduction for 3D CryoET Object Identification	Yusuke Uchida et.al.	2502.13484	link
2025-01-30	Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images	Wei-Lun Chen et.al.	2501.18453	null
2025-01-30	Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models	Bhargav Ghanekar et.al.	2501.18361	null
2025-01-30	Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems	Liudi Yang et.al.	2501.18110	null
2025-01-21	Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction	Mengyuan Li et.al.	2501.11844	null
2025-01-20	MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching	Yepeng Liu et.al.	2501.11299	null
2025-01-19	Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation	Shibang Liu et.al.	2501.11069	null
2025-01-13	Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications	Lukas Rustler et.al.	2501.07421	null
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399	null
2024-12-24	GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network	Xianfeng Song et.al.	2412.18221	link
2024-12-21	A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection	Shahid Ansari et.al.	2412.16755	null
2024-12-19	Corn Ear Detection and Orientation Estimation Using Deep Learning	Nathan Sprague et.al.	2412.14954	null
2024-12-12	Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models	Faith Johnson et.al.	2412.09739	null
2024-12-09	An Efficient Scene Coordinate Encoding and Relocalization Method	Kuan Xu et.al.	2412.06488	link
2024-12-09	ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models	Bingchen Gong et.al.	2412.06292	null
2024-12-07	Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures	Muhammad Umar Farooq et.al.	2412.05487	null
2024-12-04	Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything	Yongkyu Lee et.al.	2412.03472	link
2024-12-02	MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection	Yonghao Dang et.al.	2412.01422	null
2024-11-23	OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs	Chen Xin et.al.	2411.15653	link
2024-11-19	IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose	Fei Ren et.al.	2411.12676	null
2024-11-04	Silver medal Solution for Image Matching Challenge 2024	Yian Wang et.al.	2411.01851	null
2024-11-04	KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension	Jie Yang et.al.	2411.01846	null
2024-10-31	From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots	Vasileios Tzouras et.al.	2410.23906	null
2024-10-04	Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation	Aman Anand et.al.	2410.14700	null
2024-11-27	Sim2real Cattle Joint Estimation in 3D point clouds	Mohammad Okour et.al.	2410.14419	null
2024-10-16	PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network	Asish Bera et.al.	2410.12742	null
2024-10-16	RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition	Asish Bera et.al.	2410.12718	null
2024-10-01	A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference	Yuan Li et.al.	2410.11848	null
2024-10-11	Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image	Marta Veganzones Rodriguez et.al.	2410.09155	null
2024-10-08	Unsupervised Model Diagnosis	Yinong Oliver Wang et.al.	2410.06243	null
2024-10-08	Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration	Xueyang Kang et.al.	2410.05729	link
2024-10-16	Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features	Chengkai Hou et.al.	2410.02237	null
2024-10-02	Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection	Hongru Yan et.al.	2410.01404	null
2024-09-30	OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection	Changsheng Lu et.al.	2409.19899	link
2024-10-07	SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation	Xin Li et.al.	2409.18082	null
2024-09-24	GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization	Gennady Sidorov et.al.	2409.16502	link
2024-09-20	Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators	Niloufar Amiri et.al.	2409.13668	null
2024-09-25	Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding	Rania Hossam et.al.	2409.08695	link
2024-09-06	D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection	Kentaro Hirahara et.al.	2409.04060	null
2024-10-01	Towards Practical Human Motion Prediction with LiDAR Point Clouds	Xiao Han et.al.	2408.08202	null
2024-07-31	Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods	Xusheng Luo et.al.	2408.00117	null
2024-07-26	SHIC: Shape-Image Correspondences with no Keypoint Supervision	Aleksandar Shtedritski et.al.	2407.18907	null
2024-07-25	LION: Linear Group RNN for 3D Object Detection in Point Clouds	Zhe Liu et.al.	2407.18232	link
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791	null
2024-07-09	LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition	Teng Wang et.al.	2407.06730	null
2024-07-04	PFGS: High Fidelity Point Cloud Rendering via Feature Splatting	Jiaxu Wang et.al.	2407.03857	link
2024-07-03	A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes	Li Fang et.al.	2407.02830	link
2024-07-02	Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning	Chengchao Shen et.al.	2407.02014	link
2024-06-28	Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics	Chengrui Gao et.al.	2406.19672	null
2024-07-23	A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking	Lorenzo Shaikewitz et.al.	2406.16837	link
2024-06-03	Scale-Free Image Keypoints Using Differentiable Persistent Homology	Giovanni Barbarani et.al.	2406.01315	link
2024-06-23	W-Net: A Facial Feature-Guided Face Super-Resolution Network	Hao Liu et.al.	2406.00676	null
2024-05-25	Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration	Junjie Gao et.al.	2405.16085	null
2024-06-01	Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection -- Towards Precise Fish Morphological Assessment in Aquaculture Breeding	Weizhen Liu et.al.	2405.12476	link
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434	null
2024-05-15	Vector-Symbolic Architecture for Event-Based Optical Flow	Hongzhi You et.al.	2405.08300	null
2024-05-13	RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration	Congjia Chen et.al.	2405.07594	null
2024-05-08	Unsupervised Skin Feature Tracking with Deep Neural Networks	Jose Chang et.al.	2405.04943	null
2024-05-07	A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images	László Kopácsi et.al.	2405.04650	null
2024-04-30	A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images	Wang Zhang et.al.	2404.19311	null
2024-04-25	Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach	Tahmim Hossain et.al.	2404.14560	null
2024-04-19	SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers	Vandad Davoodnia et.al.	2404.12625	null
2024-04-17	Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images	Junbiao Pang et.al.	2404.10985	null
2024-03-28	Towards Long Term SLAM on Thermal Imagery	Colin Keil et.al.	2403.19885	link
2024-03-28	Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation	Xiao Lin et.al.	2403.19527	link
2024-03-27	RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation	Yang Tian et.al.	2403.18259	null
2024-03-18	FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events	Xiangyuan Wang et.al.	2403.11662	link
2024-03-05	Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion	Meng Zheng et.al.	2403.03217	null
2024-02-22	A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets	Chengzhang Yu et.al.	2402.14241	null
2024-02-25	A Feature Matching Method Based on Multi-Level Refinement Strategy	Shaojie Zhang et.al.	2402.13488	null
2024-03-05	3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data	Zhi-Yi Lin et.al.	2402.13172	null
2024-02-25	Region Feature Descriptor Adapted to High Affine Transformations	Shaojie Zhang et.al.	2402.09724	null
2024-01-29	Reconstructing Close Human Interactions from Multiple Views	Qing Shuai et.al.	2401.16173	link
2024-01-17	To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection	Luyi Han et.al.	2401.09336	link
2024-01-08	Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach	Huanyu Liu et.al.	2401.03742	link
2024-03-22	6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation	Li Xu et.al.	2401.00029	null
2023-12-27	Bezier-based Regression Feature Descriptor for Deformable Linear Objects	Fangqing Chen et.al.	2312.16502	null
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471	null
2023-12-22	BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions	Elias Marks et.al.	2312.14706	null
2023-12-19	Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation	Jiaming Liu et.al.	2312.12480	null
2023-12-19	An effective image copy-move forgery detection using entropy image	Zhaowei Lu et.al.	2312.11793	link
2023-12-11	VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data	Jian Shi et.al.	2312.08871	link
2023-12-11	Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach	Travis Driver et.al.	2312.06865	link
2023-12-01	Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version)	Emma Cramer et.al.	2312.00592	link
2023-11-30	Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications	Sahar Almahfouz Nasser et.al.	2311.18281	null
2023-11-29	Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features	Thomas Wimmer et.al.	2311.18113	link
2023-11-28	Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features	Niladri Shekhar Dutt et.al.	2311.17024	link
2023-11-28	Riemannian Self-Attention Mechanism for SPD Networks	Rui Wang et.al.	2311.16738	null
2023-11-27	A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor	Jialin Liu et.al.	2311.15609	null
2023-11-21	Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers	Bo Sun et.al.	2311.12291	null
2023-11-20	CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement	Boni Hu et.al.	2311.11604	link
2023-11-17	Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration	Paul J. Claasen et.al.	2311.10361	link
2023-11-13	Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning	Tomáš Kunzo et.al.	2311.07398	null
2023-11-11	CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer	Haoyu Ma et.al.	2311.06443	link
2023-11-08	3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud	Jianchao Ci et.al.	2311.04699	null
2023-11-06	TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains	Alexander Naumann et.al.	2311.03124	link
2023-11-06	An invariant feature extraction for multi-modal images matching	Chenzhong Gao et.al.	2311.02842	null
2023-10-20	Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification	Mateus Roder et.al.	2310.13490	null
2023-10-12	UniPose: Detecting Any Keypoints	Jie Yang et.al.	2310.08530	link
2023-10-10	l-dyno: framework to learn consistent visual features using robot's motion	Kartikeya Singh et.al.	2310.06249	link
2023-10-10	Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face	Hao Zhang et.al.	2310.05056	link
2023-10-13	H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation	Yanjie Ze et.al.	2310.01404	link
2023-10-04	Self-supervised Learning of Contextualized Local Visual Embeddings	Thalles Santos Silva et.al.	2310.00527	link
2023-10-22	ObVi-SLAM: Long-Term Object-Visual SLAM	Amanda Adkins et.al.	2309.15268	link
2023-09-19	LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation	Haizhou Zhang et.al.	2309.10436	link
2023-09-18	RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy	Mert Asim Karaoglu et.al.	2309.09563	null
2023-09-17	CryoAlign: feature-based method for global and local 3D alignment of EM density maps	Bintao He et.al.	2309.09217	null
2023-09-14	EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization	Minjung Kim et.al.	2309.07471	link
2023-09-09	Mirror-Aware Neural Humans	Daniel Ajisafe et.al.	2309.04750	link
2023-09-07	InstructDiffusion: A Generalist Modeling Interface for Vision Tasks	Zigang Geng et.al.	2309.03895	null
2023-09-04	SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras	Himanshu Pahadia et.al.	2309.01324	null
2023-09-12	Improving the matching of deformable objects by learning to detect keypoints	Felipe Cadar et.al.	2309.00434	link
2023-08-31	SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation	Jiaben Chen et.al.	2308.16876	null
2023-08-30	Learning Structure-from-Motion with Graph Attention Networks	Lucas Brynte et.al.	2308.15984	link
2023-08-29	A lightweight 3D dense facial landmark estimation model from position map data	Shubhajit Basak et.al.	2308.15170	link
2023-08-27	Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors	Francesco Pirotti et.al.	2308.14047	null
2023-08-24	VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition	Gengxuan Tian et.al.	2308.12870	null
2023-08-22	LDP-Feat: Image Features with Local Differential Privacy	Francesco Pittaluga et.al.	2308.11223	null
2023-08-20	Neural Interactive Keypoint Detection	Jie Yang et.al.	2308.10174	link
2023-08-19	ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment	Bingyang Zhou et.al.	2308.09987	null
2023-09-03	DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching	Johan Edstedt et.al.	2308.08479	link
2023-08-15	CoDeF: Content Deformation Fields for Temporally Consistent Video Processing	Hao Ouyang et.al.	2308.07926	link
2023-08-15	ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition	Wenyuan Xue et.al.	2308.07743	null
2023-08-14	DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport	Sk Aziz Ali et.al.	2308.07153	null
2023-08-14	2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds	Minhao Li et.al.	2308.05667	link
2023-08-02	Automated Hit-frame Detection for Badminton Match Analysis	Yu-Hang Chien et.al.	2307.16000	link
2023-07-25	Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception	Chuanyu Luo et.al.	2307.13300	null
2023-07-21	Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data	Sahar Almahfouz Nasser et.al.	2307.10698	link
2023-07-19	SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid	Zi Li et.al.	2307.09727	link
2023-07-01	SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation	Fabian Duffhauss et.al.	2307.00306	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669	link
2023-06-26	CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild	Li Ding et.al.	2306.15073	null
2023-06-28	Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset	Ziqiao Weng et.al.	2306.07089	link
2023-06-07	Learning Probabilistic Coordinate Fields for Robust Correspondences	Weiyue Zhao et.al.	2306.04231	null
2023-06-03	LDEB -- Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues	Amitabha Dey et.al.	2306.02193	null
2023-06-02	Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images	Marcela Mera-Trujillo et.al.	2306.01938	null
2023-06-01	A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm	Onur Beker et.al.	2306.00892	null
2023-05-30	Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection	Supeng Wang et.al.	2305.18714	link
2023-05-23	Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence	Grace Luo et.al.	2305.14334	null
2023-05-15	Non-Separable Multi-Dimensional Network Flows for Visual Computing	Viktoria Ehm et.al.	2305.08628	null
2023-05-13	Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance	Xinyu Lin et.al.	2305.07943	link
2023-05-05	HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration	Canhui Tang et.al.	2305.03487	link
2023-04-17	Human Pose Estimation in Monocular Omnidirectional Top-View Images	Jingrui Yu et.al.	2304.08186	null
2023-04-14	CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression	Mubariz Zaffar et.al.	2304.07426	null
2023-04-12	SiLK -- Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194	link
2023-04-06	From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection	Changsheng Lu et.al.	2304.03140	null
2023-03-29	NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud	Xiangyu Zhu et.al.	2303.16465	link
2023-03-24	PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View	Ze Shi et.al.	2303.14095	link
2023-03-23	Semantic Image Attack for Visual Model Diagnosis	Jinqi Luo et.al.	2303.13010	null
2023-03-22	Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation	Heng Yang et.al.	2303.12246	link
2023-03-21	RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network	Sangmin Yoo et.al.	2303.10770	null
2023-03-17	ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty	Vanessa Wirth et.al.	2303.10042	null
2023-03-15	Descriptor Distillation for Efficient Multi-Robot SLAM	Xiyue Guo et.al.	2303.08420	null
2023-03-15	From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning	Zhuo Su et.al.	2303.08414	null
2023-03-16	KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input	Yiye Chen et.al.	2303.05617	link
2023-03-07	External Camera-based Mobile Robot Pose Estimation for Collaborative Perception with Smart Edge Sensors	Simon Bultmann et.al.	2303.03797	null
2023-02-26	PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection	Shenwei Xie et.al.	2302.13263	null
2023-02-24	Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks	Julian Lißner et.al.	2302.12545	null
2023-02-21	Deep Reinforcement Learning Based on Local GNN for Goal-conditioned Deformable Object Rearranging	Yuhong Deng et.al.	2302.10446	null
2023-02-12	A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training	Jingnan Shi et.al.	2302.06019	null
2023-02-11	Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing	Zitong Yu et.al.	2302.05744	null
2023-02-09	MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection	Yuhe Ding et.al.	2302.04589	link
2023-02-03	Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation	Jie Yang et.al.	2302.01593	link
2023-02-03	Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization	Yingying Zhu et.al.	2302.01572	link
2023-01-21	Vision Aided Environment Semantics Extraction and Its Application in mmWave Beam Selection	Feiyang Wen et.al.	2301.08973	null
2023-01-18	OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models	Xingyi He et.al.	2301.07673	null
2023-01-12	Towards High Performance One-Stage Human Pose Estimation	Ling Li et.al.	2301.04842	null
2022-12-31	Rethinking Rotation Invariance with Point Cloud Registration	Jianhui Yu et.al.	2301.00149	null
2023-02-06	Fruit Ripeness Classification: a Survey	Matteo Rizzo et.al.	2212.14441	null
2022-12-28	NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action	Kuan-Chieh Wang et.al.	2212.13660	link
2022-12-24	HandsOff: Labeled Dataset Generation With No Additional Human Annotations	Austin Xu et.al.	2212.12645	null
2022-12-13	Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images	Welerson Melo et.al.	2212.09589	link
2022-12-15	Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation	Bugra C. Sefercik et.al.	2212.07567	null
2023-02-01	DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization	Xiangyu Xu et.al.	2212.04575	null
2022-12-07	ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation	Yufei Xu et.al.	2212.04246	link
2022-12-15	Designing Feature Vector Representations: A case study from Chemistry	Signe Sidwall Thygesen et.al.	2212.03731	null
2022-12-09	DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model	Jeongjun Choi et.al.	2212.02796	link
2022-12-05	Images Speak in Images: A Generalist Painter for In-Context Visual Learning	Xinlong Wang et.al.	2212.02499	link
2022-12-06	R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor	Bai Zhu et.al.	2212.02277	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069	link
2022-11-29	BALF: Simple and Efficient Blur Aware Local Feature Detector	Zhenjun Zhao et.al.	2211.14731	null
2022-11-21	Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching	Paul Roetzer et.al.	2211.11589	link
2022-11-07	Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration	Zixin Yang et.al.	2211.03688	null
2022-10-31	Tree Detection and Diameter Estimation Based on Deep Learning	Vincent Grondin et.al.	2210.17424	link
2022-10-26	Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds	Zhiyuan Zhang et.al.	2210.14899	null
2022-10-23	Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders	Ömer Sümer et.al.	2210.12705	null
2022-10-21	Real-time Detection of 2D Tool Landmarks with Synthetic Training Data	Bram Vanherle et.al.	2210.11991	null
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236	null
2022-10-04	Centroid Distance Keypoint Detector for Colored Point Clouds	Hanzhe Teng et.al.	2210.01298	link
2022-09-28	Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences	Jun-Jee Chao et.al.	2209.14419	null
2022-09-28	USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation	Zhengrong Xue et.al.	2209.13864	null
2022-10-16	Suture Thread Spline Reconstruction from Endoscopic Images for Robotic Surgery with Reliability-driven Keypoint Detection	Neelay Joglekar et.al.	2209.13657	link
2022-09-27	Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors	Hao Dong et.al.	2209.13586	link
2022-09-26	Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments	Kyungmin Jung et.al.	2209.12881	null
2022-10-07	Long-Lived Accurate Keypoints in Event Streams	Philippe Chiberre et.al.	2209.10385	null
2022-09-20	Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence	Sunghwan Hong et.al.	2209.08742	null
2022-09-15	Online Marker-free Extrinsic Camera Calibration using Person Keypoint Detections	Bastian Pätzold et.al.	2209.07393	link
2022-09-07	Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip	Yang Li et.al.	2209.03440	null
2022-08-27	Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes	Ali Safa et.al.	2208.12997	null
2022-08-24	Self-Supervised Endoscopic Image Key-Points Matching	Manel Farhat et.al.	2208.11424	link
2022-08-19	Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture	Muhammad Muzammel et.al.	2208.08224	null
2022-08-08	MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis	Maximilian Gilles et.al.	2208.03963	null
2022-08-07	CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization	Yujiao Shi et.al.	2208.03660	null
2022-07-29	Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation	Qihao Liu et.al.	2208.00090	null
2022-07-25	Translating a Visual LEGO Manual to a Machine-Executable Plan	Ruocheng Wang et.al.	2207.12572	null
2022-07-21	Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network	Aline Sindel et.al.	2207.10506	null
2022-07-15	Human keypoint detection for close proximity human-robot interaction	Jan Docekal et.al.	2207.07742	null
2022-07-15	Adversarial Focal Loss: Asking Your Discriminator for Hard Examples	Chen Liu et.al.	2207.07739	null
2022-07-13	Rapid Person Re-Identification via Sub-space Consistency Regularization	Qingze Yin et.al.	2207.05933	null
2022-07-07	RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments	Qihao Peng et.al.	2207.03539	null
2022-08-15	Semi-supervised Human Pose Estimation in Art-historical Images	Matthias Springstein et.al.	2207.02976	link
2022-07-01	Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling	Jiamin Liang et.al.	2207.00474	null
2022-06-24	Motion Estimation for Large Displacements and Deformations	Qiao Chen et.al.	2206.12464	null
2022-06-24	Deep embedded clustering algorithm for clustering PACS repositories	Teo Manojlović et.al.	2206.12417	null
2022-06-21	KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences	Xuanhan Wang et.al.	2206.10090	link
2022-06-20	Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval	Guile Wu et.al.	2206.09806	null
2022-06-15	A Unified Sequence Interface for Vision Tasks	Ting Chen et.al.	2206.07669	link
2022-06-09	Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields	Mingtong Zhang et.al.	2206.04669	null
2022-06-03	SNAKE: Shape-aware Neural 3D Keypoint Field	Chengliang Zhong et.al.	2206.01724	link
2022-05-17	MulT: An End-to-End Multitask Learning Transformer	Deblina Bhattacharjee et.al.	2205.08303	null
2022-05-10	ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild	Chirag Raman et.al.	2205.05177	link
2022-04-28	Polarimetric imaging for the detection of synthetic models of SARS-CoV-2: a proof of concept	Emilio Gomez-Gonzalez et.al.	2204.14050	null
2022-05-02	GRIT: General Robust Image Task Benchmark	Tanmay Gupta et.al.	2204.13653	link
2022-05-24	ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation	Yufei Xu et.al.	2204.12484	link
2022-04-26	Unified GCNs: Towards Connecting GCNs with CNNs	Ziyan Zhang et.al.	2204.12300	null
2022-04-19	Self-Supervised Equivariant Learning for Oriented Keypoint Detection	Jongmin Lee et.al.	2204.08613	link
2022-04-17	The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation	Bao Zhao et.al.	2204.08024	null
2022-04-15	2D Human Pose Estimation: A Survey	Haoming Chen et.al.	2204.07370	null
2022-04-11	Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification	Haojie Liu et.al.	2204.04842	null
2022-04-07	Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification	Yanan Wang et.al.	2204.02611	link
2022-04-02	SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning	Nilaksh Das et.al.	2204.00734	link
2022-04-01	MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration	Chenzhong Gao et.al.	2204.00260	null
2022-03-29	Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning	David Howard et.al.	2203.15172	null
2022-03-28	REGTR: End-to-end Point Cloud Correspondences with Transformers	Zi Jian Yew et.al.	2203.14517	link
2022-03-27	UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection	Ye Liu et.al.	2203.12745	link
2022-03-21	MatchFormer: Interleaving Attention in Transformers for Feature Matching	Qing Wang et.al.	2203.09645	link
2022-03-16	PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research	R. James Cotton et.al.	2203.08792	link
2022-03-11	DRTAM: Dual Rank-1 Tensor Attention Module	Hanxing Chi et.al.	2203.05893	null
2022-03-07	Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation	Meng Tian et.al.	2203.03498	null
2022-02-10	Motion-Aware Transformer For Occluded Person Re-identification	Mi Zhou et.al.	2202.04243	null
2022-02-03	Sim2Real Object-Centric Keypoint Detection and Description	Chengliang Zhong et.al.	2202.00448	null
2022-01-16	Cross-Centroid Ripple Pattern for Facial Expression Recognition	Monu Verma et.al.	2201.05958	null
2022-01-14	Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words	Harry Nguyen et.al.	2201.03556	link
2022-01-10	TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials	Jinnavat Sanalohit et.al.	2201.03170	null
2022-01-06	A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration	Aline Sindel et.al.	2201.02242	null
2021-12-28	Skin feature point tracking using deep feature encodings	Jose Ramon Chang et.al.	2112.14159	null
2021-12-23	Data-efficient learning for 3D mirror symmetry detection	Yancong Lin et.al.	2112.12579	null
2021-12-22	Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations -- combining input rotations and a kinematic model	Michael Zwölfer et.al.	2112.12193	null
2021-12-22	Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction	Henrique Siqueira et.al.	2112.12002	link
2021-12-19	Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection	Renjie Li et.al.	2112.10275	null
2021-12-19	GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor	Jean-Baptiste Carluer et.al.	2112.10258	link
2021-12-16	Masked Feature Prediction for Self-Supervised Visual Pre-Training	Chen Wei et.al.	2112.09133	link
2021-12-13	DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points	Zhengfei Kuang et.al.	2112.06910	null
2021-12-12	Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species	Changsheng Lu et.al.	2112.06183	link
2021-12-13	Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings	Mel Vecerik et.al.	2112.04910	null
2021-12-06	ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction	Xiaoming Zhao et.al.	2112.02906	link
2021-11-25	Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association	Sen Yang et.al.	2111.12892	link
2021-11-08	Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images	Jianfei Guo et.al.	2111.04237	null
2021-11-04	Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image	Feng Liu et.al.	2111.03098	null
2021-11-01	Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-realistic Perspective of Computer Vision	Ali Safa et.al.	2111.00791	null
2021-10-30	Geometry-Aware Hierarchical Bayesian Learning on Manifolds	Yonghui Fan et.al.	2111.00184	null
2021-10-26	CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration	Hao Yu et.al.	2110.14076	link
2021-10-23	HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware	James Hegarty et.al.	2110.12106	null
2021-10-18	Keypoint-Based Bimanual Shaping of Deformable Linear Objects under Environmental Constraints using Hierarchical Action Planning	Shengzeng Huo et.al.	2110.08962	null
2021-10-11	High-order Tensor Pooling with Attention for Action Recognition	Piotr Koniusz et.al.	2110.05216	null
2021-10-10	Digging Into Self-Supervised Learning of Feature Descriptors	Iaroslav Melekhov et.al.	2110.04773	null
2021-10-04	BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion	Zhaoqun Li et.al.	2110.01179	link
2021-10-01	Machine learning aided noise filtration and signal classification for CREDO experiment	Łukasz Bibrzycki et.al.	2110.00297	null
2021-09-28	PDC-Net+: Enhanced Probabilistic Dense Correspondence Network	Prune Truong et.al.	2109.13912	link
2021-09-27	HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines	Fabio Bellavia et.al.	2109.12925	null
2021-09-24	Catadioptric Stereo on a Smartphone	Kristijan Bartol et.al.	2109.11872	null
2021-09-20	Semi-supervised Dense Keypointsusing Unlabeled Multiview Images	Zhixuan Yu et.al.	2109.09299	null
2021-08-31	A Novel Dataset for Keypoint Detection of quadruped Animals from Images	Prianka Banik et.al.	2108.13958	link
2021-08-27	A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images	Xiaoteng Zhou et.al.	2108.12151	null

(back to top)

Image Matching

Publish Date	Title	Authors	PDF	Code
2025-02-28	CNSv2: Probabilistic Correspondence Encoded Neural Image Servo	Anzhe Chen et.al.	2503.00132	null
2025-02-27	A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization	Yejun Zhang et.al.	2502.20036	link
2025-02-27	RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges	Thibaut Loiseau et.al.	2502.19955	null
2025-02-26	BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure	Haoxin Cai et.al.	2502.19242	link
2025-02-25	PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching	Han Nie et.al.	2502.18104	link
2025-02-25	Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking	Xin Tong et.al.	2502.17766	null
2025-03-04	Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model	Yaxuan Huang et.al.	2502.16779	null
2025-02-16	FeaKM: Robust Collaborative Perception under Noisy Pose Conditions	Jiuwu Hao et.al.	2502.11003	link
2025-02-24	Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation	Emanuele Mule et.al.	2502.06288	link
2025-02-04	Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications	William O'Donnell et.al.	2502.02624	null
2025-02-01	MambaGlue: Fast and Robust Local Feature Matching With Mamba	Kihwan Ryoo et.al.	2502.00462	link
2025-01-24	Dense-SfM: Structure from Motion with Dense Consistent Matching	JongMin Lee et.al.	2501.14277	null
2025-01-20	MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching	Yepeng Liu et.al.	2501.11299	null
2025-01-13	MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training	Xingyi He et.al.	2501.07556	null
2025-01-13	Matching Free Depth Recovery from Structured Light	Zhuohang Yu et.al.	2501.07113	null
2025-01-02	Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views	Yulun Wu et.al.	2501.01196	null
2024-12-31	Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights	Bharath Kumar Agnur et.al.	2412.20210	null
2024-12-27	MINIMA: Modality Invariant Image Matching	Xingyu Jiang et.al.	2412.19412	link
2024-12-24	GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network	Xianfeng Song et.al.	2412.18221	link
2024-12-17	Bringing Multimodality to Amazon Visual Search System	Xinliang Zhu et.al.	2412.13364	null
2024-12-04	Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis	Siyoon Jin et.al.	2412.03150	null
2024-11-20	DT-LSD: Deformable Transformer-based Line Segment Detection	Sebastian Janampa et.al.	2411.13005	link
2024-11-15	Image Matching Filtering and Refinement by Planes and Beyond	Fabio Bellavia et.al.	2411.09484	link
2024-11-11	XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration	Ismail Can Yagmur et.al.	2411.07430	link
2024-11-07	The Impact of Semi-Supervised Learning on Line Segment Detection	Johanna Engman et.al.	2411.04596	link
2024-11-04	Silver medal Solution for Image Matching Challenge 2024	Yian Wang et.al.	2411.01851	null
2024-10-30	Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants	Azadeh Sharafi et.al.	2410.23329	null
2024-11-05	RelationBooth: Towards Relation-Aware Customized Object Generation	Qingyu Shi et.al.	2410.23280	null
2024-10-31	ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses	Junjie Ni et.al.	2410.22733	null
2024-10-30	LoFLAT: Local Feature Matching using Focused Linear Attention Transformer	Naijian Cao et.al.	2410.22710	null
2024-10-26	Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification	Yue Su et.al.	2410.20097	null
2024-10-01	A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference	Yuan Li et.al.	2410.11848	null
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-09-27	Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras	Yipeng Lu et.al.	2409.18673	null
2024-09-25	Game4Loc: A UAV Geo-Localization Benchmark from Game Data	Yuxiang Ji et.al.	2409.16925	link
2024-09-24	Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge	Marek Wodzinski et.al.	2409.15931	null
2024-09-10	Weakly-supervised Camera Localization by Ground-to-satellite Image Registration	Yujiao Shi et.al.	2409.06471	link
2024-09-05	Enabling Practical and Privacy-Preserving Image Processing	Chao Wang et.al.	2409.03568	null
2024-09-20	A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering	Shuang Song et.al.	2409.03032	link
2024-08-29	Super-Resolution works for coastal simulations	Zhi-Song Liu et.al.	2408.16553	null
2024-09-15	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445	link
2024-08-26	Affine steerers for structured keypoint description	Georg Bökman et.al.	2408.14186	link
2024-08-25	TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers	Chuanrui Zhang et.al.	2408.13770	null
2024-09-11	Coarse-to-fine Alignment Makes Better Speech-image Retrieval	Lifeng Zhou et.al.	2408.13119	null
2024-08-19	BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval	Zhenyu Lu et.al.	2408.10383	null
2024-08-14	RSD-DOG : A New Image Descriptor based on Second Order Derivatives	Darshan Venkatrayappa et.al.	2408.07687	null
2024-08-09	One Shot is Enough for Sequential Infrared Small Target Segmentation	Bingbing Dan et.al.	2408.04823	link
2024-08-07	PRISM: PRogressive dependency maxImization for Scale-invariant image Matching	Xudong Cai et.al.	2408.03598	null
2024-08-05	ConDL: Detector-Free Dense Image Matching	Monika Kwiatkowski et.al.	2408.02766	null
2024-08-04	Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image	Xinlin Ren et.al.	2408.02079	link
2024-07-29	Image-text matching for large-scale book collections	Artemis Llabrés et.al.	2407.19812	link
2024-07-26	PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis	Sohyeong Kim et.al.	2407.18695	null
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791	null
2024-07-17	GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jingwen Yu et.al.	2407.11736	link
2024-07-16	REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching	Han Nie et.al.	2407.11637	link
2024-07-16	A Self-Correcting Strategy of the Digital Volume Correlation Displacement Field Based on Image Matching: Application to Poor Speckles Quality and Complex-Large Deformation	Chengsheng Li et.al.	2407.11287	null
2024-07-14	Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching	Xiaoyong Lu et.al.	2407.07789	null
2024-07-10	Mutual Information calculation on different appearances	Jiecheng Liao et.al.	2407.07410	null
2024-07-15	SfM on-the-fly: Get better 3D from What You Capture	Zongqian Zhan et.al.	2407.03939	null
2024-07-03	IMC 2024 Methods & Solutions Review	Shyam Gupta et.al.	2407.03172	null
2024-06-21	High Resolution Surface Reconstruction of Cultural Heritage Objects Using Shape from Polarization Method	F. S. Mortazavi et.al.	2406.15121	null
2024-06-16	Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models	Yikai Zhang et.al.	2406.10902	link
2024-06-14	Grounding Image Matching in 3D with MASt3R	Vincent Leroy et.al.	2406.09756	link
2024-06-05	A Self-Supervised Denoising Strategy for Underwater Acoustic Camera Imageries	Xiaoteng Zhou et.al.	2406.02914	null
2024-05-22	Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching	Hongkai Chen et.al.	2405.13874	null
2024-05-21	OmniGlue: Generalizable Feature Matching with Foundation Model Guidance	Hanwen Jiang et.al.	2405.12979	link
2024-07-09	Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation	Rezkellah Noureddine Khiati et.al.	2405.08556	link
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434	null
2024-05-13	Authentic Hand Avatar from a Phone Scan via Universal Hand Model	Gyeongsik Moon et.al.	2405.07933	null
2024-04-30	A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images	Wang Zhang et.al.	2404.19311	null
2024-04-30	XFeat: Accelerated Features for Lightweight Image Matching	Guilherme Potje et.al.	2404.19174	null
2024-06-10	MinBackProp -- Backpropagating through Minimal Solvers	Diana Sungatullina et.al.	2404.17993	link
2024-04-25	Transformer-Based Local Feature Matching for Multimodal Image Registration	Remi Delaunay et.al.	2404.16802	null
2024-04-23	FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction	Hang Hua et.al.	2404.14715	null
2024-04-22	Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer	Eric Brachmann et.al.	2404.14351	null
2024-04-17	A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching	Francesco Pro et.al.	2404.11302	link
2024-04-16	Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction	John Francis et.al.	2404.10626	null
2024-04-15	XoFTR: Cross-modal Feature Matching Transformer	Önder Tuzcuoğlu et.al.	2404.09692	link
2024-04-13	DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector	Johan Edstedt et.al.	2404.08928	link
2024-04-09	Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences	Axel Barroso-Laguna et.al.	2404.06337	link
2024-04-01	Marrying NeRF with Feature Matching for One-step Pose Estimation	Ronghan Chen et.al.	2404.00891	null
2024-04-01	3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching	Yibin Ye et.al.	2404.00838	null
2024-03-31	On the Estimation of Image-matching Uncertainty in Visual Place Recognition	Mubariz Zaffar et.al.	2404.00546	null
2024-03-30	Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation	Yuan Wang et.al.	2404.00262	null
2024-03-26	Staircase Localization for Autonomous Exploration in Urban Environments	Jinrae Kim et.al.	2403.17330	null
2024-03-23	MatchSeg: Towards Better Segmentation via Reference Image Matching	Ruiqiang Xiao et.al.	2403.15901	link
2024-03-20	Unifying Local and Global Multimodal Features for Place Recognition in Aliased and Low-Texture Environments	Alberto García-Hernández et.al.	2403.13395	link
2024-03-19	HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching	Ying Chen et.al.	2403.12543	null
2024-03-16	Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval	Shunsuke Tsubaki et.al.	2403.10756	null
2024-03-16	Vector search with small radiuses	Gergely Szilvasy et.al.	2403.10746	null
2024-03-15	Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline	Fangming Yuan et.al.	2403.10283	null
2024-03-15	Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning	Meixuan Li et.al.	2403.10252	null
2024-03-14	Virtual birefringence imaging and histological staining of amyloid deposits in label-free tissue using autofluorescence microscopy and deep learning	Xilin Yang et.al.	2403.09100	null
2024-03-18	Matching Non-Identical Objects	Yusuke Marumo et.al.	2403.08227	null
2024-03-11	Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed	Yifan Wang et.al.	2403.04765	null
2024-03-07	Scene Depth Estimation from Traditional Oriental Landscape Paintings	Sungho Kang et.al.	2403.03408	null
2024-02-21	Visual Style Prompting with Swapping Self-Attention	Jaeseok Jeong et.al.	2402.12974	link
2024-02-16	GIM: Learning Generalizable Image Matcher From Internet Videos	Xuelun Shen et.al.	2402.11095	link
2024-02-13	Are Semi-Dense Detector-Free Methods Good at Matching Local Features?	Matthieu Vilain et.al.	2402.08671	null
2024-02-13	Learning to Produce Semi-dense Correspondences for Visual Localization	Khang Truong Giang et.al.	2402.08359	link
2024-01-31	Improved Scene Landmark Detection for Camera Localization	Tien Do et.al.	2401.18083	link
2024-03-11	Local Feature Matching Using Deep Learning: A Survey	Shibiao Xu et.al.	2401.17592	link
2024-01-24	Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry	Qi Cai et.al.	2401.13357	null
2024-01-19	SCENES: Subpixel Correspondence Estimation With Epipolar Supervision	Dominik A. Kloepfer et.al.	2401.10886	null
2024-01-18	Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation	Songhe Deng et.al.	2401.09883	link
2024-01-26	RomniStereo: Recurrent Omnidirectional Stereo Matching	Hualie Jiang et.al.	2401.04345	link
2024-01-05	CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs	Daoan Zhang et.al.	2401.02582	null
2024-01-03	Local Adaptive Clustering Based Image Matching for Automatic Visual Identification	Zhizhen Wang et.al.	2401.01720	null
2024-01-03	A Transformer-Based Adaptive Semantic Aggregation Method for UAV Visual Geo-Localization	Shishen Li et.al.	2401.01574	null
2023-12-23	BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation	Tavis Shore et.al.	2312.15363	link
2023-12-22	Harnessing Diffusion Models for Visual Perception with Meta Prompts	Qiang Wan et.al.	2312.14733	link
2024-01-05	MatchDet: A Collaborative Framework for Image Matching and Object Detection	Jinxiang Lai et.al.	2312.10983	null
2023-12-07	Visual Geometry Grounded Deep Structure From Motion	Jianyuan Wang et.al.	2312.04563	null
2023-12-04	Steerers: A framework for rotation equivariant keypoint descriptors	Georg Bökman et.al.	2312.02152	link
2023-11-30	DSeg: Direct Line Segments Detection	Berger Cyrille et.al.	2311.18344	null
2023-11-30	Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications	Sahar Almahfouz Nasser et.al.	2311.18281	null
2023-11-29	LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching	Wenhao Zhong et.al.	2311.17571	link
2023-11-08	Zero-shot Translation of Attention Patterns in VQA Models to Natural Language	Leonard Salewski et.al.	2311.05043	link
2023-11-06	An invariant feature extraction for multi-modal images matching	Chenzhong Gao et.al.	2311.02842	null
2023-10-23	RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments	Jinyu Li et.al.	2310.15072	link
2023-10-23	Player Re-Identification Using Body Part Appearences	Mahesh Bhosale et.al.	2310.14469	null
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605	null
2023-11-14	RGM: A Robust Generalist Matching Model	Songyan Zhang et.al.	2310.11755	link
2023-10-07	UFD-PRiME: Unsupervised Joint Learning of Optical Flow and Stereo Depth through Pixel-Level Rigid Motion Estimation	Shuai Yuan et.al.	2310.04712	null
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092	null
2023-09-29	Segment Anything Model is a Good Teacher for Local Feature Learning	Jingqian Wu et.al.	2309.16992	link
2023-09-27	KDD-LOAM: Jointly Learned Keypoint Detector and Descriptors Assisted LiDAR Odometry and Mapping	Renlang Huang et.al.	2309.15394	null
2023-10-13	A Critical Analysis of Internal Reliability for Uncertainty Quantification of Dense Image Matching in Multi-view Stereo	Debao Huang et.al.	2309.09379	null
2023-09-11	Towards Content-based Pixel Retrieval in Revisited Oxford and Paris	Guoyuan An et.al.	2309.05438	link
2023-09-09	Neural Semantic Surface Maps	Luca Morreale et.al.	2309.04836	null
2023-09-05	Doppelgangers: Learning to Disambiguate Images of Similar Structures	Ruojin Cai et.al.	2309.02420	link
2023-08-14	Occ $^2$ Net: Robust Image Matching Based on 3D Occupancy Estimation for Occluded Regions	Miao Fan et.al.	2308.16160	null
2023-08-29	TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching	Yun Liao et.al.	2308.15144	null
2023-08-27	LDL: Line Distance Functions for Panoramic Localization	Junho Kim et.al.	2308.13989	link
2023-08-22	Scene-Aware Feature Matching	Xiaoyong Lu et.al.	2308.09949	null
2023-09-03	DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching	Johan Edstedt et.al.	2308.08479	link
2023-08-19	Global Features are All You Need for Image Retrieval and Reranking	Shihao Shao et.al.	2308.06954	link
2023-08-02	ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation	Bo Zhang et.al.	2308.00400	link
2023-07-28	Cross-Modal Concept Learning and Inference for Vision-Language Models	Yi Zhang et.al.	2307.15460	null
2023-07-22	CryptoMask : Privacy-preserving Face Recognition	Jianli Bai et.al.	2307.12010	null
2023-07-22	A Stronger Stitching Algorithm for Fisheye Images based on Deblurring and Registration	Jing Hao et.al.	2307.11997	null
2023-07-21	Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data	Sahar Almahfouz Nasser et.al.	2307.10698	link
2023-08-08	Balancing Privacy and Progress in Artificial Intelligence: Anonymization in Histopathology for Biomedical Research and Education	Neel Kanwal et.al.	2307.09426	null
2023-08-01	Unsupervised Deep Graph Matching Based on Cycle Consistency	Siddharth Tourani et.al.	2307.08930	link
2023-07-15	Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents	Ke Cao et.al.	2307.07763	null
2023-07-09	Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion	Jie S. Li et.al.	2307.05564	null
2023-07-11	ResMatch: Residual Attention Learning for Local Feature Matching	Yuxin Deng et.al.	2307.05180	link
2023-07-11	TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation	Paul Grimal et.al.	2307.05134	link
2023-07-02	TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching	Khang Truong Giang et.al.	2307.00485	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669	link
2023-06-28	PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment	Jianyuan Wang et.al.	2306.15667	null
2023-06-25	Enhancing Dynamic Image Advertising with Vision-Language Pre-training	Zhoufutu Wen et.al.	2306.14112	null
2023-06-23	LightGlue: Local Feature Matching at Light Speed	Philipp Lindenberger et.al.	2306.13643	link
2023-06-19	Graph Self-Supervised Learning for Endoscopic Image Matching	Manel Farhat et.al.	2306.11141	link
2023-06-09	Leaving the Lines Behind: Vision-Based Crop Row Exit for Agricultural Robot Navigation	Rajitha de Silva et.al.	2306.05869	null
2023-06-07	A2B: Anchor to Barycentric Coordinate for Robust Correspondence	Weiyue Zhao et.al.	2306.02760	null
2023-05-27	Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation	Yueh-Cheng Huang et.al.	2305.17463	null
2023-05-19	SIDAR: Synthetic Image Dataset for Alignment & Restoration	Monika Kwiatkowski et.al.	2305.12036	link
2023-05-18	LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation	Yujie Lu et.al.	2305.11116	link
2023-05-16	A Method for Training-free Person Image Picture Generation	Tianyu Chen et.al.	2305.09817	null
2023-05-15	Image Matching by Bare Homography	Fabio Bellavia et.al.	2305.08946	null
2023-05-12	CLIP-Count: Towards Text-Guided Zero-Shot Object Counting	Ruixiang Jiang et.al.	2305.07304	link
2023-05-10	SENDD: Sparse Efficient Neural Depth and Deformation for Tissue Tracking	Adam Schmidt et.al.	2305.06477	null
2023-05-10	Level-line Guided Edge Drawing for Robust Line Segment Detection	Xinyu Lin et.al.	2305.05883	link
2023-05-09	ColonMapper: topological mapping and localization for colonoscopy	Javier Morlana et.al.	2305.05546	null
2023-04-29	A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges	Xinyu Lin et.al.	2305.00264	link
2023-04-28	SFD2: Semantic-guided Feature Detection and Description	Fei Xue et.al.	2304.14845	link
2023-04-17	DeepSim-Nets: Deep Similarity Networks for Stereo Image Matching	Mohamed Ali Chebbi et.al.	2304.08056	link
2023-04-16	Long-term Visual Localization with Mobile Sensors	Shen Yan et.al.	2304.07691	null
2023-04-12	SiLK -- Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194	link
2023-04-16	ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation	Xiaoming Zhao et.al.	2304.03608	link
2023-04-04	GlueStick: Robust Image Matching by Sticking Points and Lines Together	Rémi Pautrat et.al.	2304.02008	link
2023-04-03	PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching	Pedro Castro et.al.	2304.01382	null
2023-04-02	Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints	Guilherme Potje et.al.	2304.00583	link
2023-04-13	Structured Epipolar Matcher for Local Feature Matching	Jiahao Chang et.al.	2303.16646	null
2023-03-29	Adaptive Spot-Guided Transformer for Consistent Local Feature Matching	Jiahuan Yu et.al.	2303.16624	null
2023-03-28	ASIC: Aligning Sparse in-the-wild Image Collections	Kamal Gupta et.al.	2303.16201	null
2023-03-25	Learning Rotation-Equivariant Features for Visual Correspondence	Jongmin Lee et.al.	2303.15472	null
2023-03-27	Learnable Graph Matching: A Practical Paradigm for Data Association	Jiawei He et.al.	2303.15414	link
2023-03-24	Efficient and Accurate Co-Visible Region Localization with Matching Key-Points Crop (MKPC): A Two-Stage Pipeline for Enhancing Image Matching Performance	Hongjian Song et.al.	2303.13794	null
2023-03-15	Rethinking Optical Flow from Geometric Matching Consistent Perspective	Qiaole Dong et.al.	2303.08384	link
2023-04-04	PATS: Patch Area Transportation with Subdivision for Local Feature Matching	Junjie Ni et.al.	2303.07700	null
2023-03-07	Parsing Line Segments of Floor Plan Images Using Graph Neural Networks	Mingxiang Chen et.al.	2303.03851	null
2023-03-06	Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints	Chenjie Cao et.al.	2303.02885	link
2023-03-10	ParaFormer: Parallel Attention Transformer for Efficient Feature Matching	Xiaoyong Lu et.al.	2303.00941	null
2023-03-01	RIFT2: Speeding-up RIFT with A New Rotation-Invariance Technique	Jiayuan Li et.al.	2303.00319	link
2023-02-28	Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images	Zhongli Fan et.al.	2302.14239	link
2023-02-25	BrainCLIP: Bridging Brain and Visual-Linguistic Representation via CLIP for Generic Natural Visual Stimulus Decoding from fMRI	Yulong Liu et.al.	2302.12971	**[link](https://github.com/Yulon

Name		Name	Last commit message	Last commit date
Latest commit History 2,218 Commits
.github		.github
assets		assets
docs		docs
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2025.03.06

SLAM

SFM

Visual Localization

Keypoint Detection

Image Matching

About

Releases

Packages

Contributors 2

Languages

License

Vincentqyw/cv-arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.03.06

SLAM

SFM

Visual Localization

Keypoint Detection

Image Matching

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages