CV Arxiv Daily

Updated on 2025.08.20

Usage instructions: here

SLAM

Publish Date	Title	Authors	PDF	Code
2025-07-23	Physics-based Human Pose Estimation from a Single Moving RGB Camera	Ayce Idil Aytekin et.al.	2507.17406	null
2025-07-23	CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance	Peiqi Chen et.al.	2507.17312	null
2025-07-21	DiffPF: Differentiable Particle Filtering with Generative Sampling via Conditional Diffusion Models	Ziyu Wan et.al.	2507.15716	null
2025-07-21	Dense-depth map guided deep Lidar-Visual Odometry with Sparse Point Clouds and Images	JunYing Huang et.al.	2507.15496	null
2025-07-21	All-UWB SLAM Using UWB Radar and UWB AOA	Charith Premachandra et.al.	2507.15474	null
2025-07-21	BenchDepth: Are We on the Right Way to Evaluate Depth Foundation Models?	Zhenyu Li et.al.	2507.15321	null
2025-07-20	LoopNet: A Multitasking Few-Shot Learning Approach for Loop Closure in Large Scale SLAM	Mohammad-Maher Nakshbandi et.al.	2507.15109	null
2025-07-19	Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey	Jiahui Zhang et.al.	2507.14501	null
2025-07-17	DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model	Maulana Bisyir Azhari et.al.	2507.13145	null
2025-07-17	MoCap2GT: A High-Precision Ground Truth Estimator for SLAM Benchmarking Based on Motion Capture and IMU Fusion	Zichao Shu et.al.	2507.12920	null
2025-07-17	Next-Gen Museum Guides: Autonomous Navigation and Visitor Interaction with an Agentic Robot	Luca Garello et.al.	2507.12273	null
2025-07-16	Tree-SLAM: semantic object SLAM for efficient mapping of individual trees in orchards	David Rapado-Rincon et.al.	2507.12093	null
2025-07-11	Towards Robust Sensor-Fusion Ground SLAM: A Comprehensive Benchmark and A Resilient Framework	Deteng Zhang et.al.	2507.08364	null
2025-07-10	Hardware-Aware Feature Extraction Quantisation for Real-Time Visual Odometry on FPGA Platforms	Mateusz Wasala et.al.	2507.07903	null
2025-07-10	IRAF-SLAM: An Illumination-Robust and Adaptive Feature-Culling Front-End for Visual SLAM in Challenging Environments	Thanh Nguyen Canh et.al.	2507.07752	null
2025-07-09	g2o vs. Ceres: Optimizing Scan Matching in Cartographer SLAM	Quanjie Qiu et.al.	2507.07142	null
2025-07-08	Mapping the Catacombs: An Underwater Cave Segment of the Devil’s Eye System	Michalis Chatzispyrou et.al.	2507.06397	null
2025-07-08	Cooperative Mapping, Localization, and Beam Management via Multi-Modal SLAM in ISAC Systems	Hang Que et.al.	2507.05718	null
2025-07-07	Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR	Tao Du et.al.	2507.04662	null
2025-07-06	Lidar Variability: A Novel Dataset and Comparative Study of Solid-State and Spinning Lidars	Doumegna Mawuto Koudjo Felix et.al.	2507.04321	null
2025-07-09	Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM	Xiaolei Lang et.al.	2507.04004	null
2025-07-04	Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps	Chong Cheng et.al.	2507.03737	null
2025-07-01	RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles	David Hunt et.al.	2507.00937	null
2025-07-01	Generation of Indoor Open Street Maps for Robot Navigation from CAD Files	Jiajie Zhang et.al.	2507.00552	null
2025-06-30	VOCAL: Visual Odometry via ContrAstive Learning	Chi-Yao Huang et.al.	2507.00243	null
2025-06-29	TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints	Zhen Tan et.al.	2506.23207	null
2025-06-29	Event-based Stereo Visual-Inertial Odometry with Voxel Map	Zhaoxing Zhang et.al.	2506.23078	null
2025-06-26	Adaptive Multipath-Based SLAM for Distributed MIMO Systems	Xuhong Li et.al.	2506.21798	null
2025-06-24	Ark: An Open-source Python-based Framework for Robot Learning	Magnus Dierking et.al.	2506.21628	null
2025-06-26	EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting	Taoyu Wu et.al.	2506.21420	null
2025-06-26	CURL-SLAM: Continuous and Compact LiDAR Mapping	Kaicheng Zhang et.al.	2506.21077	null
2025-06-25	SPARK: Graph-Based Online Semantic Integration System for Robot Task Planning	Mimo Shirasaka et.al.	2506.20394	null
2025-06-25	Real-Time Obstacle Avoidance Algorithms for Unmanned Aerial and Ground Vehicles	Jingwen Wei et.al.	2506.20311	null
2025-06-24	Posterior Cramér-Rao Bounds on Localization and Mapping Errors in Distributed MIMO SLAM	Benjamin J. B. Deutschmann et.al.	2506.19957	null
2025-06-23	GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM	Annika Thomas et.al.	2506.18885	null
2025-06-23	MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation	Tianchen Deng et.al.	2506.18678	null
2025-06-24	Multimodal Fusion SLAM with Fourier Attention	Youjie Zhou et.al.	2506.18204	null
2025-06-22	ADA-DPM: A Neural Descriptors-based Adaptive Noise Point Filtering Strategy for SLAM	Yongxin Shao et.al.	2506.18016	null
2025-06-21	Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems	Sebastian Sansoni et.al.	2506.17775	null
2025-06-18	MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System	Miaoxin Pan et.al.	2506.15402	null
2025-06-24	RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories	Qingsong Yan et.al.	2506.15242	null
2025-06-18	SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization	Hanjun Kim et.al.	2506.15175	null
2025-06-18	VIMS: A Visual-Inertial-Magnetic-Sonar SLAM System in Underwater Environments	Bingbing Zhang et.al.	2506.15126	null
2025-06-16	Slanted light-sheet array microscopy for large volume imaging at rates exceeding 100 Hz	Kai Long et.al.	2506.13664	null
2025-06-16	Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots	Jaehong Oh et.al.	2506.13149	null
2025-06-16	A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method	Zhanhua Xin et.al.	2506.13100	null
2025-06-16	SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure	Shahram Najam Syed et.al.	2506.13089	link
2025-06-12	LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System	Hongbeen Park et.al.	2506.10567	null
2025-06-11	VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots	Miguel Á. González-Santamarta et.al.	2506.09583	null
2025-06-10	UFM: A Simple Path towards Unified Dense Correspondence with Flow	Yuchen Zhang et.al.	2506.09278	null
2025-06-10	Princeton365: A Diverse Dataset with Accurate Camera Pose	Karhan Kayan et.al.	2506.09035	null
2025-06-10	Planar Collisionless Shock Simulations with Semi-Implicit Particle-in-Cell Model FLEKS	Hongyang Zhou et.al.	2506.08384	null
2025-06-09	ZeroVO: Visual Odometry with Minimal Assumptions	Lei Lai et.al.	2506.08005	null
2025-06-08	Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs	Qiong Chang et.al.	2506.07164	null
2025-06-08	UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment	Wentao Zhao et.al.	2506.07013	null
2025-06-06	GS4: Generalizable Sparse Splatting Semantic SLAM	Mingqi Jiang et.al.	2506.06517	null
2025-06-06	Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception	Pushyami Kaveti et.al.	2506.06476	null
2025-06-06	Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments	Mingrui Li et.al.	2506.05965	null
2025-06-06	Analysis of points outcome in ATP Grand Slam Tennis using big data and machine learning	Martin Illum et.al.	2506.05866	null
2025-06-05	On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images	Andreas Meuleman et.al.	2506.05558	null
2025-06-05	Deep Learning Reforms Image Matching: A Survey and Outlook	Shihua Zhang et.al.	2506.04619	null
2025-06-04	cuVSLAM: CUDA accelerated visual odometry	Alexander Korovko et.al.	2506.04359	link
2025-06-04	Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset	Zirui Wang et.al.	2506.04224	null
2025-06-03	LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM	Roman Titkov et.al.	2506.03073	null
2025-06-03	Online Performance Assessment of Multi-Source-Localization for Autonomous Driving Systems Using Subjective Logic	Stefan Orf et.al.	2506.02932	null
2025-06-03	VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians	Pengchong Hu et.al.	2506.02741	null
2025-06-03	GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal	Shufan Qing et.al.	2506.02736	link
2025-06-03	Olfactory Inertial Odometry: Methodology for Effective Robot Navigation by Scent	Kordel K. France et.al.	2506.02373	null
2025-06-01	Globally Consistent RGB-D SLAM with 2D Gaussian Splatting	Xingguang Zhong et.al.	2506.00970	link
2025-05-30	Black-box Adversarial Attacks on CNN-based SLAM Algorithms	Maria Rafaela Gkeka et.al.	2505.24654	null
2025-05-28	Semantic Exploration and Dense Mapping of Complex Environments using Ground Robots Equipped with LiDAR and Panoramic Camera	Xiaoyang Zhan et.al.	2505.22880	null
2025-05-28	4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians	Hidenobu Matsuki et.al.	2505.22859	null
2025-05-28	UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments	Wancai Zheng et.al.	2505.22335	null
2025-05-27	HS-SLAM: A Fast and Hybrid Strategy-Based SLAM Approach for Low-Speed Autonomous Driving	Bingxiang Kang et.al.	2505.20906	null
2025-05-27	ProBA: Probabilistic Bundle Adjustment with the Bhattacharyya Coefficient	Jason Chui et.al.	2505.20858	null
2025-05-26	ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting	Wenhua Wu et.al.	2505.19420	null
2025-05-25	VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes	Tianchen Deng et.al.	2505.18992	link
2025-05-23	CU-Multi: A Dataset for Multi-Robot Data Association	Doncey Albin et.al.	2505.17576	null
2025-05-22	TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition	Oliver Grainge et.al.	2505.16447	null
2025-05-20	A Methodological Framework for Measuring Spatial Labeling Similarity	Yihang Du et.al.	2505.14128	link
2025-05-22	Place Recognition: A Comprehensive Review, Current Challenges and Future Directions	Zhenyu Li et.al.	2505.14068	link
2025-05-19	eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks	Jad Mansour et.al.	2505.13309	null
2025-05-23	VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold	Dominic Maggio et.al.	2505.12549	null
2025-05-18	Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey	Calvin Galagain et.al.	2505.12384	null
2025-05-18	Structureless VIO	Junlin Song et.al.	2505.12337	null
2025-05-16	EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video	Ryan Hoque et.al.	2505.11709	null
2025-05-16	Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization	Aaron Wilhelm et.al.	2505.11620	null
2025-05-16	Robust 2D lidar-based SLAM in arboreal environments without IMU/GNSS	Paola Nazate-Burgos et.al.	2505.10847	null
2025-05-15	TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation	Manthan Patel et.al.	2505.10696	null
2025-05-15	A hybrid SLAM-Payne framework for atmospheric parameter and abundance determination of early-type Stars from LAMOST DR9 low-resolution Spectra	Weijia Sun et.al.	2505.10310	null
2025-05-15	Large-Scale Gaussian Splatting SLAM	Zhe Xin et.al.	2505.09915	null
2025-05-13	Automated Meta Prompt Engineering for Alignment with the Theory of Mind	Aaron Baughman et.al.	2505.09024	null
2025-05-13	MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM	Saqi Hussain Kalan et.al.	2505.08388	null
2025-05-13	SKiD-SLAM: Robust, Lightweight, and Distributed Multi-Robot LiDAR SLAM in Resource-Constrained Field Environments	Hogyun Kim et.al.	2505.08230	null
2025-05-12	RDD: Robust Feature Detector and Descriptor using Deformable Transformer	Gonglin Chen et.al.	2505.08013	null
2025-05-12	Ranking-aware Continual Learning for LiDAR Place Recognition	Xufei Wang et.al.	2505.07198	null
2025-05-07	Scalable Aerial GNSS Localization for Marine Robots	Shuo Wen et.al.	2505.04095	link
2025-05-06	Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions	Lukas Schichler et.al.	2505.03565	null
2025-05-06	AquaticVision: Benchmarking Visual SLAM in Underwater Environment with Events and Frames	Yifan Peng et.al.	2505.03448	null
2025-05-06	LiftFeat: 3D Geometry-Aware Local Feature Matching	Yepeng Liu et.al.	2505.03422	link
2025-05-05	LiDAR-Inertial SLAM-Based Navigation and Safety-Oriented AI-Driven Control System for Skid-Steer Robots	Mehdi Heydari Shahna et.al.	2505.02598	null
2025-05-04	Robust Localization, Mapping, and Navigation for Quadruped Robots	Dyuman Aditya et.al.	2505.02272	null
2025-05-04	SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment	Ganesh Sapkota et.al.	2505.01956	null
2025-05-03	GauS-SLAM: Dense RGB-D SLAM with Gaussian Surfels	Yongxin Su et.al.	2505.01934	null
2025-05-02	Tightly Coupled Range Inertial Odometry and Mapping with Exact Point Cloud Downsampling	Kenji Koide et.al.	2505.01017	null
2025-04-30	An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation	Yaming Ou et.al.	2504.21826	null
2025-04-30	eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes	Henry John Krumb et.al.	2504.21562	null
2025-04-29	Large-scale visual SLAM for in-the-wild videos	Shuo Sun et.al.	2504.20496	null
2025-04-28	Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM	Leon Davies et.al.	2504.19654	null
2025-04-28	GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM	Leon Davies et.al.	2504.19653	null
2025-04-28	GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field	Zuxing Lu et.al.	2504.19409	null
2025-04-27	Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users	Apurv Varshney et.al.	2504.19345	null
2025-04-27	NANO-SLAM : Natural Gradient Gaussian Approximation for Vehicle SLAM	Tianyi Zhang et.al.	2504.19195	null
2025-04-27	MISO: Multiresolution Submap Optimization for Efficient Globally Consistent Neural Implicit Reconstruction	Yulun Tian et.al.	2504.19104	null
2025-04-25	Certifiably-Correct Mapping for Safe Navigation Despite Odometry Drift	Devansh R. Agrawal et.al.	2504.18713	null
2025-04-25	Range-based 6-DoF Monte Carlo SLAM with Gradient-guided Particle Filter on GPU	Takumi Nakao et.al.	2504.18056	null
2025-04-24	Autonomous Navigation Of Quadrupeds Using Coverage Path Planning	Alexander James Becoy et.al.	2504.17880	null
2025-04-22	SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos	Yuxin Yao et.al.	2504.17810	null
2025-04-24	BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring	Asier Bikandi et.al.	2504.17693	null
2025-04-24	Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images	Zebo Huang et.al.	2504.17582	null
2025-04-24	Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization	Guangyang Zeng et.al.	2504.17410	null
2025-04-24	EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy	Haodi Yao et.al.	2504.17280	null
2025-04-23	ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration	Andrea Conti et.al.	2504.16545	null
2025-04-22	DERD-Net: Learning Depth from Event-based Ray Densities	Diego de Oliveira Hitzges et.al.	2504.15863	null
2025-04-23	SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems	Abhishek Tyagi et.al.	2504.15305	null
2025-04-20	Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction	Weirong Chen et.al.	2504.14516	null
2025-04-20	SG-Reg: Generalizable and Efficient Scene Graph Registration	Chuhao Liu et.al.	2504.14440	link
2025-04-19	Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering	Jonathan Embley-Riches et.al.	2504.14135	null
2025-04-21	SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM	Samuel Cerezo et.al.	2504.13713	link
2025-04-16	An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World	Xingwu Ji et.al.	2504.11698	link
2025-04-18	Doppler-SLAM: Doppler-Aided Radar-Inertial and LiDAR-Inertial Simultaneous Localization and Mapping	Dong Wang et.al.	2504.11634	link
2025-04-14	Region Based SLAM-Aware Exploration: Efficient and Robust Autonomous Mapping Strategy That Can Scale	Megha Maheshwari et.al.	2504.10416	null
2025-04-14	RoboCup Rescue 2025 Team Description Paper UruBots	Kevin Farias et.al.	2504.09778	null
2025-04-11	FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment	Sebastián Barbas Laina et.al.	2504.08603	null
2025-04-11	PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection	Xiong Li et.al.	2504.08280	null
2025-04-11	II-NVM: Enhancing Map Accuracy and Consistency with Normal Vector-Assisted Mapping	Chengwei Zhao et.al.	2504.08204	link
2025-04-10	UWB Anchor Based Localization of a Planetary Rover	Andreas Nüchter et.al.	2504.07658	null
2025-04-10	Event Signal Filtering via Probability Flux Estimation	Jinze Chen et.al.	2504.07503	null
2025-04-07	Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM	Zhicong Sun et.al.	2504.04844	link
2025-04-06	SELC: Self-Supervised Efficient Local Correspondence Learning for Low Quality Images	Yuqing Wang et.al.	2504.04497	null
2025-04-06	VSLAM-LAB: A Comprehensive Framework for Visual SLAM Methods and Datasets	Alejandro Fontan et.al.	2504.04457	link
2025-04-05	Nonlinear Observer Design for Landmark-Inertial Simultaneous Localization and Mapping	Mouaad Boughellaba et.al.	2504.04239	null
2025-04-04	WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments	Jianhao Zheng et.al.	2504.03886	null
2025-04-03	SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections	Prashant Kumar et.al.	2504.03089	null
2025-04-03	Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision	Xiaofeng Han et.al.	2504.02477	null
2025-04-03	MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM	Renwu Li et.al.	2504.02437	null
2025-04-02	A Chefs KISS – Utilizing semantic information in both ICP and SLAM framework	Sven Ochs et.al.	2504.02086	null
2025-04-01	Semantic SLAM with Rolling-Shutter Cameras and Low-Precision INS in Outdoor Environments	Yuchen Zhang et.al.	2504.01997	null
2025-04-02	Strengthening Multi-Robot Systems for SAR: Co-Designing Robotics and Communication Towards 6G	Juan Bravo-Arrabal et.al.	2504.01940	null
2025-04-02	Dynamic Initialization for LiDAR-inertial SLAM	Jie Xu et.al.	2504.01451	link
2025-04-02	ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue	Thomas Pritchard et.al.	2504.01261	link
2025-03-31	SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection	Yannick Burkhardt et.al.	2504.00139	null
2025-03-30	A Visual-Inertial Motion Prior SLAM for Dynamic Environments	Weilong Sun et.al.	2503.23429	null
2025-03-30	AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos	Felix Wimbauer et.al.	2503.23282	link
2025-03-29	Incorporating GNSS Information with LIDAR-Inertial Odometry for Accurate Land-Vehicle Localization	Jintao Cheng et.al.	2503.23199	null
2025-03-29	Towards Mobile Sensing with Event Cameras on High-mobility Resource-constrained Devices: A Survey	Haoyang Wang et.al.	2503.22943	null
2025-03-27	HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM	Ziren Gong et.al.	2503.21778	null
2025-03-27	STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM	Yongxu Wang et.al.	2503.21425	null
2025-03-25	Scene-agnostic Pose Regression for Visual Localization	Junwei Zheng et.al.	2503.19543	null
2025-03-25	First Results on UAV-aided User Localization Using ToA and OpenAirInterface in 5G NR	Omid Esrafilian et.al.	2503.19529	null
2025-03-25	MM-LINS: a Multi-Map LiDAR-Inertial System for Over-Degenerate Environments	Yongxin Ma et.al.	2503.19506	link
2025-03-24	Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control	Tohid Kargar Tasooji et.al.	2503.19135	null
2025-03-24	GI-SLAM: Gaussian-Inertial SLAM	Xulang Liu et.al.	2503.18275	null
2025-03-22	LightLoc: Learning Outdoor LiDAR Localization at Light Speed	Wen Li et.al.	2503.17814	link
2025-03-21	Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions	Muhua Zhang et.al.	2503.17005	null
2025-03-20	4D Gaussian Splatting SLAM	Yanyan Li et.al.	2503.16710	null
2025-03-20	Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education	Giovanni Adorni et.al.	2503.16307	null
2025-03-20	Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors	Tian Yi Lim et.al.	2503.16275	null
2025-03-19	A Sigma Point-based Low Complexity Algorithm for Multipath-based SLAM in MIMO Systems	Anna Masiero et.al.	2503.15286	null
2025-03-19	ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents	Hao Liang et.al.	2503.14948	null
2025-03-18	3D Densification for Multi-Map Monocular VSLAM in Endoscopy	X. Anadón et.al.	2503.14346	null
2025-03-18	GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics	Tingyang Xiao et.al.	2503.14247	link
2025-03-18	A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios	Huy-Hoang Bui et.al.	2503.13982	link
2025-03-17	Digital Beamforming Enhanced Radar Odometry	Jingqi Jiang et.al.	2503.13252	link
2025-03-17	Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes	Tatsuro Sakai et.al.	2503.12768	null
2025-03-16	KISS-SLAM: A Simple, Robust, and Accurate 3D LiDAR SLAM System With Enhanced Generalization Capabilities	Tiziano Guadagnino et.al.	2503.12660	null
2025-03-16	Deblur Gaussian Splatting SLAM	Francesco Girlanda et.al.	2503.12572	null
2025-03-16	M2UD: A Multi-model, Multi-scenario, Uneven-terrain Dataset for Ground Robot with Localization and Mapping Evaluation	Yanpeng Jia et.al.	2503.12387	null
2025-03-15	DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes	Runfa Blark Li et.al.	2503.11979	null
2025-03-14	AQUA-SLAM: Tightly-Coupled Underwater Acoustic-Visual-Inertial SLAM with Sensor Calibration	Shida Xu et.al.	2503.11420	link
2025-03-14	NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications	Li Cui et.al.	2503.11199	null
2025-03-14	Leveraging Semantic Graphs for Efficient and Robust LiDAR SLAM	Neng Wang et.al.	2503.11145	link
2025-03-13	Rapidly Converging Time-Discounted Ergodicity on Graphs for Active Inspection of Confined Spaces	Benjamin Wong et.al.	2503.10853	null
2025-03-13	OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions	Maxim Popov et.al.	2503.10331	null
2025-03-12	Online Language Splatting	Saimouli Katragadda et.al.	2503.09447	null
2025-03-12	MonoSLAM: Robust Monocular SLAM with Global Structure Optimization	Bingzheng Jiang et.al.	2503.09296	null
2025-03-11	Keypoint Detection and Description for Raw Bayer Images	Jiakai Lin et.al.	2503.08673	null
2025-03-11	GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats	Kai Deng et.al.	2503.08071	link
2025-03-10	POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality	Joey Wilson et.al.	2503.07819	null
2025-03-08	HIPPO-MAT: Decentralized Task Allocation Using GraphSAGE and Multi-Agent Deep Reinforcement Learning	Lavanya Ratnabala et.al.	2503.07662	null
2025-03-10	AirSwarm: Enabling Cost-Effective Multi-UAV Research with COTS drones	Xiaowei Li et.al.	2503.06890	link
2025-03-08	InfoFusion Controller: Informed TRRT Star with Mutual Information based on Fusion of Pure Pursuit and MPC for Enhanced Path Planning	Seongjun Choi et.al.	2503.06010	link
2025-03-07	THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks	Chaoran Xiong et.al.	2503.05112	null
2025-03-07	Adaptive-LIO: Enhancing Robustness and Precision through Environmental Adaptation in LiDAR Inertial Odometry	Chengwei Zhao et.al.	2503.05077	link
2025-03-06	MarsLGPR: Mars Rover Localization with Ground Penetrating Radar	Anja Sheppard et.al.	2503.04944	null
2025-03-06	On the Connection Between Magnetic-Field Odometry Aided Inertial Navigation and Magnetic-Field SLAM	Isaac Skog et.al.	2503.04286	null
2025-03-06	Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes	Hui Zhang et.al.	2503.04235	null
2025-03-06	DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems	Joshua Bird et.al.	2503.04126	null
2025-03-05	Equivariant Filter Design for Range-only SLAM	Yixiao Ge et.al.	2503.03973	null
2025-03-05	Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments	Jie Deng et.al.	2503.03373	link
2025-03-05	OpenGV 2.0: Motion prior-assisted calibration and SLAM with vehicle-mounted surround-view systems	Kun Huang et.al.	2503.03230	null
2025-03-05	Distributed Certifiably Correct Range-Aided SLAM	Alexander Thoms et.al.	2503.03192	link
2025-03-04	Monocular visual simultaneous localization and mapping: (r)evolution from geometry to deep learning-based pipelines	Olaya Alvarez-Tunon et.al.	2503.02955	link
2025-03-04	Introspective Loop Closure for SLAM with 4D Imaging Radar	Maximilian Hilger et.al.	2503.02383	null
2025-03-04	DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting	Haoyuan Li et.al.	2503.02223	link
2025-03-03	Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM	Marco Giberna et.al.	2503.02050	null
2025-03-03	vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding	Ali Tourani et.al.	2503.01783	null
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661	link
2025-03-03	OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding	Dianyi Yang et.al.	2503.01646	null
2025-03-03	MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features	Chao Ye et.al.	2503.01571	link
2025-03-03	AI-Driven Relocation Tracking in Dynamic Kitchen Environments	Arash Nasr Esfahani et.al.	2503.01547	link
2025-03-03	Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning	Xintao Chao et.al.	2503.01543	null
2025-03-03	RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation	Shu Pan et.al.	2503.01434	null
2025-02-28	A2DO: Adaptive Anti-Degradation Odometry with Deep Multi-Sensor Fusion for Autonomous Navigation	Hui Lai et.al.	2502.20767	null
2025-02-27	BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground	Yufei Wei et.al.	2502.20078	null
2025-02-26	Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects	Petri Mäkinen et.al.	2502.19169	null
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932	null
2025-02-28	S-Graphs 2.0 – A Hierarchical-Semantic Optimization and Loop Closure for SLAM	Hriday Bavle et.al.	2502.18044	link
2025-02-25	MegaLoc: One Retrieval to Place Them All	Gabriele Berton et.al.	2502.17237	link
2025-02-24	SLABIM: A SLAM-BIM Coupled Dataset in HKUST Main Building	Haoming Huang et.al.	2502.16856	link
2025-02-27	Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM	Yao Zhang et.al.	2502.16495	null
2025-02-19	Slamming: Training a Speech Language Model on One GPU in a Day	Gallil Maimon et.al.	2502.15814	link
2025-02-21	RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes	Sicheng Yu et.al.	2502.15633	null
2025-02-20	Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2502.14931	null
2025-02-19	3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments	Vincent Ress et.al.	2502.13803	null
2025-02-19	Active Illumination for Visual Ego-Motion Estimation in the Dark	Francesco Crocetti et.al.	2502.13708	null
2025-02-17	From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations	Matteo Scucchia et.al.	2502.12303	null
2025-02-19	pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM	Luigi Freda et.al.	2502.11955	link
2025-02-17	Anti-Degeneracy Scheme for Lidar SLAM based on Particle Filter in Geometry Feature-Less Environments	Yanbin Li et.al.	2502.11486	null
2025-02-16	GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting	Zelin Zhou et.al.	2502.10975	null
2025-02-19	MonoForce: Learnable Image-conditioned Physics Engine	Ruslan Agishev et.al.	2502.10156	link
2025-02-13	Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions	Dario Pisanti et.al.	2502.09795	null
2025-02-13	DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior	Mingrui Li et.al.	2502.09111	null
2025-02-12	LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features	Shujie Zhou et.al.	2502.08676	link
2025-02-14	Occupancy-SLAM: An Efficient and Robust Algorithm for Simultaneously Optimizing Robot Poses and Occupancy Map	Yingyu Wang et.al.	2502.06292	link
2025-02-09	PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map	Yue Pan et.al.	2502.05752	link
2025-02-07	Joint State and Noise Covariance Estimation	Kasra Khosoussi et.al.	2502.04584	null
2025-02-05	GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM	Mingrui Li et.al.	2502.03228	null
2025-02-04	SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification	Yifu Tao et.al.	2502.02657	null
2025-02-04	HeRCULES: Heterogeneous Radar Dataset in Complex Urban Environment for Multi-session Radar SLAM	Hanjun Kim et.al.	2502.01946	null
2025-02-03	Statistical enhance learning for modeling and prediction tennis matches at Grand Slam tournaments	Nourah Buhamra et.al.	2502.01613	null
2025-02-03	Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter	Dabin Kim et.al.	2502.01092	null
2025-02-01	FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps	Maximilian Leitenstern et.al.	2502.00395	link
2025-01-31	LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks	Liudi Yang et.al.	2501.19382	link
2025-01-31	Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping	Yiming Huang et.al.	2501.19319	link
2025-01-31	GO: The Great Outdoors Multimodal Dataset	Peng Jiang et.al.	2501.19274	null
2025-01-30	Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems	Liudi Yang et.al.	2501.18110	null
2025-01-28	SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios	Yinqi Chen et.al.	2501.16754	null
2025-01-27	Visual-Lidar Map Alignment for Infrastructure Inspections	Jake McLaughlin et.al.	2501.14486	link
2025-01-24	Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video	Xiaohao Xu et.al.	2501.14319	link
2025-01-24	HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting	Javier Yu et.al.	2501.14147	null
2025-01-23	FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation	Bingyang Zhou et.al.	2501.13876	null
2025-01-23	VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM	Gyuhyeon Pak et.al.	2501.13402	null
2025-01-22	Grid-based Submap Joining: An Efficient Algorithm for Simultaneously Optimizing Global Occupancy Map and Local Submap Frames	Yingyu Wang et.al.	2501.12764	null
2025-01-21	DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM	Jesse Morris et.al.	2501.11893	link
2025-01-21	Survey on Monocular Metric Depth Estimation	Jiuling Zhang et.al.	2501.11841	null
2025-01-19	OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors	Dominik Kulmer et.al.	2501.11111	link
2025-01-19	Factor Graph-Based Active SLAM for Spacecraft Proximity Operations	Lorenzo Ticozzi et.al.	2501.10950	null
2025-01-23	Mesh2SLAM in VR: A Fast Geometry-Based SLAM Framework for Rapid Prototyping in Virtual Reality Applications	Carlos Augusto Pinheiro de Sousa et.al.	2501.09600	null
2025-01-16	Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment	Maksim Filipenko et.al.	2501.09490	null
2025-01-15	Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures	Pengru Deng et.al.	2501.09203	null
2025-01-15	AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning	Assaf Lahiany et.al.	2501.09160	null
2025-01-15	SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM	Yuhang Ming et.al.	2501.08880	null
2025-01-15	GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping	Sheng Hong et.al.	2501.08672	null
2025-01-16	BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module	Dongzhihan Wang et.al.	2501.08659	null
2025-01-15	Self-Organizing Edge Computing Distribution Framework for Visual SLAM	Jussi Kalliola et.al.	2501.08629	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286	null
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399	null
2025-01-14	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015	null
2025-01-12	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927	link
2025-01-11	SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors	Zhen Hong et.al.	2501.06469	null
2025-01-09	Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping	Wen Tianci et.al.	2501.05242	null
2025-01-07	SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment	Yuchun Fan et.al.	2501.03681	link
2025-01-06	HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos	Jinglei Zhang et.al.	2501.02973	null
2025-01-09	LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments	Haosong Yue et.al.	2501.02580	link
2025-01-04	ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle	Yinchuan Wang et.al.	2501.02166	link
2024-12-31	PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM	Runnan Chen et.al.	2501.00352	null
2024-12-30	Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields	Evgenii Kruzhkov et.al.	2412.20976	null
2024-12-28	MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing	Shuo Wang et.al.	2412.20082	null
2024-12-27	DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction	Kai Xu et.al.	2412.19584	null
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130	null
2024-12-23	End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework	Fuhua Jia et.al.	2412.17343	null
2024-12-23	LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation	Riku Uemura et.al.	2412.17282	null
2024-12-23	Selective Kalman Filter: When and How to Fuse Multi-Sensor Information to Overcome Degeneracy in SLAM	Jie Xu et.al.	2412.17235	null
2025-01-03	Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry	Zhaoxing Zhang et.al.	2412.16923	link
2024-12-21	Query Quantized Neural SLAM	Sijia Jiang et.al.	2412.16476	link
2024-12-20	SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training	Wenxi Chen et.al.	2412.15649	link
2024-12-18	Energy-Efficient SLAM via Joint Design of Sensing, Communication, and Exploration Speed	Zidong Han et.al.	2412.13912	null
2024-12-18	Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation	Sait Akturk et.al.	2412.13752	null
2024-12-18	4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching	Fernando Amodeo et.al.	2412.13639	link
2024-12-17	NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment	Andrea Dunn Beltran et.al.	2412.13176	null
2024-12-18	Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera	Zhengdi Yu et.al.	2412.12861	null
2024-12-16	Global SLAM in Visual-Inertial Systems with 5G Time-of-Arrival Integration	Meisam Kabiri et.al.	2412.12406	null
2024-12-16	MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors	Riku Murai et.al.	2412.12392	null
2024-12-16	Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges	Martin Aubard et.al.	2412.11840	null
2024-12-19	RoMeO: Robust Metric Visual Odometry	Junda Cheng et.al.	2412.11530	null
2024-12-14	Affine EKF: Exploring and Utilizing Sufficient and Necessary Conditions for Observability Maintenance to Improve EKF Consistency	Yang Song et.al.	2412.10809	link
2024-12-13	RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting	Lizhi Bai et.al.	2412.09868	null
2024-12-12	SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos	Yuzheng Liu et.al.	2412.09401	link
2024-12-12	eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction	Jad Mansour et.al.	2412.09209	link
2024-12-12	Drift-free Visual SLAM using Digital Twins	Roxane Merat et.al.	2412.08496	null
2024-12-10	A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM	Zongbo Liao et.al.	2412.07513	null
2024-12-08	DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments	Juwon Kim et.al.	2412.05839	null
2024-12-06	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463	null
2024-12-05	Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset	Fuzhang Han et.al.	2412.04287	link
2024-12-10	MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application	Hyesu Jang et.al.	2412.03887	null
2024-12-04	Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars	John McConnell et.al.	2412.03760	null
2024-12-04	BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement	Miguel Arturo Vega Torres et.al.	2412.03434	link
2024-12-04	NeRF and Gaussian Splatting SLAM in the Wild	Fabian Schmidt et.al.	2412.03263	link
2024-12-04	MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras	Huai Yu et.al.	2412.03146	link
2024-12-04	An indoor DSO-based ceiling-vision odometry system for indoor industrial environments	Abdelhak Bougouffa et.al.	2412.02950	null
2024-12-03	ROVER: A Multi-Season Dataset for Visual SLAM	Fabian Schmidt et.al.	2412.02506	link
2024-12-04	RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting	Zhenzhong Cao et.al.	2412.01217	link
2024-12-02	Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM	Alejandro Fontan et.al.	2412.01116	null
2024-12-02	LiDAR SLAMMOT based on Confidence-guided Data Association	Susu Fang et.al.	2412.01041	null
2024-12-01	FlashSLAM: Accelerated RGB-D SLAM for Real-Time 3D Scene Reconstruction with Gaussian Splatting	Phu Pham et.al.	2412.00682	null
2024-11-29	Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction	Shaoxiang Wang et.al.	2412.00242	null
2024-11-28	Visual SLAMMOT Considering Multiple Motion Models	Peilin Tian et.al.	2411.19134	null
2024-11-27	ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching	Yangrui Dong et.al.	2411.18174	null
2024-11-27	HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction	Wei Zhang et.al.	2411.17982	link
2024-11-26	MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework	Xiangcheng Hu et.al.	2411.17928	link
2024-11-29	DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting	Christian Homeyer et.al.	2411.17660	link
2024-11-25	MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM	Vladimir Yugay et.al.	2411.16785	null
2024-11-24	Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors	Soumava Paul et.al.	2411.15966	null
2024-11-24	Near-Range Environmental Perception for Inland Waterway Vessels: A Comparative Study of LiDAR and Automotive FMCW RADAR Sensors	R. Herrmann et.al.	2411.15901	null
2024-11-24	PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments	Haoang Li et.al.	2411.15800	null
2024-11-23	Gassidy: Gaussian Splatting SLAM in Dynamic Environments	Long Wen et.al.	2411.15476	null
2024-11-22	OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping	Tomas Berriel Martins et.al.	2411.15043	link
2024-11-22	A Benchmark Dataset for Collaborative SLAM in Service Environments	Harin Park et.al.	2411.14775	link
2024-11-21	InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation	Marziyeh Bamdad et.al.	2411.14358	link
2024-11-20	Robust Monocular Visual Odometry using Curriculum Learning	Assaf Lahiany et.al.	2411.13438	null
2024-11-20	Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds	Jelena Trisovic et.al.	2411.13310	null
2024-11-19	3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality	Hanbeom Chang et.al.	2411.12514	null
2024-11-19	LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2411.12185	null
2024-11-18	Exploring Emerging Trends and Research Opportunities in Visual Place Recognition	Antonios Gasteratos et.al.	2411.11481	null
2024-11-18	The Blue Horizontal-Branch Stars From the LAMOST Survey: Atmospheric Parameters	Jie Ju et.al.	2411.11250	null
2024-11-17	A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality	Wei-Hsiang Lien et.al.	2411.10940	null
2024-11-16	DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment	Mangyu Kong et.al.	2411.10722	link
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546	null
2024-11-15	BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation	Yufei Wei et.al.	2411.10195	null
2024-11-13	DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization	Yueming Xu et.al.	2411.08373	null
2024-11-13	MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation	Peng Wang et.al.	2411.08279	link
2024-11-12	Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments	Ankit Shaw et.al.	2411.08231	null
2024-11-12	NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN	Sonia Raychaudhuri et.al.	2411.07848	null
2024-11-11	Lost in Tracking Translation: A Comprehensive Analysis of Visual SLAM in Human-Centered XR and IoT Ecosystems	Yasra Chandio et.al.	2411.07146	null
2024-11-11	Learning from Feedback: Semantic Enhancement for Object SLAM Using Foundation Models	Jungseok Hong et.al.	2411.06752	null
2024-11-11	HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation	Xiaolong Wang et.al.	2411.06700	null
2024-11-08	Development of an indoor localization and navigation system based on monocular SLAM for mobile robots	Thanh Nguyen Canh et.al.	2411.05337	null
2024-11-07	Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping	Sayat Ibrayev et.al.	2411.04797	null
2024-11-07	MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation	Sayan Paul et.al.	2411.04796	null
2024-11-09	DEIO: Deep Event Inertial Odometry	Weipeng Guan et.al.	2411.03928	link
2024-11-06	Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward	Shashi Kumar et.al.	2411.03866	null
2024-11-06	LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior	Jiahui Wang et.al.	2411.03610	link
2024-11-05	LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting	Huibin Zhao et.al.	2411.02703	null
2024-11-04	Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing	Xinran Zhang et.al.	2411.02553	null
2024-11-04	Semantic Masking and Visual Feature Matching for Robust Localization	Luisa Mao et.al.	2411.01804	null
2024-10-31	XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM	Xiaomeng Wang et.al.	2410.23690	link
2024-10-30	LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM	Yucheng Huang et.al.	2410.23231	link
2024-10-30	ISAC Prototype System for Multi-Domain Cooperative Communication Networks	Jie Yang et.al.	2410.22956	null
2024-10-30	SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark	HyunJun Jung et.al.	2410.22715	link
2024-10-29	LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues	Hanqing Jiang et.al.	2410.22213	null
2024-10-29	EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments	Linus Nwankwo et.al.	2410.22200	null
2024-10-28	NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments	Taiyi Pan et.al.	2410.21615	link
2024-10-28	coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM	Emiliano Höss et.al.	2410.21149	link
2024-11-01	RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior	Mingjiang Liang et.al.	2410.20358	null
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341	link
2024-10-22	AG-SLAM: Active Gaussian Splatting SLAM	Wen Jiang et.al.	2410.17422	null
2024-10-22	Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study	J. Jorge et.al.	2410.17171	null
2024-10-19	EndoMetric: Near-light metric scale monocular SLAM	Raúl Iranzo et.al.	2410.15065	null
2024-10-17	Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot	Dongkun Han et.al.	2410.13612	null
2024-10-17	TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal	Yanpeng Jia et.al.	2410.13240	null
2024-10-16	QueensCAMP: an RGB-D dataset for robust Visual SLAM	Hudson M. S. Bruno et.al.	2410.12520	link
2024-10-18	PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM	Guanghao Li et.al.	2410.12324	null
2024-10-16	Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem	Yichen Sha et.al.	2410.12169	null
2024-10-15	V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting	Tuan Dang et.al.	2410.12068	link
2024-10-15	GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information	Wancai Zheng et.al.	2410.11356	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	link
2024-10-14	MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator	Taozhe Li et.al.	2410.10669	null
2024-10-13	Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph	Benoit Casseau et.al.	2410.09896	null
2024-10-12	SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs	Wenxi Chen et.al.	2410.09503	link
2024-10-12	An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation	Wei Liang et.al.	2410.09443	null
2024-10-12	ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras	Junkai Niu et.al.	2410.09374	link
2024-10-11	Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System	Zheng Liu et.al.	2410.08935	link
2024-10-11	Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints	Yicheng He et.al.	2410.08780	null
2024-10-10	ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization	Mason B. Peterson et.al.	2410.08262	link
2024-10-10	IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera	Jian Huang et.al.	2410.08107	link
2024-10-08	Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching	Gongxin Yao et.al.	2410.06285	null
2024-10-08	Submodular Optimization for Keyframe Selection & Usage in SLAM	David Thorne et.al.	2410.05576	null
2024-10-07	SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones	Denis Davletshin et.al.	2410.05405	null
2024-10-07	Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection	Ang He et.al.	2410.05017	null
2024-10-05	A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems	Nikola Radulov et.al.	2410.04242	link
2024-10-05	High-Speed Stereo Visual SLAM for Low-Powered Computing Devices	Ashish Kumar et.al.	2410.04090	link
2024-10-04	EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM	Shi Chen et.al.	2410.03812	null
2024-10-04	Estimating Body and Hand Motion in an Ego-sensed World	Brent Yi et.al.	2410.03665	null
2024-10-03	LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features	Zihao Dong et.al.	2410.02961	null
2024-10-02	ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space	Hogyun Kim et.al.	2410.01325	null
2024-10-01	Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency	William Dubois et.al.	2410.00758	null
2024-10-02	CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM	Dapeng Feng et.al.	2410.00486	link
2024-09-30	Additively Manufactured Open-Source Quadruped Robots for Multi-Robot SLAM Applications	Zachary Fuge et.al.	2410.00122	null
2024-09-30	Direct Multipath-Based SLAM	Mingchao Liang et.al.	2409.20552	null
2024-09-30	Robust Gaussian Splatting SLAM by Leveraging Loop Closure	Zunjie Zhu et.al.	2409.20111	null
2024-09-30	DynORecon: Dynamic Object Reconstruction for Navigation	Yiduo Wang et.al.	2409.19928	null
2024-09-29	CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation	Yifan Duan et.al.	2409.19597	null
2024-09-29	CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought	Yexing Du et.al.	2409.19510	link
2024-09-29	Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface	Ziniu Wu et.al.	2409.19499	null
2024-09-27	Royal Reveals: LiDAR Mapping of Kronborg Castle, Echoes of Hamlet’s Halls	Leon Davies et.al.	2409.18752	null
2024-09-26	BlinkTrack: Feature Tracking over 100 FPS via Events and Images	Yichen Shen et.al.	2409.17981	null
2024-09-26	Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry	Qi Zhang et.al.	2409.17729	null
2024-09-26	Event-based Stereo Depth Estimation: A Survey	Suman Ghosh et.al.	2409.17680	null
2024-09-25	Efficient Submap-based Autonomous MAV Exploration using Visual-Inertial SLAM Configurable for LiDARs or Depth Cameras	Sotiris Papatheodorou et.al.	2409.16972	null
2024-09-25	Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM	Phu Pham et.al.	2409.16944	null
2024-09-25	Inline Photometrically Calibrated Hybrid Visual SLAM	Nicolas Abboud et.al.	2409.16810	link
2024-09-25	Topological SLAM in colonoscopies leveraging deep features and topological priors	Javier Morlana et.al.	2409.16806	link
2024-09-25	Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots	Masoud Dayani Najafabadi et.al.	2409.16595	link
2024-09-25	Task-driven SLAM Benchmarking	Yanwei Du et.al.	2409.16573	link
2024-09-24	SoMaSLAM: 2D Graph SLAM for Sparse Range Sensing with Soft Manhattan World Constraints	Jeahn Han et.al.	2409.15736	null
2024-09-23	Spectral Graph Theoretic Methods for Enhancing Network Robustness in Robot Localization	Neelkamal Somisetty et.al.	2409.15506	null
2024-09-22	SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms	Niraj Pudasaini et.al.	2409.14515	null
2024-09-21	Point Cloud Structural Similarity-based Underwater Sonar Loop Detection	Donghwi Jung et.al.	2409.14020	link
2024-09-20	HMD $^2$ : Environment-aware Motion Generation from Single Egocentric Head-Mounted Device	Vladimir Guzov et.al.	2409.13426	null
2024-09-20	Learning Visual Information Utility with PIXER	Yash Turkar et.al.	2409.13151	null
2024-09-19	MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting	Yan Song Hu et.al.	2409.13055	null
2024-09-19	Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2409.12518	link
2024-09-18	Bundle Adjustment in the Eager Mode	Zitong Zhan et.al.	2409.12190	null
2024-09-23	Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping	Jaehyung Jung et.al.	2409.12051	null
2024-09-18	Metric-Semantic Factor Graph Generation based on Graph Neural Networks	Jose Andres Millan-Romera et.al.	2409.11972	null
2024-09-18	Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments	Lei Cheng et.al.	2409.11854	null
2024-09-18	ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation	Yanlin Jin et.al.	2409.11692	null
2024-09-18	SLAM assisted 3D tracking system for laparoscopic surgery	Jingwei Song et.al.	2409.11688	null
2024-09-17	GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure	Ziheng Xu et.al.	2409.10982	null
2024-09-17	Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells	Ankit Butola et.al.	2409.10971	null
2024-09-17	Evaluating and Improving the Robustness of LiDAR-based Localization and Mapping	Bo Yang et.al.	2409.10824	link
2024-09-16	P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty	Yufan Zhang et.al.	2409.10143	link
2024-09-16	SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning	Amogh Joshi et.al.	2409.09990	null
2024-09-16	Enhancing Visual Inertial SLAM with Magnetic Measurements	Bharat Joshi et.al.	2409.09904	null
2024-09-15	Marginalizing and Conditioning Gaussians onto Linear Approximations of Smooth Manifolds with Applications in Robotics	Zi Cong Guo et.al.	2409.09871	link
2024-09-15	Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping	Yi Liu et.al.	2409.09763	null
2024-09-15	High Definition Map Mapping and Update: A General Overview and Future Directions	Benny Wijaya et.al.	2409.09726	null
2024-09-14	MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry	Yuheng Qiu et.al.	2409.09479	null
2024-09-14	Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM	Haoying Li et.al.	2409.09410	null
2024-09-14	GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians	Dasong Gao et.al.	2409.09295	link
2024-09-14	Panoramic Direct LiDAR-assisted Visual Odometry	Zikang Yuan et.al.	2409.09287	link
2024-09-11	Object Depth and Size Estimation using Stereo-vision and Integration with SLAM	Layth Hamad et.al.	2409.07623	null
2024-09-11	Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry	Anbo Tao et.al.	2409.06948	null
2024-09-10	Technical Report of Mobile Manipulator Robot for Industrial Environments	Erfan Amoozad Khalili et.al.	2409.06693	null
2024-09-10	Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios	Zhiqiang Chen et.al.	2409.04961	link
2024-09-08	FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat	Changfei Fu et.al.	2409.03457	null
2024-09-03	Integration of Augmented Reality and Mobile Robot Indoor SLAM for Enhanced Spatial Awareness	Michael D. Friske et.al.	2409.01915	null
2024-09-03	Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric	Tingchen Ma et.al.	2409.01856	null
2024-09-02	Saying goodbyes to rotating your phone: Magnetometer calibration during SLAM	Ilari Vallivaara et.al.	2409.01242	null
2024-09-02	Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection	Manon Kok et.al.	2409.01091	null
2024-09-02	Robust Vehicle Localization and Tracking in Rain using Street Maps	Yu Xiang Tan et.al.	2409.01038	link
2024-08-31	UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM	Mostafa Mansour et.al.	2409.00362	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373	null
2024-08-30	Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning	Shuyang Zhang et.al.	2408.17005	link
2024-08-29	Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry	Michael Adlerstein et.al.	2408.16472	null
2024-08-28	Single-Photon 3D Imaging with Equi-Depth Photon Histograms	Kaustubh Sadekar et.al.	2408.16150	null
2024-08-28	BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR	Miguel Arturo Vega Torres et.al.	2408.15870	link
2024-08-30	Addressing the challenges of loop detection in agricultural environments	Nicolás Soncini et.al.	2408.15761	link
2024-08-28	ES-PTAM: Event-based Stereo Parallel Tracking and Mapping	Suman Ghosh et.al.	2408.15605	link
2024-08-28	PointEMRay: A Novel Efficient SBR Framework on Point Based Geometry	Kaiqiao Yang et.al.	2408.15583	null
2024-09-02	Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration	Rongge Zhang et.al.	2408.14726	link
2024-08-26	A Survey on Reinforcement Learning Applications in SLAM	Mohammad Dehghani Tezerjani et.al.	2408.14518	null
2024-08-28	FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry	Chunran Zheng et.al.	2408.14035	link
2024-08-21	Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild	Turcan Tuna et.al.	2408.11809	null
2024-08-21	LiFCal: Online Light Field Camera Calibration via Bundle Adjustment	Aymeric Fleith et.al.	2408.11682	null
2024-08-21	Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars	Zhihao Lin et.al.	2408.11582	null
2024-08-21	RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions Transform	Maximilian Hilger et.al.	2408.11576	link
2024-08-21	Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models	Kento Kawaharazuka et.al.	2408.11380	null
2024-08-20	LoopSplat: Loop Closure by Registering 3D Gaussian Splats	Liyuan Zhu et.al.	2408.10154	link
2024-08-19	Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM	Sanghyun Hahn et.al.	2408.09727	link
2024-08-17	GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System	Shuo Wang et.al.	2408.09191	null
2024-08-15	GOReloc: Graph-based Object-Level Relocalization for Visual SLAM	Yutong Wang et.al.	2408.07917	link
2024-08-14	Inverse k-visibility for RSSI-based Indoor Geometric Mapping	Junseo Kim et.al.	2408.07757	null
2024-08-14	Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition	Hogyun Kim et.al.	2408.07330	link
2024-08-12	CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments	Yanpeng Jia et.al.	2408.05981	null
2024-08-21	Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis	Zhongche Qu et.al.	2408.05635	null
2024-08-10	TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping	Seoyeon Jang et.al.	2408.05453	null
2024-08-08	Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods	Yiming Zhou et.al.	2408.04268	null
2024-08-07	Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM	Yan Song Hu et.al.	2408.03825	null
2024-08-07	AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System	Kuan Xu et.al.	2408.03520	link
2024-08-06	BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications	G. Manni et.al.	2408.03078	link
2024-08-04	SLAMS-Propelled Electron Acceleration at High-Mach Number Astrophysical Shocks	Vladimir Zeković et.al.	2408.02084	null
2024-08-03	Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing	Fabian Schmidt et.al.	2408.01716	link
2024-08-03	Deep Patch Visual SLAM	Lahav Lipson et.al.	2408.01654	link
2024-08-02	Momentum Capture and Prediction System Based on Wimbledon Open2023 Tournament Data	Chang Liu et.al.	2408.01544	null
2024-08-07	IG-SLAM: Instant Gaussian SLAM	F. Aykut Sarikamis et.al.	2408.01126	null
2024-08-01	Collecting Larg-Scale Robotic Datasets on a High-Speed Mobile Platform	Yuxin Lin et.al.	2408.00545	null
2024-08-01	High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets	Jian Li et.al.	2408.00538	link
2024-07-31	SuperVINS: A visual-inertial SLAM framework integrated deep learning features	Hongkun Luo et.al.	2407.21348	link
2024-07-30	NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding	Hongjia Zhai et.al.	2407.20853	null
2024-07-29	A flexible framework for accurate LiDAR odometry, map manipulation, and localization	José Luis Blanco-Claraco et.al.	2407.20465	link
2024-07-28	Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data	Azmyin Md. Kamal et.al.	2407.19518	null
2024-07-26	Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation	Aditya Penumarti et.al.	2407.19046	null
2024-07-26	HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM	Zhe Xin et.al.	2407.18813	null
2024-07-25	CodedVO: Coded Visual Odometry	Sachin Shah et.al.	2407.18240	null
2024-07-28	HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation	Zhenzhi Wang et.al.	2407.17438	link
2024-07-22	Memory Management for Real-Time Appearance-Based Loop Closure Detection	Mathieu Labbé et.al.	2407.15890	null
2024-07-22	Reinforcement Learning Meets Visual Odometry	Nico Messikommer et.al.	2407.15626	link
2024-07-22	Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM	Mathieu Labbe et.al.	2407.15305	null
2024-07-21	Semi-Supervised Pipe Video Temporal Defect Interval Localization	Zhu Huang et.al.	2407.15170	null
2024-07-21	VoxDepth: Rectification of Depth Images on Edge Devices	Yashashwee Chakrabarty et.al.	2407.15067	null
2024-07-20	From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM	Lorenzo Montano-Oliván et.al.	2407.14797	null
2024-07-19	MSSP : A Versatile Multi-Scenario Adaptable Intelligent Robot Simulation Platform Based on LIDAR-Inertial Fusion	Qiyan Li et.al.	2407.14102	null
2024-07-18	A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion	Jianxiang Xu et.al.	2407.13878	link
2024-07-18	Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM	Baicheng Li et.al.	2407.13338	null
2024-07-18	Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain	Bach Nguyen Gia et.al.	2407.13159	link
2024-07-17	Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge	Andrea Albanese et.al.	2407.12663	null
2024-07-17	Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM	Markus Weißflog et.al.	2407.12408	null
2024-07-19	Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion	Sangjun Lee et.al.	2407.12405	link
2024-07-17	Fusion LiDAR-Inertial-Encoder data for High-Accuracy SLAM	Manh Do Duc et.al.	2407.11870	null
2024-07-17	GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jingwen Yu et.al.	2407.11736	link
2024-07-16	Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems	Jianzhu Huai et.al.	2407.11705	null
2024-07-16	Batch SLAM with PMBM Data Association Sampling and Graph-Based Optimization	Yu Ge et.al.	2407.11643	null
2024-07-16	I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM	Gwangtak Bae et.al.	2407.11347	null
2024-07-16	FR-SLAM: A SLAM Improvement Method Based on Floor Plan Registration	Jiantao Feng et.al.	2407.11299	null
2024-07-15	Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method	Adam Korycki et.al.	2407.11238	null
2024-07-12	An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks	Seyed Alireza Rahimi Azghadi et.al.	2407.09242	null
2024-07-11	SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM	Neng Wang et.al.	2407.08106	link
2024-07-09	Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM	David Hug et.al.	2407.07074	link
2024-07-15	A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM	Yasra Chandio et.al.	2407.06889	null
2024-07-08	Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots	Siva Krishna Ravipati et.al.	2407.06077	link
2024-07-10	Co-RaL: Complementary Radar-Leg Odometry with 4-DoF Optimization and Rolling Contact	Sangwoo Jung et.al.	2407.05820	null
2024-07-07	Active Collaborative Visual SLAM exploiting ORB Features	Muhammad Farhan Ahmed et.al.	2407.05453	null
2024-07-06	VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking	Xuefeng Jiang et.al.	2407.05017	null
2024-07-06	Symmetric Linear Arc Monadic Datalog and Gadget Reductions	Manuel Bodirsky et.al.	2407.04924	null
2024-07-03	Ultra-Lightweight Collaborative Mapping for Robot Swarms	Vlad Niculescu et.al.	2407.03136	null
2024-07-01	RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields	Haochen Jiang et.al.	2407.01303	link
2024-07-01	Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation	Lianjie Guo et.al.	2407.01292	link
2024-07-01	Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization	Ruofei Bai et.al.	2407.01013	link
2024-06-30	Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation	Adnan Abdullah et.al.	2407.00848	null
2024-06-30	OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration	Fengyuan Yang et.al.	2407.00574	null
2024-06-24	Compressing Search with Language Models	Thomas Mulc et.al.	2407.00085	null
2024-06-28	CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services	DongKi Noh et.al.	2406.19634	null
2024-06-25	Benchmarking SLAM Algorithms in the Cloud: The SLAM Hive System	Xinzhe Liu et.al.	2406.17586	null
2024-07-02	SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation	Xu Liu et.al.	2406.17249	link
2024-06-24	From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking	Xiaohao Xu et.al.	2406.16850	link
2024-06-23	Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy	Chen Wang et.al.	2406.16087	null
2024-06-19	Simultaneous Map and Object Reconstruction	Nathaniel Chodosh et.al.	2406.13896	null
2024-06-14	Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization	Wonho Song et.al.	2406.11599	null
2024-06-16	Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry	Boris Chidlovskii et.al.	2406.11019	null
2024-06-15	Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM	Yinjie Li et.al.	2406.10494	link
2024-06-12	From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers	Swaminathan Gurumurthy et.al.	2406.07785	link
2024-06-27	Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF)	Gyubeom Im et.al.	2406.06427	null
2024-06-10	Notes on Various Errors and Jacobian Derivations for SLAM	Gyubeom Im et.al.	2406.06422	null
2024-06-23	Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation	Shenghao Li et.al.	2406.06374	link
2024-06-15	Visual-Inertial SLAM as Simple as A, B, VINS	Nathaniel Merrill et.al.	2406.05969	null
2024-06-09	MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps	Jianhao Zheng et.al.	2406.05849	null
2024-06-06	Open Problem: Active Representation Learning	Nikola Milosevic et.al.	2406.03845	null
2024-06-04	ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization	Chen Mao et.al.	2406.01906	link
2024-06-03	The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry	Paolo Cudrano et.al.	2406.01797	null
2024-06-03	Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Takayuki Kanai et.al.	2406.00929	null
2024-06-02	Visual place recognition for aerial imagery: A survey	Ivan Moskalenko et.al.	2406.00885	link
2024-05-30	Structure Gaussian SLAM with Manhattan World Hypothesis	Shuhong Liu et.al.	2405.20031	null
2024-05-30	Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar	Wouter Jansen et.al.	2405.19869	null
2024-05-30	SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization	Jiang Wang et.al.	2405.19813	link
2024-05-30	TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM	Peifeng Jiang et.al.	2405.19614	null
2024-05-27	CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy	Richard Elvira et.al.	2405.16932	null
2024-05-26	Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians	Erik Sandström et.al.	2405.16544	link
2024-05-24	NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes	Lizhi Bai et.al.	2405.15151	null
2024-05-23	ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization	Han Song et.al.	2405.15082	null
2024-05-23	Synergistic Global-space Camera and Human Reconstruction from Videos	Yizhou Zhao et.al.	2405.14855	null
2024-05-23	CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments	Yang Zhou et.al.	2405.14731	link
2024-05-23	Efficient Robot Learning for Perception and Mapping	Niclas Vödisch et.al.	2405.14688	null
2024-05-22	Monocular Gaussian SLAM with Language Extended Loop Closure	Tian Lan et.al.	2405.13748	null
2024-05-26	NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments	Dongha Chung et.al.	2405.12563	link
2024-05-20	EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving	Boyi Liu et.al.	2405.12120	null
2024-05-24	Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation	Hyungtae Lim et.al.	2405.11176	null
2024-05-18	MotionGS : Compact Gaussian Splatting SLAM by Motion Filter	Xinli Guo et.al.	2405.11129	link
2024-05-17	CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion	Gang Wang et.al.	2405.10793	null
2024-05-17	Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map	Liang Zhao et.al.	2405.10743	null
2024-05-10	MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization	Pengcheng Zhu et.al.	2405.06241	null
2024-05-07	Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map	Yuxuan Xia et.al.	2405.04290	null
2024-05-07	IMU-Aided Event-based Stereo Visual Odometry	Junkai Niu et.al.	2405.04071	link
2024-04-27	An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation	Olivier Brochu Dufour et.al.	2404.17745	null
2024-04-26	Camera Motion Estimation from RGB-D-Inertial Scene Flow	Samuel Cerezo et.al.	2404.17251	link
2024-04-23	Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization	Lahav Lipson et.al.	2404.15263	link
2024-04-18	SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints	Spencer Carmichael et.al.	2404.12339	null
2024-04-17	VBR: A Vision Benchmark in Rome	Leonardo Brizi et.al.	2404.11322	link
2024-04-14	Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration	Yanhao Zhang et.al.	2404.09169	link
2024-04-06	Salient Sparse Visual Odometry With Pose-Only Supervision	Siyu Chen et.al.	2404.04677	null
2024-03-25	A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments	Gianluca D’Amico et.al.	2403.17084	null
2024-03-19	On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine	Jagatpreet Singh Nir et.al.	2403.13170	null
2024-03-18	The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions	Margaret Hansen et.al.	2403.12194	null
2024-03-18	An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation	Zewen Xu et.al.	2403.11639	null
2024-03-16	Efficient Domain Adaptation for Endoscopic Visual Odometry	Junyang Wu et.al.	2403.10860	null
2024-03-14	Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO)	Matthew Lisondra et.al.	2403.09882	null
2024-03-02	Grid-based Fast and Structural Visual Odometry	Zhang Zhihe et.al.	2403.01110	null
2024-02-25	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961	link
2024-02-22	Secure Navigation using Landmark-based Localization in a GPS-denied Environment	Ganesh Sapkota et.al.	2402.14280	null
2024-02-19	Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment	Ganesh Sapkota et.al.	2402.12551	null
2024-02-07	Online and Certifiably Correct Visual Odometry and Mapping	Devansh R Agrawal et.al.	2402.05254	null
2024-02-06	YOLOPoint Joint Keypoint and Object Detection	Anton Backhaus et.al.	2402.03989	link
2024-01-19	Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning	André O. Françani et.al.	2401.10857	null
2024-01-17	Event-Based Visual Odometry on Non-Holonomic Ground Vehicles	Wanting Xu et.al.	2401.09331	link
2024-01-11	On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering	Feng Zhu et.al.	2401.05836	null
2023-12-19	Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry	Olaya Álvarez-Tuñón et.al.	2401.05396	link
2024-01-07	Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people	Ali Samadzadeh et.al.	2401.03604	link
2024-01-03	LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry	Weirong Chen et.al.	2401.01887	link
2023-12-28	SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction	Zikang Yuan et.al.	2312.16800	link
2023-12-20	NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields	Jens Naumann et.al.	2312.13471	null
2023-12-22	Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM	Junru Lin et.al.	2312.13332	null
2023-12-20	Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach	Habib Boloorchi Tabrizi et.al.	2312.13162	link
2023-12-20	Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera	Abdulkadhem A. Abdulkadhem et.al.	2312.12680	null
2023-12-15	Deep Event Visual Odometry	Simon Klenk et.al.	2312.09800	link
2023-12-10	SuperPrimitive: Scene Reconstruction at a Primitive Level	Kirill Mazur et.al.	2312.05889	null
2023-12-04	iMatching: Imperative Correspondence Learning	Zitong Zhan et.al.	2312.02141	link
2023-11-30	Event-based Visual Inertial Velometer	Xiuyuan Lu et.al.	2311.18189	null
2023-11-21	CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems	Young-Hee Lee et.al.	2311.12580	null
2023-11-10	Dense Visual Odometry Using Genetic Algorithm	Slimane Djema et.al.	2311.06149	null
2023-11-07	Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM	Seongwook Yoon et.al.	2311.03722	null
2023-10-23	Converting Depth Images and Point Clouds for Feature-based Pose Estimation	Robert Lösch et.al.	2310.14924	link
2023-10-17	Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms	Yanyan Li et.al.	2310.10931	link
2023-10-12	Jointly Optimized Global-Local Visual Localization of UAVs	Haoling Li et.al.	2310.08082	null
2023-10-10	l-dyno: framework to learn consistent visual features using robot’s motion	Kartikeya Singh et.al.	2310.06249	link
2023-10-08	XVO: Generalized Visual Odometry via Cross-Modal Self-Training	Lei Lai et.al.	2309.16772	null
2023-10-22	ObVi-SLAM: Long-Term Object-Visual SLAM	Amanda Adkins et.al.	2309.15268	link
2023-09-23	Tag-based Visual Odometry Estimation for Indoor UAVs Localization	Massimiliano Bertoni et.al.	2309.13311	null
2023-09-22	Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms	Olivier Gamache et.al.	2309.13139	link
2023-09-20	Conformalized Multimodal Uncertainty Regression and Reasoning	Domenico Parente et.al.	2309.11018	null
2023-09-20	OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving	Heng Li et.al.	2309.11011	link
2023-09-19	LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation	Haizhou Zhang et.al.	2309.10436	link
2023-09-21	Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration	Hongbo Zhao et.al.	2309.10314	null
2023-09-18	End-to-End Learned Event- and Image-based Visual Odometry	Roberto Pellerito et.al.	2309.09947	link
2023-09-14	An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments	Yehao Liu et.al.	2309.07408	null
2023-09-11	Evaluating Visual Odometry Methods for Autonomous Driving in Rain	Yu Xiang Tan et.al.	2309.05249	null
2023-09-08	Robot Localization and Mapping Final Report – Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry	Akankshya Kar et.al.	2309.04147	null
2023-09-04	EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity	Zijie Jiang et.al.	2309.01296	null
2023-08-27	Deep Learning for Visual Localization and Mapping: A Survey	Changhao Chen et.al.	2308.14039	null
2023-08-19	Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters	Xiao Liu et.al.	2308.09870	link
2023-08-12	4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion	Guirong Zhuo et.al.	2308.06573	null
2023-08-10	Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU	U. V. B. L. Udugama et.al.	2308.05515	null
2023-08-02	A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry	Cora A. Dimmig et.al.	2308.01398	null
2023-08-02	Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network	Shenbagaraj Kannapiran et.al.	2308.01125	null
2023-08-02	Preliminary Design of the Dragonfly Navigation Filter	Ben Schilling et.al.	2307.13513	null
2023-07-19	Optimizing the extended Fourier Mellin Transformation Algorithm	Wenqing Jiang et.al.	2307.10015	link
2023-07-15	Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents	Ke Cao et.al.	2307.07763	null
2023-07-26	Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression	Jianeng Wang et.al.	2306.01188	null
2023-07-06	OSPC: Online Sequential Photometric Calibration	Jawad Haidar et.al.	2305.17673	null
2023-05-15	Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface	Shifan Zhu et.al.	2305.08962	null
2023-05-10	Transformer-based model for monocular visual odometry: a video understanding approach	André O. Françani et.al.	2305.06121	link
2023-04-29	Modality-invariant Visual Odometry for Embodied Vision	Marius Memmel et.al.	2305.00348	link
2023-04-21	FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving	Yuxuan Liu et.al.	2304.10719	null
2023-07-08	Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Visual Bootstrapping	Hanyu Cai et.al.	2304.08978	null
2023-04-12	SiLK – Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194	link
2023-04-11	ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster	Yifei Dong et.al.	2304.04943	null
2023-03-21	Learning a Depth Covariance Function	Eric Dexheimer et.al.	2303.12157	null
2023-03-21	Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network	Alessandro Navone et.al.	2303.11725	null
2023-03-20	VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors	Thien Hoang Nguyen et.al.	2303.10903	null
2023-03-17	CoVIO: Online Continual Learning for Visual-Inertial Odometry	Niclas Vödisch et.al.	2303.10149	link
2023-03-15	UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry	Chaoyang Jiang et.al.	2303.08550	null
2023-03-13	Discovering Multiple Algorithm Configurations	Leonid Keselman et.al.	2303.07434	null
2023-03-09	Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation	Masahiro Hirano et.al.	2303.05192	null
2023-03-16	Stereo Event-based Visual-Inertial Odometry	Kunfeng Wang et.al.	2303.05086	link
2023-03-07	Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor	Eduardo Gallo et.al.	2303.03804	null
2023-03-03	Lightweight, Uncertainty-Aware Conformalized Visual Odometry	Alex C. Stutts et.al.	2303.02207	null
2023-02-24	FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets	Yelena Randall et.al.	2302.12772	null
2023-02-27	CP+: Camera Poses Augmentation with Large-scale LiDAR Maps	Jiadi Cui et.al.	2302.12198	null
2023-02-19	EdgeVO: An Efficient and Accurate Edge-based Visual Odometry	Hui Zhao et.al.	2302.09493	null
2023-01-27	HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera	Mostafa Ahmadi et.al.	2301.11823	null
2023-01-26	Distributed Optimization Methods for Multi-Robot Systems: Part I – A Tutorial	Ola Shorinwa et.al.	2301.11313	null
2023-01-24	Generalized Object Search	Kaiyu Zheng et.al.	2301.10121	null
2023-01-22	Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories	Hanlin Chen et.al.	2301.09194	null
2023-01-21	Dense RGB SLAM with Neural Implicit Maps	Heng Li et.al.	2301.08930	null
2023-01-18	Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information	Junshi Chen et.al.	2301.07560	null
2023-01-17	COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM	Manthan Patel et.al.	2301.07147	link
2023-01-31	Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems	Pierre-Yves Lajoie et.al.	2301.06230	link
2023-01-13	A LiDAR-Inertial-Visual SLAM System with Loop Detection	Kangcheng Liu et.al.	2301.05604	null
2023-01-11	AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization	Ying Chen et.al.	2301.04620	link
2023-01-12	TBV Radar SLAM – trust but verify loop candidates	Daniel Adolfsson et.al.	2301.04397	link
2022-12-31	Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges	Maxwell McManus et.al.	2301.03359	null
2023-01-09	Motion Addition and Motion Optimization	Liqun Qi et.al.	2301.03174	null
2023-01-08	Towards Open World NeRF-Based SLAM	Daniil Lisus et.al.	2301.03102	null
2023-01-06	CyberLoc: Towards Accurate Long-term Visual Localization	Liu Liu et.al.	2301.02403	null
2023-01-03	LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation	Shreyansh Daftry et.al.	2301.01350	null
2022-12-31	4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Patrick Wenzel et.al.	2301.01147	null
2023-01-03	BS3D: Building-scale 3D Reconstruction from RGB-D Images	Janne Mustaniemi et.al.	2301.01057	null
2023-01-10	An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping	Masoud Dayani Najafabadi et.al.	2301.00618	link
2022-12-25	A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion	Nadia Figueroa et.al.	2212.14772	null
2022-12-29	An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping	Kangcheng Liu et.al.	2212.14209	link
2022-12-27	Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands	Felipe Gómez-Cuba et.al.	2212.13477	link
2022-12-26	ESVIO: Event-based Stereo Visual Inertial Odometry	Peiyu Chen et.al.	2212.13184	link
2022-12-24	A Comprehensive Review on Autonomous Navigation	Saeid Nahavandi et.al.	2212.12808	null
2022-12-23	Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation	Marina Lotti et.al.	2212.12388	null
2022-12-23	Implementation of a Blind navigation method in outdoors/indoors areas	Mohammad Javadian Farzaneh et.al.	2212.12185	null
2022-12-22	S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations	Hriday Bavle et.al.	2212.11770	link
2022-12-22	Active SLAM: A Review On Last Decade	Muhammad Farhan Ahmed et.al.	2212.11654	null
2022-12-27	Motion, Unit Dual Quaternion and Motion Optimization	Liqun Qi et.al.	2212.11593	null
2022-12-22	Vision-Based Environmental Perception for Autonomous Driving	Fei Liu et.al.	2212.11453	null
2022-12-19	Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models	Yong Cheng et.al.	2212.09553	null
2022-12-16	Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments	Lasitha Weerakoon et.al.	2212.08633	null
2022-12-16	rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments	Bo Wei et.al.	2212.08418	null
2023-03-02	AirVO: An Illumination-Robust Point-Line Visual Odometry	Kuan Xu et.al.	2212.07595	link
2022-12-14	Autonomous Vehicle Navigation with LIDAR using Path Planning	Rahul M K et.al.	2212.07155	null
2022-12-14	RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping	Hyowon Kim et.al.	2212.07141	null
2022-12-13	Know What You Don’t Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version)	Daniil Lisus et.al.	2212.06923	null
2022-12-13	SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance	Chenyangguang Zhang et.al.	2212.06524	null
2022-12-13	Localization and Navigation System for Indoor Mobile Robot	Yanbaihui Liu et.al.	2212.06391	null
2022-12-12	Evaluation of RGB-D SLAM in Large Indoor Environments	Kirill Muravyev et.al.	2212.05980	null
2022-12-19	A Light-Weight LiDAR-Inertial SLAM System with Loop Closing	Kangcheng Liu et.al.	2212.05743	link
2022-12-12	An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds	Kangcheng Liu et.al.	2212.05705	link
2022-12-09	SLAM for Visually Impaired People: A Survey	Marziyeh Bamdad et.al.	2212.04745	null
2022-12-09	Ego-Body Pose Estimation via Ego-Head Pose Estimation	Jiaman Li et.al.	2212.04636	null
2022-12-06	Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles	Sushant Veer et.al.	2212.03323	link
2022-12-06	PRISM: Probabilistic Real-Time Inference in Spatial World Models	Atanas Mirchev et.al.	2212.02988	null
2022-12-06	RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps	Florian Sauerbeck et.al.	2212.02085	link
2022-12-05	DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization	Xuebo Tian et.al.	2212.02077	null
2022-12-05	ObjectMatch: Robust Registration using Canonical Object Correspondences	Can Gümeli et.al.	2212.01985	null
2022-12-02	Sparse SPN: Depth Completion from Sparse Keypoints	Yuqun Wu et.al.	2212.00987	null
2022-12-01	maplab 2.0 – A Modular and Multi-Modal Mapping Framework	Andrei Cramariuc et.al.	2212.00654	link
2022-12-01	AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body – Theory and Experiments	Mehregan Dor et.al.	2212.00350	null
2022-11-30	MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves	Pranjali Pathre et.al.	2211.16882	null
2022-11-29	PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images	Hartmut Surmann et.al.	2211.16266	link
2022-11-29	MmWave Mapping and SLAM for 5G and Beyond	Yu Ge et.al.	2211.16024	null
2022-11-28	Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map	Xi Zheng et.al.	2211.15127	null
2022-11-29	BALF: Simple and Efficient Blur Aware Local Feature Detector	Zhenjun Zhao et.al.	2211.14731	null
2022-11-27	Development of a Modular Real-time Shared-control System for a Smart Wheelchair	Vaishanth Ramaraj et.al.	2211.14711	null
2022-11-26	A1 SLAM: Quadruped SLAM using the A1’s Onboard Sensors	Jerred Chen et.al.	2211.14432	link
2022-11-23	ActiveRMAP: Radiance Field for Active Mapping And Planning	Huangying Zhan et.al.	2211.12656	null
2022-11-22	Vision-based localization methods under GPS-denied conditions	Zihao Lu et.al.	2211.11988	null
2022-11-21	Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques	David Ramirez et.al.	2211.11836	null
2022-11-21	ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields	Mohammad Mahdi Johari et.al.	2211.11704	null
2022-11-24	Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths	Erik Leitinger et.al.	2211.09241	null
2022-11-16	Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery	Hao Qu et.al.	2211.08904	null
2022-11-20	Detecting Line Segments in Motion-blurred Images with Events	Huai Yu et.al.	2211.07365	link
2022-11-13	Automatic Eye-in-Hand Calibration using EKF	Aditya Ramakrishnan et.al.	2211.06881	null
2022-11-12	Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling	Zhihao Wang et.al.	2211.06557	link
2022-11-11	Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications	Jie Yang et.al.	2211.05982	null
2022-11-10	Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time	Ignacio Torroba et.al.	2211.05601	link
2022-11-07	When Geometry is not Enough: Using Reflector Markers in Lidar SLAM	Gerhard Kurz et.al.	2211.03484	null
2022-11-07	Detecting Invalid Map Merges in Lifelong SLAM	Matthias Holoch et.al.	2211.03423	null
2022-11-06	Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU	Yibin Wu et.al.	2211.03174	link
2022-11-07	Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments	Daniel Adolfsson et.al.	2211.02445	link
2022-11-03	DyOb-SLAM : Dynamic Object Tracking SLAM System	Rushmian Annoy Wadud et.al.	2211.01941	null
2022-11-03	Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM	Yang Chen et.al.	2211.01749	null
2022-11-04	$D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm	Hao Xu et.al.	2211.01538	link
2022-11-02	Semantic SuperPoint: A Deep Semantic Descriptor	Gabriel S. Gama et.al.	2211.01098	link
2022-11-02	Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation	Myung-Hwan Jeon et.al.	2211.00960	link
2022-10-31	Mapping Extended Landmarks for Radar SLAM	Shuai Sun et.al.	2210.17207	null
2022-10-25	MAROAM: Map-based Radar SLAM through Two-step Feature Selection	Dequan Wang et.al.	2210.13797	null
2022-10-25	S3E: A Large-scale Multimodal Dataset for Collaborative SLAM	Dapeng Feng et.al.	2210.13723	link
2022-10-24	NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields	Antoni Rosinol et.al.	2210.13641	link
2022-10-24	Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging	Geng Wang et.al.	2210.13556	null
2022-10-28	VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points	Andreas Georgis et.al.	2210.12756	null
2022-10-22	SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation	Junliang Chen et.al.	2210.12417	null
2022-10-21	DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm	Shipeng Zhong et.al.	2210.11978	link
2022-10-21	Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments	Shubham Kedia et.al.	2210.11652	null
2022-10-22	Visual SLAM: What are the Current Trends and What to Expect?	Ali Tourani et.al.	2210.10491	null
2022-10-18	Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM	Geon Choi et.al.	2210.09636	null
2022-10-16	D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments	Ayman Beghdadi et.al.	2210.08647	null
2022-10-16	Indoor Smartphone SLAM with Learned Echoic Location Features	Wenjie Luo et.al.	2210.08493	null
2022-10-15	Self-Improving SLAM in Dynamic Environments: Learning When to Mask	Adrian Bojko et.al.	2210.08350	link
2022-10-13	Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems	Pushyami Kaveti et.al.	2210.07315	link
2022-10-12	RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map	Xuecheng Xu et.al.	2210.05984	link
2022-10-11	Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization	Yuanzheng He et.al.	2210.05600	null
2022-10-11	Autonomous Asteroid Characterization Through Nanosatellite Swarming	Kaitlin Dennison et.al.	2210.05518	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517	null
2022-10-11	Multi-Object Navigation with dynamically learned neural implicit representations	Pierre Marza et.al.	2210.05129	link
2022-10-12	Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation	Yulun Tian et.al.	2210.05020	null
2022-10-10	Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios	Xingyu Chen et.al.	2210.04562	null
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236	null
2022-10-06	SCORE: A Second-Order Conic Initialization for Range-Aided SLAM	Alan Papalia et.al.	2210.03177	link
2022-10-06	Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding	Kirill Mazur et.al.	2210.03043	null
2022-10-06	Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence	Osian Morgan et.al.	2210.02642	null
2022-10-05	MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation	Hanwei Zhang et.al.	2210.02038	null
2022-10-04	O2S: Open-source open shuttle	Nwankwo Linus et.al.	2210.01627	null
2022-10-04	Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing	Weiying Wang et.al.	2210.01320	null
2022-10-03	Probabilistic Volumetric Fusion for Dense Monocular SLAM	Antoni Rosinol et.al.	2210.01276	null
2022-10-03	DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams	John McConnell et.al.	2210.00867	link
2022-10-03	A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments	Ha Sier et.al.	2210.00812	link
2022-10-01	Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2	Ali Eslamian et.al.	2210.00278	null
2022-09-30	PyPose: A Library for Robot Learning with Physics-based Optimization	Chen Wang et.al.	2209.15428	link
2022-09-29	DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment	Mariia Gladkova et.al.	2209.14965	null
2022-09-28	Robust Incremental Smoothing and Mapping (riSAM)	Daniel McGann et.al.	2209.14359	null
2022-09-27	Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping	Chi-Ming Chung et.al.	2209.13274	link
2022-09-24	Graph Neural Networks for Multi-Robot Active Information Acquisition	Mariliza Tzes et.al.	2209.12091	null
2022-09-24	Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes	Jonathan J. Y. Kim et.al.	2209.11894	null
2022-09-23	involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs	Gilad Rotman et.al.	2209.11591	null
2022-09-23	Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot	David Balaban et.al.	2209.11432	null
2022-09-22	SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation	Xiao Han et.al.	2209.10817	null
2022-09-22	Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio	Wenhao Qiu et.al.	2209.10726	null
2022-09-21	Visual Localization and Mapping in Dynamic and Changing Environments	João Carlos Virgolino Soares et.al.	2209.10710	null
2022-09-20	Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM	Sabir Hossain et.al.	2209.10047	null
2022-09-20	WGICP: Differentiable Weighted GICP-Based Lidar Odometry	Sanghyun Son et.al.	2209.09777	null
2022-09-20	PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention	José Arce et.al.	2209.09699	link
2022-09-19	MeSLAM: Memory Efficient SLAM based on Neural Fields	Evgenii Kruzhkov et.al.	2209.09357	null
2022-09-19	LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM	Letian Zhang et.al.	2209.08810	null
2022-09-18	HGI-SLAM: Loop Closure With Human and Geometric Importance Features	Shuhul Mujoo et.al.	2209.08608	null
2022-09-18	Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM	Jiarui Tan et.al.	2209.08578	link
2022-09-17	DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments	Shihao Shen et.al.	2209.08430	link
2022-09-17	OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM	Matthieu Zins et.al.	2209.08338	null
2022-09-17	PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments	Adam Dai et.al.	2209.08248	link
2022-09-16	ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM	Aditya Arun et.al.	2209.08091	null
2022-09-16	iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking	Yuhang Ming et.al.	2209.07919	null
2022-09-16	TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM	Mathieu Gonzalez et.al.	2209.07888	null
2022-09-15	Landmark Management in the Application of Radar SLAM	Shuai Sun et.al.	2209.07199	link
2022-09-15	PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization	Xianwei Meng et.al.	2209.07061	null
2022-09-14	Semantic Visual Simultaneous Localization and Mapping: A Survey	Kaiqi Chen et.al.	2209.06428	null
2022-09-13	Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets	Islam Ali et.al.	2209.06316	null
2022-09-12	A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding	Tin Lai et.al.	2209.05222	null
2022-09-12	Attitude-Guided Loop Closure for Cameras with Negative Plane	Ze Wang et.al.	2209.05167	link
2022-09-09	General Place Recognition Survey: Towards the Real-world Autonomy Age	Peng Yin et.al.	2209.04497	link
2022-09-08	ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology	Julio A. Placed et.al.	2209.03693	link
2022-09-08	R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator	Jiarong Lin et.al.	2209.03666	link
2022-09-06	Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection	Brendon Forsgren et.al.	2209.02658	link
2022-09-05	Neuromorphic Visual Odometry with Resonator Networks	Alpha Renner et.al.	2209.02000	null
2022-09-05	MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM	Pavel Karpyshev et.al.	2209.01936	null
2022-09-05	ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics	Boyi Liu et.al.	2209.01774	null
2022-09-04	CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud	Evgeny Yudin et.al.	2209.01605	null
2022-08-31	PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM	Yifan Duan et.al.	2208.14848	null
2022-08-30	BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition	Peng Yin et.al.	2208.14543	null
2022-08-27	Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes	Ali Safa et.al.	2208.12997	null
2022-08-25	FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms	Jianhao Jiao et.al.	2208.11865	null
2022-08-25	Lidar SLAM for Autonomous Driving Vehicles	Farhad Aghili et.al.	2208.11855	null
2022-08-24	DynaVINS: A Visual-Inertial SLAM for Dynamic Environments	Seungwon Song et.al.	2208.11500	link
2022-08-22	Doppler Exploitation in Bistatic mmWave Radio SLAM	Yu Ge et.al.	2208.10204	null
2022-08-21	Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping	Lintong Zhang et.al.	2208.09825	link
2022-08-26	JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario	Longrui Dong et.al.	2208.09777	null
2022-08-15	BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM	Yunge Cui et.al.	2208.07473	link
2022-08-12	Handling Constrained Optimization in Factor Graphs for Autonomous Navigation	Barbara Bazzana et.al.	2208.06325	null
2022-08-11	RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild	Jason Y. Zhang et.al.	2208.05963	null
2022-08-08	Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation	Yifei Ren et.al.	2208.04274	link
2022-08-08	SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty	Shuai Zhang et.al.	2208.03945	link
2022-08-05	A Survey on Visual Map Localization Using LiDARs and Cameras	Elhousni Mahdi et.al.	2208.03376	null
2022-08-04	SROS2: Usable Cyber Security Tools for ROS 2	Victor Mayoral Vilches et.al.	2208.02615	link
2022-08-03	Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms	Bharath Garigipati et.al.	2208.02063	null
2022-08-02	Present and Future of SLAM in Extreme Underground Environments	Kamak Ebadi et.al.	2208.01787	null
2022-08-01	Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion	Simon Boche et.al.	2208.00709	null
2022-07-29	Neural Density-Distance Fields	Itsuki Ueda et.al.	2207.14455	link
2022-07-25	DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions	Tristan Laidlow et.al.	2207.12244	null
2022-07-25	Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration	Kenji Koide et.al.	2207.11942	null
2022-07-22	NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction	Yunlong Ran et.al.	2207.10985	null
2022-07-22	Dense RGB-D-Inertial SLAM with Map Deformations	Tristan Laidlow et.al.	2207.10940	null
2022-07-22	PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes	BaoSheng Zhang et.al.	2207.10916	null
2022-07-21	Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion	Suman Ghosh et.al.	2207.10494	link
2022-07-21	Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions	Quentin Serdel et.al.	2207.10489	link
2022-07-21	On applicability of von Karman’s momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity	Yujin Lu et.al.	2207.10413	null
2022-07-19	Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM	Tuvy Lemberg et.al.	2207.09103	null
2022-07-18	DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM	Weicai Ye et.al.	2207.08794	link
2022-07-18	Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction	Marco Orsingher et.al.	2207.08439	null
2022-07-18	ORB-based SLAM accelerator on SoC FPGA	Vibhakar Vemulapati et.al.	2207.08405	null
2022-07-14	Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset	Riccardo Giubilato et.al.	2207.06815	null
2022-07-14	Semi-supervised Vector-Quantization in Visual SLAM using HGCN	Amir Zarringhalam et.al.	2207.06738	null
2022-07-14	Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders	Amir Zarringhalam et.al.	2207.06732	null
2022-07-13	SLAM: SLO-Aware Memory Optimization for Serverless Applications	Gor Safaryan et.al.	2207.06183	null
2022-07-19	Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras	Fangwen Shu et.al.	2207.06058	link
2022-07-12	Accelerating Certifiable Estimation with Preconditioned Eigensolvers	David M. Rosen et.al.	2207.05257	null
2022-07-12	Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features	Meiyu Zhi et.al.	2207.05244	null
2022-07-14	SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial	Chih-Yuan Chiu et.al.	2207.05043	null
2022-07-08	BlindSpotNet: Seeing Where We Cannot See	Taichi Fukuda et.al.	2207.03870	null
2022-07-08	Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints	Philipp Glira et.al.	2207.03785	null
2022-07-08	Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements	Ran Liu et.al.	2207.03700	null
2022-07-07	RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments	Qihao Peng et.al.	2207.03539	null
2022-07-06	VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization	Marius Laska et.al.	2207.02668	null
2022-07-06	A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models	Axel Garcia-Vega et.al.	2207.02396	null
2022-07-04	VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM	Ling Gao et.al.	2207.01404	null
2022-07-04	VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM	Danpeng Chen et.al.	2207.01158	null
2022-07-03	Wireless Channel Prediction in Partially Observed Environments	Mingsheng Yin et.al.	2207.00934	null
2022-07-01	A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers	Julio A. Placed et.al.	2207.00254	null
2022-07-01	Keeping Less is More: Point Sparsification for Visual SLAM	Yeonsoo Park et.al.	2207.00225	null
2022-06-30	Controlled and impulsive compression of an entrapped air bubble during impact	Utkarsh Jain et.al.	2206.15297	null
2022-06-30	Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery	Yuehao Wang et.al.	2206.15255	link
2022-06-27	IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments	Abanob Soliman et.al.	2206.13455	link
2022-06-26	An Efficient Global Optimality Certificate for Landmark-Based SLAM	Connor Holmes et.al.	2206.12961	link
2022-06-21	Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping	Davide Tateo et.al.	2206.10263	link
2022-06-20	Data Fusion for Radio Frequency SLAM with Robust Sampling	Erik Leitinger et.al.	2206.09746	null
2022-06-19	RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments	Chenglong Qian et.al.	2206.09463	null
2022-06-17	Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments	Khairuldanial Ismail et.al.	2206.08733	null
2022-06-17	An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions	Yijun Yuan et.al.	2206.08712	link
2022-06-13	ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy	Hao Bai et.al.	2206.06435	null
2022-06-10	Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming	Javier Cremona et.al.	2206.05066	link
2022-06-09	SparseFormer: Attention-based Depth Completion Network	Frederik Warburg et.al.	2206.04557	null
2022-06-07	Robot Self-Calibration Using Actuated 3D Sensors	Arne Peters et.al.	2206.03430	null
2022-06-07	Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map	Haodong Yuan et.al.	2206.03062	null
2022-06-05	DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions	Alena Savinykh et.al.	2206.02199	null
2022-06-04	C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy	Erez Posner et.al.	2206.01961	null
2022-06-01	PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry	Dong-Uk Seo et.al.	2206.00266	link
2022-05-27	A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching	Arno Solin et.al.	2205.13821	null
2022-05-31	LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments	Yun Chang et.al.	2205.13135	link
2022-05-25	Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM	Milad Ramezani et.al.	2205.12595	null
2022-05-24	Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM	Christopher E. Denniston et.al.	2205.12402	link
2022-05-22	ALITA: A Large-scale Incremental Dataset for Long-term Autonomy	Peng Yin et.al.	2205.10737	link
2022-05-19	FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2	Jeffrey Ichnowski et.al.	2205.09778	link
2022-05-17	Global Data Association for SLAM with 3D Grassmannian Manifold Objects	Parker C. Lusk et.al.	2205.08556	null
2022-05-19	Cluster on Wheels	Yuanyuan Yang et.al.	2205.08151	null
2022-05-12	Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry	Shihao Shen et.al.	2205.05916	link
2022-05-12	S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization	Ran Cheng et.al.	2205.05861	null
2022-05-14	Multi-modal Semantic SLAM for Complex Dynamic Environments	Han Wang et.al.	2205.04300	link
2022-05-06	OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations	Carmen Delgado et.al.	2205.03256	null
2022-05-05	CNN-Augmented Visual-Inertial SLAM with Planar Constraints	Pan Ji et.al.	2205.02940	null
2022-05-05	PMBM-based SLAM Filters in 5G mmWave Vehicular Networks	Hyowon Kim et.al.	2205.02502	null
2022-05-04	BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking	Dorian Henning et.al.	2205.02301	null
2022-05-04	A Global Asymptotic Convergent Observer for SLAM	Seyed Hamed Hashemi et.al.	2205.01953	null
2022-05-04	Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation	Nathaniel Merrill et.al.	2205.01823	link
2022-05-03	GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping	Pan Ji et.al.	2205.01656	null
2022-04-29	Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM	Jinwoo Jeon et.al.	2204.13877	link
2022-04-27	The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection	Konstantinos A. Tsintotas et.al.	2204.12831	null
2022-04-27	Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment	Wenyu Li et.al.	2204.12769	null
2022-04-29	MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment	Tingchen Ma et.al.	2204.11621	null
2022-04-23	Indoor simultaneous localization and mapping based on fringe projection profilometry	Yang Zhao et.al.	2204.11020	null
2022-04-22	Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria	Julio A. Placed et.al.	2204.10631	null
2022-04-22	Fast Autonomous Robotic Exploration Using the Underlying Graph Structure	Julio A. Placed et.al.	2204.10610	null
2022-04-22	Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions	Yutong Hu et.al.	2204.10552	null
2022-04-22	Implicit Object Mapping With Noisy Data	Jad Abou-Chakra et.al.	2204.10516	link
2022-04-19	Photometric single-view dense 3D reconstruction in endoscopy	Victor M. Batlle et.al.	2204.09083	null
2022-04-18	Pulsar skips: Understanding variations in the regular periods of rotating neutron stars	Clayton Miller et.al.	2204.08449	null
2022-04-18	Tracking monocular camera pose and deformation for SLAM inside the human body	Juan J. Gomez Rodriguez et.al.	2204.08309	null
2022-04-18	Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker	Hanjing Ye et.al.	2204.08163	null
2022-04-14	ViViD++: Vision for Visibility Dataset	Alex Junho Lee et.al.	2204.06183	null
2022-04-12	HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud	Zhixing Hou et.al.	2204.05481	null
2022-04-12	RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room	Cong Gao et.al.	2204.05467	null
2022-04-11	Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context	Lizhou Liao et.al.	2204.04932	link
2022-04-04	Monitoring social distancing with single image depth estimation	Alessio Mingozzi et.al.	2204.01693	null
2022-04-01	Bi-directional Loop Closure for Visual SLAM	Ihtisham Ali et.al.	2204.01524	null
2022-04-04	IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers	Lei Sun et.al.	2204.01324	link
2022-04-03	Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor	Wenyan Ou et.al.	2204.01154	null
2022-04-02	UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps	Ayyappa Swamy Thatavarthy et.al.	2204.00865	link
2022-03-31	Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects	Yujie Lu et.al.	2204.00035	null
2022-03-30	GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios	Chih-Yuan Chiu et.al.	2203.16690	null
2022-03-29	Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field	Mostafa Osman et.al.	2203.15866	null
2022-03-29	Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform	Mingjun Li et.al.	2203.15439	null
2022-03-29	Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots	Pranay Mathur et.al.	2203.15272	null
2022-03-28	Are High-Resolution Event Cameras Really Needed?	Daniel Gehrig et.al.	2203.14672	null
2022-03-25	Spectral Measurement Sparsification for Pose-Graph SLAM	Kevin J. Doherty et.al.	2203.13897	link
2022-03-25	FD-SLAM: 3-D Reconstruction Using Features and Dense Matching	Xingrui Yang et.al.	2203.13861	null
2022-03-25	Gravity-constrained point cloud registration	Vladimír Kubelka et.al.	2203.13799	null
2022-03-24	MD-SLAM: Multi-cue Direct SLAM	Luca Di Giammarino et.al.	2203.13237	link
2022-03-24	Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video	Shun Taguchi et.al.	2203.12804	null
2022-03-19	Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems	Jie Yang et.al.	2203.10267	null
2022-03-16	Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR	Ian D. Miller et.al.	2203.08925	link
2022-03-15	Neural RF SLAM for unsupervised positioning and mapping with channel state information	Shreya Kadambi et.al.	2203.08264	null
2022-03-15	Simultaneous Localisation and Mapping with Quadric Surfaces	Tristan Laidlow et.al.	2203.08040	null
2022-03-14	Drift Reduced Navigation with Deep Explainable Features	Mohd Omama et.al.	2203.06897	link
2022-03-11	An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs	Keisuke Sugiura et.al.	2203.05763	null
2022-03-10	High Definition, Inexpensive, Underwater Mapping	Bharat Joshi et.al.	2203.05640	link
2022-03-10	SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning	Jaehoon Choi et.al.	2203.05332	null
2022-03-08	Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM	Pierre-Yves Lajoie et.al.	2203.04446	link
2022-03-08	SLAM-Supported Self-Training for 6D Object Pose Estimation	Ziqi Lu et.al.	2203.04424	link
2022-03-08	An Online Semantic Mapping System for Extending and Enhancing Visual SLAM	Thorsten Hempel et.al.	2203.03944	null
2022-03-07	Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms	Qingqing Li et.al.	2203.03454	link
2022-03-07	OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition	Junyi Ma et.al.	2203.03397	link
2022-03-06	Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM	Kazushi Aiba et.al.	2203.02887	null
2022-03-06	RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects	Ran Long et.al.	2203.02882	null
2022-03-03	STUN: Self-Teaching Uncertainty Estimation for Place Recognition	Kaiwen Cai et.al.	2203.01851	link
2022-03-03	Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning	Niclas Vödisch et.al.	2203.01578	link
2022-03-02	FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry	Chunran Zheng et.al.	2203.00893	link
2022-03-02	Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation	Yulun Tian et.al.	2203.00851	null
2022-03-01	Descriptellation: Deep Learned Constellation Descriptors for SLAM	Chunwei Xing et.al.	2203.00567	null
2022-03-01	Collaborative Robot Mapping using Spectral Graph Analysis	Lukas Bernreiter et.al.	2203.00308	null
2022-02-26	RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization	Nikolaos Kourtzanidis et.al.	2202.13221	link
2022-02-25	Probabilistic Data Association for Semantic SLAM at Scale	Elad Michael et.al.	2202.12802	link
2022-02-24	TwistSLAM: Constrained SLAM in Dynamic Environment	Mathieu Gonzalez et.al.	2202.12384	null
2022-02-24	Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion	Hyeonsoo Jang et.al.	2202.12108	null
2022-02-23	MITI: SLAM Benchmark for Laparoscopic Surgery	Regine Hartwig et.al.	2202.11496	null
2022-02-23	DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization	Xuebo Tian et.al.	2202.11431	null
2022-02-23	Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets	Islam Ali et.al.	2202.11312	null
2022-02-22	SAGE: SLAM with Appearance and Geometry Prior for Endoscopy	Xingtong Liu et.al.	2202.09487	link
2022-02-18	OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure	Stefan Leutenegger et.al.	2202.09199	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146	link
2022-02-18	An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems	Qiang Liu et.al.	2202.08952	null
2022-02-17	Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study	Giovanni Cioffi et.al.	2202.08894	link
2022-02-17	LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building	Jiashi Zhang et.al.	2202.08487	null
2022-02-16	Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments	Jinkun Wang et.al.	2202.08359	null
2022-02-11	Overhead Image Factors for Underwater Sonar-based SLAM	John McConnell et.al.	2202.05811	null
2022-02-10	Scale Estimation with Dual Quadrics for Monocular Object SLAM	Shuangfu Song et.al.	2202.04816	null
2022-02-08	A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition	Nie Jiwei et.al.	2202.03677	null
2022-01-25	Autonomous Vehicles: Open-Source Technologies, Considerations, and Development	Oussama Saoudi et.al.	2202.03148	null
2022-02-07	Temporal Point Cloud Completion with Pose Disturbance	Jieqi Shi et.al.	2202.03084	null
2022-02-04	DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments	Xinggang Hu et.al.	2202.01938	null
2022-02-01	A Model for Multi-View Residual Covariances based on Perspective Deformation	Alejandro Fontan et.al.	2202.00765	null
2022-01-30	Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM	Xinghe Chu et.al.	2201.12726	null
2022-01-28	RGB-D SLAM Using Attention Guided Frame Association	Ali Caglayan et.al.	2201.12047	null
2022-02-04	Learning to Act with Affordance-Aware Multimodal Neural SLAM	Zhiwei Jia et.al.	2201.09862	link
2022-01-22	Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems	Xi Zheng et.al.	2201.09048	link
2022-01-17	SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System	Giseop Kim et.al.	2201.06423	null
2022-01-14	SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions	Ali Samadzadeh et.al.	2201.05386	link
2022-01-19	Multi-Hypothesis Scan Matching through Clustering	Giorgio Iavicoli et.al.	2201.03814	null
2022-01-11	Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM	Kevin J. Doherty et.al.	2201.03773	null
2022-01-10	High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM	Brian M. Hopkinson et.al.	2201.03364	link
2022-01-10	Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition	M. Usman Maqbool Bhutta et.al.	2201.03212	link
2022-01-04	Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds	Xueliang Wen et.al.	2201.00959	null
2021-12-29	Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic	Khen Elimelech et.al.	2112.14428	null
2021-12-19	M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots	Jie Yin et.al.	2112.13659	link
2021-12-27	UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping	Hyunjun Lim et.al.	2112.13515	link
2021-12-25	Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs	Yusheng Wang et.al.	2112.13224	null
2021-12-25	Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping	Peng Huang et.al.	2112.13222	null
2021-12-24	3D Point Cloud Reconstruction and SLAM as an Input	Ziyu Li et.al.	2112.12907	null
2021-12-22	NICE-SLAM: Neural Implicit Scalable Encoding for SLAM	Zihan Zhu et.al.	2112.12130	link
2021-12-18	Fast and Robust Registration of Partially Overlapping Point Clouds	Eduardo Arnold et.al.	2112.09922	link
2021-12-17	Symmetry-aware Neural Architecture for Embodied Visual Navigation	Shuang Liu et.al.	2112.09515	null
2021-12-27	Homography Decomposition Networks for Planar Object Tracking	Xinrui Zhan et.al.	2112.07909	link
2021-12-14	Autonomous Navigation System from Simultaneous Localization and Mapping	Micheal Caracciolo et.al.	2112.07723	link
2021-12-12	360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation	Bolivar Solarte et.al.	2112.06180	link
2021-12-11	Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization	Amay Saxena et.al.	2112.05921	null
2021-12-07	Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems	Gideon Billings et.al.	2112.03826	link
2021-12-05	Iterated Posterior Linearization PMB Filter for 5G SLAM	Yu Ge et.al.	2112.02575	null
2021-12-03	Fast Direct Stereo Visual SLAM	Jiawei Mo et.al.	2112.01890	link
2021-12-02	MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment	Jie Ren et.al.	2112.01349	link
2021-12-01	Research on Event Accumulator Settings for Event-Based SLAM	Kun Xiao et.al.	2112.00427	link
2021-11-29	An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments	Assem Sadek et.al.	2111.14666	null
2021-11-29	Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report	Hartmut Surmann et.al.	2111.14542	null
2021-11-24	Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment	V. Ayala-Alfaro et.al.	2111.12690	null
2021-11-24	Autonomous bot with ML-based reactive navigation for indoor environment	Yash Srivastava et.al.	2111.12542	null
2021-11-22	A General Framework for Lifelong Localization and Mapping in Changing Environment	Min Zhao et.al.	2111.10946	link
2021-11-17	Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network	Xiaoming Zhao et.al.	2111.09006	null
2021-11-10	Comparing dominance of tennis’ big three via multiple-output Bayesian quantile regression models	Bruno Santos et.al.	2111.05631	null
2021-11-10	TomoSLAM: factor graph optimization for rotation angle refinement in microtomography	Mark Griguletskii et.al.	2111.05562	null
2021-11-07	Hierarchical Segment-based Optimization for SLAM	Yuxin Tian et.al.	2111.04101	null
2021-11-07	Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM	Shing Yan Loo et.al.	2111.04096	null
2021-11-05	MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry	Joan P. Company-Corcoles et.al.	2111.03408	null
2021-10-31	Loop closure detection using local 3D deep descriptors	Youjie Zhou et.al.	2111.00440	link
2021-10-27	Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification	Mingsheng Yin et.al.	2110.14789	link
2021-10-27	Efficient Placard Discovery for Semantic Mapping During Frontier Exploration	David Balaban et.al.	2110.14742	null
2021-10-26	Robust Multi-view Registration of Point Sets with Laplacian Mixture Model	Jin Zhang et.al.	2110.13744	null
2021-10-25	WOLF: A modular estimation framework for robotics based on factor graphs	Joan Sola et.al.	2110.12919	null
2021-10-21	Real-Time Ground-Plane Refined LiDAR SLAM	Fan Yang et.al.	2110.11517	null
2021-10-21	SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words	Jonathan J. Y. Kim et.al.	2110.11491	null
2021-10-21	InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion	Zhenkun Zhu et.al.	2110.11040	null
2021-10-20	SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training	Ankur Bapna et.al.	2110.10329	null
2021-10-18	Enhancing exploration algorithms for navigation with visual SLAM	Kirill Muravyev et.al.	2110.09156	null
2021-10-18	Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment	Rui Tian et.al.	2110.08977	null
2021-10-16	Partial Hierarchical Pose Graph Optimization for SLAM	Alexander Korovko et.al.	2110.08639	null
2021-10-14	Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach	Shumon Koga et.al.	2110.07546	null
2021-10-13	Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity	Ran Liu et.al.	2110.06541	null
2021-10-12	Learning Efficient Multi-Agent Cooperative Visual Exploration	Chao Yu et.al.	2110.05734	null
2021-10-07	Self-Supervised Depth Completion for Active Stereo	Frederik Warburg et.al.	2110.03234	null
2021-10-06	InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes	Zhenkun Zhu et.al.	2110.02593	null
2021-10-03	AEROS: Adaptive RObust least-Squares for Graph-Based SLAM	Milad Ramezani et.al.	2110.02018	null
2021-10-04	Fast Uncertainty Quantification for Active Graph SLAM	Julio A. Placed et.al.	2110.01289	link
2021-10-04	Geometry-based Graph Pruning for Lifelong SLAM	Gerhard Kurz et.al.	2110.01286	null
2021-10-03	Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration	Marcus Greiff et.al.	2110.01099	null
2021-10-02	Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows	Qiangqiang Huang et.al.	2110.00876	link

SFM

Publish Date	Title	Authors	PDF	Code
2025-07-22	Sparse-View 3D Reconstruction: Recent Advances and Open Challenges	Tanveer Younis et.al.	2507.16406	null
2025-07-21	Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing	Boni Hu et.al.	2507.15683	null
2025-07-21	Few-Shot Object Detection via Spatial-Channel State Space Model	Zhimeng Xin et.al.	2507.15308	null
2025-07-20	An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks	Xinyi Wu et.al.	2507.14798	null
2025-07-17	Uncertainty Quantification Framework for Aerial and UAV Photogrammetry through Error Propagation	Debao Huang et.al.	2507.13486	null
2025-07-16	Enhancing In-Domain and Out-Domain EmoFake Detection via Cooperative Multilingual Speech Foundation Models	Orchid Chetia Phukan et.al.	2507.12595	null
2025-07-16	BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images	Davide Di Nucci et.al.	2507.12095	null
2025-07-23	Spatial Frequency Modulation for Semantic Segmentation	Linwei Chen et.al.	2507.11893	null
2025-07-20	Supporting SENCOTEN Language Documentation Efforts with Automatic Speech Recognition	Mengzhe Geng et.al.	2507.10827	null
2025-07-11	Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT	Wei Zhang et.al.	2507.08448	null
2025-07-04	MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion	Peilin Tao et.al.	2507.03306	null
2025-06-30	Towards Initialization-free Calibrated Bundle Adjustment	Carl Olsson et.al.	2506.23808	null
2025-06-30	AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention	Ziao Liu et.al.	2506.23611	null
2025-06-27	Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras	Petr Hruby et.al.	2506.22069	null
2025-06-24	ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes	Chenhao Zhang et.al.	2506.21629	null
2025-07-08	Wild refitting for black box prediction	Martin J. Wainwright et.al.	2506.21460	null
2025-06-24	Experimental Assessment of Neural 3D Reconstruction for Small UAV-based Applications	Genís Castillo Gómez-Raya et.al.	2506.19491	null
2025-06-23	ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs	Michal Nazarczuk et.al.	2506.18792	null
2025-06-23	Room temperature spin injection into commercial VCSELs at non-resonant wavelengths	Timur Almabetov et.al.	2506.18376	null
2025-06-11	OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary	Yui Sudo et.al.	2506.09448	null
2025-06-06	SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction	Yuchao Zheng et.al.	2506.05935	null
2025-06-05	On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images	Andreas Meuleman et.al.	2506.05558	null
2025-06-05	SupeRANSAC: One RANSAC to Rule Them All	Daniel Barath et.al.	2506.04803	link
2025-06-04	Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation	Tianyu Huang et.al.	2506.04225	null
2025-06-04	Accelerating SfM-based Pose Estimation with Dominating Set	Joji Joseph et.al.	2506.03667	null
2025-06-03	Nearby dwarf galaxies with extreme star formation rates: a window into dwarf-galaxy evolution in the early Universe	S. Kaviraj et.al.	2506.03265	null
2025-06-02	Fast and Robust Rotation Averaging with Anisotropic Coordinate Descent	Yaroslava Lochman et.al.	2506.01940	null
2025-06-03	Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC	Qingzheng Wang et.al.	2505.24200	null
2025-05-29	Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping	Justin Lazarow et.al.	2505.23756	null
2025-05-30	FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian	Sara Papi et.al.	2505.22759	link
2025-05-28	UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images	Junhuan Liu et.al.	2505.22098	null
2025-05-28	Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule	San Jiang et.al.	2505.22089	null
2025-05-30	Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations	Whenty Ariyanti et.al.	2505.21356	null
2025-05-27	Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting	Xiangyu Sun et.al.	2505.20729	null
2025-05-26	Robust fine-tuning of speech recognition models via model merging: application to disordered speech	Alexandre Ducorroy et.al.	2505.20477	null
2025-05-29	Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud	Natsuki Takama et.al.	2505.19854	null
2025-05-25	Improving Novel view synthesis of 360 $^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images	Guangan Chen et.al.	2505.19264	link
2025-05-24	Token-Level Logits Matter: A Closer Look at Speech Foundation Models for Ambiguous Emotion Recognition	Jule Valendo Halim et.al.	2505.18484	null
2025-05-23	To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models	Simone Gaisbauer et.al.	2505.17973	link
2025-05-23	Corporate Needs You to Find the Difference: Revisiting Submodular and Supermodular Ratio Optimization Problems	Elfarouk Harb et.al.	2505.17443	link
2025-05-23	Tracking the Flight: Exploring a Computational Framework for Analyzing Escape Responses in Plains Zebra (Equus quagga)	Isla Duporge et.al.	2505.16882	link
2025-05-21	A Taxonomy of Structure from Motion Methods	Federica Arrigoni et.al.	2505.15814	null
2025-05-18	Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis	Dong Yang et.al.	2505.12226	null
2025-05-15	Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis	Francisco Raverta Capua et.al.	2505.10751	link
2025-05-13	Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People	Haoshuai Zhou et.al.	2505.08215	null
2025-05-12	RDD: Robust Feature Detector and Descriptor using Deformable Transformer	Gonglin Chen et.al.	2505.08013	null
2025-05-12	Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild	Lintao Xiang et.al.	2505.07373	null
2025-05-11	Symmetry in Fundamental Parameters of Galaxies on the Star-forming Main Sequence	Zhicheng He et.al.	2505.06868	null
2025-05-10	TPK: Trustworthy Trajectory Prediction Integrating Prior Knowledge For Interpretability and Kinematic Feasibility	Marius Baden et.al.	2505.06743	null
2025-05-08	DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion	Qitao Zhao et.al.	2505.05473	null
2025-05-20	FastMap: Revisiting Dense and Scalable Structure from Motion	Jiahao Li et.al.	2505.04612	link
2025-05-15	Estimating the Diameter at Breast Height of Trees in a Forest With a Single 360 Camera	Siming He et.al.	2505.03093	null
2025-05-03	AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting	Junhao Shi et.al.	2505.01799	null
2025-05-03	PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth	Bu Jin et.al.	2505.01729	null
2025-05-01	Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation?	Viktor Kocur et.al.	2505.00866	link
2025-04-29	Large-scale visual SLAM for in-the-wild videos	Shuo Sun et.al.	2504.20496	null
2025-04-29	Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views	Jiang Wu et.al.	2504.20378	link
2025-04-28	MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion	Zador Pataki et.al.	2504.20040	link
2025-04-24	Dynamic Camera Poses and Where to Find Them	Chris Rockwell et.al.	2504.17788	null
2025-04-24	EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy	Haodi Yao et.al.	2504.17280	null
2025-04-23	A Low-Cost Photogrammetry System for 3D Plant Modeling and Phenotyping	Joe Hrzich et.al.	2504.16840	null
2025-04-23	PRaDA: Projective Radial Distortion Averaging	Daniil Sinitsyn et.al.	2504.16499	null
2025-04-21	Traversing the Star-Forming Main Sequence with Molecular Gas Stacks of z~1.6 Cluster Galaxies	Alex Pigarelli et.al.	2504.15381	null
2025-04-21	Towards Understanding Camera Motions in Any Video	Zhiqiu Lin et.al.	2504.15376	null
2025-04-21	StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models	Yeona Hong et.al.	2504.14915	null
2025-04-17	Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering	Landon Dyken et.al.	2504.13339	null
2025-04-15	EDGS: Eliminating Densification for Efficient Convergence of 3DGS	Dmytro Kotovenko et.al.	2504.13204	null
2025-04-15	Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps	Panagiotis Agrafiotis et.al.	2504.11416	link
2025-04-12	A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds	Jizong Peng et.al.	2504.09129	null
2025-04-11	Stereophotoclinometry Revisited	Travis Driver et.al.	2504.08252	null
2025-04-08	Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring	José A. Pilartes-Congo et.al.	2504.06464	null
2025-04-07	Decoding the variability in the star-formation histories of z ~ 0.8 galaxies	Jenny T. Wan et.al.	2504.05281	null
2025-04-05	3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS	Zhisheng Huang et.al.	2504.04294	null
2025-04-04	An Algebraic Geometry Approach to Viewing Graph Solvability	Federica Arrigoni et.al.	2504.03637	null
2025-04-04	Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video	Jiaxin Guo et.al.	2504.03198	null
2025-04-03	Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation	Feng Gao et.al.	2504.02647	link
2025-04-09	FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking	Ulas Gunes et.al.	2504.01732	null
2025-03-31	LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors	Han Zhou et.al.	2504.00219	null
2025-03-30	AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos	Felix Wimbauer et.al.	2503.23282	link
2025-03-24	Ground Penetrating Radar-Assisted Multimodal Robot Odometry Using Subsurface Feature Matrix	Haifeng Li et.al.	2503.18301	null
2025-03-22	3D Modeling: Camera Movement Estimation and path Correction for SFM Model using the Combination of Modified A-SIFT and Stereo System	Usha Kumari et.al.	2503.17668	null
2025-03-25	ProtoGS: Efficient and High-Quality Rendering with 3D Gaussian Prototypes	Zhengqing Gao et.al.	2503.17486	null
2025-03-21	ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration	Johan Edstedt et.al.	2503.17093	link
2025-03-20	From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction	Ayberk Acar et.al.	2503.16263	null
2025-03-22	Euclid Quick Data Release (Q1). A first view of the star-forming main sequence in the Euclid Deep Fields	Euclid Collaboration et.al.	2503.15314	null
2025-03-18	Multi-view Reconstruction via SfM-guided Monocular Depth Estimation	Haoyu Guo et.al.	2503.14483	null
2025-03-18	A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios	Huy-Hoang Bui et.al.	2503.13982	link
2025-03-17	Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios	Iryna Repinetska et.al.	2503.13710	null
2025-03-17	Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization	Yiwei Xu et.al.	2503.13086	null
2025-03-15	SFMNet: Sparse Focal Modulation for 3D Object Detection	Oren Shrout et.al.	2503.12093	null
2025-03-11	A Framework for Reducing the Complexity of Geometric Vision Problems and its Application to Two-View Triangulation with Approximation Bounds	Felix Rydell et.al.	2503.08142	null
2025-03-11	DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection	Johan Edstedt et.al.	2503.07347	link
2025-03-18	Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion	Mona Sheikh Zeinoddin et.al.	2503.07204	null
2025-03-10	VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation	Hanzhi Chen et.al.	2503.07135	null
2025-03-09	AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation	Yang Zou et.al.	2503.06660	null
2025-03-07	LiDAR-enhanced 3D Gaussian Splatting Mapping	Jian Shen et.al.	2503.05425	null
2025-03-06	PLMP – Point-Line Minimal Problems for Projective SfM	Kim Kiehn et.al.	2503.04351	null
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661	link
2025-03-03	ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization	Anas Abdelkarim et.al.	2503.01311	link
2025-03-05	A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping	Jialei He et.al.	2503.01202	null
2025-03-02	MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain	Rui Yi Yong et.al.	2503.00853	null
2025-03-02	PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery	BoCheng Li et.al.	2503.00848	null
2025-03-02	Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration	Jinjiang You et.al.	2503.00737	link
2025-02-28	The THESAN-ZOOM project: Burst, quench, repeat – unveiling the evolution of high-redshift galaxies along the star-forming main sequence	William McClymont et.al.	2503.00106	null
2025-02-27	Best Foot Forward: Robust Foot Reconstruction in-the-wild	Kyle Fogarty et.al.	2502.20511	null
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932	null
2025-03-04	Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model	Yaxuan Huang et.al.	2502.16779	null
2025-02-20	CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting	Qilin Zhang et.al.	2502.14684	link
2025-02-19	Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections	Seong Jong Yoo et.al.	2502.13986	null
2025-02-19	IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras	Dongki Jung et.al.	2502.12545	null
2025-02-12	Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors	Vishwanath Pratap Singh et.al.	2502.08587	null
2025-02-10	FOCUS – Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences	Oliver Boyne et.al.	2502.06367	link
2025-02-09	Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models	Jing-Xuan Zhang et.al.	2502.05766	link
2025-02-10	Building Rome with Convex Optimization	Haoyu Han et.al.	2502.04640	null
2025-02-04	SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification	Yifu Tao et.al.	2502.02657	null
2025-02-05	GP-GS: Gaussian Processes for Enhanced Gaussian Splatting	Zhihao Guo et.al.	2502.02283	link
2025-02-03	XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications	Shangjin Zhai et.al.	2502.01297	null
2025-01-29	Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment	Zixue Zeng et.al.	2501.17690	link
2025-01-28	Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction	Tim Flückiger et.al.	2501.16221	null
2025-01-25	Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos	Zhen-Hui Dong et.al.	2501.15096	null
2025-01-24	MATCHA:Towards Matching Anything	Fei Xue et.al.	2501.14945	null
2025-01-24	Light3R-SfM: Towards Feed-forward Structure-from-Motion	Sven Elflein et.al.	2501.14914	null
2025-01-24	Dense-SfM: Structure from Motion with Dense Consistent Matching	JongMin Lee et.al.	2501.14277	null
2025-01-21	Theory of quantum-geometric charge and spin Josephson diode effects in strongly spin-polarized hybrid structures with noncoplanar spin textures	Niklas L. Schulz et.al.	2501.12232	null
2025-01-14	Selective Attention Merging for low resource tasks: A case study of Child ASR	Natarajan Balaji Shankar et.al.	2501.08468	link
2025-01-14	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015	null
2025-02-02	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927	link
2025-01-11	Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis	Aditya Rauniyar et.al.	2501.06431	null
2025-01-09	Existence of dynamical fluctuation in AMPT generated data for Au+Au collisions at 10 AGeV	Somen Gope et.al.	2501.05175	null
2025-01-06	Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation	Yuezhang Lv et.al.	2501.02821	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	null
2025-01-02	EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy	Ao Gao et.al.	2501.01003	null
2024-12-30	KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences	Keng-Wei Chang et.al.	2412.20767	null
2024-12-27	Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images	Xudong Cai et.al.	2412.19518	null
2024-12-25	Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition	Shujie Hu et.al.	2412.18832	null
2024-12-23	Reconstructing People, Places, and Cameras	Lea Müller et.al.	2412.17806	link
2024-12-18	Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Rémi Marsal et.al.	2412.14103	null
2024-12-16	Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection	Beomseok Lee et.al.	2412.11978	null
2024-12-18	SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video	Jongmin Park et.al.	2412.09982	null
2024-12-12	CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework	Yushan Han et.al.	2412.08344	null
2024-12-10	Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling	Hui Deng et.al.	2412.07230	null
2024-12-08	Unveiling True Talent: The Soccer Factor Model for Skill Evaluation	Alexandre Andorra et.al.	2412.05911	null
2024-12-08	Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features	Yuanbo Xiangli et.al.	2412.05826	null
2024-12-06	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463	null
2024-12-03	ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification	Pan Zhang et.al.	2412.02044	link
2024-12-02	SfM-Free 3D Gaussian Splatting via Hierarchical Training	Bo Ji et.al.	2412.01553	link
2024-12-02	MVImgNet2.0: A Larger-scale Dataset of Multi-view Images	Xiaoguang Han et.al.	2412.01430	null
2024-12-02	TAS-TsC: A Data-Driven Framework for Estimating Time of Arrival Using Temporal-Attribute-Spatial Tri-space Coordination of Truck Trajectories	Mengran Li et.al.	2412.01122	null
2024-12-02	Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM	Alejandro Fontan et.al.	2412.01116	null
2024-11-27	RoMo: Robust Motion Segmentation Improves Structure from Motion	Lily Goli et.al.	2411.18650	null
2024-11-26	The MAGPI Survey: radial trends in star formation across different cosmological simulations in comparison with observations at $z \sim$ 0.3	Marcie Mun et.al.	2411.17882	null
2024-11-25	Characterizing Stellar and Gas Properties in NGC 628: Spatial Distributions, Radial Gradients, and Resolved Scaling Relations	Peng Wei et.al.	2411.16150	null
2024-11-24	ZeroGS: Training 3D Gaussian Splatting from Unposed Images	Yu Chen et.al.	2411.15779	null
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291	null
2024-11-15	SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction	Yutao Tang et.al.	2411.12592	link
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546	null
2024-11-13	4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization	Mijeong Kim et.al.	2411.08879	null
2024-11-13	Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model	Yutao Shen et.al.	2411.08453	null
2024-11-08	From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS	Haoran Zhang et.al.	2411.05362	link
2024-10-29	A Cascade Approach for APT Campaign Attribution in System Event Logs: Technique Hunting and Subgraph Matching	Yi-Ting Huang et.al.	2410.22602	null
2024-10-29	LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues	Hanqing Jiang et.al.	2410.22213	null
2024-10-17	Stochastic Flow Matching for Resolving Small-Scale Physics	Stathi Fotiadis et.al.	2410.19814	null
2024-10-25	A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint	Changshi Mu et.al.	2410.19473	link
2024-10-30	Large Spatial Model: End-to-end Unposed Images to Semantic 3D	Zhiwen Fan et.al.	2410.18956	link
2024-10-23	CO-CAVITY project: Molecular gas and star formation in void galaxies	M. I. Rodríguez et.al.	2410.18078	null
2024-10-23	PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting	Yu Wang et.al.	2410.17505	null
2024-10-20	Neural Active Structure-from-Motion in Dark and Textureless Environment	Kazuto Ichimaru et.al.	2410.15378	null
2024-10-17	SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation	Shiao Xie et.al.	2410.13486	null
2024-10-16	Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks	Orchid Chetia Phukan et.al.	2410.12947	null
2024-10-16	Gravity-aligned Rotation Averaging with Circular Regression	Linfei Pan et.al.	2410.12763	link
2024-10-16	Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals	Orchid Chetia Phukan et.al.	2410.12645	null
2024-10-15	SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection	Yizhe Liu et.al.	2410.12080	link
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	link
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-10-09	Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models	Ange Lou et.al.	2410.07434	null
2024-10-09	Deep HI Mapping of M 106 Group with FAST	Yao Liu et.al.	2410.07038	null
2024-10-09	MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data	Mingu Kang et.al.	2410.06442	null
2024-10-08	Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation?	Charalambos Tzamos et.al.	2410.05984	link
2024-10-04	Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	Laura Fink et.al.	2410.03861	link
2024-10-01	MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages	Marco Gaido et.al.	2410.01036	link
2024-10-01	Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance	Hongchao Shu et.al.	2410.00386	null
2024-09-29	Robust Incremental Structure-from-Motion with Hybrid Features	Shaohui Liu et.al.	2409.19811	null
2024-09-27	MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion	Bardienus Duisterhof et.al.	2409.19152	null
2024-09-27	Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras	Yipeng Lu et.al.	2409.18673	null
2024-09-26	BlinkTrack: Feature Tracking over 100 FPS via Events and Images	Yichen Shen et.al.	2409.17981	null
2024-09-25	How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not	Francesco Verdini et.al.	2409.17044	null
2024-09-24	Frequency-based View Selection in Gaussian Splatting Reconstruction	Monica M. Q. Li et.al.	2409.16470	null
2024-10-07	Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion	Juan-Diego Florez et.al.	2409.16465	null
2024-09-24	Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research	Vandita Shukla et.al.	2409.15914	null
2024-09-23	Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments	Francisco Roza de Moraes et.al.	2409.15602	null
2024-09-23	Evaluating Robot Influence on Pedestrian Behavior Models for Crowd Simulation and Benchmarking	Subham Agrawal et.al.	2409.14844	null
2024-09-21	Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models	Orchid Chetia Phukan et.al.	2409.14131	null
2024-09-17	GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module	Yichen Zhang et.al.	2409.11307	null
2024-09-13	Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints	Shan Chen et.al.	2409.08613	null
2024-09-09	KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction	Davide Di Nucci et.al.	2409.05407	null
2024-09-06	The Arizona Molecular ISM Survey with the SMT: Variations in the CO(2-1)/CO(1-0) Line Ratio Across the Galaxy Population	Ryan P. Keenan et.al.	2409.03963	null
2024-09-05	Active Galactic Nuclei in the Green Valley at z $\sim$ 0.7	Charity Woodrum et.al.	2409.03197	null
2024-09-04	Object Gaussian for Monocular 6D Pose Estimation from Sparse Views	Luqing Luo et.al.	2409.02581	null
2024-09-11	Geometry-aware Feature Matching for Large-Scale Structure from Motion	Gonglin Chen et.al.	2409.02310	null
2024-09-04	The study of strongly intensive observables for $π^{\pm,0}$ in $pp$ collisions at LHC energy in the framework of PYTHIA model	Tumpa Biswas et.al.	2409.00525	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373	null
2024-09-15	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445	link
2024-08-21	Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations	Lintong Zhang et.al.	2408.11966	null
2024-08-20	TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks	Jinjie Mai et.al.	2408.10739	null
2024-08-16	Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS	Wei Sun et.al.	2408.08723	null
2024-08-15	CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning	Wei Zhu et.al.	2408.08134	link
2024-08-13	A Miniature Vision-Based Localization System for Indoor Blimps	Shicong Ma et.al.	2408.06648	null
2024-08-07	Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM	Yan Song Hu et.al.	2408.03825	null
2024-08-05	Context-aware Mamba-based Reinforcement Learning for social robot navigation	Syed Muhammad Mustafa et.al.	2408.02661	null
2024-08-04	Birational geometry of critical loci in Algebraic Vision	Marina Bertolini et.al.	2408.02067	null
2024-08-04	PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone	Xin Yang et.al.	2408.02053	null
2024-08-02	Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris	Kentaro Uno et.al.	2408.01035	null
2024-08-01	LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting	Zhenyu Bao et.al.	2408.00254	null
2024-07-29	Global Structure-from-Motion Revisited	Linfei Pan et.al.	2407.20219	link
2024-08-06	Revisit Self-supervised Depth Estimation with Local Structure-from-Motion	Shengjie Zhu et.al.	2407.19166	null
2024-07-23	The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations	Hao Liu et.al.	2407.16452	null
2024-07-22	Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures	Ruizhe Wang et.al.	2407.15435	null
2024-07-16	NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models	Francesco Milano et.al.	2407.12207	link
2024-07-15	LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning	Zhuozhu Jian et.al.	2407.10782	null
2024-07-15	Towards Scale-Aware Full Surround Monodepth with Transformers	Yuchen Yang et.al.	2407.10406	null
2024-07-14	3DEgo: 3D Editing on the Go!	Umar Khalid et.al.	2407.10102	null
2024-07-10	Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization	Jinjie Mai et.al.	2407.08023	link
2024-07-10	Euclid preparation. Forecasting the recovery of galaxy physical properties and their relations with template-fitting and machine-learning methods	Euclid Collaboration et.al.	2407.07940	null
2024-07-10	Controlling Space and Time with Diffusion Models	Daniel Watson et.al.	2407.07860	null
2024-07-09	Computer vision tasks for intelligent aerospace missions: An overview	Huilin Chen et.al.	2407.06513	null
2024-07-08	Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views	Jiawei Guo et.al.	2407.05666	null
2024-07-05	Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization	Shaohan Li et.al.	2407.04260	null
2024-07-15	SfM on-the-fly: Get better 3D from What You Capture	Zongqian Zhan et.al.	2407.03939	null
2024-07-03	Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction	Jiaxin Guo et.al.	2407.02918	link
2024-07-02	Indoor 3D Reconstruction with an Unknown Camera-Projector Pair	Zhaoshuai Qi et.al.	2407.01945	null
2024-06-27	SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas	John Lambert et.al.	2406.19390	link
2024-06-27	STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning	Yanan Zhang et.al.	2406.19362	null
2024-06-26	VDG: Vision-Only Dynamic Gaussian for Driving Simulation	Hao Li et.al.	2406.18198	null
2024-06-25	Consensus Learning with Deep Sets for Essential Matrix Estimation	Dror Moran et.al.	2406.17414	link
2024-06-24	Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction	Tong Qin et.al.	2406.16289	null
2024-06-21	The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization	Ivan Nikolić et.al.	2406.15237	link
2024-06-19	MVSBoost: An Efficient Point Cloud-based 3D Reconstruction	Umair Haroon et.al.	2406.13515	null
2024-06-17	MegaScenes: Scene-Level View Synthesis at Scale	Joseph Tung et.al.	2406.11819	link
2024-06-15	Benchmarking Children’s ASR with Supervised and Self-supervised Speech Foundation Models	Ruchao Fan et.al.	2406.10507	link
2024-06-14	On the Evaluation of Speech Foundation Models for Spoken Language Understanding	Siddhant Arora et.al.	2406.10083	null
2024-06-12	Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement	Maxime Pietrantoni et.al.	2406.08463	null
2024-06-12	SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models	Chun Yin et.al.	2406.08445	null
2024-06-10	Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis	Xin Jin et.al.	2406.06216	link
2024-06-07	The Star-Forming Main Sequence in JADES and CEERS at $z>1.4$ : Investigating the Burstiness of Star Formation	Leonardo Clarke et.al.	2406.05178	null
2024-06-13	Gaussian Splatting with Localized Points Management	Haosen Yang et.al.	2406.04251	null
2024-06-05	L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration	Yibo Liu et.al.	2406.03298	link
2024-06-04	CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation	Dejia Xu et.al.	2406.02509	null
2024-05-29	Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy	Zijie Jiang et.al.	2405.18863	null
2024-05-29	3D Reconstruction with Fast Dipole Sums	Hanyu Chen et.al.	2405.16788	null
2024-05-26	MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups	Yusen Xie et.al.	2405.16599	null
2024-05-26	Categorical Flow Matching on Statistical Manifolds	Chaoran Cheng et.al.	2405.16441	link
2024-05-22	Exploring Galaxy Properties of eCALIFA with Contrastive Learning	G. Martínez-Solaeche et.al.	2405.13471	null
2024-05-23	Switched Flow Matching: Eliminating Singularities via Switching ODEs	Qunxi Zhu et.al.	2405.11605	null
2024-05-28	NeRO: Neural Road Surface Reconstruction	Ruibo Wang et.al.	2405.10554	link
2024-05-15	Three Dimensional Spatial Cognition: Bees and Bats	Robert Worden et.al.	2405.09413	null
2024-05-09	Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media	Zhizhen Zhang et.al.	2405.05760	null
2024-05-09	Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment	Simon Weber et.al.	2405.05079	link
2024-05-07	Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications	Markus Hillemann et.al.	2405.04345	null
2024-05-07	Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling	Jiawei Shi et.al.	2405.04309	null
2024-05-06	Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion	Yunfeng Li et.al.	2405.03177	link
2024-05-03	HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2	Miriam Jäger et.al.	2405.02005	null
2024-04-25	The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time	Marcie Mun et.al.	2404.16319	null
2024-04-22	Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer	Eric Brachmann et.al.	2404.14351	null
2024-04-22	RESFM: Robust Equivariant Multiview Structure from Motion	Fadi Khatib et.al.	2404.14280	null
2024-04-22	Does Gaussian Splatting need SFM Initialization?	Yalda Foroutan et.al.	2404.12547	null
2024-05-07	A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion	Feng Yu et.al.	2404.11590	link
2024-04-18	DeblurGS: Gaussian Splatting for Camera Motion Blur	Jeongtaek Oh et.al.	2404.11358	null
2024-05-21	LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives	Jiadi Cui et.al.	2404.09748	null
2024-04-12	MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance	Yuqun Wu et.al.	2404.08252	null
2024-04-11	Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation	Keonhee Han et.al.	2404.07933	null
2024-04-07	NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization	Peng Tu et.al.	2404.04875	null
2024-04-04	GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis	Emmanouil Nikolakakis et.al.	2404.03126	null
2024-03-29	InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds	Zhiwen Fan et.al.	2403.20309	link
2024-03-29	HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes	Zhuopeng Li et.al.	2403.20032	null
2024-03-26	NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation	Jiahao Chen et.al.	2403.17537	null
2024-03-25	INPC: Implicit Neural Point Clouds for Radiance Field Rendering	Florian Hahlbohm et.al.	2403.16862	null
2024-03-18	An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation	Zewen Xu et.al.	2403.11639	null
2024-03-14	Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting	Jaewoo Jung et.al.	2403.09413	link
2024-03-13	Refractive COLMAP: Refractive Structure-from-Motion Revisited	Mengkun She et.al.	2403.08640	null
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156	link
2024-03-11	SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection	Yifu Tao et.al.	2403.06877	null
2024-03-24	BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling	Cheng Peng et.al.	2403.04926	link
2024-02-22	GaussianPro: 3D Gaussian Splatting with Progressive Propagation	Kai Cheng et.al.	2402.14650	null
2024-02-25	A Robust Error-Resistant View Selection Method for 3D Reconstruction	Shaojie Zhang et.al.	2402.11431	null
2024-02-17	Dense Matchers for Dense Tracking	Tomáš Jelínek et.al.	2402.11287	null
2024-03-11	Local Feature Matching Using Deep Learning: A Survey	Shibiao Xu et.al.	2401.17592	link
2024-01-22	HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs	Zelin Gao et.al.	2401.11711	null
2024-01-19	SCENES: Subpixel Correspondence Estimation With Epipolar Supervision	Dominik A. Kloepfer et.al.	2401.10886	null
2024-01-15	3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data	Mathilde Letard et.al.	2401.09481	link
2024-01-17	3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey	Thiago Lopes Trugillo da Silveira et.al.	2401.09252	null
2024-01-17	ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization	Weiyao Wang et.al.	2401.08937	null
2024-01-16	Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions	Yi-Fan Zuo et.al.	2401.08043	link
2024-01-10	Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects	Tianhang Cheng et.al.	2401.05236	link
2024-01-07	A Classification of Critical Configurations for any Number of Projective Views	Martin Bråtelund et.al.	2401.03450	link
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471	null
2023-12-16	Transformers in Unsupervised Structure-from-Motion	Hemang Chawla et.al.	2312.10529	link
2023-12-14	HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video	Xueying Wang et.al.	2312.08863	null
2023-12-14	CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning	Qingsong Yan et.al.	2312.08760	null
2023-12-11	Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach	Travis Driver et.al.	2312.06865	link
2023-12-11	Gaussian Splatting SLAM	Hidenobu Matsuki et.al.	2312.06741	null
2023-12-10	SuperPrimitive: Scene Reconstruction at a Primitive Level	Kirill Mazur et.al.	2312.05889	null
2023-12-07	Visual Geometry Grounded Deep Structure From Motion	Jianyuan Wang et.al.	2312.04563	null
2023-11-30	Distributed Global Structure-from-Motion with a Deep Front-End	Ayush Baid et.al.	2311.18801	link
2023-11-21	Robot Hand-Eye Calibration using Structure-from-Motion	Nicolas Andreff et.al.	2311.11808	null
2023-11-18	LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation	Sébastien Henry et.al.	2311.11171	null
2023-11-10	MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty	Rémi Marsal et.al.	2311.06137	link
2023-11-08	VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering	Linus Franke et.al.	2311.04634	link
2023-10-22	A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video	Jan Emily Mangulabnan et.al.	2310.14364	null
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605	null
2023-10-09	Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration	Chunge Bai et.al.	2310.05504	link
2023-10-08	LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization	Artem Nenashev et.al.	2310.05134	null
2023-11-29	Pose-Free Generalizable Rendering Transformer	Zhiwen Fan et.al.	2310.03704	link
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092	null
2023-10-01	Propagating Semantic Labels in Video Data	David Balaban et.al.	2310.00783	null
2023-09-22	Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning	Jonathan Sauder et.al.	2309.12804	null
2023-09-21	On-the-Fly SfM: What you capture is What you get	Zongqian Zhan et.al.	2309.11883	link
2023-09-19	Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water	Jayesh Tripathi et.al.	2309.10269	null
2023-09-16	DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF	Mert Asim Karaoglu et.al.	2309.08927	link
2023-09-08	Robot Localization and Mapping Final Report – Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry	Akankshya Kar et.al.	2309.04147	null
2023-09-01	SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation	Youhong Wang et.al.	2309.00526	null
2023-09-01	Dense Voxel 3D Reconstruction Using a Monocular Event Camera	Haodong Chen et.al.	2309.00385	null
2023-08-30	Learning Structure-from-Motion with Graph Attention Networks	Lucas Brynte et.al.	2308.15984	link
2023-08-26	Disjoint Pose and Shape for 3D Face Reconstruction	Raja Kumar et.al.	2308.13903	null
2023-08-30	CamP: Camera Preconditioning for Neural Radiance Fields	Keunhong Park et.al.	2308.10902	null
2023-08-18	Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling	Haorui Ji et.al.	2308.10705	null
2023-08-14	Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation	Tao Liu et.al.	2308.07231	link
2023-08-11	Efficient Large-scale AUV-based Visual Seafloor Mapping	Mengkun She et.al.	2308.06147	null
2023-08-04	EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems	Weihan Wang et.al.	2308.02670	null
2023-08-15	Tirtha – An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites	Jyotirmaya Shivottam et.al.	2308.01246	link
2023-08-02	Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network	Shenbagaraj Kannapiran et.al.	2308.01125	null
2023-07-27	PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking	Yang Zheng et.al.	2307.15055	link
2023-07-28	SACReg: Scene-Agnostic Coordinate Regression for Visual Localization	Jerome Revaud et.al.	2307.11702	null
2023-07-19	Lazy Visual Localization via Motion Averaging	Siyan Dong et.al.	2307.09981	null
2023-07-10	Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor	San Jiang et.al.	2307.04520	null
2023-07-07	RGB-D Mapping and Tracking in a Plenoxel Radiance Field	Andreas L. Teigen et.al.	2307.03404	link
2023-06-29	The Drunkard’s Odometry: Estimating Camera Motion in Deforming Scenes	David Recasens et.al.	2306.16917	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669	link
2023-06-28	PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment	Jianyuan Wang et.al.	2306.15667	null
2023-06-24	3D Reconstruction of Spherical Images based on Incremental Structure from Motion	San Jiang et.al.	2306.12770	link
2023-06-15	NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations	Varun Jampani et.al.	2306.09109	link
2023-06-15	Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization	Dror Aiger et.al.	2306.09012	link
2023-06-10	3D reconstruction using Structure for Motion	Kshitij Karnawat et.al.	2306.06360	link
2023-06-02	Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images	Marcela Mera-Trujillo et.al.	2306.01938	null
2023-05-31	FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow	Cameron Smith et.al.	2306.00180	null
2023-05-19	SIDAR: Synthetic Image Dataset for Alignment & Restoration	Monika Kwiatkowski et.al.	2305.12036	link
2023-05-09	Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization	Clémentin Boittiaux et.al.	2305.05301	link
2023-05-09	Rotation Synchronization via Deep Matrix Factorization	Gk Tejus et.al.	2305.05268	link
2023-04-20	A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion	Miriam Jäger et.al.	2304.10664	null
2023-04-14	Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments	Felix Ott et.al.	2304.07250	null
2023-04-12	Visual Localization using Imperfect 3D Models from the Internet	Vojtech Panek et.al.	2304.05947	link
2023-04-08	Photometric Correction for Infrared Sensors	Jincheng Zhang et.al.	2304.03930	null
2023-04-07	DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium	Antyanta Bangunharcana et.al.	2304.03560	link
2023-04-05	Semantic Validation in Structure from Motion	Joseph Rowell et.al.	2304.02420	link
2023-03-31	Learning Internal Representations of 3D Transformations from 2D Projected Inputs	Marissa Connor et.al.	2303.17776	null
2023-03-30	3D Line Mapping Revisited	Shaohui Liu et.al.	2303.17504	link
2023-03-27	TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering	Jaehoon Choi et.al.	2303.15060	null
2023-03-26	On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks	HyunJun Jung et.al.	2303.14840	link
2023-03-24	Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container	Jinguang Tong et.al.	2303.13805	link
2023-03-24	Progressively Optimized Local Radiance Fields for Robust View Synthesis	Andreas Meuleman et.al.	2303.13791	null
2023-03-15	RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters	Shuja Khalid et.al.	2303.08695	null
2023-03-09	Revisiting Rotation Averaging: Uncertainties and Robust Losses	Ganlin Zhang et.al.	2303.05195	link
2023-02-28	Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images	Zhongli Fan et.al.	2302.14239	link
2023-03-25	BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling	Sameera Ramasinghe et.al.	2302.13543	null
2023-02-21	EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images	Zhichao Ye et.al.	2302.10544	link
2023-02-18	Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering	Tatsuro Yamane et.al.	2302.09208	null
2023-02-12	Uncertainty-Driven Dense Two-View Structure from Motion	Weirong Chen et.al.	2302.00523	null
2023-01-28	AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion	Yu Chen et.al.	2301.12135	null
2023-01-20	A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles	Zhefan Xu et.al.	2301.08422	link
2023-03-21	Robust Dynamic Radiance Fields	Yu-Lun Liu et.al.	2301.02239	link
2022-12-24	Polarimetric Multi-View Inverse Rendering	Jinyu Zhao et.al.	2212.12721	null
2022-12-13	Accidental Turntables: Learning 3D Pose by Watching Objects Turn	Zezhou Cheng et.al.	2212.06300	null
2022-12-04	3D Object Aided Self-Supervised Monocular Depth Estimation	Songlin Wei et.al.	2212.01768	null
2022-12-02	High-Res Facial Appearance Capture from Polarized Smartphone Images	Dejan Azinović et.al.	2212.01160	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069	link
2022-11-24	JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models	Sepidehsadat Hosseini et.al.	2211.13785	null
2022-11-24	SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks	Sergio Izquierdo et.al.	2211.13551	link
2022-11-22	Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces	Yuxi Xiao et.al.	2211.12018	link
2022-11-21	Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques	David Ramirez et.al.	2211.11836	null
2022-11-14	Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion	René Haas et.al.	2211.07195	null
2022-10-13	Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach	Zhiang Chen et.al.	2210.07349	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517	null
2022-10-07	Leveraging Structure from Motion to Localize Inaccessible Bus Stops	Indu Panigrahi et.al.	2210.03646	link
2022-10-01	Structure-Aware NeRF without Posed Camera via Epipolar Constraint	Shu Chen et.al.	2210.00183	link
2022-10-05	FAST-LIO, Then Bayesian ICP, Then GTSFM	Jerred Chen et.al.	2210.00146	null
2022-09-20	BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction	Ahalya Ravendran et.al.	2209.09470	null
2022-09-19	A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion	Gerry Chen et.al.	2209.08690	null
2022-09-14	End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes	Qiao Chen et.al.	2209.06926	null
2022-09-07	Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021	Hartmut Surmann et.al.	2209.03084	null
2022-08-27	Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data	Thomas A. Ciarfuglia et.al.	2208.13001	null
2022-08-12	Handling Constrained Optimization in Factor Graphs for Autonomous Navigation	Barbara Bazzana et.al.	2208.06325	null
2022-08-04	Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training	Yao-Chih Lee et.al.	2208.02709	link
2022-07-31	One Object at a Time: Accurate and Robust Structure From Motion for Robots	Aravind Battaje et.al.	2208.00487	null
2022-07-23	Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks	Daniel Posada et.al.	2207.11413	null
2022-07-25	MeshLoc: Mesh-Based Visual Localization	Vojtech Panek et.al.	2207.10762	link
2022-07-19	ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild	Wang Zhao et.al.	2207.09137	link
2022-07-16	Organic Priors in Non-Rigid Structure from Motion	Suryansh Kumar et.al.	2207.06262	null
2022-07-06	A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models	Axel Garcia-Vega et.al.	2207.02396	null
2022-06-24	Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set	San Jiang et.al.	2206.11499	null
2022-06-13	TC-SfM: Robust Track-Community-Based Structure-from-Motion	Lei Wang et.al.	2206.05866	null
2022-06-10	EigenFairing: 3D Model Fairing using Image Coherence	Pragyana Mishra et.al.	2206.05309	null
2022-06-01	Semantic Room Wireframe Detection from a Single View	David Gillsjö et.al.	2206.00491	link
2022-05-31	Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction	Qiancheng Fu et.al.	2205.15848	null
2022-05-09	Is my Depth Ground-Truth Good Enough? HAMMER – Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression	HyunJun Jung et.al.	2205.04565	null
2022-05-07	Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs	Pedro F. Proença et.al.	2205.03522	null
2022-05-06	EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms	Levi Burner et.al.	2205.03467	null
2022-04-20	Learned Monocular Depth Priors in Visual-Inertial Initialization	Yunwen Zhou et.al.	2204.09171	null
2022-04-10	Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective	Hui Deng et.al.	2204.04730	null
2022-04-08	Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems	Debao Huang et.al.	2204.04145	null
2022-04-07	SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation	Yi Wei et.al.	2204.03636	link
2022-04-06	Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion	Lukas Bommes et.al.	2204.02733	link
2022-04-05	Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows	Sheng Liu et.al.	2204.02509	link
2022-03-31	Fast, Accurate and Memory-Efficient Partial Permutation Synchronization	Shaohan Li et.al.	2203.16505	null
2022-03-28	Visual Odometry for RGB-D Cameras	Afonso Fontes et.al.	2203.15119	null
2022-03-28	Optimizing Elimination Templates by Greedy Parameter Search	Evgeniy Martyushev et.al.	2203.14901	link
2022-03-23	Event-Based Dense Reconstruction Pipeline	Kun Xiao et.al.	2203.12270	null
2022-03-21	DiffPoseNet: Direct Differentiable Camera Pose Estimation	Chethan M. Parameshwara et.al.	2203.11174	null
2022-03-02	Asynchronous Optimisation for Event-based Visual Odometry	Daqi Liu et.al.	2203.01037	null
2022-03-02	Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation	Yulun Tian et.al.	2203.00851	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146	link
2022-01-20	GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry	Yunhan Zhao et.al.	2201.08131	null
2022-01-13	Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching	Yunpeng Shi et.al.	2201.04797	link
2022-01-10	High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM	Brian M. Hopkinson et.al.	2201.03364	link
2022-01-06	De-rendering 3D Objects in the Wild	Felix Wimbauer et.al.	2201.02279	link
2021-12-29	On the Instability of Relative Pose Estimation and RANSAC’s Role	Hongyi Fan et.al.	2112.14651	null
2021-12-16	Road-aware Monocular Structure from Motion and Homography Estimation	Wei Sui et.al.	2112.08635	null
2021-12-10	Critical configurations for three projective views	Martin Bråtelund et.al.	2112.05478	null
2021-12-09	Critical configurations for two projective views, a new approach	Martin Bråtelund et.al.	2112.05074	null
2021-12-06	Dense Depth Priors for Neural Radiance Fields from Sparse Input Views	Barbara Roessle et.al.	2112.03288	link
2021-12-10	MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment	Jie Ren et.al.	2112.01349	link
2021-11-11	Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft	Pascal Schoppmann et.al.	2111.06271	null
2021-11-10	Damage Estimation and Localization from Sparse Aerial Imagery	Rene Garcia Franceschini et.al.	2111.03708	null
2021-11-03	Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems	Swarnabja Bhaumik et.al.	2111.02064	null
2021-10-14	Modeling dynamic target deformation in camera calibration	Annika Hagemann et.al.	2110.07322	null
2021-10-13	Hyperspectral 3D Mapping of Underwater Environments	Maxime Ferrera et.al.	2110.06571	null
2021-09-24	Automatic Map Update Using Dashcam Videos	Aziza Zhanabatyrova et.al.	2109.12131	null
2021-09-16	Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs	Gabriel Moreira et.al.	2109.08046	link
2021-09-06	Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications	Tejas Mane et.al.	2109.02740	null
2021-09-02	Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency	Beatrix-Emőke Fülöp-Balogh et.al.	2109.01018	null
2021-09-01	On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation	Eric Brachmann et.al.	2109.00524	link
2021-08-31	DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension	Roman Shapovalov et.al.	2109.00033	null
2021-08-29	Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration	Seyed-Mahdi Nasiri et.al.	2108.12876	null
2021-08-23	Burst Imaging for Light-Constrained Structure-From-Motion	Ahalya Ravendran et.al.	2108.09895	null

Visual Localization

Publish Date	Title	Authors	PDF	Code
2025-07-23	VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization	Sania Waheed et.al.	2507.17455	null
2025-07-23	Content-based 3D Image Retrieval and a ColBERT-inspired Re-ranking for Tumor Flagging and Staging	Farnaz Khun Jush et.al.	2507.17412	null
2025-07-20	LoopNet: A Multitasking Few-Shot Learning Approach for Loop Closure in Large Scale SLAM	Mohammad-Maher Nakshbandi et.al.	2507.15109	null
2025-07-20	Visual Place Recognition for Large-Scale UAV Applications	Ioannis Tsampikos Papapetros et.al.	2507.15089	null
2025-07-20	U-MARVEL: Unveiling Key Factors for Universal Multimodal Retrieval via Embedding Learning with MLLMs	Xiaojie Li et.al.	2507.14902	null
2025-07-19	OptiCorNet: Optimizing Sequence-Based Context Correlation for Visual Place Recognition	Zhenyu Li et.al.	2507.14477	null
2025-07-16	Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired	Jiayu et.al.	2507.14215	null
2025-07-17	FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval	Jeong-Woo Park et.al.	2507.12823	null
2025-07-17	MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval	Jeong-Woo Park et.al.	2507.12819	null
2025-07-16	QuRe: Query-Relevant Retrieval through Hard Negative Sampling in Composed Image Retrieval	Jaehyun Kwak et.al.	2507.12416	null
2025-07-16	CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning	Peiwen Xia et.al.	2507.11834	null
2025-07-09	Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning	Konstantinos I. Roumeliotis et.al.	2507.10571	null
2025-07-14	GT-Loc: Unifying When and Where in Images Through a Joint Embedding Space	David G. Shatwell et.al.	2507.10473	null
2025-07-14	Text-to-Remote-Sensing-Image Retrieval beyond RGB Sources	Daniele Rege Cambrin et.al.	2507.10403	null
2025-07-14	Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures	Xinlong Ding et.al.	2507.10265	null
2025-07-11	RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features	Inye Na et.al.	2507.08546	null
2025-07-11	LiDAR, GNSS and IMU Sensor Alignment through Dynamic Time Warping to Construct 3D City Maps	Haitian Wang et.al.	2507.08420	null
2025-07-11	Deep Hashing with Semantic Hash Centers for Image Retrieval	Li Chen et.al.	2507.08404	null
2025-07-08	Unveiling Effective In-Context Configurations for Image Captioning: An External & Internal Analysis	Li Li et.al.	2507.08021	null
2025-07-10	SCREP: Scene Coordinate Regression and Evidential Learning-based Perception-Aware Trajectory Generation	Juyeop Han et.al.	2507.07467	null
2025-07-10	VP-SelDoA: Visual-prompted Selective DoA Estimation of Target Sound via Semantic-Spatial Matching	Yu Chen et.al.	2507.07384	null
2025-07-08	FACap: A Large-scale Fashion Dataset for Fine-grained Composed Image Retrieval	François Gardères et.al.	2507.07135	null
2025-07-09	Evaluating Attribute Confusion in Fashion Text-to-Image Generation	Ziyue Liu et.al.	2507.07079	null
2025-07-09	MS-DPPs: Multi-Source Determinantal Point Processes for Contextual Diversity Refinement of Composite Attributes in Text to Image Retrieval	Naoya Sogi et.al.	2507.06654	null
2025-07-08	Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval	Haiwen Li et.al.	2507.05970	null
2025-07-08	OFFSET: Segmentation-based Focus Shift Revision for Composed Image Retrieval	Zhiwei Chen et.al.	2507.05631	null
2025-07-07	Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model	Mengyao Xu et.al.	2507.05513	null
2025-07-07	An analysis of vision-language models for fabric retrieval	Francesco Giuliari et.al.	2507.04735	null
2025-07-08	What’s Making That Sound Right Now? Video-centric Audio-Visual Localization	Hahyeon Choi et.al.	2507.04667	null
2025-07-07	Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR	Tao Du et.al.	2507.04662	null
2025-07-06	U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration	Xiaofan Li et.al.	2507.04503	null
2025-07-04	Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition	Jiuhong Xiao et.al.	2507.03831	null
2025-07-01	LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment	Juelin Zhu et.al.	2507.00659	null
2025-06-28	Utilizing a Novel Deep Learning Method for Scene Categorization in Remote Sensing Data	Ghufran A. Omran et.al.	2506.22939	null
2025-06-28	Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval	Li-Cheng Shen et.al.	2506.22864	null
2025-06-27	MatChA: Cross-Algorithm Matching with Feature Augmentation	Paula Carbó Cubero et.al.	2506.22336	null
2025-06-26	OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography	Caoshuo Li et.al.	2506.21101	null
2025-06-25	Visualizing intercalation effects in 2D materials using AFM based techniques	Karmen Kapustić et.al.	2506.20467	null
2025-06-25	On the Burstiness of Faces in Set	Jiong Wang et.al.	2506.20312	null
2025-06-24	jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval	Michael Günther et.al.	2506.18902	null
2025-06-26	Referring Expression Instance Retrieval and A Strong End-to-End Baseline	Xiangzhao Hao et.al.	2506.18246	null
2025-06-20	Class Agnostic Instance-level Descriptor for Visual Instance Search	Qi-Ying Sun et.al.	2506.16745	null
2025-06-19	MambaHash: Visual State Space Deep Hashing Model for Large-Scale Image Retrieval	Chao He et.al.	2506.16353	link
2025-06-19	Fine-grained Image Retrieval via Dual-Vision Adaptation	Xin Jiang et.al.	2506.16273	null
2025-06-19	Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation	Connor Malone et.al.	2506.15988	link
2025-06-18	Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles	Qiyuan Wu et.al.	2506.15851	null
2025-06-18	ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections	Ziling Huang et.al.	2506.15180	null
2025-06-17	HARMONY: A Scalable Distributed Vector Database for High-Throughput Approximate Nearest Neighbor Search	Qian Xu et.al.	2506.14707	null
2025-06-17	TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping	Jeewon Kim et.al.	2506.14178	null
2025-06-16	A Semantically-Aware Relevance Measure for Content-Based Medical Image Retrieval Evaluation	Xiaoyang Wei et.al.	2506.13509	null
2025-06-19	Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval	Kshitij Kavimandan et.al.	2506.13496	null
2025-06-16	EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition	Bingxi Liu et.al.	2506.13133	null
2025-06-16	SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models	Bingxi Liu et.al.	2506.13073	null
2025-06-14	Feature Complementation Architecture for Visual Place Recognition	Weiwei Wang et.al.	2506.12401	null
2025-06-11	Towards a general-purpose foundation model for fMRI analysis	Cheng Wang et.al.	2506.11167	null
2025-06-11	Improving Personalized Search with Regularized Low-Rank Parameter Updates	Fiona Ryan et.al.	2506.10182	link
2025-06-10	Safeguarding Multimodal Knowledge Copyright in the RAG-as-a-Service Environment	Tianyu Chen et.al.	2506.10030	link
2025-06-11	Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints	Xiangkai Zhang et.al.	2506.09748	null
2025-06-10	Robust Visual Localization via Semantic-Guided Multi-Scale Transformer	Zhongtao Tian et.al.	2506.08526	null
2025-06-08	Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs	Yikun Ji et.al.	2506.07045	null
2025-06-07	Zero Shot Composed Image Retrieval	Santhosh Kakarla et.al.	2506.06602	null
2025-06-06	GenIR: Generative Visual Feedback for Mental Image Retrieval	Diji Yang et.al.	2506.06220	null
2025-06-06	Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning	Sheng Chen et.al.	2506.06205	null
2025-06-05	HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition	Suhan Woo et.al.	2506.04764	null
2025-06-05	Deep Learning Reforms Image Matching: A Survey and Outlook	Shihua Zhang et.al.	2506.04619	null
2025-06-02	Entity Image and Mixed-Modal Image Retrieval Datasets	Cristian-Ioan Blaga et.al.	2506.02291	null
2025-06-01	Quantization-based Bounds on the Wasserstein Metric	Jonathan Bobrutsky et.al.	2506.00976	null
2025-05-30	SORCE: Small Object Retrieval in Complex Environments	Chunxu Liu et.al.	2505.24441	link
2025-05-29	Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch	Aneeshan Sain et.al.	2505.23763	null
2025-05-28	4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians	Hidenobu Matsuki et.al.	2505.22859	null
2025-05-28	UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images	Junhuan Liu et.al.	2505.22098	null
2025-05-28	Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule	San Jiang et.al.	2505.22089	null
2025-05-27	Visual Loop Closure Detection Through Deep Graph Consensus	Martin Büchner et.al.	2505.21754	null
2025-05-27	QuARI: Query Adaptive Retrieval Improvement	Eric Xing et.al.	2505.21647	null
2025-05-27	ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval	Eric Xing et.al.	2505.20764	link
2025-05-26	Visualized Text-to-Image Retrieval	Di Wu et.al.	2505.20291	link
2025-05-26	Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval	Rong-Cheng Tu et.al.	2505.19952	null
2025-05-26	Can Visual Encoder Learn to See Arrows?	Naoyuki Terashita et.al.	2505.19944	null
2025-05-26	MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval	Rong-Cheng Tu et.al.	2505.19707	null
2025-05-24	Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU	Yicheng Lin et.al.	2505.18652	link
2025-05-24	TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP	Yuliang Cai et.al.	2505.18434	null
2025-05-23	ImLPR: Image-based LiDAR Place Recognition using Vision Foundation Models	Minwoo Jung et.al.	2505.18364	null
2025-05-23	DART $^3$ : Leveraging Distance for Test Time Adaptation in Person Re-Identification	Rajarshi Bhattacharya et.al.	2505.18337	null
2025-05-23	To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models	Simone Gaisbauer et.al.	2505.17973	link
2025-05-23	DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval	Yuxin Yang et.al.	2505.17796	null
2025-05-23	CU-Multi: A Dataset for Multi-Robot Data Association	Doncey Albin et.al.	2505.17576	null
2025-05-22	TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition	Oliver Grainge et.al.	2505.16447	null
2025-05-21	Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval	Siting Li et.al.	2505.15877	null
2025-05-21	SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval	Nikolaos Chaidos et.al.	2505.15867	link
2025-05-20	Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models	Kiarash Naghavi Khanghah et.al.	2505.13828	null
2025-05-18	MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark	Yiwei Ou et.al.	2505.12254	null
2025-05-16	Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization	Aaron Wilhelm et.al.	2505.11620	null
2025-05-16	Redundancy-Aware Pretraining of Vision-Language Foundation Models in Remote Sensing	Mathis Jürgen Adler et.al.	2505.11121	null
2025-05-04	OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery	Chongsheng Zhang et.al.	2505.03836	link
2025-05-06	Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions	Lukas Schichler et.al.	2505.03565	null
2025-05-06	LiftFeat: 3D Geometry-Aware Local Feature Matching	Yepeng Liu et.al.	2505.03422	link
2025-05-06	Seeing the Abstract: Translating the Abstract Language for Vision Language Models	Davide Talon et.al.	2505.03242	link
2025-05-13	SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment	Ganesh Sapkota et.al.	2505.01956	null
2025-05-02	NeuroLoc: Encoding Navigation Cells for 6-DOF Camera Localization	Xun Li et.al.	2505.01113	null
2025-05-01	GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting	Jongwon Lee et.al.	2504.20379	null
2025-04-25	From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval	Yabing Wang et.al.	2504.17990	null
2025-04-24	A Guide to Structureless Visual Localization	Vojtech Panek et.al.	2504.17636	null
2025-04-23	Rethinking Vision Transformer for Large-Scale Fine-Grained Image Retrieval	Xin Jiang et.al.	2504.16691	null
2025-04-22	Media Content Atlas: A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs	Merve Cerit et.al.	2504.16323	link
2025-04-19	A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling	Kyle Buettner et.al.	2504.14359	null
2025-04-17	SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs	Haoxuan Li et.al.	2504.13172	null
2025-04-16	Generalized Visual Relation Detection with Diffusion Models	Kaifeng Gao et.al.	2504.12100	null
2025-04-15	Visual Re-Ranking with Non-Visual Side Information	Gustav Hanning et.al.	2504.11134	link
2025-04-15	TMCIR: Token Merge Benefits Composed Image Retrieval	Chaoyang Wang et.al.	2504.10995	null
2025-04-14	Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition	Changwei Wang et.al.	2504.09881	link
2025-04-12	Evolved Hierarchical Masking for Self-Supervised Learning	Zhanzhou Feng et.al.	2504.09155	null
2025-04-11	HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields	Asterios Reppas et.al.	2504.08901	null
2025-04-11	Hypergraph Vision Transformers: Images are More than Nodes, More than Edges	Joshua Fixelle et.al.	2504.08710	null
2025-04-11	FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations	Cheng-Yu Hsieh et.al.	2504.08368	null
2025-04-11	PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection	Xiong Li et.al.	2504.08280	null
2025-04-10	Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval	Zehong Ma et.al.	2504.07718	null
2025-04-09	A Pointcloud Registration Framework for Relocalization in Subterranean Environments	David Akhihiero et.al.	2504.07231	null
2025-04-09	Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception	Ruotian Peng et.al.	2504.06666	null
2025-04-08	To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition	Davide Sferrazza et.al.	2504.06116	link
2025-04-06	NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval	Peng Gao et.al.	2504.04339	null
2025-04-04	REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval	Shabnam Choudhury et.al.	2504.03169	null
2025-04-06	Re-thinking Temporal Search for Long-Form Video Understanding	Jinhui Ye et.al.	2504.02259	link
2025-04-02	A Chefs KISS – Utilizing semantic information in both ICP and SLAM framework	Sven Ochs et.al.	2504.02086	null
2025-04-02	Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval	Yuji Nozawa et.al.	2504.01348	null
2025-04-01	IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval	Bangwei Liu et.al.	2504.00954	null
2025-04-01	Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data	Yiqun Duan et.al.	2504.00812	null
2025-03-31	CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization	Yingrui Ji et.al.	2503.24182	null
2025-03-31	LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds	Masahiko Tsuji et.al.	2503.23664	null
2025-03-30	Multiview Image-Based Localization	Cameron Fiore et.al.	2503.23577	null
2025-03-27	LOCORE: Image Re-ranking with Long-Context Sequence Modeling	Zilin Xiao et.al.	2503.21772	link
2025-03-27	Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck	Adrian Bulat et.al.	2503.21757	null
2025-03-27	UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation	Yehui Shen et.al.	2503.21338	link
2025-03-27	FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval	Zixu Li et.al.	2503.21309	link
2025-03-27	Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing	Shuai Li et.al.	2503.21236	null
2025-03-25	CoLLM: A Large Language Model for Composed Image Retrieval	Chuong Huynh et.al.	2503.19910	link
2025-03-25	Scene-agnostic Pose Regression for Visual Localization	Junwei Zheng et.al.	2503.19543	null
2025-03-25	From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting	Zhiwei Huang et.al.	2503.19358	null
2025-03-25	Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval	Haoqiang Lin et.al.	2503.19296	link
2025-03-23	LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space	Zhangyu Wang et.al.	2503.18142	null
2025-03-23	Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning	Xiang Fang et.al.	2503.17938	null
2025-03-23	What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images	Dongheng Lin et.al.	2503.17899	null
2025-03-22	good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval	Pranavi Kolouju et.al.	2503.17871	null
2025-03-21	Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2503.17109	link
2025-03-21	Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions	Muhua Zhang et.al.	2503.17005	null
2025-03-20	PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval	Qiang Zou et.al.	2503.16064	link
2025-03-20	Automating 3D Dataset Generation with Neural Radiance Fields	P. Schulz et.al.	2503.15997	link
2025-03-18	3D Densification for Multi-Map Monocular VSLAM in Endoscopy	X. Anadón et.al.	2503.14346	null
2025-03-18	A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios	Huy-Hoang Bui et.al.	2503.13982	link
2025-03-17	Scale Efficient Training for Large Datasets	Qing Zhou et.al.	2503.13385	link
2025-03-17	Multi-Platform Teach-and-Repeat Navigation by Visual Place Recognition Based on Deep-Learned Local Features	Václav Truhlařík et.al.	2503.13090	null
2025-03-17	All You Need to Know About Training Image Retrieval Models	Gabriele Berton et.al.	2503.13045	link
2025-03-12	Exploring the best way for UAV visual localization under Low-altitude Multi-view Observation Condition: a Benchmark	Yibin Ye et.al.	2503.10692	link
2025-03-13	ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning	Pengfei Luo et.al.	2503.10166	link
2025-03-12	Revisiting Medical Image Retrieval via Knowledge Consolidation	Yang Nan et.al.	2503.09370	null
2025-03-11	CQVPR: Landmark-aware Contextual Queries for Visual Place Recognition	Dongyue Li et.al.	2503.08170	null
2025-03-10	Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization	Michael Green et.al.	2503.07038	null
2025-03-10	Zero-Shot Hashing Based on Reconstruction With Part Alignment	Yan Jiang et.al.	2503.07037	null
2025-03-10	Improving Visual Place Recognition with Sequence-Matching Receptiveness Prediction	Somayeh Hussaini et.al.	2503.06840	null
2025-03-09	RoboDesign1M: A Large-scale Dataset for Robot Design Understanding	Tri Le et.al.	2503.06796	null
2025-03-09	StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place Recognition	Yanqing Shen et.al.	2503.06601	link
2025-03-09	TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification	Huaqi Tao et.al.	2503.06501	link
2025-03-08	NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features	Hongjia Zhai et.al.	2503.06117	null
2025-03-07	Data-Efficient Generalization for Zero-shot Composed Image Retrieval	Zining Chen et.al.	2503.05204	null
2025-03-06	RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining	Tengfei Zhang et.al.	2503.04653	null
2025-03-06	ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images	Yanqing Shen et.al.	2503.04475	link
2025-03-06	Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes	Hui Zhang et.al.	2503.04235	null
2025-03-06	Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior	Haitao Wu et.al.	2503.04207	link
2025-03-06	Image-Based Relocalization and Alignment for Long-Term Monitoring of Dynamic Underwater Environments	Beverley Gorry et.al.	2503.04096	link
2025-03-04	TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition	Oliver Grainge et.al.	2503.02511	null
2025-03-04	Introspective Loop Closure for SLAM with 4D Imaging Radar	Maximilian Hilger et.al.	2503.02383	null
2025-03-04	Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models	Kenta Tsukahara et.al.	2503.02256	null
2025-03-03	Composed Multi-modal Retrieval: A Survey of Approaches and Applications	Kun Zhang et.al.	2503.01334	link
2025-03-03	AirRoom: Objects Matter in Room Reidentification	Runmao Yao et.al.	2503.01130	null
2025-03-02	Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching	Jinyu Miao et.al.	2503.00862	null
2025-03-01	Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning	Songlin Dong et.al.	2503.00515	null
2025-02-28	EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration	Kuangyi Chen et.al.	2503.00167	link
2025-02-28	CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval	Zelong Sun et.al.	2502.20826	null
2025-02-28	SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition	Shanshan Wan et.al.	2502.20676	null
2025-02-27	A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization	Yejun Zhang et.al.	2502.20036	link
2025-02-27	On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation	Ruben T. Lucassen et.al.	2502.19285	null
2025-02-26	BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure	Haoxin Cai et.al.	2502.19242	link
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932	null
2025-02-19	A Comprehensive Survey on Composed Image Retrieval	Xuemeng Song et.al.	2502.18495	link
2025-02-25	MegaLoc: One Retrieval to Place Them All	Gabriele Berton et.al.	2502.17237	link
2025-02-23	Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries	Yin Wu et.al.	2502.16636	link
2025-02-23	SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition	Feng Lu et.al.	2502.16601	link
2025-02-21	ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval	Guanqi Zhan et.al.	2502.15682	null
2025-02-20	Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition	Tianyi Shang et.al.	2502.14195	link
2025-02-19	3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments	Vincent Ress et.al.	2502.13803	null
2025-02-18	Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization	Shuo Xing et.al.	2502.13146	link
2025-02-19	IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras	Dongki Jung et.al.	2502.12545	null
2025-02-17	From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations	Matteo Scucchia et.al.	2502.12303	null
2025-02-17	Descriminative-Generative Custom Tokens for Vision-Language Models	Pramuditha Perera et.al.	2502.12095	null
2025-02-17	ILIAS: Instance-Level Image retrieval At Scale	Giorgos Kordopatis-Zilos et.al.	2502.11748	null
2025-02-17	Range and Bird’s Eye View Fused Cross-Modal Visual Place Recognition	Jianyi Peng et.al.	2502.11742	link
2025-02-17	Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics	Francesco Croce et.al.	2502.11725	link
2025-02-17	Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization	Yuanze Xu et.al.	2502.11408	null
2025-02-12	E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection	Junjie Wu et.al.	2502.10455	null
2025-02-11	Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning	Yuhang Dong et.al.	2502.09649	null
2025-02-13	ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation	Rotem Shalev-Arkushin et.al.	2502.09411	null
2025-02-12	SpeechCompass: Enhancing Mobile Captioning with Diarization and Directional Guidance via Multi-Microphone Localization	Artem Dementyev et.al.	2502.08848	null
2025-02-12	Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions	Prajwal Gatti et.al.	2502.08438	null
2025-02-11	Captured by Captions: On Memorization and its Mitigation in CLIP Models	Wenhao Wang et.al.	2502.07830	null
2025-02-11	Ultrafast 4D scanning transmission electron microscopy for imaging of localized optical fields	Petr Koutenský et.al.	2502.07338	null
2025-02-11	Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos	Haowen Gao et.al.	2502.07327	null
2025-02-11	PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval	Osman Tursun et.al.	2502.07215	null
2025-02-10	AstroLoc: Robust Space to Ground Image Localizer	Gabriele Berton et.al.	2502.07003	null
2025-02-09	Uni-Retrieval: A Multi-Style Retrieval Framework for STEM’s Education	Yanhao Jia et.al.	2502.05863	null
2025-02-07	Learning Street View Representations with Spatiotemporal Contrast	Yong Li et.al.	2502.04638	null
2025-02-06	Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion	Marco Mistretta et.al.	2502.04263	link
2025-02-05	Human-Aligned Image Models Improve Visual Decoding from the Brain	Nona Rajabi et.al.	2502.03081	null
2025-02-03	ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies	Costin F. Ciusdel et.al.	2502.01335	null
2025-01-31	LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks	Liudi Yang et.al.	2501.19382	link
2025-01-27	Freestyle Sketch-in-the-Loop Image Segmentation	Subhadeep Koley et.al.	2501.16022	null
2025-01-26	Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations	Zijun Long et.al.	2501.15379	null
2025-01-24	Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection	Viktor Kozák et.al.	2501.14587	null
2025-01-23	Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models	Jakob Krogh Petersen et.al.	2501.14051	link
2025-01-22	Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation	Kenta Uesugi et.al.	2501.13968	null
2025-01-19	Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection	Zhipeng Yu et.al.	2501.11063	link
2025-01-18	A Resource-Efficient Training Framework for Remote Sensing Text–Image Retrieval	Weihang Zhang et.al.	2501.10638	null
2025-01-17	FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis	Zhe Chen et.al.	2501.09887	null
2025-01-15	Vision Foundation Models for Computed Tomography	Suraj Pai et.al.	2501.09001	link
2025-01-12	SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval	Bhavin Jawade et.al.	2501.08347	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286	null
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399	null
2025-01-12	Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation	Zhenyang Feng et.al.	2501.06749	null
2025-01-06	Integrating Language-Image Prior into EEG Decoding for Cross-Task Zero-Calibration RSVP-BCI	Xujin Li et.al.	2501.02841	null
2025-01-03	A Minimal Subset Approach for Efficient and Scalable Loop Closure	Nikolaos Stathoulopoulos et.al.	2501.01791	link
2025-01-03	iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings	Shuhei Tomoshige et.al.	2501.01642	null
2025-01-02	R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization	Xudong Jiang et.al.	2501.01421	link
2025-01-02	Training Medical Large Vision-Language Models with Abnormal-Aware Feedback	Yucheng Zhou et.al.	2501.01377	null
2025-01-02	Domain-invariant feature learning in brain MR imaging for content-based image retrieval	Shuya Tobari et.al.	2501.01326	null
2024-12-28	GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting	Atticus J. Zeller et.al.	2412.20056	link
2024-12-25	FOR: Finetuning for Object Level Open Vocabulary Image Retrieval	Hila Levi et.al.	2412.18806	null
2024-12-24	ERVD: An Efficient and Robust ViT-Based Distillation Framework for Remote Sensing Image Retrieval	Le Dong et.al.	2412.18136	link
2024-12-22	Where am I? Cross-View Geo-localization with Natural Language Descriptions	Junyan Ye et.al.	2412.17007	null
2024-12-22	Large-Scale UWB Anchor Calibration and One-Shot Localization Using Gaussian Process	Shenghai Yuan et.al.	2412.16880	null
2024-12-24	Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling	Daichi Yashima et.al.	2412.16576	link
2024-12-20	A New Method to Capturing Compositional Knowledge in Linguistic Space	Jiahe Wan et.al.	2412.15632	null
2024-12-20	Stabilizing Laplacian Inversion in Fokker-Planck Image Retrieval using the Transport-of-Intensity Equation	Samantha J Alloo et.al.	2412.15513	null
2024-12-19	Learning Visual Composition through Improved Semantic Guidance	Austin Stone et.al.	2412.15396	null
2024-12-19	MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval	Junjie Zhou et.al.	2412.14475	null
2024-12-18	Adversarial Hubness in Multi-Modal Retrieval	Tingwei Zhang et.al.	2412.14113	link
2024-12-18	Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval	Giacomo Pacini et.al.	2412.13834	null
2024-12-18	ConDo: Continual Domain Expansion for Absolute Pose Regression	Zijun Li et.al.	2412.13452	link
2024-12-17	Three Things to Know about Deep Metric Learning	Yash Patel et.al.	2412.12432	null
2024-12-15	Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval	Zelong Sun et.al.	2412.11087	null
2024-12-20	Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2412.11077	link
2024-12-13	MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition	Qiwen Gu et.al.	2412.09199	null
2024-12-12	A Flexible Plug-and-Play Module for Generating Variable-Length	Liyang He et.al.	2412.08922	link
2024-12-11	Image Retrieval Methods in the Dissimilarity Space	Madhu Kiran et.al.	2412.08618	null
2024-12-11	Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization	Siyan Dong et.al.	2412.08376	link
2024-12-11	Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin	Benjamin D. Killeen et.al.	2412.08020	null
2024-12-10	On Motion Blur and Deblurring in Visual Place Recognition	Timur Ismagilov et.al.	2412.07751	null
2024-12-10	Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance	Wanwen Chen et.al.	2412.07741	null
2024-12-09	An Efficient Scene Coordinate Encoding and Relocalization Method	Kuan Xu et.al.	2412.06488	link
2024-12-09	A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition	Connor Malone et.al.	2412.06153	null
2024-12-07	Compositional Image Retrieval via Instruction-Aware Contrastive Learning	Wenliang Zhong et.al.	2412.05756	link
2024-12-06	DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification	Ying Jin et.al.	2412.04828	null
2024-12-04	Distillation of Diffusion Features for Semantic Correspondence	Frank Fundel et.al.	2412.03512	null
2024-12-04	Composed Image Retrieval for Training-Free Domain Conversion	Nikos Efthymiadis et.al.	2412.03297	link
2024-12-03	A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration	Thulio Amorim et.al.	2412.02881	null
2024-12-03	Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval	Leah Bar et.al.	2412.02310	link
2024-12-02	Mutli-View 3D Reconstruction using Knowledge Distillation	Aditya Dutt et.al.	2412.02039	link
2024-12-02	Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features	MD Shaikh Rahman et.al.	2412.01555	null
2024-12-02	Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models	Yi Liao et.al.	2412.01202	null
2024-12-01	EDTformer: An Efficient Decoder Transformer for Visual Place Recognition	Tong Jin et.al.	2412.00784	link
2024-11-28	EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval	Muhammad Huzaifa et.al.	2412.00139	null
2024-11-28	Unleashing the Power of Data Synthesis in Visual Localization	Sihang Li et.al.	2412.00138	null
2024-11-28	Relation-Aware Meta-Learning for Zero-shot Sketch-Based Image Retrieval	Yang Liu et.al.	2412.00120	null
2024-11-29	A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications	Liqiang Zhang Ye Tian Dongyan Wei et.al.	2411.19845	null
2024-11-27	Optimizing Image Retrieval with an Extended b-Metric Space	Abdelkader Belhenniche et.al.	2411.18800	null
2024-11-26	Learning Visual Hierarchies with Hyperbolic Embeddings	Ziwei Wang et.al.	2411.17490	null
2024-12-02	Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy	You Li et.al.	2411.16752	null
2024-12-02	AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks	You Li et.al.	2411.16749	null
2024-11-25	Image Generation Diversity Issues and How to Tame Them	Mischa Dombrowski et.al.	2411.16171	link
2024-11-24	PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments	Haoang Li et.al.	2411.15800	null
2024-11-22	Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval	Zengbao Sun et.al.	2411.14704	null
2024-11-20	Globally Correlation-Aware Hard Negative Generation	Wenjie Peng et.al.	2411.13145	link
2024-11-18	Exploring Emerging Trends and Research Opportunities in Visual Place Recognition	Antonios Gasteratos et.al.	2411.11481	null
2024-11-13	OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances	Youqi Liao et.al.	2411.08665	link
2024-11-13	Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval	Saul Santos et.al.	2411.08590	link
2024-11-22	Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments	Ashkan Nejad et.al.	2411.08567	link
2024-11-13	MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation	Peng Wang et.al.	2411.08279	link
2024-11-05	From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing	Xintian Sun et.al.	2411.05826	null
2024-11-04	TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives	Maitreya Patel et.al.	2411.02545	null
2024-11-11	INQUIRE: A Natural World Text-to-Image Retrieval Benchmark	Edward Vendrow et.al.	2411.02537	link
2024-11-20	Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Sharat Agarwal et.al.	2411.01925	null
2024-11-04	Semantic Masking and Visual Feature Matching for Robust Localization	Luisa Mao et.al.	2411.01804	null
2024-11-03	Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification	MD Shaikh Rahman et.al.	2411.01473	null
2024-11-01	Identifying Implicit Social Biases in Vision-Language Models	Kimia Hamidieh et.al.	2411.00997	null
2024-10-31	Nearest Neighbor Normalization Improves Multimodal Retrieval	Neil Chowdhury et.al.	2410.24114	link
2024-10-31	MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval	Haiwen Li et.al.	2410.23736	null
2024-10-30	Decoupling Semantic Similarity from Spatial Alignment for Neural Networks	Tassilo Wald et.al.	2410.23107	link
2024-10-29	Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications	Monica Riedler et.al.	2410.21943	link
2024-10-28	NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments	Taiyi Pan et.al.	2410.21615	link
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341	link
2024-10-24	ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval	Zijia Zhao et.al.	2410.18715	link
2024-10-25	On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features	Tomáš Pivoňka et.al.	2410.18573	null
2024-10-22	Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2410.17393	null
2024-10-20	GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Haiwen Diao et.al.	2410.15266	link
2024-10-19	Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway’s Digitised Book Collection	Marie Roald et.al.	2410.14969	link
2024-10-16	Development of Image Collection Method Using YOLO and Siamese Network	Chan Young Shin et.al.	2410.12561	null
2024-10-16	LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment	Juelin Zhu et.al.	2410.12269	link
2024-10-16	Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization	Nanda Febri Istighfarin et.al.	2410.12240	null
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	link
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-10-11	Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System	Zheng Liu et.al.	2410.08935	link
2024-10-16	Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP	Eunji Kim et.al.	2410.08469	null
2024-10-11	A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification	Eugene P. W. Ang et.al.	2410.08456	null
2024-10-10	A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks	Hoin Jung et.al.	2410.07593	link
2024-10-09	Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval	Mohammad Omama et.al.	2410.07022	null
2024-10-09	Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers	Stephen Hausler et.al.	2410.06614	link
2024-10-09	MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging	Noel C. F. Codella et.al.	2410.06542	null
2024-10-08	Temporal Image Caption Retrieval Competition – Description and Results	Jakub Pokrywka et.al.	2410.06314	null
2024-10-08	Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching	Gongxin Yao et.al.	2410.06285	null
2024-10-08	GSLoc: Visual Localization with 3D Gaussian Splatting	Kazii Botashev et.al.	2410.06165	null
2024-10-08	Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning	Ayush Singh et.al.	2410.05928	null
2024-10-08	RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps	Minsoo Kim et.al.	2410.05621	null
2024-10-09	LoTLIP: Improving Language-Image Pre-training for Long Text Understanding	Wei Wu et.al.	2410.05249	null
2024-10-06	LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation	Jianhao Jiao et.al.	2410.04419	null
2024-10-02	Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension	Zaiquan Yang et.al.	2410.01544	null
2024-10-03	EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections	Francesc Net et.al.	2410.01536	link
2024-10-04	CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment	Safouane El Ghazouali et.al.	2410.01411	link
2024-09-30	Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation	Aleyna Kütük et.al.	2410.00266	null
2024-09-29	CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation	Yifan Duan et.al.	2409.19597	null
2024-09-28	VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition	Ahmad Khaliq et.al.	2409.19293	link
2024-09-27	MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion	Bardienus Duisterhof et.al.	2409.19152	null
2024-09-26	Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval	Mankeerat Sidhu et.al.	2409.18733	null
2024-09-26	Revisit Anything: Visual Place Recognition via Image Segment Retrieval	Kartik Garg et.al.	2409.18049	link
2024-09-24	GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization	Gennady Sidorov et.al.	2409.16502	link
2024-09-23	CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis	Xiang Zhang et.al.	2409.15169	null
2024-09-21	Combining Absolute and Semi-Generalized Relative Poses for Visual Localization	Vojtech Panek et.al.	2409.14269	null
2024-09-21	SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality	Hongjia Zhai et.al.	2409.14067	null
2024-09-20	Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval	Morris Florek et.al.	2409.13513	link
2024-09-18	Towards Global Localization using Multi-Modal Object-Instance Re-Identification	Aneesh Chavan et.al.	2409.12002	link
2024-09-17	Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching	Kurran Singh et.al.	2409.11555	null
2024-09-17	Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information	Kunal Chelani et.al.	2409.11536	null
2024-09-17	Improving the Efficiency of Visually Augmented Language Models	Paula Ontalvilla et.al.	2409.11148	link
2024-09-21	HGSLoc: 3DGS-based Heuristic Camera Pose Refinement	Zhongyan Niu et.al.	2409.10925	null
2024-09-16	SOLVR: Submap Oriented LiDAR-Visual Re-Localisation	Joshua Knights et.al.	2409.10247	null
2024-09-16	Garment Attribute Manipulation with Multi-level Attention	Vittorio Casula et.al.	2409.10206	null
2024-09-14	Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval	Amirreza Mahbod et.al.	2409.09430	link
2024-09-12	Structured Pruning for Efficient Visual Place Recognition	Oliver Grainge et.al.	2409.07834	null
2024-09-10	GeoCalib: Learning Single-image Calibration with Geometric Optimization	Alexander Veicht et.al.	2409.06704	link
2024-09-10	Weakly-supervised Camera Localization by Ground-to-satellite Image Registration	Yujiao Shi et.al.	2409.06471	link
2024-09-10	A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions	Zhicong Wu et.al.	2409.06381	null
2024-09-09	Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding	Bram Willemsen et.al.	2409.05721	link
2024-09-09	Open-World Dynamic Prompt and Continual Visual Representation Learning	Youngeun Kim et.al.	2409.05312	null
2024-09-12	Training-free ZS-CIR via Weighted Modality Fusion and Similarity	Ren-Di Wu et.al.	2409.04918	link
2024-09-12	Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models	Saghir Alfasly et.al.	2409.04631	null
2024-09-06	Reprojection Errors as Prompts for Efficient Scene Coordinate Regression	Ting-Ru Liu et.al.	2409.04178	null
2024-09-06	Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments	Therese Joseph et.al.	2409.03998	null
2024-09-04	Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications	Abby Stylianou et.al.	2409.03012	null
2024-09-04	NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval	Sepanta Zeighami et.al.	2409.02343	link
2024-09-03	Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment	Konstantin Schall et.al.	2409.01936	link
2024-09-02	A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches	Kim Jinwoo et.al.	2409.01219	null
2024-09-02	Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection	Manon Kok et.al.	2409.01091	null
2024-09-02	Evidential Transformers for Improved Image Retrieval	Danilo Dordevic et.al.	2409.01082	null
2024-09-05	EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System	Bonan Liu et.al.	2409.00343	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373	null
2024-09-02	RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance	Avideep Mukherjee et.al.	2408.17095	null
2024-08-29	A compact neuromorphic system for ultra energy-efficient, on-device robot localization	Adam D. Hines et.al.	2408.16754	link
2024-08-29	Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models	Kengo Nakata et.al.	2408.16296	null
2024-08-28	Temporal Attention for Cross-View Sequential Image Localization	Dong Yuan et.al.	2408.15569	link
2024-08-27	Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild	Tianqi Wei et.al.	2408.14723	null
2024-08-25	LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task	Ali Asgarov et.al.	2408.13909	link
2024-08-15	Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval	Lifeng Zhou et.al.	2408.13705	null
2024-08-15	Coarse-to-fine Alignment Makes Better Speech-image Retrieval	Lifeng Zhou et.al.	2408.13119	null
2024-08-21	FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization	Son Tung Nguyen et.al.	2408.12037	link
2024-08-21	Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations	Lintong Zhang et.al.	2408.11966	null
2024-08-21	UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation	Xiangyu Zhao et.al.	2408.11305	link
2024-08-20	GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting	Changkun Liu et.al.	2408.11085	link
2024-08-19	BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval	Zhenyu Lu et.al.	2408.10383	null
2024-08-23	Fashion Image-to-Image Translation for Complementary Item Retrieval	Matteo Attimonelli et.al.	2408.09847	link
2024-08-20	MambaLoc: Efficient Camera Localisation via State Space Model	Jialu Wang et.al.	2408.09680	null
2024-08-15	DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions	Ryosuke Korekata et.al.	2408.07910	null
2024-08-13	A Miniature Vision-Based Localization System for Indoor Blimps	Shicong Ma et.al.	2408.06648	null
2024-08-10	Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network	Junyan Ye et.al.	2408.05475	link
2024-08-09	Spherical World-Locking for Audio-Visual Localization in Egocentric Videos	Heeseung Yun et.al.	2408.05364	null
2024-08-06	AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval	Pavel Suma et.al.	2408.03282	link
2024-08-05	CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration	Gongxin Yao et.al.	2408.02394	null
2024-08-09	BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles	Lun Luo et.al.	2408.01841	link
2024-08-02	On Validation of Search & Retrieval of Tissue Images in Digital Pathology	H. R. Tizhoosh et.al.	2408.01570	null
2024-07-31	VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning	Yuhang Ming et.al.	2407.21416	null
2024-07-31	SuperVINS: A visual-inertial SLAM framework integrated deep learning features	Hongkun Luo et.al.	2407.21348	link
2024-07-30	Re-localization acceleration with Medoid Silhouette Clustering	Hongyi Zhang et.al.	2407.20749	null
2024-07-29	A flexible framework for accurate LiDAR odometry, map manipulation, and localization	José Luis Blanco-Claraco et.al.	2407.20465	link
2024-07-26	From 2D to 3D: AISG-SLA Visual Localization Challenge	Jialin Gao et.al.	2407.18590	null
2024-07-24	Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation	Yongqi Li et.al.	2407.17274	null
2024-07-24	Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments	Wei Gao et.al.	2407.17078	null
2024-07-24	Pose Estimation from Camera Images for Underwater Inspection	Luyuan Peng et.al.	2407.16961	null
2024-07-22	Memory Management for Real-Time Appearance-Based Loop Closure Detection	Mathieu Labbé et.al.	2407.15890	null
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791	null
2024-07-22	Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM	Mathieu Labbe et.al.	2407.15305	null
2024-07-22	Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation	Mathieu Labbé et.al.	2407.15304	null
2024-07-19	Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization	Yuehua Ding et.al.	2407.14643	null
2024-07-18	Visual Haystacks: Answering Harder Questions About Sets of Images	Tsung-Han Wu et.al.	2407.13766	link
2024-07-17	Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM	Markus Weißflog et.al.	2407.12408	null
2024-07-17	GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jingwen Yu et.al.	2407.11736	link
2024-07-16	EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis	Ruijie Yang et.al.	2407.11401	null
2024-07-15	No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Walter Simoncini et.al.	2407.10964	link
2024-07-15	DINO Pre-training for Vision-based End-to-end Autonomous Driving	Shubham Juneja et.al.	2407.10803	null
2024-07-15	Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval	Youngsun Lim et.al.	2407.10683	null
2024-07-15	An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots	J. J. Cabrera et.al.	2407.10596	link
2024-07-15	An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments	J. J. Cabrera et.al.	2407.10536	null
2024-07-12	Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval	Vaibhav Balloli et.al.	2407.08908	link
2024-07-11	Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates	Owen Claxton et.al.	2407.08162	link
2024-07-12	Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal	Xinyu Zhu et.al.	2407.08153	link
2024-07-11	SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM	Neng Wang et.al.	2407.08106	link
2024-07-09	LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition	Teng Wang et.al.	2407.06730	null
2024-07-09	CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding	Wenhao Xu et.al.	2407.06611	null
2024-07-08	Pseudo-triplet Guided Few-shot Composed Image Retrieval	Bohan Hou et.al.	2407.06001	null
2024-07-09	HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels	Yingying Jiang et.al.	2407.05795	null
2024-07-05	Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning	Mainak Singha et.al.	2407.04207	link
2024-07-04	Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models	Chang-Sheng Kao et.al.	2407.03615	link
2024-07-03	Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach	Pronay Debnath et.al.	2407.03486	null
2024-07-02	Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition	Sergio Izquierdo et.al.	2407.02422	link
2024-07-01	Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval	Aneeshan Sain et.al.	2407.01810	null
2024-07-01	Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval	Hanwen Su et.al.	2407.00979	null
2024-07-01	Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios	Connor Malone et.al.	2407.00863	null
2024-06-27	PathAlign: A vision-language model for whole slide images in histopathology	Faruk Ahmed et.al.	2406.19578	null
2024-07-05	360 in the Wild: Dataset for Depth Prediction and View Synthesis	Kibaek Park et.al.	2406.18898	null
2024-06-27	Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs	Huaying Zhang et.al.	2406.18836	null
2024-06-26	WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images	Yannik Glaser et.al.	2406.18765	link
2024-06-26	View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis	Subin Varghese et.al.	2406.18012	null
2024-06-25	Tell Me Where You Are: Multimodal LLMs Meet Place Recognition	Zonglin Lyu et.al.	2406.17520	null
2024-06-25	SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation	Xu Liu et.al.	2406.17249	link
2024-06-23	Breaking the Frame: Image Retrieval by Visual Overlap Prediction	Tong Wei et.al.	2406.16204	link
2024-06-19	Towards a multimodal framework for remote sensing image change retrieval and captioning	Roger Ferrod et.al.	2406.13424	link
2024-06-19	CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval	Christian Lülf et.al.	2406.13322	link
2024-06-17	Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization	Huaiji Zhou et.al.	2406.11766	null
2024-06-22	Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment	Jianan Jiang et.al.	2406.11551	link
2024-06-17	They’re All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias	Salma Abdel Magid et.al.	2406.11331	null
2024-06-17	Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion	Guoyuan An et.al.	2406.11242	null
2024-06-14	Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval	Genc Hoxha et.al.	2406.10107	null
2024-06-14	BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval	Imanol Miranda et.al.	2406.09952	link
2024-06-13	Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases	Meng Wang et.al.	2406.09317	link
2024-06-13	Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval	Jaeseok Byun et.al.	2406.09188	null
2024-06-13	DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification	Zhengrui Xu et.al.	2406.08773	link
2024-06-12	Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement	Maxime Pietrantoni et.al.	2406.08463	null
2024-06-12	ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery	Kam Woh Ng et.al.	2406.08457	link
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502	link
2024-06-11	Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Shuvendu Roy et.al.	2406.07450	link
2024-06-11	Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval	Adrià Molina et.al.	2406.07315	null
2024-06-10	Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation	Shenghao Li et.al.	2406.06374	link
2024-06-09	Unified Text-to-Image Generation and Retrieval	Leigang Qu et.al.	2406.05814	null
2024-06-07	The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better	Scott Geng et.al.	2406.05184	link
2024-06-07	PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction	Eduard Poesina et.al.	2406.04746	link
2024-06-06	GLACE: Global Local Accelerated Coordinate Encoding	Fangjinhua Wang et.al.	2406.04340	link
2024-06-06	Monocular Localization with Semantics Map for Autonomous Vehicles	Jixiang Wan et.al.	2406.03835	null
2024-06-05	Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach	Saehyung Lee et.al.	2406.03411	link
2024-06-04	MeshVPR: Citywide Visual Place Recognition Using 3D Meshes	Gabriele Berton et.al.	2406.02776	null
2024-06-04	Can CLIP help CLIP in learning 3D?	Cristian Sbrolli et.al.	2406.02202	null
2024-06-03	Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP	Sriram Balasubramanian et.al.	2406.01583	link
2024-06-03	Scale-Free Image Keypoints Using Differentiable Persistent Homology	Giovanni Barbarani et.al.	2406.01315	link
2024-06-02	Visual place recognition for aerial imagery: A survey	Ivan Moskalenko et.al.	2406.00885	link
2024-06-01	NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization	Wugang Meng et.al.	2406.00312	null
2024-05-31	DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models	Linli Yao et.al.	2405.20985	link
2024-05-29	Multi-Modal Generative Embedding Model	Feipeng Ma et.al.	2405.19333	null
2024-05-29	ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions	Honglin Lin et.al.	2405.19226	null
2024-05-30	CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval	Xintong Jiang et.al.	2405.19149	link
2024-05-29	SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation	Zhenbei Wu et.al.	2405.18801	null
2024-05-29	Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs	Jialiang Xu et.al.	2405.18740	link
2024-05-28	EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition	Issar Tzachor et.al.	2405.18065	null
2024-05-28	AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval	Sihe Zhang et.al.	2405.17718	null
2024-05-26	MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups	Yusen Xie et.al.	2405.16599	null
2024-05-29	Composed Image Retrieval for Remote Sensing	Bill Psomas et.al.	2405.15587	link
2024-05-24	Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval	Yiming Wu et.al.	2405.15451	null
2024-05-20	UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization	Wenjia Xu et.al.	2405.11936	link
2024-05-19	Register assisted aggregation for Visual Place Recognition	Xuan Yu et.al.	2405.11526	null
2024-05-26	CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion	Gang Wang et.al.	2405.10793	null
2024-05-16	FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models	Adrian Bulat et.al.	2405.10286	null
2024-05-15	Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study	Farnaz Khun Jush et.al.	2405.09334	null
2024-05-14	BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment	Lihong Jin et.al.	2405.09001	null
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434	null
2024-05-13	OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition	Qiuchi Xiang et.al.	2405.07966	link
2024-05-14	HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval	Chao He et.al.	2405.07524	link
2024-05-13	JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation	Xubo Luo et.al.	2405.07429	link
2024-05-12	BoQ: A Place is Worth a Bag of Learnable Queries	Amar Ali-bey et.al.	2405.07364	link
2024-05-07	Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction	Nematollah Saeidi et.al.	2405.04211	null
2024-05-06	A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions	Sharath Raghvendra et.al.	2405.03664	null
2024-05-06	Knowledge-aware Text-Image Retrieval for Remote Sensing Images	Li Mi et.al.	2405.03373	null
2024-05-06	Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval	Jiacheng Cheng et.al.	2405.03190	null
2024-05-05	iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval	Lorenzo Agnolucci et.al.	2405.02951	link
2024-05-01	Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval	Young Kyun Jang et.al.	2405.00571	null
2024-04-30	Large Language Model Informed Patent Image Retrieval	Hao-Cheng Lo et.al.	2404.19360	null
2024-04-30	XFeat: Accelerated Features for Lightweight Image Matching	Guilherme Potje et.al.	2404.19174	null
2024-04-29	Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models	Hongyi Zhu et.al.	2404.18746	null
2024-04-29	Dual-Modal Prompting for Sketch-Based Image Retrieval	Liying Gao et.al.	2404.18695	null
2024-05-01	Semantic Line Combination Detector	Jinwon Ko et.al.	2404.18399	link
2024-04-26	Learning text-to-video retrieval from image captioning	Lucas Ventura et.al.	2404.17498	null
2024-04-25	CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching	Samia Shafique et.al.	2404.16972	link
2024-04-29	Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval	Ryoya Nara et.al.	2404.16398	null
2024-04-24	Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval	Haokun Wen et.al.	2404.15875	link
2024-04-24	DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines	Xin Jiang et.al.	2404.15771	null
2024-04-23	Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval	Young Kyun Jang et.al.	2404.15516	null
2024-04-22	EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models	Mathias Thorsager et.al.	2404.14236	null
2024-04-22	Hierarchical localization with panoramic views and triplet loss functions	Marcos Alfaro et.al.	2404.14117	link
2024-04-20	High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces	Baoru Huang et.al.	2404.13437	null
2024-04-20	Collaborative Visual Place Recognition through Federated Learning	Mattia Dutto et.al.	2404.13324	null
2024-04-18	SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints	Spencer Carmichael et.al.	2404.12339	null
2024-04-17	Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives	Zhangchi Feng et.al.	2404.11317	link
2024-04-17	Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing	Sanggeon Yun et.al.	2404.11025	null
2024-04-16	SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments	Niklas Gard et.al.	2404.10527	link
2024-04-20	CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning	Haojian Huang et.al.	2404.09640	link
2024-04-11	PRAM: Place Recognition Anywhere Model for Efficient Visual Localization	Fei Xue et.al.	2404.07785	null
2024-04-16	2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure	Bin Zhang et.al.	2404.07644	link
2024-04-11	Semantically-correlated memories in a dense associative model	Thomas F Burns et.al.	2404.07123	link
2024-04-09	Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation	Luca Barsellotti et.al.	2404.06542	null
2024-04-09	Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping	Anas Gouda et.al.	2404.06277	link
2024-04-07	Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval	Jinpeng Wang et.al.	2404.04998	link
2024-04-06	Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning	Juncheng Yang et.al.	2404.04538	link
2024-04-05	Towards introspective loop closure in 4D radar SLAM	Maximilian Hilger et.al.	2404.03940	null
2024-04-02	TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation	Yehui Shen et.al.	2404.01587	link
2024-04-01	On Train-Test Class Overlap and Detection for Image Retrieval	Chull Hwan Song et.al.	2404.01524	link
2024-04-01	NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification	Juyeop Han et.al.	2404.01400	null
2024-03-31	On the Estimation of Image-matching Uncertainty in Visual Place Recognition	Mubariz Zaffar et.al.	2404.00546	null
2024-03-31	NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation	Diwei Sheng et.al.	2404.00504	null
2024-03-30	SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs	Yang Miao et.al.	2404.00469	null
2024-03-30	Do Vision-Language Models Understand Compound Nouns?	Sonal Kumar et.al.	2404.00419	link
2024-04-05	FairRAG: Fair Human Generation via Fair Retrieval Augmentation	Robik Shrestha et.al.	2403.19964	null
2024-03-28	JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition	Gabriele Berton et.al.	2403.19787	link
2024-03-28	MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions	Kai Zhang et.al.	2403.19651	link
2024-03-27	AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation	Changkun Liu et.al.	2403.18281	null
2024-03-26	Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge	Dongjin Kim et.al.	2403.17420	link
2024-03-25	Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras	Gokul B. Nair et.al.	2403.16425	link
2024-03-24	Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval	Yucheng Suo et.al.	2403.16005	link
2024-03-24	BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval	Yinda Chen et.al.	2403.15992	null
2024-03-22	Long-CLIP: Unlocking the Long-Text Capability of CLIP	Beichen Zhang et.al.	2403.15378	link
2024-03-22	A Multimodal Approach for Cross-Domain Image Retrieval	Lucas Iijima et.al.	2403.15152	null
2024-03-22	Piecewise-Linear Manifolds for Deep Metric Learning	Shubhang Bhatnagar et.al.	2403.14977	null
2024-03-21	Enhancing Historical Image Retrieval with Compositional Cues	Tingyu Lin et.al.	2403.14287	link
2024-03-20	Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval	Aymene Berriche et.al.	2403.13747	null
2024-03-20	Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval	Haoyu Liu et.al.	2403.13317	null
2024-03-19	Learning Neural Volumetric Pose Features for Camera Localization	Jingyu Lin et.al.	2403.12800	null
2024-03-19	Quantixar: High-performance Vector Data Management System	Gulshan Yadav et.al.	2403.12583	null
2024-03-17	3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization	Peng Jiang et.al.	2403.11367	null
2024-03-17	MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data	Paul S. Scotti et.al.	2403.11207	link
2024-03-16	Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval	Shunsuke Tsubaki et.al.	2403.10756	null
2024-03-16	Vector search with small radiuses	Gergely Szilvasy et.al.	2403.10746	null
2024-03-13	Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer	Kenta Tsukahara et.al.	2403.10552	null
2024-03-20	Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression	Huy-Hoang Bui et.al.	2403.10297	link
2024-03-15	Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline	Fangming Yuan et.al.	2403.10283	null
2024-03-14	The NeRFect Match: Exploring NeRF Features for Visual Localization	Qunjie Zhou et.al.	2403.09577	null
2024-03-14	VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition	Benjamin Ramtoula et.al.	2403.09025	null
2024-03-13	PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models	Siddharth Mishra-Sharma et.al.	2403.08851	link
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156	link
2024-03-12	It’s All About Your Sketch: Democratising Sketch Control in Diffusion Models	Subhadeep Koley et.al.	2403.07234	link
2024-03-12	You’ll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval	Subhadeep Koley et.al.	2403.07222	null
2024-03-12	Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers	Subhadeep Koley et.al.	2403.07214	null
2024-03-11	How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?	Subhadeep Koley et.al.	2403.07203	null
2024-03-11	EarthLoc: Astronaut Photography Localization by Indexing Earth from Space	Gabriele Berton et.al.	2403.06758	link
2024-03-11	BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues	Fudong Ge et.al.	2403.06600	link
2024-03-11	Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology	Stefan Denner et.al.	2403.06567	link
2024-03-10	RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation	Mathieu Labbé et.al.	2403.06341	null
2024-03-10	Texture image retrieval using a classification and contourlet-based features	Asal Rouhafzay et.al.	2403.06048	null
2024-03-11	LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map	Xinrui Wu et.al.	2403.05002	link
2024-03-11	Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed	Yifan Wang et.al.	2403.04765	null
2024-03-07	mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar	Chengzhen Meng et.al.	2403.04703	null
2024-03-06	Self-supervised Photographic Image Layout Representation Learning	Zhaoran Zhao et.al.	2403.03740	link
2024-03-04	Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models	Benedikt Blumenstiel et.al.	2403.02059	link
2024-03-03	Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval	Yongchao Du et.al.	2403.01431	null
2024-03-01	Asymmetric Feature Fusion for Image Retrieval	Hui Wu et.al.	2403.00671	null
2024-03-01	Structure Similarity Preservation Learning for Asymmetric Image Retrieval	Hui Wu et.al.	2403.00648	link
2024-02-29	CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition	Feng Lu et.al.	2402.19231	link
2024-02-28	Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport	Bin Li et.al.	2402.18411	link
2024-02-28	Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning	Hanyao Wang et.al.	2402.18400	null
2024-02-28	Representing 3D sparse map points and lines for camera relocalization	Bach-Thuan Bui et.al.	2402.18011	link
2024-02-27	Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control	Thong Nguyen et.al.	2402.17535	link
2024-02-29	Active propulsion noise shaping for multi-rotor aircraft localization	Gabriele Serussi et.al.	2402.17289	link
2024-02-27	NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer	Bingxi Liu et.al.	2402.17159	link
2024-02-25	Deep Homography Estimation for Visual Place Recognition	Feng Lu et.al.	2402.16086	link
2024-02-25	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961	link
2024-02-28	Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries	Zijun Long et.al.	2402.15276	null
2024-02-23	Fine-tuning CLIP Text Encoders with Two-step Paraphrasing	Hyunjae Kim et.al.	2402.15120	null
2024-02-22	Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition	Feng Lu et.al.	2402.14505	link
2024-02-16	Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition	Chenming Hu et.al.	2402.10476	null
2024-02-15	Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task	Mirko Nava et.al.	2402.09886	link
2024-02-14	Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency	Yannis Kalantidis et.al.	2402.09237	null
2024-02-13	Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast	Xiangming Gu et.al.	2402.08567	link
2024-02-13	Learning to Produce Semi-dense Correspondences for Visual Localization	Khang Truong Giang et.al.	2402.08359	link
2024-02-10	Semantic Object-level Modeling for Robust Visual Camera Relocalization	Yifan Zhu et.al.	2402.06951	null
2024-02-09	Large Language Models for Captioning and Retrieving Remote Sensing Images	João Daniel Silva et.al.	2402.06475	null
2024-02-09	PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes	Xinggang Hu et.al.	2402.06131	null
2024-02-21	MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction	Heng Zhou et.al.	2402.03762	null
2024-02-04	Region-Based Representations Revisited	Michal Shlapentokh-Rothman et.al.	2402.02352	link
2024-02-03	Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization	Bo Yang et.al.	2402.02141	link
2024-02-01	BrainSLAM: SLAM on Neural Population Activity Data	Kipp Freud et.al.	2402.00588	null
2024-02-01	Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering	Tianxiao Gao et.al.	2402.00330	link
2024-01-31	Improved Scene Landmark Detection for Camera Localization	Tien Do et.al.	2401.18083	link
2024-01-31	Local Feature Matching Using Deep Learning: A Survey	Shibiao Xu et.al.	2401.17592	link
2024-01-29	Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors	Shiyin Dong et.al.	2401.16459	null
2024-01-29	Cross-Modal Coordination Across a Diverse Set of Input Modalities	Jorge Sánchez et.al.	2401.16347	null
2024-01-29	Regressing Transformers for Data-efficient Visual Place Recognition	María Leyva-Vallina et.al.	2401.16304	null
2024-01-27	Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval	Ayush Dubey et.al.	2401.15362	null
2024-01-24	Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode	Naresh Kumar Lahajal et.al.	2401.13613	null
2024-01-23	PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion	Shyam Sundar Kannan et.al.	2401.13082	null
2024-01-23	SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization	Mingyang Li et.al.	2401.13076	link
2024-01-25	CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios	Xiangshuo Qiao et.al.	2401.10475	link
2024-01-19	PhotoScout: Synthesis-Powered Multi-Modal Image Search	Celeste Barnaby et.al.	2401.10464	null
2024-01-19	Cross-Modality Perturbation Synergy Attack for Person Re-identification	Yunpeng Gong et.al.	2401.10090	null
2024-01-16	Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging	Zahra Tabatabaei et.al.	2401.08272	null
2024-01-16	Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments	Bruno Arcanjo et.al.	2401.08263	null
2024-01-15	Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing	Jakob Hackstein et.al.	2401.07782	link
2024-01-14	HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval	Zexuan Qiu et.al.	2401.07212	link
2024-01-11	UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization	Rouwan Wu et.al.	2401.05971	link
2024-01-10	Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval	Eunyi Lyou et.al.	2401.04860	link
2024-01-05	Benchmarking PathCLIP for Pathology Image Analysis	Sunyi Zheng et.al.	2401.02651	null
2024-01-03	DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM with Joint Semantic Encoding	Mingrui Li et.al.	2401.01545	null
2024-01-02	BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving	Dafeng Wei et.al.	2401.01065	null
2023-12-31	Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval	Liang Wang et.al.	2401.00371	link
2023-12-29	Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering	Long-Kun Du et.al.	2401.00032	null
2023-12-27	LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization	Sai Shubodh Puligilla et.al.	2312.16648	null
2023-12-26	Recursive Distillation for Open-Set Distributed Robot Localization	Kenta Tsukahara et.al.	2312.15897	null
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471	null
2023-12-23	CaLDiff: Camera Localization in NeRF via Pose Diffusion	Rashik Shrestha et.al.	2312.15242	null
2023-12-20	Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition	Bruno Arcanjo et.al.	2312.12995	null
2023-12-19	VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering	Chun-Mei Feng et.al.	2312.12273	link
2023-12-18	Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback	Boaz Lerner et.al.	2312.11078	link
2023-12-17	PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields	Boming Zhao et.al.	2312.10649	null
2023-12-17	DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition	Sijie Wang et.al.	2312.10616	link
2023-12-16	Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval	Decheng Liu et.al.	2312.10320	link
2023-12-15	Data-Efficient Multimodal Fusion on a Single GPU	Noël Vouitsis et.al.	2312.10144	link
2023-12-13	Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques	Hamed Qazanfari et.al.	2312.10089	null
2023-12-15	Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval	Zhe Ma et.al.	2312.09716	link
2023-12-14	Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition	Oliver Grainge et.al.	2312.09028	null
2023-12-14	Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking	Shitong Sun et.al.	2312.08924	null
2023-12-13	C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation	Florian Fervers et.al.	2312.08060	null
2023-12-12	Contextually Affinitive Neighborhood Refinery for Deep Clustering	Chunlin Yu et.al.	2312.07806	link
2023-12-12	Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval	Qiwei Tian et.al.	2312.07364	link
2023-12-12	Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection	Jonathan J. Y. Kim et.al.	2312.06991	null
2023-12-11	Dynamic Weighted Combiner for Mixed-Modal Image Retrieval	Fuxiang Huang et.al.	2312.06179	link
2023-12-06	Lite-Mind: Towards Efficient and Versatile Brain Representation Network	Zixuan Gong et.al.	2312.03781	link
2023-12-08	FreestyleRet: Retrieving Images from Style-Diversified Queries	Hao Li et.al.	2312.02428	link
2023-12-04	Implicit Learning of Scene Geometry from Poses for Global Localization	Mohammad Altillawi et.al.	2312.02029	null
2023-12-04	Language-only Efficient Training of Zero-shot Composed Image Retrieval	Geonmo Gu et.al.	2312.01998	link
2023-12-03	G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training	Che Liu et.al.	2312.01522	link
2023-12-01	Improve Supervised Representation Learning with Masked Image Modeling	Kaifeng Chen et.al.	2312.00950	null
2023-12-05	Grounding Everything: Emerging Localization Properties in Vision-Language Transformers	Walid Bousselham et.al.	2312.00878	link
2023-12-01	Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras	Mohammad Altillawi et.al.	2312.00500	null
2023-11-30	HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance	Zhuohao Yin et.al.	2311.18273	link
2023-11-30	Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models	Raviteja Vemulapalli et.al.	2311.18237	link
2023-11-29	Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce	Chang Liu et.al.	2311.17954	null
2023-11-28	Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames	Chao Chen et.al.	2311.17940	null
2023-11-29	360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries	Huajian Huang et.al.	2311.17389	link
2023-11-27	Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation	Samuele Poppi et.al.	2311.16254	link
2023-11-27	Optimal Transport Aggregation for Visual Place Recognition	Sergio Izquierdo et.al.	2311.15937	link
2023-11-27	AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval	Shicheng Xu et.al.	2311.14084	link
2023-11-23	3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology	Asma Ben Abacha et.al.	2311.13752	link
2023-11-22	Medical Image Retrieval Using Pretrained Embeddings	Farnaz Khun Jush et.al.	2311.13547	null
2023-11-22	Applications of Spiking Neural Networks in Visual Place Recognition	Somayeh Hussaini et.al.	2311.13186	link
2023-11-21	Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval	Xiu-Shen Wei et.al.	2311.12894	null
2023-11-21	Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs	Zhentian Qian et.al.	2311.12245	null
2023-11-19	From Categories to Classifier: Name-Only Continual Learning by Exploring the Web	Ameya Prabhu et.al.	2311.11293	null
2023-11-18	Lesion Search with Self-supervised Learning	Kristin Qi et.al.	2311.11014	null
2023-11-15	Flow reconstruction and particle characterization from inertial Lagrangian tracks	Ke Zhou et.al.	2311.09076	null
2023-11-15	Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval	Junyang Chen et.al.	2311.07622	link
2023-11-13	VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search	Shuting He et.al.	2311.07514	null
2023-11-10	Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval	Xin Lu et.al.	2311.06067	null
2023-11-08	Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model	Junya Shiraishi et.al.	2311.04788	null
2023-11-08	Training CLIP models on Data from Scientific Papers	Calvin Metzger et.al.	2311.04711	link
2023-11-07	DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding	Kehinde Ajayi et.al.	2311.04098	link
2023-11-06	Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences	Zador Pataki et.al.	2311.03345	null
2023-11-06	FocusTune: Tuning Visual Localization through Focus-Guided Sampling	Son Tung Nguyen et.al.	2311.02872	link
2023-11-01	DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing	Gaoshuang Huang et.al.	2311.00230	link
2023-10-29	Identifiable Contrastive Learning with Automatic Feature Importance Discovery	Qi Zhang et.al.	2310.18904	link
2023-10-27	LipSim: A Provably Robust Perceptual Similarity Metric	Sara Ghazanfari et.al.	2310.18274	link
2023-10-27	Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation	Susu Fang et.al.	2310.17879	null
2023-10-25	FoundLoc: Vision-based Onboard Aerial Localization in the Wild	Yao He et.al.	2310.16299	null
2023-10-24	Cross-view Self-localization from Synthesized Scene-graphs	Ryogo Yamamoto et.al.	2310.15504	null
2023-10-23	Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval	Xu Yuan et.al.	2310.14637	link
2023-10-21	Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation	Anastasia Kritharoula et.al.	2310.14025	link
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605	null
2023-10-20	CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants	Shaoan Wang et.al.	2310.13320	link
2023-10-27	Representation Learning via Consistent Assignment of Views over Random Partitions	Thalles Silva et.al.	2310.12692	link
2023-10-18	Evaluating the Fairness of Discriminative Foundation Models in Computer Vision	Junaid Ali et.al.	2310.11867	link
2023-10-17	Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification	Shuanglin Yan et.al.	2310.11210	null
2023-10-16	Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People	Dharmateja Adapa et.al.	2310.10290	null
2023-10-16	EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge	Tom Bryan et.al.	2310.10050	null
2023-10-15	CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes	Yulei Qin et.al.	2310.09761	link
2023-10-13	Pairwise Similarity Learning is SimPLE	Yandong Wen et.al.	2310.09449	link
2023-10-13	Vision-by-Language for Training-Free Compositional Image Retrieval	Shyamgopal Karthik et.al.	2310.09291	link
2023-10-12	Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning	Shiyang Yan et.al.	2310.08390	null
2023-10-12	Jointly Optimized Global-Local Visual Localization of UAVs	Haoling Li et.al.	2310.08082	null
2023-10-10	Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization	Le Chen et.al.	2310.06984	null
2023-10-10	Distillation Improves Visual Place Recognition for Low-Quality Queries	Anbang Yang et.al.	2310.06906	link
2023-10-10	Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets	Jiajun Zhang et.al.	2310.06566	null
2023-10-10	Topological RANSAC for instance verification and retrieval without fine-tuning	Guoyuan An et.al.	2310.06486	null
2023-10-10	3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments	Ghanta Sai Krishna et.al.	2310.06385	null
2023-10-09	Collaborative Visual Place Recognition	Yiming Li et.al.	2310.05541	null
2023-10-09	Sentence-level Prompts Benefit Composed Image Retrieval	Yang Bai et.al.	2310.05473	link
2023-10-08	AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition	Feng Lu et.al.	2310.05184	link
2023-10-08	LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization	Artem Nenashev et.al.	2310.05134	null
2023-10-12	ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer	Yifan Xu et.al.	2310.04099	null
2023-10-06	Sub-token ViT Embedding via Stochastic Resonance Transformers	Dong Lao et.al.	2310.03967	link
2023-10-04	Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach	Matthew Hanlon et.al.	2310.02650	null
2023-10-02	NEUCORE: Neural Concept Reasoning for Composed Image Retrieval	Shu Zhao et.al.	2310.01358	null
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092	null
2023-10-05	PlaceNav: Topological Navigation through Place Recognition	Lauri Suomela et.al.	2309.17260	null
2023-09-29	Segment Anything Model is a Good Teacher for Local Feature Learning	Jingqian Wu et.al.	2309.16992	link
2023-09-28	Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning	Albert Mohwald et.al.	2309.16351	link
2023-09-28	FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding	Pengxiang Wu et.al.	2309.16249	link
2023-09-28	Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval	Yuanmin Tang et.al.	2309.16137	link
2023-09-27	GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization	Vicente Vivanco Cepeda et.al.	2309.16020	link
2023-09-27	Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization	Zhenbo Song et.al.	2309.15556	null
2023-09-26	Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features	Hila Levi et.al.	2309.14999	null
2023-09-23	Resolving References in Visually-Grounded Dialogue via Text Generation	Bram Willemsen et.al.	2309.13430	link
2023-09-21	Face Identity-Aware Disentanglement in StyleGAN	Adrian Suwała et.al.	2309.12033	null
2023-09-21	On-the-Fly SfM: What you capture is What you get	Zongqian Zhan et.al.	2309.11883	link
2023-09-20	2D-3D Pose Tracking with Multi-View Constraints	Huai Yu et.al.	2309.11335	null
2023-09-19	VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition	Adam D. Hines et.al.	2309.10225	link
2023-09-18	DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach	Chenghao Xu et.al.	2309.09879	null
2023-09-18	Decompose Semantic Shifts for Composed Image Retrieval	Xingyu Yang et.al.	2309.09531	null
2023-09-16	Efficient Object Rearrangement via Multi-view Fusion	Dehao Huang et.al.	2309.08994	null
2023-09-16	DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF	Mert Asim Karaoglu et.al.	2309.08927	link
2023-09-16	Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning	Pengyu Yin et.al.	2309.08914	link
2023-09-15	Active Learning for Fine-Grained Sketch-Based Image Retrieval	Himanshu Thakur et.al.	2309.08743	null
2023-09-15	Optimization of Rank Losses for Image Retrieval	Elias Ramzi et.al.	2309.08250	link
2023-09-18	Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer	Yaoting Wang et.al.	2309.07929	link
2023-09-14	EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization	Minjung Kim et.al.	2309.07471	link
2023-09-13	RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline	Mirko Usuelli et.al.	2309.07094	null
2023-09-11	Towards Content-based Pixel Retrieval in Revisited Oxford and Paris	Guoyuan An et.al.	2309.05438	link
2023-09-08	Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning	Hiroki Nakamura et.al.	2309.04148	null
2023-09-05	Magnetic Navigation using Attitude-Invariant Magnetic Field Information for Loop Closure Detection	Natalia Pavlasek et.al.	2309.02394	null
2023-09-05	Dual Relation Alignment for Composed Image Retrieval	Xintong Jiang et.al.	2309.02169	null
2023-09-04	NLLB-CLIP – train performant multilingual image retrieval model on a budget	Alexander Visheratin et.al.	2309.01859	null
2023-09-04	Target-Guided Composed Image Retrieval	Haokun Wen et.al.	2309.01366	null
2023-09-02	Deep supervised hashing for fast retrieval of radio image cubes	Steven Ndung’u et.al.	2309.00932	null
2023-08-31	Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval	Prateksha Udhayanan et.al.	2308.16649	null
2023-08-28	Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics	Nils Böhne et.al.	2308.14786	null
2023-08-28	CoVR: Learning Composed Video Retrieval from Web Video Captions	Lucas Ventura et.al.	2308.14746	link
2023-08-27	Deep Learning for Visual Localization and Mapping: A Survey	Changhao Chen et.al.	2308.14039	null
2023-08-26	Learning Efficient Representations for Image-Based Patent Retrieval	Hongsong Wang et.al.	2308.13749	null
2023-08-25	Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers	Mohammad Javad Rajabi et.al.	2308.13671	null
2023-08-24	Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities	Jinze Bai et.al.	2308.12966	link
2023-08-23	Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval	Huafeng Li et.al.	2308.11994	null
2023-08-23	OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes	Tao Xie et.al.	2308.11928	link
2023-08-22	Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features	Alberto Baldrati et.al.	2308.11485	link
2023-08-22	GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training	Xinchi Deng et.al.	2308.11331	null
2023-08-22	LDP-Feat: Image Features with Local Differential Privacy	Francesco Pittaluga et.al.	2308.11223	null
2023-08-21	EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition	Gabriele Berton et.al.	2308.10832	link
2023-08-20	FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory	Anwesan Pal et.al.	2308.10170	null
2023-08-18	3D Model-free Visual localization System from Essential Matrix under Local Planar Motion	Yanmei Jiao et.al.	2308.09566	null
2023-08-17	FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings	Yulin Su et.al.	2308.09012	link
2023-08-16	Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval	Aishwarya Venkataramanan et.al.	2308.08431	link
2023-08-16	Ranking-aware Uncertainty for Text-guided Image Retrieval	Junyang Chen et.al.	2308.08131	null
2023-08-19	Global Features are All You Need for Image Retrieval and Reranking	Shihao Shao et.al.	2308.06954	link
2023-08-14	MixBCT: Towards Self-Adapting Backward-Compatible Training	Yu Liang et.al.	2308.06948	link
2023-08-10	KS-APR: Keyframe Selection for Robust Absolute Pose Regression	Changkun Liu et.al.	2308.05459	null
2023-08-09	AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities	Jingdan Zhang et.al.	2308.04992	link
2023-08-08	Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval	Yi Bin et.al.	2308.04343	link
2023-08-08	Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval	Yunquan Zhu et.al.	2308.04008	link
2023-08-05	A Comprehensive Analysis of Real-World Image Captioning and Scene Identification	Sai Suprabhanu Nallapaneni et.al.	2308.02833	null
2023-08-03	Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies	Eunsuk Seo et.al.	2308.01871	null
2023-08-01	AnyLoc: Towards Universal Visual Place Recognition	Nikhil Keetha et.al.	2308.00688	link
2023-07-31	Guiding Image Captioning Models Toward More Specific Captions	Simon Kornblith et.al.	2307.16686	null
2023-07-31	Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks	Kousik Rajesh et.al.	2307.16395	null
2023-07-28	D2S: Representing local descriptors and global scene coordinates for camera relocalization	Bach-Thuan Bui et.al.	2307.15250	link
2023-07-26	Neural-based Cross-modal Search and Retrieval of Artwork	Yan Gong et.al.	2307.14244	null
2023-07-26	Boon: A Neural Search Engine for Cross-Modal Information Retrieval	Yan Gong et.al.	2307.14240	null
2023-07-25	Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network	Chull Hwan Song et.al.	2307.13254	null
2023-07-28	SACReg: Scene-Agnostic Coordinate Regression for Visual Localization	Jerome Revaud et.al.	2307.11702	null
2023-07-19	Lazy Visual Localization via Motion Averaging	Siyan Dong et.al.	2307.09981	null
2023-07-19	Quantum Optics based Algorithm for Measuring the Similarity between Images	Vivek Mehta et.al.	2307.09789	null
2023-07-18	Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments	Max Moebius et.al.	2307.09172	null
2023-07-18	3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving	Qipeng Li et.al.	2307.09044	null
2023-07-19	Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation	Rundong Luo et.al.	2307.08779	null
2023-07-17	Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition	Gabriele Trivigno et.al.	2307.08417	link
2023-07-17	Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification	Tengfei Liang et.al.	2307.08316	link
2023-07-17	NDT-Map-Code: A 3D global descriptor for real-time loop closure detection in lidar SLAM	Lizhou Liao et.al.	2307.08221	link
2023-07-20	Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer	Yujiao Shi et.al.	2307.08015	link
2023-07-10	Phoneme-retrieval; voice recognition; vowels recognition	Brunello Tirozzi et.al.	2307.07407	null
2023-07-14	Risk Controlled Image Retrieval	Kaiwen Cai et.al.	2307.07336	link
2023-07-11	ResMatch: Residual Attention Learning for Local Feature Matching	Yuxin Deng et.al.	2307.05180	link
2023-07-11	Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification	Yi Liao et.al.	2307.05017	null
2023-07-10	Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor	San Jiang et.al.	2307.04520	null
2023-07-10	RaPlace: Place Recognition for Imaging Radar using Radon Transform and Mutable Threshold	Hyesu Jang et.al.	2307.04321	link
2023-07-08	Calibration-Aware Margin Loss: Pushing the Accuracy-Calibration Consistency Pareto Frontier for Deep Metric Learning	Qin Zhang et.al.	2307.04047	null
2023-07-04	Unsupervised Quality Prediction for Improved Single-Frame and Weighted Sequential Visual Place Recognition	Helen Carson et.al.	2307.01464	null
2023-07-04	Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network	Zizhuo Li et.al.	2307.01447	null
2023-07-03	Cross-modal Place Recognition in Image Databases using Event-based Sensors	Xiang Ji et.al.	2307.01047	null
2023-06-30	DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions	Stephen Hausler et.al.	2306.17536	null
2023-06-30	Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization	Stephen Hausler et.al.	2306.17529	null
2023-06-27	Dental CLAIRES: Contrastive LAnguage Image REtrieval Search for Dental Research	Tanjida Kabir et.al.	2306.15651	null
2023-06-27	Mean Field Theory in Deep Metric Learning	Takuya Furusawa et.al.	2306.15368	null
2023-06-26	Hierarchical Matching and Reasoning for Multi-Query Image Retrieval	Zhong Ji et.al.	2306.14460	link
2023-06-25	Enhancing Dynamic Image Advertising with Vision-Language Pre-training	Zhoufutu Wen et.al.	2306.14112	null
2023-06-23	Catching Image Retrieval Generalization	Maksim Zhdanov et.al.	2306.13357	null
2023-06-22	Deep Metric Learning with Soft Orthogonal Proxies	Farshad Saberi-Movahed et.al.	2306.13055	null
2023-06-22	What to Learn: Features, Image Transformations, or Both?	Yuxuan Chen et.al.	2306.13040	null
2023-06-22	Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval	Katrin Glinka et.al.	2306.12843	null
2023-06-26	Annotation Cost Efficient Active Learning for Content Based Image Retrieval	Julia Henkel et.al.	2306.11605	null
2023-06-19	Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning	Shivaen Ramshetty et.al.	2306.11065	link
2023-06-18	LiDAR-Based Place Recognition For Autonomous Driving: A Survey	Pengcheng Shi et.al.	2306.10561	link
2023-06-15	Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization	Dror Aiger et.al.	2306.09012	link
2023-06-15	Prompt Performance Prediction for Generative IR	Nicolas Bizzozzero et.al.	2306.08915	null
2023-06-15	Graph Convolution Based Efficient Re-Ranking for Visual Retrieval	Yuqi Zhang et.al.	2306.08792	link
2023-06-13	GeneCIS: A Benchmark for General Conditional Image Similarity	Sagar Vaze et.al.	2306.07969	null
2023-06-13	MOFI: Learning Image Representations from Noisy Entity Annotated Images	Wentao Wu et.al.	2306.07952	link
2023-06-12	Zero-shot Composed Text-Image Retrieval	Yikun Liu et.al.	2306.07272	link
2023-06-12	Sticker820K: Empowering Interactive Retrieval with Stickers	Sijie Zhao et.al.	2306.06870	null
2023-06-11	Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models	Yuguang Yang et.al.	2306.06691	null
2023-06-03	Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval	Xu Zhang et.al.	2306.02092	null
2023-06-03	Class Anchor Margin Loss for Content-Based Image Retrieval	Alexandru Ghita et.al.	2306.00630	null
2023-05-31	Chatting Makes Perfect – Chat-based Image Retrieval	Matan Levy et.al.	2305.20062	link
2023-05-31	Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization	Junan Chen et.al.	2305.20044	null
2023-05-30	A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation	Omar Seddati et.al.	2305.18988	null
2023-05-29	Synfeal: A Data-Driven Simulator for End-to-End Camera Localization	Daniel Coelho et.al.	2305.18260	link
2023-05-29	Nanoscale visualization of the thermally-driven evolution of antiferromagnetic domains in FeTe thin films	Shrinkhala Sharma et.al.	2305.18197	null
2023-05-29	TReR: A Lightweight Transformer Re-Ranking Approach for 3D LiDAR Place Recognition	Tiago Barros et.al.	2305.18013	null
2023-05-28	ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval	Jiapeng Wang et.al.	2305.17652	null
2023-06-01	FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing	Zhuang Li et.al.	2305.17497	link
2023-05-27	Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation	Yueh-Cheng Huang et.al.	2305.17463	null
2023-05-26	Generating Images with Multimodal Language Models	Jing Yu Koh et.al.	2305.17216	link
2023-05-25	Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder	Zheyuan Liu et.al.	2305.16304	link
2023-05-23	Leveraging BEV Representation for 360-degree Visual Place Recognition	Xuecheng Xu et.al.	2305.13814	link
2023-05-23	EDIS: Entity-Driven Image Search over Multimodal Web Content	Siqi Liu et.al.	2305.13631	link
2023-05-20	DAC: Detector-Agnostic Spatial Covariances for Deep Local Features	Javier Tirado-Garín et.al.	2305.12250	link
2023-05-19	Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach	Zahra Tabatabaei et.al.	2305.11728	null
2023-05-19	Learning Sequence Descriptor based on Spatiotemporal Attention for Visual Place Recognition	Fenglin Zhang et.al.	2305.11467	link
2023-05-12	IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images	Varuna Krishna et.al.	2305.10438	link
2023-05-17	Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval	Haokun Wen et.al.	2305.09979	null
2023-05-13	Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance	Xinyu Lin et.al.	2305.07943	link
2023-05-11	Foundations of Spatial Perception for Robotics: Hierarchical Representations and Real-time Systems	Nathan Hughes et.al.	2305.07154	link
2023-05-09	Visual Place Recognition with Low-Resolution Images	Mihnea-Alexandru Tomita et.al.	2305.05776	null
2023-05-09	Vision-Language Models in Remote Sensing: Current Progress and Future Trends	Congcong Wen et.al.	2305.05726	null
2023-05-09	An Evaluation and Ranking of Different Voting Schemes for Improved Visual Place Recognition	Maria Waheed et.al.	2305.05705	null
2023-05-09	Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query	Ho Hin Lee et.al.	2305.05598	null
2023-05-09	ColonMapper: topological mapping and localization for colonoscopy	Javier Morlana et.al.	2305.05546	null
2023-05-09	Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization	Clémentin Boittiaux et.al.	2305.05301	link
2023-05-09	Patch-DrosoNet: Classifying Image Partitions With Fly-Inspired Models For Lightweight Visual Place Recognition	Bruno Arcanjo et.al.	2305.05256	null
2023-05-09	Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval	Shiyin Dong et.al.	2305.05144	null
2023-05-08	Hierarchical Visual Localization Based on Sparse Feature Pyramid for Adaptive Reduction of Keypoint Map Size	Andrei Potapov et.al.	2305.04856	null
2023-05-08	Privacy-Preserving Representations are not Enough – Recovering Scene Content from Camera Poses	Kunal Chelani et.al.	2305.04603	link
2023-05-06	Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer	Minyi Zhao et.al.	2305.04072	null
2023-05-06	Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing	Swagatika Dash et.al.	2305.03881	link
2023-05-05	COLA: How to adapt vision-language models to Compose Objects Localized with Attributes?	Arijit Ray et.al.	2305.03689	link
2023-05-05	HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer	Shuzhe Wang et.al.	2305.03595	null
2023-05-05	WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval	Zahra Tabatabaei et.al.	2305.03383	null
2023-05-04	Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval	Tan Pan et.al.	2305.02610	link
2023-05-03	Learning-based Relational Object Matching Across Views	Cathrin Elich et.al.	2305.02398	null
2023-05-05	A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text	Yunxin Li et.al.	2305.02265	link
2023-05-03	AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation	Shentong Mo et.al.	2305.01836	null
2023-04-30	Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection	Jie Ren et.al.	2305.00435	null
2023-04-28	SFD2: Semantic-guided Feature Detection and Description	Fei Xue et.al.	2304.14845	link
2023-04-28	Quantum enhanced non-interferometric quantitative phase imaging	Giuseppe Ortolano et.al.	2304.14727	null
2023-04-26	Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams	Yun Chang et.al.	2304.13487	null
2023-04-27	STIR: Siamese Transformer for Image Retrieval Postprocessing	Aleksei Shabanov et.al.	2304.13393	null
2023-04-25	DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design	Jiahao Weng et.al.	2304.12506	null
2023-04-24	Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning	Lucas Pascotti Valem et.al.	2304.12448	link
2023-04-23	IDLL: Inverse Depth Line based Visual Localization in Challenging Environments	Wanting Li et.al.	2304.11748	null
2023-04-23	Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval	Mehdi Rafiei et.al.	2304.11734	null
2023-04-17	Features-over-the-Air: Contrastive Learning Enabled Cooperative Edge Inference	Haotian Wu et.al.	2304.08221	null
2023-04-17	NeRF-Loc: Visual Localization with Conditional Neural Radiance Field	Jianlin Liu et.al.	2304.07979	link
2023-04-16	Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification	Luca Piano et.al.	2304.07883	null
2023-04-16	Language Guided Local Infiltration for Interactive Image Retrieval	Fuxiang Huang et.al.	2304.07747	null
2023-04-16	Long-term Visual Localization with Mobile Sensors	Shen Yan et.al.	2304.07691	null
2023-04-16	Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging	Jielin Qiu et.al.	2304.07675	null
2023-04-14	CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression	Mubariz Zaffar et.al.	2304.07426	null
2023-04-14	FM-Loc: Using Foundation Models for Improved Vision-based Localization	Reihaneh Mirjalili et.al.	2304.07058	null
2023-04-17	Toward Real-Time Image Annotation Using Marginalized Coupled Dictionary Learning	Seyed Mahdi Roostaiyan et.al.	2304.06907	link
2023-04-17	You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization problem and dataset	Matteo Toso et.al.	2304.06373	link
2023-04-12	Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation	Yifeng Shi et.al.	2304.06051	link
2023-04-12	Visual Localization using Imperfect 3D Models from the Internet	Vojtech Panek et.al.	2304.05947	link
2023-04-12	Are Local Features All You Need for Cross-Domain Visual Place Recognition?	Giovanni Barbarani et.al.	2304.05887	link
2023-04-12	Unicom: Universal and Compact Representation Learning for Image Retrieval	Xiang An et.al.	2304.05884	link
2023-04-12	SGL: Structure Guidance Learning for Camera Localization	Xudong Zhang et.al.	2304.05571	null
2023-04-14	Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency	Xingwu Ji et.al.	2304.05146	link
2023-04-10	CAVL: Learning Contrastive and Adaptive Representations of Vision and Language	Shentong Mo et.al.	2304.04399	null
2023-04-09	Unsupervised Multi-Criteria Adversarial Detection in Deep Image Retrieval	Yanru Xiao et.al.	2304.04228	null
2023-04-08	SGIDN-LCD: An Appearance-based Loop Closure Detection Algorithm using Superpixel Grids and Incremental Dynamic Nodes	Baosheng Zhang et.al.	2304.03872	null
2023-04-06	$R^{2}$Former: Unified $R$etrieval and $R$ eranking Transformer for Place Recognition	Sijie Zhu et.al.	2304.03410	null
2023-04-06	Distributed formation-enforcing control for UAVs robust to observation noise in relative pose measurements	Viktor Walter et.al.	2304.03057	link
2023-04-05	Efficient OCR for Building a Diverse Digital History	Jacob Carlson et.al.	2304.02737	link
2023-04-05	LogoNet: a fine-grained network for instance-level logo sketch retrieval	Binbin Feng et.al.	2304.02214	link
2023-04-04	OrienterNet: Visual Localization in 2D Public Maps with Neural Matching	Paul-Edouard Sarlin et.al.	2304.02009	link
2023-04-04	Cross-Domain Image Captioning with Discriminative Finetuning	Roberto Dessì et.al.	2304.01662	link
2023-04-02	Learning Similarity between Scene Graphs and Images with Transformers	Yuren Cong et.al.	2304.00590	link
2023-04-01	NPR: Nocturnal Place Recognition in Street	Bingxi Liu et.al.	2304.00276	null
2023-03-31	Unsupervised crack detection on complex stone masonry surfaces	Panagiotis Agrafiotis et.al.	2303.17989	null
2023-03-30	If At First You Don’t Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval	Finlay G. C. Hudson et.al.	2303.17703	null
2023-03-30	Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime	Rhydian Windsor et.al.	2303.17644	null
2023-03-30	3D Line Mapping Revisited	Shaohui Liu et.al.	2303.17504	link
2023-03-30	Methods and advancement of content-based fashion image retrieval: A Review	Amin Muhammad Shoib et.al.	2303.17371	null
2023-03-30	Adaptive Cross Batch Normalization for Metric Learning	Thalaiyasingam Ajanthan et.al.	2303.17127	null
2023-03-30	MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks	Weicheng Kuo et.al.	2303.16839	null
2023-03-29	Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval	Leo Sampaio Ferraz Ribeiro et.al.	2303.16769	null
2023-03-29	Bi-directional Training for Composed Image Retrieval via Text Prompt Learning	Zheyuan Liu et.al.	2303.16604	link
2023-03-27	Model Cascades for Efficient Image Search	Robert Hönig et.al.	2303.15595	null
2023-03-27	Zero-Shot Composed Image Retrieval with Textual Inversion	Alberto Baldrati et.al.	2303.15247	link
2023-03-27	What Can Human Sketches Do for Object Detection?	Pinaki Nath Chowdhury et.al.	2303.15149	null
2023-03-25	Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style	Fengyin Lin et.al.	2303.14348	link
2023-03-24	A-MuSIC: An Adaptive Ensemble System For Visual Place Recognition In Changing Environments	Bruno Arcanjo et.al.	2303.14247	null
2023-03-24	PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View	Ze Shi et.al.	2303.14095	link
2023-03-24	Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR	Aneeshan Sain et.al.	2303.13779	null
2023-03-28	CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not	Aneeshan Sain et.al.	2303.13440	null
2023-03-22	Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval	Xunguang Wang et.al.	2303.12658	null
2023-03-21	CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion	Geonmo Gu et.al.	2303.11916	link
2023-03-21	LIMITR: Leveraging Local Information for Medical Image-Text Representation	Gefen Dawidowicz et.al.	2303.11755	null
2023-03-25	Data-efficient Large Scale Place Recognition with Graded Similarity Supervision	Maria Leyva-Vallina et.al.	2303.11739	link
2023-03-20	Picture that Sketch: Photorealistic Image Generation from Abstract Sketches	Subhadeep Koley et.al.	2303.11162	null
2023-03-19	Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths	Ming Xu et.al.	2303.10778	link
2023-03-17	MRIS: A Multi-modal Retrieval Approach for Image Synthesis on Diverse Modalities	Boqi Chen et.al.	2303.10249	null
2023-03-17	IRGen: Generative Modeling for Image Retrieval	Yidan Zhang et.al.	2303.10126	link
2023-03-16	Data Roaming and Early Fusion for Composed Image Retrieval	Matan Levy et.al.	2303.09429	link
2023-03-16	Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval	Yi Xie et.al.	2303.09230	null
2023-03-16	Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space	Yuhang He et.al.	2303.09192	null
2023-03-16	Unsupervised Facial Expression Representation Learning with Contrastive Local Warping	Fanglei Xue et.al.	2303.09034	null
2023-03-15	A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval	Saeideh Yousefzadeh et.al.	2303.08398	null
2023-03-14	Data-Free Sketch-Based Image Retrieval	Abhra Chaudhuri et.al.	2303.07775	link
2023-03-14	PATS: Patch Area Transportation with Subdivision for Local Feature Matching	Junjie Ni et.al.	2303.07700	null
2023-03-10	Robotic Applications of Pre-Trained Vision-Language Models to Various Recognition Behaviors	Kento Kawaharazuka et.al.	2303.05674	null
2023-03-09	Dominating Set Database Selection for Visual Place Recognition	Anastasiia Kornilova et.al.	2303.05123	null
2023-03-07	Graph Neural Networks in Vision-Language Image Understanding: A Survey	Henry Senior et.al.	2303.03761	null
2023-03-07	Sketch-based Medical Image Retrieval	Kazuma Kobayashi et.al.	2303.03633	link
2023-03-06	Visual Place Recognition: A Tutorial	Stefan Schubert et.al.	2303.03281	link
2023-03-06	MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval	Rohit Agarwal et.al.	2303.03050	link
2023-03-06	Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints	Chenjie Cao et.al.	2303.02885	link
2023-03-05	Composing Mood Board with User Feedback in Concept Space	Shin Sano et.al.	2303.02547	null
2023-03-04	FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks	Xiao Han et.al.	2303.02483	link
2023-03-09	Self-Supervised Learning for Place Representation Generalization across Appearance Changes	Mohamed Adel Musallam et.al.	2303.02370	null
2023-03-03	MixVPR: Feature Mixing for Visual Place Recognition	Amar Ali-bey et.al.	2303.02190	link
2023-03-01	A Complementarity-Based Switch-Fuse System for Improved Visual Place Recognition	Maria Waheed et.al.	2303.00714	null
2023-03-01	ORCHNet: A Robust Global Feature Aggregation approach for 3D LiDAR-based Place recognition in Orchards	T. Barros et.al.	2303.00477	link
2023-03-03	Renderable Neural Radiance Map for Visual Navigation	Obin Kwon et.al.	2303.00304	null
2023-03-01	Region Prediction for Efficient Robot Localization on Large Maps	Matteo Scucchia et.al.	2303.00295	link
2023-02-28	OEKG: The Open Event Knowledge Graph	Simon Gottschalk et.al.	2302.14688	null
2023-02-28	Global Proxy-based Hard Mining for Visual Place Recognition	Amar Ali-bey et.al.	2302.14217	link
2023-02-27	Efficient Informed Proposals for Discrete Distributions via Newton’s Series Approximation	Yue Xiang et.al.	2302.13929	link
2023-02-26	Data-Efficient Sequence-Based Visual Place Recognition with Highly Compressed JPEG Images	Mihnea-Alexandru Tomita et.al.	2302.13314	null
2023-02-26	Learning cross space mapping via DNN using large scale click-through logs	Wei Yu et.al.	2302.13275	null
2023-02-25	DeepBrainPrint: A Novel Contrastive Framework for Brain MRI Re-Identification	Lemuel Puglisi et.al.	2302.13057	null
2023-02-23	Teaching CLIP to Count to Ten	Roni Paiss et.al.	2302.12066	null
2023-02-22	Steerable Equivariant Representation Learning	Sangnie Bhardwaj et.al.	2302.11349	null
2023-02-21	iQPP: A Benchmark for Image Query Performance Prediction	Eduard Poesina et.al.	2302.10126	link
2023-02-20	Ontology-aware Network for Zero-shot Sketch-based Image Retrieval	Haoxiang Zhang et.al.	2302.10040	null
2023-02-20	TBPos: Dataset for Large-Scale Precision Visual Localization	Masud Fahim et.al.	2302.09825	link
2023-02-17	Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts	Zhihong Chen et.al.	2302.08958	link
2023-02-22	Fashion Image Retrieval with Multi-Granular Alignment	Jinkuan Zhu et.al.	2302.08902	null
2023-02-15	Unsupervised Hashing via Similarity Distribution Calibration	Kam Woh Ng et.al.	2302.07669	link
2023-02-13	Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior	Shen Yan et.al.	2302.06287	link
2023-02-13	Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation	Binqian Jiang et.al.	2302.06149	link
2023-02-13	Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval	Xu Wang et.al.	2302.06081	link
2023-02-11	Sketch Less Face Image Retrieval: A New Challenge	Dawei Dai et.al.	2302.05576	link
2023-02-10	Is multi-modal vision supervision beneficial to language?	Avinash Madasu et.al.	2302.05016	link
2023-02-06	Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval	Kuniaki Saito et.al.	2302.03084	link
2023-02-06	Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs	Michael Kirchhof et.al.	2302.02865	link
2023-02-03	Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization	Yingying Zhu et.al.	2302.01572	link
2023-02-04	Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval	Frederik Warburg et.al.	2302.01332	link
2023-01-31	Grounding Language Models to Images for Multimodal Generation	Jing Yu Koh et.al.	2301.13823	link
2023-01-31	UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers	Dachuan Shi et.al.	2301.13741	link
2023-01-23	Lexi: Self-Supervised Learning of the UI Language	Pratyay Banerjee et.al.	2301.10165	link
2023-01-17	Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval	Yuchen Wu et.al.	2301.06685	null
2023-01-19	High-bandwidth Close-Range Information Transport through Light Pipes	Joowon Lim et.al.	2301.06496	null
2023-01-13	A LiDAR-Inertial-Visual SLAM System with Loop Detection	Kangcheng Liu et.al.	2301.05604	null
2023-01-12	GH-Feat: Learning Versatile Generative Hierarchical Features from GANs	Yinghao Xu et.al.	2301.05315	null
2023-01-10	Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images	Xindi Wu et.al.	2301.04224	null
2023-01-10	Collaborative Semantic Communication at the Edge	Wing Fei Lo et.al.	2301.03996	null
2023-01-10	Online Backfilling with No Regret for Large-Scale Image Retrieval	Seonguk Seo et.al.	2301.03767	null
2023-01-06	CyberLoc: Towards Accurate Long-term Visual Localization	Liu Liu et.al.	2301.02403	null
2023-01-05	A Probabilistic Framework for Visual Localization in Ambiguous Scenes	Fereidoon Zangeneh et.al.	2301.02086	link
2022-12-31	4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Patrick Wenzel et.al.	2301.01147	null
2022-12-30	HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images	Dmitry Yudin et.al.	2212.14649	link
2022-12-27	Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning	Wooyoung Kang et.al.	2212.13563	link
2022-12-23	SuperGF: Unifying Local and Global Features for Visual Localization	Wenzheng Song et.al.	2212.13105	null
2022-12-24	GraffMatch: Global Matching of 3D Lines and Planes for Wide Baseline LiDAR Registration	Parker C. Lusk et.al.	2212.12745	null
2022-12-19	From a Bird’s Eye View to See: Joint Camera and Subject Registration without the Camera Calibration	Zekun Qian et.al.	2212.09298	link
2022-12-14	The Infinite Index: Information Retrieval on Generative Text-To-Image Models	Niklas Deckers et.al.	2212.07476	null
2022-12-14	Shared Coupling-bridge for Weakly Supervised Local Feature Learning	Jiayuan Sun et.al.	2212.07047	link
2022-12-08	Group Generalized Mean Pooling for Vision Transformer	Byungsoo Ko et.al.	2212.04114	null
2022-12-12	Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models	Gowthami Somepalli et.al.	2212.03860	null
2022-12-07	LSVL: Large-scale season-invariant visual localization for UAVs	Jouko Kinnari et.al.	2212.03581	null
2022-12-06	ADIR: Adaptive Diffusion for Image Reconstruction	Shady Abu-Hussein et.al.	2212.03221	null
2022-12-08	Privacy-Preserving Visual Localization with Event Cameras	Junho Kim et.al.	2212.03177	link
2022-12-06	Semantic Communication for Internet of Vehicles: A Multi-User Cooperative Approach	Wenjun Xu et.al.	2212.03037	null
2022-12-06	Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds	Zhipeng Zhao et.al.	2212.02757	null
2022-12-04	Fast and Lightweight Scene Regressor for Camera Relocalization	Thuan B. Bui et.al.	2212.01830	link
2022-12-02	Information Retrieval from the Digitized Books	Riya Gupta et.al.	2212.00999	null
2022-12-09	StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition	Yanqing Shen et.al.	2212.00937	null
2022-11-30	Self-Supervised Feature Learning for Long-Term Metric Visual Localization	Yuxuan Chen et.al.	2212.00122	null
2022-11-30	SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation	Tianyu Zhang et.al.	2211.16697	link
2022-11-28	SLAN: Self-Locator Aided Network for Cross-Modal Understanding	Jiang-Tian Zhai et.al.	2211.16208	null
2022-11-29	RankDNN: Learning to Rank for Few-shot Learning	Qianyu Guo et.al.	2211.15320	link
2022-11-28	Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map	Xi Zheng et.al.	2211.15127	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069	link
2022-11-27	BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images	Zhihuang Zhang et.al.	2211.14927	null
2022-11-27	A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition	Rui Huang et.al.	2211.14864	null
2022-11-26	Visual Place Recognition	Bailu Guo et.al.	2211.14533	null
2022-11-26	Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval	Fan Yang et.al.	2211.14515	link
2022-11-30	Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark	Floriana Ciaglia et.al.	2211.13523	link
2022-11-23	InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images	Konstantin Kobs et.al.	2211.12760	link
2022-11-29	Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments	Joshua Knights et.al.	2211.12732	link
2022-11-23	FE-Fusion-VPR: Attention-based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events	Kuanxu Hou et.al.	2211.12244	null
2022-11-22	Multimorbidity Content-Based Medical Image Retrieval Using Proxies	Yunyan Xing et.al.	2211.12185	null
2022-11-22	Vision-based localization methods under GPS-denied conditions	Zihao Lu et.al.	2211.11988	null
2022-11-21	ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields	Mohammad Mahdi Johari et.al.	2211.11704	null
2022-11-21	LISA: Localized Image Stylization with Audio via Implicit Neural Representation	Seung Hyun Lee et.al.	2211.11381	null
2022-11-21	NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization	Shitao Tang et.al.	2211.11177	link
2022-11-16	Improving Feature-based Visual Localization by Geometry-Aided Matching	Hailin Yu et.al.	2211.08712	link
2022-11-15	LiePoseNet: Heterogeneous Loss Function Based on Lie Group for Significant Speed-up of PoseNet Training Process	Mikhail Kurenkov et.al.	2211.08480	null
2022-11-14	Degeneracy removal of spin bands in antiferromagnets with non-interconvertible spin motif pair	Lin-Ding Yuan et.al.	2211.07803	null
2022-11-14	Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition	Farid Alijani et.al.	2211.07696	null
2022-11-14	Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization	Yiyang Chen et.al.	2211.07394	link
2022-11-14	Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment	Junyang Wang et.al.	2211.07275	null
2022-11-14	ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations	Chanda Grover et.al.	2211.07122	null
2022-11-14	Few-shot Metric Learning: Online Adaptation of Embedding for Retrieval	Deunsol Jung et.al.	2211.07116	null
2022-11-12	Partial Visual-Semantic Embedding: Fashion Intelligence System with Sensitive Part-by-Part Learning	Ryotaro Shimizu et.al.	2211.06688	null
2022-11-09	Visual Named Entity Linking: A New Dataset and A Baseline	Wenxiang Sun et.al.	2211.04872	link
2022-11-07	Ultrafast Image Retrieval from a Holographic Memory Disc for High-Speed Operation of a Shift, Scale, and Rotation Invariant Target Recognition System	Julian Gamboa et.al.	2211.03881	null
2022-11-06	A Geometrically Constrained Point Matching based on View-invariant Cross-ratios, and Homography	Yueh-Cheng Huang et.al.	2211.03007	null
2022-11-02	Optimizing Fiducial Marker Placement for Improved Visual Localization	Qiangqiang Huang et.al.	2211.01513	link
2022-11-02	A comparison of uncertainty estimation approaches for DNN-based camera localization	Matteo Vaghi et.al.	2211.01234	null
2022-11-02	M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval	Layne Berry et.al.	2211.01180	null
2022-11-11	Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality	Anuj Diwan et.al.	2211.00768	link
2022-11-07	Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding	Ryotaro Shimizu et.al.	2210.17417	null
2022-10-27	Structuring User-Generated Content on Social Media with Multimodal Aspect-Based Sentiment Analysis	Miriam Anschütz et.al.	2210.15377	link
2022-10-27	Leveraging Computer Vision Application in Visual Arts: A Case Study on the Use of Residual Neural Network to Classify and Analyze Baroque Paintings	Daniel Kvak et.al.	2210.15300	null
2022-10-27	Towards Practicality of Sketch-Based Visual Understanding	Ayan Kumar Bhunia et.al.	2210.15146	null
2022-10-27	MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval	Chen Bao et.al.	2210.15128	null
2022-10-26	FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning	Suvir Mirchandani et.al.	2210.15028	null
2022-10-26	FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization	Junyang Wang et.al.	2210.14562	null
2022-11-02	A Framework for Collaborative Multi-Robot Mapping using Spectral Graph Wavelets	Lukas Bernreiter et.al.	2210.13856	null
2022-10-27	Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision	Tzu-Jui Julius Wang et.al.	2210.13591	null
2022-10-24	Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval	Zhaopeng Dou et.al.	2210.13440	link
2022-10-23	Neural Eigenfunctions Are Structured Representation Learners	Zhijie Deng et.al.	2210.12637	link
2022-10-21	Boosting vision transformers for image retrieval	Chull Hwan Song et.al.	2210.11909	link
2022-10-20	Communication breakdown: On the low mutual intelligibility between human and neural captioning	Roberto Dessì et.al.	2210.11512	link
2022-10-19	Image Semantic Relation Generation	Mingzhe Du et.al.	2210.11253	null
2022-10-20	General Image Descriptors for Open World Image Retrieval using ViT CLIP	Marcos V. Conde et.al.	2210.11141	link
2022-10-20	DeepRING: Learning Roto-translation Invariant Representation for LiDAR based Place Recognition	Sha Lu et.al.	2210.11029	null
2022-10-19	Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval	Abhra Chaudhuri et.al.	2210.10486	link
2022-10-19	GSV-Cities: Toward Appropriate Supervised Visual Place Recognition	Amar Ali-bey et.al.	2210.10239	link
2022-10-18	A Real-Time Fusion Framework for Long-term Visual Localization	Yuchen Yang et.al.	2210.09757	null
2022-10-17	Bridging the Gap between Local Semantic Concepts and Bag of Visual Words for Natural Scene Image Retrieval	Yousef Alqasrawi et.al.	2210.08875	null
2022-10-17	SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation	Woo Suk Choi et.al.	2210.08675	null
2022-10-16	Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers	Tao Tang et.al.	2210.08458	link
2022-10-14	Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding	Xuetong Xue et.al.	2210.07572	link
2022-10-14	Boosting Performance of a Baseline Visual Place Recognition Technique by Predicting the Maximally Complementary Technique	Connor Malone et.al.	2210.07509	null
2022-10-11	Large-to-small Image Resolution Asymmetry in Deep Metric Learning	Pavel Suma et.al.	2210.05463	link
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236	null
2022-10-05	Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features	Deepak Gupta et.al.	2210.02401	link
2022-10-05	Granularity-aware Adaptation for Image Retrieval over Multiple Tasks	Jon Almazán et.al.	2210.02254	null
2022-10-05	Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective	Zijian Zhang et.al.	2210.02206	link
2022-10-04	Supervised Metric Learning for Retrieval via Contextual Similarity Optimization	Christopher Liao et.al.	2210.01908	link
2022-10-04	Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing	Weiying Wang et.al.	2210.01320	null
2022-10-03	Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments	Bruno Arcanjo et.al.	2210.00834	null
2022-10-02	Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval	Kei Nishimaki et.al.	2210.00506	null
2022-09-29	Guided Unsupervised Learning by Subaperture Decomposition for Ocean SAR Image Retrieval	Nicolae-Cătălin Ristea et.al.	2209.15034	null
2022-09-28	TVLT: Textless Vision-Language Transformer	Zineng Tang et.al.	2209.14156	link
2022-09-28	SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval	Yang Shen et.al.	2209.13833	link
2022-09-28	Learning Deep Representations via Contrastive Learning for Instance Retrieval	Tao Wu et.al.	2209.13832	null
2022-09-28	Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text	Cheng-An Hsieh et.al.	2209.13764	link
2022-09-27	Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors	Hao Dong et.al.	2209.13586	link
2022-09-27	Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability	Peisong Wen et.al.	2209.13262	link
2022-09-26	NDD: A 3D Point Cloud Descriptor Based on Normal Distribution for Loop Closure Detection	Ruihao Zhou et.al.	2209.12513	link
2022-09-25	Personalized Saliency in Task-Oriented Semantic Communications: Image Transmission and Performance Analysis	Jiawen Kang et.al.	2209.12274	link
2022-09-24	Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes	Jonathan J. Y. Kim et.al.	2209.11894	null
2022-09-23	Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs	Youya Xia et.al.	2209.11673	null
2022-09-23	Query-based Hard-Image Retrieval for Object Detection at Test Time	Edward Ayers et.al.	2209.11559	link
2022-09-23	Unsupervised Hashing with Semantic Concept Mining	Rong-Cheng Tu et.al.	2209.11475	link
2022-09-22	UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low vision	Anbang Yang et.al.	2209.11336	null
2022-09-21	Visual Localization and Mapping in Dynamic and Changing Environments	João Carlos Virgolino Soares et.al.	2209.10710	null
2022-09-20	PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention	José Arce et.al.	2209.09699	link
2022-09-19	Deep Metric Learning with Chance Constraints	Yeti Z. Gurbuz et.al.	2209.09060	link
2022-09-18	HGI-SLAM: Loop Closure With Human and Geometric Importance Features	Shuhul Mujoo et.al.	2209.08608	null
2022-09-18	Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM	Jiarui Tan et.al.	2209.08578	link
2022-09-17	Data Efficient Visual Place Recognition Using Extremely JPEG-Compressed Images	Mihnea-Alexandru Tomita et.al.	2209.08343	null
2022-09-15	Efficient Planar Pose Estimation via UWB Measurements	Haodong Jiang et.al.	2209.06779	link
2022-09-14	Transformers and CNNs both Beat Humans on SBIR	Omar Seddati et.al.	2209.06629	null
2022-09-14	Tac2Structure: Object Surface Reconstruction Only through Multi Times Touch	J. Lu et.al.	2209.06545	link
2022-09-14	iSimLoc: Visual Global Localization for Previously Unseen Environments with Simulated Images	Peng Yin et.al.	2209.06376	null
2022-09-09	General Place Recognition Survey: Towards the Real-world Autonomy Age	Peng Yin et.al.	2209.04497	link
2022-09-09	Retinal Image Restoration and Vessel Segmentation using Modified Cycle-CBAM and CBAM-UNet	Alnur Alimanov et.al.	2209.04234	link
2022-09-13	Segment Augmentation and Differentiable Ranking for Logo Retrieval	Feyza Yavuz et.al.	2209.02482	null
2022-09-12	ScaleFace: Uncertainty-aware Deep Metric Learning	Roman Kail et.al.	2209.01880	link
2022-09-04	CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud	Evgeny Yudin et.al.	2209.01605	null
2022-08-31	EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing	Qihua Feng et.al.	2208.14657	link
2022-08-25	A Deep Perceptual Measure for Lens and Camera Calibration	Yannick Hold-Geoffroy et.al.	2208.12300	null
2022-08-25	A Privacy-Preserving and End-to-End-Based Encrypted Image Retrieval Scheme	Zhixun Lu et.al.	2208.11876	null
2022-08-23	Satellite Image Search in AgoraEO	Ahmet Kerem Aksoy et.al.	2208.10830	null
2022-08-20	Fuse and Attend: Generalized Embedding Learning for Art and Sketches	Ujjal Kr Dutta et.al.	2208.09698	null
2022-08-19	Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods	Chao Chen et.al.	2208.09315	link
2022-08-19	TTT-UCDR: Test-time Training for Universal Cross-Domain Retrieval	Soumava Paul et.al.	2208.09198	link
2022-08-17	Visual Cross-View Metric Localization with Dense Uncertainty Estimates	Zimin Xia et.al.	2208.08519	link
2022-08-17	Understanding Attention for Vision-and-Language Tasks	Feiqi Cao et.al.	2208.08104	link
2022-08-14	Visual Localization via Few-Shot Scene Region Classification	Siyan Dong et.al.	2208.06933	link
2022-08-14	HyP $^2$ Loss: Beyond Hypersphere Metric Space for Multi-label Image Retrieval	Chengyin Xu et.al.	2208.06866	link
2022-08-13	Finding Point with Image: An End-to-End Benchmark for Vision-based UAV Localization	Ming Dai et.al.	2208.06561	link
2022-08-16	Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation	Georgios Kouros et.al.	2208.06195	link
2022-08-12	Instance Image Retrieval by Learning Purely From Within the Dataset	Zhongyan Zhang et.al.	2208.06119	null
2022-08-07	CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization	Yujiao Shi et.al.	2208.03660	null
2022-08-05	A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch	Patsorn Sangkloy et.al.	2208.03354	null
2022-08-05	ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding	Bingning Wang et.al.	2208.03030	link
2022-08-04	Pattern Spotting and Image Retrieval in Historical Documents using Deep Hashing	Caio da S. Dias et.al.	2208.02397	null
2022-07-27	On the robustness of self-supervised representations for multi-view object classification	David Torpey et.al.	2208.00787	null
2022-07-26	Multimodal Neural Machine Translation with Search Engine Based Image Retrieval	ZhenHao Tang et.al.	2208.00767	null
2022-07-30	Towards Privacy-Preserving, Real-Time and Lossless Feature Matching	Qiang Meng et.al.	2208.00214	link
2022-07-30	DAS: Densely-Anchored Sampling for Deep Metric Learning	Lizhao Liu et.al.	2208.00119	link
2022-07-29	Curriculum Learning for Data-Efficient Vision-Language Alignment	Tejas Srinivasan et.al.	2207.14525	null
2022-07-29	Neural Density-Distance Fields	Itsuki Ueda et.al.	2207.14455	link
2022-07-27	Abstracting Sketches through Simple Primitives	Stephan Alaniz et.al.	2207.13543	link
2022-07-27	Satellite Image Based Cross-view Localization for Autonomous Vehicle	Shan Wang et.al.	2207.13506	null
2022-07-26	RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments	Jiahui Zhang et.al.	2207.12579	null
2022-07-25	A hybrid-qudit representation of digital RGB images	Sreetama Das et.al.	2207.12550	null
2022-07-19	ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization	Ivan Cisneros et.al.	2207.12317	link
2022-07-22	PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes	BaoSheng Zhang et.al.	2207.10916	null
2022-07-25	MeshLoc: Mesh-Based Visual Localization	Vojtech Panek et.al.	2207.10762	link
2022-07-20	Revisiting Hotels-50K and Hotel-ID	Aarash Feizi et.al.	2207.10200	link
2022-07-20	Feature Representation Learning for Unsupervised Cross-domain Image Retrieval	Conghui Hu et.al.	2207.09721	link
2022-07-19	SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany	Dominik Koßmann et.al.	2207.09507	null
2022-07-19	Context Unaware Knowledge Distillation for Image Retrieval	Bytasandram Yaswanth Reddy et.al.	2207.09070	link
2022-07-17	FashionViL: Fashion-Focused Vision-and-Language Representation Learning	Xiao Han et.al.	2207.08150	link
2022-07-14	AutoMerge: A Framework for Map Assembling and Smoothing in City-scale Environments	Peng Yin et.al.	2207.06965	null
2022-07-14	Semi-supervised Vector-Quantization in Visual SLAM using HGCN	Amir Zarringhalam et.al.	2207.06738	null
2022-07-14	Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders	Amir Zarringhalam et.al.	2207.06732	null
2022-07-19	Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras	Fangwen Shu et.al.	2207.06058	link
2022-07-12	CPO: Change Robust Panorama to Point Cloud Localization	Junho Kim et.al.	2207.05317	link
2022-07-05	Hierarchical Average Precision Training for Pertinent Image Retrieval	Elias Ramzi et.al.	2207.04873	link
2022-07-11	A clinically motivated self-supervised approach for content-based image retrieval of CT liver images	Kristoffer Knutsen Wickstrøm et.al.	2207.04812	link
2022-07-09	BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval	Wenqiao Zhang et.al.	2207.04211	null
2022-07-08	Learning Sequential Descriptors for Sequence-based Visual Place Recognition	Riccardo Mereu et.al.	2207.03868	link
2022-07-08	GEMS: Scene Expansion using Generative Models of Graphs	Rishi Agarwal et.al.	2207.03729	null
2022-07-05	Object-Level Targeted Selection via Deep Template Matching	Suraj Kothawade et.al.	2207.01778	null
2022-07-06	Adaptive Fine-Grained Sketch-Based Image Retrieval	Ayan Kumar Bhunia et.al.	2207.01723	link
2022-07-04	Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets	Paul Albert et.al.	2207.01573	link
2022-07-08	Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval	Keyu Wen et.al.	2207.00733	null
2022-07-01	DALG: Deep Attentive Local and Global Modeling for Image Retrieval	Yuxin Song et.al.	2207.00287	null
2022-07-04	BadHash: Invisible Backdoor Attacks against Deep Hashing with Clean Label	Shengshan Hu et.al.	2207.00278	link
2022-06-28	Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems	Stephen Hausler et.al.	2206.13883	null
2022-07-08	How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels	Tobias Fischer et.al.	2206.13673	link
2022-06-25	FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance	Yongzhi Fan et.al.	2206.12628	link
2022-06-25	Inverted Semantic-Index for Image Retrieval	Ying Wang et.al.	2206.12623	null
2022-06-17	RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval	Yihan Wu et.al.	2206.11225	null
2022-06-22	ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas	Prathmesh Madhu et.al.	2206.11115	null
2022-06-20	Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval	Guile Wu et.al.	2206.09806	null
2022-06-18	Attention-based Dynamic Subspace Learners for Medical Image Analysis	Sukesh Adiga V et.al.	2206.09068	null
2022-06-17	Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments	Khairuldanial Ismail et.al.	2206.08733	null
2022-06-06	Learning Treatment Plan Representations for Content Based Image Retrieval	Charles Huang et.al.	2206.02912	null
2022-06-19	NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation	Ekaterina Nepovinnykh et.al.	2206.02498	link
2022-06-05	Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks	B. G. Palm et.al.	2206.02278	null
2022-05-28	FaIRCoP: Facial Image Retrieval using Contrastive Personalization	Devansh Gupta et.al.	2205.15870	null
2022-05-31	Investigating the Role of Image Retrieval for Visual Localization – An exhaustive benchmark	Martin Humenberger et.al.	2205.15761	link
2022-05-27	Improving Road Segmentation in Challenging Domains Using Similar Place Priors	Connor Malone et.al.	2205.14112	null
2022-05-31	LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments	Yun Chang et.al.	2205.13135	link
2022-05-26	Fine-grained Image Captioning with CLIP Reward	Jaemin Cho et.al.	2205.13115	link
2022-05-25	Deep Dense Local Feature Matching and Vehicle Removal for Indoor Visual Localization	Kyung Ho Park et.al.	2205.12544	null
2022-05-24	OnePose: One-Shot Object Pose Estimation without CAD Models	Jiaming Sun et.al.	2205.12257	link
2022-05-23	VPAIR – Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments	Michael Schleiss et.al.	2205.11567	link
2022-05-23	VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering	Yanan Wang et.al.	2205.11501	null
2022-05-23	Deep Image Retrieval is not Robust to Label Noise	Stanislav Dereka et.al.	2205.11195	null
2022-05-22	Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval	Zelong Zeng et.al.	2205.10878	link
2022-05-20	Visually-Augmented Language Modeling	Weizhi Wang et.al.	2205.10178	link
2022-05-18	Deep Features for CBIR with Scarce Data using Hebbian Learning	Gabriele Lagani et.al.	2205.08935	null
2022-05-19	Text Detection & Recognition in the Wild for Robot Localization	Zobeir Raisi et.al.	2205.08565	null
2022-05-12	One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code	Yong Dai et.al.	2205.06126	null
2022-05-11	Review on Panoramic Imaging and Its Applications in Scene Understanding	Shaohua Gao et.al.	2205.05570	null
2022-05-18	Identical Image Retrieval using Deep Learning	Sayan Nath et.al.	2205.04883	link
2022-05-09	Introspective Deep Metric Learning	Chengkun Wang et.al.	2205.04449	link
2022-05-11	Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting	Kai Uwe Barthel et.al.	2205.04255	link
2022-05-08	Adversarial Learning of Hard Positives for Place Recognition	Wenxuan Fang et.al.	2205.03871	null
2022-05-10	AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching	Khanh Nguyen et.al.	2205.02849	link
2022-04-29	Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval	Shupeng Su et.al.	2204.13919	null
2022-04-29	Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval	Siyu Ren et.al.	2204.13913	link
2022-04-28	Spatio-Temporal Graph Localization Networks for Image-based Navigation	Takahiro Niwa et.al.	2204.13237	null
2022-04-27	The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection	Konstantinos A. Tsintotas et.al.	2204.12831	null
2022-04-25	SceneTrilogy: On Scene Sketches and its Relationship with Text and Photo	Pinaki Nath Chowdhury et.al.	2204.11964	null
2022-04-23	On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning	Muhammad Umer Anwaar et.al.	2204.11848	null
2022-04-24	Progressive Learning for Image Retrieval with Hybrid-Modality Queries	Yida Zhao et.al.	2204.11212	null
2022-04-23	Training and challenging models for text-guided fashion image retrieval	Eric Dodds et.al.	2204.11004	link
2022-04-18	Centralized Adversarial Learning for Robust Deep Hashing	Xunguang Wang et.al.	2204.10779	link
2022-04-22	Transferring ConvNet Features from Passive to Active Robot Self-Localization: The Use of Ego-Centric and World-Centric Views	Kanya Kurauchi et.al.	2204.10497	null
2022-04-21	Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval	Zhiqiang Yuan et.al.	2204.09868	link
2022-04-21	Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information	Zhiqiang Yuan et.al.	2204.09860	link
2022-04-20	Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations	Leila Pishdad et.al.	2204.09268	null
2022-04-19	Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing	Georgii Mikriukov et.al.	2204.08707	null
2022-04-18	Multiple-environment Self-adaptive Network for Aerial-view Geo-localization	Tingyu Wang et.al.	2204.08381	link
2022-04-15	Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder	Hanjing Ye et.al.	2204.07350	link
2022-04-14	Composite Code Sparse Autoencoders for first stage retrieval	Carlos Lassance et.al.	2204.07023	null
2022-04-13	Reuse your features: unifying retrieval and feature-metric alignment	Javier Morlana et.al.	2204.06292	link
2022-04-12	Probabilistic Compositional Embeddings for Multimodal Image Retrieval	Andrei Neculai et.al.	2204.05845	link
2022-04-12	Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval	Yu-Wei Zhan et.al.	2204.05666	null
2022-04-12	HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud	Zhixing Hou et.al.	2204.05481	null
2022-04-11	Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context	Lizhou Liao et.al.	2204.04932	link
2022-04-10	Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image	Yujiao Shi et.al.	2204.04752	link
2022-04-08	A Generic Image Retrieval Method for Date Estimation of Historical Document Collections	Adrià Molina et.al.	2204.04028	null
2022-04-08	SnapMode: An Intelligent and Distributed Large-Scale Fashion Image Retrieval Platform Based On Big Data and Deep Generative Adversarial Network Technologies	Narges Norouzi et.al.	2204.03998	null
2022-04-05	Leveraging Equivariant Features for Absolute Pose Regression	Mohamed Adel Musallam et.al.	2204.02163	null
2022-04-04	“This is my unicorn, Fluffy”: Personalizing frozen vision-language representations	Niv Cohen et.al.	2204.01694	link
2022-04-01	Bi-directional Loop Closure for Visual SLAM	Ihtisham Ali et.al.	2204.01524	null
2022-04-01	LASER: LAtent SpacE Rendering for 2D Visual Localization	Zhixiang Min et.al.	2204.00157	link
2022-03-31	Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning	Semih Orhan et.al.	2203.16945	null
2022-03-30	AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift	Burak Yildiz et.al.	2203.16291	link
2022-03-29	Long-term Visual Map Sparsification with Heterogeneous GNN	Ming-Fang Chang et.al.	2203.15182	null
2022-04-01	A Simulation Benchmark for Vision-based Autonomous Navigation	Lauri Suomela et.al.	2203.13048	link
2022-03-24	Is Geometry Enough for Matching in Visual Localization?	Qunjie Zhou et.al.	2203.12979	link
2022-03-21	MatchFormer: Interleaving Attention in Transformers for Feature Matching	Qing Wang et.al.	2203.09645	link
2022-03-10	ReF – Rotation Equivariant Features for Local Feature Matching	Abhishek Peri et.al.	2203.05206	null
2022-03-09	Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction	Matthieu Zins et.al.	2203.04613	null
2022-03-08	Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM	Pierre-Yves Lajoie et.al.	2203.04446	link
2022-03-07	ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization	Simon Maurer et.al.	2203.03610	link
2022-03-07	Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms	Qingqing Li et.al.	2203.03454	link
2022-03-01	SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments	Maria Waheed et.al.	2203.00591	null
2022-02-28	Deep Camera Pose Regression Using Pseudo-LiDAR	Ali Raza et.al.	2203.00080	null
2022-02-25	RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation	Praveen Kumar Rajendran et.al.	2202.12838	null
2022-02-24	Highly-Efficient Binary Neural Networks for Visual Place Recognition	Bruno Ferrarini et.al.	2202.12375	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146	link
2022-02-14	Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition	Y. Shen et.al.	2202.06470	null
2022-02-11	Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition	Yingfeng Cai et.al.	2202.05738	null
2022-02-09	Object-Guided Day-Night Visual Localization in Urban Scenes	Assia Benbihi et.al.	2202.04445	null
2022-02-08	A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition	Nie Jiwei et.al.	2202.03677	null
2022-02-25	CFP-SLAM: A Real-time Visual SLAM Based on Coarse-to-Fine Probability in Dynamic Environments	Xinggang Hu et.al.	2202.01938	null
2022-02-03	Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization	Andrea Vallone et.al.	2202.01821	null
2022-02-02	Training Semantic Descriptors for Image-Based Localization	Ibrahim Cinaroglu et.al.	2202.01212	null
2022-01-31	Hydra: A Real-time Spatial Perception Engine for 3D Scene Graph Construction and Optimization	Nathan Hughes et.al.	2201.13360	null
2022-01-31	Rigidity Preserving Image Transformations and Equivariance in Perspective	Lucas Brynte et.al.	2201.13065	null
2022-01-25	Learning Semantics for Visual Place Recognition through Multi-Scale Attention	Valerio Paolicelli et.al.	2201.09701	link
2022-01-22	Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems	Xi Zheng et.al.	2201.09048	link
2022-01-15	A Critical Analysis of Image-based Camera Pose Estimation Techniques	Meng Xu et.al.	2201.05816	null
2022-01-14	SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions	Ali Samadzadeh et.al.	2201.05386	link
2021-12-23	NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning	Tony Ng et.al.	2112.12785	null
2021-12-16	CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data	Qi Yan et.al.	2112.09081	link
2021-12-05	RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather	Jialu Wang et.al.	2112.02469	null
2021-11-25	MegLoc: A Robust and Accurate Visual Localization Pipeline	Shuxue Peng et.al.	2111.13063	null
2021-10-08	Semantic Image Alignment for Vehicle Localization	Markus Herb et.al.	2110.04162	null
2021-10-05	Season-invariant GNSS-denied visual localization for UAVs	Jouko Kinnari et.al.	2110.01967	link
2021-09-30	Forming a sparse representation for visual place recognition using a neurorobotic approach	Sylvain Colomer et.al.	2109.14916	null
2021-09-22	Audio-Visual Grounding Referring Expression for Robotic Manipulation	Yefei Wang et.al.	2109.10571	null
2021-09-20	Efficient shape mapping through dense touch and vision	Sudharshan Suresh et.al.	2109.09884	link
2021-09-15	S3LAM: Structured Scene SLAM	Mathieu Gonzalez et.al.	2109.07339	null
2021-09-13	Monocular Camera Localization for Automated Vehicles Using Image Retrieval	Eunhyek Joa et.al.	2109.06296	null
2021-09-10	Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization	Sungho Yoon et.al.	2109.04753	link
2021-09-09	CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization	Ara Jafarzadeh et.al.	2109.04527	null
2021-09-09	Keeping an Eye on Things: Deep Learned Features for Long-Term Visual Localization	Mona Gridseth et.al.	2109.04041	link

Keypoint Detection

Publish Date	Title	Authors	PDF	Code
2025-07-23	CartoonAlive: Towards Expressive Live2D Modeling from Single Portraits	Chao He et.al.	2507.17327	null
2025-07-21	Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors	Mohamed Adjel et.al.	2507.16850	null
2025-07-17	DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model	Maulana Bisyir Azhari et.al.	2507.13145	null
2025-07-15	KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model	Jie Yang et.al.	2507.11102	null
2025-07-15	GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft	Weizhao Ma et.al.	2507.11077	null
2025-07-14	FPC-Net: Revisiting SuperPoint with Descriptor-Free Keypoint Detection via Feature Pyramids and Consistency-Based Implicit Matching	Ionuţ Grigore et.al.	2507.10770	null
2025-07-11	Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection	Subhajit Maity et.al.	2507.07994	null
2025-07-09	Reading a Ruler in the Wild	Yimu Pan et.al.	2507.07077	null
2025-07-09	MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning	Yifan Yang et.al.	2507.06662	null
2025-06-27	MatChA: Cross-Algorithm Matching with Feature Augmentation	Paula Carbó Cubero et.al.	2506.22336	null
2025-06-27	SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images	Naftaly Wambugu et.al.	2506.21945	null
2025-05-29	TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning	Ron Shapira Weber et.al.	2505.23475	link
2025-05-24	Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU	Yicheng Lin et.al.	2505.18652	link
2025-05-18	SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving	Muleilan Pei et.al.	2505.12246	null
2025-05-17	Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation	Niaz Ahmad et.al.	2505.12130	null
2025-05-16	Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation	Massimiliano Cassia et.al.	2505.11110	null
2025-06-19	RDD: Robust Feature Detector and Descriptor using Deformable Transformer	Gonglin Chen et.al.	2505.08013	null
2025-05-12	Enabling Privacy-Aware AI-Based Ergonomic Analysis	Sander De Coninck et.al.	2505.07306	null
2025-05-09	My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing	Jingrui He et.al.	2505.06436	null
2025-05-05	Unsupervised training of keypoint-agnostic descriptors for flexible retinal image registration	David Rivas-Villar et.al.	2505.02787	null
2025-05-05	Unsupervised Deep Learning-based Keypoint Localization Estimating Descriptor Matching Performance	David Rivas-Villar et.al.	2505.02779	null
2025-05-04	Focus What Matters: Matchability-Based Reweighting for Local Feature Matching	Dongyue Li et.al.	2505.02161	null
2025-05-04	Enhancing Lidar Point Cloud Sampling via Colorization and Super-Resolution of Lidar Imagery	Sier Ha et.al.	2505.02049	null
2025-04-29	Emotion Recognition in Contemporary Dance Performances Using Laban Movement Analysis	Muhammad Turab et.al.	2504.21154	null
2025-04-29	Learning a General Model: Folding Clothing with Topological Dynamics	Yiming Liu et.al.	2504.20720	null
2025-04-26	VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation	Niaz Ahmad et.al.	2504.19032	null
2025-04-24	EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy	Haodi Yao et.al.	2504.17280	null
2025-04-15	UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques	Pedro Diaz-Garcia et.al.	2504.11063	null
2025-04-15	Acquisition of high-quality images for camera calibration in robotics applications via speech prompts	Timm Linder et.al.	2504.11031	null
2025-04-11	Stereophotoclinometry Revisited	Travis Driver et.al.	2504.08252	null
2025-03-31	SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection	Yannick Burkhardt et.al.	2504.00139	null
2025-03-29	Deep Visual Servoing of an Aerial Robot Using Keypoint Feature Extraction	Shayan Sepahvand et.al.	2503.23171	null
2025-03-25	Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines	Junle Liu et.al.	2503.19278	null
2025-03-05	Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing	Ryan Banks et.al.	2503.13477	null
2025-03-16	Histogram Transporter: Learning Rotation-Equivariant Orientation Histograms for High-Precision Robotic Kitting	Jiadong Zhou et.al.	2503.12541	null
2025-04-12	Keypoint Detection and Description for Raw Bayer Images	Jiakai Lin et.al.	2503.08673	null
2025-03-10	REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding	Yan Tai et.al.	2503.07413	link
2025-03-11	DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection	Johan Edstedt et.al.	2503.07347	link
2025-03-07	Automatic determination of quasicrystalline patterns from microscopy images	Tano Kim Kender et.al.	2503.05472	link
2025-03-07	Spatial regularisation for improved accuracy and interpretability in keypoint-based registration	Benjamin Billot et.al.	2503.04499	link
2025-03-04	A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection	Junyi Wang et.al.	2503.02481	null
2025-03-01	Autonomous Dissection in Robotic Cholecystectomy	Ki-Hwan Oh et.al.	2503.00666	null
2025-02-28	CNSv2: Probabilistic Correspondence Encoded Neural Image Servo	Anzhe Chen et.al.	2503.00132	null
2025-02-27	Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets	Jisoo Lee et.al.	2502.19766	null
2025-02-23	Rewards-based image analysis in microscopy	Kamyar Barakati et.al.	2502.18522	null
2025-02-19	2.5D U-Net with Depth Reduction for 3D CryoET Object Identification	Yusuke Uchida et.al.	2502.13484	link
2025-01-30	Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images	Wei-Lun Chen et.al.	2501.18453	null
2025-01-30	Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models	Bhargav Ghanekar et.al.	2501.18361	null
2025-01-30	Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems	Liudi Yang et.al.	2501.18110	null
2025-01-21	Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction	Mengyuan Li et.al.	2501.11844	null
2025-01-20	MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching	Yepeng Liu et.al.	2501.11299	null
2025-01-19	Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation	Shibang Liu et.al.	2501.11069	null
2025-01-13	Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications	Lukas Rustler et.al.	2501.07421	null
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399	null
2024-12-24	GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network	Xianfeng Song et.al.	2412.18221	link
2024-12-21	A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection	Shahid Ansari et.al.	2412.16755	null
2024-12-19	Corn Ear Detection and Orientation Estimation Using Deep Learning	Nathan Sprague et.al.	2412.14954	null
2024-12-12	Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models	Faith Johnson et.al.	2412.09739	null
2024-12-09	An Efficient Scene Coordinate Encoding and Relocalization Method	Kuan Xu et.al.	2412.06488	link
2024-12-09	ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models	Bingchen Gong et.al.	2412.06292	null
2024-12-07	Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures	Muhammad Umar Farooq et.al.	2412.05487	null
2024-12-04	Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything	Yongkyu Lee et.al.	2412.03472	link
2024-12-02	MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection	Yonghao Dang et.al.	2412.01422	null
2024-11-23	OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs	Chen Xin et.al.	2411.15653	link
2024-11-19	IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose	Fei Ren et.al.	2411.12676	null
2024-11-04	Silver medal Solution for Image Matching Challenge 2024	Yian Wang et.al.	2411.01851	null
2024-11-04	KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension	Jie Yang et.al.	2411.01846	null
2024-10-31	From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots	Vasileios Tzouras et.al.	2410.23906	null
2024-10-04	Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation	Aman Anand et.al.	2410.14700	null
2024-11-27	Sim2real Cattle Joint Estimation in 3D point clouds	Mohammad Okour et.al.	2410.14419	null
2024-10-16	PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network	Asish Bera et.al.	2410.12742	null
2024-10-16	RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition	Asish Bera et.al.	2410.12718	null
2024-10-01	A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference	Yuan Li et.al.	2410.11848	null
2024-10-11	Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image	Marta Veganzones Rodriguez et.al.	2410.09155	null
2024-10-08	Unsupervised Model Diagnosis	Yinong Oliver Wang et.al.	2410.06243	null
2024-10-08	Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration	Xueyang Kang et.al.	2410.05729	link
2024-10-16	Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features	Chengkai Hou et.al.	2410.02237	null
2024-10-02	Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection	Hongru Yan et.al.	2410.01404	null
2024-09-30	OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection	Changsheng Lu et.al.	2409.19899	link
2024-10-07	SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation	Xin Li et.al.	2409.18082	null
2024-09-24	GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization	Gennady Sidorov et.al.	2409.16502	link
2024-09-20	Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators	Niloufar Amiri et.al.	2409.13668	null
2024-09-25	Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding	Rania Hossam et.al.	2409.08695	link
2024-09-06	D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection	Kentaro Hirahara et.al.	2409.04060	null
2024-10-01	Towards Practical Human Motion Prediction with LiDAR Point Clouds	Xiao Han et.al.	2408.08202	null
2024-07-31	Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods	Xusheng Luo et.al.	2408.00117	null
2024-07-26	SHIC: Shape-Image Correspondences with no Keypoint Supervision	Aleksandar Shtedritski et.al.	2407.18907	null
2024-07-25	LION: Linear Group RNN for 3D Object Detection in Point Clouds	Zhe Liu et.al.	2407.18232	link
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791	null
2024-07-09	LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition	Teng Wang et.al.	2407.06730	null
2024-07-04	PFGS: High Fidelity Point Cloud Rendering via Feature Splatting	Jiaxu Wang et.al.	2407.03857	link
2024-07-03	A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes	Li Fang et.al.	2407.02830	link
2024-07-02	Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning	Chengchao Shen et.al.	2407.02014	link
2024-06-28	Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics	Chengrui Gao et.al.	2406.19672	null
2024-07-23	A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking	Lorenzo Shaikewitz et.al.	2406.16837	link
2024-06-03	Scale-Free Image Keypoints Using Differentiable Persistent Homology	Giovanni Barbarani et.al.	2406.01315	link
2024-06-23	W-Net: A Facial Feature-Guided Face Super-Resolution Network	Hao Liu et.al.	2406.00676	null
2024-05-25	Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration	Junjie Gao et.al.	2405.16085	null
2024-06-01	Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection – Towards Precise Fish Morphological Assessment in Aquaculture Breeding	Weizhen Liu et.al.	2405.12476	link
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434	null
2024-05-15	Vector-Symbolic Architecture for Event-Based Optical Flow	Hongzhi You et.al.	2405.08300	null
2024-05-13	RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration	Congjia Chen et.al.	2405.07594	null
2024-05-08	Unsupervised Skin Feature Tracking with Deep Neural Networks	Jose Chang et.al.	2405.04943	null
2024-05-07	A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images	László Kopácsi et.al.	2405.04650	null
2024-04-30	A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images	Wang Zhang et.al.	2404.19311	null
2024-04-25	Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach	Tahmim Hossain et.al.	2404.14560	null
2024-04-19	SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers	Vandad Davoodnia et.al.	2404.12625	null
2024-04-17	Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images	Junbiao Pang et.al.	2404.10985	null
2024-03-28	Towards Long Term SLAM on Thermal Imagery	Colin Keil et.al.	2403.19885	link
2024-03-28	Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation	Xiao Lin et.al.	2403.19527	link
2024-03-27	RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation	Yang Tian et.al.	2403.18259	null
2024-03-18	FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events	Xiangyuan Wang et.al.	2403.11662	link
2024-03-05	Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion	Meng Zheng et.al.	2403.03217	null
2024-02-22	A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets	Chengzhang Yu et.al.	2402.14241	null
2024-02-25	A Feature Matching Method Based on Multi-Level Refinement Strategy	Shaojie Zhang et.al.	2402.13488	null
2024-03-05	3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data	Zhi-Yi Lin et.al.	2402.13172	null
2024-02-25	Region Feature Descriptor Adapted to High Affine Transformations	Shaojie Zhang et.al.	2402.09724	null
2024-01-29	Reconstructing Close Human Interactions from Multiple Views	Qing Shuai et.al.	2401.16173	link
2024-01-17	To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection	Luyi Han et.al.	2401.09336	link
2024-01-08	Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach	Huanyu Liu et.al.	2401.03742	link
2024-03-22	6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation	Li Xu et.al.	2401.00029	null
2023-12-27	Bezier-based Regression Feature Descriptor for Deformable Linear Objects	Fangqing Chen et.al.	2312.16502	null
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471	null
2023-12-22	BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions	Elias Marks et.al.	2312.14706	null
2023-12-19	Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation	Jiaming Liu et.al.	2312.12480	null
2023-12-19	An effective image copy-move forgery detection using entropy image	Zhaowei Lu et.al.	2312.11793	link
2023-12-11	VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data	Jian Shi et.al.	2312.08871	link
2023-12-11	Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach	Travis Driver et.al.	2312.06865	link
2023-12-01	Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version)	Emma Cramer et.al.	2312.00592	link
2023-11-30	Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications	Sahar Almahfouz Nasser et.al.	2311.18281	null
2023-11-29	Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features	Thomas Wimmer et.al.	2311.18113	link
2023-11-28	Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features	Niladri Shekhar Dutt et.al.	2311.17024	link
2023-11-28	Riemannian Self-Attention Mechanism for SPD Networks	Rui Wang et.al.	2311.16738	null
2023-11-27	A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor	Jialin Liu et.al.	2311.15609	null
2023-11-21	Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers	Bo Sun et.al.	2311.12291	null
2023-11-20	CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement	Boni Hu et.al.	2311.11604	link
2023-11-17	Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration	Paul J. Claasen et.al.	2311.10361	link
2023-11-13	Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning	Tomáš Kunzo et.al.	2311.07398	null
2023-11-11	CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer	Haoyu Ma et.al.	2311.06443	link
2023-11-08	3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud	Jianchao Ci et.al.	2311.04699	null
2023-11-06	TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains	Alexander Naumann et.al.	2311.03124	link
2023-11-06	An invariant feature extraction for multi-modal images matching	Chenzhong Gao et.al.	2311.02842	null
2023-10-20	Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification	Mateus Roder et.al.	2310.13490	null
2023-10-12	UniPose: Detecting Any Keypoints	Jie Yang et.al.	2310.08530	link
2023-10-10	l-dyno: framework to learn consistent visual features using robot’s motion	Kartikeya Singh et.al.	2310.06249	link
2023-10-10	Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face	Hao Zhang et.al.	2310.05056	link
2023-10-13	H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation	Yanjie Ze et.al.	2310.01404	link
2023-10-04	Self-supervised Learning of Contextualized Local Visual Embeddings	Thalles Santos Silva et.al.	2310.00527	link
2023-10-22	ObVi-SLAM: Long-Term Object-Visual SLAM	Amanda Adkins et.al.	2309.15268	link
2023-09-19	LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation	Haizhou Zhang et.al.	2309.10436	link
2023-09-18	RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy	Mert Asim Karaoglu et.al.	2309.09563	null
2023-09-17	CryoAlign: feature-based method for global and local 3D alignment of EM density maps	Bintao He et.al.	2309.09217	null
2023-09-14	EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization	Minjung Kim et.al.	2309.07471	link
2023-09-09	Mirror-Aware Neural Humans	Daniel Ajisafe et.al.	2309.04750	link
2023-09-07	InstructDiffusion: A Generalist Modeling Interface for Vision Tasks	Zigang Geng et.al.	2309.03895	null
2023-09-04	SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras	Himanshu Pahadia et.al.	2309.01324	null
2023-09-12	Improving the matching of deformable objects by learning to detect keypoints	Felipe Cadar et.al.	2309.00434	link
2023-08-31	SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation	Jiaben Chen et.al.	2308.16876	null
2023-08-30	Learning Structure-from-Motion with Graph Attention Networks	Lucas Brynte et.al.	2308.15984	link
2023-08-29	A lightweight 3D dense facial landmark estimation model from position map data	Shubhajit Basak et.al.	2308.15170	link
2023-08-27	Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors	Francesco Pirotti et.al.	2308.14047	null
2023-08-24	VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition	Gengxuan Tian et.al.	2308.12870	null
2023-08-22	LDP-Feat: Image Features with Local Differential Privacy	Francesco Pittaluga et.al.	2308.11223	null
2023-08-20	Neural Interactive Keypoint Detection	Jie Yang et.al.	2308.10174	link
2023-08-19	ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment	Bingyang Zhou et.al.	2308.09987	null
2023-09-03	DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching	Johan Edstedt et.al.	2308.08479	link
2023-08-15	CoDeF: Content Deformation Fields for Temporally Consistent Video Processing	Hao Ouyang et.al.	2308.07926	link
2023-08-15	ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition	Wenyuan Xue et.al.	2308.07743	null
2023-08-14	DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport	Sk Aziz Ali et.al.	2308.07153	null
2023-08-14	2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds	Minhao Li et.al.	2308.05667	link
2023-08-02	Automated Hit-frame Detection for Badminton Match Analysis	Yu-Hang Chien et.al.	2307.16000	link
2023-07-25	Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception	Chuanyu Luo et.al.	2307.13300	null
2023-07-21	Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data	Sahar Almahfouz Nasser et.al.	2307.10698	link
2023-07-19	SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid	Zi Li et.al.	2307.09727	link
2023-07-01	SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation	Fabian Duffhauss et.al.	2307.00306	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669	link
2023-06-26	CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild	Li Ding et.al.	2306.15073	null
2023-06-28	Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset	Ziqiao Weng et.al.	2306.07089	link
2023-06-07	Learning Probabilistic Coordinate Fields for Robust Correspondences	Weiyue Zhao et.al.	2306.04231	null
2023-06-03	LDEB – Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues	Amitabha Dey et.al.	2306.02193	null
2023-06-02	Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images	Marcela Mera-Trujillo et.al.	2306.01938	null
2023-06-01	A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm	Onur Beker et.al.	2306.00892	null
2023-05-30	Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection	Supeng Wang et.al.	2305.18714	link
2023-05-23	Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence	Grace Luo et.al.	2305.14334	null
2023-05-15	Non-Separable Multi-Dimensional Network Flows for Visual Computing	Viktoria Ehm et.al.	2305.08628	null
2023-05-13	Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance	Xinyu Lin et.al.	2305.07943	link
2023-05-05	HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration	Canhui Tang et.al.	2305.03487	link
2023-04-17	Human Pose Estimation in Monocular Omnidirectional Top-View Images	Jingrui Yu et.al.	2304.08186	null
2023-04-14	CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression	Mubariz Zaffar et.al.	2304.07426	null
2023-04-12	SiLK – Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194	link
2023-04-06	From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection	Changsheng Lu et.al.	2304.03140	null
2023-03-29	NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud	Xiangyu Zhu et.al.	2303.16465	link
2023-03-24	PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View	Ze Shi et.al.	2303.14095	link
2023-03-23	Semantic Image Attack for Visual Model Diagnosis	Jinqi Luo et.al.	2303.13010	null
2023-03-22	Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation	Heng Yang et.al.	2303.12246	link
2023-03-21	RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network	Sangmin Yoo et.al.	2303.10770	null
2023-03-17	ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty	Vanessa Wirth et.al.	2303.10042	null
2023-03-15	Descriptor Distillation for Efficient Multi-Robot SLAM	Xiyue Guo et.al.	2303.08420	null
2023-03-15	From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning	Zhuo Su et.al.	2303.08414	null
2023-03-16	KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input	Yiye Chen et.al.	2303.05617	link
2023-03-07	External Camera-based Mobile Robot Pose Estimation for Collaborative Perception with Smart Edge Sensors	Simon Bultmann et.al.	2303.03797	null
2023-02-26	PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection	Shenwei Xie et.al.	2302.13263	null
2023-02-24	Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks	Julian Lißner et.al.	2302.12545	null
2023-02-21	Deep Reinforcement Learning Based on Local GNN for Goal-conditioned Deformable Object Rearranging	Yuhong Deng et.al.	2302.10446	null
2023-02-12	A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training	Jingnan Shi et.al.	2302.06019	null
2023-02-11	Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing	Zitong Yu et.al.	2302.05744	null
2023-02-09	MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection	Yuhe Ding et.al.	2302.04589	link
2023-02-03	Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation	Jie Yang et.al.	2302.01593	link
2023-02-03	Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization	Yingying Zhu et.al.	2302.01572	link
2023-01-21	Vision Aided Environment Semantics Extraction and Its Application in mmWave Beam Selection	Feiyang Wen et.al.	2301.08973	null
2023-01-18	OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models	Xingyi He et.al.	2301.07673	null
2023-01-12	Towards High Performance One-Stage Human Pose Estimation	Ling Li et.al.	2301.04842	null
2022-12-31	Rethinking Rotation Invariance with Point Cloud Registration	Jianhui Yu et.al.	2301.00149	null
2023-02-06	Fruit Ripeness Classification: a Survey	Matteo Rizzo et.al.	2212.14441	null
2022-12-28	NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action	Kuan-Chieh Wang et.al.	2212.13660	link
2022-12-24	HandsOff: Labeled Dataset Generation With No Additional Human Annotations	Austin Xu et.al.	2212.12645	null
2022-12-13	Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images	Welerson Melo et.al.	2212.09589	link
2022-12-15	Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation	Bugra C. Sefercik et.al.	2212.07567	null
2023-02-01	DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization	Xiangyu Xu et.al.	2212.04575	null
2022-12-07	ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation	Yufei Xu et.al.	2212.04246	link
2022-12-15	Designing Feature Vector Representations: A case study from Chemistry	Signe Sidwall Thygesen et.al.	2212.03731	null
2022-12-09	DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model	Jeongjun Choi et.al.	2212.02796	link
2022-12-05	Images Speak in Images: A Generalist Painter for In-Context Visual Learning	Xinlong Wang et.al.	2212.02499	link
2022-12-06	R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor	Bai Zhu et.al.	2212.02277	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069	link
2022-11-29	BALF: Simple and Efficient Blur Aware Local Feature Detector	Zhenjun Zhao et.al.	2211.14731	null
2022-11-21	Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching	Paul Roetzer et.al.	2211.11589	link
2022-11-07	Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration	Zixin Yang et.al.	2211.03688	null
2022-10-31	Tree Detection and Diameter Estimation Based on Deep Learning	Vincent Grondin et.al.	2210.17424	link
2022-10-26	Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds	Zhiyuan Zhang et.al.	2210.14899	null
2022-10-23	Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders	Ömer Sümer et.al.	2210.12705	null
2022-10-21	Real-time Detection of 2D Tool Landmarks with Synthetic Training Data	Bram Vanherle et.al.	2210.11991	null
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236	null
2022-10-04	Centroid Distance Keypoint Detector for Colored Point Clouds	Hanzhe Teng et.al.	2210.01298	link
2022-09-28	Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences	Jun-Jee Chao et.al.	2209.14419	null
2022-09-28	USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation	Zhengrong Xue et.al.	2209.13864	null
2022-10-16	Suture Thread Spline Reconstruction from Endoscopic Images for Robotic Surgery with Reliability-driven Keypoint Detection	Neelay Joglekar et.al.	2209.13657	link
2022-09-27	Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors	Hao Dong et.al.	2209.13586	link
2022-09-26	Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments	Kyungmin Jung et.al.	2209.12881	null
2022-10-07	Long-Lived Accurate Keypoints in Event Streams	Philippe Chiberre et.al.	2209.10385	null
2022-09-20	Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence	Sunghwan Hong et.al.	2209.08742	null
2022-09-15	Online Marker-free Extrinsic Camera Calibration using Person Keypoint Detections	Bastian Pätzold et.al.	2209.07393	link
2022-09-07	Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip	Yang Li et.al.	2209.03440	null
2022-08-27	Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes	Ali Safa et.al.	2208.12997	null
2022-08-24	Self-Supervised Endoscopic Image Key-Points Matching	Manel Farhat et.al.	2208.11424	link
2022-08-19	Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture	Muhammad Muzammel et.al.	2208.08224	null
2022-08-08	MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis	Maximilian Gilles et.al.	2208.03963	null
2022-08-07	CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization	Yujiao Shi et.al.	2208.03660	null
2022-07-29	Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation	Qihao Liu et.al.	2208.00090	null
2022-07-25	Translating a Visual LEGO Manual to a Machine-Executable Plan	Ruocheng Wang et.al.	2207.12572	null
2022-07-21	Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network	Aline Sindel et.al.	2207.10506	null
2022-07-15	Human keypoint detection for close proximity human-robot interaction	Jan Docekal et.al.	2207.07742	null
2022-07-15	Adversarial Focal Loss: Asking Your Discriminator for Hard Examples	Chen Liu et.al.	2207.07739	null
2022-07-13	Rapid Person Re-Identification via Sub-space Consistency Regularization	Qingze Yin et.al.	2207.05933	null
2022-07-07	RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments	Qihao Peng et.al.	2207.03539	null
2022-08-15	Semi-supervised Human Pose Estimation in Art-historical Images	Matthias Springstein et.al.	2207.02976	link
2022-07-01	Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling	Jiamin Liang et.al.	2207.00474	null
2022-06-24	Motion Estimation for Large Displacements and Deformations	Qiao Chen et.al.	2206.12464	null
2022-06-24	Deep embedded clustering algorithm for clustering PACS repositories	Teo Manojlović et.al.	2206.12417	null
2022-06-21	KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences	Xuanhan Wang et.al.	2206.10090	link
2022-06-20	Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval	Guile Wu et.al.	2206.09806	null
2022-06-15	A Unified Sequence Interface for Vision Tasks	Ting Chen et.al.	2206.07669	link
2022-06-09	Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields	Mingtong Zhang et.al.	2206.04669	null
2022-06-03	SNAKE: Shape-aware Neural 3D Keypoint Field	Chengliang Zhong et.al.	2206.01724	link
2022-05-17	MulT: An End-to-End Multitask Learning Transformer	Deblina Bhattacharjee et.al.	2205.08303	null
2022-05-10	ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild	Chirag Raman et.al.	2205.05177	link
2022-04-28	Polarimetric imaging for the detection of synthetic models of SARS-CoV-2: a proof of concept	Emilio Gomez-Gonzalez et.al.	2204.14050	null
2022-05-02	GRIT: General Robust Image Task Benchmark	Tanmay Gupta et.al.	2204.13653	link
2022-05-24	ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation	Yufei Xu et.al.	2204.12484	link
2022-04-26	Unified GCNs: Towards Connecting GCNs with CNNs	Ziyan Zhang et.al.	2204.12300	null
2022-04-19	Self-Supervised Equivariant Learning for Oriented Keypoint Detection	Jongmin Lee et.al.	2204.08613	link
2022-04-17	The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation	Bao Zhao et.al.	2204.08024	null
2022-04-15	2D Human Pose Estimation: A Survey	Haoming Chen et.al.	2204.07370	null
2022-04-11	Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification	Haojie Liu et.al.	2204.04842	null
2022-04-07	Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification	Yanan Wang et.al.	2204.02611	link
2022-04-02	SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning	Nilaksh Das et.al.	2204.00734	link
2022-04-01	MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration	Chenzhong Gao et.al.	2204.00260	null
2022-03-29	Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning	David Howard et.al.	2203.15172	null
2022-03-28	REGTR: End-to-end Point Cloud Correspondences with Transformers	Zi Jian Yew et.al.	2203.14517	link
2022-03-27	UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection	Ye Liu et.al.	2203.12745	link
2022-03-21	MatchFormer: Interleaving Attention in Transformers for Feature Matching	Qing Wang et.al.	2203.09645	link
2022-03-16	PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research	R. James Cotton et.al.	2203.08792	link
2022-03-11	DRTAM: Dual Rank-1 Tensor Attention Module	Hanxing Chi et.al.	2203.05893	null
2022-03-07	Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation	Meng Tian et.al.	2203.03498	null
2022-02-10	Motion-Aware Transformer For Occluded Person Re-identification	Mi Zhou et.al.	2202.04243	null
2022-02-03	Sim2Real Object-Centric Keypoint Detection and Description	Chengliang Zhong et.al.	2202.00448	null
2022-01-16	Cross-Centroid Ripple Pattern for Facial Expression Recognition	Monu Verma et.al.	2201.05958	null
2022-01-14	Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words	Harry Nguyen et.al.	2201.03556	link
2022-01-10	TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials	Jinnavat Sanalohit et.al.	2201.03170	null
2022-01-06	A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration	Aline Sindel et.al.	2201.02242	null
2021-12-28	Skin feature point tracking using deep feature encodings	Jose Ramon Chang et.al.	2112.14159	null
2021-12-23	Data-efficient learning for 3D mirror symmetry detection	Yancong Lin et.al.	2112.12579	null
2021-12-22	Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations – combining input rotations and a kinematic model	Michael Zwölfer et.al.	2112.12193	null
2021-12-22	Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction	Henrique Siqueira et.al.	2112.12002	link
2021-12-19	Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection	Renjie Li et.al.	2112.10275	null
2021-12-19	GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor	Jean-Baptiste Carluer et.al.	2112.10258	link
2021-12-16	Masked Feature Prediction for Self-Supervised Visual Pre-Training	Chen Wei et.al.	2112.09133	link
2021-12-13	DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points	Zhengfei Kuang et.al.	2112.06910	null
2021-12-12	Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species	Changsheng Lu et.al.	2112.06183	link
2021-12-13	Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings	Mel Vecerik et.al.	2112.04910	null
2021-12-06	ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction	Xiaoming Zhao et.al.	2112.02906	link
2021-11-25	Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association	Sen Yang et.al.	2111.12892	link
2021-11-08	Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images	Jianfei Guo et.al.	2111.04237	null
2021-11-04	Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image	Feng Liu et.al.	2111.03098	null
2021-11-01	Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-realistic Perspective of Computer Vision	Ali Safa et.al.	2111.00791	null
2021-10-30	Geometry-Aware Hierarchical Bayesian Learning on Manifolds	Yonghui Fan et.al.	2111.00184	null
2021-10-26	CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration	Hao Yu et.al.	2110.14076	link
2021-10-23	HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware	James Hegarty et.al.	2110.12106	null
2021-10-18	Keypoint-Based Bimanual Shaping of Deformable Linear Objects under Environmental Constraints using Hierarchical Action Planning	Shengzeng Huo et.al.	2110.08962	null
2021-10-11	High-order Tensor Pooling with Attention for Action Recognition	Piotr Koniusz et.al.	2110.05216	null
2021-10-10	Digging Into Self-Supervised Learning of Feature Descriptors	Iaroslav Melekhov et.al.	2110.04773	null
2021-10-04	BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion	Zhaoqun Li et.al.	2110.01179	link
2021-10-01	Machine learning aided noise filtration and signal classification for CREDO experiment	Łukasz Bibrzycki et.al.	2110.00297	null
2021-09-28	PDC-Net+: Enhanced Probabilistic Dense Correspondence Network	Prune Truong et.al.	2109.13912	link
2021-09-27	HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines	Fabio Bellavia et.al.	2109.12925	null
2021-09-24	Catadioptric Stereo on a Smartphone	Kristijan Bartol et.al.	2109.11872	null
2021-09-20	Semi-supervised Dense Keypointsusing Unlabeled Multiview Images	Zhixuan Yu et.al.	2109.09299	null
2021-08-31	A Novel Dataset for Keypoint Detection of quadruped Animals from Images	Prianka Banik et.al.	2108.13958	link
2021-08-27	A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images	Xiaoteng Zhou et.al.	2108.12151	null

Image Matching

Publish Date	Title	Authors	PDF	Code
2025-07-22	A Single-step Accurate Fingerprint Registration Method Based on Local Feature Matching	Yuwei Jia et.al.	2507.16201	null
2025-07-09	Dual-Granularity Cross-Modal Identity Association for Weakly-Supervised Text-to-Person Image Matching	Yafei Zhang et.al.	2507.06744	null
2025-07-05	From Query to Explanation: Uni-RAG for Multi-Modal Retrieval-Augmented Learning in STEM	Xinyi Wu et.al.	2507.03868	null
2025-07-02	What does really matter in image goal navigation?	Gianluca Monaci et.al.	2507.01667	null
2025-06-30	Efficient and Accurate Image Provenance Analysis: A Scalable Pipeline for Large-scale Images	Jiewei Lai et.al.	2506.23707	null
2025-06-29	Dynamic Contrastive Learning for Hierarchical Retrieval: A Case Study of Distance-Aware Cross-View Geo-Localization	Suofei Zhang et.al.	2506.23077	null
2025-06-27	MatChA: Cross-Algorithm Matching with Feature Augmentation	Paula Carbó Cubero et.al.	2506.22336	null
2025-07-22	Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs	Shaojie Zhang et.al.	2506.22139	null
2025-06-27	ZeroReg3D: A Zero-shot Registration Pipeline for 3D Consecutive Histopathology Image Reconstruction	Juming Xiong et.al.	2506.21923	null
2025-06-25	Fast entropy-regularized SDP relaxations for permutation synchronization	Michael Lindsey et.al.	2506.20191	null
2025-06-18	ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections	Ziling Huang et.al.	2506.15180	null
2025-06-16	EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition	Bingxi Liu et.al.	2506.13133	null
2025-06-12	RealKeyMorph: Keypoints in Real-world Coordinates for Resolution-agnostic Image Registration	Mina C. Moghadam et.al.	2506.10344	null
2025-06-11	Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints	Xiangkai Zhang et.al.	2506.09748	null
2025-06-11	ScaleLSD: Scalable Deep Line Segment Detection Streamlined	Zeran Ke et.al.	2506.09369	link
2025-05-21	Anti-interrupted sampling repeater jamming via linear canonical Wigner distribution lightweight LFM detection	Jia-Mian Li et.al.	2506.06302	null
2025-06-05	Vanishing arcs for isolated plane curve singularities	Hanwool Bae et.al.	2506.04917	null
2025-06-05	Deep Learning Reforms Image Matching: A Survey and Outlook	Shihua Zhang et.al.	2506.04619	null
2025-06-20	SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping	Mingxu Zhang et.al.	2505.24305	null
2025-06-05	Universal Domain Adaptation for Semantic Segmentation	Seun-An Choe et.al.	2505.22458	null
2025-05-23	To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models	Simone Gaisbauer et.al.	2505.17973	link
2025-05-16	Multi-view dense image matching with similarity learning and geometry priors	Mohamed Ali Chebbi et.al.	2505.11264	null
2025-05-12	Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection	Yuqi Cheng et.al.	2505.07375	link
2025-05-04	OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery	Chongsheng Zhang et.al.	2505.03836	link
2025-05-06	LiftFeat: 3D Geometry-Aware Local Feature Matching	Yepeng Liu et.al.	2505.03422	link
2025-05-04	Focus What Matters: Matchability-Based Reweighting for Local Feature Matching	Dongyue Li et.al.	2505.02161	null
2025-05-15	Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective	Taoyu Su et.al.	2504.19458	link
2025-04-28	Dynamic Arthroscopic Navigation System for Anterior Cruciate Ligament Reconstruction Based on Multi-level Memory Architecture	Shuo Wang et.al.	2504.19398	null
2025-04-23	Road Similarity-Based BEV-Satellite Image Matching for UGV Localization	Zhenping Sun et.al.	2504.16346	null
2025-04-18	Outlier-Robust Multi-Model Fitting on Quantum Annealers	Saurabh Pandey et.al.	2504.13836	null
2025-04-11	Geometric Consistency Refinement for Single Image Novel View Synthesis via Test-Time Adaptation of Diffusion Models	Josef Bengtson et.al.	2504.08348	null
2025-04-10	Image registration of 2D optical thin sections in a 3D porous medium: Application to a Berea sandstone digital rock image	Jaehong Chung et.al.	2504.06604	link
2025-04-22	To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition	Davide Sferrazza et.al.	2504.06116	link
2025-04-10	Learning Affine Correspondences by Integrating Geometric Constraints	Pengju Sun et.al.	2504.04834	link
2025-04-01	Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data	Yiqun Duan et.al.	2504.00812	null
2025-03-31	CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching	Zizhuo Li et.al.	2503.23925	null
2025-03-28	Pairwise Matching of Intermediate Representations for Fine-grained Explainability	Lauren Shrack et.al.	2503.22881	link
2025-03-26	Multimodal Image Matching based on Frequency-domain Information of Local Energy Response	Meng Yang et.al.	2503.20827	null
2025-03-22	Normalized Matching Transformer	Abtin Pourhadi et.al.	2503.17715	link
2025-03-20	Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors	Tian Yi Lim et.al.	2503.16275	null
2025-03-20	MapGlue: Multimodal Remote Sensing Image Matching	Peihao Wu et.al.	2503.16185	link
2025-03-19	PAPI-Reg: Patch-to-Pixel Solution for Efficient Cross-Modal Registration between LiDAR Point Cloud and Camera Image	Yuanchao Yue et.al.	2503.15285	null
2025-04-07	Less Biased Noise Scale Estimation for Threshold-Robust RANSAC	Johan Edstedt et.al.	2503.13433	null
2025-03-17	SatDepth: A Novel Dataset for Satellite Image Matching	Rahul Deshmukh et.al.	2503.12706	link
2025-03-14	Refining Image Edge Detection via Linear Canonical Riesz Transforms	Shuhui Yang et.al.	2503.11148	null
2025-03-13	Speedy MASt3R	Jingxing Li et.al.	2503.10017	null
2025-03-11	Keypoint Detection and Description for Raw Bayer Images	Jiakai Lin et.al.	2503.08673	null
2025-03-06	Learning 3D Medical Image Models From Brain Functional Connectivity Network Supervision For Mental Disorder Diagnosis	Xingcan Hu et.al.	2503.04205	null
2025-03-07	Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration	Qianliang Wu et.al.	2503.04127	null
2025-03-05	JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba	Xiaoyong Lu et.al.	2503.03437	null
2025-02-28	CNSv2: Probabilistic Correspondence Encoded Neural Image Servo	Anzhe Chen et.al.	2503.00132	null
2025-02-27	A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization	Yejun Zhang et.al.	2502.20036	link
2025-02-27	RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges	Thibaut Loiseau et.al.	2502.19955	null
2025-02-26	BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure	Haoxin Cai et.al.	2502.19242	link
2025-02-25	PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching	Han Nie et.al.	2502.18104	link
2025-02-25	Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking	Xin Tong et.al.	2502.17766	null
2025-03-04	Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model	Yaxuan Huang et.al.	2502.16779	null
2025-02-16	FeaKM: Robust Collaborative Perception under Noisy Pose Conditions	Jiuwu Hao et.al.	2502.11003	link
2025-02-24	Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation	Emanuele Mule et.al.	2502.06288	link
2025-02-04	Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications	William O’Donnell et.al.	2502.02624	null
2025-02-01	MambaGlue: Fast and Robust Local Feature Matching With Mamba	Kihwan Ryoo et.al.	2502.00462	link
2025-01-24	Dense-SfM: Structure from Motion with Dense Consistent Matching	JongMin Lee et.al.	2501.14277	null
2025-01-20	MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching	Yepeng Liu et.al.	2501.11299	null
2025-01-13	MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training	Xingyi He et.al.	2501.07556	null
2025-01-13	Matching Free Depth Recovery from Structured Light	Zhuohang Yu et.al.	2501.07113	null
2025-01-02	Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views	Yulun Wu et.al.	2501.01196	null
2024-12-31	Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights	Bharath Kumar Agnur et.al.	2412.20210	null
2024-12-27	MINIMA: Modality Invariant Image Matching	Xingyu Jiang et.al.	2412.19412	link
2024-12-24	GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network	Xianfeng Song et.al.	2412.18221	link
2024-12-17	Bringing Multimodality to Amazon Visual Search System	Xinliang Zhu et.al.	2412.13364	null
2024-12-04	Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis	Siyoon Jin et.al.	2412.03150	null
2024-11-20	DT-LSD: Deformable Transformer-based Line Segment Detection	Sebastian Janampa et.al.	2411.13005	link
2024-11-15	Image Matching Filtering and Refinement by Planes and Beyond	Fabio Bellavia et.al.	2411.09484	link
2024-11-11	XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration	Ismail Can Yagmur et.al.	2411.07430	link
2024-11-07	The Impact of Semi-Supervised Learning on Line Segment Detection	Johanna Engman et.al.	2411.04596	link
2024-11-04	Silver medal Solution for Image Matching Challenge 2024	Yian Wang et.al.	2411.01851	null
2024-10-30	Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants	Azadeh Sharafi et.al.	2410.23329	null
2024-11-05	RelationBooth: Towards Relation-Aware Customized Object Generation	Qingyu Shi et.al.	2410.23280	null
2024-10-31	ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses	Junjie Ni et.al.	2410.22733	null
2024-10-30	LoFLAT: Local Feature Matching using Focused Linear Attention Transformer	Naijian Cao et.al.	2410.22710	null
2024-10-26	Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification	Yue Su et.al.	2410.20097	null
2024-10-01	A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference	Yuan Li et.al.	2410.11848	null
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-09-27	Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras	Yipeng Lu et.al.	2409.18673	null
2024-09-25	Game4Loc: A UAV Geo-Localization Benchmark from Game Data	Yuxiang Ji et.al.	2409.16925	link
2024-09-24	Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge	Marek Wodzinski et.al.	2409.15931	null
2024-09-10	Weakly-supervised Camera Localization by Ground-to-satellite Image Registration	Yujiao Shi et.al.	2409.06471	link
2024-09-05	Enabling Practical and Privacy-Preserving Image Processing	Chao Wang et.al.	2409.03568	null
2024-09-20	A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering	Shuang Song et.al.	2409.03032	link
2024-08-29	Super-Resolution works for coastal simulations	Zhi-Song Liu et.al.	2408.16553	null
2024-09-15	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445	link
2024-08-26	Affine steerers for structured keypoint description	Georg Bökman et.al.	2408.14186	link
2024-08-25	TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers	Chuanrui Zhang et.al.	2408.13770	null
2024-09-11	Coarse-to-fine Alignment Makes Better Speech-image Retrieval	Lifeng Zhou et.al.	2408.13119	null
2024-08-19	BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval	Zhenyu Lu et.al.	2408.10383	null
2024-08-14	RSD-DOG : A New Image Descriptor based on Second Order Derivatives	Darshan Venkatrayappa et.al.	2408.07687	null
2024-08-09	One Shot is Enough for Sequential Infrared Small Target Segmentation	Bingbing Dan et.al.	2408.04823	link
2024-08-07	PRISM: PRogressive dependency maxImization for Scale-invariant image Matching	Xudong Cai et.al.	2408.03598	null
2024-08-05	ConDL: Detector-Free Dense Image Matching	Monika Kwiatkowski et.al.	2408.02766	null
2024-08-04	Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image	Xinlin Ren et.al.	2408.02079	link
2024-07-29	Image-text matching for large-scale book collections	Artemis Llabrés et.al.	2407.19812	link
2024-07-26	PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis	Sohyeong Kim et.al.	2407.18695	null
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791	null
2024-07-17	GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jingwen Yu et.al.	2407.11736	link
2024-07-16	REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching	Han Nie et.al.	2407.11637	link
2024-07-16	A Self-Correcting Strategy of the Digital Volume Correlation Displacement Field Based on Image Matching: Application to Poor Speckles Quality and Complex-Large Deformation	Chengsheng Li et.al.	2407.11287	null
2024-07-14	Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching	Xiaoyong Lu et.al.	2407.07789	null
2024-07-10	Mutual Information calculation on different appearances	Jiecheng Liao et.al.	2407.07410	null
2024-07-15	SfM on-the-fly: Get better 3D from What You Capture	Zongqian Zhan et.al.	2407.03939	null
2024-07-03	IMC 2024 Methods & Solutions Review	Shyam Gupta et.al.	2407.03172	null
2024-06-21	High Resolution Surface Reconstruction of Cultural Heritage Objects Using Shape from Polarization Method	F. S. Mortazavi et.al.	2406.15121	null
2024-06-16	Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models	Yikai Zhang et.al.	2406.10902	link
2024-06-14	Grounding Image Matching in 3D with MASt3R	Vincent Leroy et.al.	2406.09756	link
2024-06-05	A Self-Supervised Denoising Strategy for Underwater Acoustic Camera Imageries	Xiaoteng Zhou et.al.	2406.02914	null
2024-05-22	Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching	Hongkai Chen et.al.	2405.13874	null
2024-05-21	OmniGlue: Generalizable Feature Matching with Foundation Model Guidance	Hanwen Jiang et.al.	2405.12979	link
2024-07-09	Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation	Rezkellah Noureddine Khiati et.al.	2405.08556	link
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434	null
2024-05-13	Authentic Hand Avatar from a Phone Scan via Universal Hand Model	Gyeongsik Moon et.al.	2405.07933	null
2024-04-30	A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images	Wang Zhang et.al.	2404.19311	null
2024-04-30	XFeat: Accelerated Features for Lightweight Image Matching	Guilherme Potje et.al.	2404.19174	null
2024-06-10	MinBackProp – Backpropagating through Minimal Solvers	Diana Sungatullina et.al.	2404.17993	link
2024-04-25	Transformer-Based Local Feature Matching for Multimodal Image Registration	Remi Delaunay et.al.	2404.16802	null
2024-04-23	FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction	Hang Hua et.al.	2404.14715	null
2024-04-22	Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer	Eric Brachmann et.al.	2404.14351	null
2024-04-17	A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching	Francesco Pro et.al.	2404.11302	link
2024-04-16	Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction	John Francis et.al.	2404.10626	null
2024-04-15	XoFTR: Cross-modal Feature Matching Transformer	Önder Tuzcuoğlu et.al.	2404.09692	link
2024-04-13	DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector	Johan Edstedt et.al.	2404.08928	link
2024-04-09	Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences	Axel Barroso-Laguna et.al.	2404.06337	link
2024-04-01	Marrying NeRF with Feature Matching for One-step Pose Estimation	Ronghan Chen et.al.	2404.00891	null
2024-04-01	3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching	Yibin Ye et.al.	2404.00838	null
2024-03-31	On the Estimation of Image-matching Uncertainty in Visual Place Recognition	Mubariz Zaffar et.al.	2404.00546	null
2024-03-30	Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation	Yuan Wang et.al.	2404.00262	null
2024-03-26	Staircase Localization for Autonomous Exploration in Urban Environments	Jinrae Kim et.al.	2403.17330	null
2024-03-23	MatchSeg: Towards Better Segmentation via Reference Image Matching	Ruiqiang Xiao et.al.	2403.15901	link
2024-03-20	Unifying Local and Global Multimodal Features for Place Recognition in Aliased and Low-Texture Environments	Alberto García-Hernández et.al.	2403.13395	link
2024-03-19	HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching	Ying Chen et.al.	2403.12543	null
2024-03-16	Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval	Shunsuke Tsubaki et.al.	2403.10756	null
2024-03-16	Vector search with small radiuses	Gergely Szilvasy et.al.	2403.10746	null
2024-03-15	Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline	Fangming Yuan et.al.	2403.10283	null
2024-03-15	Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning	Meixuan Li et.al.	2403.10252	null
2024-03-14	Virtual birefringence imaging and histological staining of amyloid deposits in label-free tissue using autofluorescence microscopy and deep learning	Xilin Yang et.al.	2403.09100	null
2024-03-18	Matching Non-Identical Objects	Yusuke Marumo et.al.	2403.08227	null
2024-03-11	Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed	Yifan Wang et.al.	2403.04765	null
2024-03-07	Scene Depth Estimation from Traditional Oriental Landscape Paintings	Sungho Kang et.al.	2403.03408	null
2024-02-21	Visual Style Prompting with Swapping Self-Attention	Jaeseok Jeong et.al.	2402.12974	link
2024-02-16	GIM: Learning Generalizable Image Matcher From Internet Videos	Xuelun Shen et.al.	2402.11095	link
2024-02-13	Are Semi-Dense Detector-Free Methods Good at Matching Local Features?	Matthieu Vilain et.al.	2402.08671	null
2024-02-13	Learning to Produce Semi-dense Correspondences for Visual Localization	Khang Truong Giang et.al.	2402.08359	link
2024-01-31	Improved Scene Landmark Detection for Camera Localization	Tien Do et.al.	2401.18083	link
2024-03-11	Local Feature Matching Using Deep Learning: A Survey	Shibiao Xu et.al.	2401.17592	link
2024-01-24	Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry	Qi Cai et.al.	2401.13357	null
2024-01-19	SCENES: Subpixel Correspondence Estimation With Epipolar Supervision	Dominik A. Kloepfer et.al.	2401.10886	null
2024-01-18	Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation	Songhe Deng et.al.	2401.09883	link
2024-01-26	RomniStereo: Recurrent Omnidirectional Stereo Matching	Hualie Jiang et.al.	2401.04345	link
2024-01-05	CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs	Daoan Zhang et.al.	2401.02582	null
2024-01-03	Local Adaptive Clustering Based Image Matching for Automatic Visual Identification	Zhizhen Wang et.al.	2401.01720	null
2024-01-03	A Transformer-Based Adaptive Semantic Aggregation Method for UAV Visual Geo-Localization	Shishen Li et.al.	2401.01574	null
2023-12-23	BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation	Tavis Shore et.al.	2312.15363	link
2023-12-22	Harnessing Diffusion Models for Visual Perception with Meta Prompts	Qiang Wan et.al.	2312.14733	link
2024-01-05	MatchDet: A Collaborative Framework for Image Matching and Object Detection	Jinxiang Lai et.al.	2312.10983	null
2023-12-07	Visual Geometry Grounded Deep Structure From Motion	Jianyuan Wang et.al.	2312.04563	null
2023-12-04	Steerers: A framework for rotation equivariant keypoint descriptors	Georg Bökman et.al.	2312.02152	link
2023-11-30	DSeg: Direct Line Segments Detection	Berger Cyrille et.al.	2311.18344	null
2023-11-30	Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications	Sahar Almahfouz Nasser et.al.	2311.18281	null
2023-11-29	LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching	Wenhao Zhong et.al.	2311.17571	link
2023-11-08	Zero-shot Translation of Attention Patterns in VQA Models to Natural Language	Leonard Salewski et.al.	2311.05043	link
2023-11-06	An invariant feature extraction for multi-modal images matching	Chenzhong Gao et.al.	2311.02842	null
2023-10-23	RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments	Jinyu Li et.al.	2310.15072	link
2023-10-23	Player Re-Identification Using Body Part Appearences	Mahesh Bhosale et.al.	2310.14469	null
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605	null
2023-11-14	RGM: A Robust Generalist Matching Model	Songyan Zhang et.al.	2310.11755	link
2023-10-07	UFD-PRiME: Unsupervised Joint Learning of Optical Flow and Stereo Depth through Pixel-Level Rigid Motion Estimation	Shuai Yuan et.al.	2310.04712	null
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092	null
2023-09-29	Segment Anything Model is a Good Teacher for Local Feature Learning	Jingqian Wu et.al.	2309.16992	link
2023-09-27	KDD-LOAM: Jointly Learned Keypoint Detector and Descriptors Assisted LiDAR Odometry and Mapping	Renlang Huang et.al.	2309.15394	null
2023-10-13	A Critical Analysis of Internal Reliability for Uncertainty Quantification of Dense Image Matching in Multi-view Stereo	Debao Huang et.al.	2309.09379	null
2023-09-11	Towards Content-based Pixel Retrieval in Revisited Oxford and Paris	Guoyuan An et.al.	2309.05438	link
2023-09-09	Neural Semantic Surface Maps	Luca Morreale et.al.	2309.04836	null
2023-09-05	Doppelgangers: Learning to Disambiguate Images of Similar Structures	Ruojin Cai et.al.	2309.02420	link
2023-08-14	Occ $^2$ Net: Robust Image Matching Based on 3D Occupancy Estimation for Occluded Regions	Miao Fan et.al.	2308.16160	null
2023-08-29	TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching	Yun Liao et.al.	2308.15144	null
2023-08-27	LDL: Line Distance Functions for Panoramic Localization	Junho Kim et.al.	2308.13989	link
2023-08-22	Scene-Aware Feature Matching	Xiaoyong Lu et.al.	2308.09949	null
2023-09-03	DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching	Johan Edstedt et.al.	2308.08479	link
2023-08-19	Global Features are All You Need for Image Retrieval and Reranking	Shihao Shao et.al.	2308.06954	link
2023-08-02	ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation	Bo Zhang et.al.	2308.00400	link
2023-07-28	Cross-Modal Concept Learning and Inference for Vision-Language Models	Yi Zhang et.al.	2307.15460	null
2023-07-22	CryptoMask : Privacy-preserving Face Recognition	Jianli Bai et.al.	2307.12010	null
2023-07-22	A Stronger Stitching Algorithm for Fisheye Images based on Deblurring and Registration	Jing Hao et.al.	2307.11997	null
2023-07-21	Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data	Sahar Almahfouz Nasser et.al.	2307.10698	link
2023-08-08	Balancing Privacy and Progress in Artificial Intelligence: Anonymization in Histopathology for Biomedical Research and Education	Neel Kanwal et.al.	2307.09426	null
2023-08-01	Unsupervised Deep Graph Matching Based on Cycle Consistency	Siddharth Tourani et.al.	2307.08930	link
2023-07-15	Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents	Ke Cao et.al.	2307.07763	null
2023-07-09	Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion	Jie S. Li et.al.	2307.05564	null
2023-07-11	ResMatch: Residual Attention Learning for Local Feature Matching	Yuxin Deng et.al.	2307.05180	link
2023-07-11	TIAM – A Metric for Evaluating Alignment in Text-to-Image Generation	Paul Grimal et.al.	2307.05134	link
2023-07-02	TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching	Khang Truong Giang et.al.	2307.00485	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669	link
2023-06-28	PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment	Jianyuan Wang et.al.	2306.15667	null
2023-06-25	Enhancing Dynamic Image Advertising with Vision-Language Pre-training	Zhoufutu Wen et.al.	2306.14112	null
2023-06-23	LightGlue: Local Feature Matching at Light Speed	Philipp Lindenberger et.al.	2306.13643	link
2023-06-19	Graph Self-Supervised Learning for Endoscopic Image Matching	Manel Farhat et.al.	2306.11141	link
2023-06-09	Leaving the Lines Behind: Vision-Based Crop Row Exit for Agricultural Robot Navigation	Rajitha de Silva et.al.	2306.05869	null
2023-06-07	A2B: Anchor to Barycentric Coordinate for Robust Correspondence	Weiyue Zhao et.al.	2306.02760	null
2023-05-27	Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation	Yueh-Cheng Huang et.al.	2305.17463	null
2023-05-19	SIDAR: Synthetic Image Dataset for Alignment & Restoration	Monika Kwiatkowski et.al.	2305.12036	link
2023-05-18	LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation	Yujie Lu et.al.	2305.11116	link
2023-05-16	A Method for Training-free Person Image Picture Generation	Tianyu Chen et.al.	2305.09817	null
2023-05-15	Image Matching by Bare Homography	Fabio Bellavia et.al.	2305.08946	null
2023-05-12	CLIP-Count: Towards Text-Guided Zero-Shot Object Counting	Ruixiang Jiang et.al.	2305.07304	link
2023-05-10	SENDD: Sparse Efficient Neural Depth and Deformation for Tissue Tracking	Adam Schmidt et.al.	2305.06477	null
2023-05-10	Level-line Guided Edge Drawing for Robust Line Segment Detection	Xinyu Lin et.al.	2305.05883	link
2023-05-09	ColonMapper: topological mapping and localization for colonoscopy	Javier Morlana et.al.	2305.05546	null
2023-04-29	A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges	Xinyu Lin et.al.	2305.00264	link
2023-04-28	SFD2: Semantic-guided Feature Detection and Description	Fei Xue et.al.	2304.14845	link
2023-04-17	DeepSim-Nets: Deep Similarity Networks for Stereo Image Matching	Mohamed Ali Chebbi et.al.	2304.08056	link
2023-04-16	Long-term Visual Localization with Mobile Sensors	Shen Yan et.al.	2304.07691	null
2023-04-12	SiLK – Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194	link
2023-04-16	ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation	Xiaoming Zhao et.al.	2304.03608	link
2023-04-04	GlueStick: Robust Image Matching by Sticking Points and Lines Together	Rémi Pautrat et.al.	2304.02008	link
2023-04-03	PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching	Pedro Castro et.al.	2304.01382	null
2023-04-02	Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints	Guilherme Potje et.al.	2304.00583	link
2023-04-13	Structured Epipolar Matcher for Local Feature Matching	Jiahao Chang et.al.	2303.16646	null
2023-03-29	Adaptive Spot-Guided Transformer for Consistent Local Feature Matching	Jiahuan Yu et.al.	2303.16624	null
2023-03-28	ASIC: Aligning Sparse in-the-wild Image Collections	Kamal Gupta et.al.	2303.16201	null
2023-03-25	Learning Rotation-Equivariant Features for Visual Correspondence	Jongmin Lee et.al.	2303.15472	null
2023-03-27	Learnable Graph Matching: A Practical Paradigm for Data Association	Jiawei He et.al.	2303.15414	link
2023-03-24	Efficient and Accurate Co-Visible Region Localization with Matching Key-Points Crop (MKPC): A Two-Stage Pipeline for Enhancing Image Matching Performance	Hongjian Song et.al.	2303.13794	null
2023-03-15	Rethinking Optical Flow from Geometric Matching Consistent Perspective	Qiaole Dong et.al.	2303.08384	link
2023-04-04	PATS: Patch Area Transportation with Subdivision for Local Feature Matching	Junjie Ni et.al.	2303.07700	null
2023-03-07	Parsing Line Segments of Floor Plan Images Using Graph Neural Networks	Mingxiang Chen et.al.	2303.03851	null
2023-03-06	Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints	Chenjie Cao et.al.	2303.02885	link
2023-03-10	ParaFormer: Parallel Attention Transformer for Efficient Feature Matching	Xiaoyong Lu et.al.	2303.00941	null
2023-03-01	RIFT2: Speeding-up RIFT with A New Rotation-Invariance Technique	Jiayuan Li et.al.	2303.00319	link
2023-02-28	Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images	Zhongli Fan et.al.	2302.14239	link
2023-02-25	BrainCLIP: Bridging Brain and Visual-Linguistic Representation via CLIP for Generic Natural Visual Stimulus Decoding from fMRI	Yulong Liu et.al.	2302.12971	link
2023-02-24	Classification of structural building damage grades from multi-temporal photogrammetric point clouds using a machine learning model trained on virtual laser scanning data	Vivien Zahs et.al.	2302.12591	null
2023-02-20	A Large Scale Homography Benchmark	Daniel Barath et.al.	2302.09997	link
2023-02-12	OAMatcher: An Overlapping Areas-based Network for Accurate Local Feature Matching	Kun Dai et.al.	2302.05846	link
2023-02-10	General, Single-shot, Target-less, and Automatic LiDAR-Camera Extrinsic Calibration Toolbox	Kenji Koide et.al.	2302.05094	link
2023-02-03	Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization	Yingying Zhu et.al.	2302.01572	link
2023-01-27	Harmonizing Flows: Unsupervised MR harmonization based on normalizing flows	Farzad Beizaee et.al.	2301.11551	link
2023-01-25	Local Feature Extraction from Salient Regions by Feature Map Transformation	Yerim Jung et.al.	2301.10413	null
2023-01-24	Feature-based Image Matching for Identifying Individual Kākā	Fintan O’Sullivan et.al.	2301.06678	null
2023-01-18	Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram Images	Johannes Bayer et.al.	2301.03155	null
2023-01-08	DeepMatcher: A Deep Transformer-based Network for Robust and Accurate Local Feature Matching	Tao Xie et.al.	2301.02993	link
2023-01-07	Deep Learning-Based UAV Aerial Triangulation without Image Control Points	Jiageng Zhong et.al.	2301.02869	null
2023-01-06	The UNCOVER Survey: A first-look HST+JWST catalog of 50,000 galaxies near Abell 2744 and beyond	John R. Weaver et.al.	2301.02671	link
2023-02-13	Translating Text Synopses to Video Storyboards	Xu Gu et.al.	2301.00135	link
2022-12-23	SuperGF: Unifying Local and Global Features for Visual Localization	Wenzheng Song et.al.	2212.13105	null
2022-12-26	Transformer and GAN Based Super-Resolution Reconstruction Network for Medical Images	Weizhi Du et.al.	2212.13068	null
2022-12-20	Seafloor-Invariant Caustics Removal from Underwater Imagery	Panagiotis Agrafiotis et.al.	2212.10167	null
2022-12-15	DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients	Rémi Pautrat et.al.	2212.07766	link
2022-12-14	Shared Coupling-bridge for Weakly Supervised Local Feature Learning	Jiayuan Sun et.al.	2212.07047	link
2022-12-05	Real Time Incremental Image Mosaicking Without Use of Any Camera Parameter	Suleyman Melih Portakal et.al.	2212.02302	null
2022-12-05	ObjectMatch: Robust Registration using Canonical Object Correspondences	Can Gümeli et.al.	2212.01985	null
2022-12-07	Universe Points Representation Learning for Partial Multi-Graph Matching	Zhakshylyk Nurlanov et.al.	2212.00780	null
2022-11-30	Self-Supervised Feature Learning for Long-Term Metric Visual Localization	Yuxuan Chen et.al.	2212.00122	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069	link
2022-11-19	Person Text-Image Matching via Text-Feature Interpretability Embedding and External Attack Node Implantation	Fan Li et.al.	2211.08657	link
2022-11-20	Detecting Line Segments in Motion-blurred Images with Events	Huai Yu et.al.	2211.07365	link
2022-11-15	Fast Key Points Detection and Matching for Tree-Structured Images	Hao Wang et.al.	2211.03242	null
2022-10-25	A Comparative Study on Deep-Learning Methods for Dense Image Matching of Multi-angle and Multi-date Remote Sensing Stereo Images	Hessah Albanwan et.al.	2210.14031	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517	null
2022-10-07	Mars Rover Localization Based on A2G Obstacle Distribution Pattern Matching	Lang Zhou et.al.	2210.03398	link
2022-09-27	Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors	Hao Dong et.al.	2209.13586	link
2022-09-25	ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement	Dongli Tan et.al.	2209.12213	null
2022-09-22	DRKF: Distilled Rotated Kernel Fusion for Efficiently Boosting Rotation Invariance in Image Matching	Chao Li et.al.	2209.10907	null
2022-11-15	Uncertainty-aware Efficient Subgraph Isomorphism using Graph Topology	Arpan Kusari et.al.	2209.09090	null
2022-09-16	SRFeat: Learning Locally Accurate and Globally Consistent Non-Rigid Shape Correspondence	Lei Li et.al.	2209.07806	link
2022-08-30	ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer	Hongkai Chen et.al.	2208.14201	link
2022-08-25	A Gis Aided Approach for Geolocalizing an Unmanned Aerial System Using Deep Learning	Jianli Wei et.al.	2208.12251	link
2022-08-25	UAS Navigation in the Real World Using Visual Observation	Yuci Han et.al.	2208.12125	null
2022-08-24	Self-Supervised Endoscopic Image Key-Points Matching	Manel Farhat et.al.	2208.11424	link
2022-08-22	Equivariant Hypergraph Neural Networks	Jinwoo Kim et.al.	2208.10428	link
2022-09-22	Understanding Attention for Vision-and-Language Tasks	Feiqi Cao et.al.	2208.08104	link
2022-08-16	Hierarchical Attention Network for Few-Shot Object Detection via Meta-Contrastive Learning	Dongwoo Park et.al.	2208.07039	link
2022-08-04	Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification	Xinyu Lin et.al.	2208.02450	link
2022-08-04	OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images	Weijia Li et.al.	2208.00928	null
2022-07-29	Testing Relational Understanding in Text-Guided Image Generation	Colin Conwell et.al.	2208.00005	null
2022-07-21	Pose for Everything: Towards Category-Agnostic Pose Estimation	Lumin Xu et.al.	2207.10387	link
2022-07-20	Explaining Deepfake Detection by Analysing Image Matching	Shichao Dong et.al.	2207.09679	link
2022-07-18	Adaptive Assignment for Geometry Aware Local Feature Matching	Dihe Huang et.al.	2207.08427	link
2022-07-16	Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching	Jiazhen Liu et.al.	2207.07932	link
2022-07-06	Virtual staining of defocused autofluorescence images of unlabeled tissue using deep neural networks	Yijie Zhang et.al.	2207.02946	null
2022-07-01	TopicFM: Robust and Interpretable Feature Matching with Topic-assisted	Khang Truong Giang et.al.	2207.00328	link
2022-06-16	Virtual Correspondence: Humans as a Cue for Extreme-View Geometry	Wei-Chiu Ma et.al.	2206.08365	null
2022-06-15	Self-Supervised Learning of Image Scale and Orientation	Jongmin Lee et.al.	2206.07259	link
2022-05-27	Image Keypoint Matching using Graph Neural Networks	Nancy Xu et.al.	2205.14275	null
2022-05-27	Fine-tuning deep learning models for stereo matching using results from semi-global matching	Hessah Albanwan et.al.	2205.14051	null
2022-05-23	TransforMatcher: Match-to-Match Attention for Semantic Correspondence	Seungwook Kim et.al.	2205.11634	link
2022-05-16	ReDFeat: Recoupling Detection and Description for Multimodal Feature Learning	Yuxin Deng et.al.	2205.07439	null
2022-05-06	BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo Surgical Image Matching	Jingwei Song et.al.	2205.03133	link
2022-05-10	AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching	Khanh Nguyen et.al.	2205.02849	link
2022-04-27	Gleo-Det: Deep Convolution Feature-Guided Detector with Local Entropy Optimization for Salient Points	Chao Li et.al.	2204.12884	null
2022-04-22	SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite	Runzhe Zhu et.al.	2204.10704	link
2022-04-20	Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations	Leila Pishdad et.al.	2204.09268	null
2022-04-19	OpenGlue: Open Source Graph Neural Net Based Pipeline for Image Matching	Ostap Viniavskyi et.al.	2204.08870	link
2022-04-19	Self-Supervised Equivariant Learning for Oriented Keypoint Detection	Jongmin Lee et.al.	2204.08613	link
2022-04-22	Efficient Linear Attention for Fast and Accurate Keypoint Matching	Suwichaya Suwanwimolkul et.al.	2204.07731	null
2022-04-08	Lightweight starshade position sensing with convolutional neural networks and simulation-based inference	Andrew Chen et.al.	2204.03853	link
2022-03-30	AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift	Burak Yildiz et.al.	2203.16291	link
2022-03-29	Photographic Visualization of Weather Forecasts with Generative Adversarial Networks	Christian Sigg et.al.	2203.15601	link
2022-03-29	Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots	Pranay Mathur et.al.	2203.15272	null
2022-03-28	Optimizing Elimination Templates by Greedy Parameter Search	Evgeniy Martyushev et.al.	2203.14901	link
2022-03-28	S2-Net: Self-supervision Guided Feature Representation Learning for Cross-Modality Images	Shasha Mei et.al.	2203.14581	null
2022-03-26	Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching	Yujiao Shi et.al.	2203.14148	link
2022-03-24	Keypoints Tracking via Transformer Networks	Oleksii Nasypanyi et.al.	2203.12848	link
2022-03-21	MatchFormer: Interleaving Attention in Transformers for Feature Matching	Qing Wang et.al.	2203.09645	link
2022-03-14	There’s no difference: Convolutional Neural Networks for transient detection without template subtraction	Tatiana Acero-Cuellar et.al.	2203.07390	link
2022-03-25	Cross Language Image Matching for Weakly Supervised Semantic Segmentation	Jinheng Xie et.al.	2203.02668	link
2022-03-01	CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP	Zihao Wang et.al.	2203.00386	null
2022-03-09	Time-resolved Imaging of Stochastic Cascade Reactions over a Submillisecond to Second Time Range at the Angstrom Level	Toshiki Shimizu et.al.	2202.13332	null
2022-02-16	Cross-view and Cross-domain Underwater Localization based on Optical Aerial and Acoustic Underwater Images	Matheus M. Dos Santos et.al.	2202.07817	null
2022-02-14	CATs++: Boosting Cost Aggregation with Convolutions and Transformers	Seokju Cho et.al.	2202.06817	link
2022-02-11	Improving Image-recognition Edge Caches with a Generative Adversarial Network	Guilherme B. Souza et.al.	2202.05929	null
2022-02-08	Learning Optical Flow with Adaptive Graph Reasoning	Ao Luo et.al.	2202.03857	link
2022-02-03	Sim2Real Object-Centric Keypoint Detection and Description	Chengliang Zhong et.al.	2202.00448	null
2022-01-27	Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape context	Jie Shao et.al.	2201.11296	null
2021-12-24	Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation	Zhiwei Liu et.al.	2112.12917	null
2021-12-20	Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching	Yujie Fu et.al.	2112.10485	null
2021-12-19	GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor	Jean-Baptiste Carluer et.al.	2112.10258	link
2021-12-14	More Control for Free! Image Synthesis with Semantic Diffusion Guidance	Xihui Liu et.al.	2112.05744	null
2021-12-08	Label-free virtual HER2 immunohistochemical staining of breast tissue using deep learning	Bijie Bai et.al.	2112.05240	null
2021-12-01	FaSS-MVS – Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery	Boitumelo Ruf et.al.	2112.00821	null
2021-12-01	CLIPstyler: Image Style Transfer with a Single Text Condition	Gihyun Kwon et.al.	2112.00374	link
2021-11-29	Nonlinear Intensity Underwater Sonar Image Matching Method Based on Phase Information and Deep Convolution Features	Xiaoteng Zhou et.al.	2111.15514	null
2021-11-29	Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic	Yoad Tewel et.al.	2111.14447	link
2021-11-29	Heterogeneous Visible-Thermal and Visible-Infrared Face Recognition using Unit-Class Loss and Cross-Modality Discriminator	Usman Cheema et.al.	2111.14339	null
2021-11-17	Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network	Xiaoming Zhao et.al.	2111.09006	null
2021-11-17	Nonlinear Intensity Sonar Image Matching based on Deep Convolution Features	Xiaoteng Zhou et.al.	2111.08994	null
2021-10-30	A Deep Search for Faint Chandra X-ray Sources, Radio Sources, and Optical Counterparts in NGC 6752	Haldan N. Cohn et.al.	2111.00357	null
2021-10-01	Robustly Removing Deep Sea Lighting Effects for Visual Mapping of Abyssal Plains	Kevin Köser et.al.	2110.00480	null
2021-09-29	Visually Grounded Concept Composition	Bowen Zhang et.al.	2109.14115	null
2021-09-27	HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines	Fabio Bellavia et.al.	2109.12925	null
2021-09-20	Viewpoint Invariant Dense Matching for Visual Geolocalization	Gabriele Berton et.al.	2109.09827	link
2021-09-20	Image Subtraction in Fourier Space	Lei Hu et.al.	2109.09334	link
2021-09-10	Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization	Sungho Yoon et.al.	2109.04753	link
2021-09-08	Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes	Wenzheng Song et.al.	2109.03585	null
2021-08-27	A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images	Xiaoteng Zhou et.al.	2108.12151	null
2021-08-27	Matching Underwater Sonar Images by the Learned Descriptor Based on Style Transfer Method	Xiaoteng Zhou et.al.	2108.12072	null
2021-08-26	Efficient Joint Object Matching via Linear Programming	Antonio De Rosa et.al.	2108.11911	null

NeRF

Publish Date	Title	Authors	PDF	Code
2025-07-23	Exploring Active Learning for Label-Efficient Training of Semantic Neural Radiance Field	Yuzhe Zhu et.al.	2507.17351	null
2025-07-22	Sparse-View 3D Reconstruction: Recent Advances and Open Challenges	Tanveer Younis et.al.	2507.16406	null
2025-07-19	DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF	Doriand Petit et.al.	2507.14596	null
2025-07-19	Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey	Jiahui Zhang et.al.	2507.14501	null
2025-07-18	TimeNeRF: Building Generalizable Neural Radiance Fields across Time from Few-Shot Input Views	Hsiang-Hui Hung et.al.	2507.13929	null
2025-07-18	EPSilon: Efficient Point Sampling for Lightening of Hybrid-based 3D Avatar Generation	Seungjun Moon et.al.	2507.13648	null
2025-07-16	DoRF: Doppler Radiance Fields for Robust Human Activity Recognition Using Wi-Fi	Navid Hasanzadeh et.al.	2507.12132	null
2025-07-16	HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing	Tielong Wang et.al.	2507.11971	null
2025-07-14	VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling	Zihang Zeng et.al.	2507.09987	null
2025-07-12	Stable Score Distillation	Haiming Zhu et.al.	2507.09168	null
2025-07-11	From images to properties: a NeRF-driven framework for granular material parameter inversion	Cheng-Hsi Hsiao et.al.	2507.09005	null
2025-07-10	MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation	Bangning Wei et.al.	2507.07519	null
2025-07-14	BayesSDF: Surface-Based Laplacian Uncertainty Estimation for 3D Geometry with Neural Signed Distance Fields	Rushil Desai et.al.	2507.06269	null
2025-07-08	Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering	Jiayi Song et.al.	2507.06103	null
2025-07-08	DreamArt: Generating Interactable Articulated Objects from a Single Image	Ruijie Lu et.al.	2507.05763	null
2025-07-06	A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields	Aoxiang Fan et.al.	2507.04408	null
2025-07-02	Tile and Slide : A New Framework for Scaling NeRF from Local to Global 3D Earth Observation	Camille Billouard et.al.	2507.01631	null
2025-07-01	Surgical Neural Radiance Fields from One Image	Alberto Neri et.al.	2507.00969	null
2025-07-01	PlantSegNeRF: A few-shot, cross-dataset method for plant 3D instance point cloud reconstruction via joint-channel NeRF with multi-view image instance matching	Xin Yang et.al.	2507.00371	null
2025-06-30	AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention	Ziao Liu et.al.	2506.23611	null
2025-06-29	Dynamic View Synthesis from Small Camera Motion Videos	Huiqiang Sun et.al.	2506.23153	null
2025-06-27	UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields	Fabian Perez et.al.	2506.21884	null
2025-06-24	ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes	Chenhao Zhang et.al.	2506.21629	null
2025-06-26	PanSt3R: Multi-view Consistent Panoptic Segmentation	Lojze Zust et.al.	2506.21348	null
2025-06-25	Joint attitude estimation and 3D neural reconstruction of non-cooperative space objects	Clément Forray et.al.	2506.20638	null
2025-06-24	NeRF-based CBCT Reconstruction needs Normalization and Initialization	Zhuowei Xu et.al.	2506.19742	null
2025-06-25	Self-Supervised Multimodal NeRF for Autonomous Driving	Gaurav Sharma et.al.	2506.19615	null
2025-06-24	HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis	Xiaoyuan Wang et.al.	2506.19291	null
2025-06-23	MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation	Tianchen Deng et.al.	2506.18678	null
2025-06-26	2D Triangle Splatting for Direct Differentiable Mesh Training	Kaifeng Sheng et.al.	2506.18575	link
2025-06-22	Limitations of NERF with pre-trained Vision Features for Few-Shot 3D Reconstruction	Ankit Sanjyal et.al.	2506.18208	null
2025-06-21	3D Gaussian Splatting for Fine-Detailed Surface Reconstruction in Large-Scale Scene	Shihan Chen et.al.	2506.17636	null
2025-06-23	R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision	Weeyoung Kwon et.al.	2506.16262	link
2025-06-24	RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories	Qingsong Yan et.al.	2506.15242	null
2025-06-17	Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction	Zhengquan Zhang et.al.	2506.14856	null
2025-06-18	Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting	Mufan Liu et.al.	2506.12787	null
2025-06-17	Efficient multi-view training for 3D Gaussian Splatting	Minhyuk Choi et.al.	2506.12727	null
2025-06-12	PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting	Lintao Xiang et.al.	2506.10335	null
2025-06-11	The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images without Any 3D Knowledge	Haoru Wang et.al.	2506.09885	null
2025-06-10	A Probability-guided Sampler for Neural Implicit Surface Rendering	Gonçalo Dias Pais et.al.	2506.08619	null
2025-06-09	Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes	Allen Tu et.al.	2506.07917	link
2025-06-20	Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency	Xiangyu Guo et.al.	2506.07497	null
2025-06-07	SPC to 3D: Novel View Synthesis from Binary SPC via I2I translation	Sumit Sharma et.al.	2506.06890	null
2025-06-06	Splat and Replace: 3D Reconstruction with Repetitive Elements	Nicolás Violante et.al.	2506.06462	null
2025-06-06	NeurNCD: Novel Class Discovery via Implicit Neural Representation	Junming Wang et.al.	2506.06412	null
2025-06-06	Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments	Mingrui Li et.al.	2506.05965	null
2025-06-06	ProJo4D: Progressive Joint Optimization for Sparse-View Inverse Physics Estimation	Daniel Rho et.al.	2506.05317	null
2025-06-06	Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting	Nan Wang et.al.	2506.05280	link
2025-06-05	Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer	Filip Slezak et.al.	2506.04908	null
2025-05-30	Hi-Dyna Graph: Hierarchical Dynamic Scene Graph for Robotic Autonomy in Human-Centric Environments	Jiawei Hou et.al.	2506.00083	null
2025-05-29	PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views	Mohamed Rayan Barhdadi et.al.	2505.23481	link
2025-05-29	LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering	Jonas Kulhanek et.al.	2505.23158	null
2025-05-28	Can NeRFs See without Cameras?	Chaitanya Amballa et.al.	2505.22441	null
2025-05-28	Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss	Wenjun Lu et.al.	2505.22279	null
2025-05-28	Hyperspectral Gaussian Splatting	Sunil Kumar Narayanan et.al.	2505.21890	null
2025-05-27	Structure from Collision	Takuhiro Kaneko et.al.	2505.21335	null
2025-05-26	OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender	Shintaro Ito et.al.	2505.20126	link
2025-05-30	ErpGS: Equirectangular Image Rendering enhanced with 3D Gaussian Regularization	Shintaro Ito et.al.	2505.19883	null
2025-05-26	GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis	You Wang et.al.	2505.19813	link
2025-05-26	Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction	Li Fang et.al.	2505.19793	link
2025-05-26	ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting	Wenhua Wu et.al.	2505.19420	null
2025-05-25	Triangle Splatting for Real-Time Radiance Field Rendering	Jan Held et.al.	2505.19175	null
2025-05-22	UAV See, UGV Do: Aerial Imagery and Virtual Teach Enabling Zero-Shot Ground Vehicle Repeat	Desiree Fisker et.al.	2505.16912	null
2025-05-19	IPENS:Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion	Wentao Song et.al.	2505.13633	null
2025-05-19	3D Gaussian Adaptive Reconstruction for Fourier Light-Field Microscopy	Chenyu Xu et.al.	2505.12875	null
2025-05-18	Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey	Calvin Galagain et.al.	2505.12384	null
2025-05-16	MutualNeRF: Improve the Performance of NeRF under Limited Samples with Mutual Information Theory	Zifan Wang et.al.	2505.11386	null
2025-05-16	EA-3DGS: Efficient and Adaptive 3D Gaussians with Highly Enhanced Quality for outdoor scenes	Jianlin Guo et.al.	2505.10787	link
2025-05-15	Large-Scale Gaussian Splatting SLAM	Zhe Xin et.al.	2505.09915	null
2025-05-14	Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians	Ma Changfeng et.al.	2505.09413	link
2025-05-14	FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling	Yue Wen et.al.	2505.09406	null
2025-05-12	TUGS: Physics-based Compact Representation of Underwater Scenes by Tensorized Gaussian	Shijie Lian et.al.	2505.08811	null
2025-05-13	FOCI: Trajectory Optimization on Gaussian Splats	Mario Gomez Andreu et.al.	2505.08510	null
2025-05-13	TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset	Olaf Wysocki et.al.	2505.07396	null
2025-05-12	Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild	Lintao Xiang et.al.	2505.07373	null
2025-05-11	NeuGen: Amplifying the ‘Neural’ in Neural Radiance Fields for Domain Generalization	Ahmed Qazi et.al.	2505.06894	null
2025-05-10	3D Characterization of Smoke Plume Dispersion Using Multi-View Drone Swarm	Nikil Krishnakumar et.al.	2505.06638	null
2025-05-10	FlexNeRFer: A Multi-Dataflow, Adaptive Sparsity-Aware Accelerator for On-Device NeRF Rendering	Seock-Hwan Noh et.al.	2505.06504	null
2025-05-08	3D Scene Generation: A Survey	Beichen Wen et.al.	2505.05474	link
2025-05-04	HandOcc: NeRF-based Hand Rendering with Occupancy Networks	Maksym Ivashechkin et.al.	2505.02079	null
2025-05-04	Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance Fields	Zhenxing Mi et.al.	2505.02005	link
2025-05-03	AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting	Junhao Shi et.al.	2505.01799	null
2025-05-03	Unified Steganography via Implicit Neural Representation	Qi Song et.al.	2505.01749	null
2025-04-30	A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond	Jiajia Li et.al.	2505.00737	link
2025-05-01	Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation	Feng Xue et.al.	2505.00378	null
2025-04-29	GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction	Yuhan Xie et.al.	2504.21067	link
2025-04-29	Large-scale visual SLAM for in-the-wild videos	Shuo Sun et.al.	2504.20496	null
2025-05-01	GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting	Jongwon Lee et.al.	2504.20379	null
2025-04-29	Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views	Jiang Wu et.al.	2504.20378	link
2025-04-28	Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video	Hoang Chuong Nguyen et.al.	2504.19819	null
2025-04-27	Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users	Apurv Varshney et.al.	2504.19345	null
2025-04-29	IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos	Yuan Li et.al.	2504.19165	null
2025-04-28	RGS-DR: Reflective Gaussian Surfels with Deferred Rendering for Shiny Objects	Georgios Kouros et.al.	2504.18468	null
2025-04-23	Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning	Mingxuan Cui et.al.	2504.17815	link
2025-04-24	CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos	Shucheng Gong et.al.	2504.17728	link
2025-04-23	Dual-Camera All-in-Focus Neural Radiance Fields	Xianrui Luo et.al.	2504.16636	null
2025-04-23	Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks	Murat Bilgehan Ertan et.al.	2504.16557	null
2025-04-23	SaENeRF: Suppressing Artifacts in Event-based Neural Radiance Fields	Yuanjian Wang et.al.	2504.16389	link
2025-04-22	Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models	Quentin Herau et.al.	2504.15776	null
2025-04-21	StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians	Cailin Zhuang et.al.	2504.15281	null
2025-04-18	Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training	Andrea Amaduzzi et.al.	2504.13995	null
2025-04-21	SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM	Samuel Cerezo et.al.	2504.13713	link
2025-04-16	BEV-GS: Feed-forward Gaussian Splatting in Bird’s-Eye-View for Road Reconstruction	Wenhua Wu et.al.	2504.13207	null
2025-04-17	GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration	Rendong Zhang et.al.	2504.12999	link
2025-04-16	R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors	Haoyang Wang et.al.	2504.11946	null
2025-04-19	LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis	Hao Sun et.al.	2504.10331	null
2025-04-14	MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling	Yunpeng Tan et.al.	2504.09878	null
2025-04-14	NeRF-Based Transparent Object Grasping Enhanced by Shape Priors	Yi Han et.al.	2504.09868	null
2025-04-11	HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields	Asterios Reppas et.al.	2504.08901	null
2025-04-09	Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting	Daiwei Zhang et.al.	2504.06978	null
2025-04-09	S-EO: A Large-Scale Dataset for Geometry-Aware Shadow Detection in Remote Sensing Applications	Masquil Elías et.al.	2504.06920	null
2025-04-09	SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering	Hanxiao Sun et.al.	2504.06815	link
2025-04-08	Meta-Continual Learning of Neural Fields	Seungyoon Woo et.al.	2504.05806	null
2025-04-08	SE4Lip: Speech-Lip Encoder for Talking Head Synthesis to Solve Phoneme-Viseme Alignment Ambiguity	Yihuan Huang et.al.	2504.05803	null
2025-04-08	InvNeRF-Seg: Fine-Tuning a Pre-Trained NeRF for 3D Object Segmentation	Jiangsan Zhao et.al.	2504.05751	null
2025-04-07	DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal	Wanzhou Liu et.al.	2504.04679	null
2025-04-06	Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models	Etienne Chassaing et.al.	2504.04448	null
2025-04-04	NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices	Zhe Wang et.al.	2504.03415	null
2025-04-03	MultiNeRF: Multiple Watermark Embedding for Neural Radiance Fields	Yash Kulthe et.al.	2504.02517	null
2025-04-03	LPA3D: 3D Room-Level Scene Generation from In-the-Wild Images	Ming-Jia Yang et.al.	2504.02337	null
2025-04-01	OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF	Jingyu Shi et.al.	2504.02007	null
2025-04-02	Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis	Niluthpol Chowdhury Mithun et.al.	2504.01960	null
2025-04-02	BOGausS: Better Optimized Gaussian Splatting	Stéphane Pateux et.al.	2504.01844	null
2025-04-02	FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking	Ulas Gunes et.al.	2504.01732	null
2025-04-02	RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars	Yahui Li et.al.	2504.01559	null
2025-04-02	Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment	Ziteng Cui et.al.	2504.01503	link
2025-04-01	Neural Pruning for 3D Scene Reconstruction: Efficient NeRF Acceleration	Tianqi Ding et.al.	2504.00950	null
2025-04-01	NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds	Mahan Rafidashti et.al.	2504.00859	null
2025-03-31	NeRF-Based defect detection	Tianqi et.al.	2504.00270	null
2025-03-31	LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors	Han Zhou et.al.	2504.00219	null
2025-03-31	ERUPT: Efficient Rendering with Unposed Patch Transformer	Maxim V. Shugaev et.al.	2503.24374	null
2025-03-29	NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations	Zhenyu Tang et.al.	2503.23162	null
2025-03-28	ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting	Wenjie Liu et.al.	2503.22218	null
2025-03-27	NeRF-based Point Cloud Reconstruction using a Stationary Camera for Agricultural Applications	Kibon Ku et.al.	2503.21958	null
2025-03-27	Refined Geometry-guided Head Avatar Reconstruction from Monocular RGB Video	Pilseo Park et.al.	2503.21886	null
2025-03-27	HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM	Ziren Gong et.al.	2503.21778	null
2025-04-01	RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting	Qiyu Dai et.al.	2503.21442	null
2025-03-28	LandMarkSystem Technical Report	Zhenxiang Ma et.al.	2503.21364	link
2025-03-27	UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation	Yehui Shen et.al.	2503.21338	link
2025-03-25	CoMapGS: Covisibility Map-based Gaussian Splatting for Sparse Novel View Synthesis	Youngkyoon Jang et.al.	2503.20998	null
2025-03-26	AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports	Xiangwen Zhang et.al.	2503.20654	null
2025-03-26	EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis	Sheng Miao et.al.	2503.20168	null
2025-03-25	Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals	Zhirui Dai et.al.	2503.20066	null
2025-03-25	MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities	Federico Lincetto et.al.	2503.19673	null
2025-03-24	NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting	Yulong Zheng et.al.	2503.18794	null
2025-03-25	LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene	Xiaoyu Zhang et.al.	2503.18513	null
2025-03-24	NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction	Wenyuan Zhang et.al.	2503.18361	null
2025-03-23	End-to-End Implicit Neural Representations for Classification	Alexander Gielisse et.al.	2503.18123	link
2025-03-23	Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving	Junhao Ge et.al.	2503.18108	link
2025-03-23	PanopticSplatting: End-to-End Panoptic Gaussian Splatting	Yuxuan Xie et.al.	2503.18073	null
2025-03-21	Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping	Emanuele Giacomini et.al.	2503.17491	link
2025-03-21	FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields	Kwan Yun et.al.	2503.17095	link
2025-03-21	DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery	Jiadong Tang et.al.	2503.16964	null
2025-03-20	Digitally Prototype Your Eye Tracker: Simulating Hardware Performance using 3D Synthetic Data	Esther Y. H. Lin et.al.	2503.16742	null
2025-03-20	Enhancing Close-up Novel View Synthesis via Pseudo-labeling	Jiatong Xia et.al.	2503.15908	link
2025-03-19	SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints	Weiwen Hu et.al.	2503.15712	null
2025-03-19	DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis	Yuming Gu et.al.	2503.15667	link
2025-03-19	GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector	Zechuan Li et.al.	2503.15211	null
2025-03-19	MultiBARF: Integrating Imagery of Different Wavelength Regions by Using Neural Radiance Fields	Kana Kurata et.al.	2503.15070	null
2025-03-19	3D Engine-ready Photorealistic Avatars via Dynamic Textures	Yifan Wang et.al.	2503.14943	null
2025-03-19	ClimateGS: Real-Time Climate Simulation with 3D Gaussian Style Transfer	Yuezhen Xie et.al.	2503.14845	null
2025-03-18	Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis	Yizhou Li et.al.	2503.14219	null
2025-03-17	Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios	Iryna Repinetska et.al.	2503.13710	null
2025-03-17	TriDF: Triplane-Accelerated Density Fields for Few-Shot Remote Sensing Novel View Synthesis	Jiaming Kang et.al.	2503.13347	null
2025-03-17	DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction	Rui Wang et.al.	2503.13176	null
2025-03-17	DivCon-NeRF: Generating Augmented Rays with Diversity and Consistency for Few-shot View Synthesis	Ingyun Lee et.al.	2503.12947	null
2025-03-15	FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields	Rui Qian et.al.	2503.12086	null
2025-03-14	Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation	Xianming Zeng et.al.	2503.11731	null
2025-03-13	Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations	Xunzhi Zheng et.al.	2503.10464	null
2025-03-13	AI-assisted 3D Preservation and Reconstruction of Temple Arts	Naai-Jung Shih et.al.	2503.10031	null
2025-03-12	Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation	Máté Tóth et.al.	2503.09464	null
2025-03-11	GAS-NeRF: Geometry-Aware Stylization of Dynamic Radiance Fields	Nhat Phuong Anh Vu et.al.	2503.08483	null
2025-03-17	Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios	Zikang Yuan et.al.	2503.08317	null
2025-03-11	GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats	Kai Deng et.al.	2503.08071	link
2025-03-11	NeRF-VIO: Map-Based Visual-Inertial Odometry with Initialization Leveraging Neural Radiance Fields	Yanyu Zhang et.al.	2503.07952	null
2025-03-10	Neural Radiance and Gaze Fields for Visual Attention Modeling in 3D Environments	Andrei Chubarau et.al.	2503.07828	null
2025-03-10	CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving	Ziliang Xiong et.al.	2503.07425	null
2025-03-08	Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction	Kai Li et.al.	2503.06161	null
2025-03-08	SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography	Xuanyu Zhang et.al.	2503.06118	null
2025-03-08	NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features	Hongjia Zhai et.al.	2503.06117	null
2025-03-06	Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering	Idris O. Sunmola et.al.	2503.04079	null
2025-03-05	LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation	Qian Feng et.al.	2503.03890	null
2025-03-04	Tracking-Aware Deformation Field Estimation for Non-rigid 3D Reconstruction in Robotic Surgeries	Zeqing Wang et.al.	2503.02558	null
2025-03-04	2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting	Qipeng Yan et.al.	2503.02452	null
2025-03-04	Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views	Yingji Zhong et.al.	2503.02230	null
2025-03-04	Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints	Yan Miao et.al.	2503.02198	null
2025-03-03	Data Augmentation for NeRFs in the Low Data Limit	Ayush Gaggar et.al.	2503.02092	null
2025-03-03	Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models	Jay Zhangjie Wu et.al.	2503.01774	null
2025-03-05	Category-level Meta-learned NeRF Priors for Efficient Object Mapping	Saad Ejaz et.al.	2503.01582	null
2025-03-03	LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training	Kaimin Liao et.al.	2503.01199	link
2025-03-02	DreamPrinting: Volumetric Printing Primitives for High-Fidelity 3D Printing	Youjia Wang et.al.	2503.00887	null
2025-03-01	Scalable Real2Sim: Physics-Aware Asset Generation Via Robotic Pick-and-Place Setups	Nicholas Pfaff et.al.	2503.00370	link
2025-02-27	Identity-preserving Distillation Sampling by Fixed-Point Iterator	SeonHwa Kim et.al.	2502.19930	null
2025-02-27	NeRFCom: Feature Transform Coding Meets Neural Radiance Field for Free-View 3D Scene Semantic Transmission	Weijie Yue et.al.	2502.19873	null
2025-02-26	Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions	Muhammad Salman Ali et.al.	2502.19457	null
2025-02-26	Does 3D Gaussian Splatting Need Accurate Volumetric Rendering?	Adam Celarek et.al.	2502.19318	link
2025-02-26	The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields	Ziyuan Luo et.al.	2502.19125	null
2025-02-24	Semantic Neural Radiance Fields for Multi-Date Satellite Data	Valentin Wagner et.al.	2502.16992	link
2025-02-22	AquaNeRF: Neural Radiance Fields in Underwater Media with Distractor Removal	Luca Gough et.al.	2502.16351	null
2025-02-22	DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation	Yuxuan Xiong et.al.	2502.16302	null
2025-02-24	Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis	Ziqian Ni et.al.	2502.15635	null
2025-02-20	Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2502.14931	null
2025-02-20	NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis	Xiaoxing Liu et.al.	2502.14178	null
2025-02-19	GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian	Bang Du et.al.	2502.14129	null
2025-02-18	Geometry-Aware Diffusion Models for Multiview Scene Inpainting	Ahmad Salimi et.al.	2502.13335	null
2025-02-18	GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis	Pedro Martin et.al.	2502.13196	null
2025-02-18	ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition	Quoc-Anh Bui et.al.	2502.12673	null
2025-02-21	HumanGif: Single-View Human Diffusion with Generative Prior	Shoukang Hu et.al.	2502.12080	link
2025-02-17	3D Gaussian Inpainting with Depth-Guided Cross-View Consistency	Sheng-Yu Huang et.al.	2502.11801	null
2025-02-13	Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures	Francesco Ballerini et.al.	2502.09623	null
2025-02-13	DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior	Mingrui Li et.al.	2502.09111	null
2025-02-12	Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision	Tianle Liu et.al.	2502.08352	null
2025-02-10	PrismAvatar: Real-time animated 3D neural head avatars on edge devices	Prashant Raina et.al.	2502.07030	null
2025-02-10	Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC	Siwei Meng et.al.	2502.07007	null
2025-02-08	GWRF: A Generalizable Wireless Radiance Field for Wireless Signal Propagation Modeling	Kang Yang et.al.	2502.05708	null
2025-02-05	VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning	Jayram Palamadai et.al.	2502.05222	null
2025-02-11	PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression	Feifei Li et.al.	2502.04843	null
2025-02-04	SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification	Yifu Tao et.al.	2502.02657	null
2025-02-04	MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning	Shengbo Gu et.al.	2502.02372	null
2025-02-03	FourieRF: Few-Shot NeRFs via Progressive Fourier Frequency Control	Diego Gomez et.al.	2502.01405	null
2025-01-31	VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting	Mateusz Nowak et.al.	2501.17978	null
2025-01-28	LinPrim: Linear Primitives for Differentiable Volumetric Rendering	Nicolas von Lützow et.al.	2501.16312	null
2025-01-24	SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation	Yujian Liu et.al.	2501.14646	null
2025-02-05	GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting	Junzhe Jiang et.al.	2501.13971	link
2025-01-23	VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM	Gyuhyeon Pak et.al.	2501.13402	null
2025-01-22	Neural Radiance Fields for the Real World: A Survey	Wenhui Xiao et.al.	2501.13104	null
2025-02-02	DWTNeRF: Boosting Few-shot Neural Radiance Fields via Discrete Wavelet Transform	Hung Nguyen et.al.	2501.12637	null
2025-01-21	DNRSelect: Active Best View Selection for Deferred Neural Rendering	Dongli Wu et.al.	2501.12150	null
2025-01-21	Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging	Shuyi Hu et.al.	2501.11884	null
2025-01-16	Poxel: Voxel Reconstruction for 3D Printing	Ruixiang Cao et.al.	2501.10474	null
2025-01-17	Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation	Xiaoyun Zheng et.al.	2501.09947	link
2025-01-16	Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective Scenes	Ji Shi et.al.	2501.09460	link
2025-01-15	SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM	Yuhang Ming et.al.	2501.08880	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286	null
2025-01-13	Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes	Yuhang Zhang et.al.	2501.08072	null
2025-01-14	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015	null
2025-01-12	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927	link
2025-01-12	ActiveGAMER: Active GAussian Mapping through Efficient Rendering	Liyan Chen et.al.	2501.06897	null
2025-01-17	SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis	Peng Zheng et.al.	2501.06770	null
2025-01-11	NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without References	Qiang Qu et.al.	2501.06488	link
2025-01-10	UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping	Yanjie Li et.al.	2501.05783	null
2025-01-13	Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes	Ludwic Leonard et.al.	2501.05226	link
2025-01-07	NeRFs are Mirror Detectors: Using Structural Similarity for Multi-View Mirror Scene Reconstruction with 3D Surface Primitives	Leif Van Holland et.al.	2501.04074	link
2025-01-07	NeuralSVG: An Implicit Representation for Text-to-Vector Generation	Sagi Polaczek et.al.	2501.03992	null
2025-01-07	DehazeGS: Seeing Through Fog with 3D Gaussian Splatting	Jinze Yu et.al.	2501.03659	null
2025-01-07	ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting	Yifeng Yang et.al.	2501.03605	link
2025-01-07	AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene	Chaoran Feng et.al.	2501.02807	null
2024-12-29	Bringing Objects to Life: 4D generation from 3D objects	Ohad Rahamim et.al.	2412.20422	null
2024-12-27	Learning Radiance Fields from a Single Snapshot Compressive Image	Yunhao Li et.al.	2412.19483	null
2025-01-05	BeSplat: Gaussian Splatting from a Single Blurry Image and Event Stream	Gopi Raju Matta et.al.	2412.19370	link
2024-12-26	Generating Editable Head Avatars with 3D Gaussian GANs	Guohao Li et.al.	2412.19149	link
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130	null
2024-12-26	Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos	Changwoon Choi et.al.	2412.19089	null
2024-12-23	Editing Implicit and Explicit Representations of Radiance Fields: A Survey	Arthur Hubert et.al.	2412.17628	null
2024-12-23	Exploring Dynamic Novel View Synthesis Technologies for Cinematography	Adrian Azzarelli et.al.	2412.17532	null
2024-12-21	LUCES-MV: A Multi-View Dataset for Near-Field Point Light Source Photometric Stereo	Fotios Logothetis et.al.	2412.16737	null
2024-12-20	NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems	Laura Weihl et.al.	2412.16141	null
2024-12-20	NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images	Yue Guo et.al.	2412.15890	null
2024-12-19	LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction	Pou-Chun Kung et.al.	2412.15447	null
2024-12-18	DreaMark: Rooting Watermark in Score Distillation Sampling Generated Neural Radiance Fields	Xingyu Zhu et.al.	2412.15278	null
2024-12-19	GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting	Qianpu Sun et.al.	2412.14579	null
2024-12-19	Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images	Min Wang et.al.	2412.14547	null
2024-12-18	GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians	Xiaobao Wei et.al.	2412.13983	link
2024-12-17	EOGS: Gaussian Splatting for Earth Observation	Luca Savant Aira et.al.	2412.13047	null
2024-12-18	Optimize the Unseen – Fast NeRF Cleanup with Free Space Prior	Leo Segre et.al.	2412.12772	null
2024-12-17	Towards a Training Free Approach for 3D Scene Editing	Vivek Madhavaram et.al.	2412.12766	null
2024-12-16	GS-ProCams: Gaussian Splatting-based Projector-Camera Systems	Qingyue Deng et.al.	2412.11762	null
2024-12-18	Sequence Matters: Harnessing Video Models in 3D Super-Resolution	Hyun-kyu Ko et.al.	2412.11525	null
2024-12-16	VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression	Qiang Hu et.al.	2412.11362	null
2024-12-13	NeRF-Texture: Synthesizing Neural Radiance Field Textures	Yi-Hua Huang et.al.	2412.10004	null
2024-12-13	Sharpening Your Density Fields: Spiking Neuron Aided Fast Geometry Learning	Yi Gu et.al.	2412.09881	null
2024-12-12	PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields	Sean Wu et.al.	2412.09680	link
2024-12-11	GN-FR:Generalizable Neural Radiance Fields for Flare Removal	Gopi Raju Matta et.al.	2412.08200	null
2024-12-11	NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods	Qiang Qu et.al.	2412.08029	link
2024-12-10	EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering	Toshiya Yura et.al.	2412.07293	null
2024-12-09	Diffusing Differentiable Representations	Yash Savani et.al.	2412.06981	null
2024-12-09	Dynamic EventNeRF: Reconstructing General Dynamic Scenes from Multi-view Event Cameras	Viktor Rudnev et.al.	2412.06770	null
2024-12-09	Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video	Renlong Wu et.al.	2412.06424	link
2024-12-09	Splatter-360: Generalizable 360 $^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images	Zheng Chen et.al.	2412.06250	link
2024-12-07	WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking	Yuqi Tan et.al.	2412.05695	null
2024-12-06	Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories	Susung Hong et.al.	2412.05279	null
2024-12-11	MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting	Peng Chen et.al.	2412.04955	link
2024-12-04	NeRF and Gaussian Splatting SLAM in the Wild	Fabian Schmidt et.al.	2412.03263	link
2024-12-01	SAGA: Surface-Aligned Gaussian Avatar	Ronghan Chen et.al.	2412.00845	null
2024-12-01	CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images	Jian Liu et.al.	2412.00754	null
2024-11-30	Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives	Alex Hanson et.al.	2412.00578	link
2024-11-30	Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects	Amir Barda et.al.	2412.00518	null
2024-11-29	$C^{3}$ -NeRF: Modeling Multiple Scenes via Conditional-cum-Continual Neural Radiance Fields	Prajwal Singh et.al.	2411.19903	null
2024-11-29	Gaussian Splashing: Direct Volumetric Rendering Underwater	Nir Mualem et.al.	2411.19588	null
2024-11-29	ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration	Chaojun Ni et.al.	2411.19548	null
2024-11-29	LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis	Tianqi Li et.al.	2411.19525	null
2024-11-28	SAMa: Material-aware 3D Selection and Segmentation	Michael Fischer et.al.	2411.19322	null
2024-11-27	Surf-NeRF: Surface Regularised Neural Radiance Fields	Jack Naylor et.al.	2411.18652	null
2024-11-26	MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields	Yixiong Yang et.al.	2411.17235	link
2024-11-25	The Radiance of Neural Fields: Democratizing Photorealistic and Dynamic Robotic Simulation	Georgina Nuthall et.al.	2411.16940	null
2024-11-27	SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving	Georg Hess et.al.	2411.16816	link
2024-11-25	Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction	Ziyu Zhang et.al.	2411.16392	null
2024-11-25	U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields	Vinayak Gupta et.al.	2411.16172	null
2024-11-24	ZeroGS: Training 3D Gaussian Splatting from Unposed Images	Yu Chen et.al.	2411.15779	null
2024-11-24	GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision	Xu Baixin et.al.	2411.15723	link
2024-11-23	NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation	Menglin Zhang et.al.	2411.15551	null
2024-11-23	SplatSDF: Boosting Neural Implicit SDF via Gaussian Splatting Fusion	Runfa Blark Li et.al.	2411.15468	null
2024-11-20	Sparse Input View Synthesis: 3D Representations and Reliable Priors	Nagabhushan Somraj et.al.	2411.13631	null
2024-11-20	Robust SG-NeRF: Robust Scene Graph Aided Neural Surface Reconstruction	Yi Gu et.al.	2411.13620	null
2024-11-20	GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting	Xiaobao Wei et.al.	2411.12981	null
2024-11-25	SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image	Zixu Wang et.al.	2411.12471	null
2024-11-19	GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving	Shaoqing Xu et.al.	2411.12452	link
2024-11-18	Towards Degradation-Robust Reconstruction in Generalizable NeRF	Chan Ho Park et.al.	2411.11691	null
2024-11-18	LeC $^2$ O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes	Zhenxing Mi et.al.	2411.11374	null
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546	null
2024-11-15	USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting	Kang Chen et.al.	2411.10504	link
2024-11-15	GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization	Yanhao Sun et.al.	2411.10033	null
2024-11-22	BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis	David Svitov et.al.	2411.08508	link
2024-11-13	Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model	Yutao Shen et.al.	2411.08453	null
2024-11-13	MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation	Peng Wang et.al.	2411.08279	link
2024-11-12	TomoGRAF: A Robust and Generalizable Reconstruction Network for Single-View Computed Tomography	Di Xu et.al.	2411.08158	null
2024-11-12	Material Transforms from Disentangled NeRF Representations	Ivan Lopes et.al.	2411.08037	link
2024-11-11	LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes	Zefan Qu et.al.	2411.06757	link
2024-11-10	Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field	Liuyue Xie et.al.	2411.06365	null
2024-11-09	AI-Driven Stylization of 3D Environments	Yuanbo Chen et.al.	2411.06067	null
2024-11-08	A Nerf-Based Color Consistency Method for Remote Sensing Images	Zongcheng Zuo et.al.	2411.05557	null
2024-11-08	Rate-aware Compression for NeRF-based Volumetric Video	Zhiyu Zhang et.al.	2411.05322	null
2024-11-07	Planar Reflection-Aware Neural Radiance Fields	Chen Gao et.al.	2411.04984	null
2024-11-07	GANESH: Generalizable NeRF for Lensless Imaging	Rakesh Raj Madavan et.al.	2411.04810	null
2024-11-08	SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation	Xun Tu et.al.	2411.04386	null
2024-11-06	Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis	Rui Peng et.al.	2411.03637	link
2024-11-05	Enhancing Exploratory Capability of Visual Navigation Using Uncertainty of Implicit Scene Representation	Yichen Wang et.al.	2411.03487	link
2024-11-05	CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval	Xin Wen et.al.	2411.02979	null
2024-11-05	Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery	Liv Kåreborn et.al.	2411.02972	null
2024-11-05	Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation	Xavier Timoneda et.al.	2411.02969	null
2024-11-04	NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields	Eric Zhu et.al.	2411.02482	null
2024-11-05	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-06	GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes	Gaochao Song et.al.	2411.01853	null
2024-11-04	A Probabilistic Formulation of LiDAR Mapping with Neural Radiance Fields	Matthew McDermott et.al.	2411.01725	link
2024-11-01	ZIM: Zero-Shot Image Matting for Anything	Beomyoung Kim et.al.	2411.00626	link
2024-10-31	Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes	Karim Kassab et.al.	2410.23742	null
2024-10-31	Get a Grip: Multi-Finger Grasp Evaluation at Scale Enables Robust Sim-to-Real Transfer	Tyler Ga Wei Lum et.al.	2410.23701	null
2024-10-31	XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM	Xiaomeng Wang et.al.	2410.23690	link
2024-10-30	Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder	Antoine Schnepf et.al.	2410.22936	null
2024-10-28	MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps	Yating Xu et.al.	2410.21566	link
2024-10-29	EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior	Xin Xiang et.al.	2410.20981	null
2024-10-28	ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings	Suyoung Lee et.al.	2410.20686	link
2024-10-27	GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields	Yusuke Sekikawa et.al.	2410.20306	null
2024-10-25	Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization	Weihang Liu et.al.	2410.19483	link
2024-10-25	Evaluation of strategies for efficient rate-distortion NeRF streaming	Pedro Martin et.al.	2410.19459	null
2024-10-27	Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis	Liang Han et.al.	2410.18822	null
2024-10-24	Real-time 3D-aware Portrait Video Relighting	Ziqi Cai et.al.	2410.18355	link
2024-10-22	Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies	Shrey Vishen et.al.	2410.18137	link
2024-10-23	VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points	Linus Franke et.al.	2410.17932	null
2024-10-23	Few-shot NeRF by Adaptive Rendering Loss Regularization	Qingshan Xu et.al.	2410.17839	null
2024-10-23	Efficient Neural Implicit Representation for 3D Human Reconstruction	Zexu Huang et.al.	2410.17741	link
2024-10-23	PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting	Yu Wang et.al.	2410.17505	null
2024-10-22	LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias	Haian Jin et.al.	2410.17242	null
2024-10-18	GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting	Yusen Xie et.al.	2410.17084	null
2024-10-22	E-3DGS: Gaussian Splatting with Exposure and Motion Events	Xiaoting Yin et.al.	2410.16995	link
2024-10-21	Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions	Malte Prinzler et.al.	2410.16395	null
2024-10-21	FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors	Chin-Yang Lin et.al.	2410.16271	null
2024-10-22	EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting	Bohao Liao et.al.	2410.15392	null
2024-10-19	Neural Radiance Field Image Refinement through End-to-End Sampling Point Optimization	Kazuhiro Ohta et.al.	2410.14958	null
2024-10-18	Learning autonomous driving from aerial imagery	Varun Murali et.al.	2410.14177	null
2024-10-18	DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction	Ange Lou et.al.	2410.14169	null
2024-10-17	DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering	Jiahao Lu et.al.	2410.13607	link
2024-10-21	DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation	Guosheng Zhao et.al.	2410.13571	null
2024-10-17	Object Pose Estimation Using Implicit Representation For Transparent Objects	Varun Burde et.al.	2410.13465	null
2024-10-17	GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting	Shuichang Lai et.al.	2410.13349	null
2024-10-16	3D Gaussian Splatting in Robotics: A Survey	Siting Zhu et.al.	2410.12262	link
2024-10-16	EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View	Zhaorong Wang et.al.	2410.12242	null
2024-10-14	3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications	Eduardo R. Corral-Soto et.al.	2410.10782	null
2024-10-14	NeRF-enabled Analysis-Through-Synthesis for ISAR Imaging of Small Everyday Objects with Sparse and Noisy UWB Radar Data	Md Farhan Tasnim Oshim et.al.	2410.10085	null
2024-10-13	Magnituder Layers for Implicit Neural Representations in 3D	Sang Min Kim et.al.	2410.09771	null
2024-10-12	Improving 3D Finger Traits Recognition via Generalizable Neural Rendering	Hongbin Xu et.al.	2410.09582	null
2024-10-11	SceneCraft: Layout-Guided 3D Scene Generation	Xiuyu Yang et.al.	2410.09049	link
2024-10-11	MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering	Jaehoon Choi et.al.	2410.08941	null
2024-10-11	Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints	Yicheng He et.al.	2410.08780	null
2024-10-10	RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image	Xiaoxue Chen et.al.	2410.08181	null
2024-10-10	IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera	Jian Huang et.al.	2410.08107	link
2024-10-11	NeRF-Accelerated Ecological Monitoring in Mixed-Evergreen Redwood Forest	Adam Korycki et.al.	2410.07418	link
2024-10-09	DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation	Zhiqi Li et.al.	2410.06756	null
2024-10-09	MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes	Zhenhui Ye et.al.	2410.06734	null
2024-10-09	3D Representation Methods: A Survey	Zhengren Wang et.al.	2410.06475	null
2024-10-08	Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters	Guoji Tian et.al.	2410.05772	null
2024-10-07	Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors	Ziwei Liao et.al.	2410.05514	link
2024-10-07	PH-Dropout: Prctical Epistemic Uncertainty Quantification for View Synthesis	Chuanhao Sun et.al.	2410.05468	link
2024-10-07	LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting	Qifeng Chen et.al.	2410.05111	null
2024-10-07	6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering	Zhongpai Gao et.al.	2410.04974	null
2024-10-07	TeX-NeRF: Neural Radiance Fields from Pseudo-TeX Vision	Chonghao Zhong et.al.	2410.04873	null
2024-10-06	Deformable NeRF using Recursively Subdivided Tetrahedra	Zherui Qiu et.al.	2410.04402	null
2024-10-05	Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy	Pengcheng Chen et.al.	2410.04041	null
2024-10-02	MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis	Xiaobiao Du et.al.	2410.02103	link
2024-10-03	EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis	Alexander Mai et.al.	2410.01804	null
2024-10-02	3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection	Yang Cao et.al.	2410.01647	link
2024-10-02	Gaussian Splatting in Mirrors: Reflection-Aware Rendering via Virtual Camera Optimization	Zihan Wang et.al.	2410.01614	link
2024-10-02	Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection	Hongru Yan et.al.	2410.01404	null
2024-10-01	GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer	Youngho Yoon et.al.	2410.00672	link
2024-09-30	Distributed NeRF Learning for Collaborative Multi-Robot Perception	Hongrui Zhao et.al.	2409.20289	null
2024-09-30	Active Neural Mapping at Scale	Zijia Kuang et.al.	2409.20276	null
2024-09-30	OPONeRF: One-Point-One NeRF for Robust Neural Rendering	Yu Zheng et.al.	2409.20043	link
2024-09-28	G3R: Gradient Guided Generalizable Reconstruction	Yun Chen et.al.	2409.19405	null
2024-09-26	LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field	Huan Wang et.al.	2409.18057	link
2024-09-26	Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions	Weng Fei Low et.al.	2409.17988	null
2024-09-26	Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry	Qi Zhang et.al.	2409.17729	null
2024-09-26	TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene	Sandika Biswas et.al.	2409.17459	link
2024-09-25	SeaSplat: Representing Underwater Scenes with 3D Gaussian Splatting and a Physically Grounded Image Formation Model	Daniel Yang et.al.	2409.17345	null
2024-09-25	TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans	Aggelina Chatziagapi et.al.	2409.16666	null
2024-09-26	Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities	Peizhi Yan et.al.	2409.16147	link
2024-09-24	Disentangled Generation and Aggregation for Robust Radiance Fields	Shihe Shen et.al.	2409.15715	null
2024-09-24	Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB	Jae Yong Lee et.al.	2409.15689	null
2024-09-23	AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions	Samarth Chopra et.al.	2409.15487	null
2024-09-22	MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views	Wangze Xu et.al.	2409.14316	null
2024-09-21	MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors	Zhenhua Du et.al.	2409.14019	null
2024-09-19	CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications	Vladimir Frolov et.al.	2409.12617	null
2024-09-18	JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation	Sai Tanmay Reddy Chakkera et.al.	2409.12156	null
2024-09-25	BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling	Lulin Zhang et.al.	2409.12014	link
2024-09-17	RenderWorld: World Model with Self-Supervised 3D Label	Ziyang Yan et.al.	2409.11356	null
2024-09-21	HGSLoc: 3DGS-based Heuristic Camera Pose Refinement	Zhongyan Niu et.al.	2409.10925	null
2024-09-16	Baking Relightable NeRF for Real-time Direct/Indirect Illumination Rendering	Euntae Choi et.al.	2409.10327	null
2024-09-16	DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments	Mahmud A. Mohamad et.al.	2409.10041	link
2024-09-15	NARF24: Estimating Articulated Object Structure for Implicit Rendering	Stanley Lewis et.al.	2409.09829	null
2024-09-12	DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors	Thomas Hanwen Zhu et.al.	2409.08278	null
2024-09-11	DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation	Haibo Yang et.al.	2409.07454	null
2024-09-11	ThermalGaussian: Thermal 3D Gaussian Splatting	Rongfeng Lu et.al.	2409.07200	link
2024-09-10	LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation	Archana Swaminathan et.al.	2409.06703	null
2024-09-10	Sources of Uncertainty in 3D Scene Reconstruction	Marcus Klasson et.al.	2409.06407	link
2024-09-09	LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo	Wei Zhi Tang et.al.	2409.06104	link
2024-09-09	G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis	Lutao Jiang et.al.	2409.05617	null
2024-09-09	From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models	Tessa Pulli et.al.	2409.05413	null
2024-09-09	KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction	Davide Di Nucci et.al.	2409.05407	null
2024-09-09	Lagrangian Hashing for Compressed Neural Field Representations	Shrisudhan Govindarajan et.al.	2409.05334	null
2024-09-09	Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems	Jianheng Liu et.al.	2409.05310	null
2024-09-06	SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields	Yuze Wang et.al.	2409.04482	null
2024-09-05	Weight Conditioning for Smooth Optimization of Neural Networks	Hemanth Saratchandran et.al.	2409.03424	null
2024-09-05	Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction	Shen Chen et.al.	2409.03213	null
2024-09-04	UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views	Jiaxin Guo et.al.	2409.02917	link
2024-09-03	GraspSplats: Efficient Manipulation with 3D Feature Splatting	Mazeyu Ji et.al.	2409.02084	null
2024-09-03	$S^2$ NeRF: Privacy-preserving Training Framework for NeRF	Bokang Zhang et.al.	2409.01661	link
2024-08-30	ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images	Xiaoshuai Zhang et.al.	2408.17027	null
2024-08-29	GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content	Lebin Zhou et.al.	2408.16866	null
2024-09-01	Generic Objects as Pose Probes for Few-Shot View Synthesis	Zhirui Gao et.al.	2408.16690	null
2024-08-29	Spurfies: Sparse Surface Reconstruction using Local Geometry Priors	Kevin Raj et.al.	2408.16544	null
2024-08-29	NeRF-CA: Dynamic Reconstruction of X-ray Coronary Angiography with Extremely Sparse-views	Kirsten W. H. Maas et.al.	2408.16355	link
2024-08-28	Towards Realistic Example-based Modeling via 3D Gaussian Stitching	Xinyu Gao et.al.	2408.15708	null
2024-08-27	Learning-based Multi-View Stereo: A Survey	Fangjinhua Wang et.al.	2408.15235	null
2024-08-27	GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning	Shubhendu Jena et.al.	2408.14724	null
2024-08-28	FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry	Chunran Zheng et.al.	2408.14035	link
2024-08-25	TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers	Chuanrui Zhang et.al.	2408.13770	null
2024-08-24	G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles	Adil Meric et.al.	2408.13508	null
2024-08-23	SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting	Jiseung Hong et.al.	2408.13285	link
2024-08-21	Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations	Lintong Zhang et.al.	2408.11966	null
2024-08-21	Irregularity Inspection using Neural Radiance Field	Tianqi Ding et.al.	2408.11251	null
2024-08-20	GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting	Changkun Liu et.al.	2408.11085	link
2024-08-20	Learning Part-aware 3D Representations by Fusing 2D Gaussians and Superquadrics	Zhirui Gao et.al.	2408.10789	null
2024-08-20	TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks	Jinjie Mai et.al.	2408.10739	null
2024-08-19	$R^2$ -Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement	Haoyang Wang et.al.	2408.10135	null
2024-08-19	DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery	Corentin Dumery et.al.	2408.09928	null
2024-08-20	CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning	Haoyu Zhao et.al.	2408.09663	null
2024-08-18	S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis	Dongze Li et.al.	2408.09347	null
2024-08-17	SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation	Xiao Cao et.al.	2408.09144	null
2024-08-17	HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction	Xiao Zhao et.al.	2408.09104	null
2024-08-16	VF-NeRF: Learning Neural Vector Fields for Indoor Scene Reconstruction	Albert Gassol Puigjaner et.al.	2408.08766	link
2024-08-15	WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting	Huapeng Li et.al.	2408.08206	null
2024-08-18	Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space	Hyunjee Lee et.al.	2408.07416	null
2024-08-13	Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture	Yu Feng et.al.	2408.06608	null
2024-08-13	ActiveNeRF: Learning Accurate 3D Geometry by Active Pattern Projection	Jianyu Tao et.al.	2408.06592	link
2024-08-13	HDRGS: High Dynamic Range Gaussian Splatting	Jiahao Wu et.al.	2408.06543	link
2024-08-12	Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering	Jiameng Li et.al.	2408.06286	link
2024-08-12	3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs)	Jaydeep Rade et.al.	2408.06244	null
2024-08-10	Radiance Field Learners As UAV First-Person Viewers	Liqi Yan et.al.	2408.05533	null
2024-08-09	DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow	Hangyu Li et.al.	2408.05008	null
2024-08-09	FewShotNeRF: Meta-Learning-based Novel View Synthesis for Rapid Scene-Specific Adaptation	Piraveen Sivakumar et.al.	2408.04803	null
2024-08-06	LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting	Joanna Kaleta et.al.	2408.04474	link
2024-08-08	A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery	Mengya Xu et.al.	2408.04426	link
2024-08-08	Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods	Yiming Zhou et.al.	2408.04268	null
2024-08-07	Goal-oriented Semantic Communication for the Metaverse Application	Zhe Wang et.al.	2408.03646	null
2024-08-06	RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis	Hugo Blanc et.al.	2408.03356	null
2024-08-06	Efficient NeRF Optimization – Not All Samples Remain Equally Hard	Juuso Korhonen et.al.	2408.03193	null
2024-08-06	MGFs: Masked Gaussian Fields for Meshing Building based on Multi-View Images	Tengfei Wang et.al.	2408.03060	null
2024-08-04	PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone	Xin Yang et.al.	2408.02053	null
2024-08-03	FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields	Yifan Wu et.al.	2408.01878	null
2024-08-03	E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images	Yunshan Qi et.al.	2408.01840	null
2024-08-02	NeRFoot: Robot-Footprint Estimation for Image-Based Visual Servoing	Daoxin Zhong et.al.	2408.01251	null
2024-08-05	UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization	Ziwen Guo et.al.	2408.00860	null
2024-07-31	StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization	Kaiyuan Tang et.al.	2408.00150	null
2024-07-22	PAV: Personalized Head Avatar from Unstructured Video Collection	Akin Caliskan et.al.	2407.21047	null
2024-07-30	Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering	Yanpeng Zhao et.al.	2407.20908	link
2024-07-29	Radiance Fields for Robotic Teleoperation	Maximum Wilder-Smith et.al.	2407.20194	link
2024-07-29	Garment Animation NeRF with Color Editing	Renke Wang et.al.	2407.19774	link
2024-07-27	Revisit Self-supervised Depth Estimation with Local Structure-from-Motion	Shengjie Zhu et.al.	2407.19166	null
2024-07-26	IOVS4NeRF:Incremental Optimal View Selection for Large-Scale NeRFs	Jingpeng Xie et.al.	2407.18611	null
2024-07-24	SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency	Yiming Xie et.al.	2407.17470	null
2024-07-23	HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images	Shreyas Singh et.al.	2407.16503	link
2024-07-23	DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors	Zizheng Yan et.al.	2407.16260	null
2024-07-22	BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes	Chih-Hai Su et.al.	2407.15848	null
2024-07-22	Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures	Ruizhe Wang et.al.	2407.15435	null
2024-07-19	HOTS3D: Hyper-Spherical Optimal Transport for Semantic Alignment of Text-to-3D Generation	Zezeng Li et.al.	2407.14419	null
2024-07-19	DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays	Zongyuan Yang et.al.	2407.14053	null
2024-07-19	Semantic Communications for 3D Human Face Transmission with Neural Radiance Fields	Guanlin Wu et.al.	2407.13992	null
2024-07-18	EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting	Yuchen Weng et.al.	2407.13520	null
2024-07-18	GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields	Xiufeng Huang et.al.	2407.13390	null
2024-07-18	KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter	Yifan Zhan et.al.	2407.13185	null
2024-07-17	Generalizable Human Gaussians for Sparse View Synthesis	Youngjoong Kwon et.al.	2407.12777	link
2024-07-17	SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization	Yiyang Chen et.al.	2407.12667	link
2024-07-17	InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction	Xulong Wang et.al.	2407.12661	link
2024-07-17	Invertible Neural Warp for NeRF	Shin-Fang Chng et.al.	2407.12354	null
2024-07-17	Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections	Congrong Xu et.al.	2407.12306	null
2024-07-18	Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling	Jaehyeok Kim et.al.	2407.11962	null
2024-07-18	IPA-NeRF: Illusory Poisoning Attack Against Neural Radiance Fields	Wenxiang Jiang et.al.	2407.11921	link
2024-07-16	DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation	Jiwook Kim et.al.	2407.11394	link
2024-07-15	Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method	Adam Korycki et.al.	2407.11238	null
2024-07-15	AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems	Alexey Kotcov et.al.	2407.10865	null
2024-07-15	Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis	Antoine Legrand et.al.	2407.10762	null
2024-07-15	IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild	Shuaixian Wang et.al.	2407.10695	null
2024-07-15	NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis	Yubin Hu et.al.	2407.10482	null
2024-07-15	Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering	Francesco Di Sario et.al.	2407.10389	null
2024-07-14	RS-NeRF: Neural Radiance Fields from Rolling Shutter Images	Muyao Niu et.al.	2407.10267	link
2024-07-14	SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion	Jiyuan Zhang et.al.	2407.10062	null
2024-07-12	Physics-Informed Learning of Characteristic Trajectories for Smoke Reconstruction	Yiming Wang et.al.	2407.09679	link
2024-07-12	Radiance Fields from Photons	Sacha Jungerman et.al.	2407.09386	null
2024-07-12	HPC: Hierarchical Progressive Coding Framework for Volumetric Video	Zihan Zheng et.al.	2407.09026	null
2024-07-11	Feasibility of Neural Radiance Fields for Crime Scene Video Reconstruction	Shariq Nadeem Malik et.al.	2407.08795	null
2024-07-11	WildGaussians: 3D Gaussian Splatting in the Wild	Jonas Kulhanek et.al.	2407.08447	link
2024-07-11	MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos	Yushuo Chen et.al.	2407.08414	link
2024-07-11	Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression	Yuke Xing et.al.	2407.08165	null
2024-07-11	Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields	Haojie Lian et.al.	2407.08154	null
2024-07-11	Survey on Fundamental Deep Learning 3D Reconstruction Techniques	Yonge Bai et.al.	2407.08137	null
2024-07-10	Protecting NeRFs’ Copyright via Plug-And-Play Watermarking Base Model	Qi Song et.al.	2407.07735	null
2024-07-10	Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field	Ganlin Yang et.al.	2407.07461	null
2024-07-09	Reference-based Controllable Scene Stylization with Gaussian Splatting	Yiqun Mei et.al.	2407.07220	null
2024-07-09	Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View	Dogyoon Lee et.al.	2407.06613	null
2024-07-08	RRM: Relightable assets using Radiance guided Material extraction	Diego Gomez et.al.	2407.06397	null
2024-07-08	PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes	Mohammad Reza Karimi Dastjerdi et.al.	2407.06150	null
2024-07-08	Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views	Jiawei Guo et.al.	2407.05666	null
2024-07-08	GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields	Weiyi Xue et.al.	2407.05597	null
2024-07-08	Dynamic Neural Radiance Field From Defocused Monocular Video	Xianrui Luo et.al.	2407.05586	null
2024-07-07	GaussReg: Fast 3D Registration with Gaussian Splatting	Jiahao Chang et.al.	2407.05254	null
2024-07-06	SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction	Weixing Xie et.al.	2407.05023	link
2024-07-04	CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images	Junghe Lee et.al.	2407.03923	null
2024-07-02	MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering	Ahmad AlMughrabi et.al.	2407.02668	null
2024-07-03	BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream	Wenpu Li et.al.	2407.02174	link
2024-07-01	Active Human Pose Estimation via an Autonomous UAV Agent	Jingxi Chen et.al.	2407.01811	null
2024-07-01	DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction	Yujin Ham et.al.	2407.01761	null
2024-07-01	Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation	Zihan Gao et.al.	2407.01220	link
2024-06-29	Intrinsic PAPR for Point-level 3D Scene Albedo and Shading Editing	Alireza Moazeni et.al.	2407.00500	null
2024-06-28	ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction	Ding-Jiun Huang et.al.	2406.20066	null
2024-06-28	EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting	Daiwei Zhang et.al.	2406.19811	null
2024-06-27	Shorter SPECT Scans Using Self-supervised Coordinate Learning to Synthesize Skipped Projection Views	Zongyu Li et.al.	2406.18840	null
2024-06-25	Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes	Qi Ma et.al.	2406.17438	link
2024-06-25	NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods	Jonas Kulhanek et.al.	2406.17345	null
2024-06-24	From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking	Xiaohao Xu et.al.	2406.16850	link
2024-06-24	Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis	Jianning Deng et.al.	2406.16623	null
2024-06-24	Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction	Tong Qin et.al.	2406.16289	null
2024-06-23	Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study	Zhe Wang et.al.	2406.16068	null
2024-06-23	Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction	Yangdi Lu et.al.	2406.15982	null
2024-06-22	psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery	Tongtong Zhang et.al.	2406.15707	null
2024-06-21	A3D: Does Diffusion Dream about 3D Alignment?	Savva Ignatyev et.al.	2406.15020	null
2024-06-21	E2GS: Event Enhanced Gaussian Splatting	Hiroyuki Deguchi et.al.	2406.14978	link
2024-06-21	Relighting Scenes with Object Insertions in Neural Radiance Fields	Xuening Zhu et.al.	2406.14806	null
2024-06-20	Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment	Yunshan Qi et.al.	2406.14360	null
2024-06-19	NeRF-Feat: 6D Object Pose Estimation using Feature Rendering	Shishir Reddy Vutukur et.al.	2406.13796	null
2024-06-19	Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images	Haruo Fujiwara et.al.	2406.13393	null
2024-06-19	Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields	Youngin Park et.al.	2406.13251	link
2024-06-18	Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models	Paul Henderson et.al.	2406.13099	null
2024-06-18	Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings	Ruijie Tang et.al.	2406.13048	null
2024-06-18	Fast Global Localization on Neural Radiance Field	Mangyu Kong et.al.	2406.12202	link
2024-06-20	TutteNet: Injective 3D Deformations by Composition of 2D Mesh Deformations	Bo Sun et.al.	2406.12121	null
2024-06-17	DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features	Letian Wang et.al.	2406.12095	null
2024-06-17	Uncertainty modeling for fine-tuned implicit functions	Anna Susmelj et.al.	2406.12082	null
2024-06-17	LLaNA: Large Language and NeRF Assistant	Andrea Amaduzzi et.al.	2406.11840	null
2024-06-17	Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization	Huaiji Zhou et.al.	2406.11766	null
2024-06-17	InterNeRF: Scaling Radiance Fields via Parameter Interpolation	Clinton Wang et.al.	2406.11737	null
2024-06-17	NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation	Niu Guanchen et.al.	2406.11259	null
2024-06-15	NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows	Zhenggang Tang et.al.	2406.10543	link
2024-06-15	Federated Neural Radiance Field for Distributed Intelligence	Yintian Zhang et.al.	2406.10474	null
2024-06-14	Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections	Jiacong Xu et.al.	2406.10373	null
2024-06-14	PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting	Alex Hanson et.al.	2406.10219	link
2024-06-14	GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors	Xiqian Yu et.al.	2406.10111	null
2024-06-14	OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control	Yuzhong Huang et.al.	2406.10000	null
2024-06-14	dGrasp: NeRF-Informed Implicit Grasp Policies with Supervised Optimization Slopes	Gergely Sóti et.al.	2406.09939	null
2024-06-14	RaNeuS: Ray-adaptive Neural Surface Reconstruction	Yida Wang et.al.	2406.09801	link
2024-06-13	Rethinking Score Distillation as a Bridge Between Image Distributions	David McAllister et.al.	2406.09417	null
2024-06-13	Preserving Identity with Variational Score for General-purpose 3D Editing	Duong H. Le et.al.	2406.08953	null
2024-06-13	Neural NeRF Compression	Tuan Pham et.al.	2406.08943	null
2024-06-14	AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis	Swapnil Bhosale et.al.	2406.08920	null
2024-06-13	NeRF Director: Revisiting View Selection in Neural Volume Rendering	Wenhui Xiao et.al.	2406.08839	link
2024-06-12	ICE-G: Image Conditional Editing of 3D Gaussian Splats	Vishnu Jaganathan et.al.	2406.08488	null
2024-06-12	OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding	Yinan Deng et.al.	2406.08009	link
2024-06-12	Spatial Annealing Smoothing for Efficient Few-shot Neural Rendering	Yuru Xiao et.al.	2406.07828	link
2024-06-11	C3DAG: Controlled 3D Animal Generation using 3D pose guidance	Sandeep Mishra et.al.	2406.07742	null
2024-06-11	M-LRM: Multi-view Large Reconstruction Model	Mengfei Li et.al.	2406.07648	null
2024-06-11	Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments	Christopher D. Hsu et.al.	2406.07431	null
2024-06-11	Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion	Xin Yuan et.al.	2406.06972	null
2024-06-11	Neural Visibility Field for Uncertainty-Driven Active Mapping	Shangjie Xue et.al.	2406.06948	null
2024-06-10	IllumiNeRF: 3D Relighting without Inverse Rendering	Xiaoming Zhao et.al.	2406.06527	null
2024-06-10	GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation	Haozhe Xie et.al.	2406.06526	link
2024-06-10	PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction	Danpeng Chen et.al.	2406.06521	null
2024-06-10	Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis	Xin Jin et.al.	2406.06216	link
2024-06-10	ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models	Meng-Li Shih et.al.	2406.06133	null
2024-06-09	GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement	Peiye Zhuang et.al.	2406.05649	null
2024-06-07	Multiplane Prior Guided Few-Shot Aerial Scene Rendering	Zihan Gao et.al.	2406.04961	null
2024-06-07	Multi-style Neural Radiance Field with AdaIN	Yu-Wen Pao et.al.	2406.04960	link
2024-06-06	Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization	Takuhiro Kaneko et.al.	2406.04155	null
2024-06-06	How Far Can We Compress Instant-NGP-Based NeRF?	Yihang Chen et.al.	2406.04101	link
2024-06-06	Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling	Xinhang Liu et.al.	2406.03723	null
2024-06-06	Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction	Diwen Wan et.al.	2406.03697	link
2024-06-04	3D-HGS: 3D Half-Gaussian Splatting	Haolin Li et.al.	2406.02720	link
2024-06-06	Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting	Inkyu Shin et.al.	2406.02541	null
2024-06-04	Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning	Jiaxu Wang et.al.	2406.02370	null
2024-06-03	Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting	Shaojie Ma et.al.	2406.01593	null
2024-06-03	Tetrahedron Splatting for 3D Generation	Chun Gu et.al.	2406.01579	link
2024-06-03	Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting	Fang Li et.al.	2406.01042	link
2024-06-02	PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency	Yeonsung Jung et.al.	2406.00798	null
2024-06-02	Representing Animatable Avatar via Factorized Neural Fields	Chunjin Song et.al.	2406.00637	null
2024-06-04	SuperGaussian: Repurposing Video Models for 3D Super Resolution	Yuan Shen et.al.	2406.00609	null
2024-06-02	Efficient Neural Light Fields (ENeLF) for Mobile Devices	Austin Peng et.al.	2406.00598	null
2024-06-01	Bilateral Guided Radiance Field Processing	Yuehao Wang et.al.	2406.00448	null
2024-05-31	R $^2$ -Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction	Ruyi Zha et.al.	2405.20693	link
2024-05-31	4Diffusion: Multi-view Video Diffusion Model for 4D Generation	Haiyu Zhang et.al.	2405.20674	null
2024-05-30	$\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving	Nan Huang et.al.	2405.20323	link
2024-05-30	TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes	Minghao Guo et.al.	2405.20283	null
2024-05-31	NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation	Pedro Martin et.al.	2405.20078	null
2024-05-30	IReNe: Instant Recoloring in Neural Radiance Fields	Alessio Mazzucchelli et.al.	2405.19876	null
2024-05-30	HINT: Learning Complete Human Neural Representations from Limited Viewpoints	Alessandro Sanvito et.al.	2405.19712	null
2024-05-30	View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields	Haodi He et.al.	2405.19678	link
2024-05-29	Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy	Zijie Jiang et.al.	2405.18863	null
2024-06-02	NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild	Weining Ren et.al.	2405.18715	link
2024-05-28	Self-supervised Pre-training for Transferable Multi-modal Perception	Xiaohao Xu et.al.	2405.17942	link
2024-05-28	A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction	Bin Zhang et.al.	2405.17891	null
2024-05-29	HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction	Haoyu Zhao et.al.	2405.17872	link
2024-05-28	Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh	Xiangjun Gao et.al.	2405.17811	null
2024-05-28	F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting	Xiangyu Sun et.al.	2405.17083	null
2024-05-29	PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting	Zipeng Wang et.al.	2405.16829	null
2024-05-26	Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors	Soumava Paul et.al.	2405.16517	null
2024-05-24	Neural Elevation Models for Terrain Mapping and Path Planning	Adam Dai et.al.	2405.15227	link
2024-05-27	HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting	Yuanhao Cai et.al.	2405.15125	link
2024-05-24	GS-Hider: Hiding Messages into 3D Gaussian Splatting	Xuanyu Zhang et.al.	2405.15118	null
2024-05-23	NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections	Dor Verbin et.al.	2405.14871	null
2024-05-23	Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling	Liwen Wu et.al.	2405.14847	null
2024-05-23	Camera Relocalization in Shadow-free Neural Radiance Fields	Shiyao Xu et.al.	2405.14824	link
2024-05-23	LDM: Large Tensorial SDF Model for Textured Mesh Generation	Rengan Xie et.al.	2405.14580	link
2024-05-23	JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression	Zihan Zheng et.al.	2405.14452	null
2024-05-22	DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus	Yu Chen et.al.	2405.13943	link
2024-05-22	Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances	Licheng Shen et.al.	2405.13694	null
2024-05-21	MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video	Hongsheng Wang et.al.	2405.12806	null
2024-05-21	Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations	Antoine Legrand et.al.	2405.12728	null
2024-05-20	Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo	Tianqi Liu et.al.	2405.12218	link
2024-05-20	Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents	Guanlin Wu et.al.	2405.12155	null
2024-05-20	NPLMV-PS: Neural Point-Light Multi-View Photometric Stereo	Fotios Logothetis et.al.	2405.12057	null
2024-05-19	Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems	Shengxiang Sun et.al.	2405.11629	null
2024-05-19	R-NeRF: Neural Radiance Fields for Modeling RIS-enabled Wireless Environments	Huiying Yang et.al.	2405.11541	link
2024-05-18	MotionGS : Compact Gaussian Splatting SLAM by Motion Filter	Xinli Guo et.al.	2405.11129	link
2024-05-16	When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models	Xianzheng Ma et.al.	2405.10255	link
2024-05-15	From NeRFs to Gaussian Splats, and Back	Siming He et.al.	2405.09717	link
2024-05-14	Dynamic NeRF: A Review	Jinwei Lin et.al.	2405.08609	null
2024-05-13	Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs	Mingyu Kim et.al.	2405.07857	link
2024-05-12	Point Resampling and Ray Transformation Aid to Editable NeRF Models	Zhenyang Li et.al.	2405.07306	null
2024-05-12	Hologram: Realtime Holographic Overlays via LiDAR Augmented Reconstruction	Ekansh Agrawal et.al.	2405.07178	null
2024-05-11	TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization	Zhen Tan et.al.	2405.07027	link
2024-05-10	LIVE: LaTex Interactive Visual Editing	Jinwei Lin et.al.	2405.06762	null
2024-05-14	SketchDream: Sketch-based Text-to-3D Generation and Editing	Feng-Lin Liu et.al.	2405.06461	null
2024-05-10	Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering	Xiaohan Zhang et.al.	2405.06214	null
2024-05-10	Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation	Bardienus P. Duisterhof et.al.	2405.06181	null
2024-05-09	DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation	Sitian Shen et.al.	2405.05800	null
2024-05-10	NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior	Gihoon Kim et.al.	2405.05749	null
2024-05-09	RPBG: Towards Robust Neural Point-based Graphics in the Wild	Qingtian Zhu et.al.	2405.05663	link
2024-05-09	Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview	Yuhang Ming et.al.	2405.05526	null
2024-05-08	${M^2D}$ NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields	Ning Wang et.al.	2405.05010	null
2024-05-08	DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid	Sidun Liu et.al.	2405.04416	null
2024-05-07	Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications	Markus Hillemann et.al.	2405.04345	null
2024-05-05	Blending Distributed NeRFs with Tri-stage Robust Pose Optimization	Baijun Ye et.al.	2405.02880	null
2024-05-05	MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior	Honghua Chen et.al.	2405.02859	null
2024-05-04	TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes	Christopher Maxey et.al.	2405.02762	null
2024-05-04	ActiveNeuS: Active 3D Reconstruction using Neural Implicit Surface Uncertainty	Hyunseo Kim et.al.	2405.02568	null
2024-05-03	Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning	Dhruva Tirumala et.al.	2405.02425	null
2024-05-03	Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids	Junchen Liu et.al.	2405.02386	link
2024-05-03	WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights	Youngdong Jang et.al.	2405.02066	null
2024-05-02	NeRF in Robotics: A Survey	Guangming Wang et.al.	2405.01333	null
2024-05-04	LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes	Shanlin Sun et.al.	2405.00900	null
2024-05-01	Depth Priors in Removal Neural Radiance Fields	Zhihao Guo et.al.	2405.00630	null
2024-05-01	NeRF-Guided Unsupervised Learning of RGB-D Registration	Zhinan Yu et.al.	2405.00507	null
2024-05-01	RTG-SLAM: Real-time 3D Reconstruction at Scale using Gaussian Splatting	Zhexi Peng et.al.	2404.19706	null
2024-04-30	NeRF-Insert: 3D Local Editing with Multimodal Control Signals	Benet Oriol Sabat et.al.	2404.19204	null
2024-04-29	SAGS: Structure-Aware 3D Gaussian Splatting	Evangelos Ververas et.al.	2404.19149	null
2024-04-29	GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting	Bo Chen et.al.	2404.19040	null
2024-04-29	Embedded Representation Learning Network for Animating Styled Video Portrait	Tianyong Wang et.al.	2404.19038	null
2024-04-29	Simple-RF: Regularizing Sparse Input Radiance Fields with Simpler Solutions	Nagabhushan Somraj et.al.	2404.19015	null
2024-04-28	S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM	Zhiyao Zhang et.al.	2404.18284	null
2024-04-27	DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction	Chenhe Du et.al.	2404.17890	null
2024-04-26	Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields	Tianqi Liu et.al.	2404.17528	link
2024-04-25	Depth Supervised Neural Surface Reconstruction from Airborne Imagery	Vincent Hackstein et.al.	2404.16429	null
2024-04-24	NeRF-XL: Scaling NeRFs with Multiple GPUs	Ruilong Li et.al.	2404.16221	null
2024-04-24	ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images	Jinseo Jeong et.al.	2404.15707	null
2024-04-23	DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft	Sam Earle et.al.	2404.15538	null
2024-04-28	GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting	Hongyun Yu et.al.	2404.14037	null
2024-04-22	NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation	Chi Huang et.al.	2404.13921	null
2024-04-23	CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory	Yunlong Ran et.al.	2404.13896	null
2024-04-26	Neural Radiance Field in Autonomous Driving: A Survey	Lei He et.al.	2404.13816	null
2024-04-26	ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis	Zichen Tang et.al.	2404.13711	link
2024-04-21	Generalizable Novel-View Synthesis using a Stereo Camera	Haechan Lee et.al.	2404.13541	null
2024-04-20	High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces	Baoru Huang et.al.	2404.13437	null
2024-04-20	EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment	Guanghao Li et.al.	2404.13346	link
2024-04-19	FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction	Maria Dronova et.al.	2404.12970	null
2024-04-22	Does Gaussian Splatting need SFM Initialization?	Yalda Foroutan et.al.	2404.12547	null
2024-04-18	MeshLRM: Large Reconstruction Model for High-Quality Mesh	Xinyue Wei et.al.	2404.12385	null
2024-04-18	AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering	Jingfeng Guo et.al.	2404.11897	link
2024-04-18	Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations	Yu Feng et.al.	2404.11852	null
2024-04-17	SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping	Vincent Cartillier et.al.	2404.11419	null
2024-04-16	Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks	Florian Barthel et.al.	2404.10625	null
2024-04-16	Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences	Seungwook Kim et.al.	2404.10603	null
2024-04-16	1st Place Solution for ICCV 2023 OmniObject3D Challenge: Sparse-View Reconstruction	Hang Du et.al.	2404.10441	null
2024-04-16	SRGS: Super-Resolution 3D Gaussian Splatting	Xiang Feng et.al.	2404.10318	link
2024-04-16	Plug-and-Play Acceleration of Occupancy Grid-based NeRF Rendering using VDB Grid and Hierarchical Ray Traversal	Yoshio Kato et.al.	2404.10272	link
2024-04-15	Taming Latent Diffusion Model for Neural Radiance Field Inpainting	Chieh Hubert Lin et.al.	2404.09995	null
2024-04-15	Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video	Hongchi Xia et.al.	2404.09833	null
2024-04-15	DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading	Tong Wu et.al.	2404.09412	null
2024-04-14	VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field	Fei Xue et.al.	2404.09271	link
2024-04-15	OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering	Jingrui Ye et.al.	2404.08449	null
2024-04-12	GPN: Generative Point-based NeRF	Haipeng Wang et.al.	2404.08312	link
2024-04-12	MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance	Yuqun Wu et.al.	2404.08252	null
2024-04-11	Connecting NeRFs, Images, and Text	Francesco Ballerini et.al.	2404.07993	link
2024-04-11	Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation	Keonhee Han et.al.	2404.07933	link
2024-04-12	NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving	William Ljungbergh et.al.	2404.07762	link
2024-04-11	G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images	Zixiong Huang et.al.	2404.07474	link
2024-04-10	SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection	Mathis Kruse et.al.	2404.06832	link
2024-04-10	MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views	Runfa Li et.al.	2404.06753	null
2024-04-10	Bayesian NeRF: Quantifying Uncertainty with Volume Density in Neural Radiance Fields	Sibeak Lee et.al.	2404.06727	link
2024-04-11	SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera	Gaole Dai et.al.	2404.06710	null
2024-04-09	Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion	Fan Yang et.al.	2404.06429	link
2024-04-09	3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis	Zhicheng Lu et.al.	2404.06270	null
2024-04-09	GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields	Arnab Dey et.al.	2404.06246	null
2024-04-09	HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields	Arnab Dey et.al.	2404.06152	null
2024-04-08	Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation	Y. Wang et.al.	2404.05236	null
2024-04-08	StylizedGS: Controllable Stylization for 3D Gaussian Splatting	Dingxi Zhang et.al.	2404.05220	null
2024-04-08	Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos	Fengrui Tian et.al.	2404.05163	link
2024-04-07	CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis	Gyeongjin Kang et.al.	2404.04913	null
2024-04-07	GauU-Scene V2: Expanse Lidar Image Dataset Shows Unreliable Geometric Reconstruction Using Gaussian Splatting and NeRF	Butian Xiong et.al.	2404.04880	null
2024-04-07	NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization	Peng Tu et.al.	2404.04875	null
2024-04-06	DATENeRF: Depth-Aware Text-based Editing of NeRFs	Sara Rojas et.al.	2404.04526	null
2024-04-05	Robust Gaussian Splatting	François Darmon et.al.	2404.04211	null
2024-04-04	SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer	Zijie Wu et.al.	2404.03736	link
2024-04-07	RaFE: Generative Radiance Fields Restoration	Zhongkai Wu et.al.	2404.03654	null
2024-04-04	OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views	Francis Engelmann et.al.	2404.03650	null
2024-04-04	VF-NeRF: Viewshed Fields for Rigid NeRF Registration	Leo Segre et.al.	2404.03349	null
2024-04-03	GenN2N: Generative NeRF2NeRF Translation	Xiangyue Liu et.al.	2404.02788	link
2024-04-03	LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis	Zehan Zheng et.al.	2404.02742	link
2024-04-03	Neural Radiance Fields with Torch Units	Bingnan Ni et.al.	2404.02617	null
2024-04-03	Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition	Yisheng He et.al.	2404.02514	null
2024-04-02	NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation	Sicheng Li et.al.	2404.02185	null
2024-04-02	Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields	Joshua Ahn et.al.	2404.02155	null
2024-04-02	Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions	Saptarshi Dasgupta et.al.	2404.01812	null
2024-04-01	NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification	Juyeop Han et.al.	2404.01400	null
2024-04-01	NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields	Muhammad Zubair Irshad et.al.	2404.01300	link
2024-04-01	MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space	Armand Comas-Massagué et.al.	2404.01296	null
2024-04-02	StructLDM: Structured Latent Diffusion for 3D Human Generation	Tao Hu et.al.	2404.01241	null
2024-04-01	Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting	Jiarui Meng et.al.	2404.01168	null
2024-04-01	SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance	Yuru Xiao et.al.	2404.00992	null
2024-04-01	FlexiDreamer: Single Image-to-3D Generation with FlexiCubes	Ruowen Zhao et.al.	2404.00987	link
2024-04-01	Marrying NeRF with Feature Matching for One-step Pose Estimation	Ronghan Chen et.al.	2404.00891	null
2024-03-29	HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes	Ke Wu et.al.	2403.20159	null
2024-03-29	Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior	Jaehoon Ko et.al.	2403.20153	link
2024-03-29	SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior	Zhongrui Yu et.al.	2403.20079	null
2024-03-29	NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising	Tianchen Deng et.al.	2403.20034	link
2024-03-29	SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image	Yunhao Li et.al.	2403.20018	link
2024-03-29	DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal	Yunhao Li et.al.	2403.20013	link
2024-03-29	Stable Surface Regularization for Fast Few-Shot NeRF	Byeongin Joung et.al.	2403.19985	null
2024-03-29	MI-NeRF: Learning a Single Face NeRF from Multiple Identities	Aggelina Chatziagapi et.al.	2403.19920	null
2024-03-28	Mitigating Motion Blur in Neural Radiance Fields with Events and Frames	Marco Cannici et.al.	2403.19780	link
2024-03-28	SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects	Avinash Ummadisingu et.al.	2403.19607	null
2024-03-28	CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians	Avinash Paliwal et.al.	2403.19495	link
2024-03-28	Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation	Yujin Chen et.al.	2403.19319	null
2024-03-28	Sine Activated Low-Rank Matrices for Parameter Efficient Learning	Yiping Ji et.al.	2403.19243	null
2024-03-29	Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction	Qiuhong Shen et.al.	2403.18795	link
2024-03-27	SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery	Camille Billouard et.al.	2403.18711	link
2024-03-27	Modeling uncertainty for Gaussian Splatting	Luca Savant et.al.	2403.18476	null
2024-03-26	Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians	Kerui Ren et.al.	2403.17898	link
2024-03-26	NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation	Jiahao Chen et.al.	2403.17537	null
2024-03-25	VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation	Yang Chen et.al.	2403.17001	null
2024-03-25	CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs	Yingji Zhong et.al.	2403.16885	null
2024-03-25	Spike-NeRF: Neural Radiance Field Based On Spike Camera	Yijia Guo et.al.	2403.16410	null
2024-03-24	Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields	Haoyuan Wang et.al.	2403.16224	null
2024-03-24	Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes	Takashi Otonari et.al.	2403.16141	null
2024-03-24	CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field	Jiarui Hu et.al.	2403.16095	null
2024-03-24	Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap	Carl Lindström et.al.	2403.16092	null
2024-03-26	PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling	Xiaoyun Zheng et.al.	2403.16080	link
2024-03-24	Semantic Is Enough: Only Semantic Information For NeRF Reconstruction	Ruibo Wang et.al.	2403.16043	null
2024-03-24	Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields	unhong Zhao et.al.	2403.15981	null
2024-03-23	DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation	Mu-Yi Shen et.al.	2403.15791	link
2024-03-23	UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation	Yuliang Guo et.al.	2403.15705	link
2024-03-22	WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization	Jialu Wang et.al.	2403.15272	null
2024-03-21	Hyperspectral Neural Radiance Fields	Gerry Chen et.al.	2403.14839	null
2024-03-21	ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition	Tianhao Wu et.al.	2403.14619	null
2024-03-21	CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis	Matteo Bonotto et.al.	2403.14412	link
2024-03-21	InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity	Jiabin Liang et.al.	2403.14376	null
2024-03-21	Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions	Jiacong Xu et.al.	2403.14053	link
2024-03-20	MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination	Weiying Wang et.al.	2403.13348	null
2024-03-19	Depth-guided NeRF Training via Earth Mover’s Distance	Anita Rau et.al.	2403.13206	null
2024-03-19	DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images	Zaid Tasneem et.al.	2403.13199	null
2024-03-19	Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering	Mingqi Shao et.al.	2403.12839	null
2024-03-19	Learning Neural Volumetric Pose Features for Camera Localization	Jingyu Lin et.al.	2403.12800	null
2024-03-19	IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model	Matteo Bortolon et.al.	2403.12682	null
2024-03-18	FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos	Florian Philipp Stilz et.al.	2403.12198	null
2024-03-18	ThermoNeRF: Multimodal Neural Radiance Fields for Thermal Novel View Synthesis	Mariam Hassan et.al.	2403.12154	link
2024-03-18	RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF	Sibi Catley-Chandar et.al.	2403.11909	null
2024-03-18	GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors	LI Yang et.al.	2403.11899	null
2024-03-18	Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging	Mert Özer et.al.	2403.11865	null
2024-03-19	BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting	Lingzhe Zhao et.al.	2403.11831	link
2024-03-18	Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery	Yuqi Zhang et.al.	2403.11812	link
2024-03-18	DVN-SLAM: Dynamic Visual Neural SLAM Based on Local-Global Encoding	Wenhua Wu et.al.	2403.11776	null
2024-03-18	Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes	Antoine Schnepf et.al.	2403.11678	null
2024-03-18	UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling	Yujiao Jiang et.al.	2403.11589	null
2024-03-18	Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem	Mincheol Chang et.al.	2403.11573	null
2024-03-17	Creating Seamless 3D Maps Using Radiance Fields	Sai Tarun Sathyan et.al.	2403.11364	null
2024-03-17	SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream	Lin Zhu et.al.	2403.11222	link
2024-03-17	Recent Advances in 3D Gaussian Splatting	Tong Wu et.al.	2403.11134	null
2024-03-17	Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications	Yonggan Fu et.al.	2403.11131	link
2024-03-16	Fast Sparse View Guided NeRF Update for Object Reconfigurations	Ziqi Lu et.al.	2403.11024	null
2024-03-16	HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering	Seunghyeon Seo et.al.	2403.10906	null
2024-03-15	FeatUp: A Model-Agnostic Framework for Features at Any Resolution	Stephanie Fu et.al.	2403.10516	link
2024-03-15	Thermal-NeRF: Neural Radiance Fields from an Infrared Camera	Tianxiang Ye et.al.	2403.10340	link
2024-03-15	Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression	Huy-Hoang Bui et.al.	2403.10297	link
2024-03-15	GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time	Hao Li et.al.	2403.10147	null
2024-03-15	URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields	Bo Xu et.al.	2403.10119	null
2024-03-15	DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video	Huiqiang Sun et.al.	2403.10103	null
2024-03-15	Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience	Xiaohang Yu et.al.	2403.09973	null
2024-03-14	GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping	Yuhang Zheng et.al.	2403.09637	link
2024-03-14	The NeRFect Match: Exploring NeRF Features for Visual Localization	Qunjie Zhou et.al.	2403.09577	null
2024-03-14	VIRUS-NeRF – Vision, InfraRed and UltraSonic based Neural Radiance Fields	Nicolaj Schmid et.al.	2403.09477	link
2024-03-14	3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation	Frank Zhang et.al.	2403.09439	null
2024-03-14	RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes	Thang-Anh-Quan Nguyen et.al.	2403.09419	null
2024-03-14	PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors	Tianyuan Yuan et.al.	2403.09079	link
2024-03-13	Gaussian Splatting in Style	Abhishek Saroha et.al.	2403.08498	null
2024-03-13	StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields	Hongbin Xu et.al.	2403.08310	link
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156	link
2024-03-12	Q-SLAM: Quadric Representations for Monocular SLAM	Chensheng Peng et.al.	2403.08125	null
2024-03-12	SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields	Jungho Lee et.al.	2403.07547	link
2024-03-11	SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection	Yifu Tao et.al.	2403.06877	null
2024-03-11	Vosh: Voxel-Mesh Hybrid Representation for Real-Time View Synthesis	Chenhao Zhang et.al.	2403.06505	null
2024-03-13	FSViewFusion: Few-Shots View Generation of Novel Objects	Rukhshanda Hussain et.al.	2403.06394	null
2024-03-10	Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis?	Hanxin Zhu et.al.	2403.06092	null
2024-03-09	Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving	Junyi Cao et.al.	2403.05907	link
2024-03-09	Large Generative Model Assisted 3D Semantic Communication	Feibo Jiang et.al.	2403.05783	null
2024-03-08	GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting	Francesco Palandra et.al.	2403.05154	null
2024-03-08	Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces	Evangelos Skartados et.al.	2403.04508	null
2024-03-07	Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis	Yuanhao Cai et.al.	2403.04116	link
2024-03-08	DNAct: Diffusion Guided Multi-Task 3D Policy Learning	Ge Yan et.al.	2403.04115	null
2024-03-07	Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs	Nikhil Mishra et.al.	2403.04114	link
2024-03-06	GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding	Zi-Ting Chou et.al.	2403.03608	null
2024-03-05	A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction	Haofan Lu et.al.	2403.03241	null
2024-03-05	Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps	Timothy Chen et.al.	2403.02751	link
2024-03-04	DaReNeRF: Direction-aware Representation for Dynamic Scenes	Ange Lou et.al.	2403.02265	null
2024-03-04	Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views	Shuai Guo et.al.	2403.02063	null
2024-03-02	NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning	Linsheng Chen et.al.	2403.01325	link
2024-03-02	Neural radiance fields-based holography [Invited]	Minsung Kang et.al.	2403.01137	null
2024-03-02	Neural Field Classifiers via Target Encoding and Classification Loss	Xindi Yang et.al.	2403.01058	null
2024-03-01	DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots	Chunlin Li et.al.	2403.00228	link
2024-02-28	NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images	Jingrui Yu et.al.	2402.18196	link
2024-02-26	Neural Radiance Fields in Medical Imaging: Challenges and Next Steps	Xin Wang et.al.	2402.17797	null
2024-02-27	Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning	Xiaoyu Zhang et.al.	2402.17768	null
2024-02-27	VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction	Jiaqi Lin et.al.	2402.17427	null
2024-02-27	Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis	Zicheng Zhang et.al.	2402.17364	link
2024-02-27	DivAvatar: Diverse 3D Avatar Generation with a Single Prompt	Weijing Tao et.al.	2402.17292	null
2024-02-27	CharNeRF: 3D Character Generation from Concept Art	Eddy Chu et.al.	2402.17115	null
2024-02-26	Disentangled 3D Scene Generation with Layout Learning	Dave Epstein et.al.	2402.16936	null
2024-02-26	CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency	Hanxin Zhu et.al.	2402.16407	null
2024-02-26	SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field	Zetian Song et.al.	2402.16366	null
2024-02-26	DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer	Yizhe Wu et.al.	2402.16308	null
2024-02-22	Consolidating Attention Features for Multi-view Image Editing	Or Patashnik et.al.	2402.14792	null
2024-02-26	FrameNeRF: A Simple and Efficient Framework for Few-shot Novel View Synthesis	Yan Xing et.al.	2402.14586	null
2024-02-22	NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection	Chenxi Huang et.al.	2402.14464	link
2024-02-22	TaylorGrid: Towards Fast and High-Quality Implicit Field Learning via Direct Taylor-based Grid Optimization	Renyi Mao et.al.	2402.14415	null
2024-02-22	Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields	Seungtae Nam et.al.	2402.14196	null
2024-02-21	Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting	Joongho Jo et.al.	2402.13827	null
2024-02-21	SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields	Zhentao Huang et.al.	2402.13510	null
2024-02-20	How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey	Fabio Tosi et.al.	2402.13255	link
2024-02-20	Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields	Bo-Yu Cheng et.al.	2402.13252	link
2024-02-20	NeRF Solves Undersampled MRI Reconstruction	Tae Jun Jang et.al.	2402.13226	null
2024-02-20	OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow	Simon Boeder et.al.	2402.12792	null
2024-02-19	Binary Opacity Grids: Capturing Fine Geometric Detail for Mesh-Based View Synthesis	Christian Reiser et.al.	2402.12377	null
2024-02-19	Colorizing Monochromatic Radiance Fields	Yean Cheng et.al.	2402.12184	null
2024-02-17	Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review	Thang-Anh-Quan Nguyen et.al.	2402.11141	link
2024-02-15	Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions	Muhammad Arbab Arshad et.al.	2402.10344	null
2024-02-14	PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments	Xiuzhong Hu et.al.	2402.09325	link
2024-02-13	Preconditioners for the Stochastic Training of Implicit Neural Representations	Shin-Fang Chng et.al.	2402.08784	null
2024-02-13	NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs	Michael Fischer et.al.	2402.08622	null
2024-02-13	H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields	Minyoung Park et.al.	2402.08138	null
2024-02-12	DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation	Chenchang Li et.al.	2402.07648	null
2024-02-11	BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis	Leandro A. Passos et.al.	2402.07310	link
2024-02-11	3D Gaussian as a New Vision Era: A Survey	Ben Fei et.al.	2402.07181	null
2024-02-09	ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting	Georgii Stanishevskii et.al.	2402.06390	link
2024-02-07	NeRF as Non-Distant Environment Emitter in Physics-based Inverse Rendering	Jingwang Ling et.al.	2402.04829	null
2024-02-07	OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding	Guibiao Liao et.al.	2402.04648	link
2024-02-11	BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery	Huiqing Zhang et.al.	2402.04554	null
2024-02-06	Improved Generalization of Weight Space Networks via Augmentations	Aviv Shamsian et.al.	2402.04081	link
2024-02-05	ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis	Bernard Spiegl et.al.	2402.02906	link
2024-02-02	ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields	Xingyu Miao et.al.	2402.01950	link
2024-02-02	Robust Inverse Graphics via Probabilistic Inference	Tuan Anh Le et.al.	2402.01915	link
2024-02-02	HyperPlanes: Hypernetwork Approach to Rapid NeRF Adaptation	Paweł Batorski et.al.	2402.01524	link
2024-02-02	Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses	Mahboubeh Asadi et.al.	2402.01485	null
2024-02-06	GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting	Joanna Waczyńska et.al.	2402.01459	link
2024-02-02	Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization	Zhiyu Zhang et.al.	2402.01380	null
2024-02-06	Taming Uncertainty in Sparse-view Generalizable NeRF via Indirect Diffusion Guidance	Yaokun Li et.al.	2402.01217	null
2024-02-01	ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields	Jiahua Dong et.al.	2402.00864	link
2024-02-01	Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering	Pinxin Liu et.al.	2402.00827	link
2024-01-31	CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting	Jiezhi Yang et.al.	2401.18075	null
2024-02-01	Segment Anything in 3D Gaussians	Xu Hu et.al.	2401.17857	link
2024-01-30	Physical Priors Augmented Event-Based 3D Reconstruction	Jiaxu Wang et.al.	2401.17121	link
2024-01-31	Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting	Yiming Huang et.al.	2401.16416	link
2024-01-29	Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields	Rongkai Ma et.al.	2401.16144	null
2024-01-26	3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field	Zhenyu Bao et.al.	2401.14726	link
2024-01-25	Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation	Jiaxu Wang et.al.	2401.14354	null
2024-01-27	Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation	Minglin Chen et.al.	2401.14257	null
2024-01-24	EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues Reconstruction	Yangsen Chen et.al.	2401.13352	null
2024-01-23	NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for Talking Face Synthesis	Chongke Bi et.al.	2401.12568	null
2024-01-23	Exploration and Improvement of Nerf-based 3D Scene Editing Techniques	Shun Fang et.al.	2401.12456	null
2024-01-23	Methods and strategies for improving the novel view synthesis quality of neural radiation field	Shun Fang et.al.	2401.12451	null
2024-01-22	Single-View 3D Human Digitalization with Large Reconstruction Models	Zhenzhen Weng et.al.	2401.12175	null
2024-01-22	Scaling Face Interaction Graph Networks to Real World Scenes	Tatiana Lopez-Guevara et.al.	2401.11985	null
2024-01-22	HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs	Zelin Gao et.al.	2401.11711	null
2024-01-23	IPR-NeRF: Ownership Verification meets Neural Radiance Field	Win Kent Ong et.al.	2401.09495	null
2024-01-17	ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization	Weiyao Wang et.al.	2401.08937	null
2024-01-18	ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Process	Kiyohiro Nakayama et.al.	2401.08140	null
2024-01-16	Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities	Xu Yan et.al.	2401.08045	link
2024-01-15	6-DoF Grasp Pose Evaluation and Optimization via Transfer Learning from NeRFs	Gergely Sóti et.al.	2401.07935	null
2024-01-11	TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation	Rajaei Khatib et.al.	2401.06191	null
2024-01-11	Fast High Dynamic Range Radiance Fields for Dynamic Scenes	Guanjun Wu et.al.	2401.06052	null
2024-01-11	CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians	Bin Dou et.al.	2401.05925	null
2024-01-11	GO-NeRF: Generating Virtual Objects in Neural Radiance Fields	Peng Dai et.al.	2401.05750	null
2024-01-10	Diffusion Priors for Dynamic View Synthesis from Monocular Videos	Chaoyang Wang et.al.	2401.05583	null
2024-01-10	InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes	Mohamad Shahbazi et.al.	2401.05335	null
2024-01-10	CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video	Xingyu Miao et.al.	2401.04861	link
2024-01-08	A Survey on 3D Gaussian Splatting	Guikun Chen et.al.	2401.03890	link
2024-01-08	NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation	Casimir Feldmann et.al.	2401.03771	null
2024-01-06	RustNeRF: Robust Neural Radiance Field with Low-Quality Images	Mengfei Li et.al.	2401.03257	null
2024-01-06	Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping	Tongyan Hua et.al.	2401.03203	null
2024-01-05	Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human	Song Bai et.al.	2401.02620	null
2024-01-05	FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF	Hao Zhang et.al.	2401.02616	link
2024-01-05	Characterizing Satellite Geometry via Accelerated 3D Gaussian Splatting	Van Minh Nguyen et.al.	2401.02588	null
2024-01-03	SIGNeRF: Scene Integrated Generation for Neural Radiance Fields	Jan-Niklas Dihlmann et.al.	2401.01647	null
2024-01-02	Street Gaussians for Modeling Dynamic Urban Scenes	Yunzhi Yan et.al.	2401.01339	link
2024-01-02	Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise	Qinglong Huang et.al.	2401.01216	null
2024-01-02	3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands	Xuan Huang et.al.	2401.00979	link
2024-01-01	Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior	Byeonghyeon Lee et.al.	2401.00825	link
2024-01-02	GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields	Xiao Pan et.al.	2401.00616	null
2023-12-30	Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models	Han Jiang et.al.	2401.00208	null
2023-12-29	Informative Rays Selection for Few-Shot Neural Radiance Fields	Marco Orsingher et.al.	2312.17561	null
2023-12-27	City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web	Kaiwen Song et.al.	2312.16457	link
2023-12-26	DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision	Lu Ling et.al.	2312.16256	null
2023-12-24	SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition	Nikhil Behari et.al.	2312.16215	null
2023-12-23	INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields	Andrew Hou et.al.	2312.16197	null
2023-12-26	LangSplat: 3D Language Gaussian Splatting	Minghan Qin et.al.	2312.16084	link
2023-12-26	2D-Guided 3D Gaussian Segmentation	Kun Lan et.al.	2312.16047	null
2023-12-26	Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images	Zhan Lu et.al.	2312.15942	link
2023-12-23	Human101: Training 100+FPS Human Gaussians in 100s from 1 View	Mingwei Li et.al.	2312.15258	link
2023-12-23	Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane	Chen Yang et.al.	2312.15253	link
2023-12-23	CaLDiff: Camera Localization in NeRF via Pose Diffusion	Rashik Shrestha et.al.	2312.15242	null
2023-12-22	PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF	Mohsen Gholami et.al.	2312.14915	link
2023-12-22	Density Uncertainty Quantification with NeRF-Ensembles: Impact of Data and Scene Constraints	Miriam Jäger et.al.	2312.14664	null
2023-12-21	PlatoNeRF: 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar	Tzofi Klinghoffer et.al.	2312.14239	null
2023-12-21	Virtual Pets: Animatable Animal Generation in 3D Scenes	Yen-Chi Cheng et.al.	2312.14154	null
2023-12-21	Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning	Desai Xie et.al.	2312.13980	null
2023-12-21	SyncDreamer for 3D Reconstruction of Endangered Animal Species with NeRF and NeuS	Ahmet Haydar Ornek et.al.	2312.13832	null
2023-12-22	Gaussian Splatting with NeRF-based Color and Opacity	Dawid Malarz et.al.	2312.13729	link
2023-12-21	DyBluRF: Dynamic Deblurring Neural Radiance Fields for Blurry Monocular Video	Minh-Quan Viet Bui et.al.	2312.13528	null
2023-12-21	Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects	David Nakath et.al.	2312.13494	null
2023-12-20	NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields	Jens Naumann et.al.	2312.13471	null
2023-12-20	Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM	Junru Lin et.al.	2312.13332	null
2023-12-20	ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors	Weijia Mao et.al.	2312.13324	null
2023-12-20	UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections	Fangjinhua Wang et.al.	2312.13285	null
2023-12-19	ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields	Xiang Feng et.al.	2312.12122	null
2023-12-19	LHManip: A Dataset for Long-Horizon Language-Grounded Manipulation Tasks in Cluttered Tabletop Environments	Federico Ceola et.al.	2312.12036	link
2023-12-19	MixRT: Mixed Neural Representations For Real-Time NeRF Rendering	Chaojian Li et.al.	2312.11841	null
2023-12-19	Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation	Yuze He et.al.	2312.11774	null
2023-12-15	FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline	Chien-Yu Lin et.al.	2312.11537	null
2023-12-15	Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior	Nan Huang et.al.	2312.11535	null
2023-12-18	GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning	Ye Yuan et.al.	2312.11461	null
2023-12-18	AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis	Dongze Li et.al.	2312.10921	null
2023-12-17	PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields	Boming Zhao et.al.	2312.10649	null
2023-12-19	Learning Dense Correspondence for NeRF-Based Face Reenactment	Songlin Yang et.al.	2312.10422	null
2023-12-15	SlimmeRF: Slimmable Radiance Fields	Shiran Yuan et.al.	2312.10034	link
2023-12-15	LAENeRF: Local Appearance Editing for Neural Radiance Fields	Lukas Radl et.al.	2312.09913	null
2023-12-15	SLS4D: Sparse Latent Space for 4D Novel View Synthesis	Qi-Yuan Feng et.al.	2312.09743	null
2023-12-15	Towards Transferable Targeted 3D Adversarial Attack in the Physical World	Yao Huang et.al.	2312.09558	link
2023-12-14	LatentEditor: Text Driven Local Editing of 3D Scenes	Umar Khalid et.al.	2312.09313	link
2023-12-14	Stable Score Distillation for High-Quality 3D Generation	Boshi Tang et.al.	2312.09305	null
2023-12-14	ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining	Ruoxi Shi et.al.	2312.09249	null
2023-12-15	3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting	Zhiyin Qian et.al.	2312.09228	null
2023-12-15	ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field	Zhangkai Ni et.al.	2312.09095	link
2023-12-15	Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption	Ziteng Cui et.al.	2312.09093	link
2023-12-14	iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching	Yuan Sun et.al.	2312.09031	null
2023-12-14	Scene 3-D Reconstruction System in Scattering Medium	Zhuoyifan Zhang et.al.	2312.09005	null
2023-12-14	CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning	Qingsong Yan et.al.	2312.08760	null
2023-12-14	SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field	Ru Li et.al.	2312.08692	link
2023-12-13	ProNeRF: Learning Efficient Projection-Aware Ray Sampling for Fine-Grained Implicit Neural Radiance Fields	Juan Luis Gonzalez Bello et.al.	2312.08136	null
2023-12-13	Neural Radiance Fields for Transparent Object Using Visual Hull	Heechan Yoon et.al.	2312.08118	null
2023-12-13	uSF: Learning Neural Semantic Field with Uncertainty	Vsevolod Skorokhodov et.al.	2312.08012	link
2023-12-12	COLMAP-Free 3D Gaussian Splatting	Yang Fu et.al.	2312.07504	link
2023-12-12	Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs	Sunghwan Hong et.al.	2312.07246	link
2023-12-12	WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction	Jingchun Zhou et.al.	2312.06946	null
2023-12-10	TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video	Minye Wu et.al.	2312.06713	null
2023-12-11	CorresNeRF: Image Correspondence Priors for Neural Radiance Fields	Yixing Lao et.al.	2312.06642	link
2023-12-11	DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior	Tianyu Huang et.al.	2312.06439	link
2023-12-10	NeVRF: Neural Video-based Radiance Fields for Long-duration Sequences	Minye Wu et.al.	2312.05855	null
2023-12-10	IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment	Letian Zhang et.al.	2312.05748	null
2023-12-09	CoGS: Controllable Gaussian Splatting	Heng Yu et.al.	2312.05664	null
2023-12-09	R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning	Zhiling Ye et.al.	2312.05572	null
2023-12-08	Multi-view Inversion for 3D-aware Generative Adversarial Networks	Florian Barthel et.al.	2312.05330	link
2023-12-08	TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis	Heming Zhu et.al.	2312.05161	null
2023-12-08	Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting	Xiaofeng Yang et.al.	2312.04820	null
2023-12-08	Reality’s Canvas, Language’s Brush: Crafting 3D Avatars from Monocular Video	Yuchen Rao et.al.	2312.04784	null
2023-12-07	MuRF: Multi-Baseline Radiance Fields	Haofei Xu et.al.	2312.04565	link
2023-12-07	EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS	Sharath Girish et.al.	2312.04564	link
2023-12-07	Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection	Kohei Yamashita et.al.	2312.04527	null
2023-12-07	Multi-View Unsupervised Image Generation with Cross Attention Guidance	Llukman Cerkezi et.al.	2312.04337	null
2023-12-07	Towards 4D Human Video Stylization	Tiantian Wang et.al.	2312.04143	link
2023-12-07	Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction	Jiayi Kong et.al.	2312.04106	null
2023-12-06	Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion	Kira Prabhu et.al.	2312.03869	null
2023-12-06	Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle	Youtian Lin et.al.	2312.03431	null
2023-12-06	Artist-Friendly Relightable and Animatable Neural Heads	Yingyan Xu et.al.	2312.03420	null
2023-12-06	Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method	Hongyu Huang et.al.	2312.03372	null
2023-12-06	RING-NeRF: A Versatile Architecture based on Residual Implicit Neural Grids	Doriand Petit et.al.	2312.03357	null
2023-12-06	SO-NeRF: Active View Planning for NeRF using Surrogate Objectives	Keifer Lee et.al.	2312.03266	null
2023-12-06	Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields	Shijie Zhou et.al.	2312.03203	link
2023-12-05	HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces	Haithem Turki et.al.	2312.03160	null
2023-12-05	ReconFusion: 3D Reconstruction with Diffusion Priors	Rundi Wu et.al.	2312.02981	null
2023-12-05	GauHuman: Articulated Gaussian Splatting from Monocular Human Videos	Shoukang Hu et.al.	2312.02973	link
2023-12-05	Alchemist: Parametric Control of Material Properties with Diffusion Models	Prafull Sharma et.al.	2312.02970	null
2023-12-05	MVHumanNet: A Large-scale Dataset of Multi-view Daily Dressing Human Captures	Zhangyang Xiong et.al.	2312.02963	null
2023-12-05	C-NERF: Representing Scene Changes as Directional Consistency Difference-based NeRF	Rui Huang et.al.	2312.02751	link
2023-12-05	Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent	Jianmeng Liu et.al.	2312.02568	null
2023-12-04	PointNeRF++: A multi-scale, point-based Neural Radiance Field	Weiwei Sun et.al.	2312.02362	null
2023-12-04	Calibrated Uncertainties for Neural Radiance Fields	Niki Amini-Naieni et.al.	2312.02350	null
2023-12-04	Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis	Felix Tristram et.al.	2312.02255	null
2023-12-04	ColonNeRF: Neural Radiance Fields for High-Fidelity Long-Sequence Colonoscopy Reconstruction	Yufei Shi et.al.	2312.02015	null
2023-12-04	Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training	Runze He et.al.	2312.01663	null
2023-12-03	SANeRF-HQ: Segment Anything for NeRF in High Quality	Yichen Liu et.al.	2312.01531	null
2023-12-03	VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams	Liao Wang et.al.	2312.01407	null
2023-12-02	Self-Evolving Neural Radiance Fields	Jaewoo Jung et.al.	2312.01003	link
2023-12-01	Gaussian Grouping: Segment and Edit Anything in 3D Scenes	Mingqiao Ye et.al.	2312.00732	link
2023-11-30	LucidDreaming: Controllable Object-Centric 3D Generation	Zhaoning Wang et.al.	2312.00588	null
2023-12-01	FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting	Zehao Zhu et.al.	2312.00451	null
2023-11-30	PyNeRF: Pyramidal Neural Radiance Fields	Haithem Turki et.al.	2312.00252	link
2023-11-30	SparseGS: Real-Time 360° Sparse View Synthesis using Gaussian Splatting	Haolin Xiong et.al.	2312.00206	link
2023-11-30	Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing	Hyelin Nam et.al.	2311.18608	null
2023-11-30	ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs	Violeta Menéndez González et.al.	2311.18491	null
2023-11-30	Anisotropic Neural Representation Learning for High-Quality Neural Rendering	Y. Wang et.al.	2311.18311	null
2023-11-30	CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt	Haiyao Xiao et.al.	2311.18288	null
2023-11-30	Compact3D: Compressing Gaussian Splat Radiance Field Models with Vector Quantization	KL Navaneet et.al.	2311.18159	link
2023-11-29	GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces	Yingwenqi Jiang et.al.	2311.17977	null
2023-11-29	AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text	Jianfeng Zhang et.al.	2311.17917	null
2023-11-29	FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information	Wen Jiang et.al.	2311.17874	link
2023-11-29	Cinematic Behavior Transfer via NeRF-based Differentiable Filming	Xuekun Jiang et.al.	2311.17754	null
2023-11-29	SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis	Ziqiao Peng et.al.	2311.17590	link
2023-11-29	NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields	Xiaoliang Liu et.al.	2311.17332	null
2023-11-28	LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS	Zhiwen Fan et.al.	2311.17245	link
2023-11-28	Continuous Pose for Monocular Cameras in Neural Implicit Representation	Qi Ma et.al.	2311.17119	link
2023-11-28	UC-NeRF: Neural Radiance Field for Under-Calibrated multi-view cameras in autonomous driving	Kai Cheng et.al.	2311.16945	null
2023-11-28	The Sky’s the Limit: Re-lightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility	James A. D. Gardner et.al.	2311.16937	link
2023-11-28	SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation	Jesus Zarzar et.al.	2311.16671	link
2023-11-28	DGNR: Density-Guided Neural Point Rendering of Large Driving Scenes	Zhuopeng Li et.al.	2311.16664	null
2023-11-28	SCALAR-NeRF: SCAlable LARge-scale Neural Radiance Fields for Scene Reconstruction	Yu Chen et.al.	2311.16657	null
2023-11-28	Rethinking Directional Integration in Neural Radiance Fields	Congyue Deng et.al.	2311.16504	null
2023-11-27	Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images	Shiu-hong Kao et.al.	2311.16499	link
2023-11-27	Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling	Zhe Li et.al.	2311.16096	link
2023-11-27	SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance Fields	Quentin Herau et.al.	2311.15803	null
2023-11-27	CaesarNeRF: Calibrated Semantic Representation for Few-shot Generalizable Neural Rendering	Haidong Zhu et.al.	2311.15510	link
2023-11-26	Efficient Encoding of Graphics Primitives with Simplex-based Structures	Yibo Wen et.al.	2311.15439	null
2023-11-26	Obj-NeRF: Extract Object NeRFs from Multi-view Images	Zhiyi Li et.al.	2311.15291	null
2023-11-26	NeuRAD: Neural Rendering for Autonomous Driving	Adam Tonderski et.al.	2311.15260	link
2023-11-24	Animate124: Animating One Image to 4D Dynamic Scene	Yuyang Zhao et.al.	2311.14603	null
2023-11-24	GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting	Yiwen Chen et.al.	2311.14521	link
2023-11-23	ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain Optimization	Soonbin Lee et.al.	2311.14208	null
2023-11-23	Tube-NeRF: Efficient Imitation Learning of Visuomotor Policies from MPC using Tube-Guided Data Augmentation and NeRFs	Andrea Tagliabue et.al.	2311.14153	null
2023-11-23	Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder	Xiaohao Xu et.al.	2311.13750	null
2023-11-22	Compact 3D Gaussian Representation for Radiance Field	Joo Chan Lee et.al.	2311.13681	link
2023-11-22	Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning	Kai Yu et.al.	2311.13617	null
2023-11-22	Animatable 3D Gaussians for High-fidelity Synthesis of Human Motions	Keyang Ye et.al.	2311.13404	null
2023-11-22	Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images	Jaeyoung Chung et.al.	2311.13398	link
2023-11-22	3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization	Jianwei Feng et.al.	2311.13168	null
2023-11-22	PIE-NeRF: Physics-based Interactive Elastodynamics with NeRF	Yutao Feng et.al.	2311.13099	null
2023-11-21	SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering	Antoine Guédon et.al.	2311.12775	link
2023-11-21	Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields	Yifan Wang et.al.	2311.12490	null
2023-11-18	Towards Function Space Mesh Watermarking: Protecting the Copyright of Signed Distance Fields	Xingyu Zhu et.al.	2311.12059	null
2023-11-20	GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding	Hao Li et.al.	2311.11863	null
2023-11-20	Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields	Zhiyuan Min et.al.	2311.11845	link
2023-11-19	GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise	Xinhai Li et.al.	2311.11221	null
2023-11-18	SNI-SLAM: Semantic Neural Implicit SLAM	Siting Zhu et.al.	2311.11016	link
2023-11-18	Structure-Aware Sparse-View X-ray 3D Reconstruction	Yuanhao Cai et.al.	2311.10959	link
2023-11-17	Removing Adverse Volumetric Effects From Trained Neural Radiance Fields	Andreas L. Teigen et.al.	2311.10523	null
2023-11-18	EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices	Jingnan Gao et.al.	2311.09806	null
2023-11-16	Reconstructing Continuous Light Field From Single Coded Image	Yuya Ishikawa et.al.	2311.09646	null
2023-11-15	Single-Image 3D Human Digitization with Shape-Guided Diffusion	Badour AlBahar et.al.	2311.09221	null
2023-11-15	DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model	Yinghao Xu et.al.	2311.09217	null
2023-11-15	Spiking NeRF: Representing the Real-World Geometry by a Discontinuous Representation	Zhanfeng Liao et.al.	2311.09077	link
2023-11-13	$L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF	Liangchen Li et.al.	2311.07044	null
2023-11-11	Aria-NeRF: Multimodal Egocentric View Synthesis	Jiankai Sun et.al.	2311.06455	null
2023-11-10	Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model	Jiahao Li et.al.	2311.06214	null
2023-11-10	A Neural Height-Map Approach for the Binocular Photometric Stereo Problem	Fotios Logothetis et.al.	2311.05958	null
2023-11-09	BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis	Hao-Bin Duan et.al.	2311.05521	link
2023-11-09	Control3D: Towards Controllable Text-to-3D Generation	Yang Chen et.al.	2311.05461	null
2023-11-08	LRM: Large Reconstruction Model for Single Image to 3D	Yicong Hong et.al.	2311.04400	null
2023-11-07	ADFactory: Automated Data Factory for Optical Flow Tasks	Han Ling et.al.	2311.04246	null
2023-11-07	High-fidelity 3D Reconstruction of Plants using Neural Radiance Field	Kewei Hu et.al.	2311.04154	null
2023-11-07	Fast Sun-aligned Outdoor Scene Relighting based on TensoRF	Yeonjin Chang et.al.	2311.03965	null
2023-11-08	UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields	Injae Kim et.al.	2311.03784	link
2023-11-06	Osprey: Multi-Session Autonomous Aerial Mapping with LiDAR-based SLAM and Next Best View Planning	Rowan Border et.al.	2311.03484	null
2023-11-06	Animating NeRFs from Texture Space: A Framework for Pose-Dependent Rendering of Human Performances	Paul Knoll et.al.	2311.03140	null
2023-11-06	InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image	Jianhui Li et.al.	2311.02826	link
2023-11-03	Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields	Jianxiong Shen et.al.	2311.01815	null
2023-11-03	PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation	Yuhan Ding et.al.	2311.01773	null
2023-11-03	Efficient Cloud Pipelines for Neural Radiance Fields	Derek Jacoby et.al.	2311.01659	null
2023-11-02	Novel View Synthesis from a Single RGBD Image for Indoor Scenes	Congrui Hetang et.al.	2311.01065	null
2023-10-31	FPO++: Efficient Encoding and Rendering of Dynamic Neural Radiance Fields by Analyzing and Enhancing Fourier PlenOctrees	Saskia Rabich et.al.	2310.20710	link
2023-10-31	NeRF Revisited: Fixing Quadrature Instability in Volume Rendering	Mikaela Angelina Uy et.al.	2310.20685	null
2023-10-30	Generative Neural Fields by Mixtures of Neural Implicit Functions	Tackgeun You et.al.	2310.19464	null
2023-11-04	TiV-NeRF: Tracking and Mapping via Time-Varying Representation with Dynamic Neural Radiance Fields	Chengyao Duan et.al.	2310.18917	null
2023-10-28	INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings	Amirhossein Kazerouni et.al.	2310.18846	link
2023-10-27	ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image	Kyle Sargent et.al.	2310.17994	link
2023-10-27	Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations	Tristan Aumentado-Armstrong et.al.	2310.17880	null
2023-10-27	HyperFields: Towards Zero-Shot Generation of NeRFs from Text	Sudarshan Babu et.al.	2310.17075	null
2023-10-25	4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via 4D Semantic Segmentation	Dadong Jiang et.al.	2310.16858	null
2023-10-26	LightSpeed: Light and Fast Neural Light Fields on Mobile Devices	Aarush Gupta et.al.	2310.16832	link
2023-10-28	PERF: Panoramic Neural Radiance Field from a Single Panorama	Guangcong Wang et.al.	2310.16831	link
2023-10-25	Open-NeRF: Towards Open Vocabulary NeRF Decomposition	Hao Zhang et.al.	2310.16383	null
2023-10-25	UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception	Christopher Maxey et.al.	2310.16255	null
2023-10-24	Cross-view Self-localization from Synthesized Scene-graphs	Ryogo Yamamoto et.al.	2310.15504	null
2023-10-23	CAwa-NeRF: Instant Learning of Compression-Aware NeRF Features	Omnia Mahmoud et.al.	2310.14695	null
2023-10-23	VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations	Yiying Yang et.al.	2310.14487	null
2023-10-20	ManifoldNeRF: View-dependent Image Feature Supervision for Few-shot Neural Radiance Fields	Daiju Kanaoka et.al.	2310.13670	null
2023-10-20	Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos	Seoha Kim et.al.	2310.13356	link
2023-10-20	UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene	Jiaming Gu et.al.	2310.13263	null
2023-10-18	VQ-NeRF: Neural Reflectance Decomposition and Editing with Vector Quantization	Hongliang Zhong et.al.	2310.11864	null
2023-10-18	Towards Abdominal 3-D Scene Rendering from Laparoscopy Surgical Videos using NeRFs	Khoa Tuan Nguyen et.al.	2310.11645	null
2023-10-16	TraM-NeRF: Tracing Mirror and Near-Perfect Specular Reflections through Neural Radiance Fields	Leif Van Holland et.al.	2310.10650	link
2023-10-16	DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing	Jia-Wei Liu et.al.	2310.10624	null
2023-10-16	Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model	Junpeng Tan et.al.	2310.10209	null
2023-10-15	ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context	Binglun Wang et.al.	2310.09965	null
2023-10-15	Active Perception using Neural Radiance Fields	Siming He et.al.	2310.09892	link
2023-10-15	CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses	Hongyu Fu et.al.	2310.09776	null
2023-10-11	Dynamic Appearance Particle Neural Radiance Field	Ancheng Lin et.al.	2310.07916	null
2023-10-12	PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction	Jia-Wang Bian et.al.	2310.07449	link
2023-10-11	rpcPRF: Generalizable MPI Neural Radiance Field for Satellite Camera	Tongtong Zhang et.al.	2310.07179	null
2023-10-10	Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization	Le Chen et.al.	2310.06984	null
2023-10-10	High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field	Minghan Qin et.al.	2310.06275	null
2023-10-09	A Real-time Method for Inserting Virtual Objects into Neural Radiance Fields	Keyang Ye et.al.	2310.05837	null
2023-10-09	Neural Impostor: Editing Neural Radiance Fields with Explicit Shape Manipulation	Ruiyang Liu et.al.	2310.05391	null
2023-10-08	LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization	Artem Nenashev et.al.	2310.05134	null
2023-10-08	Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation	Dominik Hollidt et.al.	2310.05133	null
2023-10-06	Improving Neural Radiance Field using Near-Surface Sampling with Point Cloud Generation	Hye Bin Yoo et.al.	2310.04152	null
2023-10-05	Drag View: Generalizable Novel View Synthesis with Unposed Imagery	Zhiwen Fan et.al.	2310.03704	link
2023-10-05	Targeted Adversarial Attacks on Generalizable Neural Radiance Fields	Andras Horvath et.al.	2310.03578	null
2023-10-05	BID-NeRF: RGB-D image pose estimation with inverted Neural Radiance Fields	Ágoston István Csehi et.al.	2310.03563	null
2023-10-04	Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation	Yihan Wu et.al.	2310.03125	null
2023-10-04	T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation	Yuze He et.al.	2310.02977	link
2023-10-04	ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF	Jangho Park et.al.	2310.02712	null
2023-10-05	USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields	Moyang Li et.al.	2310.02687	link
2023-10-03	EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields	Anish Bhattacharya et.al.	2310.02437	link
2023-10-03	Adaptive Multi-NeRF: Exploit Efficient Parallelism in Adaptive Multiple Scale Neural Radiance Field Rendering	Tong Wang et.al.	2310.01881	null
2023-10-03	MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields	Takuhiro Kaneko et.al.	2310.01821	null
2023-10-02	PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments	Xiuzhong Hu et.al.	2310.00874	link
2023-10-01	How Many Views Are Needed to Reconstruct an Unknown Object Using NeRF?	Sicong Pan et.al.	2310.00684	link
2023-10-01	Enabling Neural Radiance Fields (NeRF) for Large-scale Aerial Images – A Multi-tiling Approaching and the Geometry Assessment of NeRF	Ningli Xu et.al.	2310.00530	null
2023-09-30	MMPI: a Flexible Radiance Field Representation by Multiple Multi-plane Images Blending	Yuze He et.al.	2310.00249	null
2023-09-29	Multi-task View Synthesis with Neural Radiance Fields	Shuhong Zheng et.al.	2309.17450	link
2023-09-29	Forward Flow for Novel View Synthesis of Dynamic Scenes	Xiang Guo et.al.	2309.17390	null
2023-09-29	HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field	Xiaochen Zhao et.al.	2309.17128	null
2023-09-28	Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis	Marcel C. Bühler et.al.	2309.16859	null
2023-09-28	MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond	Yixuan Li et.al.	2309.16553	null
2023-09-28	FG-NeRF: Flow-GAN based Probabilistic Neural Radiance Field for Independence-Assumption-Free Uncertainty Estimation	Songlin Wei et.al.	2309.16364	null
2023-09-28	Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge	Zheyuan Yang et.al.	2309.16110	null
2023-09-27	P2I-NET: Mapping Camera Pose to Image via Adversarial Learning for New View Synthesis in Real Indoor Environments	Xujie Kang et.al.	2309.15526	null
2023-09-27	BASED: Bundle-Adjusting Surgical Endoscopic Dynamic Video Reconstruction using Neural Radiance Fields	Shreya Saha et.al.	2309.15329	null
2023-09-26	3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction	Miriam Jäger et.al.	2309.14800	null
2023-09-25	NAS-NeRF: Generative Neural Architecture Search for Neural Radiance Fields	Saeejith Nair et.al.	2309.14293	null
2023-09-25	Variational Inference for Scalable 3D Object-centric Learning	Tianyu Wang et.al.	2309.14010	null
2023-09-24	MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field	Zijiang Yang et.al.	2309.13607	null
2023-09-23	NeRF-Enhanced Outpainting for Faithful Field-of-View Extrapolation	Rui Yu et.al.	2309.13240	null
2023-09-22	NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields	Xiaoxue Chen et.al.	2309.13039	link
2023-09-21	ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding	Yu Cheng et.al.	2309.12183	null
2023-09-21	NeuralLabeling: A versatile toolset for labeling vision datasets using Neural Radiance Fields	Floris Erich et.al.	2309.11966	link
2023-09-21	Fast Satellite Tensorial Radiance Field for Multi-date Satellite Imagery of Large Size	Tongtong Zhang et.al.	2309.11767	null
2023-09-21	MarkNerf:Watermarking for Neural Radiance Field	Lifeng Chen et.al.	2309.11747	null
2023-09-21	Rendering stable features improves sampling-based localisation with Neural radiance fields	Boxuan Zhang et.al.	2309.11698	null
2023-09-20	GenLayNeRF: Generalizable Layered Representations with 3D Model Alignment for Multi-Human View Synthesis	Youssef Abdelkareem et.al.	2309.11627	null
2023-09-20	Light Field Diffusion for Single-View Novel View Synthesis	Yifeng Xiong et.al.	2309.11525	null
2023-09-21	Controllable Dynamic Appearance for Neural 3D Portraits	ShahRukh Athar et.al.	2309.11009	null
2023-09-20	Spiking NeRF: Making Bio-inspired Neural Networks See through the Real World	Xingting Yao et.al.	2309.10987	link
2023-09-19	Locally Stylized Neural Radiance Fields	Hong-Wing Pang et.al.	2309.10684	null
2023-09-19	Steganography for Neural Radiance Fields by Backdooring	Weina Dong et.al.	2309.10503	null
2023-09-18	Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach	Rong Liu et.al.	2309.10011	null
2023-09-18	RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision	Mingjie Pan et.al.	2309.09502	link
2023-09-17	NeRF-VINS: A Real-time Neural Radiance Field Map-based Visual-Inertial Navigation System	Saimouli Katragadda et.al.	2309.09295	null
2023-09-16	DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF	Mert Asim Karaoglu et.al.	2309.08927	link
2023-09-15	Robust e-NeRF: NeRF from Sparse & Noisy Events under Non-Uniform Motion	Weng Fei Low et.al.	2309.08596	link
2023-09-14	Gradient based Grasp Pose Optimization on a NeRF that Approximates Grasp Success	Gergely Sóti et.al.	2309.08040	null
2023-09-14	MC-NeRF: Muti-Camera Neural Radiance Fields for Muti-Camera Image Acquisition Systems	Yu Gao et.al.	2309.07846	null
2023-09-14	DT-NeRF: Decomposed Triplane-Hash Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis	Yaoyu Su et.al.	2309.07752	null
2023-09-14	CoRF : Colorizing Radiance Fields using Knowledge Distillation	Ankit Dhiman et.al.	2309.07668	null
2023-09-13	Text-Guided Generation and Editing of Compositional 3D Avatars	Hao Zhang et.al.	2309.07125	null
2023-09-13	Dynamic NeRFs for Soccer Scenes	Sacha Lewin et.al.	2309.06802	link
2023-09-12	Federated Learning for Large-Scale Scene Modeling with Neural Radiance Fields	Teppei Suzuki et.al.	2309.06030	null
2023-09-11	PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics	Claus Smitt et.al.	2309.05339	null
2023-09-10	Text-driven Editing of 3D Scenes without Retraining	Shuangkang Fang et.al.	2309.04917	link
2023-09-09	Mirror-Aware Neural Humans	Daniel Ajisafe et.al.	2309.04750	link
2023-09-08	Dynamic Mesh-Aware Radiance Fields	Yi-Ling Qiao et.al.	2309.04581	null
2023-09-08	DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields	Junzhe Zhang et.al.	2309.04410	link
2023-09-14	SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions	Nagabhushan Somraj et.al.	2309.03955	null
2023-09-07	BluNF: Blueprint Neural Field	Robin Courant et.al.	2309.03933	null
2023-09-07	Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model	Sungwon Hwang et.al.	2309.03550	null
2023-09-06	Bayes’ Rays: Uncertainty Quantification for Neural Radiance Fields	Lily Goli et.al.	2309.03185	link
2023-09-06	ResFields: Residual Neural Fields for Spatiotemporal Signals	Marko Mihajlovic et.al.	2309.03160	link
2023-09-06	Instant Continual Learning of Neural Radiance Fields	Ryan Po et.al.	2309.01811	null
2023-09-04	Adv3D: Generating 3D Adversarial Examples in Driving Scenarios with NeRF	Leheng Li et.al.	2309.01351	null
2023-09-01	SparseSat-NeRF: Dense Depth Supervised Neural Radiance Fields for Sparse Satellite Images	Lulin Zhang et.al.	2309.00277	link
2023-08-24	Improving NeRF Quality by Progressive Camera Placement for Unrestricted Navigation in Complex Environments	Georgios Kopanas et.al.	2309.00014	null
2023-09-03	GHuNeRF: Generalizable Human NeRF from a Monocular Video	Chen Li et.al.	2308.16576	link
2023-08-30	From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications	Shreyank N Gowda et.al.	2308.16041	null
2023-08-30	Drone-NeRF: Efficient NeRF Based 3D Scene Reconstruction for Large-Scale Drone Survey	Zhihao Jia et.al.	2308.15733	null
2023-08-29	Efficient Ray Sampling for Radiance Fields Reconstruction	Shilei Sun et.al.	2308.15547	null
2023-08-29	Pose-Free Neural Radiance Fields via Implicit Pose Regularization	Jiahui Zhang et.al.	2308.15049	null
2023-08-28	CLNeRF: Continual Learning Meets NeRF	Zhipeng Cai et.al.	2308.14816	link
2023-08-26	InsertNeRF: Instilling Generalizability into NeRF with HyperNet Modules	Yanqi Bao et.al.	2308.13897	link
2023-08-24	NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects	Dakshit Agrawal et.al.	2308.12560	link
2023-08-23	Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields	Hyeonseop Song et.al.	2308.11974	null
2023-08-25	Pose Modulated Avatars from Video	Chunjin Song et.al.	2308.11951	null
2023-08-22	Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts	Wenyan Cong et.al.	2308.11793	link
2023-08-22	SAMSNeRF: Segment Anything Model (SAM) Guides Dynamic Surgical Scene Reconstruction by Neural Radiance Field (NeRF)	Ange Lou et.al.	2308.11774	null
2023-08-22	Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views	Wentian Qu et.al.	2308.11198	null
2023-08-22	Efficient View Synthesis with Neural Radiance Distribution Field	Yushuang Wu et.al.	2308.11130	null
2023-08-21	CamP: Camera Preconditioning for Neural Radiance Fields	Keunhong Park et.al.	2308.10902	null
2023-08-20	Strata-NeRF : Neural Radiance Fields for Stratified Scenes	Ankit Dhiman et.al.	2308.10337	null
2023-08-19	HollowNeRF: Pruning Hashgrid-Based NeRFs with Trainable Collision Mitigation	Xiufeng Xie et.al.	2308.10122	null
2023-08-19	AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization	Kun Wang et.al.	2308.10001	null
2023-08-19	Semantic-Human: Neural Rendering of Humans from Monocular Video with Human Parsing	Jie Zhang et.al.	2308.09894	null
2023-08-18	MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection	Junkai Xu et.al.	2308.09421	link
2023-08-18	DReg-NeRF: Deep Registration for Neural Radiance Fields	Yu Chen et.al.	2308.09386	link
2023-08-17	Watch Your Steps: Local Image and Scene Editing by Text Instructions	Ashkan Mirzaei et.al.	2308.08947	null
2023-08-21	Ref-DVGO: Reflection-Aware Direct Voxel Grid Optimization for an Improved Quality-Efficiency Trade-Off in Reflective Scene Reconstruction	Georgios Kouros et.al.	2308.08530	link
2023-08-16	SceNeRFlow: Time-Consistent Reconstruction of General Dynamic Scenes	Edith Tretschk et.al.	2308.08258	null
2023-08-16	Neural radiance fields in the industrial and robotics domain: applications, research opportunities and use cases	Eugen Šlapak et.al.	2308.07118	link
2023-08-14	S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural Fields	Zeke Xie et.al.	2308.07032	link
2023-08-11	Focused Specific Objects NeRF	Yuesong Li et.al.	2308.05970	null
2023-08-11	VERF: Runtime Monitoring of Pose Estimation with Neural Radiance Fields	Dominic Maggio et.al.	2308.05939	null
2023-08-09	WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields	Muyu Xu et.al.	2308.04826	null
2023-08-14	A General Implicit Framework for Fast NeRF Composition and Rendering	Xinyu Gao et.al.	2308.04669	null
2023-08-08	Digging into Depth Priors for Outdoor Neural Radiance Fields	Chen Wang et.al.	2308.04413	null
2023-08-07	Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing	Junyi Zeng et.al.	2308.03280	null
2023-08-05	Where and How: Mitigating Confusion in Neural Radiance Fields from Sparse Inputs	Yanqi Bao et.al.	2308.02908	link
2023-08-05	Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis	Yuxin Wang et.al.	2308.02840	null
2023-08-05	NeRFs: The Search for the Best 3D Representation	Ravi Ramamoorthi et.al.	2308.02751	null
2023-08-04	ES-MVSNet: Efficient Framework for End-to-end Self-supervised Multi-View Stereo	Qiang Zhou et.al.	2308.02191	null
2023-08-02	Incorporating Season and Solar Specificity into Renderings made by a NeRF Architecture using Satellite Images	Michael Gableman et.al.	2308.01262	link
2023-08-01	High-Fidelity Eye Animatable Neural Radiance Fields for Human Face	Hengfei Wang et.al.	2308.00773	null
2023-08-01	Context-Aware Talking-Head Video Editing	Songlin Yang et.al.	2308.00462	null
2023-07-28	Dynamic PlenOctree for Adaptive Sampling Refinement in Explicit NeRF	Haotian Bai et.al.	2307.15333	null
2023-07-27	Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields	Xiangyu Wang et.al.	2307.15131	link
2023-07-27	MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving	Zirui Wu et.al.	2307.15058	link
2023-07-27	NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection	Chenfeng Xu et.al.	2307.14620	link
2023-07-26	Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation	Chaohui Yu et.al.	2307.13908	null
2023-07-24	Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields	Shangzhan Zhang et.al.	2307.12909	null
2023-07-24	CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components	Davide Di Nucci et.al.	2307.12718	null
2023-07-23	TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering	Xiao Pan et.al.	2307.12291	null
2023-07-29	CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields	Ziyuan Luo et.al.	2307.11526	link
2023-07-21	FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields	Sungwon Hwang et.al.	2307.11418	null
2023-07-21	Tri-MipRF: Tri-Mip Representation for Efficient Anti-Aliasing Neural Radiance Fields	Wenbo Hu et.al.	2307.11335	null
2023-07-20	Urban Radiance Field Representation with Deformable Neural Mesh Primitives	Fan Lu et.al.	2307.10776	null
2023-07-20	Lighting up NeRF via Unsupervised Decomposition and Enhancement	Haoyuan Wang et.al.	2307.10664	link
2023-07-19	An Improved NeuMIP with Better Accuracy	Bowen Xue et.al.	2307.10135	null
2023-07-19	Magic NeRF Lens: Interactive Fusion of Neural Radiance Fields for Virtual Facility Inspection	Ke Li et.al.	2307.09860	link
2023-07-14	Transient Neural Radiance Fields for Lidar View Synthesis and 3D Reconstruction	Anagh Malik et.al.	2307.09555	null
2023-07-18	Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis	Jiahe Li et.al.	2307.09323	link
2023-07-16	Cross-Ray Neural Radiance Fields for Novel-view Synthesis from Unconstrained Image Collections	Yifan Yang et.al.	2307.08093	link
2023-07-15	Improving NeRF with Height Data for Utilization of GIS Data	Hinata Aoki et.al.	2307.07729	null
2023-07-11	SAR-NeRF: Neural Radiance Fields for Synthetic Aperture Radar Multi-View Representation	Zhengxin Lei et.al.	2307.05087	null
2023-07-07	NOFA: NeRF-based One-shot Facial Avatar Reconstruction	Wangbo Yu et.al.	2307.03441	null
2023-07-07	RGB-D Mapping and Tracking in a Plenoxel Radiance Field	Andreas L. Teigen et.al.	2307.03404	link
2023-07-16	FlipNeRF: Flipped Reflection Rays for Few-shot Novel View Synthesis	Seunghyeon Seo et.al.	2306.17723	link
2023-07-03	Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial Predictions	Gengchen Mai et.al.	2306.17624	null
2023-06-28	Envisioning a Next Generation Extended Reality Conferencing System with Efficient Photorealistic Human Rendering	Chuanyue Shen et.al.	2306.16541	null
2023-06-27	Unsupervised Polychromatic Neural Representation for CT Metal Artifact Reduction	Qing Wu et.al.	2306.15203	link
2023-06-22	Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields	Ori Gordon et.al.	2306.12760	link
2023-06-21	Local 3D Editing via 3D Distillation of CLIP Knowledge	Junha Hyung et.al.	2306.12570	null
2023-06-21	Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase	Qiuyu Wang et.al.	2306.12423	link
2023-06-21	DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation	Yukun Huang et.al.	2306.12422	null
2023-06-20	NeRF synthesis with shading guidance	Chenbin Li et.al.	2306.11556	null
2023-06-24	MA-NeRF: Motion-Assisted Neural Radiance Fields for Face Synthesis from Sparse Images	Weichen Zhang et.al.	2306.10350	null
2023-06-15	Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model	Lu Yu et.al.	2306.09551	null
2023-06-16	UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video	Zhi-Hao Lin et.al.	2306.09349	null
2023-06-13	DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$	Allan Jabri et.al.	2306.08068	null
2023-06-13	Binary Radiance Fields	Seungjoo Shin et.al.	2306.07581	null
2023-06-10	From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm	Kun Zhou et.al.	2306.06388	null
2023-06-15	NERFBK: A High-Quality Benchmark for NERF-Based 3D Reconstruction	Ali Karami et.al.	2306.06300	link
2023-06-09	HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork	Bipasha Sen et.al.	2306.06093	null
2023-06-09	GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields	Barbara Roessle et.al.	2306.06044	null
2023-06-09	RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models	Xingchen Zhou et.al.	2306.05668	null
2023-06-08	LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs	Zezhou Cheng et.al.	2306.05410	null
2023-06-08	Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields	Qianqiu Tan et.al.	2306.05303	link
2023-06-06	Towards Visual Foundational Models of Physical Scenes	Chethan Parameshwara et.al.	2306.03727	null
2023-06-06	Human 3D Avatar Modeling with Implicit Neural Representation: A Brief Survey	Mingyang Sun et.al.	2306.03576	null
2023-06-05	H2-Mapping: Real-time Dense Mapping Using Hierarchical Hybrid Representation	Chenxing Jiang et.al.	2306.03207	link
2023-06-05	BeyondPixels: A Comprehensive Review of the Evolution of Neural Radiance Fields	AKM Shahariar Azad Rabby et.al.	2306.03000	null
2023-06-05	ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields	Kanghyeok Ko et.al.	2306.02741	null
2023-06-01	FDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models	Hao Zhang et.al.	2306.00783	link
2023-06-01	Analyzing the Internals of Neural Radiance Fields	Lukas Radl et.al.	2306.00696	link
2023-06-02	AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars	Mohit Mendiratta et.al.	2306.00547	null
2023-05-30	DäRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation	Jiuhn Song et.al.	2305.19201	link
2023-05-30	Template-free Articulated Neural Point Clouds for Reposable View Synthesis	Lukas Uzolas et.al.	2305.19065	link
2023-05-31	HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance	Junzhe Zhu et.al.	2305.18766	link
2023-05-31	Towards a Robust Framework for NeRF Evaluation	Adrian Azzarelli et.al.	2305.18079	link
2023-05-31	Volume Feature Rendering for Fast Neural Radiance Field Reconstruction	Kang Han et.al.	2305.17916	null
2023-05-30	PlaNeRF: SVD Unsupervised 3D Plane Regularization for NeRF Large-Scale Scene Reconstruction	Fusang Wang et.al.	2305.16914	null
2023-05-25	ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image	Zhenzhen Weng et.al.	2305.16411	null
2023-05-25	Interactive Segment Anything NeRF with Feature Imitation	Xiaokang Chen et.al.	2305.16233	null
2023-05-25	ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation	Zhengyi Wang et.al.	2305.16213	link
2023-05-31	Deceptive-NeRF: Enhancing NeRF Reconstruction using Pseudo-Observations from Diffusion Models	Xinhang Liu et.al.	2305.15171	null
2023-05-24	InpaintNeRF360: Text-Guided 3D Inpainting on Unbounded Neural Radiance Fields	Dongqing Wang et.al.	2305.15094	null
2023-05-24	OD-NeRF: Efficient Training of On-the-Fly Dynamic Neural Radiance Fields	Zhiwen Yan et.al.	2305.14831	null
2023-05-24	3D Open-vocabulary Segmentation with Foundation Models	Kunhao Liu et.al.	2305.14093	link
2023-05-22	NeRFuser: Large-Scale Scene Representation by NeRF Fusion	Jiading Fang et.al.	2305.13307	link
2023-05-22	Registering Neural Radiance Fields as 3D Density Images	Han Jiang et.al.	2305.12843	null
2023-05-19	Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields	Jingbo Zhang et.al.	2305.11588	link
2023-05-18	MVPSNet: Fast Generalizable Multi-view Photometric Stereo	Dongxu Zhao et.al.	2305.11167	null
2023-05-18	ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis	Shoukang Hu et.al.	2305.11031	link
2023-05-17	MultiPlaneNeRF: Neural Radiance Field with Non-Trainable Representation	Dominik Zimny et.al.	2305.10579	link
2023-05-24	OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields	Youtan Yin et.al.	2305.10503	link
2023-05-16	NerfBridge: Bringing Real-time, Online Neural Radiance Field Training to Robotics	Javier Yu et.al.	2305.09761	link
2023-05-15	MV-Map: Offboard HD-Map Generation with Multi-view Consistency	Ziyang Xie et.al.	2305.08851	link
2023-05-12	BundleRecon: Ray Bundle-Based 3D Neural Reconstruction	Weikun Zhang et.al.	2305.07342	null
2023-05-10	Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era	Chenghao Li et.al.	2305.06131	null
2023-05-10	NeRF $^\textbf{2}$ : Neural Radio-Frequency Radiance Fields	Xiaopeng Zhao et.al.	2305.06118	null
2023-05-09	Instant-NeRF: Instant On-Device Neural Radiance Field Training via Algorithm-Accelerator Co-Designed Near-Memory Processing	Yang Zhao et.al.	2305.05766	null
2023-05-09	PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces	Yiqun Wang et.al.	2305.05594	link
2023-05-08	NerfAcc: Efficient Sampling Accelerates NeRFs	Ruilong Li et.al.	2305.04966	null
2023-05-08	AvatarReX: Real-time Expressive Full-body Avatars	Zerong Zheng et.al.	2305.04789	null
2023-05-07	HashCC: Lightweight Method to Improve the Quality of the Camera-less NeRF Scene Generation	Jan Olszewski et.al.	2305.04296	null
2023-05-07	Multi-Space Neural Radiance Fields	Ze-Xin Yin et.al.	2305.04268	null
2023-05-04	NeRF-QA: Neural Radiance Fields Quality Assessment Database	Pedro Martin et.al.	2305.03176	null
2023-05-04	NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds	Jun-Kun Chen et.al.	2305.03049	null
2023-05-04	Radiance Field Gradient Scaling for Unbiased Near-Camera Training	Julien Philip et.al.	2305.02756	link
2023-05-04	Semantic-aware Generation of Multi-view Portrait Drawings	Biao Ma et.al.	2305.02618	link
2023-05-02	Neural LiDAR Fields for Novel View Synthesis	Shengyu Huang et.al.	2305.01643	null
2023-05-03	LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar	Yuelang Xu et.al.	2305.01190	null
2023-05-02	Federated Neural Radiance Fields	Lachlan Holden et.al.	2305.01163	link
2023-05-01	GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation	Zhenhui Ye et.al.	2305.00787	null
2023-04-30	Neural Radiance Fields (NeRFs): A Review and Some Recent Developments	Mohamed Debbagh et.al.	2305.00375	null
2023-04-28	ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance Fields	Nagabhushan Somraj et.al.	2305.00041	link
2023-04-28	NeRF-LiDAR: Generating Realistic LiDAR Point Clouds with Neural Radiance Fields	Junge Zhang et.al.	2304.14811	link
2023-04-27	Learning a Diffusion Prior for NeRFs	Guandao Yang et.al.	2304.14473	null
2023-04-27	ActorsNeRF: Animatable Few-shot Human Rendering with Generalizable NeRFs	Jiteng Mu et.al.	2304.14401	null
2023-05-03	Combining HoloLens with Instant-NeRFs: Advanced Real-Time 3D Mobile Mapping	Dennis Haitz et.al.	2304.14301	null
2023-04-27	Compositional 3D Human-Object Neural Animation	Zhi Hou et.al.	2304.14070	null
2023-04-26	Super-NeRF: View-consistent Detail Generation for NeRF super-resolution	Yuqi Han et.al.	2304.13518	null
2023-04-26	VGOS: Voxel Grid Optimization for View Synthesis from Sparse Inputs	Jiakai Sun et.al.	2304.13386	link
2023-04-25	Local Implicit Ray Function for Generalizable Radiance Field Representation	Xin Huang et.al.	2304.12746	null
2023-04-27	MF-NeRF: Memory Efficient NeRF with Mixed-Feature Hash Table	Yongjae Lee et.al.	2304.12587	link
2023-04-24	Instant-3D: Instant Neural Radiance Field Training Towards On-Device AR/VR 3D Reconstruction	Sixu Li et.al.	2304.12467	null
2023-04-24	TextMesh: Generation of Realistic 3D Meshes From Text Prompts	Christina Tsalicoglou et.al.	2304.12439	null
2023-04-26	Segment Anything in 3D with NeRFs	Jiazhong Cen et.al.	2304.12308	link
2023-04-24	Explicit Correspondence Matching for Generalizable Neural Radiance Fields	Yuedong Chen et.al.	2304.12294	link
2023-04-25	Gen-NeRF: Efficient and Generalizable Neural Radiance Fields via Algorithm-Hardware Co-Design	Yonggan Fu et.al.	2304.11842	null
2023-04-22	3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes	Haotian Xue et.al.	2304.11470	null
2023-04-22	Dehazing-NeRF: Neural Radiance Fields from Hazy Images	Tian Li et.al.	2304.11448	null
2023-04-22	NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation	Baao Xie et.al.	2304.11342	link
2023-04-21	AutoNeRF: Training Implicit Scene Representations with Autonomous Agents	Pierre Marza et.al.	2304.11241	link
2023-04-21	Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction	Binbin Huang et.al.	2304.10780	null
2023-04-20	A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion	Miriam Jäger et.al.	2304.10664	null
2023-04-20	Learning Neural Duplex Radiance Fields for Real-Time View Synthesis	Ziyu Wan et.al.	2304.10537	null
2023-04-21	Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs	Frederik Warburg et.al.	2304.10532	link
2023-04-20	ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects	Marco Toschi et.al.	2304.10448	null
2023-04-20	LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields	Tang Tao et.al.	2304.10406	link
2023-04-20	Revisiting Implicit Neural Representations in Low-Level Vision	Wentian Xu et.al.	2304.10250	link
2023-04-20	Multiscale Representation for Real-Time Anti-Aliasing Neural Rendering	Dongting Hu et.al.	2304.10075	null
2023-04-20	Neural Radiance Fields: Past, Present, and Future	Ansh Mittal et.al.	2304.10050	link
2023-04-19	Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra	Jonas Kulhanek et.al.	2304.09987	link
2023-04-20	Reference-guided Controllable Inpainting of Neural Radiance Fields	Ashkan Mirzaei et.al.	2304.09677	null
2023-04-18	SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes	Yiming Gao et.al.	2304.08971	null
2023-04-18	NeAI: A Pre-convoluted Representation for Plug-and-Play Neural Ambient Illumination	Yiyu Zhuang et.al.	2304.08757	null
2023-04-17	MoDA: Modeling Deformable 3D Objects from Casual Videos	Chaoyue Song et.al.	2304.08279	link
2023-04-17	NeRF-Loc: Visual Localization with Conditional Neural Radiance Field	Jianlin Liu et.al.	2304.07979	link
2023-04-16	Likelihood-Based Generative Radiance Field with Latent Space Energy-Based Model for 3D-Aware Disentangled Image Representation	Yaxuan Zhu et.al.	2304.07918	null
2023-04-16	CAT-NeRF: Constancy-Aware Tx $^2$ Former for Dynamic Body Modeling	Haidong Zhu et.al.	2304.07915	link
2023-04-16	SeaThru-NeRF: Neural Radiance Fields in Scattering Media	Deborah Levy et.al.	2304.07743	link
2023-04-14	UVA: Towards Unified Volumetric Avatar for View Synthesis, Pose rendering, Geometry and Texture Editing	Jinlong Fan et.al.	2304.06969	null
2023-04-17	Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction	Hansheng Chen et.al.	2304.06714	link
2023-04-13	Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields	Jonathan T. Barron et.al.	2304.06706	null
2023-04-13	NeRFVS: Neural Radiance Fields for Free View Synthesis via Geometry Scaffolds	Chen Yang et.al.	2304.06287	null
2023-04-12	NutritionVerse-Thin: An Optimized Strategy for Enabling Improved Rendering of 3D Thin Food Models	Chi-en Amy Tai et.al.	2304.05620	null
2023-04-11	Improving Neural Radiance Fields with Depth-aware Optimization for Novel View Synthesis	Shu Chen et.al.	2304.05218	link
2023-04-11	One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field	Weichuang Li et.al.	2304.05097	null
2023-04-11	MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields	Ganlin Yang et.al.	2304.04962	link
2023-04-10	Neural Image-based Avatars: Generalizable Radiance Fields for Human Avatar Modeling	Youngjoong Kwon et.al.	2304.04897	null
2023-04-07	Event-based Camera Tracker by $\nabla$ t NeRF	Mana Masuda et.al.	2304.04559	null
2023-04-10	Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos	Liao Wang et.al.	2304.04452	null
2023-04-10	Inferring Fluid Dynamics via Inverse Rendering	Jinxian Liu et.al.	2304.04446	null
2023-04-10	Instance Neural Radiance Field	Benran Hu et.al.	2304.04395	link
2023-04-12	NeRF applied to satellite imagery for surface reconstruction	Federico Semeraro et.al.	2304.04133	link
2023-04-08	PVD-AL: Progressive Volume Distillation with Active Learning for Efficient Conversion Between Different NeRF Architectures	Shuangkang Fang et.al.	2304.04012	link
2023-04-07	Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field	Leheng Li et.al.	2304.03526	null
2023-04-06	Beyond NeRF Underwater: Learning Neural Reflectance Fields for True Color Correction of Marine Imagery	Tianyi Zhang et.al.	2304.03384	link
2023-04-06	LANe: Lighting-Aware Neural Fields for Compositional Scene Synthesis	Akshay Krishnan et.al.	2304.03280	null
2023-04-06	Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes	Zian Wang et.al.	2304.03266	null
2023-04-06	DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model	Hoigi Seo et.al.	2304.02827	null
2023-04-05	Image Stabilization for Hololens Camera in Remote Collaboration	Gowtham Senthil et.al.	2304.02736	null
2023-04-04	Generating Continual Human Motion in Diverse 3D Scenes	Aymen Mir et.al.	2304.02061	null
2023-04-04	MonoHuman: Animatable Human Neural Field from Monocular Video	Zhengming Yu et.al.	2304.02001	null
2023-04-06	DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models	Yukang Cao et.al.	2304.00916	link
2023-04-01	JacobiNeRF: NeRF Shaping with Mutual Information Gradients	Xiaomeng Xu et.al.	2304.00341	link
2023-03-31	VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization	Bingfan Zhu et.al.	2303.17968	link
2023-03-30	NeRF-Supervised Deep Stereo	Fabio Tosi et.al.	2303.17603	link
2023-03-30	SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling	Zhitao Yang et.al.	2303.17368	link
2023-03-30	NeILF++: Inter-Reflectable Light Fields for Geometry and Material Estimation	Jingyang Zhang et.al.	2303.17147	null
2023-03-30	Enhanced Stable View Synthesis	Nishant Jain et.al.	2303.17094	null
2023-03-29	TriVol: Point Cloud Rendering via Triple Volumes	Tao Hu et.al.	2303.16485	link
2023-03-29	Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance Fields	Tao Hu et.al.	2303.16482	null
2023-03-28	Flow supervision for Deformable NeRF	Chaoyang Wang et.al.	2303.16333	null
2023-03-28	SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis	Guangcong Wang et.al.	2303.16196	link
2023-03-28	VMesh: Hybrid Volume-Mesh Representation for Efficient View Synthesis	Yuan-Chen Guo et.al.	2303.16184	null
2023-03-30	Adaptive Voronoi NeRFs	Tim Elsner et.al.	2303.16001	null
2023-03-28	F $^{2}$ -NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories	Peng Wang et.al.	2303.15951	link
2023-03-27	JAWS: Just A Wild Shot for Cinematic Transfer in Neural Radiance Fields	Xi Wang et.al.	2303.15427	link
2023-03-27	Generalizable Neural Voxels for Fast Human Radiance Fields	Taoran Yi et.al.	2303.15387	null
2023-03-27	NeUDF: Learning Unsigned Distance Fields from Multi-view Images for Reconstructing Non-watertight Models	Fei Hou et.al.	2303.15368	link
2023-03-24	Perceptual Quality Assessment of NeRF and Neural View Synthesis Methods for Front-Facing Views	Hanxue Liang et.al.	2303.15206	null
2023-03-27	3D-Aware Multi-Class Image-to-Image Translation with NeRFs	Senmao Li et.al.	2303.15012	link
2023-03-26	Clean-NeRF: Reformulating NeRF to account for View-Dependent Observations	Xinhang Liu et.al.	2303.14707	null
2023-03-25	SUDS: Scalable Urban Dynamic Scenes	Haithem Turki et.al.	2303.14536	null
2023-03-25	DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields	Yu Chen et.al.	2303.14478	null
2023-03-25	NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects	Zhiwen Yan et.al.	2303.14435	link
2023-03-24	Grid-guided Neural Radiance Fields for Large Urban Scenes	Linning Xu et.al.	2303.14001	null
2023-03-24	CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout	Yiqi Lin et.al.	2303.13843	null
2023-03-24	HandNeRF: Neural Radiance Fields for Animatable Interacting Hands	Zhiyang Guo et.al.	2303.13825	null
2023-03-24	ABLE-NeRF: Attention-Based Rendering with Learnable Embeddings for Neural Radiance Field	Zhe Jun Tang et.al.	2303.13817	link
2023-03-24	GM-NeRF: Learning Generalizable Model-based Neural Radiance Fields from Multi-view Images	Jianchuan Chen et.al.	2303.13777	null
2023-03-24	TEGLO: High Fidelity Canonical Texture Mapping from Single-View Images	Vishal Vinod et.al.	2303.13743	null
2023-03-23	SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates	Mikaela Angelina Uy et.al.	2303.13582	null
2023-03-23	TriPlaneNet: An Encoder for EG3D Inversion	Ananta R. Bhattarai et.al.	2303.13497	null
2023-03-23	Plotting Behind the Scenes: Towards Learnable Game Engines	Willi Menapace et.al.	2303.13472	null
2023-03-23	Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes	Dana Cohen-Bar et.al.	2303.13450	link
2023-03-23	SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field	Chong Bao et.al.	2303.13277	link
2023-03-23	Transforming Radiance Field with Lipschitz Network for Photorealistic 3D Scene Stylization	Zicheng Zhang et.al.	2303.13232	null
2023-03-23	Semantic Ray: Learning a Generalizable Semantic Field with Cross-Reprojection Attention	Fangfu Liu et.al.	2303.13014	link
2023-03-22	NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions	Mohamad Shahbazi et.al.	2303.12865	link
2023-03-22	SHERF: Generalizable Human NeRF from a Single Image	Shoukang Hu et.al.	2303.12791	link
2023-03-22	Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions	Ayaan Haque et.al.	2303.12789	null
2023-03-22	FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models	Jianglong Ye et.al.	2303.12786	link
2023-03-22	Balanced Spherical Grid for Egocentric View Synthesis	Changwoon Choi et.al.	2303.12408	link
2023-03-21	Pre-NeRF 360: Enriching Unbounded Appearances for Neural Radiance Fields	Ahmad AlMughrabi et.al.	2303.12234	link
2023-03-21	3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion	Yu-Jhe Li et.al.	2303.11938	null
2023-03-22	ExtremeNeRF: Few-shot Neural Radiance Fields Under Unconstrained Illumination	SeokYeong Lee et.al.	2303.11728	null
2023-03-20	DehazeNeRF: Multiple Image Haze Removal and 3D Shape Reconstruction using Neural Radiance Fields	Wei-Ting Chen et.al.	2303.11364	null
2023-03-20	ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-real Novel View Synthesis via Contrastive Learning	Hao Yang et.al.	2303.11052	null
2023-03-19	SKED: Sketch-guided Text-based 3D Editing	Aryan Mikaeili et.al.	2303.10735	null
2023-03-19	NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping	Junyuan Deng et.al.	2303.10709	link
2023-03-18	3D Data Augmentation for Driving Scenes on Camera	Wenwen Tong et.al.	2303.10340	null
2023-03-17	$α$ Surf: Implicit Surface Reconstruction for Semi-Transparent and Thin Objects with Decoupled Geometry and Opacity	Tianhao Wu et.al.	2303.10083	null
2023-03-17	Single-view Neural Radiance Fields with Depth Teacher	Yurui Chen et.al.	2303.09952	null
2023-03-21	PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D Supervision	Konstantinos Tertikas et.al.	2303.09554	null
2023-03-16	LERF: Language Embedded Radiance Fields	Justin Kerr et.al.	2303.09553	null
2023-03-16	NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes	Marie-Julie Rakotosaona et.al.	2303.09431	null
2023-03-17	NeRFtrinsic Four: An End-To-End Trainable NeRF Jointly Optimizing Diverse Intrinsic and Extrinsic Camera Parameters	Hannah Schieber et.al.	2303.09412	link
2023-03-16	Reliable Image Dehazing by NeRF	Zheyan Jin et.al.	2303.09153	null
2023-03-15	Mesh Strikes Back: Fast and Efficient Human Reconstruction from RGB videos	Rohit Jena et.al.	2303.08808	null
2023-03-15	Re-ReND: Real-time Rendering of NeRFs across Devices	Sara Rojas et.al.	2303.08717	link
2023-03-15	RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters	Shuja Khalid et.al.	2303.08695	null
2023-03-15	Harnessing Low-Frequency Neural Fields for Few-Shot View Synthesis	Liangchen Song et.al.	2303.08370	link
2023-03-14	MELON: NeRF with Unposed Images Using Equivalence Class Estimation	Axel Levy et.al.	2303.08096	null
2023-03-16	Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation	Junyoung Seo et.al.	2303.07937	link
2023-03-16	NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction from Multi-view Images	Yunfan Ye et.al.	2303.07653	link
2023-03-14	Frequency-Modulated Point Cloud Rendering with Easy Editing	Yi Zhang et.al.	2303.07596	link
2023-03-13	FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization	Jiawei Yang et.al.	2303.07418	link
2023-03-13	NeRFLiX: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-viewpoint MiXer	Kun Zhou et.al.	2303.06919	link
2023-03-11	Just Flip: Flipped Observation Generation and Optimization for Neural Radiance Fields to Cover Unobserved View	Minjae Lee et.al.	2303.06335	link
2023-03-10	NeRFlame: FLAME-based conditioning of NeRF for 3D face rendering	Wojciech Zając et.al.	2303.06226	link
2023-03-10	You Only Train Once: Multi-Identity Free-Viewpoint Neural Human Rendering from Monocular Videos	Jaehyeok Kim et.al.	2303.05835	null
2023-03-10	Aleth-NeRF: Low-light Condition View Synthesis with Concealing Fields	Ziteng Cui et.al.	2303.05807	null
2023-03-10	Self-NeRF: A Self-Training Pipeline for Few-Shot Neural Radiance Fields	Jiayang Bai et.al.	2303.05775	null
2023-03-14	Hardware Acceleration of Neural Graphics	Muhammad Husnain Mubarik et.al.	2303.05735	null
2023-03-10	MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field	Kaizhi Yang et.al.	2303.05703	null
2023-03-09	PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification	Xuan Li et.al.	2303.05512	null
2023-03-08	FastSurf: Fast Neural RGB-D Surface Reconstruction using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning	Seunghwan Lee et.al.	2303.04508	link
2023-03-08	DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields	Dipam Patel et.al.	2303.04322	null
2023-03-07	NEPHELE: A Neural Platform for Highly Realistic Cloud Radiance Rendering	Haimin Luo et.al.	2303.04086	null
2023-03-05	Semantic-aware Occlusion Filtering Neural Radiance Fields in the Wild	Jaewon Lee et.al.	2303.03966	null
2023-03-07	Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis	Kang Han et.al.	2303.03808	link
2023-03-10	Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation from 2D Supervision	Xiaoshuai Zhang et.al.	2303.03361	null
2023-03-07	Efficient Large-scale Scene Representation with a Hybrid of High-resolution Grid and Plane Features	Yuqi Zhang et.al.	2303.03003	link
2023-03-03	Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement	Jiaxiang Tang et.al.	2303.02091	link
2023-03-03	Multi-Plane Neural Radiance Fields for Novel View Synthesis	Youssef Abdelkareem et.al.	2303.01736	null
2023-03-01	S-NeRF: Neural Radiance Fields for Street Views	Ziyang Xie et.al.	2303.00749	null
2023-02-28	IntrinsicNGP: Intrinsic Coordinate based Hash Encoding for Human NeRF	Bo Peng et.al.	2302.14683	null
2023-02-27	BaLi-RF: Bandlimited Radiance Fields for Dynamic Scene Modeling	Sameera Ramasinghe et.al.	2302.13543	null
2023-02-26	Efficient physics-informed neural networks using hash encoding	Xinquan Huang et.al.	2302.13397	null
2023-02-24	CATNIPS: Collision Avoidance Through Neural Implicit Probabilistic Scenes	Timothy Chen et.al.	2302.12931	link
2023-02-24	Learning Neural Volumetric Representations of Dynamic Humans in Minutes	Chen Geng et.al.	2302.12237	link
2023-02-23	DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models	Jamie Wynn et.al.	2302.12231	link
2023-02-20	NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion	Jiatao Gu et.al.	2302.10109	null
2023-02-19	LC-NeRF: Local Controllable Face Generation in Neural Randiance Field	Wenyang Zhou et.al.	2302.09486	null
2023-02-17	MixNeRF: Modeling a Ray with Mixture Density for Novel View Synthesis from Sparse Inputs	Seunghyeon Seo et.al.	2302.08788	link
2023-02-14	VQ3D: Learning a 3D-Aware Generative Model on ImageNet	Kyle Sargent et.al.	2302.06833	null
2023-02-13	3D-aware Blending with Generative NeRFs	Hyunsu Kim et.al.	2302.06608	link
2023-02-11	3D Colored Shape Reconstruction from a Single RGB Image through Diffusion	Bo Li et.al.	2302.05573	null
2023-02-08	Nerfstudio: A Modular Framework for Neural Radiance Field Development	Matthew Tancik et.al.	2302.04264	null
2023-02-07	AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis	Susan Liang et.al.	2302.02088	null
2023-02-03	Semantic 3D-aware Portrait Synthesis and Manipulation Based on Compositional Neural Radiance Field	Tianxiang Ma et.al.	2302.01579	link
2023-02-03	Robust Camera Pose Refinement for Multi-Resolution Hash Encoding	Hwan Heo et.al.	2302.01571	null
2023-02-03	INV: Towards Streaming Incremental Neural Videos	Shengze Wang et.al.	2302.01532	null
2023-02-02	Factor Fields: A Unified Framework for Neural Fields and Beyond	Anpei Chen et.al.	2302.01226	null
2023-02-02	RobustNeRF: Ignoring Distractors with Robust Losses	Sara Sabour et.al.	2302.00833	null
2023-01-31	GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis	Zhenhui Ye et.al.	2301.13430	null
2023-01-30	Equivariant Architectures for Learning in Deep Weight Spaces	Aviv Navon et.al.	2301.12780	link
2023-01-27	HyperNeRFGAN: Hypernetwork approach to 3D NeRF GAN	Adam Kania et.al.	2301.11631	link
2023-01-27	A Comparison of Tiny-nerf versus Spatial Representations for 3d Reconstruction	Saulo Abraham Gante et.al.	2301.11522	null
2023-01-27	SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning	Dongseok Shim et.al.	2301.11520	null
2023-01-26	Text-To-4D Dynamic Scene Generation	Uriel Singer et.al.	2301.11280	null
2023-01-26	GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency	Minseop Kwak et.al.	2301.10941	link
2023-01-23	HexPlane: A Fast Representation for Dynamic Scenes	Ang Cao et.al.	2301.09632	link
2023-01-22	3D Reconstruction of Non-cooperative Resident Space Objects using Instant NGP-accelerated NeRF and D-NeRF	Trupti Mahendrakar et.al.	2301.09060	null
2023-01-18	NeRF in the Palm of Your Hand: Corrective Augmentation for Robotics via Novel-View Synthesis	Allan Zhou et.al.	2301.08556	null
2023-01-19	RecolorNeRF: Layer Decomposed Radiance Field for Efficient Color Editing of 3D Scenes	Bingchen Gong et.al.	2301.07958	null
2023-01-18	Behind the Scenes: Density Fields for Single View Reconstruction	Felix Wimbauer et.al.	2301.07668	link
2023-01-17	A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction	Chongshan Lu et.al.	2301.06782	null
2023-01-13	Laser: Latent Set Representations for 3D Generative Modeling	Pol Moreno et.al.	2301.05747	null
2023-01-10	Benchmarking Robustness in Neural Radiance Fields	Chen Wang et.al.	2301.04075	null
2023-01-08	Towards Open World NeRF-Based SLAM	Daniil Lisus et.al.	2301.03102	null
2023-01-10	Traditional Readability Formulas Compared for English	Bruce W. Lee et.al.	2301.02975	null
2023-01-09	Class-Continuous Conditional Generative Neural Radiance Field	Jiwook Kim et.al.	2301.00950	link
2023-01-11	Detachable Novel Views Synthesis of Dynamic Scenes Using Distribution-Driven Neural Radiance Fields	Boyu Zhang et.al.	2301.00411	link
2022-12-26	MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos	Fengrui Tian et.al.	2212.13056	link
2022-12-25	PaletteNeRF: Palette-based Color Editing for NeRFs	Qiling Wu et.al.	2212.12871	null
2022-12-22	Removing Objects From Neural Radiance Fields	Silvan Weder et.al.	2212.11966	null
2022-12-21	Incremental Learning for Neural Radiance Field with Uncertainty-Filtered Knowledge Distillation	Mengqi Guo et.al.	2212.10950	link
2022-12-21	PaletteNeRF: Palette-based Appearance Editing of Neural Radiance Fields	Zhengfei Kuang et.al.	2212.10699	null
2022-12-20	Correspondence Distillation from NeRF-based GAN	Yushi Lan et.al.	2212.09735	null
2022-12-19	StyleTRF: Stylizing Tensorial Radiance Fields	Rahul Goel et.al.	2212.09330	null
2022-12-18	SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images	Abdullah Hamdi et.al.	2212.09100	link
2022-12-18	Masked Wavelet Representation for Compact Neural Radiance Fields	Daniel Rho et.al.	2212.09069	link
2022-12-15	SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory	Sicheng Li et.al.	2212.08476	null
2022-12-16	MEIL-NeRF: Memory-Efficient Incremental Learning of Neural Radiance Fields	Jaeyoung Chung et.al.	2212.08328	null
2022-12-15	NeRF-Art: Text-Driven Neural Radiance Fields Stylization	Can Wang et.al.	2212.08070	link
2022-12-15	Real-Time Neural Light Field on Mobile Devices	Junli Cao et.al.	2212.08057	link
2022-12-14	NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior	Wenjing Bian et.al.	2212.07388	link
2022-12-08	GazeNeRF: 3D-Aware Gaze Redirection with Neural Radiance Fields	Alessandro Ruzzi et.al.	2212.04823	link
2022-12-09	4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions	Zhongshu Wang et.al.	2212.04701	link
2022-12-07	EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points	Chengwei Zheng et.al.	2212.04247	null
2022-12-08	NeRFEditor: Differentiable Style Decomposition for Full 3D Scene Editing	Chunyi Sun et.al.	2212.03848	null
2022-12-07	Non-uniform Sampling Strategies for NeRF on 360{\textdegree} images	Takashi Otonari et.al.	2212.03635	null
2022-12-07	SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields	Siddhant Ranade et.al.	2212.03406	null
2022-12-06	NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors	Congyue Deng et.al.	2212.03267	null
2022-12-05	SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields	Anh-Quan Cao et.al.	2212.02501	link
2022-12-05	Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields	Rohith Agaram et.al.	2212.02493	link
2022-12-06	D-TensoRF: Tensorial Radiance Fields for Dynamic Scenes	Hankyu Jang et.al.	2212.02375	null
2022-12-07	GARF:Geometry-Aware Generalized Neural Radiance Field	Yue Shi et.al.	2212.02280	null
2022-12-05	INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors	Chaojian Li et.al.	2212.01959	null
2022-12-03	MaRF: Representing Mars as Neural Radiance Fields	Lorenzo Giusti et.al.	2212.01672	link
2022-12-03	StegaNeRF: Embedding Invisible Information within Neural Radiance Fields	Chenxin Li et.al.	2212.01602	null
2022-12-02	RT-NeRF: Real-Time On-Device Neural Radiance Fields Towards Immersive AR/VR Rendering	Chaojian Li et.al.	2212.01120	null
2022-12-02	3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation	Zutao Jiang et.al.	2212.01103	null
2022-12-02	QFF: Quantized Fourier Features for Neural Field Representations	Jae Yong Lee et.al.	2212.00914	null
2022-12-01	ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural Radiance Fields	Octave Mariotti et.al.	2212.00436	null
2022-11-30	NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation	Yu Yin et.al.	2211.17235	null
2022-11-29	NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views	Dejia Xu et.al.	2211.16431	link
2022-11-29	Compressing Volumetric Radiance Fields to 1 MB	Lingzhi Li et.al.	2211.16386	link
2022-11-28	In-Hand 3D Object Scanning from an RGB Sequence	Shreyas Hampali et.al.	2211.16193	null
2022-11-30	One is All: Bridging the Gap Between Neural Radiance Fields Architectures with Progressive Volume Distillation	Shuangkang Fang et.al.	2211.15977	link
2022-11-28	High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors	Yunpeng Bai et.al.	2211.15064	null
2022-11-27	SuNeRF: Validation of a 3D Global Reconstruction of the Solar Corona Using Simulated EUV Images	Kyriaki-Margarita Bintsi et.al.	2211.14879	null
2022-11-27	3D Scene Creation and Rendering via Rough Meshes: A Lighting Transfer Avenue	Yujie Li et.al.	2211.14823	null
2022-11-27	Sampling Neural Radiance Fields for Refractive Objects	Jen-I Pan et.al.	2211.14799	link
2022-11-25	3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models	Gang Li et.al.	2211.14108	null
2022-11-25	ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision	Jingwang Ling et.al.	2211.14086	link
2022-11-25	Dynamic Neural Portraits	Michail Christos Doukas et.al.	2211.13994	null
2022-11-25	Unsupervised Continual Semantic Adaptation through Neural Rendering	Zhizheng Liu et.al.	2211.13969	link
2022-11-25	TPA-Net: Generate A Dataset for Text to Physics-based Animation	Yuxing Qiu et.al.	2211.13887	null
2022-11-24	ScanNeRF: a Scalable Benchmark for Neural Radiance Fields	Luca De Luigi et.al.	2211.13762	null
2022-11-24	Immersive Neural Graphics Primitives	Ke Li et.al.	2211.13494	link
2022-11-23	CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields	Keqiang Sun et.al.	2211.13251	null
2022-11-26	ClimateNeRF: Physically-based Neural Rendering for Extreme Climate Synthesis	Yuan Li et.al.	2211.13226	null
2022-11-23	ManVatar : Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels	Yuelang Xu et.al.	2211.13206	null
2022-11-23	BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields	Peng Wang et.al.	2211.12853	link
2022-11-23	PANeRF: Pseudo-view Augmentation for Improved Neural Radiance Fields Based on Few-shot Inputs	Young Chun Ahn et.al.	2211.12758	null
2022-11-23	ActiveRMAP: Radiance Field for Active Mapping And Planning	Huangying Zhan et.al.	2211.12656	null
2022-11-22	Zero NeRF: Registration with Zero Overlap	Casey Peat et.al.	2211.12544	null
2022-11-22	Depth-Supervised NeRF for Multi-View RGB-D Operating Room Images	Beerend G. A. Gerats et.al.	2211.12436	null
2022-11-22	Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition	Jiaxiang Tang et.al.	2211.12368	null
2022-11-22	Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields	Brian K. S. Isaac-Medina et.al.	2211.12285	link
2022-11-22	SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields	Ashkan Mirzaei et.al.	2211.12254	null
2022-11-22	Deblurred Neural Radiance Field with Physical Scene Priors	Dogyoon Lee et.al.	2211.12046	link
2022-11-22	ONeRF: Unsupervised 3D Object Segmentation from Multiple Views	Shengnan Liang et.al.	2211.12038	null
2022-11-21	Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques	David Ramirez et.al.	2211.11836	null
2022-11-21	SPARF: Neural Radiance Fields from Sparse and Noisy Poses	Prune Truong et.al.	2211.11738	link
2022-11-21	ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields	Mohammad Mahdi Johari et.al.	2211.11704	null
2022-11-21	Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion	Dario Pavllo et.al.	2211.11674	link
2022-11-18	Magic3D: High-Resolution Text-to-3D Content Creation	Chen-Hsuan Lin et.al.	2211.10440	null
2022-11-17	AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training	Yifan Jiang et.al.	2211.09682	null
2022-11-16	CoNFies: Controllable Neural Face Avatars	Heng Yu et.al.	2211.08610	null
2022-11-14	Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures	Gal Metzer et.al.	2211.07600	link
2022-11-12	3D-Aware Encoding for Style-based Neural Radiance Fields	Yu-Jhe Li et.al.	2211.06583	null
2022-11-11	ParticleNeRF: A Particle-Based Encoding for Online Neural Radiance Fields in Dynamic Scenes	Jad Abou-Chakra et.al.	2211.04041	null
2022-11-07	Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories	Samarth Sinha et.al.	2211.03889	null
2022-11-03	nerf2nerf: Pairwise Registration of Neural Radiance Fields	Lily Goli et.al.	2211.01600	null
2022-10-27	ProbNeRF: Uncertainty-Aware Inference of 3D Shapes from 2D Images	Matthew D. Hoffman et.al.	2210.17415	null
2022-10-27	Boosting Point Clouds Rendering via Radiance Mapping	Xiaoyang Huang et.al.	2210.15107	link
2022-10-24	Learning Neural Radiance Fields from Multi-View Geometry	Marco Orsingher et.al.	2210.13041	null
2022-10-23	Compressing Explicit Voxel Grid Representations: fast NeRFs become also small	Chenxi Lola Deng et.al.	2210.12782	null
2022-11-06	Joint Rigid Motion Correction and Sparse-View CT via Self-Calibrating Neural Field	Qing Wu et.al.	2210.12731	null
2022-10-21	An Exploration of Neural Radiance Field Scene Reconstruction: Synthetic, Real-world and Dynamic Scenes	Benedict Quartey et.al.	2210.12268	null
2022-11-06	Neural Fields for Robotic Object Manipulation from a Single Image	Valts Blukis et.al.	2210.12126	null
2022-10-21	HDHumans: A Hybrid Approach for High-fidelity Digital Humans	Marc Habermann et.al.	2210.12003	null
2022-10-21	RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control	Zhenggang Tang et.al.	2210.11668	null
2022-10-21	Coordinates Are NOT Lonely – Codebook Prior Helps Implicit Neural 3D Representations	Fukun Yin et.al.	2210.11170	link
2022-10-18	Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation	Yunzhi Lin et.al.	2210.10108	link
2022-10-18	ARAH: Animatable Volume Rendering of Articulated Human SDFs	Shaofei Wang et.al.	2210.10036	null
2022-10-20	Differentiable Physics Simulation of Dynamics-Augmented Neural Objects	Simon Le Cleac’h et.al.	2210.09420	null
2022-10-15	SPIDR: SDF-based Neural Point Fields for Illumination and Deformation	Ruofan Liang et.al.	2210.08398	null
2022-10-15	IBL-NeRF: Image-Based Lighting Formulation of Neural Radiance Fields	Changwoon Choi et.al.	2210.08202	link
2022-10-17	3D GAN Inversion with Pose Optimization	Jaehoon Ko et.al.	2210.07301	link
2022-10-13	Multiplane NeRF-Supervised Disentanglement of Depth and Camera Pose from Videos	Yang Fu et.al.	2210.07181	null
2022-10-12	GraspNeRF: Multiview-based 6-DoF Grasp Detection for Transparent and Specular Objects Using Generalizable NeRF	Qiyu Dai et.al.	2210.06575	link
2022-10-12	Reconstructing Personalized Semantic Facial NeRF Models From Monocular Video	Xuan Gao et.al.	2210.06108	link
2022-10-11	X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360 $^{\circ}$ Insufficient RGB-D Views	Haoyi Zhu et.al.	2210.05135	link
2022-10-10	NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields	Arunkumar Byravan et.al.	2210.04932	null
2022-10-10	EVA3D: Compositional 3D Human Generation from 2D Image Collections	Fangzhou Hong et.al.	2210.04888	link
2022-10-13	NerfAcc: A General NeRF Acceleration Toolbox	Ruilong Li et.al.	2210.04847	link
2022-10-10	SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction	Yitong Xia et.al.	2210.04553	link
2022-10-09	Robustifying the Multi-Scale Representation of Neural Radiance Fields	Nishant Jain et.al.	2210.04233	null
2022-10-09	Estimating Neural Reflectance Field from Radiance Field using Tree Structures	Xiu Li et.al.	2210.04217	null
2022-10-09	Data augmentation for NeRF: a geometric consistent solution based on view morphing	Matteo Bortolon et.al.	2210.04214	link
2022-10-09	Towards Efficient Neural Scene Graphs by Learning Consistency Fields	Yeji Song et.al.	2210.04127	null
2022-10-08	ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints	Yinpeng Dong et.al.	2210.03895	link
2022-10-04	SelfNeRF: Fast Training NeRF for Human from Monocular Self-rotating Video	Bo Peng et.al.	2210.01651	null
2022-10-03	NARF22: Neural Articulated Radiance Fields for Configuration-Aware Rendering	Stanley Lewis et.al.	2210.01166	null
2022-10-02	IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis	Weicai Ye et.al.	2210.00647	link
2022-10-02	Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation	Xinhang Liu et.al.	2210.00489	null
2022-10-01	NeRF: Neural Radiance Field in 3D Vision, A Comprehensive Review	Kyle Gao et.al.	2210.00379	null
2022-10-01	Structure-Aware NeRF without Posed Camera via Epipolar Constraint	Shu Chen et.al.	2210.00183	link
2022-09-30	Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator	Zifan Shi et.al.	2209.15637	null
2022-09-30	Understanding Pure CLIP Guidance for Voxel Grid NeRF Models	Han-Hung Lee et.al.	2209.15172	null
2022-09-29	DreamFusion: Text-to-3D using 2D Diffusion	Ben Poole et.al.	2209.14988	null
2022-09-29	SymmNeRF: Learning to Explore Symmetry Prior for Single-View View Synthesis	Xingyi Li et.al.	2209.14819	link
2022-10-03	360FusionNeRF: Panoramic Neural Radiance Fields with Joint Guidance	Shreyas Kulkarni et.al.	2209.14265	link
2022-09-27	OmniNeRF: Hybriding Omnidirectional Distance and Radiance fields for Neural Surface Reconstruction	Jiaming Shen et.al.	2209.13433	null
2022-09-27	Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping	Chi-Ming Chung et.al.	2209.13274	link
2022-09-27	WaterNeRF: Neural Radiance Fields for Underwater Scenes	Advaith Venkatramanan Sethuraman et.al.	2209.13091	null
2022-09-26	Baking in the Feature: Accelerating Volumetric Segmentation by Rendering Feature Maps	Kenneth Blomqvist et.al.	2209.12744	null
2022-09-25	Enforcing safety for vision-based controllers via Control Barrier Functions and Neural Radiance Fields	Mukun Tong et.al.	2209.12266	null
2022-09-24	NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields	Jiankai Sun et.al.	2209.12068	null
2022-09-19	Loc-NeRF: Monte Carlo Localization using Neural Radiance Fields	Dominic Maggio et.al.	2209.09050	link
2022-09-23	NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes	Zhiwen Fan et.al.	2209.08776	link
2022-09-19	Density-aware NeRF Ensembles: Quantifying Predictive Uncertainty in Neural Radiance Fields	Niko Sünderhauf et.al.	2209.08718	null
2022-09-18	ActiveNeRF: Learning where to See with Uncertainty Estimation	Xuran Pan et.al.	2209.08546	link
2022-09-18	LATITUDE: Robotic Global Localization with Truncated Dynamic Low-pass Filter in City-scale NeRF	Zhenxin Zhu et.al.	2209.08498	link
2022-09-16	iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking	Yuhang Ming et.al.	2209.07919	null
2022-09-12	StructNeRF: Neural Radiance Fields for Indoor Scenes with Structural Hints	Zheng Chen et.al.	2209.05277	null
2022-09-09	Generative Deformable Radiance Fields for Disentangled Image Synthesis of Topology-Varying Objects	Ziyu Wang et.al.	2209.04183	null
2022-09-08	im2nerf: Image to Neural Radiance Field in the Wild	Lu Mi et.al.	2209.04061	null
2022-09-08	PixTrack: Precise 6DoF Object Pose Tracking using NeRF Templates and Feature-metric Alignment	Prajwal Chidananda et.al.	2209.03910	link
2022-09-07	Neural Feature Fusion Fields: 3D Distillation of Self-Supervised 2D Image Representations	Vadim Tschernezki et.al.	2209.03494	null
2022-08-29	Volume Rendering Digest (for NeRF)	Andrea Tagliasacchi et.al.	2209.02417	null
2022-09-06	CLONeR: Camera-Lidar Fusion for Occupancy Grid-aided Neural Representations	Alexandra Carlson et.al.	2209.01194	null
2022-09-01	On Quantizing Implicit Neural Representations	Cameron Gordon et.al.	2209.01019	null
2022-08-31	Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces	Yihao Zhi et.al.	2208.14851	link
2022-08-30	A Portable Multiscopic Camera for Novel View and Time Synthesis in Dynamic Scenes	Tianjia Zhang et.al.	2208.14433	null
2022-08-24	PeRFception: Perception using Radiance Fields	Yoonwoo Jeong et.al.	2208.11537	link
2022-08-24	E-NeRF: Neural Radiance Fields from a Moving Event Camera	Simon Klenk et.al.	2208.11300	link
2022-08-18	Neural Capture of Animatable 3D Human from Monocular Video	Gusi Te et.al.	2208.08728	null
2022-08-16	Casual Indoor HDR Radiance Capture from Omnidirectional Images	Pulkit Gera et.al.	2208.07903	null
2022-08-15	DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images	Bing Wang et.al.	2208.07227	link
2022-08-11	RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild	Jason Y. Zhang et.al.	2208.05963	null
2022-08-11	FDNeRF: Few-shot Dynamic Neural Radiance Fields for Face Reconstruction and Expression Editing	Jingbo Zhang et.al.	2208.05751	link
2022-08-04	360Roam: Real-Time Indoor Roaming Using Geometry-Aware ${360^\circ}$ Radiance Fields	Huajian Huang et.al.	2208.02705	null
2022-08-02	T4DT: Tensorizing Time for Learning Temporal 3D Visual Data	Mikhail Usvyatsov et.al.	2208.01421	link
2022-08-01	DoF-NeRF: Depth-of-Field Meets Neural Radiance Fields	Zijin Wu et.al.	2208.00945	link
2022-08-06	MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures	Zhiqin Chen et.al.	2208.00277	link
2022-07-30	Distilled Low Rank Neural Radiance Field with Quantization for Light Field Compression	Jinglei Shi et.al.	2208.00164	null
2022-08-01	End-to-end View Synthesis via NeRF Attention	Zelin Zhao et.al.	2207.14741	null
2022-07-29	Neural Density-Distance Fields	Itsuki Ueda et.al.	2207.14455	link
2022-07-27	Is Attention All NeRF Needs?	Mukund Varma T et.al.	2207.13298	null

Gaussian Splatting

Publish Date	Title	Authors	PDF	Code
2025-07-23	Temporal Smoothness-Aware Rate-Distortion Optimized 4D Gaussian Splatting	Hyeongmin Lee et.al.	2507.17336	null
2025-07-22	StreamME: Simplify 3D Gaussian Avatar within Live Stream	Luchuan Song et.al.	2507.17029	null
2025-07-23	EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion	Shang Liu et.al.	2507.16535	null
2025-07-22	Sparse-View 3D Reconstruction: Recent Advances and Open Challenges	Tanveer Younis et.al.	2507.16406	null
2025-07-22	LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence Images	Guichen Huang et.al.	2507.16144	null
2025-07-21	Appearance Harmonization via Bilateral Grid Prediction with Transformers for 3DGS	Jisu Shin et.al.	2507.15748	null
2025-07-21	DWTGS: Rethinking Frequency Regularization for Sparse-view 3D Gaussian Splatting	Hung Nguyen et.al.	2507.15690	null
2025-07-21	Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing	Boni Hu et.al.	2507.15683	null
2025-07-21	Gaussian Splatting with Discretized SDF for Relightable Assets	Zuo-Liang Zhu et.al.	2507.15629	null
2025-07-21	SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting	Zihui Gao et.al.	2507.15602	null
2025-07-21	ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting	Ruijie Zhu et.al.	2507.15454	null
2025-07-22	GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional Processing	Minnan Pei et.al.	2507.15300	null
2025-07-20	Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction	Xiufeng Huang et.al.	2507.14921	null
2025-07-19	DCHM: Depth-Consistent Human Modeling for Multiview Detection	Jiahao Ma et.al.	2507.14505	null
2025-07-19	Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey	Jiahui Zhang et.al.	2507.14501	null
2025-07-18	Neural-GASh: A CGA-based neural radiance prediction pipeline for real-time shading	Efstratios Geronikolakis et.al.	2507.13917	null
2025-07-18	PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations	Yu Wei et.al.	2507.13891	null
2025-07-18	TexGS-VolVis: Expressive Scene Editing for Volume Visualization via Textured Gaussian Splatting	Kaiyuan Tang et.al.	2507.13586	null
2025-07-16	VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians	Siyuan Yao et.al.	2507.12667	null
2025-07-16	NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting	Kuangshi Ai et.al.	2507.12621	null
2025-07-16	Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition	Beizhen Zhao et.al.	2507.12498	null
2025-07-16	AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving	Jiawei Xu et.al.	2507.12137	null
2025-07-16	BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images	Davide Di Nucci et.al.	2507.12095	null
2025-07-16	SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation	Beining Xu et.al.	2507.12027	null
2025-07-16	Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark	Jingqian Wu et.al.	2507.11931	null
2025-07-15	A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction	Haoxuan Qu et.al.	2507.11321	null
2025-07-16	TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update	Jeongyun Kim et.al.	2507.11069	null
2025-07-15	Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling	Hayeon Kim et.al.	2507.11061	null
2025-07-14	ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions	Shivangi Aneja et.al.	2507.10542	null
2025-07-14	3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving	Yixun Zhang et.al.	2507.09993	null
2025-07-11	Learning human-to-robot handovers through 3D scene reconstruction	Yuekun Wu et.al.	2507.08726	null
2025-07-11	RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting	Ji Hyun Seo et.al.	2507.08434	null
2025-07-10	Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction	Hyungjun Doh et.al.	2507.08137	null
2025-07-10	RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration	Chong Cheng et.al.	2507.08136	null
2025-07-10	RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection	Yongyang Zhou et.al.	2507.07733	null
2025-07-10	MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation	Bangning Wei et.al.	2507.07519	null
2025-07-10	SD-GS: Structured Deformable 3D Gaussians for Efficient Dynamic Scene Reconstruction	Wei Yao et.al.	2507.07465	null
2025-07-10	Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections	Yongtang Bao et.al.	2507.07395	null
2025-07-09	LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS	Wanhua Li et.al.	2507.07136	null
2025-07-09	Enhancing non-Rigid 3D Model Deformations Using Mesh-based Gaussian Splatting	Wijayathunga W. M. R. D. B et.al.	2507.07000	null
2025-07-09	Photometric Stereo using Gaussian Splatting and inverse rendering	Matéo Ducastel et.al.	2507.06684	null
2025-07-09	FlexGaussian: Flexible and Cost-Effective Training-Free Compression for 3D Gaussian Splatting	Boyuan Tian et.al.	2507.06671	null
2025-07-09	ClipGS: Clippable Gaussian Splatting for Interactive Cinematic Visualization of Volumetric Medical Data	Chengkun Li et.al.	2507.06647	null
2025-07-08	LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures	Seungoh Han et.al.	2507.06109	null
2025-07-08	Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering	Jiayi Song et.al.	2507.06103	null
2025-07-08	VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis	Alexandre Symeonidis-Herzig et.al.	2507.06060	null
2025-07-08	D-FCGS: Feedforward Compression of Dynamic Gaussian Splatting for Free-Viewpoint Videos	Wenkang Zhang et.al.	2507.05859	null
2025-07-08	DreamArt: Generating Interactable Articulated Objects from a Single Image	Ruijie Lu et.al.	2507.05763	null
2025-07-08	3DGS_LSR:Large_Scale Relocation for Autonomous Driving Based on 3D Gaussian Splatting	Haitao Lu et.al.	2507.05661	null
2025-07-07	Mastering Regional 3DGS: Locating, Initializing, and Editing with Diverse 2D Priors	Lanqing Guo et.al.	2507.05426	null
2025-07-07	SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation	Jiahao Zhu et.al.	2507.05256	null
2025-07-07	InterGSEdit: Interactive 3D Gaussian Splatting Editing with 3D Geometry-Consistent Attention Prior	Minghao Wen et.al.	2507.04961	null
2025-07-05	A3FR: Agile 3D Gaussian Splatting with Incremental Gaze Tracked Foveated Rendering in Virtual Reality	Shuo Xin et.al.	2507.04147	null
2025-07-05	Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM	Xiaolei Lang et.al.	2507.04004	null
2025-07-05	ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments	Guile Wu et.al.	2507.03886	null
2025-07-04	Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps	Chong Cheng et.al.	2507.03737	null
2025-07-03	HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars	Gent Serifi et.al.	2507.02803	null
2025-07-03	ArtGS:3D Gaussian Splatting for Interactive Visual-Physical Modeling and Manipulation of Articulated Objects	Qiaojun Yu et.al.	2507.02600	null
2025-07-03	LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling	Jiahao Wu et.al.	2507.02363	null
2025-07-03	Gbake: Baking 3D Gaussian Splats into Reflection Probes	Stephen Pasch et.al.	2507.02257	null
2025-07-02	3D Gaussian Splatting Driven Multi-View Robust Physical Adversarial Camouflage Generation	Tianrui Lou et.al.	2507.01367	null
2025-07-01	VISTA: Open-Vocabulary, Task-Relevant Robot Exploration with Online Semantic Gaussian Splatting	Keiko Nagami et.al.	2507.01125	null
2025-07-01	A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory	Felix Windisch et.al.	2507.01110	null
2025-07-01	Masks make discriminative models great again!	Tianshi Cao et.al.	2507.00916	null
2025-07-01	GaussianVLM: Scene-centric 3D Vision-Language Models using Language-aligned Gaussian Splats for Embodied Reasoning and Beyond	Anna-Maria Halacheva et.al.	2507.00886	null
2025-07-01	LOD-GS: Level-of-Detail-Sensitive 3D Gaussian Splatting for Detail Conserved Anti-Aliasing	Zhenya Yang et.al.	2507.00554	null
2025-07-01	GDGS: 3D Gaussian Splatting Via Geometry-Guided Initialization And Dynamic Density Control	Xingjun Wang et.al.	2507.00363	null
2025-06-30	MILo: Mesh-In-the-Loop Gaussian Splatting for Detailed and Efficient Surface Reconstruction	Antoine Guédon et.al.	2506.24096	null
2025-06-30	GaVS: 3D-Grounded Video Stabilization via Temporally-Consistent Local Reconstruction and Rendering	Zinuo You et.al.	2506.23957	null
2025-06-30	AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention	Ziao Liu et.al.	2506.23611	null
2025-06-30	Instant GaussianImage: A Generalizable and Self-Adaptive Image Representation via 2D Gaussian Splatting	Zhaojie Zeng et.al.	2506.23479	null
2025-07-01	SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting	Yiming Huang et.al.	2506.23309	null
2025-06-29	Endo-4DGX: Robust Endoscopic Scene Reconstruction and Illumination Correction with Gaussian Splatting	Yiming Huang et.al.	2506.23308	null
2025-06-29	TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints	Zhen Tan et.al.	2506.23207	null
2025-06-29	STD-GS: Exploring Frame-Event Interaction for SpatioTemporal-Disentangled Gaussian Splatting to Reconstruct High-Dynamic Scene	Hanyu Zhou et.al.	2506.23157	null
2025-06-29	From Coarse to Fine: Learnable Discrete Wavelet Transforms for Efficient 3D Gaussian Splatting	Hung Nguyen et.al.	2506.23042	null
2025-06-28	Confident Splatting: Confidence-Based Compression of 3D Gaussian Splatting via Learnable Beta Distributions	AmirHossein Naghi Razlighi et.al.	2506.22973	null
2025-06-27	DIGS: Dynamic CBCT Reconstruction using Deformation-Informed 4D Gaussian Splatting and a Low-Rank Free-Form Deformation Model	Yuliang Huang et.al.	2506.22280	null
2025-06-27	BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting	Zipei Ma et.al.	2506.22099	null
2025-06-26	MADrive: Memory-Augmented Driving Scene Modeling	Polina Karpikova et.al.	2506.21520	null
2025-06-26	EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting	Taoyu Wu et.al.	2506.21420	null
2025-06-28	Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction	Zhirui Gao et.al.	2506.21401	null
2025-06-26	Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image	Pufan Li et.al.	2506.21152	null
2025-06-26	CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization	Jan Ackermann et.al.	2506.21117	null
2025-06-26	User-in-the-Loop View Sampling with Error Peaking Visualization	Ayaka Yasunaga et.al.	2506.21009	null
2025-06-26	DBMovi-GS: Dynamic View Synthesis from Blurry Monocular Video via Sparse-Controlled Gaussian Splatting	Yeon-Ji Song et.al.	2506.20998	null
2025-06-25	3DGH: 3D Head Generation with Composable Hair and Face	Chengan He et.al.	2506.20875	null
2025-06-25	RaRa Clipper: A Clipper for Gaussian Splatting Based on Ray Tracer and Rasterizer	Da Li et.al.	2506.20202	null
2025-06-24	ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model	Tengbo Yu et.al.	2506.19842	null
2025-06-24	Virtual Memory for 3D Gaussian Splatting	Jonathan Haberl et.al.	2506.19415	null
2025-06-24	HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis	Xiaoyuan Wang et.al.	2506.19291	null
2025-06-23	GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM	Annika Thomas et.al.	2506.18885	null
2025-06-23	ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs	Michal Nazarczuk et.al.	2506.18792	null
2025-06-23	3D Arena: An Open Platform for Generative 3D Evaluation	Dylan Ebert et.al.	2506.18787	null
2025-06-23	Reconstructing Tornadoes in 3D with Gaussian Splatting	Adam Yang et.al.	2506.18677	null
2025-06-21	3D Gaussian Splatting for Fine-Detailed Surface Reconstruction in Large-Scale Scene	Shihan Chen et.al.	2506.17636	null
2025-06-20	Part $^{2}$ GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting	Tianjiao Yu et.al.	2506.17212	null
2025-06-23	R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision	Weeyoung Kwon et.al.	2506.16262	link
2025-06-19	Information-computation trade-offs in non-linear transforms	Connor Ding et.al.	2506.15948	null
2025-06-18	Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos	Kaifeng Zhang et.al.	2506.15680	null
2025-06-18	RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories	Qingsong Yan et.al.	2506.15242	null
2025-06-17	Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction	Zhengquan Zhang et.al.	2506.14856	null
2025-06-17	SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting	Ziqiao Peng et.al.	2506.14742	null
2025-06-17	3DGS-IEval-15K: A Large-scale Image Quality Evaluation Database for 3D Gaussian-Splatting	Yuke Xing et.al.	2506.14642	link
2025-06-17	HRGS: Hierarchical Gaussian Splatting for Memory-Efficient High-Resolution 3D Reconstruction	Changbai Li et.al.	2506.14229	null
2025-06-17	GAF: Gaussian Action Field as a Dvnamic World Model for Robotic Mlanipulation	Ying Chai et.al.	2506.14135	null
2025-06-16	GRaD-Nav++: Vision-Language Model Enabled Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics	Qianzhong Chen et.al.	2506.14009	null
2025-06-16	PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images	Lingteng Qiu et.al.	2506.13766	null
2025-06-16	Micro-macro Gaussian Splatting with Enhanced Scalability for Unconstrained Scene Reconstruction	Yihui Li et.al.	2506.13516	link
2025-06-16	Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields	Jungeon Kim et.al.	2506.13508	null
2025-06-16	TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian Splatting	Mae Younes et.al.	2506.13348	link
2025-06-16	GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction	Jinguang Tong et.al.	2506.13110	null
2025-06-15	Metropolis-Hastings Sampling for 3D Gaussian Reconstruction	Hyunjin Kim et.al.	2506.12945	null
2025-06-15	Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting	Mufan Liu et.al.	2506.12787	null
2025-06-17	Efficient multi-view training for 3D Gaussian Splatting	Minhyuk Choi et.al.	2506.12727	null
2025-06-15	Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors	Wen-Hsuan Chu et.al.	2506.12716	null
2025-06-14	Perceptual-GS: Scene-adaptive Perceptual Densification for Gaussian Splatting	Hongbi Zhou et.al.	2506.12400	link
2025-06-12	Anti-Aliased 2D Gaussian Splatting	Mae Younes et.al.	2506.11252	link
2025-06-12	PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting	Lintao Xiang et.al.	2506.10335	null
2025-06-11	DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos	Chieh Hubert Lin et.al.	2506.09997	null
2025-06-11	UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting	Ziyi Wang et.al.	2506.09952	link
2025-06-11	DynaSplat: Dynamic-Static Gaussian Splatting with Hierarchical Motion Decomposition for Scene Reconstruction	Junli Deng et.al.	2506.09836	null
2025-06-11	Self-Supervised Multi-Part Articulated Objects Modeling via Deformable Gaussian Splatting and Progressive Primitive Segmentation	Haowen Wang et.al.	2506.09663	null
2025-06-11	Gaussian Herding across Pens: An Optimal Transport Perspective on Global Gaussian Reduction for 3DGS	Tao Wang et.al.	2506.09534	null
2025-06-11	HAIF-GS: Hierarchical and Induced Flow-Guided Gaussian Splatting for Dynamic Scene	Jianing Chen et.al.	2506.09518	null
2025-06-11	TinySplat: Feedforward Approach for Generating Compact 3D Scene Representation	Zetian Song et.al.	2506.09479	null
2025-06-12	ODG: Occupancy Prediction Using Dual Gaussians	Yunxiao Shi et.al.	2506.09417	null
2025-06-11	UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images	Qijian Tian et.al.	2506.09378	null
2025-06-10	StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams	Zike Wu et.al.	2506.08862	link
2025-06-11	Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting	Keyi Liu et.al.	2506.08777	null
2025-06-10	SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting	Mengjiao Ma et.al.	2506.08710	null
2025-06-10	TraGraph-GS: Trajectory Graph-based Gaussian Splatting for Arbitrary Large-Scale Scene Rendering	Xiaohan Zhang et.al.	2506.08704	null
2025-06-10	Complex-Valued Holographic Radiance Fields	Yicheng Zhan et.al.	2506.08350	null
2025-06-09	Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes	Allen Tu et.al.	2506.07917	link
2025-06-09	GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution	Shuja Khalid et.al.	2506.07897	null
2025-06-09	R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation	William Ljungbergh et.al.	2506.07826	null
2025-06-09	OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting	Jens Piekenbrinck et.al.	2506.07697	null
2025-06-09	ProSplat: Improved Feed-Forward 3D Gaussian Splatting for Wide-Baseline Sparse Views	Xiaohan Lu et.al.	2506.07670	null
2025-06-09	PIG: Physically-based Multi-Material Interaction with 3D Gaussians	Zeyu Xiao et.al.	2506.07657	null
2025-06-09	Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation	Yijie Deng et.al.	2506.07338	null
2025-06-08	Accelerating 3D Gaussian Splatting with Neural Sorting and Axis-Oriented Rasterization	Zhican Wang et.al.	2506.07069	null
2025-06-08	Hybrid Mesh-Gaussian Representation for Efficient Indoor Scene Reconstruction	Binxiao Huang et.al.	2506.06988	null
2025-06-07	Gaussian Mapping for Evolving Scenes	Vladimir Yugay et.al.	2506.06909	null
2025-06-06	Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments	Mingrui Li et.al.	2506.05965	null
2025-06-06	SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction	Yuchao Zheng et.al.	2506.05935	null
2025-06-06	Lumina: Real-Time Mobile Neural Rendering by Exploiting Computational Redundancy	Yu Feng et.al.	2506.05682	null
2025-06-05	VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction	Ziyue Zhu et.al.	2506.05563	null
2025-06-05	On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images	Andreas Meuleman et.al.	2506.05558	null
2025-06-05	ODE-GS: Latent ODEs for Dynamic Scene Extrapolation with 3D Gaussian Splatting	Daniel Wang et.al.	2506.05480	null
2025-06-05	Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting	Duochao Shi et.al.	2506.05327	null
2025-06-06	Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting	Nan Wang et.al.	2506.05280	link
2025-06-05	Synthetic Dataset Generation for Autonomous Mobile Robots Using 3D Gaussian Splatting for Vision Training	Aneesh Deogan et.al.	2506.05092	null
2025-06-05	UAV4D: Dynamic Neural Rendering of Human-Centric UAV Imagery using Gaussian Splatting	Jaehoon Choi et.al.	2506.05011	null
2025-06-05	Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting	Alfred T. Christiansen et.al.	2506.05009	null
2025-06-05	Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer	Filip Slezak et.al.	2506.04908	null
2025-06-05	Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations	Gaia Di Lorenzo et.al.	2506.04789	null
2025-06-04	Photoreal Scene Reconstruction from an Egocentric Device	Zhaoyang Lv et.al.	2506.04444	link
2025-06-04	HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting	Maksym Ivashechkin et.al.	2506.04351	null
2025-06-04	Pseudo-Simulation for Autonomous Driving	Wei Cao et.al.	2506.04218	link
2025-06-04	FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting	Hengyu Liu et.al.	2506.04174	null
2025-06-04	Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data	Ben Moran et.al.	2506.04120	null
2025-06-04	JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting	Yang Xiao et.al.	2506.03872	null
2025-06-04	SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting	Shengjie Lin et.al.	2506.03594	link
2025-06-04	Robust Neural Rendering in the Wild with Asymmetric Dual 3D Gaussian Splatting	Chengqi Li et.al.	2506.03538	null
2025-06-03	Multi-Spectral Gaussian Splatting with Neural Color Representation	Lukas Meyer et.al.	2506.03407	null
2025-06-03	LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM	Roman Titkov et.al.	2506.03073	null
2025-06-03	Large Processor Chip Model	Kaiyan Chang et.al.	2506.02929	null
2025-06-04	Voyager: Real-Time Splatting City-Scale 3D Gaussians on Your Phone	Zheng Liu et.al.	2506.02774	null
2025-06-03	RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS	Chuanyu Fu et.al.	2506.02751	null
2025-06-03	EyeNavGS: A 6-DoF Navigation Dataset and Record-n-Replay Software for Real-World 3DGS Scenes in VR	Zihao Ding et.al.	2506.02380	link
2025-06-02	GSCodec Studio: A Modular Framework for Gaussian Splat Compression	Sicheng Li et.al.	2506.01822	link
2025-06-02	WorldExplorer: Towards Generating Fully Navigable 3D Scenes	Manuel-Andreas Schneider et.al.	2506.01799	null
2025-06-02	WoMAP: World Models For Embodied Open-Vocabulary Object Localization	Tenny Yin et.al.	2506.01600	null
2025-06-02	RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes	Pou-Chun Kung et.al.	2506.01379	null
2025-06-01	CountingFruit: Real-Time 3D Fruit Counting with Language-Guided Semantic Gaussian Splatting	Fengze Li et.al.	2506.01109	null
2025-05-30	AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion	Yangyi Huang et.al.	2505.24877	null
2025-05-30	TC-GS: A Faster Gaussian Splatting Module Utilizing Tensor Cores	Zimu Liao et.al.	2505.24796	link
2025-05-30	Tackling View-Dependent Semantics in 3D Language Gaussian Splatting	Jiazhong Cen et.al.	2505.24746	link
2025-05-30	GARLIC: GAussian Representation LearnIng for spaCe partitioning	Panagiotis Rigas et.al.	2505.24608	null
2025-05-30	LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework	Xin Kang et.al.	2505.24245	null
2025-05-29	3DGEER: Exact and Efficient Volumetric Rendering with 3D Gaussians	Zixun Huang et.al.	2505.24053	link
2025-05-30	ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS	Weijie Wang et.al.	2505.23734	link
2025-05-29	AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views	Lihan Jiang et.al.	2505.23716	null
2025-05-29	Mobi- $π$ : Mobilizing Your Robot Learning Policy	Jingyun Yang et.al.	2505.23692	null
2025-05-29	Radiant Triangle Soup with Soft Connectivity Forces for 3D Reconstruction and Novel View Synthesis	Nathaniel Burgdorfer et.al.	2505.23642	null
2025-05-29	Holistic Large-Scale Scene Reconstruction via Mixed Gaussian Splatting	Chuandong Liu et.al.	2505.23280	link
2025-05-29	LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering	Jonas Kulhanek et.al.	2505.23158	null
2025-05-29	Pose-free 3D Gaussian splatting via shape-ray estimation	Youngju Na et.al.	2505.22978	null
2025-05-28	3DGS Compression with Sparsity-guided Hierarchical Transform Coding	Hao Xu et.al.	2505.22908	null
2025-05-28	CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting	Kornel Howil et.al.	2505.22854	link
2025-05-28	STDR: Spatio-Temporal Decoupling for Real-Time Dynamic Scene Rendering	Zehao Li et.al.	2505.22400	null
2025-05-28	UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments	Wancai Zheng et.al.	2505.22335	null
2025-05-28	Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss	Wenjun Lu et.al.	2505.22279	null
2025-05-28	Hyperspectral Gaussian Splatting	Sunil Kumar Narayanan et.al.	2505.21890	null
2025-05-27	Generalizable and Relightable Gaussian Splatting for Human Novel View Synthesis	Yipengjing Sun et.al.	2505.21502	null
2025-05-27	Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility	Yidi Li et.al.	2505.21377	link
2025-05-27	Structure from Collision	Takuhiro Kaneko et.al.	2505.21335	null
2025-05-29	3D-UIR: 3D Gaussian for Underwater 3D Scene Reconstruction via Physics Based Appearance-Medium Decoupling	Jieyu Yuan et.al.	2505.21238	null
2025-05-28	CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians	Weihang Liu et.al.	2505.21041	null
2025-05-27	Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting	Xiangyu Sun et.al.	2505.20729	null
2025-05-27	Wideband RF Radiance Field Modeling Using Frequency-embedded 3D Gaussian Splatting	Zechen Li et.al.	2505.20714	link
2025-05-26	CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting	Lei Tian et.al.	2505.20469	null
2025-05-26	ParticleGS: Particle-Based Dynamics Modeling of 3D Gaussians for Prior-free Motion Extrapolation	Jinsheng Quan et.al.	2505.20270	link
2025-05-26	HaloGS: Loose Coupling of Compact Geometry and Gaussian Splats for 3D Scenes	Changjian Jiang et.al.	2505.20267	null
2025-05-26	OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender	Shintaro Ito et.al.	2505.20126	link
2025-05-26	Weather-Magician: Reconstruction and Rendering Framework for 4D Weather Synthesis In Real Time	Chen Sang et.al.	2505.19919	null
2025-05-26	Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud	Natsuki Takama et.al.	2505.19854	null
2025-05-26	K-Buffers: A Plug-in Method for Enhancing Neural Fields with Multiple Buffers	Haofan Ren et.al.	2505.19564	link
2025-05-26	ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting	Wenhua Wu et.al.	2505.19420	null
2025-05-25	Improving Novel view synthesis of 360 $^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images	Guangan Chen et.al.	2505.19264	link
2025-05-25	Triangle Splatting for Real-Time Radiance Field Rendering	Jan Held et.al.	2505.19175	null
2025-05-25	FHGS: Feature-Homogenized Gaussian Splatting	Q. G. Duan et.al.	2505.19154	null
2025-05-25	Veta-GS: View-dependent deformable 3D Gaussian Splatting for thermal infrared Novel-view Synthesis	Myeongseok Nam et.al.	2505.19138	null
2025-05-25	VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes	Tianchen Deng et.al.	2505.18992	link
2025-05-23	SplatCo: Structure-View Collaborative Gaussian Splatting for Detail-Preserving Rendering of Large-Scale Unbounded Scenes	Haihong Xiao et.al.	2505.17951	null
2025-05-23	CGS-GAN: 3D Consistent Gaussian Splatting GANs for High Resolution Human Head Synthesis	Florian Barthel et.al.	2505.17590	link
2025-05-23	From Flight to Insight: Semantic 3D Reconstruction for Aerial Inspection via Gaussian Splatting and Language-Guided Segmentation	Mahmoud Chick Zaouali et.al.	2505.17402	null
2025-05-22	Render-FM: A Foundation Model for Real-time Photorealistic Volumetric Rendering	Zhongpai Gao et.al.	2505.17338	null
2025-05-22	SHaDe: Compact and Consistent Dynamic 3D Reconstruction via Tri-Plane Deformation and Latent Diffusion	Asrar Alruwayqi et.al.	2505.16535	null
2025-05-22	Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction	Jiacong Chen et.al.	2505.16533	null
2025-05-21	RUSplatting: Robust 3D Gaussian Splatting for Sparse-View Underwater Scene Reconstruction	Zhuodong Jiang et.al.	2505.15737	null
2025-05-21	PlantDreamer: Achieving Realistic 3D Plant Models with Diffusion-Guided Gaussian Splatting	Zane K J Hartley et.al.	2505.15528	null
2025-05-21	R3GS: Gaussian Splatting for Robust Reconstruction and Relocalization in Unconstrained Image Collections	Xu yan et.al.	2505.15294	null
2025-05-21	GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation	Yuchen Li et.al.	2505.15287	null
2025-05-21	X-GRM: Large Gaussian Reconstruction Model for Sparse-view X-rays to Computed Tomography	Yifan Liu et.al.	2505.15235	link
2025-05-21	GT^2-GS: Geometry-aware Texture Transfer for Gaussian Splatting	Wenjie Liu et.al.	2505.15208	null
2025-05-21	MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models	Yifan Liu et.al.	2505.15185	link
2025-05-20	Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning	Amine Elhafsi et.al.	2505.14938	null
2025-05-20	Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image	Yuxuan Wang et.al.	2505.14537	null
2025-05-20	MGStream: Motion-aware 3D Gaussian for Streamable Dynamic Scene Reconstruction	Zhenyu Bao et.al.	2505.13839	link
2025-05-19	Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos	Ruoyu Wang et.al.	2505.13440	link
2025-05-19	Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation	Seungjun Oh et.al.	2505.13215	link
2025-05-19	3D Gaussian Adaptive Reconstruction for Fourier Light-Field Microscopy	Chenyu Xu et.al.	2505.12875	null
2025-05-19	TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy	Luyao Lei et.al.	2505.12693	null
2025-05-18	Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey	Calvin Galagain et.al.	2505.12384	null
2025-05-17	GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity	Takuya Ikeda et.al.	2505.11905	null
2025-05-17	MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos	Hongyi Zhou et.al.	2505.11868	null
2025-05-17	Gaussian Splatting as a Unified Representation for Autonomy in Unstructured Environments	Dexter Ong et.al.	2505.11794	null
2025-05-16	Exploiting Radiance Fields for Grasp Generation on Novel Synthetic Views	Abhishek Kashyap et.al.	2505.11467	null
2025-05-16	GrowSplat: Constructing Temporal Digital Twins of Plants with Gaussian Splats	Simeon Adebola et.al.	2505.10923	null
2025-05-16	EA-3DGS: Efficient and Adaptive 3D Gaussians with Highly Enhanced Quality for outdoor scenes	Jianlin Guo et.al.	2505.10787	link
2025-05-14	ExploreGS: a vision-based low overhead framework for 3D scene reconstruction	Yunji Feng et.al.	2505.10578	null
2025-05-15	Consistent Quantity-Quality Control across Scenes for Deployment-Aware Gaussian Splatting	Fengdi Zhang et.al.	2505.10473	link
2025-05-15	VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality	Xuechang Tu et.al.	2505.10144	link
2025-05-15	Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field	Jinlong Fan et.al.	2505.10049	link
2025-05-15	Large-Scale Gaussian Splatting SLAM	Zhe Xin et.al.	2505.09915	null
2025-05-14	Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware	Justin Yu et.al.	2505.09601	null
2025-05-14	Neural Video Compression using 2D Gaussian Splatting	Lakshya Gupta et.al.	2505.09324	null
2025-05-15	NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance	Wenzhe Cai et.al.	2505.08712	null
2025-05-13	DLO-Splatting: Tracking Deformable Linear Objects Using 3D Gaussian Splatting	Holly Dinkel et.al.	2505.08644	null
2025-05-13	FOCI: Trajectory Optimization on Gaussian Splats	Mario Gomez Andreu et.al.	2505.08510	null
2025-05-13	A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering	Chuanzhi Xu et.al.	2505.08438	null
2025-05-13	ADC-GS: Anchor-Driven Deformable and Compressed Gaussian Splatting for Dynamic Scene Reconstruction	He Huang et.al.	2505.08196	link
2025-05-12	SLAG: Scalable Language-Augmented Gaussian Splatting	Laszlo Szilagyi et.al.	2505.08124	null
2025-05-12	GIFStream: 4D Gaussian-based Immersive Video with Feature Stream	Hao Li et.al.	2505.07539	null
2025-05-13	TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset	Olaf Wysocki et.al.	2505.07396	null
2025-05-10	Virtualized 3D Gaussians: Flexible Cluster-based Level-of-Detail System for Real-Time Rendering of Composed Scenes	Xijie Yang et.al.	2505.06523	null
2025-05-08	TeGA: Texture Space Gaussian Avatars for High-Resolution Dynamic Head Modeling	Gengyan Li et.al.	2505.05672	null
2025-05-08	UltraGauss: Ultrafast Gaussian Reconstruction of 3D Ultrasound Volumes	Mark C. Eid et.al.	2505.05643	null
2025-05-08	QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization	Yueh-Cheng Liu et.al.	2505.05591	null
2025-05-08	Steepest Descent Density Control for Compact 3D Gaussian Splatting	Peihao Wang et.al.	2505.05587	null
2025-05-08	SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation	Yonwoo Choi et.al.	2505.05475	link
2025-05-08	Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields	Runfeng Li et.al.	2505.05356	null
2025-05-07	SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction	Xinran Yang et.al.	2505.04668	link
2025-05-07	GSsplat: Generalizable Semantic Gaussian Splatting for Novel-view Synthesis in 3D Scenes	Feng Xiao et.al.	2505.04659	link
2025-05-07	Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting	Feng Yang et.al.	2505.04262	null
2025-05-06	3D Gaussian Splatting Data Compression with Mixture of Priors	Lei Liu et.al.	2505.03310	null
2025-05-04	Sparfels: Fast Reconstruction from Sparse Unposed Imagery	Shubhendu Jena et.al.	2505.02178	null
2025-05-04	SparSplat: Fast Multi-View Reconstruction with Generalizable 2D Gaussian Splatting	Shubhendu Jena et.al.	2505.02175	null
2025-05-04	GarmentGS: Point-Cloud Guided Gaussian Splatting for High-Fidelity Non-Watertight 3D Garment Reconstruction	Zhihao Tang et.al.	2505.02126	null
2025-05-04	SignSplat: Rendering Sign Language via Gaussian Splatting	Maksym Ivashechkin et.al.	2505.02108	null
2025-05-03	HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud Encoder	Qi Yang et.al.	2505.01938	link
2025-05-03	GenSync: A Generalized Talking Head Framework for Audio-driven Multi-Subject Lip-Sync using 3D Gaussian Splatting	Anushka Agarwal et.al.	2505.01928	null
2025-05-03	Visual enhancement and 3D representation for underwater scenes: a review	Guoxi Huang et.al.	2505.01869	null
2025-05-03	AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting	Junhao Shi et.al.	2505.01799	null
2025-05-02	FalconWing: An Open-Source Platform for Ultra-Light Fixed-Wing Aircraft Research	Yan Miao et.al.	2505.01383	null
2025-05-02	Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting	Youngsik Yun et.al.	2505.01235	null
2025-04-30	A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond	Jiajia Li et.al.	2505.00737	link
2025-05-01	Real-Time Animatable 2DGS-Avatars with Detail Enhancement from Monocular Videos	Xia Yuan et.al.	2505.00421	null
2025-04-30	HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation	Haiyang Zhou et.al.	2504.21650	link
2025-04-29	GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction	Yuhan Xie et.al.	2504.21067	link
2025-04-29	GaussTrap: Stealthy Poisoning Attacks on 3D Gaussian Splatting for Targeted Scene Confusion	Jiaxin Hong et.al.	2504.20829	null
2025-04-29	EfficientHuman: Efficient Training and Reconstruction of Moving Human using Articulated 2D Gaussian	Hao Tian et.al.	2504.20607	null
2025-04-29	Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting	Hanxi Liu et.al.	2504.20403	null
2025-05-01	GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting	Jongwon Lee et.al.	2504.20379	null
2025-04-29	Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views	Jiang Wu et.al.	2504.20378	link
2025-04-28	Mesh-Learner: Texturing Mesh with Spherical Harmonics	Yunfei Wan et.al.	2504.19938	link
2025-04-28	CE-NPBG: Connectivity Enhanced Neural Point-Based Graphics for Novel View Synthesis in Autonomous Driving Scenes	Mohammad Altillawi et.al.	2504.19557	null
2025-04-28	GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field	Zuxing Lu et.al.	2504.19409	null
2025-04-27	Rendering Anywhere You See: Renderability Field-guided Gaussian Splatting	Xiaofeng Jin et.al.	2504.19261	null
2025-04-30	4DGS-CC: A Contextual Coding Framework for 4D Gaussian Splatting Data Compression	Zicong Chen et.al.	2504.18925	null
2025-04-26	TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians	Letian Huang et.al.	2504.18768	null
2025-04-28	RGS-DR: Reflective Gaussian Surfels with Deferred Rendering for Shiny Objects	Georgios Kouros et.al.	2504.18468	null
2025-04-25	STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting	Yunze Deng et.al.	2504.18318	null
2025-04-25	PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models	Michel Gokan Khan et.al.	2504.18165	link
2025-04-24	iVR-GS: Inverse Volume Rendering for Explorable Visualization via Editable 3D Gaussian Splatting	Kaiyuan Tang et.al.	2504.17954	link
2025-04-23	Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning	Mingxuan Cui et.al.	2504.17815	link
2025-04-24	CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos	Shucheng Gong et.al.	2504.17728	link
2025-04-23	Gaussian Splatting is an Effective Data Generator for 3D Object Detection	Farhad G. Zanjani et.al.	2504.16740	null
2025-04-23	PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation	Wenxuan Li et.al.	2504.16693	null
2025-04-23	HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction	Zhongtao Wang et.al.	2504.16606	null
2025-04-23	ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration	Andrea Conti et.al.	2504.16545	null
2025-04-21	StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians	Cailin Zhuang et.al.	2504.15281	null
2025-04-21	Immersive Teleoperation Framework for Locomanipulation Tasks	Takuya Boehringer et.al.	2504.15229	null
2025-04-21	MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video	Minh-Quan Viet Bui et.al.	2504.15122	null
2025-04-20	IXGS-Intraoperative 3D Reconstruction from Sparse, Arbitrarily Posed Real X-rays	Sascha Jecklin et.al.	2504.14699	null
2025-04-20	NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation	Junyuan Fang et.al.	2504.14638	null
2025-04-20	VGNC: Reducing the Overfitting of Sparse-view 3DGS via Validation-guided Gaussian Number Control	Lifeng Lin et.al.	2504.14548	null
2025-04-20	Metamon-GS: Enhancing Representability with Variance-Guided Densification and Light Encoding	Junyan Su et.al.	2504.14460	null
2025-04-23	SEGA: Drivable 3D Gaussian Head Avatar from a Single Image	Chen Guo et.al.	2504.14373	null
2025-04-21	SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM	Samuel Cerezo et.al.	2504.13713	link
2025-04-18	Green Robotic Mixed Reality with Gaussian Splatting	Chenxuan Liu et.al.	2504.13697	null
2025-04-18	EG-Gaussian: Epipolar Geometry and Graph Network Enhanced 3D Gaussian Splatting	Beizhen Zhao et.al.	2504.13540	null
2025-04-17	Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering	Landon Dyken et.al.	2504.13339	null
2025-04-17	Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation	Sizhe Yang et.al.	2504.13175	null
2025-04-18	ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos	Zetong Zhang et.al.	2504.13167	null
2025-04-17	Digital Twin Generation from Visual Data: A Survey	Andrew Melnik et.al.	2504.13159	link
2025-04-17	Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint Graphs	Shaohui Dai et.al.	2504.13153	link
2025-04-17	CompGS++: Compressed Gaussian Splatting for Static and Dynamic Scene Representation	Xiangrui Liu et.al.	2504.13022	null
2025-04-17	GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration	Rendong Zhang et.al.	2504.12999	link
2025-04-17	Second-order Optimization of Gaussian Splats with Importance Sampling	Hamza Pehlivan et.al.	2504.12905	null
2025-04-17	AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering	Michael Steiner et.al.	2504.12811	null
2025-04-17	CAGE-GS: High-fidelity Cage Based 3D Gaussian Splatting Deformation	Yifei Tong et.al.	2504.12800	null
2025-04-17	TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors	Mingwei Li et.al.	2504.12799	null
2025-04-16	CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting	Wei Sun et.al.	2504.11893	null
2025-04-16	3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians	Zeming Wei et.al.	2504.11218	link
2025-04-15	Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation	Andrea Simonelli et.al.	2504.11024	null
2025-04-15	3D Gabor Splatting: Reconstruction of High-frequency Surface Texture using Gabor Noise	Haato Watanabe et.al.	2504.11003	null
2025-04-15	GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR	Christophe Bolduc et.al.	2504.10809	null
2025-04-14	DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting	Zeren Jiang et.al.	2504.10486	link
2025-04-15	LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis	Hao Sun et.al.	2504.10331	null
2025-04-14	ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting	Huiqi Wu et.al.	2504.10316	null
2025-04-14	EBAD-Gaussian: Event-driven Bundle Adjusted Deblur Gaussian Splatting	Yufei Deng et.al.	2504.10012	null
2025-04-16	GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting	Junlin Hao et.al.	2504.10001	null
2025-04-14	MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling	Yunpeng Tan et.al.	2504.09878	null
2025-04-13	TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting	Zhicong Wu et.al.	2504.09588	null
2025-04-13	DropoutGS: Dropping Out Gaussians for Better Sparse-view Rendering	Yexing Xu et.al.	2504.09491	null
2025-04-12	A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds	Jizong Peng et.al.	2504.09129	null
2025-04-12	BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting	Jeongwan On et.al.	2504.09097	null
2025-04-11	FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents	Xin Tan et.al.	2504.08581	null
2025-04-11	Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation	Bram Vanherle et.al.	2504.08473	link
2025-04-11	In-2-4D: Inbetweening from Two Single-View Images to 4D Generation	Sauradip Nag et.al.	2504.08366	null
2025-04-10	ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting	Junbang Liu et.al.	2504.08100	link
2025-04-10	InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians	Kefan Chen et.al.	2504.07949	null
2025-04-10	View-Dependent Uncertainty Estimation of 3D Gaussian Splatting	Chenyu Han et.al.	2504.07370	null
2025-04-09	Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting	Daiwei Zhang et.al.	2504.06978	null
2025-04-09	IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments	Can Zhang et.al.	2504.06827	null
2025-04-09	SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering	Hanxiao Sun et.al.	2504.06815	link
2025-04-09	GSta: Efficient Training Scheme with Siestaed Gaussians for Monocular 3D Scene Reconstruction	Anil Armagan et.al.	2504.06716	null
2025-04-09	Collision avoidance from monocular vision trained with novel view synthesis	Valentin Tordjman–Levavasseur et.al.	2504.06651	null
2025-04-10	Stochastic Ray Tracing of 3D Transparent Gaussians	Xin Sun et.al.	2504.06598	null
2025-04-08	Micro-splatting: Maximizing Isotropic Constraints for Refined Optimization in 3D Gaussian Splatting	Jee Won Lee et.al.	2504.05740	null
2025-04-07	View-Dependent Deformation Fields for 2D Editing of 3D Models	Martin El Mqirmi et.al.	2504.05544	null
2025-04-07	L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene Delivery	Yi-Zhen Tsai et.al.	2504.05517	link
2025-04-07	Let it Snow! Animating Static Gaussian Scenes With Dynamic Weather Effects	Gal Fiebelman et.al.	2504.05296	null
2025-04-07	PanoDreamer: Consistent Text to 360-Degree Scene Generation	Zhexiao Xiong et.al.	2504.05152	null
2025-04-07	3D Gaussian Particle Approximation of VDB Datasets: A Study for Scientific Visualization	Isha Sharma et.al.	2504.04857	null
2025-04-07	Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM	Zhicong Sun et.al.	2504.04844	link
2025-04-07	DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal	Wanzhou Liu et.al.	2504.04679	null
2025-04-06	Tool-as-Interface: Learning Robot Policies from Human Tool Usage through Imitation Learning	Haonan Chen et.al.	2504.04612	null
2025-04-06	Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models	Etienne Chassaing et.al.	2504.04448	null
2025-04-05	3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS	Zhisheng Huang et.al.	2504.04294	null
2025-04-05	Interpretable Single-View 3D Gaussian Splatting using Unsupervised Hierarchical Disentangled Representation Learning	Yuyang Zhang et.al.	2504.04190	null
2025-04-04	WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments	Jianhao Zheng et.al.	2504.03886	null
2025-04-04	HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration	Boyuan Wang et.al.	2504.03536	null
2025-04-03	Compressing 3D Gaussian Splatting by Noise-Substituted Vector Quantization	Haishan Wang et.al.	2504.03059	link
2025-04-03	MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM	Renwu Li et.al.	2504.02437	null
2025-04-03	ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation	Yuan Zhou et.al.	2504.02316	link
2025-04-03	Digital-twin imaging based on descattering Gaussian splatting	Suguru Shimomura et.al.	2504.02278	null
2025-04-02	UAVTwin: Neural Digital Twins for UAVs using Gaussian Splatting	Jaehoon Choi et.al.	2504.02158	null
2025-04-02	WorldPrompter: Traversable Text-to-Scene Generation	Zhaoyang Zhang et.al.	2504.02045	null
2025-04-02	Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis	Niluthpol Chowdhury Mithun et.al.	2504.01960	null
2025-04-03	Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting	Shu-Wei Lu et.al.	2504.01957	null
2025-04-02	BOGausS: Better Optimized Gaussian Splatting	Stéphane Pateux et.al.	2504.01844	null
2025-04-02	FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking	Ulas Gunes et.al.	2504.01732	null
2025-04-02	FlowR: Flowing from Sparse to Dense 3D Reconstructions	Tobias Fischer et.al.	2504.01647	null
2025-04-02	3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting	Hao Wu et.al.	2504.01619	null
2025-04-02	RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars	Yahui Li et.al.	2504.01559	null
2025-04-02	High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model	Yiyang Shen et.al.	2504.01512	null
2025-04-02	Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment	Ziteng Cui et.al.	2504.01503	link
2025-04-02	3D Gaussian Inverse Rendering with Approximated Global Illumination	Zirui Wu et.al.	2504.01358	null
2025-03-31	Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views	Chong Bao et.al.	2503.24382	null
2025-03-31	ERUPT: Efficient Rendering with Unposed Patch Transformer	Maxim V. Shugaev et.al.	2503.24374	null
2025-03-31	StochasticSplats: Stochastic Rasterization for Sorting-Free 3D Gaussian Splatting	Shakiba Kheradmand et.al.	2503.24366	null
2025-04-01	Visual Acoustic Fields	Yuelei Li et.al.	2503.24270	null
2025-03-31	DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting	Seungjun Lee et.al.	2503.24210	null
2025-03-31	Learning 3D-Gaussian Simulators from RGB Videos	Mikel Zhobro et.al.	2503.24009	null
2025-03-31	ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image	Tianyi Gong et.al.	2503.23881	null
2025-03-30	Gaussian Blending Unit: An Edge GPU Plug-in for Real-Time Gaussian-Based Rendering in AR/VR	Zhifan Ye et.al.	2503.23625	null
2025-03-30	Enhancing 3D Gaussian Splatting Compression via Spatial Condition-based Prediction	Jingui Ma et.al.	2503.23337	null
2025-03-30	ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning	Zhenyang Liu et.al.	2503.23297	null
2025-03-28	TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting	Boyang et.al.	2503.22676	null
2025-03-28	Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis	Shuai Shen et.al.	2503.22605	null
2025-03-28	EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting	Xu Wang et.al.	2503.22437	link
2025-03-28	AH-GS: Augmented 3D Gaussian Splatting for High-Frequency Detail Representation	Chenyang Xu et.al.	2503.22324	null
2025-03-28	Follow Your Motion: A Generic Temporal Consistency Portrait Editing Framework with Trajectory Guidance	Haijie Yang et.al.	2503.22225	null
2025-03-28	ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting	Wenjie Liu et.al.	2503.22218	null
2025-03-28	Segment then Splat: A Unified Approach for 3D Open-Vocabulary Segmentation based on Gaussian Splatting	Yiren Lu et.al.	2503.22204	null
2025-03-28	Disentangled 4D Gaussian Splatting: Towards Faster and More Efficient Dynamic Scene Rendering	Hao Feng et.al.	2503.22159	null
2025-03-27	X $^{2}$ -Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction	Weihao Yu et.al.	2503.21779	null
2025-03-27	Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying	Hairong Yin et.al.	2503.21767	null
2025-03-27	RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting	Qiyu Dai et.al.	2503.21442	null
2025-03-28	LandMarkSystem Technical Report	Zhenxiang Ma et.al.	2503.21364	link
2025-03-27	Frequency-Aware Gaussian Splatting Decomposition	Yishai Lavi et.al.	2503.21226	null
2025-03-27	StyledStreets: Multi-style Street Simulator with Spatial and Temporal Consistency	Yuyin Chen et.al.	2503.21104	null
2025-03-26	PGC: Physics-Based Gaussian Cloth from a Single Pose	Michelle Guo et.al.	2503.20779	null
2025-03-28	Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields	Shijie Zhou et.al.	2503.20776	null
2025-03-26	TC-GS: Tri-plane based compression for 3D Gaussian Splatting	Taorui Wang et.al.	2503.20221	link
2025-03-26	EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis	Sheng Miao et.al.	2503.20168	null
2025-03-25	Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields	Navami Kairanda et.al.	2503.19976	null
2025-03-26	A Survey on Event-driven 3D Reconstruction: Development under Different Categories	Chuanzhi Xu et.al.	2503.19753	null
2025-03-25	High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting	Qian Wang et.al.	2503.19703	null
2025-03-25	GaussianUDF: Inferring Unsigned Distance Functions through 3D Gaussian Splatting	Shujuan Li et.al.	2503.19458	null
2025-03-25	SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors	Yiqing Li et.al.	2503.19452	null
2025-03-26	COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting	Jiaxin Zhang et.al.	2503.19443	link
2025-03-25	From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting	Zhiwei Huang et.al.	2503.19358	null
2025-03-25	Divide-and-Conquer: Dual-Hierarchical Optimization for Semantic 4D Gaussian Spatting	Zhiying Yan et.al.	2503.19332	null
2025-03-25	MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection	Jee Won Lee et.al.	2503.19330	null
2025-03-25	HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting	Xinpeng Liu et.al.	2503.19232	link
2025-03-24	NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting	Yulong Zheng et.al.	2503.18794	null
2025-03-24	GS-Marker: Generalizable and Robust Watermarking for 3D Gaussian Splatting	Lijiang Li et.al.	2503.18718	null
2025-03-24	Hardware-Rasterized Ray-Based Gaussian Splatting	Samuel Rota Bulò et.al.	2503.18682	null
2025-03-24	LLGS: Unsupervised Gaussian Splatting for Image Enhancement and Reconstruction in Pure Dark Environment	Haoran Wang et.al.	2503.18640	null
2025-03-25	StableGS: A Floater-Free Framework for 3D Gaussian Splatting	Luchao Wang et.al.	2503.18458	null
2025-03-24	4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video	Qiang Hu et.al.	2503.18421	null
2025-03-24	DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds	Youyu Chen et.al.	2503.18402	null
2025-03-24	GI-SLAM: Gaussian-Inertial SLAM	Xulang Liu et.al.	2503.18275	null
2025-03-23	Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving	Junhao Ge et.al.	2503.18108	link
2025-03-23	PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding	Hongjia Zhai et.al.	2503.18107	null
2025-03-21	TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting	Jianchuan Chen et.al.	2503.17032	null
2025-03-21	Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting	Jinbo Yan et.al.	2503.16979	link
2025-03-21	DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery	Jiadong Tang et.al.	2503.16964	null
2025-03-21	Optimized Minimal 3D Gaussian Splatting	Joo Chan Lee et.al.	2503.16924	null
2025-03-20	SAGE: Semantic-Driven Adaptive Gaussian Splatting in Extended Reality	Chiara Schiavo et.al.	2503.16747	null
2025-03-20	4D Gaussian Splatting SLAM	Yanyan Li et.al.	2503.16710	null
2025-03-20	GauRast: Enhancing GPU Triangle Rasterizers to Accelerate 3D Gaussian Splatting	Sixu Li et.al.	2503.16681	null
2025-03-20	1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering	Yuheng Yuan et.al.	2503.16422	null
2025-03-20	M3: 3D-Spatial MultiModal Memory	Xueyan Zou et.al.	2503.16413	link
2025-03-20	Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images	Shengjun Zhang et.al.	2503.16338	null
2025-03-20	OccluGaussian: Occlusion-Aware Gaussian Splatting for Large Scene Reconstruction and Rendering	Shiyong Liu et.al.	2503.16177	null
2025-03-20	Enhancing Close-up Novel View Synthesis via Pseudo-labeling	Jiatong Xia et.al.	2503.15908	link
2025-03-20	VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling	Hyojun Go et.al.	2503.15855	null
2025-03-20	BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting	Yiren Lu et.al.	2503.15835	null
2025-03-18	HandSplat: Embedding-Driven Gaussian Splatting for High-Fidelity Hand Rendering	Yilan Dong et.al.	2503.14736	null
2025-03-18	SplatVoxel: History-Aware Novel View Streaming without Temporal Training	Yiming Wang et.al.	2503.14698	null
2025-03-18	Optimized 3D Gaussian Splatting using Coarse-to-Fine Image Frequency Modulation	Umar Farooq et.al.	2503.14475	null
2025-03-18	Improving Adaptive Density Control for 3D Gaussian Splatting	Glenn Grubert et.al.	2503.14274	link
2025-03-18	RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images	Junjin Xiao et.al.	2503.14198	link
2025-03-18	Lightweight Gradient-Aware Upscaling of 3D Gaussian Splatting Images	Simon Niedermayr et.al.	2503.14171	null
2025-03-18	Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting	Runsong Zhu et.al.	2503.14029	link
2025-03-18	Light4GS: Lightweight Compact 4D Gaussian Splatting Generation via Context Model	Mufan Liu et.al.	2503.13948	null
2025-03-17	Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors	Katja Schwarz et.al.	2503.13272	null
2025-03-17	DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction	Rui Wang et.al.	2503.13176	null
2025-03-17	Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization	Yiwei Xu et.al.	2503.13086	null
2025-03-17	CAT-3DGS Pro: A New Benchmark for Efficient 3DGS Compression	Yu-Ting Zhan et.al.	2503.12862	null
2025-03-17	CompMarkGS: Robust Watermarking for Compression 3D Gaussian Splatting	Sumin In et.al.	2503.12836	null
2025-03-17	AV-Surf: Surface-Enhanced Geometry-Aware Novel-View Acoustic Synthesis	Hadam Baek et.al.	2503.12806	null
2025-03-16	Deblur Gaussian Splatting SLAM	Francesco Girlanda et.al.	2503.12572	null
2025-03-16	MTGS: Multi-Traversal Gaussian Splatting	Tianyu Li et.al.	2503.12552	link
2025-03-16	SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs	Guibiao Liao et.al.	2503.12535	null
2025-03-16	VRsketch2Gaussian: 3D VR Sketch Guided 3D Object Generation with Gaussian Splatting	Songen Gu et.al.	2503.12383	null
2025-03-14	Advancing 3D Gaussian Splatting Editing with Complementary and Consensus Information	Xuanqi Zhang et.al.	2503.11601	null
2025-03-14	EgoSplat: Open-Vocabulary Egocentric Scene Understanding with Language Embedded 3D Gaussian Splatting	Di Li et.al.	2503.11345	null
2025-03-14	Uncertainty-Aware Normal-Guided Gaussian Splatting for Surface Reconstruction from Sparse Image Sequences	Zhen Tan et.al.	2503.11172	null
2025-03-13	RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors	Avinash Paliwal et.al.	2503.10860	link
2025-03-13	LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds	Lingteng Qiu et.al.	2503.10625	link
2025-03-13	MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction	Yingshuang Zou et.al.	2503.10604	null
2025-03-13	4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models	Wanhua Li et.al.	2503.10437	link
2025-03-13	VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames	Zhiqi Li et.al.	2503.10286	null
2025-03-13	ROODI: Reconstructing Occluded Objects with Denoising Inpainters	Yeonjin Chang et.al.	2503.10256	null
2025-03-13	GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction	Jianheng Liu et.al.	2503.10170	link
2025-03-13	3D Student Splatting and Scooping	Jialin Zhu et.al.	2503.10148	link
2025-03-13	GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping	Jinfeng Liu et.al.	2503.10143	null
2025-03-12	Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation	Máté Tóth et.al.	2503.09464	null
2025-03-12	Online Language Splatting	Saimouli Katragadda et.al.	2503.09447	null
2025-03-12	Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training	Jiatong Xia et.al.	2503.09396	null
2025-03-12	GASPACHO: Gaussian Splatting for Controllable Humans and Objects	Aymen Mir et.al.	2503.09342	null
2025-03-12	SDD-4DGS: Static-Dynamic Aware Decoupling in Gaussian Splatting for 4D Scene Reconstruction	Dai Sun et.al.	2503.09332	null
2025-03-12	Motion Blender Gaussian Splatting for Dynamic Reconstruction	Xinyu Zhang et.al.	2503.09040	link
2025-03-11	PCGS: Progressive Compression of 3D Gaussian Splatting	Yihang Chen et.al.	2503.08511	link
2025-03-11	TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting	Fengyi Zhang et.al.	2503.08485	null
2025-03-11	Mitigating Ambiguities in 3D Classification with Gaussian Splatting	Ruiqi Zhang et.al.	2503.08352	null
2025-03-11	Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios	Zikang Yuan et.al.	2503.08317	null
2025-03-11	ELECTRA: A Symmetry-breaking Cartesian Network for Charge Density Prediction with Floating Orbitals	Jonas Elsborg et.al.	2503.08305	link
2025-03-11	HRAvatar: High-Quality and Relightable Gaussian Head Avatar	Dongbin Zhang et.al.	2503.08224	null
2025-03-11	S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction	Guangting Zheng et.al.	2503.08217	null
2025-03-11	Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming	Jiaxuan Zhu et.al.	2503.08166	null
2025-03-11	ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting	Junfu Guo et.al.	2503.08135	null
2025-03-11	MVGSR: Multi-View Consistency Gaussian Splatting for Robust Surface Reconstruction	Chenfeng Hou et.al.	2503.08093	null
2025-03-10	SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting	Jiahui Zhang et.al.	2503.07476	null
2025-03-10	EigenGS Representation: From Eigenspace to Gaussian Image Space	Lo-Wei Tai et.al.	2503.07446	null
2025-03-10	All That Glitters Is Not Gold: Key-Secured 3D Secrets within 3D Gaussian Splatting	Yan Ren et.al.	2503.07191	link
2025-03-10	Multi-Modal 3D Mesh Reconstruction from Images and Text	Melvin Reka et.al.	2503.07190	null
2025-03-10	Frequency-Aware Density Control via Reparameterization for High-Quality Rendering of 3D Gaussian Splatting	Zhaojie Zeng et.al.	2503.07000	link
2025-03-10	DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation	Xiaoliang Ju et.al.	2503.06900	null
2025-03-10	ActiveInitSplat: How Active Image Selection Helps Gaussian Splatting	Konstantinos D. Polyzos et.al.	2503.06859	null
2025-03-09	Gaussian RBFNet: Gaussian Radial Basis Functions for Fast and Accurate Representation and Reconstruction of Neural Fields	Abdelaziz Bouzidi et.al.	2503.06762	null
2025-03-09	CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving	Rui Song et.al.	2503.06744	null
2025-03-09	D3DR: Lighting-Aware Object Insertion in Gaussian Splatting	Vsevolod Skorokhodov et.al.	2503.06740	null
2025-03-07	D2GV: Deformable 2D Gaussian Splatting for Video Representation in 400FPS	Mufan Liu et.al.	2503.05600	link
2025-03-07	Free Your Hands: Lightweight Relightable Turntable Capture Pipeline	Jiahui Fan et.al.	2503.05511	null
2025-03-07	LiDAR-enhanced 3D Gaussian Splatting Mapping	Jian Shen et.al.	2503.05425	null
2025-03-07	Self-Modeling Robots by Photographing	Kejun Hu et.al.	2503.05398	null
2025-03-07	CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images	Jungho Lee et.al.	2503.05332	link
2025-03-07	STGA: Selective-Training Gaussian Head Avatars	Hanzhi Guo et.al.	2503.05196	null
2025-03-07	Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects	Justin Yu et.al.	2503.05189	null
2025-03-07	MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions	Qingyuan Zhou et.al.	2503.05182	null
2025-03-07	SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting	Linqi Yang et.al.	2503.05174	null
2025-03-07	SeeLe: A Unified Acceleration Framework for Real-Time Gaussian Splatting	Xiaotong Huang et.al.	2503.05168	null
2025-03-06	GaussianVideo: Efficient Video Representation and Compression by Gaussian Splatting	Inseo Lee et.al.	2503.04333	null
2025-03-06	S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting	Yecong Wan et.al.	2503.04314	null
2025-03-06	Instrument-Splatting: Controllable Photorealistic Reconstruction of Surgical Instruments Using Gaussian Splatting	Shuojue Yang et.al.	2503.04082	null
2025-03-06	Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details	Yifei Gao et.al.	2503.04037	null
2025-03-06	GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding	Xihan Wang et.al.	2503.04034	null
2025-03-06	GRaD-Nav: Efficiently Learning Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics	Qianzhong Chen et.al.	2503.03984	null
2025-03-05	LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation	Qian Feng et.al.	2503.03890	null
2025-03-05	NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics	Kun Yang et.al.	2503.03115	null
2025-03-04	2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting	Qipeng Yan et.al.	2503.02452	null
2025-03-04	DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting	Haoyuan Li et.al.	2503.02223	link
2025-03-03	Morpheus: Text-Driven 3D Gaussian Splat Shape and Color Stylization	Jamie Wynn et.al.	2503.02009	null
2025-03-03	Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models	Jay Zhangjie Wu et.al.	2503.01774	null
2025-03-03	OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding	Dianyi Yang et.al.	2503.01646	null
2025-03-03	LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training	Kaimin Liao et.al.	2503.01199	link
2025-03-03	FGS-SLAM: Fourier-based Gaussian Splatting for Real-time SLAM with Sparse and Dense Map Fusion	Yansong Xu et.al.	2503.01109	null
2025-03-02	Evolving High-Quality Rendering and Reconstruction in a Unified Framework with Contribution-Adaptive Regularization	You Shen et.al.	2503.00881	null
2025-03-02	Vid2Fluid: 3D Dynamic Fluid Assets from Single-View Videos with Generative Gaussian Splatting	Zhiwei Zhao et.al.	2503.00868	null
2025-03-02	PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery	BoCheng Li et.al.	2503.00848	null
2025-03-03	FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering	Jingqiu Zhou et.al.	2502.21093	null
2025-02-28	EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering	John J. Han et.al.	2502.20669	null
2025-02-27	ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting	Dexter Ong et.al.	2502.20386	null
2025-02-27	Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling	Hanyang Kong et.al.	2502.20378	null
2025-02-27	No Parameters, No Problem: 3D Gaussian Splatting without Camera Intrinsics and Extrinsics	Dongbo Shi et.al.	2502.19800	null
2025-02-27	Open-Vocabulary Semantic Part Segmentation of 3D Human	Keito Suzuki et.al.	2502.19782	null
2025-02-26	Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting	Yu Liu et.al.	2502.19459	link
2025-02-26	Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions	Muhammad Salman Ali et.al.	2502.19457	null
2025-02-26	Does 3D Gaussian Splatting Need Accurate Volumetric Rendering?	Adam Celarek et.al.	2502.19318	link
2025-02-28	OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation	Yunpeng Gao et.al.	2502.18041	null
2025-02-27	UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting	Haoyuan Li et.al.	2502.17860	null
2025-02-24	Laplace-Beltrami Operator for Gaussian Splatting	Hongyu Zhou et.al.	2502.17531	null
2025-02-24	Graph-Guided Scene Reconstruction from Images with 3D Gaussian Splatting	Chong Cheng et.al.	2502.17377	null
2025-02-25	GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow	Simon Boeder et.al.	2502.17288	null
2025-02-24	VR-Pipe: Streamlining Hardware Graphics Pipeline for Volume Rendering	Junseo Lee et.al.	2502.17078	null
2025-02-23	GS-TransUNet: Integrated 2D Gaussian Splatting and Transformer UNet for Accurate Skin Lesion Analysis	Anand Kumar et.al.	2502.16748	null
2025-02-23	Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration	Kim Jun-Seong et.al.	2502.16652	null
2025-02-23	Dragen3D: Multiview Geometry Consistent 3D Gaussian Generation with Drag-Based Control	Jinbo Yan et.al.	2502.16475	null
2025-02-21	RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes	Sicheng Yu et.al.	2502.15633	null
2025-02-24	DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation	Luzhou Ge et.al.	2502.15309	link
2025-02-20	GS-Cache: A GS-Cache Inference Framework for Large-scale Gaussian Splatting Models	Miao Tao et.al.	2502.14938	null
2025-02-20	Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2502.14931	null
2025-02-20	CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting	Qilin Zhang et.al.	2502.14684	link
2025-02-20	OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving	Yedong Shen et.al.	2502.14235	null
2025-02-19	GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian	Bang Du et.al.	2502.14129	null
2025-02-19	Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction	Gan Chen et.al.	2502.14004	link
2025-02-19	3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments	Vincent Ress et.al.	2502.13803	null
2025-02-18	GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis	Pedro Martin et.al.	2502.13196	null
2025-02-18	RadSplatter: Extending 3D Gaussian Splatting to Radio Frequencies for Wireless Radiomap Extrapolation	Yiheng Wang et.al.	2502.12686	null
2025-02-17	PUGS: Zero-shot Physical Understanding with Gaussian Splatting	Yinghao Shuai et.al.	2502.12231	link
2025-02-17	3D Gaussian Inpainting with Depth-Guided Cross-View Consistency	Sheng-Yu Huang et.al.	2502.11801	null
2025-02-17	Exploring the Versal AI Engine for 3D Gaussian Splatting	Kotaro Shimamura et.al.	2502.11782	null
2025-02-17	GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text	Gyumin Shim et.al.	2502.11642	null
2025-02-16	OMG: Opacity Matters in Material Modeling with Gaussian Splatting	Silong Yong et.al.	2502.10988	null
2025-02-16	GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting	Zelin Zhou et.al.	2502.10975	null
2025-02-15	E-3DGS: Event-Based Novel View Rendering of Large-Scale Scenes Using 3D Gaussian Splatting	Sohaib Zahid et.al.	2502.10827	null
2025-02-13	X-SG $^2$ S: Safe and Generalizable Gaussian Splatting with X-dimensional Watermarks	Zihang Cheng et.al.	2502.10475	null
2025-02-13	Self-Calibrating Gaussian Splatting for Large Field of View Reconstruction	Youming Deng et.al.	2502.09563	null
2025-02-13	DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior	Mingrui Li et.al.	2502.09111	null
2025-02-13	Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting	Lingting Zhu et.al.	2502.09039	link
2025-02-12	Interactive Holographic Visualization for 3D Facial Avatar	Tri Tung Nguyen Nguyen et.al.	2502.08085	null
2025-02-11	TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation	Jeongyun Kim et.al.	2502.07840	link
2025-02-11	MeshSplats: Mesh-Based Rendering with Gaussian Splatting Initialization	Rafał Tobiasz et.al.	2502.07754	link
2025-02-11	Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors	Lin-Zhuo Chen et.al.	2502.07615	null
2025-02-10	Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC	Siwei Meng et.al.	2502.07007	null
2025-02-10	SIREN: Semantic, Initialization-Free Registration of Multi-Robot Gaussian Splatting Maps	Ola Shorinwa et.al.	2502.06519	null
2025-02-10	Three-Dimensional MRI Reconstruction with Gaussian Representations: Tackling the Undersampling Problem	Tengya Peng et.al.	2502.06510	null
2025-02-11	Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform	Kyle Gao et.al.	2502.05769	null
2025-02-09	PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map	Yue Pan et.al.	2502.05752	link
2025-02-08	Vision-in-the-loop Simulation for Deep Monocular Pose Estimation of UAV in Ocean Environment	Maneesha Wickramasuriya et.al.	2502.05409	null
2025-02-07	AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting	Chung-Ho Wu et.al.	2502.05176	null
2025-02-07	GaussRender: Learning 3D Occupancy with Gaussian Rendering	Loick Chambon et.al.	2502.05040	link
2025-02-07	OccGS: Zero-shot 3D Occupancy Reconstruction with Semantic and Geometric-Aware Gaussian Splatting	Xiaoyu Zhou et.al.	2502.04981	null
2025-02-07	PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression	Feifei Li et.al.	2502.04843	null
2025-02-07	SC-OmniGS: Self-Calibrating Omnidirectional Gaussian Splatting	Huajian Huang et.al.	2502.04734	null
2025-02-07	High-Speed Dynamic 3D Imaging with Sensor Fusion Splatting	Zihao Zou et.al.	2502.04630	null
2025-02-05	GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM	Mingrui Li et.al.	2502.03228	null
2025-02-05	GP-GS: Gaussian Processes for Enhanced Gaussian Splatting	Zhihao Guo et.al.	2502.02283	link
2025-02-04	LAYOUTDREAMER: Physics-guided Layout for Text-to-3D Compositional Scene Generation	Yang Zhou et.al.	2502.01949	null
2025-02-03	UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping	Aashish Rai et.al.	2502.01846	null
2025-02-03	Scalable 3D Gaussian Splatting-Based RF Signal Spatial Propagation Modeling	Kang Yang et.al.	2502.01826	null
2025-02-03	VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion	Shaoting Zhu et.al.	2502.01536	null
2025-02-03	Radiant Foam: Real-Time Differentiable Ray Tracing	Shrisudhan Govindarajan et.al.	2502.01157	null
2025-02-02	EmoTalkingGaussian: Continuous Emotion-conditioned Talking Head Synthesis	Junuk Cha et.al.	2502.00654	null
2025-01-31	Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation	Rohan Chacko et.al.	2502.00173	null
2025-01-31	Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping	Yiming Huang et.al.	2501.19319	link
2025-01-31	RaySplats: Ray Tracing based Gaussian Splatting	Krzysztof Byrski et.al.	2501.19196	link
2025-01-31	JGHand: Joint-Driven Animatable Hand Avater via 3D Gaussian Splatting	Zhoutao Sun et.al.	2501.19088	null
2025-01-30	Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting	Yansong Qu et.al.	2501.18672	null
2025-01-29	3D Reconstruction of Shoes for Augmented Reality	Pratik Shrestha et.al.	2501.18643	null
2025-01-31	VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting	Mateusz Nowak et.al.	2501.17978	null
2025-01-29	CrowdSplat: Exploring Gaussian Splatting For Crowd Rendering	Xiaohan Sun et.al.	2501.17792	link
2025-01-29	FeatureGS: Eigenvalue-Feature Optimization in 3D Gaussian Splatting for Geometrically Accurate and Artifact-Reduced Reconstruction	Miriam Jäger et.al.	2501.17655	null
2025-01-28	Evaluating CrowdSplat: Perceived Level of Detail for Gaussian Crowds	Xiaohan Sun et.al.	2501.17085	null
2025-01-28	DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation	Chenguo Lin et.al.	2501.16764	null
2025-01-26	GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting	Jiajun Dong et.al.	2501.15619	link
2025-01-25	Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos	Zhen-Hui Dong et.al.	2501.15096	null
2025-01-25	HuGDiffusion: Generalizable Single-Image Human Rendering via 3D Gaussian Diffusion	Yingzhi Tang et.al.	2501.15008	null
2025-01-24	Trick-GS: A Balanced Bag of Tricks for Efficient Gaussian Splatting	Anil Armagan et.al.	2501.14534	null
2025-01-24	Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video	Xiaohao Xu et.al.	2501.14319	link
2025-01-24	Dense-SfM: Structure from Motion with Dense Consistent Matching	JongMin Lee et.al.	2501.14277	null
2025-01-24	Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images	Yihui Li et.al.	2501.14231	null
2025-01-24	HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting	Javier Yu et.al.	2501.14147	null
2025-01-23	GoDe: Gaussians on Demand for Progressive Level of Detail and Scalable Compression	Francesco Di Sario et.al.	2501.13558	null
2025-01-23	MultiDreamer3D: Multi-concept 3D Customization with Concept-Aware Diffusion Guidance	Wooseok Song et.al.	2501.13449	null
2025-01-23	GeomGS: LiDAR-Guided Geometry-Aware Gaussian Splatting for Robot Localization	Jaewon Lee et.al.	2501.13417	null
2025-01-23	VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM	Gyuhyeon Pak et.al.	2501.13402	null
2025-01-23	Deblur-Avatar: Animatable Avatars from Motion-Blurred Monocular Videos	Xianrui Luo et.al.	2501.13335	null
2025-01-22	Sketch and Patch: Efficient 3D Gaussian Representation for Man-Made Scenes	Yuang Shi et.al.	2501.13045	null
2025-01-21	DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions	Vishagar Arunan et.al.	2501.12369	null
2025-01-22	HAC++: Towards 100X Compression of 3D Gaussian Splatting	Yihang Chen et.al.	2501.12255	link
2025-01-22	GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting	Longan Wang et.al.	2501.12060	null
2025-01-20	See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization	Zongqi He et.al.	2501.11508	null
2025-01-19	RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering	Chenlu Zhan et.al.	2501.11102	null
2025-01-18	Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting	Jiaqi Lin et.al.	2501.10788	null
2025-01-15	BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation	Xiaolu Hou et.al.	2501.10462	link
2025-01-20	GSTAR: Gaussian Surface Tracking and Reconstruction	Chengwei Zheng et.al.	2501.10283	null
2025-01-16	Creating Virtual Environments with 3D Gaussian Splatting: A Comparative Study	Shi Qiu et.al.	2501.09302	null
2025-01-15	CityLoc: 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation	Qi Ma et.al.	2501.08982	null
2025-01-15	GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping	Sheng Hong et.al.	2501.08672	null
2025-01-14	3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering	Meenakshi Krishnan et.al.	2501.08370	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286	null
2025-01-14	Object-Centric 2D Gaussian Splatting: Background Removal and Occlusion-Aware Pruning for Compact Object Models	Marcel Rogge et.al.	2501.08174	null
2025-01-13	Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes	Yuhang Zhang et.al.	2501.08072	null
2025-01-13	UnCommon Objects in 3D	Xingchen Liu et.al.	2501.07574	link
2025-01-13	3DGS-to-PC: Convert a 3D Gaussian Splatting Scene into a Dense Point Cloud or Mesh	Lewis A G Stuart et.al.	2501.07478	link
2025-01-13	RMAvatar: Photorealistic Human Avatar Reconstruction from Monocular Video Based on Rectified Mesh-embedded Gaussians	Sen Peng et.al.	2501.07104	null
2025-01-14	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015	null
2025-01-12	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927	link
2025-01-12	Synthetic Prior for Few-Shot Drivable Head Avatar Inversion	Wojciech Zielonka et.al.	2501.06903	null
2025-01-12	ActiveGAMER: Active GAussian Mapping through Efficient Rendering	Liyan Chen et.al.	2501.06897	null
2025-01-12	Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution	Du Chen et.al.	2501.06838	link
2025-01-12	F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian Splatting	Yuxin Wang et.al.	2501.06714	null
2025-01-11	MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis	Hengyuan Zhang et.al.	2501.06660	null
2025-01-10	Locality-aware Gaussian Compression for Fast and High-quality Rendering	Seungjoo Shin et.al.	2501.05757	null
2025-01-09	Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation	Xuyi Meng et.al.	2501.05427	null
2025-01-09	Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance	Dimitrios Gerogiannis et.al.	2501.05379	null
2025-01-09	Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping	Wen Tianci et.al.	2501.05242	null
2025-01-08	GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting	Andrew Bond et.al.	2501.04782	null
2025-01-08	FatesGS: Fast and Accurate Sparse-View Surface Reconstruction using Gaussian Splatting with Depth-Feature Consistency	Han Huang et.al.	2501.04628	null
2025-01-07	ZDySS – Zero-Shot Dynamic Scene Stylization using Gaussian Splatting	Abhishek Saroha et.al.	2501.03875	null
2025-01-07	MoDec-GS: Global-to-Local Motion Decomposition and Temporal Interval Adjustment for Compact Dynamic 3D Gaussian Splatting	Sangwoon Kwak et.al.	2501.03714	null
2025-01-07	DehazeGS: Seeing Through Fog with 3D Gaussian Splatting	Jinze Yu et.al.	2501.03659	null
2025-01-07	ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting	Yifeng Yang et.al.	2501.03605	link
2025-01-06	Compression of 3D Gaussian Splatting with Optimized Feature Planes and Standard Video Codecs	Soonbin Lee et.al.	2501.03399	null
2025-01-06	Gaussian Masked Autoencoders	Jathushan Rajasegaran et.al.	2501.03229	null
2025-01-06	HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation	Wentian Qu et.al.	2501.02845	null
2025-01-05	GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking	Weikang Bian et.al.	2501.02690	null
2025-01-03	EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation	Siyuan Huang et.al.	2501.01895	null
2025-01-03	Cloth-Splatting: 3D Cloth State Estimation from RGB Supervision	Alberta Longhini et.al.	2501.01715	null
2025-01-03	CrossView-GS: Cross-view Gaussian Splatting For Large-scale Scene Reconstruction	Chenhao Zhang et.al.	2501.01695	null
2025-01-03	PG-SAG: Parallel Gaussian Splatting for Fine-Grained Large-Scale Urban Buildings Reconstruction via Semantic-Aware Grouping	Tengfei Wang et.al.	2501.01677	link
2025-01-02	Deformable Gaussian Splatting for Efficient and High-Fidelity Reconstruction of Surgical Scenes	Jiwei Shan et.al.	2501.01101	null
2025-01-02	EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy	Ao Gao et.al.	2501.01003	null
2024-12-31	Gaussian Building Mesh (GBM): Extract a Building’s 3D Mesh with Google Earth and Gaussian Splatting	Kyle Gao et.al.	2501.00625	null
2024-12-31	DreamDrive: Generative 4D Scene Modeling from Street View Images	Jiageng Mao et.al.	2501.00601	null
2024-12-31	PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM	Runnan Chen et.al.	2501.00352	null
2024-12-31	SG-Splatting: Accelerating 3D Gaussian Splatting with Spherical Gaussians	Yiwen Wang et.al.	2501.00342	null
2024-12-30	PERSE: Personalized 3D Generative Avatars from A Single Portrait	Hyunsoo Cha et.al.	2412.21206	null
2024-12-30	KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences	Keng-Wei Chang et.al.	2412.20767	null
2024-12-30	4D Gaussian Splatting: Modeling Dynamic Scenes with Native 4D Primitives	Zeyu Yang et.al.	2412.20720	null
2024-12-29	MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks	Yifei Liu et.al.	2412.20522	link
2024-12-28	DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis	Kaijun Deng et.al.	2412.20148	link
2024-12-28	GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting	Atticus J. Zeller et.al.	2412.20056	link
2024-12-27	DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction	Kai Xu et.al.	2412.19584	null
2024-12-27	Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images	Xudong Cai et.al.	2412.19518	null
2024-12-27	Learning Radiance Fields from a Single Snapshot Compressive Image	Yunhao Li et.al.	2412.19483	null
2024-12-26	BeSplat – Gaussian Splatting from a Single Blurry Image and Event Stream	Gopi Raju Matta et.al.	2412.19370	link
2024-12-26	Reflective Gaussian Splatting	Yuxuan Yao et.al.	2412.19282	null
2024-12-26	Generating Editable Head Avatars with 3D Gaussian GANs	Guohao Li et.al.	2412.19149	link
2024-12-26	CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting	Siyu Jiao et.al.	2412.19142	null
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130	null
2024-12-25	WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian Splatting	Chenghao Qian et.al.	2412.18862	link
2024-12-25	GSAVS: Gaussian Splatting-based Autonomous Vehicle Simulator	Rami Wilson et.al.	2412.18816	null
2024-12-24	Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation	Anselm Krainovic et.al.	2412.18584	null
2024-12-24	RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis	Yiling Yao et.al.	2412.18380	null
2024-12-23	FaceLift: Single Image to 3D Head with View Generation and GS-LRM	Weijie Lyu et.al.	2412.17812	null
2024-12-23	ActiveGS: Active Scene Reconstruction using Gaussian Splatting	Liren Jin et.al.	2412.17769	link
2024-12-23	GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance	Jingqiu Zhou et.al.	2412.17715	null
2024-12-24	LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding	Hao Li et.al.	2412.17635	null
2024-12-23	CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction	Yuanyuan Gao et.al.	2412.17612	null
2024-12-23	Exploring Dynamic Novel View Synthesis Technologies for Cinematography	Adrian Azzarelli et.al.	2412.17532	null
2024-12-23	Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling	Hao Gui et.al.	2412.17378	null
2024-12-22	GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs	Xingrui Wang et.al.	2412.16932	link
2024-12-22	GeoTexDensifier: Geometry-Texture-Aware Densification for High-Quality Photorealistic 3D Gaussian Splatting	Hanqing Jiang et.al.	2412.16809	null
2024-12-21	Topology-Aware 3D Gaussian Splatting: Leveraging Persistent Homology for Optimized Structural Integrity	Tianqi Shen et.al.	2412.16619	link
2024-12-20	CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images	Jungho Lee et.al.	2412.16028	null
2024-12-20	IRGS: Inter-Reflective Gaussian Splatting with 2D Gaussian Ray Tracing	Chun Gu et.al.	2412.15867	null
2024-12-20	AvatarPerfect: User-Assisted 3D Gaussian Splatting Avatar Refinement with Automatic Pose Suggestion	Jotaro Sakamiya et.al.	2412.15609	null
2024-12-20	EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene	Yixiong Huo et.al.	2412.15550	link
2024-12-19	LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction	Pou-Chun Kung et.al.	2412.15447	null
2024-12-19	SolidGS: Consolidating Gaussian Surfel Splatting for Sparse-View Surface Reconstruction	Zhuowen Shen et.al.	2412.15400	null
2024-12-19	SqueezeMe: Efficient Gaussian Avatars for VR	Shunsuke Saito et.al.	2412.15171	null
2024-12-19	Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination	Leonardo Barcellona et.al.	2412.14957	null
2024-12-19	GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting	Qianpu Sun et.al.	2412.14579	null
2024-12-19	Improving Geometry in Sparse-View 3DGS via Reprojection-based DoF Separation	Yongsung Kim et.al.	2412.14568	null
2024-12-18	GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians	Xiaobao Wei et.al.	2412.13983	link
2024-12-18	GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting	Yuning Peng et.al.	2412.13654	null
2024-12-18	4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching	Fernando Amodeo et.al.	2412.13639	link
2024-12-18	Turbo-GS: Accelerating 3D Gaussian Fitting for High-Quality Radiance Fields	Tao Lu et.al.	2412.13547	null
2024-12-18	Vivar: A Generative AR System for Intuitive Multi-Modal Sensor Data Presentation	Yunqi Guo et.al.	2412.13509	null
2024-12-17	Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures	Guoxing Sun et.al.	2412.13183	null
2024-12-17	EOGS: Gaussian Splatting for Earth Observation	Luca Savant Aira et.al.	2412.13047	null
2024-12-17	4DRGS: 4D Radiative Gaussian Splatting for Efficient 3D Vessel Reconstruction from Sparse-View Dynamic DSA Images	Zhentao Liu et.al.	2412.12919	link
2024-12-17	CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image	Wonseok Roh et.al.	2412.12906	null
2024-12-17	HyperGS: Hyperspectral 3D Gaussian Splatting	Christopher Thirgood et.al.	2412.12849	null
2024-12-17	Gaussian Billboards: Expressive 2D Gaussian Splatting with Textures	Sebastian Weiss et.al.	2412.12734	null
2024-12-17	3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian Splatting	Qi Wu et.al.	2412.12507	link
2024-12-16	PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting	Cheng Zhang et.al.	2412.12096	link
2024-12-16	Wonderland: Navigating 3D Scenes from a Single Image	Hanwen Liang et.al.	2412.12091	null
2024-12-16	GS-ProCams: Gaussian Splatting-based Projector-Camera Systems	Qingyue Deng et.al.	2412.11762	null
2024-12-16	Deformable Radial Kernel Splatting	Yi-Hua Huang et.al.	2412.11752	null
2024-12-16	SweepEvGS: Event-Based 3D Gaussian Splatting for Macro and Micro Radiance Field Rendering from a Single Sweep	Jingqian Wu et.al.	2412.11579	null
2024-12-16	EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting	Dong In Lee et.al.	2412.11520	null
2024-12-14	DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting	Luis Wiedmann et.al.	2412.10972	link
2024-12-13	SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians	Siyun Liang et.al.	2412.10231	null
2024-12-13	GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion	Jiapeng Tang et.al.	2412.10209	null
2024-12-13	TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views	Liang Zhao et.al.	2412.10051	link
2024-12-13	SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video	Jongmin Park et.al.	2412.09982	null
2024-12-13	RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting	Lizhi Bai et.al.	2412.09868	null
2024-12-12	MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction	Xiaohao Xu et.al.	2412.09723	link
2024-12-12	PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields	Sean Wu et.al.	2412.09680	link
2024-12-12	Feat2GS: Probing Visual Foundation Models with Gaussian Splatting	Yue Chen et.al.	2412.09606	null
2024-12-12	LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors	Yabo Chen et.al.	2412.09597	null
2024-12-12	FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction	Jiale Xu et.al.	2412.09573	null
2024-12-12	GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency	Dongyue Lu et.al.	2412.09511	link
2024-12-12	LIVE-GS: LLM Powers Interactive VR by Enhancing Gaussian Splatting	Haotian Mao et.al.	2412.09176	null
2024-12-11	SLGaussian: Fast Language Gaussian Splatting in Sparse Views	Kangjie Chen et.al.	2412.08331	null
2024-12-11	ProGDF: Progressive Gaussian Differential Field for Controllable and Flexible 3D Editing	Yian Zhao et.al.	2412.08152	null
2024-12-10	Diffusion-Based Attention Warping for Consistent 3D Scene Editing	Eyal Gomel et.al.	2412.07984	null
2024-12-10	GASP: Gaussian Avatars with Synthetic Priors	Jack Saunders et.al.	2412.07739	null
2024-12-10	Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians	Yixuan Li et.al.	2412.07660	null
2024-12-10	Faster and Better 3D Splatting via Group Training	Chengbo Wang et.al.	2412.07608	null
2024-12-10	ResGS: Residual Densification of 3D Gaussian for Efficient Detail Recovery	Yanzhe Lyu et.al.	2412.07494	null
2024-12-10	EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering	Toshiya Yura et.al.	2412.07293	null
2024-12-09	MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds	Zhenggang Tang et.al.	2412.06974	null
2024-12-09	Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video	Renlong Wu et.al.	2412.06424	link
2024-12-09	4D Gaussian Splatting with Scale-aware Residual Field and Adaptive Optimization for Real-time Rendering of Temporally Complex Dynamic Scenes	Jinbo Yan et.al.	2412.06299	null
2024-12-09	Advancing Extended Reality with 3D Gaussian Splatting: Innovations and Prospects	Shi Qiu et.al.	2412.06257	null
2024-12-09	Splatter-360: Generalizable 360 $^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images	Zheng Chen et.al.	2412.06250	link
2024-12-09	Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction	Seungtae Nam et.al.	2412.06234	null
2024-12-08	Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation	Zipeng Qi et.al.	2412.05969	null
2024-12-08	GBR: Generative Bundle Refinement for High-fidelity Gaussian Splatting and Meshing	Jianing Zhang et.al.	2412.05908	null
2024-12-07	Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes	Saqib Javed et.al.	2412.05700	null
2024-12-07	WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking	Yuqi Tan et.al.	2412.05695	null
2024-12-07	Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis	Diwen Wan et.al.	2412.05570	null
2024-12-06	Extrapolated Urban View Synthesis Benchmark	Xiangyu Han et.al.	2412.05256	link
2024-12-06	MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting	Peng Chen et.al.	2412.04955	link
2024-12-06	Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction	Jixuan Fan et.al.	2412.04887	link
2024-12-06	WRF-GS: Wireless Radiation Field Reconstruction with 3D Gaussian Splatting	Chaozheng Wen et.al.	2412.04832	link
2024-12-06	Pushing Rendering Boundaries: Hard Gaussian Splatting	Qingshan Xu et.al.	2412.04826	null
2024-12-05	Turbo3D: Ultra-fast Text-to-3D Generation	Hanzhe Hu et.al.	2412.04470	null
2024-12-05	QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos	Sharath Girish et.al.	2412.04469	null
2024-12-05	Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering	Cheng Sun et.al.	2412.04459	link
2024-12-05	Monocular Dynamic Gaussian Splatting is Fast and Brittle but Smooth Motion Helps	Yiqing Liang et.al.	2412.04457	null
2024-12-05	PBDyG: Position Based Dynamic Gaussians for Motion-Aware Clothed Human Avatars	Shota Sasaki et.al.	2412.04433	null
2024-12-05	Multi-View Pose-Agnostic Change Localization with Zero Labels	Chamuditha Jayanga Galappaththige et.al.	2412.03911	link
2024-12-05	DGNS: Deformable Gaussian Splatting and Dynamic Neural Surface for Monocular Dynamic 3D Reconstruction	Xuesong Li et.al.	2412.03910	link
2024-12-05	HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting	Jingyu Lin et.al.	2412.03844	link
2024-12-04	Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos	Hanxue Liang et.al.	2412.03526	null
2024-12-04	Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter	Hermes McGriff et.al.	2412.03518	null
2024-12-04	Urban4D: Semantic-Guided 4D Gaussian Splatting for Urban Scene Reconstruction	Ziwen Li et.al.	2412.03473	null
2024-12-04	2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction	Wanting Zhang et.al.	2412.03428	null
2024-12-04	Volumetrically Consistent 3D Gaussian Rasterization	Chinmay Talegaonkar et.al.	2412.03378	link
2024-12-04	SGSST: Scaling Gaussian Splatting StyleTransfer	Bruno Galerne et.al.	2412.03371	link
2024-12-04	NeRF and Gaussian Splatting SLAM in the Wild	Fabian Schmidt et.al.	2412.03263	link
2024-12-04	Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting	Yijia Guo et.al.	2412.03121	null
2024-12-04	RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videos	Yoonwoo Jeong et.al.	2412.03077	null
2024-12-03	Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects	Abdurrahman Zeybey et.al.	2412.02803	null
2024-12-03	AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction	Lingteng Qiu et.al.	2412.02684	null
2024-12-03	RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians	Qiankun Gao et.al.	2412.02493	link
2024-12-03	TimeWalker: Personalized Neural Space for Lifelong Head Avatars	Dongwei Pan et.al.	2412.02421	null
2024-12-03	GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos	Zhiyuan Chen et.al.	2412.02267	null
2024-12-03	Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance	Jing Zeng et.al.	2412.02249	null
2024-12-03	SparseLGS: Sparse View Language Embedded Gaussian Splatting	Jun Hu et.al.	2412.02245	null
2024-12-03	How to Use Diffusion Priors under Sparse Views?	Qisen Wang et.al.	2412.02225	link
2024-12-03	SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images	Junqiu Yu et.al.	2412.02140	null
2024-12-03	Gaussian Object Carver: Object-Compositional Gaussian Splatting with surfaces completion	Liu Liu et.al.	2412.02075	link
2024-12-02	Planar Gaussian Splatting	Farhad G. Zanjani et.al.	2412.01931	null
2024-12-02	GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting	Zixuan Chen et.al.	2411.19895	link
2024-11-29	DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering	Yihao Wang et.al.	2411.19756	null
2024-11-29	TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting	Bojun Xiong et.al.	2411.19654	link
2024-11-29	Tortho-Gaussian: Splatting True Digital Orthophoto Maps	Xin Wang et.al.	2411.19594	null
2024-11-29	Gaussian Splashing: Direct Volumetric Rendering Underwater	Nir Mualem et.al.	2411.19588	null
2024-11-29	Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding	Wenbo Zhang et.al.	2411.19551	link
2024-12-02	GausSurf: Geometry-Guided 3D Gaussian Splatting for Surface Reconstruction	Jiepeng Wang et.al.	2411.19454	null
2024-11-29	RF-3DGS: Wireless Channel Modeling with Radio Radiance Field and 3D Gaussian Splatting	Lihao Zhang et.al.	2411.19420	link
2024-11-28	SADG: Segment Any Dynamic Gaussian Without Object Trackers	Yun-Jin Li et.al.	2411.19290	link
2024-11-28	AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones	Xuqian Ren et.al.	2411.19271	null
2024-11-27	Textured Gaussians for Enhanced 3D Scene Appearance Modeling	Brian Chao et.al.	2411.18625	null
2024-11-27	PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image	Han Yan et.al.	2411.18548	null
2024-11-27	HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression	Lei Liu et.al.	2411.18473	null
2024-11-27	Neural Surface Priors for Editable Gaussian Splatting	Jakub Szymkowiak et.al.	2411.18311	link
2024-11-27	Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters	Zhiyang Guo et.al.	2411.18197	null
2024-11-27	SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images	Yanyan Li et.al.	2411.18072	null
2024-11-27	GLS: Geometry-aware 3D Language Gaussian Splatting	Jiaxiong Qiu et.al.	2411.18066	link
2024-11-27	HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction	Wei Zhang et.al.	2411.17982	link
2024-11-26	DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting	Christian Homeyer et.al.	2411.17660	link
2024-11-26	Distractor-free Generalizable 3D Gaussian Splatting	Yanqi Bao et.al.	2411.17605	link
2024-11-26	SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting	Gyeongjin Kang et.al.	2411.17190	null
2024-11-26	4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction	Woong Oh Cho et.al.	2411.17044	null
2024-11-25	G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs	Kunyi Li et.al.	2411.16898	null
2024-11-25	PreF3R: Pose-Free Feed-Forward 3D Gaussian Splatting from Variable-length Image Sequence	Zequn Chen et.al.	2411.16877	null
2024-11-25	SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving	Georg Hess et.al.	2411.16816	link
2024-11-25	SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis	Hyojun Go et.al.	2411.16443	link
2024-11-25	Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction	Ziyu Zhang et.al.	2411.16392	null
2024-11-25	Event-boosted Deformable 3D Gaussians for Fast Dynamic Scene Reconstruction	Wenhao Xu et.al.	2411.16180	null
2024-11-25	UnitedVLN: Generalizable Gaussian Splatting for Continuous Vision-Language Navigation	Guangzhao Dai et.al.	2411.16053	null
2024-11-24	PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments	Haoang Li et.al.	2411.15800	null
2024-11-24	ZeroGS: Training 3D Gaussian Splatting from Unposed Images	Yu Chen et.al.	2411.15779	null
2024-11-24	DynamicAvatars: Accurate Dynamic Facial Avatars Reconstruction and Precise Editing with Diffusion Models	Yangyang Qian et.al.	2411.15732	null
2024-11-24	GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision	Xu Baixin et.al.	2411.15723	link
2024-11-23	EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting	Xiaobao Wei et.al.	2411.15582	null
2024-11-23	SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving	Su Sun et.al.	2411.15482	null
2024-11-22	Neural 4D Evolution under Large Topological Changes from 2D Images	AmirHossein Naghi Razlighi et.al.	2411.15018	null
2024-11-22	3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes	Jan Held et.al.	2411.14974	link
2024-11-22	Dynamics-Aware Gaussian Splatting Streaming Towards Fast On-the-Fly Training for 4D Reconstruction	Zhening Liu et.al.	2411.14847	null
2024-11-22	VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving	Haiming Zhang et.al.	2411.14716	null
2024-11-21	NexusSplats: Efficient 3D Gaussian Splatting in the Wild	Yuzhou Tang et.al.	2411.14514	null
2024-11-21	Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation	Zhuoman Liu et.al.	2411.14423	null
2024-11-21	Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation	Yuanhao Cai et.al.	2411.14384	null
2024-11-21	SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching	Arjun P S et.al.	2411.14322	link
2024-11-20	FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting	Ola Shorinwa et.al.	2411.13753	null
2024-11-20	Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization	Hao Ju et.al.	2411.13610	null
2024-11-20	Generating 3D-Consistent Videos from Unposed Internet Photos	Gene Chou et.al.	2411.13549	null
2024-11-20	GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting	Xiaobao Wei et.al.	2411.12981	null
2024-11-19	Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting	Haoyu Zhao et.al.	2411.12789	null
2024-11-19	Mini-Splatting2: Building 360 Scenes within Minutes via Aggressive Gaussian Densification	Guangchi Fang et.al.	2411.12788	null
2024-11-19	PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy	Joanna Kaleta et.al.	2411.12510	link
2024-11-19	SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image	Zixu Wang et.al.	2411.12471	null
2024-11-20	Beyond Gaussians: Fast and High-Fidelity 3D Splatting with Linear Kernels	Haodong Chen et.al.	2411.12440	null
2024-11-19	LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2411.12185	null
2024-11-19	Sketch-guided Cage-based 3D Gaussian Splatting Deformation	Tianhao Xie et.al.	2411.12168	null
2024-11-18	FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting	Fangyu Wu et.al.	2411.12089	null
2024-11-18	TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction	DaDong Jiang et.al.	2411.11941	null
2024-11-18	DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes	Chensheng Peng et.al.	2411.11921	link
2024-11-18	RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator	Xinhai Li et.al.	2411.11839	null
2024-11-18	GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views	Boyao Zhou et.al.	2411.11363	null
2024-11-17	VeGaS: Video Gaussian Splatting	Weronika Smolak-Dyżewska et.al.	2411.11024	link
2024-11-17	Direct and Explicit 3D Generation from a Single Image	Haoyu Wu et.al.	2411.10947	null
2024-11-16	DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment	Mangyu Kong et.al.	2411.10722	link
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546	null
2024-11-15	USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting	Kang Chen et.al.	2411.10504	link
2024-11-15	Efficient Density Control for 3D Gaussian Splatting	Xiaobin Deng et.al.	2411.10133	link
2024-11-15	GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization	Yanhao Sun et.al.	2411.10033	null
2024-11-15	GGAvatar: Reconstructing Garment-Separated 3D Gaussian Splatting Avatars from Monocular Video	Jingxuan Chen et.al.	2411.09952	link
2024-11-14	Adversarial Attacks Using Differentiable Rendering: A Survey	Matthew Hull et.al.	2411.09749	null
2024-11-14	DyGASR: Dynamic Generalized Exponential Splatting with Surface Alignment for Accelerated 3D Mesh Reconstruction	Shengchao Zhao et.al.	2411.09156	null
2024-11-13	4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization	Mijeong Kim et.al.	2411.08879	null
2024-11-13	Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models	Chengdong Dong et.al.	2411.08642	null
2024-11-13	BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis	David Svitov et.al.	2411.08508	link
2024-11-13	Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model	Yutao Shen et.al.	2411.08453	null
2024-11-13	DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization	Yueming Xu et.al.	2411.08373	null
2024-11-13	MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation	Peng Wang et.al.	2411.08279	link
2024-11-14	Projecting Gaussian Ellipsoids While Avoiding Affine Projection Approximation	Han Qi et.al.	2411.07579	null
2024-11-12	GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting	Umangi Jain et.al.	2411.07555	null
2024-11-12	HiCoM: Hierarchical Coherent Motion for Streamable Dynamic Scene with 3D Gaussian Splatting	Qiankun Gao et.al.	2411.07541	link
2024-11-12	GUS-IR: Gaussian Splatting with Unified Shading for Inverse Rendering	Zhihao Liang et.al.	2411.07478	null
2024-11-11	A Hierarchical Compression Technique for 3D Gaussian Splatting Compression	He Huang et.al.	2411.06976	null
2024-11-10	Adaptive and Temporally Consistent Gaussian Surfels for Multi-view Dynamic Reconstruction	Decai Chen et.al.	2411.06602	null
2024-11-12	SplatFormer: Point Transformer for Robust 3D Gaussian Splatting	Yutong Chen et.al.	2411.06390	link
2024-11-10	Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field	Liuyue Xie et.al.	2411.06365	null
2024-11-09	AI-Driven Stylization of 3D Environments	Yuanbo Chen et.al.	2411.06067	null
2024-11-09	GaussianSpa: An “Optimizing-Sparsifying” Simplification Framework for Compact and High-Quality 3D Gaussian Splatting	Yangming Zhang et.al.	2411.06019	null
2024-11-07	ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing	Jun-Kun Chen et.al.	2411.05006	null
2024-11-07	MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views	Yuedong Chen et.al.	2411.04924	link
2024-11-08	GS2Pose: Two-stage 6D Object Pose Estimation Guided by Gaussian Splatting	Jilan Mei et.al.	2411.03807	null
2024-11-06	3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object Rearrangement	Ziqi Lu et.al.	2411.03706	link
2024-11-06	Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis	Rui Peng et.al.	2411.03637	link
2024-11-05	Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting	Michael Büttner et.al.	2411.03555	null
2024-11-05	HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features	Arnab Dey et.al.	2411.03086	null
2024-11-05	LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting	Huibin Zhao et.al.	2411.02703	null
2024-11-04	Modeling Uncertainty in 3D Gaussian Splatting through Continuous Semantic Splatting	Joey Wilson et.al.	2411.02547	null
2024-11-06	SplatOverflow: Asynchronous Hardware Troubleshooting	Amritansh Kwatra et.al.	2411.02332	null
2024-11-05	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-06	GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes	Gaochao Song et.al.	2411.01853	null
2024-11-02	Real-Time Spatio-Temporal Reconstruction of Dynamic Endoscopic Scenes with 4D Gaussian Splatting	Fengze Li et.al.	2411.01218	null
2024-11-01	CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes	Yang Liu et.al.	2411.00771	null
2024-11-01	PCoTTA: Continual Test-Time Adaptation for Multi-Task Point Cloud Understanding	Jincen Jiang et.al.	2411.00632	null
2024-10-31	Aquatic-GS: A Hybrid 3D Representation for Underwater Scenes	Shaohua Liu et.al.	2411.00239	null
2024-10-31	Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis	Chen Zhao et.al.	2411.00144	link
2024-10-31	No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images	Botao Ye et.al.	2410.24207	link
2024-11-01	GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering	Kai Ye et.al.	2410.24204	null
2024-10-31	GaussianMarker: Uncertainty-Aware Copyright Protection of 3D Gaussian Splatting	Xiufeng Huang et.al.	2410.23718	null
2024-10-31	GS-Blur: A 3D Scene-Based Dataset for Realistic Image Deblurring	Dongwoo Lee et.al.	2410.23658	link
2024-10-30	ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting	Muhammad Salman Ali et.al.	2410.23213	null
2024-10-31	Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis	Zhiyuan Min et.al.	2410.22817	null
2024-10-30	Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted Images	Qi Song et.al.	2410.22705	null
2024-10-29	PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting	Sunghwan Hong et.al.	2410.22128	link
2024-10-29	FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives	Qizhi Chen et.al.	2410.22070	null
2024-10-29	ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting	Yuetao Li et.al.	2410.21955	link
2024-10-28	MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps	Yating Xu et.al.	2410.21566	link
2024-10-28	Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting	Jiawei Xu et.al.	2410.20815	null
2024-10-28	LoDAvatar: Hierarchical Embedding and Adaptive Levels of Detail with Gaussian Splatting for Enhanced Human Avatars	Xiaonuo Dongye et.al.	2410.20789	null
2024-10-28	CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians	Chongjian Ge et.al.	2410.20723	null
2024-10-28	ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings	Suyoung Lee et.al.	2410.20686	link
2024-10-27	Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering	Meng Wei et.al.	2410.20593	null
2024-10-26	Neural Fields in Robotics: A Survey	Muhammad Zubair Irshad et.al.	2410.20220	link
2024-10-25	DiffGS: Functional Gaussian Splatting Diffusion	Junsheng Zhou et.al.	2410.19657	null
2024-10-25	Robotic Learning in your Backyard: A Neural Simulator from Open Source Components	Liyou Zhou et.al.	2410.19564	link
2024-10-25	Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization	Weihang Liu et.al.	2410.19483	link
2024-10-24	3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation	Hansheng Chen et.al.	2410.18974	link
2024-10-24	Sort-free Gaussian Splatting via Weighted Sum Rendering	Qiqi Hou et.al.	2410.18931	null
2024-10-24	Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling	Mingtong Zhang et.al.	2410.18912	null
2024-10-27	Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis	Liang Han et.al.	2410.18822	null
2024-10-23	VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points	Linus Franke et.al.	2410.17932	null
2024-10-23	PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting	Yu Wang et.al.	2410.17505	null
2024-10-22	AG-SLAM: Active Gaussian Splatting SLAM	Wen Jiang et.al.	2410.17422	null
2024-10-22	SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes	Cheng-De Fan et.al.	2410.17249	null
2024-10-18	GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting	Yusen Xie et.al.	2410.17084	null
2024-10-22	E-3DGS: Gaussian Splatting with Exposure and Motion Events	Xiaoting Yin et.al.	2410.16995	link
2024-10-22	Multi-Layer Gaussian Splatting for Immersive Anatomy Visualization	Constantin Kleinbeck et.al.	2410.16978	link
2024-10-21	3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors	Xi Liu et.al.	2410.16266	null
2024-10-21	MSGField: A Unified Scene Representation Integrating Motion, Semantics, and Geometry for Robotic Manipulation	Yu Sheng et.al.	2410.15730	null
2024-10-22	Fully Explicit Dynamic Gaussian Splatting	Junoh Lee et.al.	2410.15629	null
2024-10-22	EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting	Bohao Liao et.al.	2410.15392	null
2024-10-18	LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes	Juliette Marrie et.al.	2410.14462	null
2024-10-18	Neural Signed Distance Function Inference through Splatting 3D Gaussians Pulled on Zero-Level Set	Wenyuan Zhang et.al.	2410.14189	null
2024-10-18	DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction	Ange Lou et.al.	2410.14169	null
2024-10-17	DepthSplat: Connecting Gaussian Splatting and Depth	Haofei Xu et.al.	2410.13862	link
2024-10-17	Differentiable Robot Rendering	Ruoshi Liu et.al.	2410.13851	null
2024-10-17	MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes	Xinjie Zhang et.al.	2410.13613	null
2024-10-17	DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering	Jiahao Lu et.al.	2410.13607	link
2024-10-17	GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting	Shuichang Lai et.al.	2410.13349	null
2024-10-16	Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats	Chen Ziwen et.al.	2410.12781	null
2024-10-16	3D Gaussian Splatting in Robotics: A Survey	Siting Zhu et.al.	2410.12262	link
2024-10-15	SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection	Yizhe Liu et.al.	2410.12080	link
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-15	GS^3: Efficient Relighting with Triple Gaussian Splatting	Zoubin Bi et.al.	2410.11419	link
2024-10-15	MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields	Yuru Xiao et.al.	2410.11394	null
2024-10-15	GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information	Wancai Zheng et.al.	2410.11356	null
2024-10-15	Scalable Indoor Novel-View Synthesis using Drone-Captured 360 Imagery with 3D Gaussian Splatting	Yuanbo Chen et.al.	2410.11285	null
2024-10-14	Few-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting	Raja Kumar et.al.	2410.11080	link
2024-10-15	4-LEGS: 4D Language Embedded Gaussian Splatting	Gal Fiebelman et.al.	2410.10719	null
2024-10-14	4DStyleGaussian: Zero-shot 4D Style Transfer with Gaussian Splatting	Wanlin Liang et.al.	2410.10412	null
2024-10-13	Gaussian Splatting Visual MPC for Granular Media Manipulation	Wei-Cheng Tseng et.al.	2410.09740	null
2024-10-12	Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors	Hritam Basak et.al.	2410.09467	null
2024-10-11	SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted Surgical Scene Reconstruction	Jialei Chen et.al.	2410.09292	null
2024-10-11	MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering	Jaehoon Choi et.al.	2410.08941	null
2024-10-11	Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars	Xuan Huang et.al.	2410.08840	link
2024-10-11	Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose Initialization	Christian Schmidt et.al.	2410.08743	link
2024-10-10	FusionSense: Bridging Common Sense, Vision, and Touch for Robust Sparse-View Reconstruction	Irving Fang et.al.	2410.08282	null
2024-10-10	Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics	Junyi Cao et.al.	2410.08257	null
2024-10-10	Poison-splat: Computation Cost Attack on 3D Gaussian Splatting	Jiahao Lu et.al.	2410.08190	link
2024-10-10	DifFRelight: Diffusion-Based Facial Performance Relighting	Mingming He et.al.	2410.08188	null
2024-10-10	Efficient Perspective-Correct 3D Gaussian Splatting Using Hybrid Transparency	Florian Hahlbohm et.al.	2410.08129	null
2024-10-10	IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera	Jian Huang et.al.	2410.08107	link
2024-10-11	Fast Feedforward 3D Gaussian Splatting Compression	Yihang Chen et.al.	2410.08017	link
2024-10-10	L-VITeX: Light-weight Visual Intuition for Terrain Exploration	Antar Mazumder et.al.	2410.07872	null
2024-10-10	MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting	Ruijie Zhu et.al.	2410.07707	link
2024-10-10	3D Vision-Language Gaussian Splatting	Qucheng Peng et.al.	2410.07577	null
2024-10-09	DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation	Zhiqi Li et.al.	2410.06756	null
2024-10-09	ES-Gaussian: Gaussian Splatting Mapping via Error Space-Based Gaussian Completion	Lu Chen et.al.	2410.06613	null
2024-10-09	3D Representation Methods: A Survey	Zhengren Wang et.al.	2410.06475	null
2024-10-08	HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction	Shengji Tang et.al.	2410.06245	null
2024-10-10	RelitLRM: Generative Relightable Radiance for Large Reconstruction Models	Tianyuan Zhang et.al.	2410.06231	null
2024-10-08	GSLoc: Visual Localization with 3D Gaussian Splatting	Kazii Botashev et.al.	2410.06165	null
2024-10-08	SplaTraj: Camera Trajectory Generation with Semantic Gaussian Splatting	Xinyi Liu et.al.	2410.06014	null
2024-10-08	Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters	Guoji Tian et.al.	2410.05772	null
2024-10-07	PH-Dropout: Prctical Epistemic Uncertainty Quantification for View Synthesis	Chuanhao Sun et.al.	2410.05468	link
2024-10-07	GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting	Yukang Cao et.al.	2410.05259	null
2024-10-07	LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting	Qifeng Chen et.al.	2410.05111	null
2024-10-07	DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects	Nidhi Mathihalli et.al.	2410.05097	link
2024-10-07	PhotoReg: Photometrically Registering 3D Gaussian Splatting Models	Ziwen Yuan et.al.	2410.05044	null
2024-10-07	6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering	Zhongpai Gao et.al.	2410.04974	null
2024-10-07	Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting	Matthew Strong et.al.	2410.04680	link
2024-10-06	Mode-GS: Monocular Depth Guided Anchored 3D Gaussian Splatting for Robust Ground-View Scene Rendering	Yonghan Lee et.al.	2410.04646	null
2024-10-06	StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting	Xiao Cui et.al.	2410.04354	null
2024-10-04	Variational Bayes Gaussian Splatting	Toon Van de Maele et.al.	2410.03592	link
2024-10-03	Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats	Mingyang Xie et.al.	2410.02764	null
2024-10-03	GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering	Hongze Chen et.al.	2410.02619	null
2024-10-03	SuperGS: Super-Resolution 3D Gaussian Splatting via Latent Feature Field and Gradient-guided Splitting	Shiyun Xie et.al.	2410.02571	link
2024-10-02	MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis	Xiaobiao Du et.al.	2410.02103	link
2024-10-03	EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis	Alexander Mai et.al.	2410.01804	null
2024-10-02	3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection	Yang Cao et.al.	2410.01647	link
2024-10-02	Gaussian Splatting in Mirrors: Reflection-Aware Rendering via Virtual Camera Optimization	Zihan Wang et.al.	2410.01614	link
2024-10-02	GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians	Shuyi Jiang et.al.	2410.01535	null
2024-10-02	MiraGe: Editable 2D Images using Gaussian Splatting	Joanna Waczyńska et.al.	2410.01521	link
2024-10-02	UW-GS: Distractor-Aware 3D Gaussian Splatting for Enhanced Underwater Scene Reconstruction	Haoran Wang et.al.	2410.01517	link
2024-10-02	EVA-Gaussian: 3D Gaussian-based Real-time Human Novel View Synthesis under Diverse Camera Settings	Yingdong Hu et.al.	2410.01425	null
2024-10-02	Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection	Hongru Yan et.al.	2410.01404	null
2024-10-02	CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM	Dapeng Feng et.al.	2410.00486	link
2024-10-01	Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance	Hongchao Shu et.al.	2410.00386	null
2024-09-30	RL-GSBridge: 3D Gaussian Splatting Based Real2Sim2Real Method for Robotic Manipulation Learning	Yuxuan Wu et.al.	2409.20291	null
2024-09-30	Robust Gaussian Splatting SLAM by Leveraging Loop Closure	Zunjie Zhu et.al.	2409.20111	null
2024-10-01	RNG: Relightable Neural Gaussians	Jiahui Fan et.al.	2409.19702	null
2024-09-28	GS-EVT: Cross-Modal Event Camera Tracking based on Gaussian Splatting	Tao Liu et.al.	2409.19228	null
2024-09-28	1st Place Solution to the 8th HANDS Workshop Challenge – ARCTIC Track: 3DGS-based Bimanual Category-agnostic Interaction Reconstruction	Jeongwan On et.al.	2409.19215	null
2024-09-27	Gaussian Heritage: 3D Digitization of Cultural Heritage with Integrated Object Segmentation	Mahtab Dahaghin et.al.	2409.19039	null
2024-09-27	Space-time 2D Gaussian Splatting for Accurate Surface Reconstruction under Complex Dynamic Scenes	Shuo Wang et.al.	2409.18852	link
2024-09-26	RT-GuIDE: Real-Time Gaussian splatting for Information-Driven Exploration	Yuezhan Tao et.al.	2409.18122	null
2024-09-26	Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot	Justin Yu et.al.	2409.18108	null
2024-09-26	WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians	Dmytro Kotovenko et.al.	2409.17917	null
2024-09-26	HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian Splatting	Zijun Xu et.al.	2409.17624	null
2024-09-25	SeaSplat: Representing Underwater Scenes with 3D Gaussian Splatting and a Physically Grounded Image Formation Model	Daniel Yang et.al.	2409.17345	null
2024-09-25	Disco4D: Disentangled 4D Human Generation and Animation from a Single Image	Hui En Pang et.al.	2409.17280	null
2024-09-25	Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM	Phu Pham et.al.	2409.16944	null
2024-09-25	Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model	Hongliang Zhong et.al.	2409.16938	link
2024-09-25	Let’s Make a Splan: Risk-Aware Trajectory Optimization in a Normalized Gaussian Splat	Jonathan Michaux et.al.	2409.16915	null
2024-09-24	GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization	Gennady Sidorov et.al.	2409.16502	link
2024-09-24	Frequency-based View Selection in Gaussian Splatting Reconstruction	Monica M. Q. Li et.al.	2409.16470	null
2024-09-23	Gaussian Déjà-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities	Peizhi Yan et.al.	2409.16147	link
2024-09-24	Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality	Hannah Schieber et.al.	2409.15959	null
2024-09-24	Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB	Jae Yong Lee et.al.	2409.15689	null
2024-09-23	Human Hair Reconstruction with Strand-Aligned 3D Gaussians	Egor Zakharov et.al.	2409.14778	null
2024-09-22	MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views	Wangze Xu et.al.	2409.14316	null
2024-09-21	SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality	Hongjia Zhai et.al.	2409.14067	null
2024-09-20	Elite-EvGS: Learning Event-based 3D Gaussian Splatting by Distilling Event-to-Video Priors	Zixin Zhang et.al.	2409.13392	null
2024-09-20	3D-GSW: 3D Gaussian Splatting Watermark for Protecting Copyrights in Radiance Fields	Youngdong Jang et.al.	2409.13222	null
2024-09-19	MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting	Yan Song Hu et.al.	2409.13055	null
2024-09-19	GStex: Per-Primitive Texturing of 2D Gaussian Splatting for Decoupled Appearance and Geometry Modeling	Victor Rong et.al.	2409.12954	link
2024-09-18	Vista3D: Unravel the 3D Darkside of a Single Image	Qiuhong Shen et.al.	2409.12193	link
2024-09-18	SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation	Mingze Sun et.al.	2409.11682	link
2024-09-18	Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks	Joji Joseph et.al.	2409.11681	link
2024-09-17	RenderWorld: World Model with Self-Supervised 3D Label	Ziyang Yan et.al.	2409.11356	null
2024-09-17	GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module	Yichen Zhang et.al.	2409.11307	null
2024-09-17	SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction	Marko Mihajlovic et.al.	2409.11211	null
2024-09-17	GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure	Ziheng Xu et.al.	2409.10982	null
2024-09-16	Phys3DGS: Physically-based 3D Gaussian Splatting for Inverse Rendering	Euntae Choi et.al.	2409.10335	null
2024-09-16	BEINGS: Bayesian Embodied Image-goal Navigation with Gaussian Splatting	Wugang Meng et.al.	2409.10216	link
2024-09-16	SplatSim: Zero-Shot Sim2Real Transfer of RGB Manipulation Policies Using Gaussian Splatting	Mohammad Nomaan Qureshi et.al.	2409.10161	null
2024-09-16	Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression	Yi-Hsin Li et.al.	2409.10101	null
2024-09-16	DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments	Mahmud A. Mohamad et.al.	2409.10041	link
2024-09-15	SAFER-Splat: A Control Barrier Function for Safe Navigation with Online Gaussian Splatting Maps	Timothy Chen et.al.	2409.09868	null
2024-09-15	MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation	Shuzhao Xie et.al.	2409.09756	null
2024-09-14	GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians	Dasong Gao et.al.	2409.09295	link
2024-09-13	A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis	Yohan Poirier-Ginter et.al.	2409.08947	null
2024-09-13	AdR-Gaussian: Accelerating Gaussian Splatting with Adaptive Radius	Xinzhe Wang et.al.	2409.08669	null
2024-09-13	Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints	Shan Chen et.al.	2409.08613	null
2024-09-13	CSS: Overcoming Pose and Scene Challenges in Crowd-Sourced 3D Gaussian Splatting	Runze Chen et.al.	2409.08562	null
2024-09-12	Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos	Yuheng Jiang et.al.	2409.08353	null
2024-09-12	FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally	Qiuhong Shen et.al.	2409.08270	link
2024-09-12	Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis	Qian Chen et.al.	2409.08042	link
2024-09-12	SwinGS: Sliding Window Gaussian Splatting for Volumetric Video Streaming with Arbitrary Length	Bangya Liu et.al.	2409.07759	null
2024-09-11	Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs	Sadra Safadoust et.al.	2409.07456	null
2024-09-11	Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models	Haibo Yang et.al.	2409.07452	link
2024-09-11	Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering	Dafei Qin et.al.	2409.07441	null
2024-09-11	Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks	Ruihan Xu et.al.	2409.07245	null
2024-09-11	ThermalGaussian: Thermal 3D Gaussian Splatting	Rongfeng Lu et.al.	2409.07200	link
2024-09-10	gsplat: An Open-Source Library for Gaussian Splatting	Vickie Ye et.al.	2409.06765	link
2024-09-10	GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction	Junyi Chen et.al.	2409.06685	null
2024-09-10	Sources of Uncertainty in 3D Scene Reconstruction	Marcus Klasson et.al.	2409.06407	link
2024-09-09	Online 3D reconstruction and dense tracking in endoscopic videos	Michel Hayoz et.al.	2409.06037	link
2024-09-09	GASP: Gaussian Splatting for Physic-Based Simulations	Piotr Borycki et.al.	2409.05819	link
2024-09-09	Lagrangian Hashing for Compressed Neural Field Representations	Shrisudhan Govindarajan et.al.	2409.05334	null
2024-09-08	DreamMapping: High-Fidelity Text-to-3D Generation via Variational Distribution Mapping	Zeyu Cai et.al.	2409.05099	null
2024-09-08	GS-PT: Exploiting 3D Gaussian Splatting for Comprehensive Point Cloud Understanding via Self-supervised Learning	Keyi Liu et.al.	2409.04963	null
2024-09-11	Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras	Zimu Liao et.al.	2409.04751	link
2024-09-06	GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers	Lorenza Prospero et.al.	2409.04196	link
2024-09-06	3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors	Yujun Huang et.al.	2409.04013	link
2024-09-05	LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors	Hanyang Yu et.al.	2409.03456	null
2024-09-05	Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction	Shen Chen et.al.	2409.03213	null
2024-09-04	Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models	Zhibin Liu et.al.	2409.02851	link
2024-09-04	Object Gaussian for Monocular 6D Pose Estimation from Sparse Views	Luqing Luo et.al.	2409.02581	null
2024-09-04	GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving	Huasong Han et.al.	2409.02382	null
2024-09-03	DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction	Jenny Seidenschwarz et.al.	2409.02104	null
2024-09-03	PRoGS: Progressive Rendering of Gaussian Splats	Brent Zoomers et.al.	2409.01761	null
2024-09-03	GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting	Zixuan Guo et.al.	2409.01581	null
2024-09-02	Free-DyGS: Camera-Pose-Free Scene Reconstruction based on Gaussian Splatting for Dynamic Surgical Videos	Qian Li et.al.	2409.01003	null
2024-08-31	3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images	YuanZheng Wu et.al.	2409.00381	null
2024-08-31	UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM	Mostafa Mansour et.al.	2409.00362	null
2024-08-30	OG-Mapping: Octree-based Structured 3D Gaussians for Online Dense Mapping	Meng Wang et.al.	2408.17223	null
2024-08-30	2DGH: 2D Gaussian-Hermite Splatting for High-quality Rendering and Better Geometry Reconstruction	Ruihan Yu et.al.	2408.16982	null
2024-08-29	ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model	Fangfu Liu et.al.	2408.16767	null
2024-08-29	OmniRe: Omni Urban Scene Reconstruction	Ziyu Chen et.al.	2408.16760	null
2024-08-28	Towards Realistic Example-based Modeling via 3D Gaussian Stitching	Xinyu Gao et.al.	2408.15708	null
2024-08-28	G-Style: Stylized Gaussian Splatting	Áron Samuel Kovács et.al.	2408.15695	link
2024-08-27	Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty	Saining Zhang et.al.	2408.15242	link
2024-08-27	Learning-based Multi-View Stereo: A Survey	Fangjinhua Wang et.al.	2408.15235	null
2024-08-27	Robo-GS: A Physics Consistent Spatial-Temporal Model for Robotic Arm with Hybrid Representation	Haozhe Lou et.al.	2408.14873	null
2024-08-27	LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming	Yuang Shi et.al.	2408.14823	link
2024-08-26	Avatar Concept Slider: Manipulate Concepts In Your Human Avatar With Fine-grained Control	Yixuan He et.al.	2408.13995	null
2024-08-26	DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian Splatting	Weiwei Cai et.al.	2408.13972	link
2024-08-27	Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs	Brandon Smart et.al.	2408.13912	null
2024-08-25	TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers	Chuanrui Zhang et.al.	2408.13770	null
2024-08-25	SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting	Wenrui Li et.al.	2408.13711	link
2024-08-23	BiGS: Bidirectional Gaussian Primitives for Relightable 3D Gaussian Splatting	Zhenyuan Liu et.al.	2408.13370	null
2024-08-23	S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points	Bing He et.al.	2408.13036	link
2024-08-23	FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering	Yunji Seo et.al.	2408.12894	null
2024-08-26	GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion	Jiaxin Wei et.al.	2408.12677	link
2024-08-22	Subsurface Scattering for 3D Gaussian Splatting	Jan-Niklas Dihlmann et.al.	2408.12282	null
2024-08-21	Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of Distractors	Paul Ungermann et.al.	2408.11697	link
2024-08-22	DeRainGS: Gaussian Splatting for Enhanced Scene Reconstruction in Rainy Environments	Shuhong Liu et.al.	2408.11540	null
2024-08-21	GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting	Wanshui Gan et.al.	2408.11447	link
2024-08-21	Pano2Room: Novel View Synthesis from a Single Indoor Panorama	Guo Pu et.al.	2408.11413	link
2024-08-20	GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting	Changkun Liu et.al.	2408.11085	link
2024-08-20	ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining	Qi Ma et.al.	2408.10906	null
2024-08-20	DEGAS: Detailed Expressions on Full-Body Gaussian Avatars	Zhijing Shao et.al.	2408.10588	link
2024-08-20	LoopSplat: Loop Closure by Registering 3D Gaussian Splats	Liyuan Zhu et.al.	2408.10154	link
2024-08-19	Implicit Gaussian Splatting with Efficient Multi-Level Tri-Plane Representation	Minye Wu et.al.	2408.10041	null
2024-08-19	SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting	Haoyu Zhao et.al.	2408.09665	null
2024-08-20	CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning	Haoyu Zhao et.al.	2408.09663	null
2024-08-20	Gaussian in the Dark: Real-Time View Synthesis From Inconsistent Dark Images Using Gaussian Splatting	Sheng Ye et.al.	2408.09130	link
2024-08-16	Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS	Wei Sun et.al.	2408.08723	null
2024-08-16	GS-ID: Illumination Decomposition on Gaussian Splatting via Diffusion Prior and Parametric Light Source Optimization	Kang Du et.al.	2408.08524	link
2024-08-15	WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting	Huapeng Li et.al.	2408.08206	null
2024-08-19	FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering	Guofeng Feng et.al.	2408.07967	link
2024-08-14	Progressive Radiance Distillation for Inverse Rendering with Gaussian Splatting	Keyang Ye et.al.	2408.07595	null
2024-08-14	3D Gaussian Editing with A Single Image	Guan Luo et.al.	2408.07540	null
2024-08-13	SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis	Saptarshi Neil Sinha et.al.	2408.06975	null
2024-08-13	HDRGS: High Dynamic Range Gaussian Splatting	Jiahao Wu et.al.	2408.06543	link
2024-08-12	Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering	Jiameng Li et.al.	2408.06286	link
2024-08-12	Developing Smart MAVs for Autonomous Inspection in GPS-denied Constructions	Paoqiang Pan et.al.	2408.06030	null
2024-08-12	HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors	Xiaozheng Zheng et.al.	2408.06019	null
2024-08-10	Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis	Zhongche Qu et.al.	2408.05635	null
2024-08-09	DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow	Hangyu Li et.al.	2408.05008	null
2024-08-14	Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction	Lingbei Meng et.al.	2408.04831	null
2024-08-06	LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting	Joanna Kaleta et.al.	2408.04474	link
2024-08-08	A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery	Mengya Xu et.al.	2408.04426	link
2024-08-08	InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting	Xin-Yi Yu et.al.	2408.04249	null
2024-08-07	Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM	Yan Song Hu et.al.	2408.03825	null
2024-08-07	Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields	Joo Chan Lee et.al.	2408.03822	null
2024-08-07	3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting	Zhe Jun Tang et.al.	2408.03753	link
2024-08-07	PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting	Yijia Guo et.al.	2408.03538	null
2024-08-02	A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness	Lutao Jiang et.al.	2408.01269	null
2024-08-02	Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion	Ke Li et.al.	2408.01225	link
2024-08-07	IG-SLAM: Instant Gaussian SLAM	F. Aykut Sarikamis et.al.	2408.01126	null
2024-08-01	LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting	Zhenyu Bao et.al.	2408.00254	null
2024-07-31	Localized Gaussian Splatting Editing with Contextual Awareness	Hanyuan Xiao et.al.	2408.00083	null
2024-07-31	Expressive Whole-Body 3D Gaussian Avatar	Gyeongsik Moon et.al.	2407.21686	null
2024-07-30	SceneTeller: Language-to-3D Scene Generation	Başak Melis Öcal et.al.	2407.20727	null
2024-07-29	Registering Neural 4D Gaussians for Endoscopic Surgery	Yiming Huang et.al.	2407.20213	null
2024-07-29	Radiance Fields for Robotic Teleoperation	Maximum Wilder-Smith et.al.	2407.20194	link
2024-07-26	ScalingGaussian: Enhancing 3D Content Creation with Generative Gaussian Splatting	Shen Chen et.al.	2407.19035	null
2024-07-25	GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution	Jintong Hu et.al.	2407.18046	null
2024-07-24	3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities	Yanqi Bao et.al.	2407.17418	link
2024-07-29	DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene	Xi Shi et.al.	2407.16600	null
2024-07-23	HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images	Shreyas Singh et.al.	2407.16503	link
2024-07-23	Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance	Jiyeop Kim et.al.	2407.16173	null
2024-07-22	6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model	Matteo Bortolon et.al.	2407.15484	null
2024-07-22	Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures	Ruizhe Wang et.al.	2407.15435	null
2024-07-21	HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions	Haiyang Zhou et.al.	2407.15187	null
2024-07-20	Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting	Tianle Zeng et.al.	2407.14846	null
2024-07-19	A Benchmark for Gaussian Splatting Compression and Quality Assessment Study	Qi Yang et.al.	2407.14197	link
2024-07-19	GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation	Florian Chabot et.al.	2407.14108	null
2024-07-19	DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays	Zongyuan Yang et.al.	2407.14053	null
2024-07-20	Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation	Zongrui Li et.al.	2407.13584	link
2024-07-18	EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting	Yuchen Weng et.al.	2407.13520	null
2024-07-17	Generalizable Human Gaussians for Sparse View Synthesis	Youngjoong Kwon et.al.	2407.12777	link
2024-07-17	Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections	Congrong Xu et.al.	2407.12306	null
2024-07-16	MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification	Zhuoxiao Li et.al.	2407.11840	null
2024-07-16	Click-Gaussian: Interactive Segmentation to Any 3D Gaussians	Seokhun Choi et.al.	2407.11793	null
2024-07-16	SlingBAG: Sliding ball adaptive growth algorithm with differentiable radiation enables super-efficient iterative 3D photoacoustic image reconstruction	Shuang Li et.al.	2407.11781	link
2024-07-16	I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM	Gwangtak Bae et.al.	2407.11347	null
2024-07-16	Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering	Jingqian Wu et.al.	2407.11343	null
2024-07-16	Gaussian Splatting LK	Liuyue Xie et.al.	2407.11309	null
2024-07-15	iHuman: Instant Animatable Digital Humans From Monocular Videos	Pramish Paudel et.al.	2407.11174	link
2024-07-15	Scaling 3D Reasoning with LMMs to Large Robot Mission Environments Using Datagraphs	W. J. Meijer et.al.	2407.10743	null
2024-07-15	Interactive Rendering of Relightable and Animatable Gaussian Avatars	Youyi Zhan et.al.	2407.10707	link
2024-07-16	RecGS: Removing Water Caustic with Recurrent Gaussian Splatting	Tianyi Zhang et.al.	2407.10318	null
2024-07-14	3DEgo: 3D Editing on the Go!	Umar Khalid et.al.	2407.10102	null
2024-07-14	SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion	Jiyuan Zhang et.al.	2407.10062	null
2024-07-13	Textured-GS: Gaussian Splatting with Spatially Defined Color and Opacity	Zhentao Huang et.al.	2407.09733	link
2024-07-12	StyleSplat: 3D Object Style Transfer with Gaussian Splatting	Sahil Jain et.al.	2407.09473	null
2024-07-11	WildGaussians: 3D Gaussian Splatting in the Wild	Jonas Kulhanek et.al.	2407.08447	link
2024-07-11	Survey on Fundamental Deep Learning 3D Reconstruction Techniques	Yonge Bai et.al.	2407.08137	null
2024-07-10	MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition	Aggelina Chatziagapi et.al.	2407.07284	null
2024-07-09	Reference-based Controllable Scene Stylization with Gaussian Splatting	Yiqun Mei et.al.	2407.07220	null
2024-07-10	3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes	Nicolas Moenne-Loccoz et.al.	2407.07090	null
2024-07-07	PICA: Physics-Integrated Clothed Avatar	Bo Peng et.al.	2407.05324	null
2024-07-07	GaussReg: Fast 3D Registration with Gaussian Splatting	Jiahao Chang et.al.	2407.05254	null
2024-07-06	SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction	Weixing Xie et.al.	2407.05023	link
2024-07-05	Gaussian Eigen Models for Human Heads	Wojciech Zielonka et.al.	2407.04545	null
2024-07-12	Segment Any 4D Gaussians	Shengxiang Ji et.al.	2407.04504	null
2024-07-10	GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction	Yuxuan Mu et.al.	2407.04237	null
2024-07-04	CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images	Junghe Lee et.al.	2407.03923	null
2024-07-04	PFGS: High Fidelity Point Cloud Rendering via Feature Splatting	Jiaxu Wang et.al.	2407.03857	link
2024-07-04	SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors	Yijia Guo et.al.	2407.03771	null
2024-07-04	VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors	Sungwon Hwang et.al.	2407.02945	link
2024-07-03	Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction	Jiaxin Guo et.al.	2407.02918	link
2024-07-04	AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction	Mustafa Khan et.al.	2407.02598	null
2024-07-02	TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation	Chaofan Luo et.al.	2407.02034	null
2024-07-01	DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction	Yujin Ham et.al.	2407.01761	null
2024-07-01	GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting	Chenxin Li et.al.	2407.01301	null
2024-07-01	EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting	Chenxin Li et.al.	2407.01029	null
2024-07-02	RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering	Weikai Lin et.al.	2407.00435	link
2024-06-29	OccFusion: Rendering Occluded Humans with Generative Diffusion Priors	Adam Sun et.al.	2407.00316	null
2024-06-28	SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting	Sara Sabour et.al.	2406.20055	null
2024-06-28	EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting	Daiwei Zhang et.al.	2406.19811	null
2024-06-27	Lightweight Predictive 3D Gaussian Splats	Junli Cao et.al.	2406.19434	link
2024-06-26	Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos	Colton Stearns et.al.	2406.18717	link
2024-06-26	On Scaling Up 3D Gaussian Splatting Training	Hexu Zhao et.al.	2406.18533	link
2024-06-26	GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality	Taoran Yi et.al.	2406.18462	null
2024-06-26	Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning	Muhammad Salman Ali et.al.	2406.18214	link
2024-06-26	GS-Octree: Octree-based 3D Gaussian Splatting for Robust Object-level 3D Reconstruction Under Strong Lighting	Jiaze Li et.al.	2406.18199	null
2024-06-26	VDG: Vision-Only Dynamic Gaussian for Driving Simulation	Hao Li et.al.	2406.18198	null
2024-06-25	NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods	Jonas Kulhanek et.al.	2406.17345	null
2024-06-24	Reducing the Memory Footprint of 3D Gaussian Splatting	Panagiotis Papantonakis et.al.	2406.17074	null
2024-06-24	From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking	Xiaohao Xu et.al.	2406.16850	link
2024-06-24	ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians	Yufei Liu et.al.	2406.16815	null
2024-06-23	LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction	Hengyu Liu et.al.	2406.16073	link
2024-06-23	Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction	Yangdi Lu et.al.	2406.15982	null
2024-06-21	Taming 3DGS: High-Quality Radiance Fields with Limited Resources	Saswat Subhajyoti Mallick et.al.	2406.15643	link
2024-06-21	Gaussian Splatting to Real World Flight Navigation Transfer with Liquid Networks	Alex Quach et.al.	2406.15149	null
2024-06-21	E2GS: Event Enhanced Gaussian Splatting	Hiroyuki Deguchi et.al.	2406.14978	link
2024-06-18	Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models	Paul Henderson et.al.	2406.13099	null
2024-06-18	HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors	Panwang Pan et.al.	2406.12459	link
2024-06-17	A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets	Bernhard Kerbl et.al.	2406.12080	null
2024-06-17	RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians	Bingling Li et.al.	2406.11836	null
2024-06-18	Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting	Junha Hyung et.al.	2406.11672	null
2024-06-16	Physically Embodied Gaussian Splatting: A Realtime Correctable World Model for Robotics	Jad Abou-Chakra et.al.	2406.10788	null
2024-06-14	Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections	Jiacong Xu et.al.	2406.10373	null
2024-06-14	L4GM: Large 4D Gaussian Reconstruction Model	Jiawei Ren et.al.	2406.10324	null
2024-06-14	PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting	Alex Hanson et.al.	2406.10219	link
2024-06-14	GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors	Xiqian Yu et.al.	2406.10111	null
2024-06-14	GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion	Trapoom Ukarapol et.al.	2406.09850	link
2024-06-14	Unified Gaussian Primitives for Scene Representation and Rendering	Yang Zhou et.al.	2406.09733	null
2024-06-13	Modeling Ambient Scene Dynamics for Free-view Synthesis	Meng-Li Shih et.al.	2406.09395	null
2024-06-13	GGHead: Fast and Generalizable 3D Gaussian Heads	Tobias Kirschstein et.al.	2406.09377	null
2024-06-14	AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis	Swapnil Bhosale et.al.	2406.08920	null
2024-06-13	Gaussian-Forest: Hierarchical-Hybrid 3D Gaussian Splatting for Compressed Scene Modeling	Fengyi Zhang et.al.	2406.08759	null
2024-06-12	ICE-G: Image Conditional Editing of 3D Gaussian Splats	Vishnu Jaganathan et.al.	2406.08488	null
2024-06-12	Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models	Yuxuan Xue et.al.	2406.08475	null
2024-06-12	From Chaos to Clarity: 3DGS in the Dark	Zhihao Li et.al.	2406.08300	null
2024-06-11	Trim 3D Gaussian Splatting for Accurate Geometry Representation	Lue Fan et.al.	2406.07499	null
2024-06-11	Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field	Chao Wang et.al.	2406.07329	null
2024-06-10	GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation	Haozhe Xie et.al.	2406.06526	link
2024-06-10	PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction	Danpeng Chen et.al.	2406.06521	null
2024-06-10	MVGamba: Unify 3D Content Generation as State Space Sequence Modeling	Xuanyu Yi et.al.	2406.06367	link
2024-06-10	Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis	Xin Jin et.al.	2406.06216	link
2024-06-09	RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering	Rui Zhang et.al.	2406.05852	null
2024-06-09	VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction	Hanlin Chen et.al.	2406.05774	null
2024-06-06	Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image	Stanislaw Szymanowicz et.al.	2406.04343	link
2024-06-06	A Survey on 3D Human Avatar Modeling – From Reconstruction to Generation	Ruihe Wang et.al.	2406.04253	null
2024-06-06	Localized Gaussian Point Management	Haosen Yang et.al.	2406.04251	null
2024-06-06	Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction	Diwen Wan et.al.	2406.03697	link
2024-06-05	Event3DGS: Event-based 3D Gaussian Splatting for Fast Egomotion	Tianyi Xiong et.al.	2406.02972	null
2024-06-05	Adversarial Generation of Hierarchical Gaussians for 3D Generative Model	Sangeek Hyun et.al.	2406.02968	link
2024-06-04	3D-HGS: 3D Half-Gaussian Splatting	Haolin Li et.al.	2406.02720	link
2024-06-06	Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting	Inkyu Shin et.al.	2406.02541	null
2024-06-04	SatSplatYOLO: 3D Gaussian Splatting-based Virtual Object Detection Ensembles for Satellite Feature Recognition	Van Minh Nguyen et.al.	2406.02533	null
2024-06-04	DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering	Zhongpai Gao et.al.	2406.02518	null
2024-06-04	WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections	Yuze Wang et.al.	2406.02407	null
2024-06-04	Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning	Jiaxu Wang et.al.	2406.02370	null
2024-06-04	OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding	Yanmin Wu et.al.	2406.02058	null
2024-06-04	FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping	Yuzhou Ji et.al.	2406.01916	null
2024-06-03	Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting	Shaojie Ma et.al.	2406.01593	null
2024-06-03	Tetrahedron Splatting for 3D Generation	Chun Gu et.al.	2406.01579	link
2024-06-03	DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors	Tianyu Huang et.al.	2406.01476	link
2024-05-31	ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model	Yufei Wang et.al.	2405.20721	link
2024-05-31	R $^2$ -Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction	Ruyi Zha et.al.	2405.20693	link
2024-05-30	$\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving	Nan Huang et.al.	2405.20323	link
2024-06-03	A Pixel Is Worth More Than One 3D Gaussians in Single-View 3D Reconstruction	Jianghao Shen et.al.	2405.20310	null
2024-05-29	EvaGaussians: Event Stream Assisted Gaussian Splatting from Blurry Images	Wangbo Yu et.al.	2405.20224	null
2024-05-30	Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting	Kuldeep R Barad et.al.	2405.20104	null
2024-06-04	PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting	Qiaowei Miao et.al.	2405.19957	link
2024-05-30	GaussianRoom: Improving 3D Gaussian Splatting with SDF Guidance and Monocular Cues for Indoor Scene Reconstruction	Haodong Xiang et.al.	2405.19671	null
2024-05-30	Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian	Wei Sun et.al.	2405.19657	null
2024-05-30	TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM	Peifeng Jiang et.al.	2405.19614	null
2024-05-29	NPGA: Neural Parametric Gaussian Avatars	Simon Giebenhain et.al.	2405.19331	null
2024-05-29	LP-3DGS: Learning to Prune 3D Gaussian Splatting	Zhaoliang Zhang et.al.	2405.18784	link
2024-05-28	GFlow: Recovering 4D World from Monocular Video	Shizun Wang et.al.	2405.18426	null
2024-05-28	3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting	Qihang Zhang et.al.	2405.18424	null
2024-05-28	3D StreetUnveiler with Semantic-Aware 2DGS	Jingwei Xu et.al.	2405.18416	null
2024-05-28	NegGS: Negative Gaussian Splatting	Artur Kasymov et.al.	2405.18163	link
2024-05-28	A Grid-Free Fluid Solver based on Gaussian Spatial Representation	Jingrui Xing et.al.	2405.18133	null
2024-05-28	EG4D: Explicit Generation of 4D Object without Score Distillation	Qi Sun et.al.	2405.18132	link
2024-05-28	RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields	Mihnea-Bogdan Jurca et.al.	2405.18033	link
2024-05-28	FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes	Yunsong Wang et.al.	2405.17958	link
2024-05-28	A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction	Bin Zhang et.al.	2405.17891	null
2024-05-29	HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction	Haoyu Zhao et.al.	2405.17872	link
2024-05-27	MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds	Jiahui Lei et.al.	2405.17421	link
2024-05-27	DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal	Yujie Wang et.al.	2405.17351	null
2024-05-27	Memorize What Matters: Emergent Scene Decomposition from Multitraverse	Yiming Li et.al.	2405.17187	link
2024-05-27	F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting	Xiangyu Sun et.al.	2405.17083	null
2024-05-27	SA-GS: Semantic-Aware Gaussian Splatting for Large Scene Reconstruction with Geometry Constrain	Butian Xiong et.al.	2405.16923	null
2024-05-28	PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting	Zipeng Wang et.al.	2405.16829	null
2024-05-26	Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models	Hanwen Liang et.al.	2405.16645	null
2024-05-26	Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians	Erik Sandström et.al.	2405.16544	link
2024-05-24	Feature Splatting for Better Novel View Synthesis with Low Overlap	T. Berriel Martins et.al.	2405.15518	link
2024-05-24	GSDeformer: Direct Cage-based Deformation for 3D Gaussian Splatting	Jiajun Huang et.al.	2405.15491	null
2024-05-24	DisC-GS: Discontinuity-aware Gaussian Splatting	Haoxuan Qu et.al.	2405.15196	null
2024-05-24	HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting	Yuanhao Cai et.al.	2405.15125	link
2024-05-24	GS-Hider: Hiding Messages into 3D Gaussian Splatting	Xuanyu Zhang et.al.	2405.15118	null
2024-05-23	EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting	Jiaxu Wang et.al.	2405.14959	link
2024-05-23	Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras	Hanzhang Tu et.al.	2405.14866	null
2024-05-23	MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes	Ruiyuan Gao et.al.	2405.14475	null
2024-05-23	TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing	Teng Xu et.al.	2405.14455	null
2024-05-24	RoGS: Large Scale Road Surface Reconstruction based on 2D Gaussian Splatting	Zhiheng Feng et.al.	2405.14342	link
2024-05-23	D-MiSo: Editing Dynamic 3D Scenes using Multi-Gaussians Soup	Joanna Waczyńska et.al.	2405.14276	link
2024-05-22	DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus	Yu Chen et.al.	2405.13943	link
2024-05-22	Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances	Licheng Shen et.al.	2405.13694	null
2024-05-21	MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video	Hongsheng Wang et.al.	2405.12806	null
2024-05-21	LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting	Jia Gong et.al.	2405.12663	null
2024-05-21	Gaussian Control with Hierarchical Semantic Graphs in 3D Human Recovery	Hongsheng Wang et.al.	2405.12477	null
2024-05-20	GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details	Boqian Li et.al.	2405.12420	link
2024-05-20	AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field	Rong Liu et.al.	2405.12369	link
2024-05-20	Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo	Tianqi Liu et.al.	2405.12218	link
2024-05-20	Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents	Guanlin Wu et.al.	2405.12155	null
2024-05-20	CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization	Jiawei Zhang et.al.	2405.12110	link
2024-05-21	Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping	Tianhao Wu et.al.	2405.12069	null
2024-05-20	MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections	Jiayue Liu et.al.	2405.11921	null
2024-05-18	Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching	Xingyu Miao et.al.	2405.11252	link
2024-05-18	MotionGS : Compact Gaussian Splatting SLAM by Motion Filter	Xinli Guo et.al.	2405.11129	link
2024-05-17	Photorealistic 3D Urban Scene Reconstruction and Point Cloud Extraction using Google Earth Imagery and Gaussian Splatting	Kyle Gao et.al.	2405.11021	null
2024-05-17	ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation	Pengzhi Li et.al.	2405.10508	null
2024-05-16	GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction	Rui Jin et.al.	2405.10142	null
2024-05-15	From NeRFs to Gaussian Splats, and Back	Siming He et.al.	2405.09717	link
2024-05-13	GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting	Haodong Chen et.al.	2405.07472	null
2024-05-11	Direct Learning of Mesh and Appearance via 3D Gaussian Splatting	Ancheng Lin et.al.	2405.06945	null
2024-05-10	OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation	Jinwei Lin et.al.	2405.06547	link
2024-05-10	I3DGS: Improve 3D Gaussian Splatting from Multiple Dimensions	Jinwei Lin et.al.	2405.06408	null
2024-05-10	MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization	Pengcheng Zhu et.al.	2405.06241	null
2024-05-09	DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation	Sitian Shen et.al.	2405.05800	null
2024-05-09	FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting	Yikun Ma et.al.	2405.05768	null
2024-05-09	NGM-SLAM: Gaussian Splatting SLAM with Radiance Field Submap	Mingrui Li et.al.	2405.05702	null

Stereo Matching

Publish Date	Title	Authors	PDF	Code
2025-07-23	Observation of Astrophysical Sources with SST-1M Telescopes – First Results	Jakub Juryšek et.al.	2507.17451	null
2025-07-23	Do male leading authors retract more articles than female leading authors?	Er-Te Zheng et.al.	2507.17127	null
2025-07-22	A Weighted Likelihood Approach Based on Statistical Data Depths	Claudio Agostinelli et.al.	2507.16998	null
2025-07-22	A resource theoretical unification of Mpemba effects: classical and quantum	Alessandro Summer et.al.	2507.16976	null
2025-07-22	Stereo performance of SST-1M at different altitudes	Patrik Čechvala et.al.	2507.16681	null
2025-07-22	confopt: A Library for Implementation and Evaluation of Gradient-based One-Shot NAS Methods	Abhash Kumar Jha et.al.	2507.16533	null
2025-07-22	Sparse-View 3D Reconstruction: Recent Advances and Open Challenges	Tanveer Younis et.al.	2507.16406	null
2025-07-22	ADCD-Net: Robust Document Image Forgery Localization via Adaptive DCT Feature and Hierarchical Content Disentanglement	Kahim Wong et.al.	2507.16397	null
2025-07-22	MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation	Yanchen Liu et.al.	2507.16310	null
2025-07-22	BDIViz: An Interactive Visualization System for Biomedical Schema Matching with LLM-Powered Validation	Eden Wu et.al.	2507.16117	null
2025-07-21	Double-offset Cassegrain telescopes for the Ultraviolet Type Ia (UVIa) mission concept	Fernando Cruz Aguirre et.al.	2507.16006	null
2025-07-21	Graph Attention Specialized Expert Fusion Model for Node Classification: Based on Cora and Pubmed Datasets	Zihang Ma et.al.	2507.15784	null
2025-07-21	Hierarchical Graph Information Bottleneck for Multi-Behavior Recommendation	Hengyu Zhang et.al.	2507.15395	null
2025-07-21	BenchDepth: Are We on the Right Way to Evaluate Depth Foundation Models?	Zhenyu Li et.al.	2507.15321	null
2025-07-20	Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction	Xiufeng Huang et.al.	2507.14921	null
2025-07-20	Towards Geometric and Textural Consistency 3D Scene Generation via Single Image-guided Model Generation and Layout Optimization	Xiang Tang et.al.	2507.14841	null
2025-07-20	An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks	Xinyi Wu et.al.	2507.14798	null
2025-07-19	Toward Inclusive AI-Driven Development: Exploring Gender Differences in Code Generation Tool Interactions	Manaal Basha et.al.	2507.14770	null
2025-07-19	Task Mode: Dynamic Filtering for Task-Specific Web Navigation using LLMs	Ananya Gubbi Mohanbabu et.al.	2507.14769	null
2025-07-19	Disparities in Peer Review Tone and the Role of Reviewer Anonymity	Maria Sahakyan et.al.	2507.14741	null
2025-07-19	MultiRetNet: A Multimodal Vision Model and Deferral System for Staging Diabetic Retinopathy	Jeannie She et.al.	2507.14738	null
2025-07-18	TimeNeRF: Building Generalizable Neural Radiance Fields across Time from Few-Shot Input Views	Hsiang-Hui Hung et.al.	2507.13929	null
2025-07-17	Uncertainty Quantification Framework for Aerial and UAV Photogrammetry through Error Propagation	Debao Huang et.al.	2507.13486	null
2025-07-17	Linking Multi-Site Sex Ad Data at the Individual Level to Aid Counter-Trafficking Efforts	Nickolas K. Freeman et.al.	2507.13477	null
2025-07-17	SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation	Weizhi Zhang et.al.	2507.13336	null
2025-07-17	$S^2M^2$ : Scalable Stereo Matching Model for Reliable Depth Estimation	Junhong Min et.al.	2507.13229	null
2025-07-17	A Classification of Six Functor Formalisms via Structured Spaces	Salash Tolan Nabaala et.al.	2507.13114	null
2025-07-17	Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities	Liuyi Wang et.al.	2507.13019	null
2025-07-17	FedGA: A Fair Federated Learning Framework Based on the Gini Coefficient	ShanBin Liu et.al.	2507.12983	null
2025-07-17	DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization	Huakang Chen et.al.	2507.12890	null
2025-07-16	Data Transformation Strategies to Remove Heterogeneity	Sangbong Yoo et.al.	2507.12677	null
2025-07-16	Wavelet-based Decoupling Framework for low-light Stereo Image Enhancement	Shuangli Du et.al.	2507.12188	null
2025-07-16	Stereo Sound Event Localization and Detection with Onscreen/offscreen Classification	Kazuki Shimada et.al.	2507.12042	null
2025-07-16	Dual form Complementary Masking for Domain-Adaptive Image Segmentation	Jiawen Wang et.al.	2507.12008	null
2025-07-15	Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation	Zhen Xu et.al.	2507.11540	null
2025-07-15	Uniting the World by Dividing it: Federated Maps to Enable Spatial Applications	Sagar Bharadwaj et.al.	2507.11437	null
2025-07-15	Caveats about measuring carbon abundances in stars using the CH band	Pablo Santos-Peral et.al.	2507.11351	null
2025-07-15	MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network	Jianfei Jiang et.al.	2507.11333	null
2025-07-15	Fairness-Aware Grouping for Continuous Sensitive Variables: Application for Debiasing Face Analysis with respect to Skin Tone	Veronika Shilova et.al.	2507.11247	null
2025-07-15	Generative Click-through Rate Prediction with Applications to Search Advertising	Lingwei Kong et.al.	2507.11246	null
2025-07-17	MMOne: Representing Multiple Modalities in One Scene	Zhifeng Gu et.al.	2507.11129	null
2025-07-15	Urban delineation through the lens of commute networks: Leveraging graph embeddings to distinguish socioeconomic groups in cities	Devashish Khulbe et.al.	2507.11057	null
2025-07-15	Uncertainty Aware Mapping for Vision-Based Underwater Robots	Abhimanyu Bhowmik et.al.	2507.10991	null
2025-07-15	Terms and Conditions (Do Not) Apply: Understanding Exploitation Disparities in Design of Mobile-Based Financial Services	Lindah Kotut et.al.	2507.10970	null
2025-07-14	Cameras as Relative Positional Encoding	Ruilong Li et.al.	2507.10496	null
2025-07-14	Rows and Capabilities as Modal Effects	Wenhao Tang et.al.	2507.10301	null
2025-07-14	Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures	Xinlong Ding et.al.	2507.10265	null
2025-07-14	Is Micro-expression Ethnic Leaning?	Huai-Qian Khor et.al.	2507.10209	null
2025-07-14	Minimizing the Pretraining Gap: Domain-aligned Text-Based Person Retrieval	Shuyu Yang et.al.	2507.10195	null
2025-07-14	Simulating Biases for Interpretable Fairness in Offline and Online Classifiers	Ricardo Inácio et.al.	2507.10154	null
2025-07-14	Efficient RF Chain Selection for MIMO Integrated Sensing and Communications: A Greedy Approach	Subin Shin et.al.	2507.09960	null
2025-07-13	EventHunter: Dynamic Clustering and Ranking of Security Events from Hacker Forum Discussions	Yasir Ech-Chammakhy et.al.	2507.09762	null
2025-07-13	Pre-trained Under Noise: A Framework for Robust Bone Fracture Detection in Medical Imaging	Robby Hoover et.al.	2507.09731	null
2025-07-13	Electric Vehicle Public Charging Equity Considerations: A Systematic Review	Boyou Chen et.al.	2507.09726	null
2025-07-11	Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT	Wei Zhang et.al.	2507.08448	null
2025-07-11	PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models	Yongjian Zhang et.al.	2507.08400	null
2025-07-10	Highly accurate simulations of asymmetric black-hole scattering and cross validation of effective-one-body models	Oliver Long et.al.	2507.08071	null
2025-07-10	Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions	Longfei Li et.al.	2507.07978	null
2025-07-10	On-Manifold Low-Thrust Maneuvering of Quasi-Periodic Orbits	Ian M. Down et.al.	2507.07940	null
2025-07-10	TRIX- Trading Adversarial Fairness via Mixed Adversarial Training	Tejaswini Medi et.al.	2507.07768	null
2025-07-10	Prime Power Residues and Blocking Sets	Bhawesh Mishra et.al.	2507.07673	null
2025-07-10	Bridging the gap in FER: addressing age bias in deep learning	F. Xavier Gaya-Morey et.al.	2507.07638	null
2025-07-10	Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects	Yuqi Cheng et.al.	2507.07435	null
2025-07-09	Dirty Data in the Newsroom: Comparing Data Preparation in Journalism and Data Science	Stephen Kasica et.al.	2507.07238	null
2025-07-09	Combining Pre-Trained Models for Enhanced Feature Representation in Reinforcement Learning	Elia Piccoli et.al.	2507.07197	null
2025-07-09	Correlations between Dust Extinction Features across All Wavelength Scales: From Diffuse Interstellar Bands to R(V)	Andrew K. Saydjari et.al.	2507.07162	null
2025-07-09	Hierarchical Feature Alignment for Gloss-Free Sign Language Translation	Sobhan Asasi et.al.	2507.06732	null
2025-07-09	Photometric Stereo using Gaussian Splatting and inverse rendering	Matéo Ducastel et.al.	2507.06684	null
2025-07-09	Transferable Parasitic Estimation via Graph Contrastive Learning and Label Rebalancing in AMS Circuits	Shan Shen et.al.	2507.06535	null
2025-07-08	Bridging Sequential Deep Operator Network and Video Diffusion: Residual Refinement of Spatio-Temporal PDE Solutions	Jaewan Park et.al.	2507.06133	null
2025-07-08	Discontinuity-aware Normal Integration for Generic Central Camera Models	Francesco Milano et.al.	2507.06075	null
2025-07-08	Bridging Perception and Language: A Systematic Benchmark for LVLMs’ Understanding of Amodal Completion Reports	Amane Watahiki et.al.	2507.05799	null
2025-07-08	Fairness-Aware Static and Dynamic Assortment Optimization: Optimal Selection with Balanced Market Share	Omar El Housni et.al.	2507.05606	null
2025-07-08	SingLoRA: Low Rank Adaptation Using a Single Matrix	David Bensaïd et.al.	2507.05566	null
2025-07-07	Incorporating Interventional Independence Improves Robustness against Interventional Distribution Shift	Gautam Sreekumar et.al.	2507.05412	null
2025-07-07	Feature Geometry for Stereo Sidescan and Forward-looking Sonar	Kalin Norman et.al.	2507.05410	null
2025-07-07	Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates	Andrea Eichenseer et.al.	2507.05409	null
2025-07-07	Stereo Reproduction in the Presence of Sample Rate Offsets	Srikanth Korse et.al.	2507.05402	null
2025-07-07	Untangling Selberg from the Wilson spool: 1-loop determinants and trace formulae in (A)dS $_{3}$	Samuel Haupfear et.al.	2507.05358	null
2025-07-07	Causal Impacts of Protected Bike Lanes on Cycling Behavior with Demographic Disparities	Marcel Moran et.al.	2507.04936	null
2025-07-07	Spatial and Semantic Embedding Integration for Stereo Sound Event Localization and Detection in Regular Videos	Davide Berghi et.al.	2507.04845	null
2025-07-07	Toward Valid Measurement Of (Un)fairness For Generative AI: A Proposal For Systematization Through The Lens Of Fair Equality of Chances	Kimberly Le Truong et.al.	2507.04641	null
2025-07-07	Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts	Yun Wang et.al.	2507.04631	null
2025-07-07	DisMS-TS: Eliminating Redundant Multi-Scale Features for Time Series Classification	Zhipeng Liu et.al.	2507.04600	null
2025-07-06	Thousand-Brains Systems: Sensorimotor Intelligence for Rapid, Robust Learning and Inference	Niels Leadholm et.al.	2507.04494	null
2025-07-05	Nested economies of scale in city mass	Kangning Huang et.al.	2507.03960	null
2025-07-04	Assessing the Viability of Wave Field Synthesis in VR-Based Cognitive Research	Benjamin Kahl et.al.	2507.03797	null
2025-07-04	Improving Social Determinants of Health Documentation in French EHRs Using Large Language Models	Adrien Bazoge et.al.	2507.03433	null
2025-07-04	CME activities on spotless days during descending phase of solar cycles 23 and 24	Dipali Burud et.al.	2507.03399	null
2025-07-02	The Illusion of Fairness: Auditing Fairness Interventions with Audit Studies	Disa Sariola et.al.	2507.02152	null
2025-07-02	The Thin Line Between Comprehension and Persuasion in LLMs	Adrian de Wynter et.al.	2507.01936	null
2025-07-02	How Do Vision-Language Models Process Conflicting Information Across Modalities?	Tianze Hua et.al.	2507.01790	null
2025-07-02	RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather	Yuran Wang et.al.	2507.01653	null
2025-07-02	Adapting Language Models to Indonesian Local Languages: An Empirical Study of Language Transferability on Zero-Shot Settings	Rifki Afina Putri et.al.	2507.01645	null
2025-07-02	Two Cases of Non-Radial Filament Eruption and Associated CME Deflection	Kostadinka Koleva et.al.	2507.01580	null
2025-07-02	Penalizing Transparency? How AI Disclosure and Author Demographics Shape Human and AI Judgments About Writing	Inyoung Cheong et.al.	2507.01418	null
2025-07-01	Improving Stereo 3D Sound Event Localization and Detection: Perceptual Features, Stereo-specific Data Augmentation, and Distance Normalization	Jun-Wei Yeow et.al.	2507.00874	null
2025-07-01	Impact of temperature asymmetry and small fraction of static positive ions on the relaxed states of a relativistic hot pair plasma	Usman Shazad et.al.	2507.00760	null
2025-07-01	Renormalization group based implicit function approach to connecting orbits	Pengfei Guo et.al.	2507.00749	null
2025-07-01	Self-organization of earth’s inner magnetospheric multi-ion plasma	Usman Shazad et.al.	2507.00734	null
2025-06-30	Development of Hybrid Artificial Intelligence Training on Real and Synthetic Data: Benchmark on Two Mixed Training Strategies	Paul Wachter et.al.	2506.24093	null
2025-06-30	Simultaneous Super-Resolution of Spatial and Spectral Imaging with a Camera Array and Notch Filters	Peng Lin et.al.	2506.24014	null
2025-06-30	Statistical Modeling for Accurate Characterization of Doppler Effect in LEO-Terrestrial Networks	Islam M. Tanash et.al.	2506.23817	null
2025-06-30	AdFair-CLIP: Adversarial Fair Contrastive Language-Image Pre-training for Chest X-rays	Chenlang Yi et.al.	2506.23467	null
2025-06-29	Zero-disparity Distribution Synthesis: Fast Exact Calculation of Chi-Squared Statistic Distribution for Discrete Uniform Histograms	Nikola Banić et.al.	2506.23416	null
2025-06-29	Datasets for Fairness in Language Models: An In-Depth Survey	Jiale Zhang et.al.	2506.23411	null
2025-06-29	Modeling European Electricity Market Integration during turbulent times	Francesco Ravazzolo et.al.	2506.23289	null
2025-06-29	Event-based Stereo Visual-Inertial Odometry with Voxel Map	Zhaoxing Zhang et.al.	2506.23078	null
2025-06-28	Feature-Wise Mixing for Mitigating Contextual Bias in Predictive Supervised Learning	Yash Vardhan Tomar et.al.	2506.23033	null
2025-06-28	SPICE-HL3: Single-Photon, Inertial, and Stereo Camera dataset for Exploration of High-Latitude Lunar Landscapes	David Rodríguez-Martínez et.al.	2506.22956	null
2025-06-27	Towards Fair Rankings: Leveraging LLMs for Gender Bias Detection and Measurement	Maryam Mousavian et.al.	2506.22372	null
2025-06-27	NoticeLight: Embracing Socio-Technical Asymmetry through Tangible Peripheral Robotic Embodiment in Hybrid Collaboration	Marie Altmann et.al.	2506.22125	null
2025-06-27	Quantifying Institutional Gender Inequality in Contemporary Visual Art	Xindi Wang et.al.	2506.22103	null
2025-06-27	Seismic resolution enhancement via deep Learning with Knowledge Distillation and Domain Adaptation	Hanpeng Cai et.al.	2506.22018	null
2025-06-27	SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images	Naftaly Wambugu et.al.	2506.21945	null
2025-06-26	Counterfactual Voting Adjustment for Quality Assessment and Fairer Voting in Online Platforms with Helpfulness Evaluation	Chang Liu et.al.	2506.21362	null
2025-06-26	ToosiCubix: Monocular 3D Cuboid Labeling via Vehicle Part Annotations	Behrooz Nasihatkon et.al.	2506.21358	null
2025-06-26	ESMStereo: Enhanced ShuffleMixer Disparity Upsampling for Real-Time and Accurate Stereo Matching	Mahmoud Tahmasebi et.al.	2506.21091	null
2025-06-26	The Role of Cyclopean-Eye in Stereo Vision	Sherlon Almeida da Silva et.al.	2506.20900	null
2025-06-25	THIRDEYE: Cue-Aware Monocular Depth Estimation via Brain-Inspired Multi-Stage Fusion	Calin Teodor Ioan et.al.	2506.20877	null
2025-06-25	StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation	Haodong Li et.al.	2506.20756	null
2025-06-25	Don’t Hash Me Like That: Exposing and Mitigating Hash-Induced Unfairness in Local Differential Privacy	Berkay Kemal Balioglu et.al.	2506.20290	null
2025-06-25	Effects of flame macrostructures on the combustion dynamics of novel counter-rotating radial swirl injector in a model can combustor	SK Thirumalaikumaran et.al.	2506.20138	null
2025-06-24	Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation	Jun Wang et.al.	2506.19774	null
2025-06-24	Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders	Matyas Bohacek et.al.	2506.19708	null
2025-06-24	Recurrent Visual Feature Extraction and Stereo Attentions for CT Report Generation	Yuanhe Tian et.al.	2506.19665	null
2025-06-24	AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models	Zeyu Li et.al.	2506.19505	null
2025-06-24	MuBench: Assessment of Multilingual Capabilities of Large Language Models Across 61 Languages	Wenhan Han et.al.	2506.19468	null
2025-06-24	Online camera-pose-free stereo endoscopic tissue deformation recovery with tissue-invariant vision-biomechanics consistency	Jiahe Chen et.al.	2506.19388	null
2025-06-23	MOSCARD – Causal Reasoning and De-confounding for Multimodal Opportunistic Screening of Cardiovascular Adverse Events	Jialu Pi et.al.	2506.19174	null
2025-06-23	Identifying Causally-Robust Mediators of Health Disparities: A Review and Simulation Studies With Directed Acyclic Graphs	Soojin Park et.al.	2506.19047	null
2025-06-23	Simulation-Based Sensitivity Analysis in Optimal Treatment Regimes and Causal Decomposition with Individualized Interventions	Soojin Park et.al.	2506.19010	null
2025-06-23	Causal Decomposition Analysis with Synergistic Interventions: A Triply-Robust Machine Learning Approach to Addressing Multiple Dimensions of Social Disparities	Soojin Park et.al.	2506.18994	null
2025-06-23	Light of Normals: Unified Feature Representation for Universal Photometric Stereo	Hong Li et.al.	2506.18882	null
2025-06-23	Evaluating Multichannel Speech Enhancement Algorithms at the Phoneme Scale Across Genders	Nasser-Eddine Monir et.al.	2506.18691	null
2025-06-23	NOVA: Navigation via Object-Centric Visual Autonomy for High-Speed Target Tracking in Unstructured GPS-Denied Environments	Alessandro Saviolo et.al.	2506.18689	null
2025-06-23	Bias vs Bias – Dawn of Justice: A Fair Fight in Recommendation Systems	Tahsin Alamgir Kheya et.al.	2506.18327	null
2025-06-22	Mental Health Equity in LLMs: Leveraging Multi-Hop Question Answering to Detect Amplified and Silenced Perspectives	Batool Haider et.al.	2506.18116	null
2025-06-22	StereoTacTip: Vision-based Tactile Sensing with Biomimetic Skin-Marker Arrangements	Chenghua Lu et.al.	2506.18040	null
2025-06-22	Feedback Driven Multi Stereo Vision System for Real-Time Event Analysis	Mohamed Benkedadra et.al.	2506.17910	null
2025-06-21	In-Context Learning Strategies Emerge Rationally	Daniel Wurgaft et.al.	2506.17859	null
2025-06-21	Learning to Dock: A Simulation-based Study on Closing the Sim2Real Gap in Autonomous Underwater Docking	Kevin Chang et.al.	2506.17823	null
2025-06-21	Optimization-Free Patch Attack on Stereo Depth Estimation	Hangcheng Liu et.al.	2506.17632	null
2025-06-20	YASMOT: Yet another stereo image multi-object tracker	Ketil Malde et.al.	2506.17186	link
2025-06-20	Are Bias Evaluation Methods Biased ?	Lina Berrayana et.al.	2506.17111	null
2025-06-20	Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping	Teng Guo et.al.	2506.17110	null
2025-06-20	Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks	Samer Lahoud et.al.	2506.17063	null
2025-06-20	LunarLoc: Segment-Based Global Localization on the Moon	Annika Thomas et.al.	2506.16940	link
2025-06-20	DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches	Yun Xing et.al.	2506.16690	null
2025-06-19	External Evaluation of Discrimination Mitigation Efforts in Meta’s Ad Delivery	Basileal Imana et.al.	2506.16560	null
2025-06-19	PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking	Yan Zhou et.al.	2506.16379	null
2025-06-19	Heterotopic energy for Sobolev mappings	Antoine Detaille et.al.	2506.16204	null
2025-06-19	Solar Transient Recognition Using Deep Learning (STRUDL) for heliospheric imager data	Maike Bauer et.al.	2506.16194	null
2025-06-18	Mono-Modalizing Extremely Heterogeneous Multi-Modal Medical Image Registration	Kyobin Choo et.al.	2506.15596	null
2025-06-18	SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models’ Knowledge of Indian Culture	Arijit Maji et.al.	2506.15355	null
2025-06-18	Dissecting the gender divide: Authorship and acknowledgment in scientific publications	Keigo Kusumegi et.al.	2506.15237	null
2025-06-18	Transit for All: Mapping Equitable Bike2Subway Connection using Region Representation Learning	Min Namgung et.al.	2506.15113	null
2025-06-18	3D Vision-tactile Reconstruction from Infrared and Visible Images for Robotic Fine-grained Tactile Perception	Yuankai Lin et.al.	2506.15087	null
2025-06-17	Time-Optimized Safe Navigation in Unstructured Environments through Learning Based Depth Completion	Jeffrey Mao et.al.	2506.14975	null
2025-06-17	Cost-Aware Routing for Efficient Text-To-Image Generation	Qinchan et.al.	2506.14753	null
2025-06-17	DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning	Kunal Swami et.al.	2506.14709	null
2025-06-17	One Size Fits None: Rethinking Fairness in Medical AI	Roland Roller et.al.	2506.14400	null
2025-06-17	Consensus Power Inequality: A Comparative Study of Blockchain Networks	Kamil Tylinski et.al.	2506.14393	null
2025-06-16	Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble	Zhiqi Wang et.al.	2506.13972	link
2025-06-16	Bias Delayed is Bias Denied? Assessing the Effect of Reporting Delays on Disparity Assessments	Jennah Gosciak et.al.	2506.13735	link
2025-06-16	Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields	Jungeon Kim et.al.	2506.13508	null
2025-06-16	Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling	Wenmiao Gao et.al.	2506.13455	null
2025-06-16	Cloud-to-cloud velocity dispersions across a Local arm segment	Lixia Yuan et.al.	2506.13424	null
2025-06-16	DVP-MVS++: Synergize Depth-Normal-Edge and Harmonized Visibility Prior for Multi-View Stereo	Zhenlong Yuan et.al.	2506.13215	null
2025-06-16	Equitable Electronic Health Record Prediction with FAME: Fairness-Aware Multimodal Embedding	Nikkie Hooman et.al.	2506.13104	null
2025-06-14	Recent Advances and Future Directions in Literature-Based Discovery	Andrej Kastrin et.al.	2506.12385	null
2025-06-14	Path-specific effects for pulse-oximetry guided decisions in critical care	Kevin Zhang et.al.	2506.12371	null
2025-06-16	A Reference Model and Patterns for Production Event Data Enrichment	Mark van der Pas et.al.	2506.11502	null
2025-06-16	SemanticST: Spatially Informed Semantic Graph Learning for Clustering, Integration, and Scalable Analysis of Spatial Transcriptomics	Roxana Zahedi et.al.	2506.11491	link
2025-06-13	A Watermark for Auto-Regressive Image Generation Models	Yihan Wu et.al.	2506.11371	null
2025-06-12	Forbidden configurations for coherency	Victoria Gould et.al.	2506.11321	null
2025-06-12	Principled Approaches for Extending Neural Architectures to Function Spaces for Operator Learning	Julius Berner et.al.	2506.10973	link
2025-06-12	FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition	Jongsuk Kim et.al.	2506.10747	null
2025-06-12	Balancing Tails when Comparing Distributions: Comprehensive Equity Index (CEI) with Application to Bias Evaluation in Operational Face Biometrics	Imanol Solano et.al.	2506.10564	null
2025-06-12	EasyDRAM: An FPGA-based Infrastructure for Fast and Accurate End-to-End Evaluation of Emerging DRAM Techniques	Oğuzhan Canpolat et.al.	2506.10441	link
2025-06-12	Transcorrelated Theory for Transition Metal Atoms	Kristoffer Simula et.al.	2506.10429	null
2025-06-12	PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting	Lintao Xiang et.al.	2506.10335	null
2025-06-12	A Novel Feedforward Youla Parameterization Method for Avoiding Local Minima in Stereo Image Based Visual Servoing Control	Rongfei Li et.al.	2506.10252	null
2025-06-10	Down But Not Out: The Case of Long-Period Comet C/2021 O3 (Panstarrs)	David Jewitt. Jing Li et.al.	2506.09263	null
2025-06-10	Princeton365: A Diverse Dataset with Accurate Camera Pose	Karhan Kayan et.al.	2506.09035	null
2025-06-10	Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia	Katelyn Xiaoying Mei et.al.	2506.08846	link
2025-06-11	Towards Fair Representation: Clustering and Consensus	Diptarka Chakraborty et.al.	2506.08673	null
2025-06-09	Unmasking inequility: socio-economic determinants and gender disparities in Maharashtra and India’s health outcomes – Insights from NFHS-5	Sharmishtha Raghuvanshi et.al.	2506.08206	null
2025-06-09	GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors	Wenlong Meng et.al.	2506.08188	null
2025-06-09	Balanced Area Deprivation Index (bADI): Enhancing social determinants of health indices to strengthen their association with healthcare clinical outcomes, utilization and costs	Mohammad Amin Morid et.al.	2506.08131	null
2025-06-09	Unraveling Ethereum’s Mempool: The Impact of Fee Fairness, Transaction Prioritization, and Consensus Efficiency	S M Mostaq Hossain et.al.	2506.07988	null
2025-06-09	LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement	Dimitris Panagopoulos et.al.	2506.07915	null
2025-06-09	Erbium-implanted WS2 flakes with room-temperature photon emission at telecom wavelengths	Guadalupe García-Arellano et.al.	2506.07746	null
2025-06-09	Federated In-Context Learning: Iterative Refinement for Improved Answer Quality	Ruhan Wang et.al.	2506.07440	null
2025-06-09	The impact of extracurricular education on socioeconomic mobility in Japan: an application of causal machine learning	Yang Qiang et.al.	2506.07421	null
2025-06-08	Analyzing Breast Cancer Survival Disparities by Race and Demographic Location: A Survival Analysis Approach	Ramisa Farha et.al.	2506.07191	null
2025-06-08	Optimal Transport Driven Asymmetric Image-to-Image Translation for Nuclei Segmentation of Histological Images	Suman Mahapatra et.al.	2506.07023	null
2025-06-08	End-to-End Probabilistic Framework for Learning with Hard Constraints	Utkarsh Utkarsh et.al.	2506.07003	null
2025-06-07	Spatial Disparities in Fire Shelter Accessibility: Capacity Challenges in the Palisades and Eaton Fires	Su Yeon Han et.al.	2506.06803	null
2025-06-06	Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception	Pushyami Kaveti et.al.	2506.06476	null
2025-06-06	PyGemini: Unified Software Development towards Maritime Autonomy Systems	Kjetil Vasstein et.al.	2506.06262	null
2025-06-06	Masked Language Models are Good Heterogeneous Graph Generalizers	Jinyu Yang et.al.	2506.06157	link
2025-06-06	SVD: Spatial Video Dataset	M. H. Izadimehr et.al.	2506.06037	null
2025-06-06	Restereo: Diffusion stereo video generation and restoration	Xingchang Huang et.al.	2506.06023	null
2025-06-06	Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning	Fan Yang et.al.	2506.05997	null
2025-06-06	A Culturally-Rich Romanian NLP Dataset from “Who Wants to Be a Millionaire?” Videos	Alexandru-Gabriel Ganea et.al.	2506.05991	null
2025-06-06	NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces	Pierluigi Zama Ramirez et.al.	2506.05815	null
2025-06-06	Efficient Online RFT with Plug-and-Play LLM Judges: Unlocking State-of-the-Art Performance	Rudransh Agnihotri et.al.	2506.05748	null
2025-06-06	Aerial Multi-View Stereo via Adaptive Depth Range Inference and Normal Cues	Yimei Liu et.al.	2506.05655	null
2025-06-05	Planets similar in size are often dissimilar in interior	E. Mamonova et.al.	2506.05089	link
2025-06-05	Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer	Filip Slezak et.al.	2506.04908	null
2025-06-05	Is It JUST Semantics? A Case Study of Discourse Particle Understanding in LLMs	William Sheffield et.al.	2506.04534	null
2025-06-04	The Latent Space Hypothesis: Toward Universal Medical Representation Learning	Salil Patel et.al.	2506.04515	null
2025-06-04	Edge interventions can mitigate demographic and prestige disparities in the Computer Science coauthorship network	Kate Barnes et.al.	2506.04435	link
2025-06-04	MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale	Ran Xu et.al.	2506.04405	null
2025-06-06	Enduring Disparities in the Workplace: A Pilot Study in the AI Community	Yunusa Simpa Abdulsalam et.al.	2506.04305	null
2025-06-04	Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation	Tianyu Huang et.al.	2506.04225	null
2025-06-04	Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness	Stephen R. Pfohl et.al.	2506.04193	link
2025-06-04	Lions and Muons: Optimization via Stochastic Frank-Wolfe	Maria-Eleni Sfyraki et.al.	2506.04192	null
2025-06-04	Multi-view Surface Reconstruction Using Normal and Reflectance Cues	Robin Bruneau et.al.	2506.04115	link
2025-06-04	When Fairness Isn’t Statistical: The Limits of Machine Learning in Evaluating Legal Reasoning	Claire Barale et.al.	2506.03913	null
2025-06-04	FedFACT: A Provable Framework for Controllable Group-Fairness Calibration in Federated Learning	Li Zhang et.al.	2506.03777	null
2025-06-04	Analyzing Pension Fund Mortality with Gaussian Processes in a Sub Population Framework	Eduardo F. L. de Melo et.al.	2506.03584	null
2025-06-04	Time-Domain Excitation of Complex Resonances	Asaf Farhi et.al.	2506.03485	null
2025-06-03	Targeted Forgetting of Image Subgroups in CLIP Models	Zeliang Zhang et.al.	2506.03117	null
2025-06-03	A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems	Đorđe Klisura et.al.	2506.02998	null
2025-06-03	Towards a Japanese Full-duplex Spoken Dialogue System	Atsumoto Ohashi et.al.	2506.02979	null
2025-06-03	TaxAgent: How Large Language Model Designs Fiscal Policy	Jizhou Wang et.al.	2506.02838	null
2025-06-03	HORUS: A Mixed Reality Interface for Managing Teams of Mobile Robots	Omotoye Shamsudeen Adekoya et.al.	2506.02622	null
2025-06-03	On the Language and Gender Biases in PSTN, VoIP and Neural Audio Codecs	Kemal Altwlkany et.al.	2506.02545	null
2025-06-03	Gender Inequality in English Textbooks Around the World: an NLP Approach	Tairan Liu et.al.	2506.02425	null
2025-06-03	Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology	Wenhao Tang et.al.	2506.02408	link
2025-06-02	ImpRAG: Retrieval-Augmented Generation with Implicit Queries	Wenzheng Zhang et.al.	2506.02279	null
2025-06-02	Tunable magnons in a dual-gated 2D antiferromagnet	Nele Stetzuhn et.al.	2506.02185	null
2025-05-30	Predicting the Past: Estimating Historical Appraisals with OCR and Machine Learning	Mihir Bhaskar et.al.	2505.24676	link
2025-05-30	Thermodynamic Signatures of Gaussian Entanglement Beyond Entropy	Beatriz Polo et.al.	2505.24596	null
2025-05-30	50 years of spin glass theory	David Sherrington et.al.	2505.24432	null
2025-05-30	A Unified Scale Factor for the Cosmic Evolution -Motivated by Brane World Models-	Farzin Safarzadeh-Maleki et.al.	2505.24420	null
2025-05-30	Verifiable Weighted Secret Sharing	Kareem Shehata et.al.	2505.24289	null
2025-05-30	Evolution of Gas Velocity Dispersion in Discs from $z\sim8$ to $z\sim0.5$	E. Wisnioski et.al.	2505.24129	null
2025-05-30	CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs	Ai Jian et.al.	2505.24120	null
2025-05-29	Estimation of Gender Wage Gap in the University of North Carolina System	Zihan Zhang et.al.	2505.24078	null
2025-05-29	Can Emotion Fool Anti-spoofing?	Aurosweta Mahapatra et.al.	2505.23962	null
2025-05-29	Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts	Xuweiyi Chen et.al.	2505.23926	null
2025-05-29	ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks	Akashah Shabbir et.al.	2505.23752	link
2025-05-29	Let’s Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM’s Math Capability	Ruida Wang et.al.	2505.23703	null
2025-05-29	Errors in Stereo Geometry Induce Distance Misperception	Raffles Xingqi Zhu et.al.	2505.23685	null
2025-05-29	Dual-Task Graph Neural Network for Joint Seizure Onset Zone Localization and Outcome Prediction using Stereo EEG	Syeda Abeera Amir et.al.	2505.23669	null
2025-05-29	PAN-Crafter: Learning Modality-Consistent Alignment for PAN-Sharpening	Jeonghyeok Do et.al.	2505.23367	null
2025-05-29	Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data	Lingkai Kong et.al.	2505.23062	null
2025-05-29	Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift	Minh Nguyen Nhat To et.al.	2505.23027	link
2025-05-28	Talent or Luck? Evaluating Attribution Bias in Large Language Models	Chahat Raj et.al.	2505.22910	link
2025-05-28	Permissioned LLMs: Enforcing Access Control in Large Language Models	Bargav Jayaraman et.al.	2505.22860	null
2025-05-28	Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese	Hanjia Lyu et.al.	2505.22645	link
2025-05-28	Overpartitions and Kaur, Rana, and Eyyunni’s mex sequences	Brian Hopkins et.al.	2505.22588	null
2025-05-28	Beyond Leaders and Laggards: A Typology of Renewable Energy Adoption Trajectories with Evidence from Off-Grid Communities	Roni Blushtein-Livnon et.al.	2505.22456	null
2025-05-28	MObyGaze: a film dataset of multimodal objectification densely annotated by experts	Julie Tores et.al.	2505.22084	null
2025-05-28	D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples	Zijing Hu et.al.	2505.22002	null
2025-05-27	From prosthetic memory to prosthetic denial: Auditing whether large language models are prone to mass atrocity denialism	Roberto Ulloa et.al.	2505.21753	null
2025-05-27	MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs	Raoyuan Zhao et.al.	2505.21693	link
2025-05-27	Data and Technology for Equitable Public Administration: Understanding City Government Employees’ Challenges and Needs	Angie Zhang et.al.	2505.21682	null
2025-05-27	ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models	Dingming Li et.al.	2505.21500	null
2025-05-27	Subgroups Matter for Robust Bias Mitigation	Anissa Alloula et.al.	2505.21363	link
2025-05-27	The Multilingual Divide and Its Impact on Global AI Safety	Aidan Peppin et.al.	2505.21344	null
2025-05-27	Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models	Zhaoqing Li et.al.	2505.21237	null
2025-05-27	Interpreting Social Bias in LVLMs via Information Flow Analysis and Multi-Round Dialogue Evaluation	Zhengyang Ji et.al.	2505.21106	null
2025-05-27	On VLMs for Diverse Tasks in Multimodal Meme Classification	Deepesh Gavit et.al.	2505.20937	null
2025-05-28	Stereo Radargrammetry Using Deep Learning from Airborne SAR Images	Tatsuya Sasayama et.al.	2505.20876	null
2025-05-27	Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties	Jiyoung Lee et.al.	2505.20875	null
2025-05-27	Aggregation Buffer: Revisiting DropEdge with a New Parameter Block	Dooho Lee et.al.	2505.20840	null
2025-05-27	TrustSkin: A Fairness Pipeline for Trustworthy Facial Affect Analysis Across Skin Tone	Ana M. Cabanas et.al.	2505.20637	null
2025-05-26	Spurious Privacy Leakage in Neural Networks	Chenxiang Zhang et.al.	2505.20095	null
2025-05-26	Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud	Natsuki Takama et.al.	2505.19854	null
2025-05-26	Deep learning based spatial aliasing reduction in beamforming for audio capture	Mateusz Guzik et.al.	2505.19781	null
2025-05-26	SACM: SEEG-Audio Contrastive Matching for Chinese Speech Decoding	Hongbin Wang et.al.	2505.19652	link
2025-05-26	Evaluating Robustness of Large Audio Language Models to Audio Injection: An Empirical Study	Guanyu Hou et.al.	2505.19598	null
2025-05-26	VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models	Hu Xiaobin et.al.	2505.19571	link
2025-05-26	AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and Healthcare	Ying Xiao et.al.	2505.19562	link
2025-05-26	SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams	Zhuoheng Gao et.al.	2505.19487	null
2025-05-25	Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding	Shiyue Wang et.al.	2505.19219	null
2025-05-25	MMATH: A Multilingual Benchmark for Mathematical Reasoning	Wenyang Luo et.al.	2505.19126	link
2025-05-23	Frankentext: Stitching random text fragments into long-form narratives	Chau Minh Pham et.al.	2505.18128	link
2025-05-23	A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency	Xiaobao Wei et.al.	2505.18024	null
2025-05-23	Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras	Masataka Kobayashi et.al.	2505.17582	null
2025-05-23	H2:Towards Efficient Large-Scale LLM Training on Hyper-Heterogeneous Cluster over 1,000 Chips	Ding Tang et.al.	2505.17548	null
2025-05-23	Learning Representational Disparities	Pavan Ravishankar et.al.	2505.17533	null
2025-05-23	Transparency and Proportionality in Post-Processing Algorithmic Bias Correction	Juliett Suárez Ferreira et.al.	2505.17525	null
2025-05-23	FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow	Haoyu Sun et.al.	2505.17399	link
2025-05-23	Pulse duration dependence of material response in ultrafast laser-induced surface-penetrating nanovoids in fused silica	Guodong Zhang et.al.	2505.17385	null
2025-05-22	Mitigate One, Skew Another? Tackling Intersectional Biases in Text-to-Image Models	Pushkar Shukla et.al.	2505.17280	null
2025-05-22	A Framework for Multi-View Multiple Object Tracking using Single-View Multi-Object Trackers on Fish Data	Chaim Chai Elchik et.al.	2505.17201	null
2025-05-22	NY Real Estate Racial Equity Analysis via Applied Machine Learning	Sanjana Chalavadi et.al.	2505.16946	null
2025-05-22	Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining	Shangquan Sun et.al.	2505.16811	null
2025-05-22	Optimising the decision threshold in a weighted voting system: The case of the IMF’s Board of Governors	Dóra Gréta Petróczy et.al.	2505.16654	null
2025-05-22	M2SVid: End-to-End Inpainting and Refinement for Monocular-to-Stereo Video Conversion	Nina Shvetsova et.al.	2505.16565	null
2025-05-22	Utilizing citation index and synthetic quality measure to compare Wikipedia languages across various topics	Włodzimierz Lewoniewski et.al.	2505.16506	null
2025-05-22	KoBALT: Korean Benchmark For Advanced Linguistic Tasks	Hyopil Shin et.al.	2505.16125	null
2025-05-22	Continually Self-Improving Language Models for Bariatric Surgery Question–Answering	Yash Kumar Atri et.al.	2505.16102	null
2025-05-21	In Silico Trials for Sex-Specific patient Inclusion Criteria in Cardiac Resynchronization Therapy: Advancing Precision in Heart Failure Treatment	Shuang Qian et.al.	2505.15708	null
2025-05-21	Kernel PCA for Out-of-Distribution Detection: Non-Linear Kernel Selections and Approximations	Kun Fang et.al.	2505.15284	link
2025-05-20	DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis	Prashanth Vijayaraghavan et.al.	2505.14971	null
2025-05-20	The Great Comets of 1843 and 1882 at Their Previous Return to Perihelion in the Twelfth Century: One Spectacular, the Other Dull	Zdenek Sekanina et.al.	2505.14662	null
2025-05-20	Early Diagnosis of Atrial Fibrillation Recurrence: A Large Tabular Model Approach with Structured and Unstructured Clinical Data	Ane G. Domingo-Aldama et.al.	2505.14643	null
2025-05-21	Mitigating Subgroup Disparities in Multi-Label Speech Emotion Recognition: A Pseudo-Labeling and Unsupervised Learning Approach	Yi-Cheng Lin et.al.	2505.14449	null
2025-05-20	MindVote: How LLMs Predict Human Decision-Making in Social Media Polls	Xutao Mao et.al.	2505.14422	null
2025-05-20	Diving into the Fusion of Monocular Priors for Generalized Stereo Matching	Chengtang Yao et.al.	2505.14414	link
2025-05-20	Accuracy and Fairness of Facial Recognition Technology in Low-Quality Police Images: An Experiment With Synthetic Faces	Maria Cuellar et.al.	2505.14320	null
2025-05-20	Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models	Zahraa Al Sahili et.al.	2505.14160	null
2025-05-20	M3Depth: Wavelet-Enhanced Depth Estimation on Mars via Mutual Boosting of Dual-Modal Data	Junjie Li et.al.	2505.14159	null
2025-05-20	Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts	Xi Chen et.al.	2505.14088	null
2025-05-20	AppleGrowthVision: A large-scale stereo dataset for phenological analysis, fruit detection, and 3D reconstruction in apple orchards	Laura-Sophia von Hirschhausen et.al.	2505.14029	null
2025-05-19	The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation	David Stap et.al.	2505.13090	null
2025-05-19	Unifying concepts in information-theoretic time-series analysis	Annie G. Bryant et.al.	2505.13080	null
2025-05-20	3D Visual Illusion Depth Estimation	Chengtang Yao et.al.	2505.13061	link
2025-05-19	Multi-Level Aware Preference Learning: Enhancing RLHF for Complex Multi-Instruction Tasks	Ruopei Sun et.al.	2505.12845	null
2025-05-19	On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding	Haoyuan Wu et.al.	2505.12723	null
2025-05-19	IA-MVS: Instance-Focused Adaptive Depth Sampling for Multi-View Stereo	Yinzhe Wang et.al.	2505.12714	null
2025-05-19	Rethinking Predictive Modeling for LLM Routing: When Simple kNN Beats Complex Learned Routers	Yang Li et.al.	2505.12601	null
2025-05-18	On long-duration storage, weather uncertainty and limited foresight	Felix Schmidt et.al.	2505.12538	link
2025-05-18	Depth Transfer: Learning to See Like a Simulator for Real-World Drone Navigation	Hang Yu et.al.	2505.12428	null
2025-05-18	Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents	Shuo Han et.al.	2505.12204	null
2025-05-16	SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision	Utsav Rai et.al.	2505.11439	null
2025-05-16	MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection	Shrutarv Awasthi et.al.	2505.11282	link
2025-05-16	Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization	Yanhao Jia et.al.	2505.11217	null
2025-05-16	A Cautionary Tale on Integrating Studies with Disparate Outcome Measures for Causal Inference	Harsh Parikh et.al.	2505.11014	null
2025-05-16	Patient-Specific Dynamic Digital-Physical Twin for Coronary Intervention Training: An Integrated Mixed Reality Approach	Shuo Wang et.al.	2505.10902	null
2025-05-16	From Embeddings to Accuracy: Comparing Foundation Models for Radiographic Classification	Xue Li et.al.	2505.10823	null
2025-05-15	TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation	Manthan Patel et.al.	2505.10696	null
2025-05-15	Artificial Intelligence Bias on English Language Learners in Automatic Scoring	Shuchen Guo et.al.	2505.10643	null
2025-05-15	Multi-contrast laser endoscopy for in vivo gastrointestinal imaging	Taylor L. Bobrow et.al.	2505.10492	null
2025-05-15	ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention	Jintian Shao et.al.	2505.10222	null
2025-05-15	VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality	Xuechang Tu et.al.	2505.10144	link
2025-05-15	Large-Scale Gaussian Splatting SLAM	Zhe Xin et.al.	2505.09915	null
2025-05-14	ZENN: A Thermodynamics-Inspired Computational Framework for Heterogeneous Data-Driven Modeling	Shun Wang et.al.	2505.09851	null
2025-05-14	Should I Stay or Should I Go Now? An Investigation into Gender Differences in the Impact of Switching Jobs on Earnings	Emily Winskill et.al.	2505.09791	null
2025-05-14	Enabling Group Fairness in Graph Unlearning via Bi-level Debiasing	Yezi Liu et.al.	2505.09702	null
2025-05-14	Fairness-aware Bayes optimal functional classification	Xiaoyu Hu et.al.	2505.09471	null
2025-05-14	RobustSpring: Benchmarking Robustness to Image Corruptions for Optical Flow, Scene Flow and Stereo	Jenny Schmalfuss et.al.	2505.09368	null
2025-05-14	Toward Fair Federated Learning under Demographic Disparities and Data Imbalance	Qiming Wu et.al.	2505.09295	link
2025-05-14	Signatures of asymmetry: Gravitational wave memory and the parity violation	Indranil Chakraborty et.al.	2505.09096	null
2025-05-13	Ages and metallicities of quiescent galaxies: confronting broadband ( $UVJ$ ) colours with stellar absorption lines	Chloe M. Cheng et.al.	2505.08858	null
2025-05-13	Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World	Yuran Wang et.al.	2505.08607	null
2025-05-13	BizChat: Scaffolding AI-Powered Business Planning for Small Business Owners Across Digital Skill Levels	Quentin Romero Lauro et.al.	2505.08493	null
2025-05-13	A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering	Chuanzhi Xu et.al.	2505.08438	null
2025-05-13	Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion	Anle Ke et.al.	2505.08281	link
2025-05-13	Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images	Ziteng Liu et.al.	2505.08178	null
2025-05-14	Fast Text-to-Audio Generation with Adversarial Post-Training	Zachary Novack et.al.	2505.08175	link
2025-05-13	MoKD: Multi-Task Optimization for Knowledge Distillation	Zeeshan Hayder et.al.	2505.08170	null
2025-05-12	Unequal Journeys to Food Markets: Continental-Scale Evidence from Open Data in Africa	Robert Benassai-Dalmau et.al.	2505.07913	link
2025-05-12	Disparity in sound speeds: implications for unitarity and effective potential in quantum field theory	Dmitry S. Ageev et.al.	2505.07794	null
2025-05-12	Higher-Order Convolution Improves Neural Predictivity in the Retina	Simone Azeglio et.al.	2505.07620	null
2025-05-11	Empirical Analysis of Asynchronous Federated Learning on Heterogeneous Devices: Efficiency, Fairness, and Privacy Trade-offs	Samaneh Mohammadi et.al.	2505.07041	null
2025-05-11	Enhancing Monocular Height Estimation via Sparse LiDAR-Guided Correction	Jian Song et.al.	2505.06905	null
2025-05-11	ContribChain: A Stress-Balanced Blockchain Sharding Protocol with Node Contribution Awareness	Xinpeng Huang et.al.	2505.06899	null
2025-05-11	Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies	Zhengmi Tang et.al.	2505.06855	null
2025-05-11	Feedback-enhanced distant entanglement of magnon and phonon modes with atomic ensembles in coupled cavities	Muhammad Awais Altaf et.al.	2505.06838	null
2025-05-10	Behind the Byline: A Large-Scale Study of Scientific Author Contributions	Itai Assraf et.al.	2505.06721	null
2025-05-09	Adaptive Wiping: Adaptive contact-rich manipulation through few-shot imitation learning with Force-Torque feedback and pre-trained object representations	Chikaha Tsuji et.al.	2505.06451	null
2025-05-09	2D Quon Language: Unifying Framework for Cliffords, Matchgates, and Beyond	Byungmin Kang et.al.	2505.06336	null
2025-05-09	Who’s at Risk? Effects of Inflation on Unemployment Risk	Hie Joo Ahn et.al.	2505.05757	null
2025-05-08	Trends and Gender Disparities in Grades and Grade Penalties Among Bioscience and Health-Related Major Students Before, During, and After COVID-19 Remote Instruction	Alysa Malespina et.al.	2505.05667	null
2025-05-07	StereoINR: Cross-View Geometry Consistent Stereo Super Resolution with Implicit Neural Representation	Yi Liu et.al.	2505.05509	null
2025-05-08	Facets of Disparate Impact: Evaluating Legally Consistent Bias in Machine Learning	Jarren Briscoe et.al.	2505.05471	link
2025-05-08	Synthesis of innovation and obsolescence	Edward D. Lee et.al.	2505.05182	null
2025-05-08	DispBench: Benchmarking Disparity Estimation to Synthetic Corruptions	Shashank Agnihotri et.al.	2505.05091	link
2025-05-08	Learning Item Representations Directly from Multimodal Features for Effective Recommendation	Xin Zhou et.al.	2505.04960	link
2025-05-08	Enhancing Blockchain Cross Chain Interoperability: A Comprehensive Survey	Zhihong Deng et.al.	2505.04934	null
2025-05-08	Advanced 3D Imaging Approach to TSV/TGV Metrology and Inspection Using Only Optical Microscopy	Gugeong Sung et.al.	2505.04913	null
2025-05-06	Algorithmic Accountability in Small Data: Sample-Size-Induced Bias Within Classification Metrics	Jarren Briscoe et.al.	2505.03992	link
2025-05-06	Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach	Srecharan Selvam et.al.	2505.03702	null
2025-05-06	Blending 3D Geometry and Machine Learning for Multi-View Stereopsis	Vibhas Vats et.al.	2505.03470	link
2025-05-06	Domain Adversarial Training for Mitigating Gender Bias in Speech-based Mental Health Detection	June-Woo Kim et.al.	2505.03359	null
2025-05-06	The Impact of Large Language Models on K-12 Education in Rural India: A Thematic Analysis of Student Volunteer’s Perspectives	Harshita Goyal et.al.	2505.03163	null
2025-05-06	Towards Application-Specific Evaluation of Vision Models: Case Studies in Ecology and Biology	Alex Hoi Hang Chan et.al.	2505.02825	null
2025-05-05	Exceptional, but Separate: Precursors to Spontaneous Symmetry Breaking	Lewis Hill et.al.	2505.02691	null
2025-05-05	VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection	Hao Cheng et.al.	2505.02331	link
2025-05-04	SparSplat: Fast Multi-View Reconstruction with Generalizable 2D Gaussian Splatting	Shubhendu Jena et.al.	2505.02175	null
2025-05-04	Representation Learning of Limit Order Book: A Comprehensive Study and Benchmarking	Muyao Zhong et.al.	2505.02139	null
2025-05-04	Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents	Christian Schroeder de Witt et.al.	2505.02077	null
2025-05-03	Mitigating Group-Level Fairness Disparities in Federated Visual Language Models	Chaomeng Chen et.al.	2505.01851	null
2025-05-03	AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting	Junhao Shi et.al.	2505.01799	null
2025-05-03	T-REX: Vision-Based System for Autonomous Leaf Detection and Grasp Estimation	Srecharan Selvam et.al.	2505.01654	null
2025-05-02	Toward a Unified Theory of Catalysis	Frank Nelson Crespilho et.al.	2505.01213	null
2025-05-02	Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods	Mahdi Dhaini et.al.	2505.01198	link
2025-05-02	Enhancing MHD model accuracy and CME forecasting by constraining coronal plasma properties with Faraday rotation	Salvatore Mancuso et.al.	2505.01080	null
2025-05-02	Destructive Interference: Encoding Loss in the Overlap	Nik Aberle et.al.	2505.00987	null
2025-05-01	Quantum Modular Forms and Resurgence	Eleanor McSpirit et.al.	2505.00799	null
2025-05-01	HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection	Deanna Emery et.al.	2505.00506	null
2025-04-30	Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis	Michal Geyer et.al.	2505.00135	null
2025-04-30	Stereo X-ray tomography on deformed object tracking	Zhenduo Shang et.al.	2505.00122	null
2025-04-30	An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation	Yaming Ou et.al.	2504.21826	null
2025-04-30	Assessing Racial Disparities in Healthcare Expenditures Using Causal Path-Specific Effects	Xiaxian Ou et.al.	2504.21688	link
2025-04-30	Lights Out, Stress In: Assessing Stress Amidst Power and Energy Challenges in Bangladesh	Faisal Quaiyyum et.al.	2504.21541	null
2025-04-30	DGFNet: End-to-End Audio-Visual Source Separation Based on Dynamic Gating Fusion	Yinfeng Yu et.al.	2504.21366	null
2025-04-30	CMD: Constraining Multimodal Distribution for Domain Adaptation in Stereo Matching	Zhelun Shen et.al.	2504.21302	null
2025-04-30	LSTM+Geo with xgBoost Filtering: A Novel Approach for Race and Ethnicity Imputation with Reduced Bias	S. Chalavadi et.al.	2504.21259	null
2025-04-29	OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification	Shangyu Li et.al.	2504.20964	link
2025-04-29	Imaging on the Edge: Mapping Object Corners and Edges with Stereo X-ray Tomography	Zhenduo Shang et.al.	2504.20892	null
2025-04-29	Partitioned Memory Storage Inspired Few-Shot Class-Incremental learning	Renye Zhang et.al.	2504.20797	null
2025-04-29	The Anyonic Quantum Carnot Engine	H S Mani et.al.	2504.20596	null
2025-04-29	Mordell–Lang and disparate Selmer ranks of odd twists of some superelliptic curves over global function fields	Sun Woo Park et.al.	2504.20594	null
2025-04-29	Hetu v2: A General and Scalable Deep Learning System with Hierarchical and Heterogeneous Single Program Multiple Data Annotations	Haoyang Li et.al.	2504.20490	link
2025-04-29	The two-clock problem in population dynamics	Kaan Öcal et.al.	2504.20388	null
2025-04-29	Neural Stereo Video Compression with Hybrid Disparity Compensation	Shiyin Jiang et.al.	2504.20383	null
2025-04-29	Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views	Jiang Wu et.al.	2504.20378	link
2025-04-28	$\texttt{SAGE}$ : A Generic Framework for LLM Safety Evaluation	Madhur Jindal et.al.	2504.19674	link
2025-04-27	Mitigating Bias in Facial Recognition Systems: Centroid Fairness Loss Optimization	Jean-Rémy Conti et.al.	2504.19370	null
2025-04-27	Unscented Particle Filter for Visual-inertial Navigation using IMU and Landmark Measurements	Khashayar Ghanizadegan et.al.	2504.19318	null
2025-04-27	OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion	Shuhao Kang et.al.	2504.19258	null
2025-04-26	Minimum Cost Nowhere-zero Flows and Cut-balanced Orientations	Karthekeyan Chandrasekaran et.al.	2504.18767	null
2025-04-25	Fairness Is More Than Algorithms: Racial Disparities in Time-to-Recidivism	Jessy Xinyi Han et.al.	2504.18629	null
2025-04-25	Are We on the Same Page? Examining Developer Perception Alignment in Open Source Code Reviews	Yoseph Berhanu Alebachew et.al.	2504.18407	null
2025-04-25	Study on Real-Time Road Surface Reconstruction Using Stereo Vision	Deepak Ghimire et.al.	2504.18112	null
2025-04-29	Factorization Formula Connecting the Shape Functions of Heavy Meson in QCD and Heavy Quark Effective Theory	Wei Wang et.al.	2504.18018	null
2025-04-24	LLM Agent Swarm for Hypothesis-Driven Drug Discovery	Kevin Song et.al.	2504.17967	null
2025-04-24	Set Phasers to Stun: Beaming Power and Control to Mobile Robots with Laser Light	Charles J. Carver et.al.	2504.17865	null
2025-04-24	The Fourth Monocular Depth Estimation Challenge	Anton Obukhov et.al.	2504.17787	null
2025-04-24	Spectral Irradiance Variability in Lyman-Alpha Emission During Solar Flares	Luke Majury et.al.	2504.17667	null
2025-04-24	Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization	Guangyang Zeng et.al.	2504.17410	null
2025-04-24	StereoMamba: Real-time and Robust Intraoperative Stereo Disparity Estimation via Long-range Spatial Dependencies	Xu Wang et.al.	2504.17401	null
2025-04-24	Evaluating and Mitigating Bias in AI-Based Medical Text Generation	Xiuying Chen et.al.	2504.17279	null
2025-04-23	Structural roles and gender disparities in corruption networks	Arthur A. B. Pessa et.al.	2504.17086	null
2025-04-23	Procedural Dataset Generation for Zero-Shot Stereo Matching	David Yan et.al.	2504.16930	null
2025-04-23	An Accelerated Camera 3DMA Framework for Efficient Urban GNSS Multipath Estimation	Shiyao Lv et.al.	2504.16906	null
2025-04-23	A model of the heliocentric dust ring on Venus orbit	Ariane Courtot et.al.	2504.16610	null
2025-04-23	Tinkering Against Scaling	Bolun Zhang et.al.	2504.16546	null
2025-04-22	Long-term disparities in the recovery of urban mobility after COVID-19 in Latin America	Carmen Cabrera et.al.	2504.15871	null
2025-04-22	DERD-Net: Learning Depth from Event-based Ray Densities	Diego de Oliveira Hitzges et.al.	2504.15863	null
2025-04-22	Trustworthy Decentralized Autonomous Machines: A New Paradigm in Automation Economy	Fernando Castillo et.al.	2504.15676	null
2025-04-22	Multimodal Perception for Goal-oriented Navigation: A Survey	I-Tak Ieong et.al.	2504.15643	null
2025-04-22	Yet Another Diminishing Spark: Low-level Cyberattacks in the Israel-Gaza Conflict	Anh V. Vu et.al.	2504.15592	null
2025-04-22	The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks	Minghao Wu et.al.	2504.15521	null
2025-04-21	Real-Time Sentiment Insights from X Using VADER, DistilBERT, and Web-Scraped Data	Yanampally Abhiram Reddy et.al.	2504.15448	null
2025-04-21	MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video	Minh-Quan Viet Bui et.al.	2504.15122	null
2025-04-21	Robust and Real-time Surface Normal Estimation from Stereo Disparities using Affine Transformations	Csongor Csanad Kariko et.al.	2504.15121	null
2025-04-21	Sum-Rate Maximization for NOMA-Assisted Pinching-Antenna Systems	Ziwu Zhou et.al.	2504.15006	null
2025-04-21	Reliable Multi-Modal Object Re-Identification via Modality-Aware Graph Reasoning	Xixi Wan et.al.	2504.14847	null
2025-04-21	Aligning Beam with Imbalanced Multi-modality: A Generative Federated Learning Approach	Jiahui Liang et.al.	2504.14835	null
2025-04-20	Polynomial-Time Constant-Approximation for Fair Sum-of-Radii Clustering	Sina Bagheri Nezhad et.al.	2504.14683	null
2025-04-20	Regret-aware Re-ranking for Guaranteeing Two-sided Fairness and Accuracy in Recommender Systems	Xiaopeng Ye et.al.	2504.14550	null
2025-04-20	Anisotropic quark propagation and Zeeman effect in an external magnetic field	Minghui Ding et.al.	2504.14504	null
2025-04-20	sEEG-based Encoding for Sentence Retrieval: A Contrastive Learning Approach to Brain-Language Alignment	Yijun Liu et.al.	2504.14468	null
2025-04-19	Balancing Fairness and Performance in Healthcare AI: A Gradient Reconciliation Approach	Xiaoyang Wang et.al.	2504.14388	null
2025-04-18	Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion	Sandipan Dhar et.al.	2504.13791	null
2025-04-18	Predictors of Childhood Vaccination Uptake in England: An Explainable Machine Learning Analysis of Longitudinal Regional Data (2021-2024)	Amin Noroozi et.al.	2504.13755	null
2025-04-18	Divergent LLM Adoption and Heterogeneous Convergence Paths in Research Writing	Cong William Lin et.al.	2504.13629	null
2025-04-18	Open-Loop and Closed-Loop Strategies for Linear Quadratic Mean Field Games: The Direct Approach	Yong Liang et.al.	2504.13496	null
2025-04-17	Addressing the Minor-Embedding Problem in Quantum Annealing and Evaluating State-of-the-Art Algorithm Performance	Aitor Gómez-Tejedor et.al.	2504.13376	null
2025-04-17	Generalized Parton Distributions from Symbolic Regression	Anusha Reddy Singireddy et.al.	2504.13289	null
2025-04-17	Prospects for Detecting Signs of Life on Exoplanets in the JWST Era	Sara Seager et.al.	2504.12946	null
2025-04-17	Quantifying walkable accessibility to urban services: An application to Florence, Italy	Leonardo Boncinelli et.al.	2504.12934	null
2025-04-17	Unsupervised Cross-Domain 3D Human Pose Estimation via Pseudo-Label-Guided Global Transforms	Jingjing Liu et.al.	2504.12699	null
2025-04-16	Reinforcement Learning from Human Feedback	Nathan Lambert et.al.	2504.12501	link
2025-04-16	A Survey on Archetypal Analysis	Aleix Alcacer et.al.	2504.12392	null
2025-04-16	Regist3R: Incremental Registration with Stereo Foundation Model	Sidun Liu et.al.	2504.12356	null
2025-04-16	Towards Explainable Fusion and Balanced Learning in Multimodal Sentiment Analysis	Miaosen Luo et.al.	2504.12151	null
2025-04-16	Stochastic Quadrature Rules for Solving PDEs using Neural Networks	Jamie M. Taylor et.al.	2504.11976	link
2025-04-16	Benchmarking Mutual Information-based Loss Functions in Federated Learning	Sarang S et.al.	2504.11877	null
2025-04-16	Boosting Multi-View Stereo with Depth Foundation Model in the Absence of Real-World Labels	Jie Zhu et.al.	2504.11845	null
2025-04-15	Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models	Maria Teleki et.al.	2504.11431	link
2025-04-15	Breaking the TDD Flow for Over-the-Air Phase Synchronization in Distributed Antenna Systems	Khac-Hoang Ngo et.al.	2504.11411	null
2025-04-15	Towards global equity in political polarization research	Max Falkenberg et.al.	2504.11090	null
2025-04-15	Meta-learning For Few-Shot Time Series Crop Type Classification: A Benchmark On The EuroCropsML Dataset	Joana Reuss et.al.	2504.11022	null
2025-04-15	Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy	Botao Zhao et.al.	2504.10819	null
2025-04-14	FuzzSense: Towards A Modular Fuzzing Framework for Autonomous Driving Software	Andrew Roberts et.al.	2504.10717	null
2025-04-14	Emotion Alignment: Discovering the Gap Between Social Media and Real-World Sentiments in Persian Tweets and Images	Sina Elahimanesh et.al.	2504.10662	null
2025-04-14	Who Speaks for Ethics? How Demographics Shape Ethical Advocacy in Software Development	Lauren Olson et.al.	2504.10276	null
2025-04-14	Localized Cultural Knowledge is Conserved and Controllable in Large Language Models	Veniamin Veselovsky et.al.	2504.10191	null
2025-04-14	Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution	Yiwen Wang et.al.	2504.09887	link
2025-04-14	RAKG:Document-level Retrieval Augmented Knowledge Graph Construction	Hairong Zhang et.al.	2504.09823	link
2025-04-13	FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird’s Eye View	Yuting Zhao et.al.	2504.09535	null
2025-04-12	“It’s not a representation of me”: Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services	Shira Michel et.al.	2504.09346	null
2025-04-12	CrossLink: A Decentralized Framework for Secure Cross-Chain Smart Contract Execution	Tahrim Hossain et.al.	2504.09319	link
2025-04-12	PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks	Jianyu Wu et.al.	2504.09258	null
2025-04-15	FairACE: Achieving Degree Fairness in Graph Neural Networks via Contrastive and Adversarial Group-Balanced Training	Jiaxin Liu et.al.	2504.09210	null
2025-04-12	Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence	Yuxu Lu et.al.	2504.09197	null
2025-04-11	Application of machine learning models to predict the relationship between air pollution, ecosystem degradation, and health disparities and lung cancer in Vietnam	Ngoc Hong Tran et.al.	2504.08651	null
2025-04-11	seeBias: A Comprehensive Tool for Assessing and Visualizing AI Fairness	Yilin Ning et.al.	2504.08418	link
2025-04-10	Adaptive Bounded Exploration and Intermediate Actions for Data Debiasing	Yifan Yang et.al.	2504.08151	link
2025-04-10	Experimental Analysis of Quadcopter Drone Hover Constraints for Localization Improvements	Uthman Olawoye et.al.	2504.07843	null
2025-04-10	FairEval: Evaluating Fairness in LLM-Based Recommendations with Personality Awareness	Chandan Kumar Sah et.al.	2504.07801	null
2025-04-10	MMLA: Multi-Environment, Multi-Species, Low-Altitude Aerial Footage Dataset	Jenna Kline et.al.	2504.07744	null
2025-04-10	Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation	Yanglin Huang et.al.	2504.07691	null
2025-04-10	Tuning chirality amplitude at ultrafast timescales	Hiroki Ueda et.al.	2504.07599	null
2025-04-10	Echoes of Disagreement: Measuring Disparity in Social Consensus	Marios Papachristou et.al.	2504.07480	link
2025-04-10	Continuity conditions weaker than lower semi-continuity	Jacob Westerhout et.al.	2504.07451	null
2025-04-10	ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement	Anning Hu et.al.	2504.07418	null
2025-04-10	FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair	Arya Fayyazi et.al.	2504.07395	null
2025-04-09	Universal neural wave functions for high-pressure hydrogen	David Linteau et.al.	2504.07062	null
2025-04-09	Identifying Key Challenges of Hardness-Based Resampling	Pawel Pukowski et.al.	2504.07031	null
2025-04-09	Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting	Daiwei Zhang et.al.	2504.06978	null
2025-04-09	Communicating complex statistical models to a public health audience: translating science into action with the FARSI approach	Mattia Stival et.al.	2504.06787	null
2025-04-09	A Novel Nonlinear Fertility Catastrophe Model Based on Thom’s Differential Equations of Morphogenesis	Rolando Gonzales Martinez et.al.	2504.06668	null
2025-04-08	Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring	José A. Pilartes-Congo et.al.	2504.06464	null
2025-04-08	Computing for Community-Based Economies: A Sociotechnical Ecosystem for Democratic, Egalitarian and Sustainable Futures	Kwame Porter Robinson et.al.	2504.06114	null
2025-04-08	Co-evolution of cooperation and resource allocation in the advantageous environment-based spatial multi-game using adaptive control	Chengbin Sun et.al.	2504.06112	null
2025-04-08	AI analysis of medical images at scale as a health disparities probe: a feasibility demonstration using chest radiographs	Heather M. Whitney et.al.	2504.05990	null
2025-04-08	Uncovering Fairness through Data Complexity as an Early Indicator	Juliett Suárez Ferreira et.al.	2504.05923	null
2025-04-08	Thermodynamic supercriticality and complex phase diagram for the AdS black hole	Zhen-Ming Xu et.al.	2504.05708	null
2025-04-08	Fairness in Machine Learning-based Hand Load Estimation: A Case Study on Load Carriage Tasks	Arafat Rahman et.al.	2504.05610	null
2025-04-07	Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation	Manvi Agarwal et.al.	2504.05364	null
2025-04-07	A BLE and UWB Beacon-Assist Framework for Multiuser Augmented Reality Synchronization Across Multiple Devices in Shared Environments	Maitree Hirunteeyakul et.al.	2504.05293	null
2025-04-07	CARE: Aligning Language Models for Regional Cultural Awareness	Geyang Guo et.al.	2504.05154	link
2025-04-07	Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and Semidensification	Yasuhiro Yao et.al.	2504.05148	link
2025-04-07	M-Prometheus: A Suite of Open Multilingual LLM Judges	José Pombal et.al.	2504.04953	link
2025-04-07	CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images	Cheng Chen et.al.	2504.04753	null
2025-04-06	eKalibr-Stereo: Continuous-Time Spatiotemporal Calibration for Event-Based Stereo Visual Systems	Shuolong Chen et.al.	2504.04451	link
2025-04-05	Exploration of Approaches for Robustness and Safety in a Low Code Open Environment for Factory Automation	Gustavo Quiros A. et.al.	2504.04224	null
2025-04-05	Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources	Zihao Li et.al.	2504.04152	null
2025-04-05	The Labor Market Incidence of New Technologies	Tianyu Fan et.al.	2504.04047	null
2025-04-05	Disparate Privacy Vulnerability: Targeted Attribute Inference Attacks and Defenses	Ehsanul Kabir et.al.	2504.04033	null
2025-04-04	SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding	Yimin Wei et.al.	2504.03254	link
2025-04-03	Bias in Large Language Models Across Clinical Applications: A Systematic Review	Thanathip Suenghataiphorn et.al.	2504.02917	null
2025-04-03	Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets	Chuning Zhu et.al.	2504.02792	null
2025-04-03	The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context	Nikhil Verma et.al.	2504.02708	null
2025-04-02	Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks	Ali Al-Kaswan et.al.	2504.01850	null
2025-04-02	SOLAQUA: SINTEF Ocean Large Aquaculture Robotics Dataset	Sveinung Johan Ohrem et.al.	2504.01790	null
2025-04-02	DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image	Jijun Xiang et.al.	2504.01596	link
2025-04-02	Hyperbolic Diffusion Recommender Model	Meng Yuan et.al.	2504.01541	null
2025-04-02	ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue	Thomas Pritchard et.al.	2504.01261	link
2025-04-01	Feature-Preserving Mesh Decimation for Normal Integration	Moritz Heep et.al.	2504.00867	null
2025-04-01	Bridging the Gap: Integrating Ethics and Environmental Sustainability in AI Research and Practice	Alexandra Sasha Luccioni et.al.	2504.00797	null
2025-04-01	Alleviating Performance Disparity in Adversarial Spatiotemporal Graph Learning Under Zero-Inflated Distribution	Songran Bai et.al.	2504.00721	null
2025-04-01	ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection	Xiaoxuan Zhu et.al.	2504.00695	link
2025-04-01	Using complex prompts to identify fine-grained biases in image generation through ChatGPT-4o	Marinus Ferreira et.al.	2504.00388	null
2025-03-31	Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views	Chong Bao et.al.	2503.24382	null
2025-03-31	BAR-Analytics: A Web-based Platform for Analyzing Information Spreading Barriers in News: Comparative Analysis Across Multiple Barriers and Events	Abdul Sittar et.al.	2503.24220	null
2025-03-31	Ride-Sourcing Vehicle Rebalancing with Service Accessibility Guarantees via Constrained Mean-Field Reinforcement Learning	Matej Jusup et.al.	2503.24183	link
2025-03-31	Is LLM the Silver Bullet to Low-Resource Languages Machine Translation?	Yewei Song et.al.	2503.24102	null
2025-03-31	Level the Level: Balancing Game Levels for Asymmetric Player Archetypes With Reinforcement Learning	Florian Rupp et.al.	2503.24099	link
2025-03-31	Multispacecraft Observations of the 2024 September 9 Backside Solar Eruption that Resulted in a Sustained Gamma Ray Emission Event	Nat Gopalswamy et.al.	2503.23852	null
2025-03-31	A PINN Methodology for Temperature Field Reconstruction in the PIV Measurement Plane: Case of Rayleigh-Bénard Convection	Marie-Christine Volk et.al.	2503.23801	null
2025-03-31	Consistency-aware Self-Training for Iterative-based Stereo Matching	Jingyi Zhou et.al.	2503.23747	null
2025-03-31	Detail-aware multi-view stereo network for depth estimation	Haitao Tian et.al.	2503.23684	null
2025-03-30	Third Harmonic Structure in an Interplanetary Type II Radio Burst and Other Energetic Phenomena During the 2024 September 14 Solar Eruption	Nat Gopalswamy et.al.	2503.23584	null
2025-03-28	Benchmarking Ultra-Low-Power $μ$ NPUs	Josh Millar et.al.	2503.22567	null
2025-03-28	A Causal Framework to Measure and Mitigate Non-binary Treatment Discrimination	Ayan Majumdar et.al.	2503.22454	link
2025-03-28	Scaling Laws of Scientific Discovery with AI and Robot Scientists	Pengsong Zhang et.al.	2503.22444	null
2025-03-28	MVSAnywhere: Zero-Shot Multi-View Stereo	Sergio Izquierdo et.al.	2503.22430	null
2025-03-28	Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion	Songsong Yu et.al.	2503.22262	null
2025-03-28	An Advanced Ensemble Deep Learning Framework for Stock Price Prediction Using VAE, Transformer, and LSTM Model	Anindya Sarkar et.al.	2503.22192	null
2025-03-28	Reflection on Code Contributor Demographics and Collaboration Patterns in the Rust Communit	Rohit Dandamudi et.al.	2503.22066	null
2025-03-28	Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges	Ukcheol Shin et.al.	2503.22060	link
2025-03-27	Improved Tomographic Reconstruction of 3D Global Coronal Density from STEREO/COR1 Observations	Tongjiang Wang et.al.	2503.22041	null
2025-03-27	The commutativity problem for effective varieties of formal series, and applications	Lorenzo Clemente et.al.	2503.21697	null
2025-03-27	Exploring the Energy Landscape of RBMs: Reciprocal Space Insights into Bosons, Hierarchical Learning and Symmetry Breaking	J. Quetzalcóatl Toledo-Marin et.al.	2503.21536	null
2025-03-27	ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo	Yuxi Hu et.al.	2503.21525	null
2025-03-27	Behavioral response to mobile phone evacuation alerts	Erick Elejalde et.al.	2503.21497	null
2025-03-27	GPU-Accelerated Charge-Equilibration for Shadow Molecular Dynamics in Python	Mehmet Cagri Kaymak et.al.	2503.21176	link
2025-03-26	Can Large Language Models Predict Associations Among Human Attitudes?	Ana Ma et.al.	2503.21011	null
2025-03-26	CH $_3$ OH as a User-Friendly Density Probe: Calibration and Beyond	A. Giannetti et.al.	2503.20944	null
2025-03-26	SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments	Tanmoy Dam et.al.	2503.20614	link
2025-03-26	Emergent properties and the multiscale characterization challenge in condensed matter, from crystals to complex materials: a Review	Elisabetta Nocerino et.al.	2503.20266	null
2025-03-26	Attention IoU: Examining Biases in CelebA using Attention Maps	Aaron Serianni et.al.	2503.19846	link
2025-03-26	A Survey on Event-driven 3D Reconstruction: Development under Different Categories	Chuanzhi Xu et.al.	2503.19753	null
2025-03-25	Fairness in Proof of Team Sprint (PoTS): Evaluating Reward Distribution Across Performance Levels	Naoki Yonezawa et.al.	2503.19301	null
2025-03-25	ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency	Yang Ren et.al.	2503.19283	link
2025-03-24	Information-Seeking Decision Strategies Mitigate Risk in Dynamic, Uncertain Environments	Nicholas W. Barendregt et.al.	2503.19107	link
2025-03-25	Learning to segment anatomy and lesions from disparately labeled sources in brain MRI	Meva Himmetoglu et.al.	2503.18840	null
2025-03-24	LeanStereo: A Leaner Backbone based Stereo Network	Rafia Rahim et.al.	2503.18557	link
2025-03-24	Distilling Stereo Networks for Performant and Efficient Leaner Networks	Rafia Rahim et.al.	2503.18544	link
2025-03-24	Natural Language Processing for Electronic Health Records in Scandinavian Languages: Norwegian, Swedish, and Danish	Ashenafi Zebene Woldaregay et.al.	2503.18539	null
2025-03-24	PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model	Junyuan Gao et.al.	2503.18484	link
2025-03-24	PS-EIP: Robust Photometric Stereo Based on Event Interval Profile	Kazuma Kitazawa et.al.	2503.18341	null
2025-03-24	Vision-Guided Loco-Manipulation with a Snake Robot	Adarsh Salagame et.al.	2503.18308	null
2025-03-24	RAU: Towards Regularized Alignment and Uniformity for Representation Learning in Recommendation	Xi Wu et.al.	2503.18300	null
2025-03-24	Fact-checking AI-generated news reports: Can LLMs catch their own lies?	Jiayi Yao et.al.	2503.18293	null
2025-03-24	GI-SLAM: Gaussian-Inertial SLAM	Xulang Liu et.al.	2503.18275	null
2025-03-21	Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors	Wonbong Jang et.al.	2503.17316	null
2025-03-21	Uncovering cooling center usage as an adaptation strategy for hurricane-blackout-heat compound hazards during Hurricane Beryl (2024)	Tianle Duan et.al.	2503.17292	null
2025-03-21	Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers	Gaojie Jin et.al.	2503.17172	null
2025-03-21	Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study of Leukocytes and Schistocytes	Davide Antonio Mura et.al.	2503.17107	null
2025-03-21	TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting	Jianchuan Chen et.al.	2503.17032	null
2025-03-21	Exploring the Role of Women in Hugging Face Organizations	Maria Tubella Salinas et.al.	2503.17000	link
2025-03-21	DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery	Jiadong Tang et.al.	2503.16964	null
2025-03-21	A Flexible Fairness Framework with Surrogate Loss Reweighting for Addressing Sociodemographic Disparities	Wen Xu et.al.	2503.16836	null
2025-03-20	RESFL: An Uncertainty-Aware Framework for Responsible Federated Learning by Balancing Privacy, Fairness and Utility in Autonomous Vehicles	Dawood Wasif et.al.	2503.16251	null
2025-03-20	Variance-Aware Noisy Training: Hardening DNNs against Unstable Analog Computations	Xiao Wang et.al.	2503.16183	null
2025-03-19	Quantum entropy as a harbinger of factorizability	Henry Bloss et.al.	2503.15603	null
2025-03-19	Evaluating Bias in Retrieval-Augmented Medical Question-Answering Systems	Yuelyu Ji et.al.	2503.15454	null
2025-03-19	Beacon2Science: Enhancing STEREO/HI beacon data1 with machine learning for efficient CME tracking	Justin Le Louëdec et.al.	2503.15288	link
2025-03-19	EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds	Yuanchao Yue et.al.	2503.15284	link
2025-03-19	Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening	Zihan Cao et.al.	2503.14975	null
2025-03-19	Body-Hand Modality Expertized Networks with Cross-attention for Fine-grained Skeleton Action Recognition	Seungyeon Cho et.al.	2503.14960	null
2025-03-19	USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network	Joseph Emmanuel DL Dayo et.al.	2503.14950	null
2025-03-18	VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms	Seungwon Lim et.al.	2503.14427	link
2025-03-18	Exploring Disparity-Accuracy Trade-offs in Face Recognition Systems: The Role of Datasets, Architectures, and Loss Functions	Siddharth D Jaiswal et.al.	2503.14138	null
2025-03-17	SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint	Zhenlong Yuan et.al.	2503.13721	null
2025-03-17	Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios	Iryna Repinetska et.al.	2503.13710	null
2025-03-17	A Circular Construction Product Ontology for End-of-Life Decision-Making	Kwabena Adu-Duodu et.al.	2503.13708	null
2025-03-17	Subgroup Performance of a Commercial Digital Breast Tomosynthesis Model for Breast Cancer Detection	Beatrice Brown-Mulry et.al.	2503.13581	null
2025-03-17	Scale Efficient Training for Large Datasets	Qing Zhou et.al.	2503.13385	link
2025-03-17	Financial Adviser Misconduct and Labor Market Penalties: Uncovering Racial Disparities in the Absence of Gender Gaps	Jun Honda et.al.	2503.12837	null
2025-03-17	Stereo Event-based, 6-DOF Pose Tracking for Uncooperative Spacecraft	Zibin Liu et.al.	2503.12732	link
2025-03-17	GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching	Feng Qiao et.al.	2503.12720	link
2025-03-16	A novel association and ranking approach identifies factors affecting educational outcomes of STEM majors	Kira Adaricheva et.al.	2503.12321	link
2025-03-15	Robust Isolation Forest using Soft Sparse Random Projection and Valley Emphasis Method	Hun Kang et.al.	2503.12125	null
2025-03-18	3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene Reconstruction	Peizhen Zheng et.al.	2503.12001	link
2025-03-14	Black Older Adults’ Perception of Using Voice Assistants to Enact a Medical Recovery Curriculum	Andrea Green et.al.	2503.11894	null
2025-03-14	Bridging the LLM Accessibility Divide? Performance, Fairness, and Cost of Closed versus Open LLMs for Automated Essay Scoring	Kezia Oketch et.al.	2503.11827	null
2025-03-14	Thermodynamics of the Hubbard Model on the Bethe Lattice	Jia-Lin Chen et.al.	2503.11598	link
2025-03-14	TikZero: Zero-Shot Text-Guided Graphics Program Synthesis	Jonas Belouadi et.al.	2503.11509	link
2025-03-14	An automated geometric space curve approach for designing dynamically corrected gates	Evangelos Piliouras et.al.	2503.11492	link
2025-03-14	ARCAS: Adaptive Runtime System for Chiplet-Aware Scheduling	Alessandro Fogli et.al.	2503.11460	null
2025-03-14	AQUA-SLAM: Tightly-Coupled Underwater Acoustic-Visual-Inertial SLAM with Sensor Calibration	Shida Xu et.al.	2503.11420	link
2025-03-14	Exploring Competitive and Collusive Behaviors in Algorithmic Pricing with Deep Reinforcement Learning	Shidi Deng et.al.	2503.11270	null
2025-03-14	NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications	Li Cui et.al.	2503.11199	null
2025-03-14	SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets	Hao Liu et.al.	2503.11133	null
2025-03-14	TigerLLM – A Family of Bangla Large Language Models	Nishat Raihan et.al.	2503.10995	link
2025-03-13	Design and Development of the MeCO Open-Source Autonomous Underwater Vehicle	David Widhalm et.al.	2503.10928	null
2025-03-13	Controlling the dynamical phase diagram of a spinor BEC using time-dependent potentials	Q. Guan et.al.	2503.10563	null
2025-03-13	Subgroup Performance Analysis in Hidden Stratifications	Alceu Bissoto et.al.	2503.10382	null
2025-03-13	Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction	Xiaobo Xia et.al.	2503.09947	null
2025-03-12	Approximately Counting and Sampling Hamiltonian Motifs in Sublinear Time	Talya Eden et.al.	2503.09810	null
2025-03-12	How good are deep learning methods for automated road safety analysis using video data? An experimental study	Qingwu Liu et.al.	2503.09807	null
2025-03-12	BiasConnect: Investigating Bias Interactions in Text-to-Image Models	Pushkar Shukla et.al.	2503.09763	null
2025-03-12	Resolving the Kagome Origin of the Strange Metallicity in Ni $_3$ In	Jean C. Souza et.al.	2503.09704	null
2025-03-12	Edge AI for Real-time Fetal Assessment in Rural Guatemala	Nasim Katebi et.al.	2503.09659	null
2025-03-12	IUP: Integrated and Programmable User Plane for Next-Generation Mobile Networks	Chieh-Chun Chen et.al.	2503.09430	null
2025-03-12	OpenVidVRD: Open-Vocabulary Video Visual Relation Detection via Prompt-Driven Semantic Space Alignment	Qi Liu et.al.	2503.09416	null
2025-03-12	GRU: Mitigating the Trade-off between Unlearning and Retention for Large Language Models	Yue Wang et.al.	2503.09117	null
2025-03-12	StratIncon Detector: Analyzing Strategy Inconsistencies Between Real-Time Strategy and Preferred Professional Strategy in MOBA Esports	Ruofei Ma et.al.	2503.09060	null
2025-03-11	BoundarEase: Fostering Constructive Community Engagement to Inform More Equitable Student Assignment Policies	Cassandra Overney et.al.	2503.08543	link
2025-03-11	Does excellence correspond to universal inequality level? Evidences from scholarly citations and Olympic medal data	Soumyajyoti Biswas et.al.	2503.08480	null
2025-03-11	SegDesicNet: Lightweight Semantic Segmentation in Remote Sensing with Geo-Coordinate Embeddings for Domain Adaptation	Sachin Verma et.al.	2503.08290	null
2025-03-11	CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning	Kaiqiang Xiong et.al.	2503.08219	null
2025-03-10	The Janus Face of Innovation: Global Disparities and Divergent Options	Nihat Mugurtay et.al.	2503.07676	null
2025-03-10	VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models	Jen-tse Huang et.al.	2503.07575	link
2025-03-10	OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation	Ding Zhong et.al.	2503.07098	null
2025-03-10	SDFA: Structure Aware Discriminative Feature Aggregation for Efficient Human Fall Detection in Video	Sania Zahan et.al.	2503.07008	null
2025-03-10	Kinetic model and numerical method for multispecies radiation hydrodynamic system with multiscale nonequilibrium transport	Mingyu Quan et.al.	2503.06906	null
2025-03-09	DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning	Chengxuan Qian et.al.	2503.06456	link
2025-03-09	Socioeconomic centers in cities worldwide	Shuai Pang et.al.	2503.06445	link
2025-03-09	Global physics-informed neural networks (GPINNs): from local point-wise constraint to global nodal association	Feng Chen et.al.	2503.06403	null
2025-03-08	Mitigating Blockchain extractable value (BEV) threats by Distributed Transaction Sequencing in Blockchains	Xiongfei Zhao et.al.	2503.06279	null
2025-03-08	Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations	Meng Wang et.al.	2503.06222	null
2025-03-08	Generation of Optimized Solidity Code for Machine Learning Models using LLMs	Nikumbh Sarthak Sham et.al.	2503.06203	null
2025-03-07	Stereo Any Video: Temporally Consistent Stereo Matching	Junpeng Jing et.al.	2503.05549	null
2025-03-07	Asteroid phase curves and phase coloring effect using the ATLAS survey data	Colazo Milagros et.al.	2503.05412	null
2025-03-07	Preparing Tetra-Digit Long-Range Entangled States via Unified Sequential Quantum Circuit	Yu-Tao Hu et.al.	2503.05374	null
2025-03-07	Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects	Justin Yu et.al.	2503.05189	null
2025-03-07	RocketEval: Efficient Automated LLM Evaluation via Grading Checklist	Tianjun Wei et.al.	2503.05142	link
2025-03-06	Addressing the Subsumption Thesis: A Formal Bridge between Microeconomics and Active Inference	Noe Kuhn et.al.	2503.05048	null
2025-03-06	MIDAS: Modeling Ground-Truth Distributions with Dark Knowledge for Domain Generalized Stereo Matching	Peng Xu et.al.	2503.04376	null
2025-03-06	Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English	Runtao Zhou et.al.	2503.04099	null
2025-03-06	Uncovering inequalities in new knowledge learning by large language models across different languages	Chenglong Wang et.al.	2503.04064	link
2025-03-05	Connecting the dots: Tracing the evolutionary pathway of Polar Ring Galaxies in the cases of NGC 3718, NGC 2685, and NGC 4262	Krishna R. Akhil et.al.	2503.03709	null
2025-03-05	The Roles of Size, Packing, and Cohesion in the Emergence of Force Chains in Granular Packings	Ankit Shrivastava et.al.	2503.03668	null
2025-03-05	Improved FPT Approximation Algorithms for TSP	Jingyang Zhao et.al.	2503.03642	null
2025-03-05	Topo Goes Political: TDA-Based Controversy Detection in Imbalanced Reddit Political Data	Arvindh Arun et.al.	2503.03500	null
2025-03-05	BANet: Bilateral Aggregation Network for Mobile Stereo Matching	Gangwei Xu et.al.	2503.03259	link
2025-03-05	Transformer-Based Spatio-Temporal Association of Apple Fruitlets	Harry Freeman et.al.	2503.03200	null
2025-03-04	CADDI: An in-Class Activity Detection Dataset using IMU data from low-cost sensors	Luis Marquez-Carpintero et.al.	2503.02853	null
2025-03-04	Educational Assortative Mating and Household Income Inequality: Evidence from Brazil, Indonesia, Mexico, and South Africa	Ana Kujundzic et.al.	2503.02713	null
2025-03-04	XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification	Xiaoyu Zheng et.al.	2503.02619	null
2025-03-04	Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation	Dengke Zhang et.al.	2503.02459	link
2025-03-04	Tabby: Tabular Data Synthesis with Language Models	Sonia Cromp et.al.	2503.02152	null
2025-03-03	Building Machine Learning Challenges for Anomaly Detection in Science	Elizabeth G. Campolongo et.al.	2503.02112	null
2025-03-03	Understanding Urban-Rural Disparities in Mobility Inefficiency for Colombia, Mexico, and India	Nandini Iyer et.al.	2503.01810	link
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661	link
2025-03-03	Unmasking Implicit Bias: Evaluating Persona-Prompted LLM Responses in Power-Disparate Social Scenarios	Bryan Chen Zhengyu Tan et.al.	2503.01532	null
2025-03-03	RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation	Shu Pan et.al.	2503.01434	null
2025-02-28	Back to the Future Cyclopean Stereo: a human perception approach unifying deep and geometric constraints	Sherlon Almeida da Silva et.al.	2502.21280	null
2025-02-28	An LLM-based Delphi Study to Predict GenAI Evolution	Francesco Bertolotti et.al.	2502.21092	null
2025-02-28	Modelling the Spatially Varying Non-Linear Effects of Heat Exposure	Xinyi Chen et.al.	2502.20745	null
2025-02-28	Displaying Fear, Sadness, and Joy in Public: Schizophrenia Vloggers’ Video Narration of Emotion and Online Care-Seeking	Jiaying “Lizzy” Liu et.al.	2502.20658	null
2025-02-28	FedConv: A Learning-on-Model Paradigm for Heterogeneous Federated Clients	Leming Shen et.al.	2502.20639	link
2025-02-27	Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis	Jeffrey Yang Fan Chiang et.al.	2502.20383	null
2025-02-27	UniTok: A Unified Tokenizer for Visual Generation and Understanding	Chuofan Ma et.al.	2502.20321	link
2025-02-27	Educator Attention: How computational tools can systematically identify the distribution of a key resource for students	Qingyang Zhang et.al.	2502.20135	null
2025-02-26	Treatment Non-Adherence Bias in Clinical Machine Learning: A Real-World Study on Hypertension Medication	Zhongyuan Liang et.al.	2502.19625	null
2025-02-26	Do LLMs exhibit demographic parity in responses to queries about Human Rights?	Rafiya Javed et.al.	2502.19463	null
2025-03-01	GraphBridge: Towards Arbitrary Transfer Learning in GNNs	Li Ju et.al.	2502.19252	link
2025-02-26	Improving the quality of Web-mined Parallel Corpora of Low-Resource Languages using Debiasing Heuristics	Aloka Fernando et.al.	2502.19074	null
2025-02-26	The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training	Jinbo Wang et.al.	2502.19002	null
2025-02-26	Disparities in Magnetic Cloud Observations Between Two Spacecraft Having Small Radial and Angular Separations Near 1 AU	Anjali Agarwal et.al.	2502.18919	null
2025-02-26	M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance	Qingpei Guo et.al.	2502.18778	null
2025-02-26	Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance	Xueqing Peng et.al.	2502.18772	null
2025-02-26	Deep-Bench: Deep Learning Benchmark Dataset for Code Generation	Alireza Daghighfarsoodeh et.al.	2502.18726	null
2025-02-25	Expected Variational Inequalities	Brian Hu Zhang et.al.	2502.18605	null
2025-02-25	Exploring Gender Disparities in Automatic Speech Recognition Technology	Hend ElGhazaly et.al.	2502.18434	null
2025-02-25	A Kinetic Model of Solar Wind Acceleration Driven by Ambipolar Electric Potential and Velocity-Space Diffusion	Maximilien Péters de Bonhome et.al.	2502.18132	null
2025-02-25	PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching	Han Nie et.al.	2502.18104	link
2025-02-25	Assessing Large Language Models in Agentic Multilingual National Bias	Qianying Liu et.al.	2502.17945	null
2025-02-25	Escaping the Subprime Trap in Algorithmic Lending	Adam Bouyamourn et.al.	2502.17816	null
2025-02-25	Radial dependence of ion fluences in the 2023 July 17 SEP event from Parker Solar Probe to STEREO and ACE	G. D. Muro et.al.	2502.17806	null
2025-02-25	FinP: Fairness-in-Privacy in Federated Learning by Addressing Disparities in Privacy Risk	Tianyu Zhao et.al.	2502.17748	null
2025-02-24	Homophilic Effects on Economic Inequality: A Dynamic Network Agent-Based Model	Gustavo L. Kohlrausch et.al.	2502.17705	null
2025-02-24	$A$-Norm and $A$ -numerical Radius Inequalities for Sums Of Operators in semi-Hilbertian spaces	M. H. M. Rashid et.al.	2502.17696	null
2025-02-24	The DECADE cosmic shear project III: validation of analysis pipeline using spatially inhomogeneous data	D. Anbajagane et.al.	2502.17676	null
2025-02-24	Kandinsky Conformal Prediction: Beyond Class- and Covariate-Conditional Coverage	Konstantina Bairaktari et.al.	2502.17264	null
2025-02-24	Determinants of the Spousal Age Gap in India: Analysis of Indian Microdata	Praveen et.al.	2502.17059	null
2025-02-24	Achieving Fair PCA Using Joint Eigenvalue Decomposition	Vidhi Rathore et.al.	2502.16933	null
2025-02-24	PulseBat: A field-accessible dataset for second-life battery diagnostics from realistic histories using multidimensional rapid pulse test	Shengyu Tao et.al.	2502.16848	null
2025-02-23	Optical appearance of a boson star with soliton potential	Ke-Jian He et.al.	2502.16623	null
2025-02-23	Unmasking Societal Biases in Respiratory Support for ICU Patients through Social Determinants of Health	Mira Moukheiber et.al.	2502.16477	link
2025-02-23	Make Literature-Based Discovery Great Again through Reproducible Pipelines	Bojan Cestnik et.al.	2502.16450	link
2025-02-23	Facilitating Emergency Vehicle Passage in Congested Urban Areas Using Multi-agent Deep Reinforcement Learning	Haoran Su et.al.	2502.16449	null
2025-02-22	Semantic Gaussian Mixture Variational Autoencoder for Sequential Recommendation	Beibei Li et.al.	2502.16140	link
2025-02-22	A Trust-Aware and Cost-Optimized Blockchain Oracle Selection Model with Deep Reinforcement Learning	Hengyang Zhang et.al.	2502.16133	link
2025-02-21	MoMa: A Modular Deep Learning Framework for Material Property Prediction	Botian Wang et.al.	2502.15483	null
2025-02-21	UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction	Chenyu Li et.al.	2502.15199	null
2025-02-21	Graph-Based Deep Learning on Stereo EEG for Predicting Seizure Freedom in Epilepsy Patients	Artur Agaronyan et.al.	2502.15198	null
2025-02-21	TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba	Xiuwei Chen et.al.	2502.15130	null
2025-02-20	Electron Beam Propagation and Radio-Wave Scattering in the Inner Heliosphere using Five Spacecraft	Luis Alberto Cañizares et.al.	2502.15067	null
2025-02-20	Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion	Jiangyuan Liu et.al.	2502.14616	link
2025-02-20	OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images	Zhichao Zheng et.al.	2502.14279	null
2025-02-20	Asymmetric Co-Training for Source-Free Few-Shot Domain Adaptation	Gengxu Li et.al.	2502.14214	link
2025-02-20	Stereo Image Coding for Machines with Joint Visual Feature Compression	Dengchao Jin et.al.	2502.14190	null
2025-02-19	The NavINST Dataset for Multi-Sensor Autonomous Navigation	Paulo Ricardo Marques de Araujo et.al.	2502.13863	null
2025-02-19	CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement	Zheng Wu et.al.	2502.13624	null
2025-02-18	Two Tickets are Better than One: Fair and Accurate Hiring Under Strategic LLM Manipulations	Lee Cohen et.al.	2502.13221	null
2025-02-18	Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks	Markus J. Buehler et.al.	2502.13025	link
2025-02-18	Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version)	Tianyi Zhang et.al.	2502.13017	null
2025-02-18	High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion	Xiang Zhang et.al.	2502.12752	null
2025-02-18	Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection	Zijian Cao et.al.	2502.12735	null
2025-02-18	Simulated Bifurcation with High-dimensional Expansion for Traffic Signal Optimization on Real-world Networks	Shengda Zhao et.al.	2502.12440	null
2025-02-17	The impact of job stability on monetary poverty in Italy: causal small area estimation	Katarzyna Reluga et.al.	2502.12376	null
2025-02-17	Healthcare cost prediction for heterogeneous patient profiles using deep learning models with administrative claims data	Mohammad Amin Morid et.al.	2502.12277	null
2025-02-17	A versatile experimental method to measure the traction forces at interfaces	Yingwei Hou et.al.	2502.12044	null
2025-02-17	pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM	Luigi Freda et.al.	2502.11955	link
2025-02-17	BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages	Shamsuddeen Hassan Muhammad et.al.	2502.11926	link
2025-02-17	Weak solutions and sharp interface limit of the anisotropic Cahn-Hilliard equation with disparate mobility and inhomogeneous potential	Charles Elbar et.al.	2502.11849	null
2025-02-17	Text Classification in the LLM Era - Where do we stand?	Sowmya Vajjala et.al.	2502.11830	null
2025-02-17	Deep Neural Networks for Accurate Depth Estimation with Latent Space Features	Siddiqui Muhammad Yasir et.al.	2502.11777	null
2025-02-17	SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking	Zijian Wu et.al.	2502.11534	null
2025-02-16	Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation	Kunal Swami et.al.	2502.11002	null
2025-02-15	Do Deepfake Detectors Work in Reality?	Simiao Ren et.al.	2502.10920	null
2025-02-15	Mobile Robotic Multi-View Photometric Stereo	Suryansh Kumar et.al.	2502.10842	null
2025-02-14	Enhancing Multilingual LLM Pretraining with Model-Based Data Selection	Bettina Messmer et.al.	2502.10361	null
2025-02-14	Merging public elementary schools to reduce racial/ethnic segregation	Madison Landry et.al.	2502.10193	link
2025-02-14	Evaluating and Improving Graph-based Explanation Methods for Multi-Agent Coordination	Siva Kailas et.al.	2502.09889	null
2025-02-13	Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages	Shreyan Biswas et.al.	2502.09532	null
2025-02-13	SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest	Jack Erhardt et.al.	2502.09528	null
2025-02-13	Diffusion Models Through a Global Lens: Are They Culturally Inclusive?	Zahra Bayramli et.al.	2502.08914	null
2025-02-13	Uncovering Disparities in Rideshare Drivers Earning and Work Patterns: A Case Study of Chicago	Hy Dang et.al.	2502.08893	null
2025-02-12	Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors	Vishwanath Pratap Singh et.al.	2502.08587	null
2025-02-12	An entropy based comparative study of regional and seasonal distributions of particulate matter in Indian cities	Suchismita Banerjee et.al.	2502.08491	null
2025-02-12	Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision	Tianle Liu et.al.	2502.08352	null
2025-02-12	Emergent dimer-model topological order and quasi-particle excitations in liquid crystals: combinatorial vortex lattices	Cuiling Meng et.al.	2502.08314	null
2025-02-12	Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model	Bencheng Yan et.al.	2502.08309	null
2025-02-12	From Individual Experience to Collective Evidence: A Reporting-Based Framework for Identifying Systemic Harms	Jessica Dai et.al.	2502.08166	link
2025-02-11	Federated Self-supervised Domain Generalization for Label-efficient Polyp Segmentation	Xinyi Tan et.al.	2502.07951	null
2025-02-11	Small Area Estimation of Education Levels in Low- and Middle-Income Countries	Yunhan Wu et.al.	2502.07946	link
2025-02-11	PFedDST: Personalized Federated Learning with Decentralized Selection Training	Mengchen Fan et.al.	2502.07750	null
2025-02-11	A Nonparametric and Functional Wombling Methodology	Luke A. Barratt et.al.	2502.07740	null
2025-02-11	HGTUL: A Hypergraph-based Model For Trajectory User Linking	Fengjie Chang et.al.	2502.07549	null
2025-02-11	MoENAS: Mixture-of-Expert based Neural Architecture Search for jointly Accurate, Fair, and Robust Edge Deep Neural Networks	Lotfi Abdelkrim Mecharbat et.al.	2502.07422	null
2025-02-11	BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models	Xu Huang et.al.	2502.07346	link
2025-02-11	Music for All: Exploring Multicultural Representations in Music Generation Models (Camera Ready)	Atharva Mehta et.al.	2502.07328	link
2025-02-11	Does Training on Synthetic Data Make Models Less Robust?	Lingze Zhang et.al.	2502.07164	null
2025-02-11	Feature Importance Depends on Properties of the Data: Towards Choosing the Correct Explanations for Your Data and Decision Trees based Models	Célia Wafa Ayad et.al.	2502.07153	null
2025-02-10	Using Contextually Aligned Online Reviews to Measure LLMs’ Performance Disparities Across Language Varieties	Zixin Tang et.al.	2502.07058	null
2025-02-10	A Compiler for Operations on Relations with Bag Semantics	James Dong et.al.	2502.06988	null
2025-02-10	Beyond Literal Token Overlap: Token Alignability for Multilinguality	Katharina Hämmerl et.al.	2502.06468	null
2025-02-10	On the reason for the widespread energetic storm particle event of 13 March 2023	N. Dresing et.al.	2502.06332	null
2025-02-10	The digital labour of artificial intelligence in Latin America: a comparison of Argentina, Brazil, and Venezuela	Paola Tubaro et.al.	2502.06317	null
2025-02-08	Knowledge is Power: Harnessing Large Language Models for Enhanced Cognitive Diagnosis	Zhiang Dong et.al.	2502.05556	null
2025-02-07	Point-Identifying Semiparametric Sample Selection Models with No Excluded Variable	Dongwoo Kim et.al.	2502.05353	null
2025-02-07	Differentiable Mobile Display Photometric Stereo	Gawoon Ban et.al.	2502.05055	null
2025-02-07	Unified Approaches in Self-Supervised Event Stream Modeling: Progress and Prospects	Levente Zólyomi et.al.	2502.04899	null
2025-02-07	Practical implementation of a chiral phononic crystal demonstrator with ultra-low frequency bandgap	Line Mardini et.al.	2502.04775	null
2025-02-06	Targeted Learning for Data Fairness	Alexander Asemota et.al.	2502.04309	null
2025-02-06	Online Learning of Counter Categories and Ratings in PvP Games	Chiu-Chou Lin et.al.	2502.03998	null
2025-02-06	Fairness Aware Reinforcement Learning via Proximal Policy Optimization	Gabriele La Malfa et.al.	2502.03953	null
2025-02-05	Large Teams Overshadow Individual Recognition	Lulin Yang et.al.	2502.03623	null
2025-02-04	How Inclusively do LMs Perceive Social and Moral Norms?	Michael Galarnyk et.al.	2502.02696	link
2025-02-04	Fairness in Survival Analysis: A Novel Conditional Mutual Information Augmentation Approach	Tianyang Xie et.al.	2502.02567	null
2025-02-04	Review of Demographic Bias in Face Recognition	Ketan Kotwal et.al.	2502.02309	null
2025-02-04	Ilargi: a GPU Compatible Factorized ML Model Training Framework	Wenbo Sun et.al.	2502.01985	null
2025-02-03	CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition	Martijn Bartelds et.al.	2502.01777	null
2025-02-03	Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool	Floris Holstege et.al.	2502.01713	null
2025-02-03	Comprehensive Modeling Approaches for Forecasting Bitcoin Transaction Fees: A Comparative Study	Jiangqin Ma et.al.	2502.01029	null
2025-02-02	Fruit Fly Classification (Diptera: Tephritidae) in Images, Applying Transfer Learning	Erick Andrew Bustamante Flores et.al.	2502.00939	null
2025-02-02	Psychometric-Based Evaluation for Theorem Proving with Large Language Models	Jianyu Zhang et.al.	2502.00855	null
2025-02-01	DeepUKF-VIN: Adaptively-tuned Deep Unscented Kalman Filter for 3D Visual-Inertial Navigation based on IMU-Vision-Net	Khashayar Ghanizadegan et.al.	2502.00575	null
2025-02-01	Evaluation of End-to-End Continuous Spanish Lipreading in Different Data Conditions	David Gimeno-Gómez et.al.	2502.00464	link
2025-01-31	Beyond checkmate: exploring the creative chokepoints in AI text	Nafis Irtiza Tripto et.al.	2501.19301	link
2025-02-03	DyPCL: Dynamic Phoneme-level Contrastive Learning for Dysarthric Speech Recognition	Wonjun Lee et.al.	2501.19010	null
2025-01-31	Examining the Impact of Income Inequality and Gender on School Completion in Malaysia: A Machine Learning Approach Utilizing Malaysia’s Public Sector Open Data	Muhammad Sukri Bin Ramli et.al.	2501.18868	null
2025-01-31	Systematic Uncertainties in the Measurement of Neutron lifetime Using Lunar Prospector Neutron Spectrometer	Akshatha Vydula et.al.	2501.18831	null
2025-01-30	Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion	Vitor Guizilini et.al.	2501.18804	null
2025-01-30	CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering	Yumeng Wang et.al.	2501.18457	null
2025-01-30	Surface Defect Identification using Bayesian Filtering on a 3D Mesh	Matteo Dalle Vedove et.al.	2501.18315	null
2025-01-29	From tools to thieves: Measuring and understanding public perceptions of AI through crowdsourced metaphors	Myra Cheng et.al.	2501.18045	null
2025-01-29	STGCN-LSTM for Olympic Medal Prediction: Dynamic Power Modeling and Causal Policy Optimization	Yiquan Wang et.al.	2501.17711	null
2025-01-29	Cross-Language Approach for Quranic QA	Islam Oshallah et.al.	2501.17449	null
2025-01-29	Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models	Yuxuan Li et.al.	2501.17420	null
2025-01-28	Stiff Transfer Learning for Physics-Informed Neural Networks	Emilien Seiler et.al.	2501.17281	null
2025-01-28	Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving	Evgenii Evstafev et.al.	2501.17084	null
2025-01-28	Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning	Xi Chen et.al.	2501.16966	null
2025-01-28	Hybrid Phenology Modeling for Predicting Temperature Effects on Tree Dormancy	Ron van Bree et.al.	2501.16848	link
2025-01-28	Strawberry Robotic Operation Interface: An Open-Source Device for Collecting Dexterous Manipulation Data in Robotic Strawberry Farming	Linsheng Hou et.al.	2501.16717	null
2025-01-27	BiFold: Bimanual Cloth Folding with Language Guidance	Oriol Barbany et.al.	2501.16458	null
2025-01-27	Will nanodust reappear in STEREO/WAVES data?	Nicole Meyer-Vernet et.al.	2501.16133	null
2025-01-27	SampleLLM: Optimizing Tabular Data Synthesis in Recommendations	Jingtong Gao et.al.	2501.16125	null
2025-01-27	Vienna Mosaic: Navigating Social Borders in a Melting Pot	Marc Sadurní et.al.	2501.15920	link
2025-01-26	Fuzzy-aware Loss for Source-free Domain Adaptation in Visual Emotion Recognition	Ying Zheng et.al.	2501.15519	null
2025-01-26	Dfilled: Repurposing Edge-Enhancing Diffusion for Guided DSM Void Filling	Daniel Panangian et.al.	2501.15440	null
2025-01-26	Evaluating Simple Debiasing Techniques in RoBERTa-based Hate Speech Detection Models	Diana Iftimie et.al.	2501.15430	null
2025-01-26	A General Approach to Relaxing Unconfoundedness	Matthew A. Masten et.al.	2501.15400	null
2025-01-25	Fairness in LLM-Generated Surveys	Andrés Abeliuk et.al.	2501.15351	null
2025-01-25	Fairness-aware Contextual Dynamic Pricing with Strategic Buyers	Pangpang Liu et.al.	2501.15338	null
2025-01-25	The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders?	Ayo Adedeji et.al.	2501.15310	null
2025-01-24	Fairness of Deep Ensembles: On the interplay between per-group task difficulty and under-representation	Estanislao Claucich et.al.	2501.14551	null
2025-01-24	SoK: What Makes Private Learning Unfair?	Kai Yao et.al.	2501.14414	null
2025-01-22	Synthetic CT image generation from CBCT: A Systematic Review	Alzahra Altalib et.al.	2501.13972	null
2025-01-23	Analysis of Indic Language Capabilities in LLMs	Aatman Vaidya et.al.	2501.13912	null
2025-01-23	You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain	Timothy Chase Jr et.al.	2501.13725	null
2025-01-23	Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers	Akshit Achara et.al.	2501.13302	link
2025-01-22	Flying shape and aerodynamics of a full-scale flexible Olympic windsurf sail	J. Zhang et.al.	2501.13254	null
2025-01-22	On the development of open geographical data infrastructures in Latin America: progress and challenges	Daniela Ballari et.al.	2501.13235	null
2025-01-22	Enhancing Multi-Attribute Fairness in Healthcare Predictive Modeling	Xiaoyang Wang et.al.	2501.13219	null
2025-01-22	Machine Learning Modeling for Multi-order Human Visual Motion Processing	Zitang Sun et.al.	2501.12810	link
2025-01-22	Exploring Wikipedia Gender Diversity Over Time $\unicode{x2013}$ The Wikipedia Gender Dashboard (WGD)	Yahya Yunus et.al.	2501.12610	null
2025-01-23	Academic Case Reports Lack Diversity: Assessing the Presence and Diversity of Sociodemographic and Behavioral Factors related to Post COVID-19 Condition	Juan Andres Medina Florez et.al.	2501.12538	null
2025-01-21	Decoherence of Schrödinger cat states in light of wave/particle duality	Th. K. Mavrogordatos et.al.	2501.12328	null
2025-01-21	Improving robot understanding using conversational AI: demonstration and feasibility study	Shikhar Kumar et.al.	2501.12214	null
2025-01-21	Towards autonomous photogrammetric forest inventory using a lightweight under-canopy robotic drone	Väinö Karjalainen et.al.	2501.12073	null
2025-01-21	Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging	Shuyi Hu et.al.	2501.11884	null
2025-01-21	FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients	Jiaqi Leng et.al.	2501.11876	link
2025-01-20	Are generative models fair? A study of racial bias in dermatological image generation	Miguel López-Pérez et.al.	2501.11752	null
2025-01-20	Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy	Saeid Asgari Taghanaki et.al.	2501.11721	link
2025-01-20	Multi-View Spectral Clustering for Graphs with Multiple View Structures	Yorgos Tsitsikas et.al.	2501.11422	link
2025-01-20	UniTrans: A Unified Vertical Federated Knowledge Transfer Framework for Enhancing Cross-Hospital Collaboration	Chung-ju Huang et.al.	2501.11388	link
2025-01-20	Mitigating Spatial Disparity in Urban Prediction Using Residual-Aware Spatiotemporal Graph Neural Networks: A Chicago Case Study	Dingyi Zhuang et.al.	2501.11214	null
2025-01-17	DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration	Huiyun Cao et.al.	2501.10325	null
2025-01-17	Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt	Qingcheng Zeng et.al.	2501.09950	null
2025-01-17	FoundationStereo: Zero-Shot Stereo Matching	Bowen Wen et.al.	2501.09898	link
2025-01-16	Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment	Maksim Filipenko et.al.	2501.09490	null
2025-01-16	DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Hualie Jiang et.al.	2501.09466	link
2025-01-15	TeV afterglow emission from a multi-component GRB jet using the kinetic approach	John P. Hope et.al.	2501.09093	null
2025-01-15	How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias	Tosin Fadahunsi et.al.	2501.09014	link
2025-01-15	StereoGen: High-quality Stereo Image Generation from a Single Image	Xianqi Wang et.al.	2501.08654	null
2025-01-15	MonSter: Marry Monodepth to Stereo Unleashes Power	Junda Cheng et.al.	2501.08643	link
2025-01-15	Image-to-Force Estimation for Soft Tissue Interaction in Robotic-Assisted Surgery Using Structured Light	Jiayin Wang et.al.	2501.08593	null
2025-01-15	Addressing Intersectionality, Explainability, and Ethics in AI-Driven Diagnostics: A Rebuttal and Call for Transdiciplinary Action	Myles Joshua Toledo Tan et.al.	2501.08497	null
2025-01-16	Navigating Gender Disparities in Communication Research Leadership: Academic Recognition, Career Development, and Compensation	Diego F. M. Oliveira et.al.	2501.08401	null
2025-01-14	TriMod Fusion for Multimodal Named Entity Recognition in Social Media	Mosab Alfaqeeh et.al.	2501.08267	null
2025-01-13	An Investigation of Experiences Engaging the Margins in Data-Centric Innovation	Gabriella Thompson et.al.	2501.07690	null
2025-01-13	Digital Twin for Smart Societies: A Catalyst for Inclusive and Accessible Healthcare	Joshit Mohanty et.al.	2501.07570	null
2025-01-13	TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models	Thales Sales Almeida et.al.	2501.07482	link
2025-01-13	PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations	Ting-Yu Dai et.al.	2501.07447	null
2025-01-13	Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion	Li Liang et.al.	2501.07260	link
2025-01-13	Depth and Image Fusion for Road Obstacle Detection Using Stereo Camera	Oleg Perezyabov et.al.	2501.07245	null
2025-01-13	Combined effect of incentives and coupling in multigames in two-layer networks	Luo-Luo Jiang et.al.	2501.07193	null
2025-01-13	Reducing Latency by Eliminating CSIT Feedback: FDD Downlink MIMO Precoding Without CSIT Feedback for Internet-of-Things Communications	Juntaek Han et.al.	2501.07094	null
2025-01-12	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927	link
2025-01-12	Integrators at War: Mediating in AI-assisted Resort-to-Force Decisions	Dennis Müller et.al.	2501.06861	null
2025-01-12	Enabling Cardiac Monitoring using In-ear Ballistocardiogram on COTS Wireless Earbuds	Yongjian Fu et.al.	2501.06744	null
2025-01-10	A monthly sub-national Harmonized Food Insecurity Dataset for comprehensive analysis and predictive modeling	Machefer Mélissande et.al.	2501.06076	null
2025-01-10	“Cause” is Mechanistic Narrative within Scientific Domains: An Ordinary Language Philosophical Critique of “Causal Machine Learning”	Vyacheslav Kungurtsev et.al.	2501.05844	null
2025-01-10	An Efficient Dual ADMM for Huber Regression with Fused Lasso Penalty	Mengjiao Shi et.al.	2501.05676	null
2025-01-10	The Impact of Model Scaling on Seen and Unseen Language Performance	Rhitabrat Pokharel et.al.	2501.05629	null
2025-01-09	Datasheets for Healthcare AI: A Framework for Transparency and Bias Mitigation	Marjia Siddik et.al.	2501.05617	null
2025-01-09	Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping	Wen Tianci et.al.	2501.05242	null
2025-01-09	An Algorithmic Approach for Causal Health Equity: A Look at Race Differentials in Intensive Care Unit (ICU) Outcomes	Drago Plecko et.al.	2501.05197	null
2025-01-09	A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision	Ali Rohan et.al.	2501.05147	null
2025-01-08	Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations	Kirandeep Kaur et.al.	2501.04762	null
2025-01-09	Do Automated Fixes Truly Mitigate Smart Contract Exploits?	Sofia Bobadilla et.al.	2501.04600	link
2025-01-08	Towards Fair Class-wise Robustness: Class Optimal Distribution Adversarial Training	Hongxin Zhi et.al.	2501.04527	null
2025-01-08	Neighborhood Disparities in Smart City Service Adoption	Shahaf Donio et.al.	2501.04363	null
2025-01-07	MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives	Wisdom O. Ikezogwo et.al.	2501.04184	null
2025-01-07	Unifying restart accelerated gradient and proximal bundle methods	Jiaming Liang et.al.	2501.04165	null
2025-01-07	Spanish heat waves curb discretionary mobility and alter work behavior	Andrew Renninger et.al.	2501.03978	null
2025-01-07	Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware	Hegel Pedroza et.al.	2501.03720	null
2025-01-06	Solar Cycle Variation of Axial Orientations and Favorable Locations of Eruptive MFRs	Hong Xie et.al.	2501.03346	null
2025-01-06	CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation	Yuanhong Chen et.al.	2501.02786	null
2025-01-05	Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera	Yuliang Guo et.al.	2501.02464	link
2025-01-05	Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap	Hyunwoo Ko et.al.	2501.02448	null
2025-01-05	Unsupervised Search for Ethnic Minorities’ Medical Segmentation Training Set	Yixiao Chen et.al.	2501.02442	link
2025-01-04	The Integration of Blockchain and Artificial Intelligence for Secure Healthcare Systems	Umar Safdar et.al.	2501.02169	null
2025-01-03	How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data	Ishaan Maitra et.al.	2501.02111	link
2025-01-03	VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment	Wenyan Cong et.al.	2501.01949	link
2025-01-03	Exploring Equality: An Investigation into Custom Loss Functions for Fairness Definitions	Gordon Lee et.al.	2501.01889	null
2025-01-03	CycleFlow: Leveraging Cycle Consistency in Flow Matching for Speaker Style Adaptation	Ziqi Liang et.al.	2501.01861	null
2025-01-03	MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling	Simon Rouard et.al.	2501.01757	null
2025-01-03	The Essence of Contextual Understanding in Theory of Mind: A Study on Question Answering with Story Characters	Chulun Zhou et.al.	2501.01705	null
2025-01-03	CrossView-GS: Cross-view Gaussian Splatting For Large-scale Scene Reconstruction	Chenhao Zhang et.al.	2501.01695	null
2025-01-03	Equity Impacts of Public Transit Network Redesign with Shared Autonomous Mobility Services	Max T. M. Ng et.al.	2501.01615	null
2025-01-02	CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries	Shudong Liu et.al.	2501.01282	null
2025-01-02	TS-SatMVSNet: Slope Aware Height Estimation for Large-Scale Earth Terrain Multi-view Stereo	Song Zhang et.al.	2501.01049	null
2025-01-02	Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer	Ziyang Chen et.al.	2501.01023	link
2025-01-01	High-Probability Polynomial-Time Complexity of Restarted PDHG for Linear Programming	Zikai Xiong et.al.	2501.00728	null
2024-12-31	H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters	Pedram Fekri et.al.	2501.00514	null
2024-12-31	Who Gets Recommended? Investigating Gender, Race, and Country Disparities in Paper Recommendations from Large Language Models	Yifan Tian et.al.	2501.00367	null
2024-12-31	SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation	Shi-Feng Peng et.al.	2501.00303	link
2024-12-30	A Data-Centric Approach to Detecting and Mitigating Demographic Bias in Pediatric Mental Health Text: A Case Study in Anxiety Detection	Julia Ive et.al.	2501.00129	null
2024-12-30	What Makes for a Good Stereoscopic Image?	Netanel Y. Tamir et.al.	2412.21127	null
2024-12-30	Closing Speed Computation using Stereo Camera and Applications in Unsignalized T-Intersection	Gautam Kumar et.al.	2412.20717	null
2024-12-30	MarsSQE: Stereo Quality Enhancement for Martian Images Using Bi-level Cross-view Attention	Mai Xu et.al.	2412.20685	null
2024-12-29	Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control	Bingliang Li et.al.	2412.20378	null
2024-12-29	Impact of Data Distribution on Fairness Guarantees in Equitable Deep Learning	Yan Luo et.al.	2412.20377	link
2024-12-29	FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation	Yan Luo et.al.	2412.20374	link
2024-12-29	Dual-Level Precision Edges Guided Multi-View Stereo with Accurate Planarization	Kehua Chen et.al.	2412.20328	link
2024-12-28	The impact of China’s economic growth on poverty alleviation: From absolute to relative poverty	Yixun Kang et.al.	2412.20176	null
2024-12-28	Neutron star stability beyond the mass peak: assessing the role of out-of-equilibrium perturbations	Martin O. Canullan-Pascual et.al.	2412.20133	null
2024-12-28	Incentivizing supplemental math assignments and using AI-generated hints improve exam performance, especially for racially minoritized students	Yifan Lu et.al.	2412.19961	null
2024-12-27	Analysis of Premature Death Rates in Texas Counties: The Impact of Air Quality, Socioeconomic Factors, and COPD Prevalence	Richard Rich et.al.	2412.19774	null
2024-12-27	Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis	Jiaqi Wang et.al.	2412.19654	link
2024-12-27	Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference	Keke Zhang et.al.	2412.19553	null
2024-12-27	Is Your Text-to-Image Model Robust to Caption Noise?	Weichen Yu et.al.	2412.19531	null
2024-12-27	Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images	Xudong Cai et.al.	2412.19518	null
2024-12-27	Disparate Model Performance and Stability in Machine Learning Clinical Support for Diabetes and Heart Diseases	Ioannis Bilionis et.al.	2412.19495	null
2024-12-27	Effects of Reynolds number and spatial resolution on the pressure source terms in turbulent boundary layers	Aditya Agarwal et.al.	2412.19474	null
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130	null
2024-12-25	Evaluating authorship disambiguation quality through anomaly analysis on researchers’ career transition	Huaxia Zhou et.al.	2412.18757	null
2024-12-24	Uncertainty Quantification in Stereo Matching	Wenxiao Cai et.al.	2412.18703	link
2024-12-24	Topological phases protected by projective PT symmetry in alkaline-earth-like atoms	Xiaofan Zhou et.al.	2412.18494	null
2024-12-24	scReader: Prompting Large Language Models to Interpret scRNA-seq Data	Cong Li et.al.	2412.18156	null
2024-12-24	Fundamental Limits in the Search for Less Discriminatory Algorithms – and How to Avoid Them	Benjamin Laufer et.al.	2412.18138	null
2024-12-23	Shifted Composition III: Local Error Framework for KL Divergence	Jason M. Altschuler et.al.	2412.17997	null
2024-12-23	A Multimodal Fusion Framework for Bridge Defect Detection with Cross-Verification	Ravi Datta Rachuri et.al.	2412.17968	null
2024-12-23	Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective	Xinmiao Yu et.al.	2412.17787	null
2024-12-23	Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings	Jérémie Sublime et.al.	2412.17486	null
2024-12-24	Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement	Hyeonjin Kim et.al.	2412.17387	link
2024-12-22	Fairness in Reinforcement Learning with Bisimulation Metrics	Sahand Rezaei-Shoshtari et.al.	2412.17123	null
2024-12-22	Differentially Private Random Block Coordinate Descent	Artavazd Maranjyan et.al.	2412.17054	null
2024-12-22	Lightweight Design and Optimization methods for DCNNs: Progress and Futures	Hanhua Long et.al.	2412.16886	null
2024-12-21	Does calibration mean what they say it means; or, the reference class problem rises again	Lily Hu et.al.	2412.16769	null
2024-12-21	ViM-Disparity: Bridging the Gap of Speed, Accuracy and Memory for Disparity Map Generation	Maheswar Bora et.al.	2412.16745	link
2024-12-21	LUCES-MV: A Multi-View Dataset for Near-Field Point Light Source Photometric Stereo	Fotios Logothetis et.al.	2412.16737	null
2024-12-21	A Unifying Family of Data-Adaptive Partitioning Algorithms	Guy B. Oldaker IV et.al.	2412.16713	null
2024-12-20	Climate Impact Assessment Requires Weighting: Introducing the Weighted Climate Dataset	Marco Gortan et.al.	2412.15699	null
2024-12-20	Gender Disparities in Contributions, Leadership, and Collaboration: An Exploratory Study on Software Systems Research	Shamse Tasnim Cynthia et.al.	2412.15661	null
2024-12-20	Radio filaments as Z-pinched Galactic center wind	Fan Zhang et.al.	2412.15575	null
2024-12-20	SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation	Ke Yan et.al.	2412.15526	link
2024-12-19	Uncertainty-Guided Cross Attention Ensemble Mean Teacher for Semi-supervised Medical Image Segmentation	Meghana Karri et.al.	2412.15380	null
2024-12-19	Tiled Diffusion	Or Madar et.al.	2412.15185	null
2024-12-19	Improving Geometry in Sparse-View 3DGS via Reprojection-based DoF Separation	Yongsung Kim et.al.	2412.14568	null
2024-12-19	Provincial allocation of China’s commercial building operational carbon towards carbon neutrality	Yanqiao Deng et.al.	2412.14523	null
2024-12-19	Who is Helping Whom? Student Concerns about AI- Teacher Collaboration in Higher Education Classrooms	Bingyi Han et.al.	2412.14469	null
2024-12-19	An Immersive Multi-Elevation Multi-Seasonal Dataset for 3D Reconstruction and Visualization	Xijun Liu et.al.	2412.14418	null
2024-12-18	I0T: Embedding Standardization Method Towards Zero Modality Gap	Na Min An et.al.	2412.14384	link
2024-12-18	Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs	David Restrepo et.al.	2412.14304	null
2024-12-18	What Has Been Overlooked in Contrastive Source-Free Domain Adaptation: Leveraging Source-Informed Latent Augmentation within Neighborhood Context	Jing Wang et.al.	2412.14301	link
2024-12-18	On Calibration in Multi-Distribution Learning	Rajeev Verma et.al.	2412.14142	null
2024-12-18	LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research	Tianyang Gu et.al.	2412.14141	null
2024-12-18	Performance Gap in Entity Knowledge Extraction Across Modalities in Vision Language Models	Ido Cohen et.al.	2412.14133	link
2024-12-18	Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Rémi Marsal et.al.	2412.14103	null
2024-12-18	Neural Combinatorial Optimization for Stochastic Flexible Job Shop Scheduling Problems	Igor G. Smit et.al.	2412.14052	link
2024-12-18	What If: Causal Analysis with Graph Databases	Amedeo Pachera et.al.	2412.13965	null
2024-12-18	MobiFuse: A High-Precision On-device Depth Perception System with Multi-Data Fusion	Jinrui Zhang et.al.	2412.13848	null
2024-12-18	A2H: A UI Converter from Android to HarmonyOS Platform	Chen Wang et.al.	2412.13693	link
2024-12-18	Soft Modes as a Predictive Framework for Low Dimensional Biological Systems across Scales	Christopher Joel Russo et.al.	2412.13637	null
2024-12-18	SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation	Kazuki Shimada et.al.	2412.13462	null
2024-12-17	C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System	Parker Addison et.al.	2412.13163	null
2024-12-17	Unlocking the Potential of Digital Pathology: Novel Baselines for Compression	Maximilian Fischer et.al.	2412.13137	null
2024-12-17	Queries, Representation & Detection: The Next 100 Model Fingerprinting Schemes	Augustin Godinot et.al.	2412.13021	link
2024-12-17	AoI in Context-Aware Hybrid Radio-Optical IoT Networks	Aymen Hamrouni et.al.	2412.12914	null
2024-12-17	ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation	Shiqi Huang et.al.	2412.12798	link
2024-12-17	Preference Robust Ordinal Priority Approach and its Satisficing Extension for Multi-Attribute Decision-Making with Incomplete Information	Renlong Wang et.al.	2412.12690	null
2024-12-17	SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing	Chen Chen et.al.	2412.12685	link
2024-12-17	DriveTester: A Unified Platform for Simulation-Based Autonomous Driving Testing	Mingfei Cheng et.al.	2412.12656	link
2024-12-17	PBVS 2024 Solution: Self-Supervised Learning and Sampling Strategies for SAR Classification in Extreme Long-Tail Distribution	Yuhyun Kim et.al.	2412.12565	null
2024-12-17	Beyond Data Quantity: Key Factors Driving Performance in Multilingual Language Models	Sina Bagheri Nezhad et.al.	2412.12500	link
2024-12-16	CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models	Felix Taubner et.al.	2412.12093	null
2024-12-16	IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations	Zhibing Li et.al.	2412.12083	null
2024-12-16	Hybrid quantum network for sensing in the acoustic frequency range	Valeriy Novikov et.al.	2412.11824	null
2024-12-16	Image Gradient-Aided Photometric Stereo Network	Kaixuan Wang et.al.	2412.11650	null
2024-12-16	DVP-MVS: Synergize Depth-Edge and Visibility Prior for Multi-View Stereo	Zhenlong Yuan et.al.	2412.11578	null
2024-12-16	RoMeO: Robust Metric Visual Odometry	Junda Cheng et.al.	2412.11530	null
2024-12-16	SpatialMe: Stereo Video Conversion Using Depth-Warping and Blend-Inpainting	Jiale Zhang et.al.	2412.11512	null
2024-12-15	On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning	Pengfei Fang et.al.	2412.11017	null
2024-12-13	EvalGIM: A Library for Evaluating Generative Image Models	Melissa Hall et.al.	2412.10604	link
2024-12-13	Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples	Yeyuan Wang et.al.	2412.10029	null
2024-12-13	All-in-One: Transferring Vision Foundation Models into Stereo Matching	Jingyi Zhou et.al.	2412.09912	null
2024-12-13	OpenForge: Probabilistic Metadata Integration	Tianji Cong et.al.	2412.09788	link
2024-12-12	Egyptian fractions meet the Sierpinski triangle	Laura De Carli et.al.	2412.09728	null
2024-12-12	Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos	Linyi Jin et.al.	2412.09621	null
2024-12-12	Learned Compression for Compressed Learning	Dan Jacobellis et.al.	2412.09405	link
2024-12-12	T-SVG: Text-Driven Stereoscopic Video Generation	Qiao Jin et.al.	2412.09323	null
2024-12-12	Multimodal Sentiment Analysis based on Video and Audio Inputs	Antonio Fernandez et.al.	2412.09317	null
2024-12-12	Pinpoint Counterfactuals: Reducing social bias in foundation models via localized counterfactual generation	Kirill Sirotkin et.al.	2412.09160	null
2024-12-12	LV-CadeNet: Long View Feature Convolution-Attention Fusion Encoder-Decoder Network for Clinical MEG Spike Detection	Kuntao Xiao et.al.	2412.08896	null
2024-12-11	jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images	Andreas Koukounas et.al.	2412.08802	null
2024-12-11	TGOSPA Metric Parameters Selection and Evaluation for Visual Multi-object Tracking	Jan Krejčí et.al.	2412.08321	null
2024-12-11	Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation	Marta R. Costa-jussà et.al.	2412.08279	null
2024-12-11	Neural Observation Field Guided Hybrid Optimization of Camera Placement	Yihan Cao et.al.	2412.08266	link
2024-12-11	Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions	Mohammadmostafa Rostamkhani et.al.	2412.08169	link
2024-12-11	Rigid Communication Topologies: Impact on Stability, Safety, Energy Consumption, Passenger Comfort, and Robustness of Vehicular Platoons	Amir Zakerimanesh et.al.	2412.08122	null
2024-12-11	Multilingual LLMs Inherently Reward In-Language Time-Sensitive Semantic Alignment for Low-Resource Languages	Ashutosh Bajpai et.al.	2412.08090	link
2024-12-10	A large language model-based approach to quantifying the effects of social determinants in liver transplant decisions	Emily Robitschek et.al.	2412.07924	null
2024-12-10	ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer	Jinyi Hu et.al.	2412.07720	link
2024-12-10	Access to care improves EHR reliability and clinical risk prediction model performance	Anna Zink et.al.	2412.07712	null
2024-12-10	Stereo Hand-Object Reconstruction for Human-to-Robot Handover	Yik Lung Pang et.al.	2412.07487	null
2024-12-10	PRM: Photometric Stereo based Large Reconstruction Model	Wenhang Ge et.al.	2412.07371	null
2024-12-10	A Bayesian Mixture Model Approach to Examining Neighborhood Social Determinants of Health Disparities in Endometrial Cancer Care in Massachusetts	Carmen B. Rodriguez et.al.	2412.07134	null
2024-12-10	TT-MPD: Test Time Model Pruning and Distillation	Haihang Wu et.al.	2412.07114	null
2024-12-09	MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds	Zhenggang Tang et.al.	2412.06974	null
2024-12-09	Bridging the Divide: Reconsidering Softmax and Linear Attention	Dongchen Han et.al.	2412.06590	link
2024-12-09	Emerging Challenges in Molecular Paleontology: Misapplication of Environmental DNA Fragments and Misconception of Deamination as a Key Criterion for In Situ DNA Identification	Wan-Qian Zhao et.al.	2412.06378	null
2024-12-09	SeFENet: Robust Deep Homography Estimation via Semantic-Driven Feature Enhancement	Zeru Shi et.al.	2412.06352	null
2024-12-08	DECO: Life-Cycle Management of Enterprise-Grade Chatbots	Yiwen Zhu et.al.	2412.06099	null
2024-12-08	Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors	Alex Rich et.al.	2412.05771	null
2024-12-07	On the effective transfer of knowledge from English to Hindi Wikipedia	Paramita Das et.al.	2412.05708	link
2024-12-07	A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions	Ola Shorinwa et.al.	2412.05563	null
2024-12-06	Excitation spectrum of a double supersolid in a trapped dipolar Bose mixture	Daniel Scheiermann et.al.	2412.05215	null
2024-12-06	Automatic Tissue Differentiation in Parotidectomy using Hyperspectral Imaging	Eric L. Wisotzky et.al.	2412.04879	null
2024-12-06	Differentially Private Random Feature Model	Chunyang Liao et.al.	2412.04785	link
2024-12-06	Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs	Kun Wu et.al.	2412.04747	null
2024-12-05	From Models to Systems: A Comprehensive Fairness Framework for Compositional Recommender Systems	Brian Hsu et.al.	2412.04655	null
2024-12-05	Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail	Luca Bartolomei et.al.	2412.04472	link
2024-12-05	Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird’s-Eye-View via Uncertainty Measure	Saheli Hazra et.al.	2412.04337	null
2024-12-05	Complexity of Vector-valued Prediction: From Linear Models to Stochastic Convex Optimization	Matan Schliserman et.al.	2412.04274	null
2024-12-05	Relationships between Keywords and Strong Beats in Lyrical Music	Callie C. Liao et.al.	2412.04202	null
2024-12-05	Adult Glioma Segmentation in Sub-Saharan Africa using Transfer Learning on Stratified Finetuning Data	Abhijeet Parida et.al.	2412.04111	link
2024-12-05	Augmenting Minds or Automating Skills: The Differential Role of Human Capital in Generative AI’s Impact on Creative Tasks	Meiling Huang et.al.	2412.03963	null
2024-12-05	BEFL: Balancing Energy Consumption in Federated Learning for Mobile Edge IoT	Zehao Ju et.al.	2412.03950	link
2024-12-05	MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application	Hyesu Jang et.al.	2412.03887	null
2024-12-05	E-Commerce in Africa: Divergent Impacts on Rural and Urban Economies	Jaelyn S. Liang et.al.	2412.03879	null
2024-12-05	Un-evaluated Solutions May Be Valuable in Expensive Optimization	Hao Hao et.al.	2412.03858	null
2024-12-04	Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter	Hermes McGriff et.al.	2412.03518	null
2024-12-04	NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images	Lingen Li et.al.	2412.03517	null
2024-12-04	Data Fusion of Semantic and Depth Information in the Context of Object Detection	Md Abu Yusuf et.al.	2412.03490	null
2024-12-04	Exploring trends in audio mixes and masters: Insights from a dataset analysis	Angeliki Mourgela et.al.	2412.03373	null
2024-12-04	TASR: Timestep-Aware Diffusion Model for Image Super-Resolution	Qinwei Lin et.al.	2412.03355	link
2024-12-04	Social media and suicide: empirical evidence from the quasi-exogenous geographical adoption of Twitter	Alexis Du et.al.	2412.03217	null
2024-12-04	MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras	Huai Yu et.al.	2412.03146	link
2024-12-03	Quaternion-based Unscented Kalman Filter for 6-DoF Vision-based Inertial Navigation in GPS-denied Regions	Khashayar Ghanizadegan et.al.	2412.02768	null
2024-12-03	ROVER: A Multi-Season Dataset for Visual SLAM	Fabian Schmidt et.al.	2412.02506	link
2024-12-03	Single-Shot Metric Depth from Focused Plenoptic Cameras	Blanca Lasheras-Hernandez et.al.	2412.02386	null
2024-12-03	Dual Exposure Stereo for Extended Dynamic Range 3D Imaging	Juhyung Choi et.al.	2412.02351	null
2024-12-03	SparseLGS: Sparse View Language Embedded Gaussian Splatting	Jun Hu et.al.	2412.02245	null
2024-12-03	Crash Severity Risk Modeling Strategies under Data Imbalance	Abdullah Al Mamun et.al.	2412.02094	null
2024-12-02	Mutli-View 3D Reconstruction using Knowledge Distillation	Aditya Dutt et.al.	2412.02039	link
2024-12-02	A Shared Standard for Valid Measurement of Generative AI Systems’ Capabilities, Risks, and Impacts	Alexandra Chouldechova et.al.	2412.01934	null
2024-12-02	World-consistent Video Diffusion with Explicit 3D Modeling	Qihang Zhang et.al.	2412.01821	null
2024-12-03	FairML: A Julia Package for Fair Classification	Jan Pablo Burgard et.al.	2412.01585	link
2024-12-02	Second FRCSyn-onGoing: Winning Solutions and Post-Challenge Analysis to Improve Face Recognition with Synthetic Data	Ivan DeAndres-Tame et.al.	2412.01383	null
2024-11-29	Quantifying the synthetic and real domain gap in aerial scene understanding	Alina Marcu et.al.	2411.19913	null
2024-11-29	Privacy-Preserving Orthogonal Aggregation for Guaranteeing Gender Fairness in Federated Recommendation	Siqing Zhang et.al.	2411.19678	null
2024-11-29	Subjective and Objective Quality Assessment Methods of Stereoscopic Videos with Visibility Affecting Distortions	Sria Biswas et.al.	2411.19522	null
2024-12-02	GausSurf: Geometry-Guided 3D Gaussian Splatting for Surface Reconstruction	Jiepeng Wang et.al.	2411.19454	null
2024-11-28	Cross-Spectral Attention for Unsupervised RGB-IR Face Verification and Person Re-identification	Kshitij Nikhal et.al.	2411.19215	null
2024-11-28	Examining Multimodal Gender and Content Bias in ChatGPT-4o	Roberto Balestri et.al.	2411.19140	null
2024-11-28	Tracking Progress Towards Sustainable Development Goal 6 Using Satellite Imagery	Othmane Echchabi et.al.	2411.19093	null
2024-11-28	Study on the Influence of Embodied Avatars on Gait Parameters in Virtual Environments and Real World	Tianyi Zhou et.al.	2411.18949	null
2024-11-27	A Talent-infused Policy-gradient Approach to Efficient Co-Design of Morphology and Task Allocation Behavior of Multi-Robot Systems	Prajit KrisshnaKumar et.al.	2411.18519	null
2024-11-27	A comparison of extended object tracking with multi-modal sensors in indoor environment	Jiangtao Shuai et.al.	2411.18476	null
2024-11-27	When does a bridge become an aeroplane?	Tina A. Dardeno et.al.	2411.18406	null
2024-11-27	Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation	Mehdi Zayene et.al.	2411.18335	link
2024-11-27	Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision	Jinnyeong Kim et.al.	2411.18025	null
2024-11-26	Updating the constraint on the quantum collapse models via kilogram masses	Qi Dai et.al.	2411.17588	null
2024-11-26	Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles	Yichen Wang et.al.	2411.17554	null
2024-11-26	Variational Quantum Simulation of the Fokker-Planck Equation applied to Quantum Radiation Reaction	Óscar Amaro et.al.	2411.17517	link
2024-11-26	Object-centric proto-symbolic behavioural reasoning from pixels	Ruben van Bergen et.al.	2411.17438	link
2024-11-26	Enhancing Imbalance Learning: A Novel Slack-Factor Fuzzy SVM Approach	M. Tanveer et.al.	2411.17128	link
2024-11-26	Multimodal Alignment and Fusion: A Survey	Songtao Li et.al.	2411.17040	null
2024-11-24	PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation	Ziyao Zeng et.al.	2411.16750	null
2024-11-25	Location-Based Service (LBS) Data Quality Metrics and Effects on Mobility Inference	Xinhua Wu et.al.	2411.16595	null
2024-11-23	IRSKG: Unified Intrusion Response System Knowledge Graph Ontology for Cyber Defense	Damodar Panigrahi et.al.	2411.15672	null
2024-11-23	Elucidating the nature of axial-vector charm-antibottom tetraquark states	U. Özdem et.al.	2411.15508	null
2024-11-22	Adaptive Group Robust Ensemble Knowledge Distillation	Patrik Kenfack et.al.	2411.14984	null
2024-11-22	A Benchmark Dataset for Collaborative SLAM in Service Environments	Harin Park et.al.	2411.14775	link
2024-11-22	FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification	Zhengrui Guo et.al.	2411.14743	link
2024-11-22	Boson-fermion universality of mesoscopic entanglement fluctuations in free systems	Cunzhong Lou et.al.	2411.14687	null
2024-11-21	Learning Fair Robustness via Domain Mixup	Meiyu Zhong et.al.	2411.14424	null
2024-11-21	InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation	Marziyeh Bamdad et.al.	2411.14358	link
2024-11-21	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart	Jian Shi et.al.	2411.14295	link
2024-11-21	Why do language models perform worse for morphologically complex languages?	Catherine Arnett et.al.	2411.14198	link
2024-11-21	Compact Visual Data Representation for Green Multimedia – A Human Visual System Perspective	Peilin Chen et.al.	2411.14135	null
2024-11-21	Stereo Anything: Unifying Stereo Matching with Large-Scale Mixed Data	Xianda Guo et.al.	2411.14053	link
2024-11-21	XAgents: A Framework for Interpretable Rule-Based Multi-Agents Cooperation	Hailong Yang et.al.	2411.13932	null
2024-11-20	Predictive Insights into LGBTQ+ Minority Stress: A Transductive Exploration of Social Media Discourse	S. Chapagain et.al.	2411.13534	link
2024-11-20	Non-Perturbative Corrections to Charged Black Hole Evaporation	Vyshnav Mohan et.al.	2411.13454	null
2024-11-20	BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation	Umamaheswaran Raman Kumar et.al.	2411.13251	null
2024-11-20	Asymptotic-Preserving schemes for the Boltzmann mixture model with disparate mass	Zhen Hao et.al.	2411.13240	null
2024-11-20	Superpixel Cost Volume Excitation for Stereo Matching	Shanglong Liu et.al.	2411.13105	null
2024-11-19	MLDGG: Meta-Learning for Domain Generalization on Graphs	Qin Tian et.al.	2411.12913	null
2024-11-19	Towards Fairness in AI for Melanoma Detection: Systemic Review and Recommendations	Laura N Montoya et.al.	2411.12846	null
2024-11-19	Human-Robot Dialogue Annotation for Multi-Modal Common Ground	Claire Bonial et.al.	2411.12829	link
2024-11-19	Probing the Capacity of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction	Sonny George et.al.	2411.12828	link
2024-11-19	Multivariate and Online Transfer Learning with Uncertainty Quantification	Jimmy Hickey et.al.	2411.12555	null
2024-11-19	Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution	Yang Zou et.al.	2411.12530	link
2024-11-19	Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph	Ziyang Chen et.al.	2411.12426	link
2024-11-19	Cities beyond proximity	Dan Hill et.al.	2411.12335	null
2024-11-19	Neuro-3D: Towards 3D Visual Decoding from EEG Signals	Zhanqiang Guo et.al.	2411.12248	null
2024-11-18	MMBind: Unleashing the Potential of Distributed and Heterogeneous Data for Multimodal Learning in IoT	Xiaomin Ouyang et.al.	2411.12126	null
2024-11-18	Fair Distillation: Teaching Fairness from Biased Teachers in Medical Imaging	Milad Masroor et.al.	2411.11939	null
2024-11-18	SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input	Zhen Lv et.al.	2411.11934	null
2024-11-18	The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather	Markus Schön et.al.	2411.11455	null
2024-11-18	Causal Effect of Group Diversity on Redundancy and Coverage in Peer-Reviewing	Navita Goyal et.al.	2411.11437	null
2024-11-17	Label Sharing Incremental Learning Framework for Independent Multi-Label Segmentation Tasks	Deepa Anand et.al.	2411.11105	null
2024-11-16	BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment	Sizhe Wang et.al.	2411.10914	null
2024-11-16	DEAL: Decoupled Classifier with Adaptive Linear Modulation for Group Robust Early Diagnosis of MCI to AD Conversion	Donggyu Lee et.al.	2411.10814	null
2024-11-16	LTCXNet: Advancing Chest X-Ray Analysis with Solutions for Long-Tailed Multi-Label Classification and Fairness Challenges	Chin-Wei Huang et.al.	2411.10746	null
2024-11-16	A Wearable Gait Monitoring System for 17 Gait Parameters Based on Computer Vision	Jiangang Chen et.al.	2411.10739	null
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546	null
2024-11-15	Debias-CLR: A Contrastive Learning Based Debiasing Method for Algorithmic Fairness in Healthcare Applications	Ankita Agarwal et.al.	2411.10544	null
2024-11-15	Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion	Haoran Wei et.al.	2411.10369	null
2024-11-15	Domain Adaptation-based Edge Computing for Cross-Conditions Fault Diagnosis	Yanzhi Wang et.al.	2411.10340	null
2024-11-15	Filament eruption deflection and associated CMEs	K. Koleva et.al.	2411.10110	null
2024-11-15	Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses	Yongfan Liu et.al.	2411.10013	link
2024-11-15	Assessing Response Disparities in California Wildland-Urban-Interface (WUI) Cities Using the Compartmental Model	Zihui Ma et.al.	2411.09946	null
2024-11-14	Propensity Score Matching: Should We Use It in Designing Observational Studies?	Fei Wan et.al.	2411.09579	null
2024-11-14	Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data	Rik Raes et.al.	2411.09431	null
2024-11-14	Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching	Yuran Wang et.al.	2411.09151	null
2024-11-14	Artificial Intelligence for Quantum Computing	Yuri Alexeev et.al.	2411.09131	null
2024-11-13	Fluoroformer: Scaling multiple instance learning to multiplexed images via attention-based channel fusion	Marc Harary et.al.	2411.08975	link
2024-11-13	Gendered Words and Grant Rates: A Textual Analysis of Disparate Outcomes in the Patent System	Deborah Gerhardt et.al.	2411.08526	null
2024-11-13	Anomalous Hall effect from inter-superlattice scattering in a noncollinear antiferromagnet	Lilia S. Xie et.al.	2411.08381	null
2024-11-12	Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset	Khaoula Chehbouni et.al.	2411.08243	null
2024-11-12	Detection asymmetry in solar energetic particle events	S. Dalla et.al.	2411.08211	null
2024-11-12	Estimating Variability in Hospital Charges: The Case of Cesarean Section	Anna Perfilyeva et.al.	2411.08174	null
2024-11-11	Identifying Differential Patient Care Through Inverse Intent Inference	Hyewon Jeong et.al.	2411.07372	null
2024-11-11	Targeting mediating mechanisms of social disparities with an interventional effects framework, applied to the gender pay gap in West Germany	Christiane Didden et.al.	2411.07368	null
2024-11-11	$SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation	Yinshuang Xu et.al.	2411.07326	null
2024-11-11	Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations	Kirti Bhagat et.al.	2411.07320	link
2024-11-10	Analysis of spatially clustered survival data with unobserved covariates using SBART	Durbadal Ghosh et.al.	2411.06591	null
2024-11-10	Image Segmentation from Shadow-Hints using Minimum Spanning Trees	Moritz Heep et.al.	2411.06530	null
2024-11-10	SymmeTac: Symmetric Color LED Driven Efficient Photometric Stereo Reconstruction Methods for Camera-based Tactile Sensors	Jieji Ren et.al.	2411.06377	link
2024-11-08	Characterizing Implementability of Global Protocols with Infinite States and Data	Elaine Li et.al.	2411.05722	null
2024-11-08	Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation	Peidong Liu et.al.	2411.05472	link
2024-11-08	From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS	Haoran Zhang et.al.	2411.05362	link
2024-11-07	Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?	Jonathan Roberts et.al.	2411.05000	null
2024-11-06	Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation	Teppei Kurita et.al.	2411.04714	null
2024-11-11	The Multiple Dimensions of Spuriousness in Machine Learning	Samuel J. Bell et.al.	2411.04696	null
2024-11-07	Comparing Fairness of Generative Mobility Models	Daniel Wang et.al.	2411.04453	null
2024-11-06	Topology Bench: Systematic Graph Based Benchmarking for Core Optical Networks	Robin Matzner et.al.	2411.04160	null
2024-11-06	Optimizing Quantum Circuits, Fast and Slow	Amanda Xu et.al.	2411.04104	null
2024-11-06	These Maps Are Made by Propagation: Adapting Deep Stereo Networks to Road Scenarios with Decisive Disparity Diffusion	Chuang-Wei Liu et.al.	2411.03717	null
2024-11-06	Physical Layer Deception in OFDM Systems	Wenwen Chen et.al.	2411.03677	null
2024-11-06	Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions	Zihan Qin et.al.	2411.03638	null
2024-11-05	Exploring the Cybersecurity-Resilience Gap: An Analysis of Student Attitudes and Behaviors in Higher Education	Steve Goliath et.al.	2411.03219	null
2024-11-05	Gender Differences in Comparative Advantage Matches: Evidence from Linked Employer-Employee Data	Hugo Sant’Anna et.al.	2411.03209	null
2024-11-04	Designing and Evaluating Sampling Strategies for Multiple-Forecast Visualization (MFV)	Ruishi Zou et.al.	2411.02576	null
2024-11-04	Gravitational wave energy spectral density properties from BPASS Galactic binary population in the Milky Way galaxy	Petra Tang et.al.	2411.02563	null
2024-11-04	Neural optical flow for planar and stereo PIV	Andrew I. Masker et.al.	2411.02373	null
2024-11-04	Can Personalized Medicine Coexist with Health Equity? Examining the Cost Barrier and Ethical Implications	Kishi Kobe Yee Francisco et.al.	2411.02307	null
2024-11-04	Constructing Emergent U(1) Symmetries in the Gamma-prime $\left(\bf Γ^{\prime} \right)$ model	Sagar Ramchandani et.al.	2411.02070	null
2024-11-04	Typicalness-Aware Learning for Failure Detection	Yijun Liu et.al.	2411.01981	link
2024-11-04	A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding	Yitong Dong et.al.	2411.01893	null
2024-11-03	Mitigating Matching Biases Through Score Calibration	Mohammad Hossein Moslemi et.al.	2411.01685	link
2024-11-03	One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection	Zhenyu Wang et.al.	2411.01584	null
2024-11-02	Visual Fourier Prompt Tuning	Runjia Zeng et.al.	2411.01327	link
2024-11-02	On The Influence Of The Solar Wind On The Propagation Of Earth-impacting Coronal Mass Ejections	Sandeep Kumar et.al.	2411.01165	null
2024-11-02	Why Does the Cortex Have Such a Vast Storage Capacity?	Hui Wei et.al.	2411.01164	null
2024-10-31	Matchmaker: Self-Improving Large Language Model Programs for Schema Matching	Nabeel Seedat et.al.	2410.24105	null
2024-10-31	A Multi-Modal Approach for Face Anti-Spoofing in Non-Calibrated Systems using Disparity Maps	Ariel Larey et.al.	2410.24031	null
2024-10-31	Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts	Xiang Deng et.al.	2410.23836	null
2024-10-30	Enhancing Image Resolution: A Simulation Study and Sensitivity Analysis of System Parameters for Resourcesat-3S/3SA	Ankur Garg et.al.	2410.23319	null
2024-10-30	TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models	Ziyao Shangguan et.al.	2410.23266	link
2024-10-30	Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe	Songyu Xu et.al.	2410.23154	null
2024-10-30	FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training	Tejaswini Medi et.al.	2410.23142	null
2024-10-30	Decarbonisation of industry and the energy system: exploring mutual impacts and investment planning	Quentin Raillard-Cazanove et.al.	2410.23025	null
2024-10-30	Improving Musical Accompaniment Co-creation via Diffusion Transformers	Javier Nistal et.al.	2410.23005	null
2024-10-30	Knowledge Graph Based Visual Search Application	Pawandeep Kaur Betz et.al.	2410.22846	null
2024-10-30	Price Regulation, Technology and Provider Redistribution	Piyush Akimitsu et.al.	2410.22616	null
2024-10-29	FairSkin: Fair Diffusion for Skin Disease Image Generation	Ruichen Zhang et.al.	2410.22551	null
2024-10-29	From Silos to Systems: Process-Oriented Hazard Analysis for AI Systems	Shalaleh Rismani et.al.	2410.22526	null
2024-10-29	Multimodal Structure Preservation Learning	Chang Liu et.al.	2410.22520	null
2024-10-29	Relieving scale disparity in binary black hole simulations	Nikolas A. Wittek et.al.	2410.22290	null
2024-10-29	Complex-Phase Extensions of Szegedy Quantum Walk on Graphs	Sergio A. Ortega et.al.	2410.22011	null
2024-10-29	Photonic systolic array for all-optical matrix-matrix multiplication	Jungmin Kim et.al.	2410.21671	null
2024-10-28	Intersectional inequalities in social networks	Samuel Martin-Gutierez et.al.	2410.21189	link
2024-10-28	Revealing the core-periphery structure of cities	Federica Fanelli et.al.	2410.21133	null
2024-10-28	BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment	Mehdi Hosseinzadeh et.al.	2410.20969	null
2024-10-28	The Zeno’s Paradox of `Low-Resource’ Languages	Hellina Hailu Nigatu et.al.	2410.20817	null
2024-10-28	Faster WIND: Accelerating Iterative Best-of- $N$ Distillation for LLM Alignment	Tong Yang et.al.	2410.20727	null
2024-10-28	Physics-Free Spectrally Multiplexed Photometric Stereo under Unknown Spectral Composition	Satoshi Ikehata et.al.	2410.20716	link
2024-10-27	Language Models And A Second Opinion Use Case: The Pocket Professional	David Noever et.al.	2410.20636	null
2024-10-27	TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation	Juntong Shi et.al.	2410.20626	link
2024-10-27	Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks?	Xuan He et.al.	2410.20533	link
2024-10-27	A Navier-Stokes asymptotic preserving Direct Simulation Monte Carlo method for multi-species gas flows	Fei Fei et.al.	2410.20322	null
2024-10-25	DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems	Muhammad Zaeem Shahzad et.al.	2410.19336	null
2024-10-24	Self-organized homogenization of flow networks	Julien Bouvard et.al.	2410.19089	null
2024-10-24	Bridge-Coder: Unlocking LLMs’ Potential to Overcome Language Gaps in Low-Resource Code	Jipeng Zhang et.al.	2410.18957	null
2024-10-27	Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis	Liang Han et.al.	2410.18822	null
2024-10-24	Rigid Single-Slice-in-Volume registration via rotation-equivariant 2D/3D feature matching	Stefan Brandstätter et.al.	2410.18683	null
2024-10-24	A Cranial-Feature-Based Registration Scheme for Robotic Micromanipulation Using a Microscopic Stereo Camera System	Xiaofeng Lin et.al.	2410.18630	null
2024-10-24	Spatial-Temporal Search for Spiking Neural Networks	Kaiwei Che et.al.	2410.18580	null
2024-10-24	Estimating early coronal mass ejection propagation direction with DIRECD during the severe May 8 and follow-up June 8, 2024 events	Shantanu Jain et.al.	2410.18549	null
2024-10-24	Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction	Hongxin Peng et.al.	2410.18433	null
2024-10-24	Large Language Models Reflect the Ideology of their Creators	Maarten Buyl et.al.	2410.18417	link
2024-10-23	Pathological Rheology of Non-Stretching Entangled Polymers: Finite-Time Blow-Up Predictions	Vickie Chen et.al.	2410.18306	null
2024-10-23	Rethinking Positive Pairs in Contrastive Learning	Jiantao Wu et.al.	2410.18200	null
2024-10-23	Continual Learning on a Data Diet	Elif Ceren Gok Yildirim et.al.	2410.17715	link
2024-10-23	Role of the argon and helium bath gases on the structure of H2/O2 detonations	Farzane Zangene et.al.	2410.17561	null
2024-10-22	Characterizing Robocalls with Multiple Vantage Points	Sathvik Prasad et.al.	2410.17361	null
2024-10-22	FairLoRA: Unpacking Bias Mitigation in Vision Models with Fairness-Driven Low-Rank Adaptation	Rohan Sukumaran et.al.	2410.17358	null
2024-10-22	Dhoroni: Exploring Bengali Climate Change and Environmental Views with a Multi-Perspective News Dataset and Natural Language Processing	Azmine Toushik Wasi et.al.	2410.17225	link
2024-10-22	Arabic Dataset for LLM Safeguard Evaluation	Yasser Ashraf et.al.	2410.17040	link
2024-10-22	DENOASR: Debiasing ASRs through Selective Denoising	Anand Kumar Rai et.al.	2410.16712	null
2024-10-21	GReFEL: Geometry-Aware Reliable Facial Expression Learning under Bias and Imbalanced Data Distribution	Azmine Toushik Wasi et.al.	2410.15927	null
2024-10-21	Analysis of short-run and long-run marginal costs of generation in the power market	Shamim Homaei et.al.	2410.15861	null
2024-10-20	A hybrid origin for the Martian atmosphere	Kaveh Pahlevan et.al.	2410.15508	null
2024-10-20	Investigating the Impact of Age and Sex on Cataract Surgery Complications and Outcomes	Hadas Ben-Eli Yaacov Cnaany et.al.	2410.15505	null
2024-10-20	CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts	Malvina Nikandrou et.al.	2410.15453	link
2024-10-20	ActiveNeuS: Neural Signed Distance Fields for Active Stereo	Kazuto Ichimaru et.al.	2410.15376	null
2024-10-19	A Semidefinite Relaxation Approach for Fair Graph Clustering	Sina Baharlouei et.al.	2410.15233	link
2024-10-19	Smart-optimism. Uncovering the Resilience of Romanian City Halls in Online Service Delivery	Catalin Vrabie et.al.	2410.15189	null
2024-10-19	Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation	Seulbi Lee et.al.	2410.14975	null
2024-10-18	A Complexity-Based Theory of Compositionality	Eric Elmoznino et.al.	2410.14817	null
2024-10-18	Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum	Ryan Soh-Eun Shim et.al.	2410.14589	null
2024-10-18	Sim2real Cattle Joint Estimation in 3D point clouds	Okour Mohammad et.al.	2410.14419	null
2024-10-18	Coded Water-Filling for Multi-User Interference Cancellation	Yuan Li et.al.	2410.14136	null
2024-10-17	Auditing and Enforcing Conditional Fairness via Optimal Transport	Mohsen Ghassemi et.al.	2410.14029	null
2024-10-17	A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models	Qiaoyu Tang et.al.	2410.13841	null
2024-10-17	The Disparate Benefits of Deep Ensembles	Kajetan Schweighofer et.al.	2410.13831	link
2024-10-18	Aggregation Artifacts in Subjective Tasks Collapse Large Language Models’ Posteriors	Georgios Chochlakis et.al.	2410.13776	null
2024-10-17	Material Fingerprinting: Identifying and Predicting Perceptual Attributes of Material Appearance	Jiri Filip et.al.	2410.13615	null
2024-10-17	SAda-Net: A Self-Supervised Adaptive Stereo Estimation CNN For Remote Sensing Image Data	Dominik Hirner et.al.	2410.13500	link
2024-10-17	Inner ear morphology in wild versus laboratory house mice	Sabrina Renaud et.al.	2410.13325	null
2024-10-17	Perceptions of Discriminatory Decisions of Artificial Intelligence: Unpacking the Role of Individual Characteristics	Soojong Kim et.al.	2410.13250	null
2024-10-16	A Location Validation Technique to Mitigate GPS Spoofing Attacks in IEEE 802.11p based Fleet Operator’s Network of Electric Vehicles	Ankita Samaddar et.al.	2410.13031	null
2024-10-16	Stability properties for subgroups generated by return words	France Gheeraert et.al.	2410.12534	null
2024-10-16	Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention	Weixuan Wang et.al.	2410.12462	link
2024-10-16	Real-time Stereo-based 3D Object Detection for Streaming Perception	Changcai Li et.al.	2410.12394	link
2024-10-16	Pyramid-Driven Alignment: Pyramid Principle Guided Integration of Large Language Models and Knowledge Graphs	Lei Sun et.al.	2410.12298	null
2024-10-15	A Software Engineering Capstone Course Facilitated By GitHub Templates	Spencer Smith et.al.	2410.12114	null
2024-10-15	DAXA: Traversing the X-ray desert by Democratising Archival X-ray Astronomy	David J. Turner et.al.	2410.11954	link
2024-10-15	Adaptive Coordinators and Prompts on Heterogeneous Graphs for Cross-Domain Recommendations	Hengyu Zhang et.al.	2410.11719	null
2024-10-15	Multiple scales homogenisation of a porous viscoelastic material with rigid inclusions: application to lithium-ion battery electrodes	J. M. Foster et.al.	2410.11699	null
2024-10-16	Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture	Dabbrata Das et.al.	2410.11610	link
2024-10-15	Towards a Healthy AI Tradition: Lessons from Biology and Biomedical Science	Simon Kasif et.al.	2410.11590	null
2024-10-15	MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields	Yuru Xiao et.al.	2410.11394	null
2024-10-15	Improving Bias in Facial Attribute Classification: A Combined Impact of KL Divergence induced Loss Function and Dual Attention	Shweta Patel et.al.	2410.11176	null
2024-10-14	Solving the Transient Dyson Equation with Quasilinear Complexity via Matrix Compression	Baptiste Lamic et.al.	2410.11057	null
2024-10-14	Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation	Emmanouil Zaranis et.al.	2410.10995	link
2024-10-14	Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation	Peiwen Sun et.al.	2410.10676	null
2024-10-14	MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator	Taozhe Li et.al.	2410.10669	null
2024-10-14	Double Jeopardy and Climate Impact in the Use of Large Language Models: Socio-economic Disparities and Reduced Utility for Non-English Speakers	Aivin V. Solatorio et.al.	2410.10665	link
2024-10-14	Energetic Analysis of Emerging Quantum Communication Protocols	Raja Yehia et.al.	2410.10661	link
2024-10-14	Dual-Path Mechanism of Amino Acid Racemization Mediated by Quantum Mechanical Tunneling	Xinrui Yang et.al.	2410.10544	null
2024-10-14	Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world	Han Ling et.al.	2410.10453	link
2024-10-14	Minimum Tuning to Unlock Long Output from LLMs with High Quality Data as the Key	Yingda Chen et.al.	2410.10210	null
2024-10-13	Robust 3D Point Clouds Classification based on Declarative Defenders	Kaidong Li et.al.	2410.09691	link
2024-10-12	Scito2M: A 2 Million, 30-Year Cross-disciplinary Dataset for Temporal Scientometric Analysis	Yiqiao Jin et.al.	2410.09510	link
2024-10-12	Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors	Hritam Basak et.al.	2410.09467	null
2024-10-11	Efficient Multi-Object Tracking on Edge Devices via Reconstruction-Based Channel Pruning	Jan Müller et.al.	2410.08769	null
2024-10-11	No Tick-Size Too Small: A General Method for Modelling Small Tick Limit Order Books	Konark Jain et.al.	2410.08744	null
2024-10-11	Bio-inspired reconfigurable stereo vision for robotics using omnidirectional cameras	Suchang Chen et.al.	2410.08691	null
2024-10-10	PubMed knowledge graph 2.0: Connecting papers, patents, and clinical trials in biomedical science	Jian Xu et.al.	2410.07969	null
2024-10-10	Determining the Magnetic Field in the Galactic Plane from New Arecibo Pulsar Faraday Rotation Measurements	Alice P. Curtin et.al.	2410.07967	null
2024-10-10	A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways	Jing Su et.al.	2410.07915	null
2024-10-10	Multi-Scale Deformable Transformers for Student Learning Behavior Detection in Smart Classroom	Zhifeng Wang et.al.	2410.07834	null
2024-10-09	ACDC: Automated Creation of Digital Cousins for Robust Policy Learning	Tianyuan Dai et.al.	2410.07408	null
2024-10-09	Enhancing Performance of Point Cloud Completion Networks with Consistency Loss	Kevin Tirta Wijaya et.al.	2410.07298	null
2024-10-09	IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation	Xinchen Zhang et.al.	2410.07171	link
2024-10-10	Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology	Xiangyu Wang et.al.	2410.07087	null
2024-10-09	CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models	Zi Gong et.al.	2410.06741	link
2024-10-09	Analysis of different disparity estimation techniques on aerial stereo image datasets	Ishan Narayan et.al.	2410.06711	null
2024-10-09	Decomposing Relationship from 1-to-N into N 1-to-1 for Text-Video Retrieval	Jian Xiao et.al.	2410.06618	link
2024-10-09	The Sampling-Gaussian for stereo matching	Baiyu Pan et.al.	2410.06527	null
2024-10-09	OledFL: Unleashing the Potential of Decentralized Federated Learning via Opposite Lookahead Enhancement	Qinglun Li et.al.	2410.06482	null
2024-10-08	Skin Cancer Machine Learning Model Tone Bias	James Pope et.al.	2410.06385	null
2024-10-08	HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction	Shengji Tang et.al.	2410.06245	null
2024-10-08	BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way	Jiazi Bu et.al.	2410.06241	null
2024-10-07	Studying and Mitigating Biases in Sign Language Understanding Models	Katherine Atwell et.al.	2410.05206	null
2024-10-07	Enhancing Equity in Large Language Models for Medical Applications	Yuelyu Ji et.al.	2410.05180	link
2024-10-07	Presto! Distilling Steps and Layers for Accelerating Music Generation	Zachary Novack et.al.	2410.05167	null
2024-10-07	Correcting for Popularity Bias in Recommender Systems via Item Loss Equalization	Juno Prent et.al.	2410.04830	null
2024-10-07	The divide between us: Internet access among people with and without disabilities in the post-pandemic era	Edgar Pacheco et.al.	2410.04825	null
2024-10-06	Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives	Carolina Veiga et.al.	2410.04318	null
2024-10-05	Fast Object Detection with a Machine Learning Edge Device	Richard C. Rodriguez et.al.	2410.04173	null
2024-10-05	High-Speed Stereo Visual SLAM for Low-Powered Computing Devices	Ashish Kumar et.al.	2410.04090	link
2024-10-05	Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy	Pengcheng Chen et.al.	2410.04041	null
2024-10-04	Improving Arabic Multi-Label Emotion Classification using Stacked Embeddings and Hybrid Loss Function	Nisar Ahmed et.al.	2410.03979	link
2024-10-04	Noncollinear ferrielectricity and hydrogen-induced ferromagnetic polar half-metallicity in MnO $_3$ Cl	Xinyu Yang et.al.	2410.03220	null
2024-10-03	Q-SCALE: Quantum computing-based Sensor Calibration for Advanced Learning and Efficiency	Lorenzo Bergadano et.al.	2410.02998	null
2024-10-03	Individuation of 3D perceptual units from neurogeometry of binocular cells	Maria Virginia Bolelli et.al.	2410.02870	null
2024-10-03	Pseudo-Stereo Inputs: A Solution to the Occlusion Challenge in Self-Supervised Stereo Matching	Ruizhi Yang et.al.	2410.02534	link
2024-10-03	Cooperative Semantic Knowledge Base Update Policy for Multiple Semantic Communication Pairs	Shuling Li et.al.	2410.02405	null
2024-10-03	Extracting the Potential of Emerging Hardware Accelerators for Symmetric Eigenvalue Decomposition	Hansheng Wang et.al.	2410.02170	null
2024-10-03	Quantum Mutual Information in Time	James Fullwood et.al.	2410.02137	null
2024-10-04	C-MELT: Contrastive Enhanced Masked Auto-Encoders for ECG-Language Pre-Training	Manh Pham et.al.	2410.02131	link
2024-10-02	Unified space-time description of pulsed twin beams	Alessandra Gatti et.al.	2410.01907	null
2024-10-02	Conformal Prediction Sets Can Cause Disparate Impact	Jesse C. Cresswell et.al.	2410.01888	link
2024-10-02	A Novel Framework of Horizontal-Vertical Hybrid Federated Learning for EdgeIoT	Kai Li et.al.	2410.01644	null
2024-10-02	Fair Class-Incremental Learning using Sample Weighting	Jaeyoung Park et.al.	2410.01324	null
2024-10-02	SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network	Ahmed Tawfik Aboukhadra et.al.	2410.01293	null
2024-10-02	Unifying the Scope of Bridging Anaphora Types in English: Bridging Annotations in ARRAU and GUM	Lauren Levine et.al.	2410.01170	null
2024-10-01	M2P2: A Multi-Modal Passive Perception Dataset for Off-Road Mobility in Extreme Low-Light Conditions	Aniket Datar et.al.	2410.01105	null
2024-10-01	A catalog of multi-vantage point observations of type-II bursts: Statistics and correlations	Atul Mohan et.al.	2410.00814	null
2024-10-01	CME-associated type-IV radio bursts: The solar paradigm and the unique case of AD Leo	Atul Mohan et.al.	2410.00787	null
2024-10-01	What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study	Beatrice Savoldi et.al.	2410.00545	link
2024-10-01	Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration	Yida Lin et.al.	2410.00503	null
2024-09-30	ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity Learning	Jian Shi et.al.	2410.00262	link
2024-09-30	Uni $^2$ Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection	Yubin Wang et.al.	2409.20558	null
2024-09-30	Match Stereo Videos via Bidirectional Alignment	Junpeng Jing et.al.	2409.20283	null
2024-09-30	Understanding How Psychological Distance Influences User Preferences in Conversational Versus Web Search	Yitian Yang et.al.	2409.19982	null
2024-09-30	Positive-Sum Fairness: Leveraging Demographic Attributes to Achieve Fair AI Outcomes Without Sacrificing Group Gains	Samia Belhadj et.al.	2409.19940	null
2024-09-29	Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems	Xuyang Wu et.al.	2409.19804	link
2024-09-29	Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving	Wei-Bin Kou et.al.	2409.19560	null
2024-09-29	Transforming Scholarly Landscapes: Influence of Large Language Models on Academic Fields beyond Computer Science	Aniket Pramanick et.al.	2409.19508	link
2024-09-29	KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation	Soofiyan Atar et.al.	2409.19490	null
2024-10-01	Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models	Seongmin Lee et.al.	2409.19382	null
2024-09-27	Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping	Anthony A. Song et.al.	2409.19153	null
2024-09-27	LW2G: Learning Whether to Grow for Prompt-based Continual Learning	Qian Feng et.al.	2409.18860	link
2024-09-27	Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation	Chaomin Shen et.al.	2409.18785	null
2024-09-27	Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds	Hanbin Bae et.al.	2409.18705	null
2024-09-27	Analysis of commissioning data from SST-1M : A Prototype of Single-Mirror Small Size Telescope	Thomas Tavernier et.al.	2409.18639	null
2024-09-27	ChARLES: Change-Aware Recovery of Latent Evolution Semantics in Relational Data	Shiyi He et.al.	2409.18386	null
2024-09-26	Realistic Evaluation of Model Merging for Compositional Generalization	Derek Tam et.al.	2409.18314	link
2024-09-26	Revisit Anything: Visual Place Recognition via Image Segment Retrieval	Kartik Garg et.al.	2409.18049	link
2024-09-26	LGFN: Lightweight Light Field Image Super-Resolution using Local Convolution Modulation and Global Attention Feature Extraction	Zhongxin Yu et.al.	2409.17759	null
2024-09-26	Efficient Bias Mitigation Without Privileged Information	Mateo Espinosa Zarlenga et.al.	2409.17691	null
2024-09-26	Event-based Stereo Depth Estimation: A Survey	Suman Ghosh et.al.	2409.17680	null
2024-09-26	Improving Fast Adversarial Training via Self-Knowledge Guidance	Chengze Jiang et.al.	2409.17589	null
2024-09-26	Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Integrating SGBM and Segmentation Models	Yida Lin et.al.	2409.17526	null
2024-09-26	Characteristics of Powerful Radio Galaxies	Chandra B. Singh et.al.	2409.17514	null
2024-09-26	Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation	Ian Chuang et.al.	2409.17435	link
2024-09-25	NTIRE 2024 Challenge on Stereo Image Super-Resolution: Methods and Results	Longguang Wang et.al.	2409.16947	null
2024-09-25	The diverse star formation histories of early massive, quenched galaxies in modern galaxy formation simulations	Claudia del P. Lagos et.al.	2409.16916	link
2024-09-25	Pruning Multilingual Large Language Models for Multilingual Inference	Hwichan Kim et.al.	2409.16911	link
2024-09-25	An Adaptive Screen-Space Meshing Approach for Normal Integration	Moritz Heep et.al.	2409.16907	null
2024-09-25	GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning	Zhe-Rui Yang et.al.	2409.16670	link
2024-09-25	Task-driven SLAM Benchmarking	Yanwei Du et.al.	2409.16573	link
2024-09-24	Camera Calibration and Stereo via a Single Image of a Spherical Mirror	Nissim Barzilay et.al.	2409.16386	null
2024-09-24	Transient bubble rising in the presence of a surfactant at very low concentrations	D. Fernández-Martínez et.al.	2409.16029	null
2024-09-24	AutoCE: An Accurate and Efficient Model Advisor for Learned Cardinality Estimation	Jintao Zhang et.al.	2409.16027	null
2024-09-24	NER-Luxury: Named entity recognition for the fashion and luxury domain	Akim Mousterou et.al.	2409.15804	null
2024-09-24	Identified-and-Targeted: The First Early Evidence of the Privacy-Invasive Use of Browser Fingerprinting for Online Tracking	Zengrui Liu et.al.	2409.15656	null
2024-09-23	Rethinking Emotion Bias in Music via Frechet Audio Distance	Yuanchao Li et.al.	2409.15545	link
2024-09-23	Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras	Ming Li et.al.	2409.14766	null
2024-09-23	An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding	Wei-Bin Kou et.al.	2409.14737	null
2024-09-22	Exploring Multilingual Probing in Large Language Models: A Cross-Language Analysis	Daoyang Li et.al.	2409.14459	null
2024-09-22	Nonmodal stability analysis of the plane Poiseuille flow in a multilayer porous-fluid channel	Supriya Karmakar et.al.	2409.14420	null
2024-09-22	MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting	Chen Tessler et.al.	2409.14393	null
2024-09-23	Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping	Jaehyung Jung et.al.	2409.12051	null
2024-09-18	SymFace: Additional Facial Symmetry Loss for Deep Face Recognition	Pritesh Prakash et.al.	2409.11816	null
2024-09-17	A Pileup of Coronal Mass Ejections Produced the Largest Geomagnetic Storm in Two Decades	Ying D. Liu et.al.	2409.11492	null
2024-09-17	A generalized non-hourglass updated Lagrangian formulation for SPH solid dynamics	Shuaihao Zhang et.al.	2409.11474	null
2024-09-17	Connecting the Low to High Corona: Propagating Disturbances as Tracers of the Near-Sun Solar Wind	Nathalia Alzate et.al.	2409.11352	null
2024-09-17	The SST-1M imaging atmospheric Cherenkov telescope for gamma-ray astrophysics	C. Alispach et.al.	2409.11310	null
2024-09-17	SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration	Xin Guan et.al.	2409.11149	link
2024-09-17	Optimal Investment under the Influence of Decision-changing Imitation	Huisheng Wang et.al.	2409.10933	null
2024-09-16	GPT takes the SAT: Tracing changes in Test Difficulty and Math Performance of Students	Vikram Krishnaveti et.al.	2409.10750	null
2024-09-16	Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance	Simone Maurizio La Cava et.al.	2409.10481	null
2024-09-16	uniGasFoam: a particle-based OpenFOAM solver for multiscale rarefied gas flows	Nikos Vasileiadis et.al.	2409.10288	null
2024-09-16	SOLVR: Submap Oriented LiDAR-Visual Re-Localisation	Joshua Knights et.al.	2409.10247	null
2024-09-16	RF-GML: Reference-Free Generative Machine Listener	Arijit Biswas et.al.	2409.10210	null
2024-09-16	DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection	Kun Fang et.al.	2409.10094	null
2024-09-16	Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments	Wessel Ledder et.al.	2409.10048	link
2024-09-15	Estimating Wage Disparities Using Foundation Models	Keyon Vafa et.al.	2409.09894	null
2024-09-15	A Benchmark Dataset with Larger Context for Non-Factoid Question Answering over Islamic Text	Faiza Qamar et.al.	2409.09844	null
2024-09-15	Introducing DAIMYO: a first-time-right dynamic design architecture and its application to tail-sitter UAS development	Jolan Wauters et.al.	2409.09820	null
2024-09-14	An Augmentation-based Model Re-adaptation Framework for Robust Image Segmentation	Zheming Zuo et.al.	2409.09530	null
2024-09-13	ClearDepth: Enhanced Stereo Perception of Transparent Objects for Robotic Manipulation	Kaixin Bai et.al.	2409.08926	null
2024-09-12	The Impact of Large Language Models on Open-source Innovation: Evidence from GitHub Copilot	Doron Yeverechyahu et.al.	2409.08379	null
2024-09-12	Reducing Population-level Inequality Can Improve Demographic Group Fairness: a Twitter Case Study	Avijit Ghosh et.al.	2409.08135	null
2024-09-12	FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments	Devansh Dhrafani et.al.	2409.07715	null
2024-09-12	Modeling Information Narrative Detection and Evolution on Telegram during the Russia-Ukraine War	Patrick Gerard et.al.	2409.07684	null
2024-09-11	Unsupervised anomaly detection in spatio-temporal stream network sensor data	Edgar Santos-Fernandez et.al.	2409.07667	null
2024-09-11	Object Depth and Size Estimation using Stereo-vision and Integration with SLAM	Layth Hamad et.al.	2409.07623	null
2024-09-11	Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs	Sadra Safadoust et.al.	2409.07456	null
2024-09-11	StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos	Sijie Zhao et.al.	2409.07447	null
2024-09-11	Towards Fairer Health Recommendations: finding informative unbiased samples via Word Sense Disambiguation	Gavin Butts et.al.	2409.07424	null
2024-09-11	The microbiome science of composting and human excrement composting: a review	Jeff Meilander et.al.	2409.07376	null
2024-09-11	Constraining Genetic Symbolic Regression via Semantic Backpropagation	Maximilian Reissmann et.al.	2409.07369	link
2024-09-11	MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications	Praveen K Kanithi et.al.	2409.07314	null
2024-09-11	Learning Personalized Scoping for Graph Neural Networks under Heterophily	Gangda Deng et.al.	2409.06998	link
2024-09-11	Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention	Wenhao Zhao et.al.	2409.06985	null
2024-09-10	A Quality Diversity Approach to Automatically Generate Multi-Agent Path Finding Benchmark Maps	Cheng Qian et.al.	2409.06888	null
2024-09-10	Adversarial Attacks to Multi-Modal Models	Zhihao Dou et.al.	2409.06793	null
2024-09-10	Synchronization of wave-propelled capillary spinners	Jack-William Barotta et.al.	2409.06652	link
2024-09-10	Quantum-like approaches unveil the intrinsic limits of predictability in compartmental models	José Alejandro Rojas-Venegas et.al.	2409.06438	null
2024-09-09	LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo	Wei Zhi Tang et.al.	2409.06104	link
2024-09-09	Online 3D reconstruction and dense tracking in endoscopic videos	Michel Hayoz et.al.	2409.06037	link
2024-09-09	Dust-UV offsets in high-redshift galaxies in the Cosmic Dawn III simulation	Pierre Ocvirk et.al.	2409.05946	null
2024-09-09	The Influence of Task and Group Disparities over Users’ Attitudes Toward Using Large Language Models for Psychotherapy	Qihang He et.al.	2409.05703	null
2024-09-09	LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow	Hongyu Wen et.al.	2409.05688	null
2024-09-09	Adaptive Offloading and Enhancement for Low-Light Video Analytics on Mobile Devices	Yuanyi He et.al.	2409.05297	null
2024-09-08	PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels	Aayushman et.al.	2409.04975	link
2024-09-10	Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios	Zhiqiang Chen et.al.	2409.04961	link
2024-09-08	A Hetero-functional Graph Resilience Analysis for Convergent Systems-of-Systems	Amro M. Farid et.al.	2409.04936	null
2024-09-06	A Short Survey on Set-Based Aggregation Techniques for Single-Vector WSI Representation in Digital Pathology	S. Hemati et.al.	2409.04615	null
2024-09-06	AGR: Age Group fairness Reward for Bias Mitigation in LLMs	Shuirong Cao et.al.	2409.04340	null
2024-09-06	Calibration of Network Confidence for Unsupervised Domain Adaptation Using Estimated Accuracy	Coby Penso et.al.	2409.04241	link
2024-09-06	Confidence-Aware Document OCR Error Detection	Arthur Hemmer et.al.	2409.04117	null
2024-09-06	3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors	Yujun Huang et.al.	2409.04013	link
2024-09-05	An analysis of spectroscopic, seismological, astrometric, and photometric masses of pulsating white dwarf stars	Leila M. Calcaferro et.al.	2409.03896	null
2024-09-05	LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors	Hanyang Yu et.al.	2409.03456	null
2024-09-05	Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities	Wei Lu et.al.	2409.03444	link
2024-09-04	Fast algorithms to improve fair information access in networks	Dennis Robert Windham et.al.	2409.03127	link
2024-09-04	Incorporating dense metric depth into neural 3D representations for view synthesis and relighting	Arkadeep Narayan Chaudhury et.al.	2409.03061	null
2024-09-04	UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views	Jiaxin Guo et.al.	2409.02917	link
2024-09-04	MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling	Jihye Ahn et.al.	2409.02846	null
2024-09-04	Deep Learning Meets Satellite Images – An Evaluation on Handcrafted and Learning-based Features for Multi-date Satellite Stereo Images	Shuang Song et.al.	2409.02825	null
2024-09-04	Experimental Framework for Generating Reliable Ground Truth for Laryngeal Spatial Segmentation Tasks	Hamzeh Ghasemzadeh et.al.	2409.02809	null
2024-09-04	UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching	Soomin Kim et.al.	2409.02545	null
2024-09-04	Demographic parity in regression and classification within the unawareness framework	Vincent Divol et.al.	2409.02471	null
2024-09-04	Unified Framework with Consistency across Modalities for Human Activity Recognition	Tuyen Tran et.al.	2409.02385	link
2024-09-03	Collaboratively Learning Federated Models from Noisy Decentralized Data	Haoyuan Li et.al.	2409.02189	null
2024-09-03	Taming Randomness in Agent-Based Models using Common Random Numbers	Daniel J. Klein et.al.	2409.02086	link
2024-09-03	Observing Context Improves Disparity Estimation when Race is Unobserved	Kweku Kwegyir-Aggrey et.al.	2409.01984	null
2024-08-30	Semi-supervised permutation invariant particle-level anomaly detection	Gabriel Matos et.al.	2408.17409	link
2024-08-30	Fairness-Aware Estimation of Graphical Models	Zhuoping Zhou et.al.	2408.17396	link
2024-08-30	BioBricks.ai: A Versioned Data Registry for Life Sciences Data Assets	Yifan Gao et.al.	2408.17320	null
2024-08-30	Accelerating the discovery of steady-states of planetary interior dynamics with machine learning	Siddhant Agarwal et.al.	2408.17298	null
2024-08-30	A Generic and Automated Methodology to Simulate Melting Point	Fu-Zhi Dai et.al.	2408.17270	null
2024-08-30	Self-supervised learning for crystal property prediction via denoising	Alexander New et.al.	2408.17255	null
2024-08-30	EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs	Zhen Fan et.al.	2408.17168	null
2024-08-30	FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition	Chen Hu et.al.	2408.17090	link
2024-08-29	STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models	Koushik Srivatsan et.al.	2408.16807	link
2024-08-30	ARINC 429 Cyber-vulnerabilities and Voltage Data in a Hardware-in-the-Loop Simulator	Connor Trask et.al.	2408.16714	null
2024-08-29	Fibrations of algebras	Danel Ahman et.al.	2408.16581	null
2024-08-29	Spurfies: Sparse Surface Reconstruction using Local Geometry Priors	Kevin Raj et.al.	2408.16544	null
2024-08-29	Physical Similarity of Fluid Flow in Bimodal Porous Media: Part 1 – Basic Model and Solution Characteristics	Yuhe Wang et.al.	2408.16434	null
2024-08-28	Simulation and analysis of a high-k electron scale turbulence diagnostic for MAST-U	David C. Speirs et.al.	2408.15807	null
2024-08-28	Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions	Huachuan Qiu et.al.	2408.15787	link
2024-08-30	Addressing the challenges of loop detection in agricultural environments	Nicolás Soncini et.al.	2408.15761	link
2024-08-28	ES-PTAM: Event-based Stereo Parallel Tracking and Mapping	Suman Ghosh et.al.	2408.15605	link
2024-08-27	Regional emission dynamics across phases of the EU ETS	Marco Dueñas et.al.	2408.15438	null
2024-08-27	Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty	Saining Zhang et.al.	2408.15242	link
2024-08-27	Learning-based Multi-View Stereo: A Survey	Fangjinhua Wang et.al.	2408.15235	null
2024-08-27	Investigating Coverage Criteria in Large Language Models: An In-Depth Study Through Jailbreak Attacks	Shide Zhou et.al.	2408.15207	null
2024-08-27	Strategic Optimization and Challenges of Large Language Models in Object-Oriented Programming	Zinan Wang et.al.	2408.14834	null
2024-08-26	Towards Graph Prompt Learning: A Survey and Beyond	Qingqing Long et.al.	2408.14520	null
2024-08-26	Predictability and Causality in Spanish and English Natural Language Generation	Andrea Busto-Castiñeira et.al.	2408.14283	null
2024-08-26	Harnessing the Digital Revolution: A Comprehensive Review of mHealth Applications for Remote Monitoring in Transforming Healthcare Delivery	Avnish Singh Jat et.al.	2408.14190	null
2024-08-26	ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation	Ruohua Shi et.al.	2408.14114	null
2024-08-26	Bengali Sign Language Recognition through Hand Pose Estimation using Multi-Branch Spatial-Temporal Attention Model	Abu Saleh Musa Miah et.al.	2408.14111	null
2024-08-26	Fast Edge-Aware Occlusion Detection in the Context of Multispectral Camera Arrays	Frank Sippel et.al.	2408.14050	link
2024-08-26	More Pictures Say More: Visual Intersection Network for Open Set Object Detection	Bingcheng Dong et.al.	2408.14032	null
2024-08-25	Splatt3R: Zero-shot Gaussian Splatting from Uncalibarated Image Pairs	Brandon Smart et.al.	2408.13912	null
2024-08-24	Submodular Maximization Approaches for Equitable Client Selection in Federated Learning	Andrés Catalino Castillo Jiménez et.al.	2408.13683	null
2024-08-24	Outlier Detection Bias Busted: Understanding Sources of Algorithmic Bias through Data-centric Factors	Xueying Ding et.al.	2408.13667	null
2024-08-23	HEK-Omics: The promise of omics to optimize HEK293 for recombinant adeno-associated virus (rAAV) gene therapy manufacturing	Sai Guna Ranjan Gurazada et.al.	2408.13374	null
2024-08-23	Deep Learning at the Intersection: Certified Robustness as a Tool for 3D Vision	Gabriel Pérez S et.al.	2408.13135	null
2024-08-23	VCEMO: Multi-Modal Emotion Recognition for Chinese Voiceprints	Jinghua Tang et.al.	2408.13019	null
2024-08-23	Ada2I: Enhancing Modality Balance for Multimodal Conversational Emotion Recognition	Cam-Van Thi Nguyen et.al.	2408.12895	null
2024-08-23	Refining the isovector component of the Woods-Saxon potential	L. Xayavong et.al.	2408.12794	null
2024-08-22	Disentangled Structural and Featural Representation for Task-Agnostic Graph Valuation	Ali Falahati et.al.	2408.12659	null
2024-08-22	The Hybrid Hospital: Balancing On-Site and Remote Hospitalization	Noa Zychlinski et.al.	2408.12431	null
2024-08-22	Multi-Style Facial Sketch Synthesis through Masked Generative Modeling	Bowen Sun et.al.	2408.12400	null
2024-08-22	Aligning (Medical) LLMs for (Counterfactual) Fairness	Raphael Poulain et.al.	2408.12055	link
2024-08-21	Electrostatic Origins of the Dirichlet Principle	Steven Deckelman et.al.	2408.12002	null
2024-08-21	Time-Dependent Strategy for Improving Aortic Blood Flow Simulations with Boundary Control and Data Assimilation	Muhammad Adnan Anwar et.al.	2408.11617	null
2024-08-21	A Novel $δ$ -SBM-OPA Approach for Policy-Driven Analysis of Carbon Emission Efficiency under Uncertainty in the Chinese Industrial Sector	Shutian Cui et.al.	2408.11600	null
2024-08-21	GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation	Abiao Li et.al.	2408.11558	link
2024-08-21	Mutagenesis screen to map the functionals of parameters of Large Language Models	Yue Hu et.al.	2408.11494	link
2024-08-20	Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier in 3D Object Detection for AVs	Sanjay Bhargav Dharavath et.al.	2408.11207	link
2024-08-20	SDI-Net: Toward Sufficient Dual-View Interaction for Low-light Stereo Image Enhancement	Linlin Hu et.al.	2408.10934	null
2024-08-20	A Noncontact Technique for Wave Measurement Based on Thermal Stereography and Deep Learning	Deyu Li et.al.	2408.10670	null
2024-08-20	Multi-view Hand Reconstruction with a Point-Embedded Transformer	Lixin Yang et.al.	2408.10581	link
2024-08-19	Customizing Language Models with Instance-wise LoRA for Sequential Recommendation	Xiaoyu Kong et.al.	2408.10159	link
2024-08-19	Envisioning Possibilities and Challenges of AI for Personalized Cancer Care	Elaine Kong et.al.	2408.10108	null
2024-08-19	ARMADA: Attribute-Based Multimodal Data Augmentation	Xiaomeng Jin et.al.	2408.10086	null
2024-08-19	Helical edge modes in a triangular Heisenberg antiferromagnet	Bastian Pradenas et.al.	2408.10062	null
2024-08-19	Bridging the Language Gap: Enhancing Multilingual Prompt-Based Code Generation in LLMs via Zero-Shot Cross-Lingual Transfer	Mingda Li et.al.	2408.09701	null
2024-08-17	Intuitive Human-Robot Interface: A 3-Dimensional Action Recognition and UAV Collaboration Framework	Akash Chaudhary et.al.	2408.09232	null
2024-08-17	TableBench: A Comprehensive and Complex Benchmark for Table Question Answering	Xianjie Wu et.al.	2408.09174	null
2024-08-17	GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation	Weiming Zhang et.al.	2408.09115	null
2024-08-17	Depth-guided Texture Diffusion for Image Semantic Segmentation	Wei Sun et.al.	2408.09097	null
2024-08-17	From Urban Clusters to Megaregions: Mapping Australia’s Evolving Urban Regions	M. K. M Ng et.al.	2408.09054	null
2024-08-16	An Empirical Examination of Balancing Strategy for Counterfactual Estimation on Time Series	Qiang Huang et.al.	2408.08815	null
2024-08-16	CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving	Shihan Peng et.al.	2408.08500	null
2024-08-16	Fishers Harvest Parallel Unlearning in Inherited Model Networks	Xiao Liu et.al.	2408.08493	null
2024-08-15	Comparing NASA Discovery and New Frontiers Class Mission Concepts for the Io Volcano Observer (IVO)	Christopher W. Hamilton et.al.	2408.08334	null
2024-08-15	Cluster Formations of Free and Congested Flows in Urban Road Networks	Yongsung Kwon et.al.	2408.08122	null
2024-08-15	Motif analysis and passing behavior in football passing networks	Ming-Xia Li et.al.	2408.07927	null
2024-08-14	Polarization dynamics: a study of individuals shifting between political communities on social media	Federico Albanese et.al.	2408.07731	null
2024-08-14	Hierarchical Working Memory and a New Magic Number	Weishun Zhong et.al.	2408.07637	null
2024-08-14	Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks	Liting Jiang et.al.	2408.07613	null
2024-08-15	DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution	Yuanbo Zhou et.al.	2408.07516	null
2024-08-14	M2L Translation Operators for Kernel Independent Fast Multipole Methods on Modern Architectures	Srinath Kailasa et.al.	2408.07436	null
2024-08-14	Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction	Liting Jiang et.al.	2408.07419	link
2024-08-14	MorphFader: Enabling Fine-grained Controllable Morphing with Text-to-Audio Models	Purnima Kamath et.al.	2408.07260	null
2024-08-12	Quantized Redshift and its significance for recent observations	Arindam Mal et.al.	2408.07101	null
2024-08-13	The News Comment Gap and Algorithmic Agenda Setting in Online Forums	Flora Böwing et.al.	2408.07052	link
2024-08-13	Quantifying the checkerboard problem to reduce numerical dissipation	Johannes Arend Hopman et.al.	2408.06821	null
2024-08-12	Observation of vortex stripes in UTe $_2$	Y. F. Wang et.al.	2408.06209	null
2024-08-12	IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI	Yash Rampuria et.al.	2408.06113	null
2024-08-12	Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models	Haifan Gong et.al.	2408.05985	null
2024-08-11	Predictors and Socio-Demographic Disparities in STEM Degree Outcomes: A ten-year UK study using Hierarchical Logistic Regression	Andrew M. Low et.al.	2408.05853	null
2024-08-10	EV-MGDispNet: Motion-Guided Event-Based Stereo Disparity Estimation Network with Left-Right Consistency	Junjie Jiang et.al.	2408.05452	null
2024-08-08	LiDAR-Event Stereo Fusion with Hallucinations	Luca Bartolomei et.al.	2408.04633	link
2024-08-08	Charmed hypernuclei within density-dependent relativistic mean-field theory	Wei Yang et.al.	2408.04527	null
2024-08-08	A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery	Mengya Xu et.al.	2408.04426	link
2024-08-07	A Framework for Assessing Cumulative Exposure to Extreme Temperatures During Transit Trip	Huiying Fan et.al.	2408.04081	null
2024-08-07	A Comparison of Fireball Luminous Efficiency Models using Acoustic Records	Luke McFadden et.al.	2408.04078	null
2024-08-07	A Blockchain-based Reliable Federated Meta-learning for Metaverse: A Dual Game Framework	Emna Baccour et.al.	2408.03694	null
2024-08-07	TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization	Kien T. Pham et.al.	2408.03637	null
2024-08-07	Unlocking Exocentric Video-Language Data for Egocentric Video Representation Learning	Zi-Yi Dou et.al.	2408.03567	null
2024-08-07	D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods	Onkar Susladkar et.al.	2408.03558	link
2024-08-07	Opening the Black Box of 3D Reconstruction Error Analysis with VECTOR	Racquel Fygenson et.al.	2408.03503	link
2024-08-06	Transit Rider Heat Stress in Atlanta, GA under Current and Future Climate Scenarios	Huiying Fan et.al.	2408.03457	null
2024-08-06	Fusing Forces: Deep-Human-Guided Refinement of Segmentation Masks	Rafael Sterzinger et.al.	2408.03304	link
2024-08-06	Measuring interconnectedness of infectious diseases in funded and unfunded research: a temporal network analysis on bibliometric data 1995-2022	Anbang Du et.al.	2408.03140	null
2024-08-06	Predictive Performance Test based on the Exhaustive Nested Cross-Validation for High-dimensional data	Iris Ivy Gauran et.al.	2408.03138	null
2024-08-06	Interoperability and Explicable AI-based Zero-Day Attacks Detection Process in Smart Community	Mohammad Sayduzzaman et.al.	2408.02921	null
2024-08-05	Phase Transitions in Anisotropic Turbulence	Adrian van Kan et.al.	2408.02844	null
2024-08-05	Gaussian Mixture based Evidential Learning for Stereo Matching	Weide Liu et.al.	2408.02796	null
2024-08-04	Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image	Xinlin Ren et.al.	2408.02079	link
2024-08-04	PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone	Xin Yang et.al.	2408.02053	null
2024-08-03	Are EU low-carbon structural funds efficient in reducing emissions?	Marco Dueñas et.al.	2408.01782	null
2024-08-03	MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas	Feng Qiao et.al.	2408.01653	null
2024-08-06	Three-dimensional Morphological Reconstruction of Millimeter-Scale Soft Continuum Robots based on Dual-Stereo-Vision	Tian-Ao Ren et.al.	2408.01615	null
2024-08-02	Decentralized Smoothing ADMM for Quantile Regression with Non-Convex Sparse Penalties	Reza Mirzaeifard et.al.	2408.01307	null
2024-08-02	The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models	Hannah Chen et.al.	2408.01285	null
2024-08-01	High-Impact Innovations and Hidden Gender Disparities in Inventor-Evaluator Networks	Tara Sowrirajan et.al.	2408.00905	null
2024-08-01	Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection	Ruiyang Zhang et.al.	2408.00619	link
2024-07-31	Machine Learning Boosted Entropy-Engineered Synthesis of stable Nanometric Solid Solution CuCo Alloys for Efficient Nitrate Reduction to Ammonia	Yao Hu et.al.	2408.00142	null
2024-07-31	A comparative study of radio signatures from winds and jets: Modelling synchrotron emission and polarization	Moun Meenakshi et.al.	2408.00099	null
2024-07-31	Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs	Shi Liu et.al.	2407.21771	null
2024-07-31	Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching	Pengjie Zhang et.al.	2407.21735	null
2024-07-31	Deep Learning-Based Longitudinal Prediction of Childhood Myopia Progression Using Fundus Image Sequences and Baseline Refraction Data	Mengtian Kang et.al.	2407.21467	null
2024-07-31	Modeling Urban Transport Choices: Incorporating Sociocultural Aspects	Kathleen Salazar-Serna et.al.	2407.21307	link
2024-07-30	Algorithm-Assisted Decision Making and Racial Disparities in Housing: A Study of the Allegheny Housing Assessment Tool	Lingwei Cheng et.al.	2407.21209	null
2024-07-30	Different behaviour of the gas-phase and stellar metallicity in the central part of MaNGA galaxies	I. A. Zinchenko et.al.	2407.21160	null
2024-07-30	Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings	Tianyi Zhang et.al.	2407.20870	null
2024-07-30	Planar network statistics for two-dimensional rupturing foams	Joseph Klobusicky et.al.	2407.20858	null
2024-07-30	Evaluating Fairness in Black-box Algorithmic Markets: A Case Study of Ride Sharing in Chicago	Yuhan Liu et.al.	2407.20522	null
2024-07-29	BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation	Kieran Saunders et.al.	2407.20437	null
2024-07-29	Solving QUBOs with a quantum-amenable branch and bound method	Thomas Häner et.al.	2407.20185	null
2024-07-29	Classification of Alzheimer’s Dementia vs. Healthy subjects by studying structural disparities in fMRI Time-Series of DMN	Sneha Noble et.al.	2407.19990	null
2024-07-29	Can I trust my anomaly detection system? A case study based on explainable AI	Muhammad Rashid et.al.	2407.19951	link
2024-07-29	Generalization bounds for regression and classification on adaptive covering input domains	Wen-Liang Hwang et.al.	2407.19715	null
2024-07-29	SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages	Wenxuan Zhang et.al.	2407.19672	link
2024-07-29	AI-Driven Healthcare: A Survey on Ensuring Fairness and Mitigating Bias	Sribala Vidyadhari Chinta et.al.	2407.19655	null
2024-07-28	On the Evaluation Consistency of Attribution-based Explanations	Jiarui Duan et.al.	2407.19471	null
2024-07-27	MSP-MVS: Multi-granularity Segmentation Prior Guided Multi-View Stereo	Zhenlong Yuan et.al.	2407.19323	null
2024-07-27	On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs	Nitay Calderon et.al.	2407.19200	null
2024-07-27	Assessing Spatial Disparities: A Bayesian Linear Regression Approach	Kyle Lin Wu et.al.	2407.19171	null
2024-07-26	PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis	Sohyeong Kim et.al.	2407.18695	null
2024-07-26	Direct observation of quantum vortex fractionalization in multiband superconductors	Yu Zheng et.al.	2407.18610	null
2024-07-26	Content-driven Magnitude-Derivative Spectrum Complementary Learning for Hyperspectral Image Classification	Huiyan Bai et.al.	2407.18593	null
2024-07-25	Unsupervised Training of Neural Cellular Automata on Edge Devices	John Kalkhof et.al.	2407.18114	link
2024-07-25	TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework	Guanfeng Tang et.al.	2407.18038	null
2024-07-25	Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks	Zhicheng Cai et.al.	2407.17834	link
2024-07-25	Multi-modal Data Binding for Survival Analysis Modeling with Incomplete Data and Annotations	Linhao Qu et.al.	2407.17726	null
2024-07-24	Unveiling the structural content of NGC 6357 via kinematics and NIR variability	C. Ordenes-Huanca et.al.	2407.17577	null
2024-07-24	Gender disparities in the dissemination and acquisition of scientific knowledge	Chiara Zappalà et.al.	2407.17441	null
2024-07-25	Domain Generalized Recaptured Screen Image Identification Using SWIN Transformer	Preeti Mehta et.al.	2407.17170	null
2024-07-23	Balanced Multi-Relational Graph Clustering	Zhixiang Shen et.al.	2407.16863	link
2024-07-24	FCNR: Fast Compressive Neural Representation of Visualization Images	Yunfei Lu et.al.	2407.16369	link
2024-07-23	MHD activity induced coherent mode excitation in the edge plasma region of ADITYA-U Tokamak	Kaushlender Singh et.al.	2407.16301	null
2024-07-23	Representation Magnitude has a Liability to Privacy Vulnerability	Xingli Fang et.al.	2407.16164	link
2024-07-22	Inequalities in Computational Thinking Among Incoming Students in an STEM Chilean University	Felipe González-Pizarro et.al.	2407.15833	null
2024-07-22	Breaking the Global North Stereotype: A Global South-centric Benchmark Dataset for Auditing and Mitigating Biases in Facial Recognition Systems	Siddharth D Jaiswal et.al.	2407.15810	null
2024-07-22	Examining Inequality in Park Quality for Promoting Health Across 35 Global Cities	Linus W. Dietz et.al.	2407.15770	link
2024-07-23	Bidirectional skip-frame prediction for video anomaly detection with intra-domain disparity-driven attention	Jiahao Lyu et.al.	2407.15424	null
2024-07-22	Iterative approach to reconstructing neural disparity fields from light-field data	Ligen Shi et.al.	2407.15380	null
2024-07-22	Dissecting Multiplication in Transformers: Insights into LLMs	Luyu Qiu et.al.	2407.15360	link
2024-07-22	Efficient Multi-disparity Transformer for Light Field Image Super-resolution	Zeke Zexi Hu et.al.	2407.15329	null
2024-07-19	PolySinger: Singing-Voice to Singing-Voice Translation from English to Japanese	Silas Antonisen et.al.	2407.14399	null
2024-07-19	tidychangepoint: a unified framework for analyzing changepoint detection in univariate time series	Benjamin S. Baumer et.al.	2407.14369	null
2024-07-19	Stable Audio Open	Zach Evans et.al.	2407.14358	link
2024-07-19	SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization	Mae Younes et.al.	2407.14257	link
2024-07-19	Double-Shot 3D Shape Measurement with a Dual-Branch Network	Mingyang Lei et.al.	2407.14198	null
2024-07-19	Scale Disparity of Instances in Interactive Point Cloud Segmentation	Chenrui Han et.al.	2407.14009	null
2024-07-19	Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance	Changye Li et.al.	2407.13982	link
2024-07-19	The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations	Tyler LaBonte et.al.	2407.13957	link
2024-07-18	Research on Tibetan Tourism Viewpoints information generation system based on LLM	Jinhu Qi et.al.	2407.13561	null
2024-07-18	CookAR: Affordance Augmentations in Wearable AR to Support Kitchen Tool Interactions for People with Low Vision	Jaewook Lee et.al.	2407.13515	link
2024-07-18	MIR laser CEP estimation using machine learning concepts in bulk high harmonic generation	Balázs Nagyillés et.al.	2407.13512	null
2024-07-18	From Words to Worlds: Compositionality for Cognitive Architectures	Ruchira Dhar et.al.	2407.13419	null
2024-07-18	Hybridization of terahertz phonons and magnons in disparate and spatially-separated material specimens	Marcin Białek et.al.	2407.13305	null
2024-07-18	FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection	Jianwei Zhao et.al.	2407.13133	null
2024-07-17	Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning	Minjae Cho et.al.	2407.13006	null
2024-07-17	Multi-Band Wi-Fi Neural Dynamic Fusion	Sorachi Kato et.al.	2407.12937	null
2024-07-17	Propagation of Interplanetary Shocks in the Heliosphere	Munkhjargal Lkhagvadorj et.al.	2407.12689	null
2024-07-16	Temporally Consistent Stereo Matching	Jiaxi Zeng et.al.	2407.11950	link
2024-07-16	Fairly Accurate: Optimizing Accuracy Parity in Fair Target-Group Detection	Soumyajit Gupta et.al.	2407.11933	null
2024-07-16	MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification	Zhuoxiao Li et.al.	2407.11840	null
2024-07-16	Robust Utility-Preserving Text Anonymization Based on Large Language Models	Tianyu Yang et.al.	2407.11770	link
2024-07-16	Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems	Jianzhu Huai et.al.	2407.11705	null
2024-07-16	Rethinking Fair Graph Neural Networks from Re-balancing	Zhixun Li et.al.	2407.11624	link
2024-07-17	QVD: Post-training Quantization for Video Diffusion Models	Shilong Tian et.al.	2407.11585	null
2024-07-16	Representation Bias in Political Sample Simulations with Large Language Models	Weihong Qi et.al.	2407.11409	null
2024-07-16	The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation	Muyang Qiu et.al.	2407.11356	link
2024-07-15	Benchmarking Vision Language Models for Cultural Understanding	Shravan Nayak et.al.	2407.10920	null
2024-07-15	Temporal Event Stereo via Joint Learning with Stereoscopic Flow	Hoonhee Cho et.al.	2407.10831	link
2024-07-15	Growth of Science: How long will the United States uphold its position?	Dipak Patra et.al.	2407.10771	null
2024-07-15	Socioeconomic factors of national representation in the global film festival circuit: skewed toward the large and wealthy, but small countries can beat the odds	Andres Karjus et.al.	2407.10755	null
2024-07-15	Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model	Zhening Liu et.al.	2407.10632	link
2024-07-15	Muon-induced collisional flavor instability in core-collapse supernova	Jiabao Liu et.al.	2407.10604	null
2024-07-15	A Unifying Approach to Product Constructions for Quantitative Temporal Inference	Kazuki Watanabe et.al.	2407.10465	null
2024-07-14	Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data	Tuo Feng et.al.	2407.10200	link
2024-07-14	Adaptive Model Predictive Control with Data-driven Error Model for Quadrupedal Locomotion	Xuanqi Zeng et.al.	2407.10124	null
2024-07-13	Characterizing Disparity Between Edge Models and High-Accuracy Base Models for Vision Tasks	Zhenyu Wang et.al.	2407.10016	null
2024-07-12	Self-organized multiscale structures in thermally relativistic electron-positron-ion plasmas	Usman Shazad et.al.	2407.09440	null
2024-07-12	Multi-Modal Dataset Creation for Federated~Learning with DICOM Structured Reports	Malte Tölle et.al.	2407.09064	null
2024-07-12	Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining on Chest CT	Jie Zheng et.al.	2407.08961	null
2024-07-11	MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization	Orevaoghene Ahia et.al.	2407.08818	null
2024-07-11	Adaptive Smooth Non-Stationary Bandits	Joe Suk et.al.	2407.08654	link
2024-07-11	Multi-Group Proportional Representation	Alex Oesterling et.al.	2407.08571	link
2024-07-11	Vox Populi, Vox AI? Using Language Models to Estimate German Public Opinion	Leah von der Heyde et.al.	2407.08563	link
2024-07-11	Unveiling Disparities in Maternity Care: A Topic Modelling Approach to Analysing Maternity Incident Investigation Reports	Georgina Cosma et.al.	2407.08328	null
2024-07-11	DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing	Minghang Zhou et.al.	2407.08132	link
2024-07-10	Stretch your reach: Studying Self-Avatar and Controller Misalignment in Virtual Reality Interaction	Jose Luis Ponton et.al.	2407.08011	null
2024-07-10	A Survey on Deep Stereo Matching in the Twenties	Fabio Tosi et.al.	2407.07816	link
2024-07-10	Explicit inverse of symmetric, tridiagonal near Toeplitz matrices Part II: with weakly diagonally dominant Toeplitz	Bakytzhan Kurmanbek et.al.	2407.07654	null
2024-07-10	TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data	Siyi Du et.al.	2407.07582	link
2024-07-10	Causal Discovery-Driven Change Point Detection in Time Series	Shanyun Gao et.al.	2407.07290	null
2024-07-09	A Detailed Analysis of a Magnetic Island Observed by WISPR on Parker Solar Probe	Madison L. Ascione et.al.	2407.07216	null
2024-07-09	Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images	Chuanrui Zhang et.al.	2407.06984	null
2024-07-09	iASiS: Towards Heterogeneous Big Data Analysis for Personalized Medicine	Anastasia Krithara et.al.	2407.06748	null
2024-07-09	Computer vision tasks for intelligent aerospace missions: An overview	Huilin Chen et.al.	2407.06513	null
2024-07-09	LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration	Jiayi Liu et.al.	2407.06512	link
2024-07-08	Systematic time-coarse graining for driven quantum systems	Leon Bello et.al.	2407.06068	link
2024-07-08	CA-FedRC: Codebook Adaptation via Federated Reservoir Computing in 5G NR	Ziqiang Ye et.al.	2407.05928	null
2024-07-08	GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation	Chenxin Li et.al.	2407.05540	null
2024-07-07	GitHub Marketplace for Automation and Innovation in Software Production	SK Golam Saroar et.al.	2407.05519	null
2024-07-07	Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models	Nikhil Sharma et.al.	2407.05502	null
2024-07-07	CLIMB: A Benchmark of Clinical Bias in Large Language Models	Yubo Zhang et.al.	2407.05250	link
2024-07-06	SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention	Yunzhong Si et.al.	2407.05128	link
2024-07-06	Crowdsourced reviews reveal substantial disparities in public perceptions of parking	Lingyao Li et.al.	2407.05104	link
2024-07-06	SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions	Zaid A. El-Shair et.al.	2407.04908	null
2024-07-05	Balancing Operator’s Risk Averseness in Model Predictive Control of a Reservoir System	Ja-Ho Koo et.al.	2407.04506	null
2024-07-04	The SOHO LASCO CME Catalog – Version 2	Nat Gopalswamy et.al.	2407.04165	null
2024-07-04	Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving	Sergio. Martín Serrano et.al.	2407.04070	null
2024-07-04	Adversarial Robustness of VAEs across Intersectional Subgroups	Chethan Krishnamurthy Ramanaik et.al.	2407.03864	link
2024-07-04	M $\mathbf5$ – A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks	Florian Schneider et.al.	2407.03791	null
2024-07-04	High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching	Gael Le Lan et.al.	2407.03648	null
2024-07-04	ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution	Yuanbo Zhou et.al.	2407.03598	link
2024-07-03	Probing Perfection: The Relentless Art of Meddling for Pulmonary Airway Segmentation from HRCT via a Human-AI Collaboration Based Active Learning Method	Shiyi Wang et.al.	2407.03542	null
2024-07-03	How Does Quantization Affect Multilingual LLMs?	Kelly Marchisio et.al.	2407.03211	null
2024-07-03	Stereo Risk: A Continuous Modeling Approach to Stereo Matching	Ce Liu et.al.	2407.03152	null
2024-07-03	Effective Heterogeneous Federated Learning via Efficient Hypernetwork-based Weight Generation	Yujin Shin et.al.	2407.03086	link
2024-07-03	Early-Stage Anomaly Detection: A Study of Model Performance on Complete vs. Partial Flows	Adrian Pekar et.al.	2407.02856	link
2024-07-03	A Pairwise DomMix Attentive Adversarial Network for Unsupervised Domain Adaptive Object Detection	Jie Shao et.al.	2407.02835	null
2024-07-02	Practical Guide for Causal Pathways and Sub-group Disparity Analysis	Farnaz Kohankhaki et.al.	2407.02702	null
2024-07-02	Domain Generalizable Knowledge Tracing via Concept Aggregation and Relation-Based Attention	Yuquan Xie et.al.	2407.02547	null
2024-07-02	QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices	Juntao Zhao et.al.	2407.02327	link
2024-07-02	Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models	Anjishnu Mukherjee et.al.	2407.02067	link
2024-07-02	Privacy Risks of General-Purpose AI Systems: A Foundation for Investigating Practitioner Perspectives	Stephen Meisenbacher et.al.	2407.02027	null
2024-07-02	Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model	Yu-Kuan Fu et.al.	2407.01911	null
2024-07-01	Race and Privacy in Broadcast Police Communications	Pranav Narayanan Venkit et.al.	2407.01817	null
2024-07-01	Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation	Lianjie Guo et.al.	2407.01292	link
2024-07-01	OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos	Yassine Benzakour et.al.	2407.01265	null
2024-07-01	FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models	Ruinan Jin et.al.	2407.00983	link
2024-06-30	Learning System Dynamics without Forgetting	Xikun Zhang et.al.	2407.00717	link
2024-06-30	Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP	Ayush Ranjan et.al.	2407.00592	null
2024-06-28	LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation	Xianda Guo et.al.	2406.19833	link
2024-06-28	Galaxy Group Ellipticity Confirms a Younger Cosmos	Yu Rong et.al.	2406.19612	null
2024-06-28	What’s the Weight? Estimating Controlled Outcome Differences in Complex Surveys for Health Disparities Research	Stephen Salerno et.al.	2406.19597	link
2024-06-27	Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects	Orevaoghene Ahia et.al.	2406.19564	link
2024-06-27	Stereo Vision Based Robot for Remote Monitoring with VR Support	Mohamed Fazil M. S. et.al.	2406.19498	null
2024-06-27	STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning	Yanan Zhang et.al.	2406.19362	null
2024-06-27	Revealing Fine-Grained Values and Opinions in Large Language Models	Dustin Wright et.al.	2406.19238	link
2024-06-27	RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulaiton	Fanfan Liu et.al.	2406.18977	link
2024-06-27	From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased Decisions	Trenton Chang et.al.	2406.18865	link
2024-06-27	Retain, Blend, and Exchange: A Quality-aware Spatial-Stereo Fusion Approach for Event Stream Recognition	Lan Chen et.al.	2406.18845	link
2024-06-26	DoubleTake: Geometry Guided Depth Estimation	Mohamed Sayed et.al.	2406.18387	null
2024-06-26	An interactive framework for the evaluation and detection of stereoacuity threshold under ambient lighting	Kritika Lohia et.al.	2406.18336	null
2024-06-26	Molecular Diffusion Models with Virtual Receptors	Matan Halfon et.al.	2406.18330	null
2024-06-28	SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance	Caishuang Huang et.al.	2406.18118	link
2024-06-25	Evaluating Fairness in Large Vision-Language Models Across Diverse Demographic Attributes and Prompts	Xuyang Wu et.al.	2406.17974	link
2024-06-25	Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals	Kentaro Seki et.al.	2406.17722	link
2024-06-25	Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2406.17679	null
2024-06-25	RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale	Beck LaBash et.al.	2406.16801	link
2024-06-24	Addressing Polarization and Unfairness in Performative Prediction	Kun Jin et.al.	2406.16756	null
2024-06-24	Lone Pair Induced 1D Character and Weak Cation-anion Interactions: Two Ingredients for Low Thermal Conductivity in Mixed-anion Metal Chalcohalides	Xingchen Shen et.al.	2406.16744	null
2024-06-24	Effective Elastic Properties of Multilayer Graphene	Yun Hwangbo et.al.	2406.16344	null
2024-06-23	Thinking beyond Bias: Analyzing Multifaceted Impacts and Implications of AI on Gendered Labour	Satyam Mohla et.al.	2406.16207	null
2024-06-23	The Persistence of Contrarianism on Twitter: Mapping users’ sharing habits for the Ukraine war, COVID-19 vaccination, and the 2020 Midterm Elections	David Axelrod et.al.	2406.16175	null
2024-06-23	Comparison of methods for mediation analysis with multiple correlated mediators	Mary Appah et.al.	2406.16174	null
2024-06-23	Quantitative Global Carbon Inequality Network	Yanming Guo et.al.	2406.16092	null
2024-06-23	Learning Accurate and Enriched Features for Stereo Image Super-Resolution	Hu Gao et.al.	2406.16001	link
2024-06-23	Generalized Measures of Population Synchrony	Francis C. Motta et.al.	2406.15987	null
2024-06-21	Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks	Hokyung Lee et.al.	2406.15325	link
2024-06-21	Time-Domain Signatures of Distinct Correlated Insulators in a Moiré Superlattice	Eric A. Arsenault et.al.	2406.15067	null
2024-06-21	3D-Localization of Single Point-Like Gamma Sources with a Coded Aperture Camera	Tobias Meißner et.al.	2406.15048	null
2024-06-21	Trustworthy Enhanced Multi-view Multi-modal Alzheimer’s Disease Prediction with Brain-wide Imaging Transcriptomics Data	Shan Cong et.al.	2406.14977	link
2024-06-21	Direct Multi-Turn Preference Optimization for Language Agents	Wentao Shi et.al.	2406.14868	link
2024-06-21	Older and Wiser: The Marriage of Device Aging and Intellectual Property Protection of Deep Neural Networks	Ning Lin et.al.	2406.14863	null
2024-06-21	Non-Markovian Collective Emission of Giant emitters in the Zeno Regime	Qing-Yang Qiu et.al.	2406.14811	null
2024-06-20	1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators?	Yue Huang et.al.	2406.14721	null
2024-06-20	Population Activity Recovery: Milestones Unfolding, Temporal Interdependencies, and Relationship with Physical and Social Vulnerability	Flavia Ioana Patrascu et.al.	2406.14720	null
2024-06-20	Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data	Johannes Treutlein et.al.	2406.14546	link
2024-06-20	Towards Truthful Multilingual Large Language Models: Benchmarking and Alignment Strategies	Weihao Liu et.al.	2406.14434	link
2024-06-20	Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services	David Hartmann et.al.	2406.14154	null
2024-06-20	Novae: An Important Source of Lithium in the Galaxy	Jun Gao et.al.	2406.13986	null
2024-06-19	Open Generative Large Language Models for Galician	Pablo Gamallo et.al.	2406.13893	null
2024-06-19	Leveraging Large Language Models to Measure Gender Bias in Gendered Languages	Erik Derner et.al.	2406.13677	null
2024-06-19	Transferable Tactile Transformers for Representation Learning Across Diverse Sensors and Tasks	Jialiang Zhao et.al.	2406.13640	null
2024-06-19	Formation of a Magnetic Cloud from the Merging of Two Successive Coronal Mass Ejections	Chong Chen et.al.	2406.13603	null
2024-06-19	MVSBoost: An Efficient Point Cloud-based 3D Reconstruction	Umair Haroon et.al.	2406.13515	null
2024-06-19	Toward Structure Fairness in Dynamic Graph Embedding: A Trend-aware Dual Debiasing Approach	Yicong Li et.al.	2406.13201	link
2024-06-18	Stealth edits for provably fixing or attacking large language models	Oliver J. Sutton et.al.	2406.12670	link
2024-06-18	An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation	Qin Li et.al.	2406.12646	null
2024-06-18	Restorer: Solving Multiple Image Restoration Tasks with One Set of Parameters	Jiawei Mao et.al.	2406.12587	link
2024-06-18	Rastall gravity: accretion disk image in radiation fields context and visual transformations compared to Reissner-Nordstrom black holes	Yu-Xiang Huang et.al.	2406.12466	null
2024-06-18	Status of Astronomy Education in India: A Baseline Survey	Moupiya Maji et.al.	2406.12308	null
2024-06-17	Slicing Through Bias: Explaining Performance Gaps in Medical Image Analysis using Slice Discovery Methods	Vincent Olesen et.al.	2406.12142	link
2024-06-17	The Benefits and Risks of Transductive Approaches for AI Fairness	Muhammed Razzak et.al.	2406.12011	null
2024-06-17	Decomposed evaluations of geographic disparities in text-to-image models	Abhishek Sureddy et.al.	2406.11988	null
2024-06-17	Be careful in multi-messenger inference of the Hubble constant: A path forward for robust inference	Michael Müller et.al.	2406.11965	null
2024-06-17	Personalized Federated Knowledge Graph Embedding with Client-Wise Relation Graph	Xiaoxiong Zhang et.al.	2406.11943	null
2024-06-17	P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models	Shuo Yang et.al.	2406.11391	null
2024-06-17	Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network	Frank Sippel et.al.	2406.11284	link
2024-06-16	Physics-Informed Deep Learning and Partial Transfer Learning for Bearing Fault Diagnosis in the Presence of Highly Missing Data	Mohammadreza Kavianpour et.al.	2406.11023	null
2024-06-16	Rectified Iterative Disparity for Stereo Matching	Weiqing Xiao et.al.	2406.10943	null
2024-06-16	Quantifying Generative Media Bias with a Corpus of Real-world and Generated News Articles	Filip Trhlik et.al.	2406.10773	null
2024-06-15	Trapping of isotropic droplets by disclinations in nematic liquid crystals controlled by surface anchoring and elastic constant disparity	Nilanthi P. Haputhanthrige et.al.	2406.10684	null
2024-06-15	Functional Clustering for Longitudinal Associations between County-Level Social Determinants of Health and Stroke Mortality in the US	Fangzhi Luo et.al.	2406.10499	null
2024-06-15	A Label is Worth a Thousand Images in Dataset Distillation	Tian Qin et.al.	2406.10485	link
2024-06-14	Consistency-diversity-realism Pareto fronts of conditional image generative models	Pietro Astolfi et.al.	2406.10429	null
2024-06-14	Gender Representation in TV and Radio: Automatic Information Extraction methods versus Manual Analyses	David Doukhan et.al.	2406.10316	null
2024-06-14	Carbon Monoxide Cooling in Radiative Transfer Modeling of Supernovae	Collin McLeod et.al.	2406.10132	null
2024-06-14	DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications	Li Li et.al.	2406.10068	link
2024-06-14	Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness	Maximilian Spliethöver et.al.	2406.09977	null
2024-06-14	OpenCapBench: A Benchmark to Bridge Pose Estimation and Biomechanics	Yoni Gozlan et.al.	2406.09788	null
2024-06-14	Cross-view geo-localization: a survey	Abhilash Durgam et.al.	2406.09722	null
2024-06-14	MoME: Mixture of Multimodal Experts for Cancer Survival Prediction	Conghao Xiong et.al.	2406.09696	link
2024-06-13	Strain rate controls alignment in growing bacterial monolayers	Blake Langeslay et.al.	2406.09615	null
2024-06-13	AOC: Analysis of Orthologous Collections – an application for the characterization of natural selection in protein-coding sequences	Alexander Lucaci et.al.	2406.09522	link
2024-06-13	You are what you eat? Feeding foundation models a regionally diverse food dataset of World Wide Dishes	Jabez Magomere et.al.	2406.09496	link
2024-06-13	Scale-Invariant Monocular Depth Estimation via SSI Depth	S. Mahdi H. Miangoleh et.al.	2406.09374	link
2024-06-13	Less Cybersickness, Please: Demystifying and Detecting Stereoscopic Visual Inconsistencies in VR Apps	Shuqing Li et.al.	2406.09313	null
2024-06-13	Python-based DSL for generating Verilog model of Synchronous Digital Circuits	Mandar Datar et.al.	2406.09208	link
2024-06-13	Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns	Kaavya Rekanar et.al.	2406.09203	null
2024-06-13	Fine-Grained Domain Generalization with Feature Structuralization	Wenlong Yu et.al.	2406.09166	link
2024-06-13	Mean Field Study of Superconductivity in the Square Lattice $t$-$J$ Model with Three-Site Hopping	Ke Yang et.al.	2406.08780	null
2024-06-12	On Strongly-equitable Social Welfare Orders Without the Axiom of Choice	Luke Serafin et.al.	2406.08684	null
2024-06-12	Conditional Similarity Triplets Enable Covariate-Informed Representations of Single-Cell Data	Chi-Jane Chen et.al.	2406.08638	link
2024-06-12	Unraveling Code-Mixing Patterns in Migration Discourse: Automated Detection and Analysis of Online Conversations on Reddit	Fedor Vitiugin et.al.	2406.08633	link
2024-06-13	Real2Code: Reconstruct Articulated Objects via Code Generation	Zhao Mandi et.al.	2406.08474	null
2024-06-12	Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models	Javier Nistal et.al.	2406.08384	null
2024-06-12	Chemistry3D: Robotic Interaction Benchmark for Chemistry Experiments	Shoujie Li et.al.	2406.08160	link
2024-06-12	Generalizable Disaster Damage Assessment via Change Detection with Vision Foundation Model	Kyeongjin Ahn et.al.	2406.08020	null
2024-06-12	Automatic detection of large-scale flux ropes and their geoeffectiveness with a machine learning approach	Sanchita Pal et.al.	2406.07798	null
2024-06-11	PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow	Joshua Tokarsky et.al.	2406.07667	null
2024-06-11	Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling	Denis Blessing et.al.	2406.07423	link
2024-06-11	NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images	Yufei Han et.al.	2406.07111	null
2024-06-11	The evolution of coronal shock wave properties and their relation with solar energetic particles	Manon Jarry et.al.	2406.07058	null
2024-06-11	Bridging Language Gaps in Audio-Text Retrieval	Zhiyong Yan et.al.	2406.07012	link
2024-06-11	HPC Alongside User-space Kubernetes	Vanessa Sochat et.al.	2406.06995	null
2024-06-11	Stepwise Regression and Pre-trained Edge for Robust Stereo Matching	Weiqing Xiao et.al.	2406.06953	link
2024-06-10	Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies	Alex DeWeese et.al.	2406.06823	null
2024-06-10	The Legal Duty to Search for Less Discriminatory Algorithms	Emily Black et.al.	2406.06817	null
2024-06-10	Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests	T. Tony Cai et.al.	2406.06749	null
2024-06-10	The largest metallicity difference in twin systems: high-precision abundance analysis of the benchmark pair Krios & Kronos	P. Miquelarena et.al.	2406.06705	null
2024-06-10	Annotation alignment: Comparing LLM and human annotations of conversational safety	Rajiv Movva et.al.	2406.06369	null
2024-06-10	Shoulders of Giants: A Look at the Degree and Utility of Openness in NLP Research	Surangika Ranathunga et.al.	2406.06021	null
2024-06-10	Computational and Statistical Guarantees for Tensor-on-Tensor Regression with Tensor Train Decomposition	Zhen Qin et.al.	2406.06002	null
2024-06-10	Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context	Jingru Jia et.al.	2406.05972	null
2024-06-09	Predictors of the Sense of Presence in an Immersive Audio Storytelling Experience, a Mixed Methods Study. PREPRINT	Isabelle Verhulst et.al.	2406.05856	null
2024-06-09	SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion	Bingsong Bai et.al.	2406.05692	null
2024-06-09	MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations	Hemant Yadav et.al.	2406.05661	null
2024-06-09	Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs for Open-Ended Responses	Maryam Amirizaniani et.al.	2406.05659	null
2024-06-08	I-SIRch: AI-Powered Concept Annotation Tool For Equitable Extraction And Analysis Of Safety Insights From Maternity Investigations	Mohit Kumar Singh et.al.	2406.05505	null
2024-06-08	M3GIA: A Cognition Inspired Multilingual and Multimodal General Intelligence Ability Benchmark	Wei Song et.al.	2406.05343	link
2024-06-07	ProMotion: Prototypes As Motion Learners	Yawen Lu et.al.	2406.04999	null
2024-06-07	On the social bias of speech self-supervised models	Yi-Cheng Lin et.al.	2406.04997	null
2024-06-07	UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection	Yuchao Wang et.al.	2406.04647	null
2024-06-06	Function and form of U.S. cities	Sandro M. Reia et.al.	2406.04543	null
2024-06-06	TexIm FAST: Text-to-Image Representation for Semantic Similarity Evaluation using Transformers	Wazib Ansar et.al.	2406.04438	null
2024-06-06	Stereo-Depth Fusion through Virtual Pattern Projection	Luca Bartolomei et.al.	2406.04345	link
2024-06-06	Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation	Honglei Zhang et.al.	2406.03933	link
2024-06-06	Knowledge Transfer, Knowledge Gaps, and Knowledge Silos in Citation Networks	Eoghan Cunningham et.al.	2406.03921	link
2024-06-06	Transductive Off-policy Proximal Policy Optimization	Yaozhong Gan et.al.	2406.03894	null
2024-06-05	Does the Sun have a Dark Disk?	Gustavo F. S. Alves et.al.	2406.03607	null
2024-06-05	Reconciling Heterogeneous Effects in Causal Inference	Audrey Chang et.al.	2406.03575	null
2024-06-05	MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization	Xiaobo Guo et.al.	2406.03479	null
2024-06-05	A Flexible Recursive Network for Video Stereo Matching Based on Residual Estimation	Youchen Zhao et.al.	2406.03333	link
2024-06-05	On the Maximal Local Disparity of Fairness-Aware Classifiers	Jinqiu Jin et.al.	2406.03255	link
2024-06-05	MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class Min-Margin Contrastive Learning for Superior Prohibited Item Detection	Mingyuan Li et.al.	2406.03176	link
2024-06-05	Instructing Prompt-to-Prompt Generation for Zero-Shot Learning	Man Liu et.al.	2406.03032	null
2024-06-05	GraphAlign: Pretraining One Graph Neural Network on Multiple Graphs via Feature Alignment	Zhenyu Hou et.al.	2406.02953	null
2024-06-04	Building Socially-Equitable Public Models	Yejia Liu et.al.	2406.02790	link
2024-06-04	VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors	Markus Plack et.al.	2406.02552	null
2024-06-04	The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding	Kenneth Enevoldsen et.al.	2406.02396	link
2024-06-04	Layer-2 Arbitrage: An Empirical Analysis of Swap Dynamics and Price Disparities on Rollups	Krzysztof Gogol et.al.	2406.02172	null
2024-06-04	A Multipurpose Interface for Close- and Far-Proximity Control of Mobile Collaborative Robots	Hamidreza Raei et.al.	2406.02171	link
2024-06-05	CondTSF: One-line Plugin of Dataset Condensation for Time Series Forecasting	Jianrong Ding et.al.	2406.02131	link
2024-06-04	Timescale bridging in atomistic simulations of epoxy polymer mechanics using non-affine deformation theory	Vinay Vaibhav et.al.	2406.02113	null
2024-06-03	Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities	Golnoosh Farnadi et.al.	2406.01757	null
2024-06-03	Inverse design of photonic surfaces on Inconel via multi-fidelity machine learning ensemble framework and high throughput femtosecond laser processing	Luka Grbcic et.al.	2406.01471	null
2024-06-03	Structural Interventions and the Dynamics of Inequality	Aurora Zhang et.al.	2406.01323	null
2024-06-03	Bridging the Digital Divide: Mapping Internet Connectivity Evolution, Inequalities, and Resilience in six Brazilian Cities	Nicolò Gozzi et.al.	2406.01113	null
2024-05-31	*Exploratory Preference Optimization: Harnessing Implicit Q-Approximation for Sample-Efficient RLHF**	Tengyang Xie et.al.	2405.21046	null
2024-05-31	GANcrop: A Contrastive Defense Against Backdoor Attacks in Federated Learning	Xiaoyun Gan et.al.	2405.20727	null
2024-05-31	Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation	Shuzhou Yang et.al.	2405.20669	link
2024-05-31	Weak-Form Inference for Hybrid Dynamical Systems in Ecology	Daniel Messenger et.al.	2405.20591	null
2024-05-31	The Point of View of a Sentiment: Towards Clinician Bias Detection in Psychiatric Notes	Alissa A. Valentine et.al.	2405.20582	null
2024-05-30	Impact of Connected and Automated Vehicles on Transport Injustices	Laura Martinez-Buelvas et.al.	2405.20530	null
2024-05-30	Bridging electronic and classical density-functional theory using universal machine-learned functional approximations	Michelle M. Kelley et.al.	2405.20270	null
2024-05-30	Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting	Kuldeep R Barad et.al.	2405.20104	null
2024-05-30	Strategies to Counter Artificial Intelligence in Law Enforcement: Cross-Country Comparison of Citizens in Greece, Italy and Spain	Petra Saskia Bayerl et.al.	2405.19970	null
2024-05-29	X-ray and Radio campaign of the Z-source GX 340+0: discovery of X-ray polarization and its implications	Yash Bhargava et.al.	2405.19324	null
2024-05-29	Measuring and Mitigating Bias for Tabular Datasets with Multiple Protected Attributes	Manh Khoi Duong et.al.	2405.19300	link
2024-05-29	Mitigating Disparate Impact of Differential Privacy in Federated Learning through Robust Clustering	Saber Malekmohammadi et.al.	2405.19272	null
2024-05-29	MAGIC: Modular Auto-encoder for Generalisable Model Inversion with Bias Corrections	Yihang She et.al.	2405.18953	link
2024-05-29	UniPTS: A Unified Framework for Proficient Post-Training Sparsity	Jingjing Xie et.al.	2405.18810	link
2024-05-28	The Efficacy of the Connect America Fund in Addressing US Internet Access Inequities	Haarika Manda et.al.	2405.18657	null
2024-05-28	Aligning in a Compact Space: Contrastive Knowledge Distillation between Heterogeneous Architectures	Hongjun Wu et.al.	2405.18524	null
2024-05-28	Exploring the Evolution of Altruistic Punishment with a PDE Model of Cultural Multilevel Selection	Daniel B. Cooney et.al.	2405.18419	link
2024-05-28	A Calibration Tool for Refractive Underwater Vision	Felix Seegräber et.al.	2405.18018	null
2024-05-28	Cross-Context Backdoor Attacks against Graph Prompt Learning	Xiaoting Lyu et.al.	2405.17984	link
2024-05-28	FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes	Yunsong Wang et.al.	2405.17958	link
2024-05-28	Boosting Protein Language Models with Negative Sample Mining	Yaoyao Xu et.al.	2405.17902	link
2024-05-28	Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection	Yingwen Wu et.al.	2405.17816	null
2024-05-27	A Two-sided Model for EV Market Dynamics and Policy Implications	Haoxuan Ma et.al.	2405.17702	null
2024-05-27	Unifying Perspectives: Plausible Counterfactual Explanations on Global, Group-wise, and Local Levels	Patryk Wielopolski et.al.	2405.17642	null
2024-05-27	MindMerger: Efficient Boosting LLM Reasoning in non-English Languages	Zixian Huang et.al.	2405.17386	link
2024-05-27	EF-Calib: Spatiotemporal Calibration of Event- and Frame-Based Cameras Using Continuous-Time Trajectories	Shaoan Wang et.al.	2405.17278	link
2024-05-27	Highly inhomogeneous interactions between background climate and urban warming across typical local climate zones in heatwave and non-heatwave days	Jing Kong et.al.	2405.17213	null
2024-05-27	SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing	Yong-Qiang Mao et.al.	2405.17140	null
2024-05-27	Multi-view Disparity Estimation Using a Novel Gradient Consistency Model	James L. Gray et.al.	2405.17029	null
2024-05-27	Blind Data Adaptation to tackle Covariate Shift in Operational Steganalysis	Rony Abecidan et.al.	2405.16961	null
2024-05-27	Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models	Fengfan Zhou et.al.	2405.16940	null
2024-05-28	PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting	Zipeng Wang et.al.	2405.16829	null
2024-05-27	Addressing Discretization-Induced Bias in Demographic Prediction	Evan Dong et.al.	2405.16762	link
2024-05-26	Demystify Mamba in Vision: A Linear Attention Perspective	Dongchen Han et.al.	2405.16605	link
2024-05-24	Synthetic high angular momentum spin dynamics in a microwave oscillator	Saswata Roy et.al.	2405.15695	null
2024-05-24	Digital finance, Bargaining Power and Gender Wage Gap	Qing Guo et.al.	2405.15486	null
2024-05-24	Mind the Gap: A Causal Perspective on Bias Amplification in Prediction & Decision-Making	Drago Plecko et.al.	2405.15446	null
2024-05-24	Fairness-Accuracy Trade-Offs: A Causal Perspective	Drago Plecko et.al.	2405.15443	link
2024-05-23	ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization	Han Song et.al.	2405.15082	null
2024-05-23	Modularity, Higher-Order Recombination, and New Venture Success	Likun Cao et.al.	2405.15042	null
2024-05-23	Federated Online Adaptation for Deep Stereo	Matteo Poggi et.al.	2405.14873	null
2024-05-23	An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models	Jiahao Sun et.al.	2405.14870	link
2024-05-23	Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras	Hanzhang Tu et.al.	2405.14866	null
2024-05-23	A Systematic and Formal Study of the Impact of Local Differential Privacy on Fairness: Preliminary Results	Karima Makhlouf et.al.	2405.14725	null
2024-05-23	Is the EJRA proportionate and therefore justified? A critical review of the EJRA policy at Cambridge	Oliver Linton et.al.	2405.14611	null
2024-05-23	Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks	Xingguang Jiang et.al.	2405.14520	null
2024-05-22	Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation	Mykhailo Uss et.al.	2405.14024	null
2024-05-22	CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models	Giada Pistilli et.al.	2405.13974	null
2024-05-22	Multi-Dataset Multi-Task Learning for COVID-19 Prognosis	Filippo Ruffini et.al.	2405.13771	null
2024-05-22	Knowledge-Driven Cross-Document Relation Extraction	Monika Jain et.al.	2405.13546	link

Monocular Depth Estimation

Publish Date	Title	Authors	PDF	Code
2025-07-23	Monocular Semantic Scene Completion via Masked Recurrent Networks	Xuzhi Wang et.al.	2507.17661	null
2025-07-22	SDGOCC: Semantic and Depth-Guided Bird’s-Eye View Transformation for 3D Multimodal Occupancy Prediction	Zaipeng Duan et.al.	2507.17083	null
2025-07-21	DAViD: Data-efficient and Accurate Vision Models from Synthetic Data	Fatemeh Saleh et.al.	2507.15365	null
2025-07-21	BenchDepth: Are We on the Right Way to Evaluate Depth Foundation Models?	Zhenyu Li et.al.	2507.15321	null
2025-07-20	Region-aware Depth Scale Adaptation with Sparse Measurements	Rizhao Fan et.al.	2507.14879	null
2025-07-20	Training Self-Supervised Depth Completion Using Sparse Measurements and a Single Image	Rizhao Fan et.al.	2507.14845	null
2025-07-19	DCHM: Depth-Consistent Human Modeling for Multiview Detection	Jiahao Ma et.al.	2507.14505	null
2025-07-19	Motion Segmentation and Egomotion Estimation from Event-Based Normal Flow	Zhiyuan Hua et.al.	2507.14500	null
2025-07-18	Depth3DLane: Fusing Monocular 3D Lane Detection with Self-Supervised Monocular Depth Estimation	Max van den Hoven et.al.	2507.13857	null
2025-07-18	Augmented Reality in Cultural Heritage: A Dual-Model Pipeline for 3D Artwork Reconstruction	Daniele Pannone et.al.	2507.13719	null
2025-07-17	$π^3$ : Scalable Permutation-Equivariant Visual Geometry Learning	Yifan Wang et.al.	2507.13347	null
2025-07-17	$S^2M^2$ : Scalable Stereo Matching Model for Reliable Depth Estimation	Junhong Min et.al.	2507.13229	null
2025-07-19	SpatialTrackerV2: 3D Point Tracking Made Easy	Yuxi Xiao et.al.	2507.12462	null
2025-07-16	Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios	Van-Hoang-Anh Phan et.al.	2507.12449	null
2025-07-16	Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation	Antonio Finocchiaro et.al.	2507.12292	null
2025-07-15	Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation	Zhen Xu et.al.	2507.11540	null
2025-07-15	MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network	Jianfei Jiang et.al.	2507.11333	null
2025-07-15	Uncertainty Aware Mapping for Vision-Based Underwater Robots	Abhimanyu Bhowmik et.al.	2507.10991	null
2025-07-14	Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision	Justin M. Kasowski et.al.	2507.10813	null
2025-07-14	Cameras as Relative Positional Encoding	Ruilong Li et.al.	2507.10496	null
2025-07-14	Spatial Lifting for Dense Prediction	Mingzhi Xu et.al.	2507.10222	null
2025-07-13	Prompt2DEM: High-Resolution DEMs for Urban and Open Environments from Global Prompts Using a Monocular Foundation Model	Osher Rafaeli et.al.	2507.09681	null
2025-07-11	ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way	Rajarshi Roy et.al.	2507.08679	null
2025-07-10	An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision	Jareen Anjom et.al.	2507.08165	null
2025-07-10	Tree-Mamba: A Tree-Aware Mamba for Underwater Monocular Depth Estimation	Peixian Zhuang et.al.	2507.07687	null
2025-07-10	HOTA: Hierarchical Overlap-Tiling Aggregation for Large-Area 3D Flood Mapping	Wenfeng Jia et.al.	2507.07585	null
2025-07-08	LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures	Seungoh Han et.al.	2507.06109	null
2025-07-14	Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation	Quanzhu Niu et.al.	2507.05948	null
2025-07-07	The Generalization Ridge: Information Flow in Natural Language Generation	Ruidi Chang et.al.	2507.05387	null
2025-07-10	VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting	Juyi Lin et.al.	2507.05116	null
2025-07-07	Estimating Object Physical Properties from RGB-D Vision and Depth Robot Sensors Using Deep Learning	Ricardo Cardoso et.al.	2507.05029	null
2025-07-06	A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields	Aoxiang Fan et.al.	2507.04408	null
2025-07-06	High-Resolution Sustain Pedal Depth Estimation from Piano Audio Across Room Acoustics	Kun Fang et.al.	2507.04230	null
2025-07-03	From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images	Danrong Zhang et.al.	2507.02781	null
2025-07-10	Underwater Monocular Metric Depth Estimation: Real-World Benchmarks and Synthetic Fine-Tuning with Vision Foundation Models	Zijie Cai et.al.	2507.02148	null
2025-07-02	RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather	Yuran Wang et.al.	2507.01653	null
2025-07-02	Depth Anything at Any Condition	Boyuan Sun et.al.	2507.01634	null
2025-07-02	DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation	Yue-Jiang Dong et.al.	2507.01603	null
2025-07-02	Evaluating Robustness of Monocular Depth Estimation with Procedural Scene Perturbations	Jack Nugent et.al.	2507.00981	null
2025-06-30	SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures	Fengyi Jiang et.al.	2507.00209	null
2025-06-30	OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving	Mingqian Ji et.al.	2506.23565	null
2025-06-26	ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation	Shruti Bansal et.al.	2506.20969	null
2025-06-25	THIRDEYE: Cue-Aware Monocular Depth Estimation via Brain-Inspired Multi-Stage Fusion	Calin Teodor Ioan et.al.	2506.20877	null
2025-06-30	StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation	Haodong Li et.al.	2506.20756	null
2025-06-24	Look to Locate: Vision-Based Multisensory Navigation with 3-D Digital Maps for GNSS-Challenged Environments	Ola Elmaghraby et.al.	2506.19827	null
2025-06-23	SOF: Sorted Opacity Fields for Fast Unbounded Surface Reconstruction	Lukas Radl et.al.	2506.19139	null
2025-06-23	BulletGen: Improving 4D Reconstruction with Bullet-Time Generation	Denys Rozumnyi et.al.	2506.18601	null
2025-06-21	Optimization-Free Patch Attack on Stereo Depth Estimation	Hangcheng Liu et.al.	2506.17632	null
2025-06-20	DreamCube: 3D Panorama Generation via Multi-plane Synchronization	Yukun Huang et.al.	2506.17206	null
2025-06-20	RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking	Teng Guo et.al.	2506.17119	link
2025-06-20	Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping	Teng Guo et.al.	2506.17110	null
2025-06-20	DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches	Yun Xing et.al.	2506.16690	null
2025-06-19	EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised Training	Liangjing Shao et.al.	2506.16017	link
2025-06-18	RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation	Xingrui Qin et.al.	2506.15560	null
2025-06-17	Time-Optimized Safe Navigation in Unstructured Environments through Learning Based Depth Completion	Jeffrey Mao et.al.	2506.14975	null
2025-06-17	DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning	Kunal Swami et.al.	2506.14709	null
2025-06-16	Test3R: Learning to Reconstruct 3D at Test Time	Yuheng Yuan et.al.	2506.13750	link
2025-06-16	Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields	Jungeon Kim et.al.	2506.13508	null
2025-06-17	Self-Supervised Enhancement for Depth from a Lightweight ToF Sensor with Monocular Images	Laiyan Ding et.al.	2506.13444	link
2025-06-16	TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast	Beilei Cui et.al.	2506.13387	link
2025-06-17	3D Hand Mesh-Guided AI-Generated Malformed Hand Refinement with Hand Pose Transformation via Diffusion Model	Chen-Bin Feng et.al.	2506.12680	null
2025-06-12	Leveraging 6DoF Pose Foundation Models For Mapping Marine Sediment Burial	Jerry Yan et.al.	2506.10386	link
2025-06-11	DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects	Guanghu Xie et.al.	2506.09491	null
2025-06-11	MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning	Tong Wang et.al.	2506.09327	null
2025-06-10	AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models	Zheda Mai et.al.	2506.09082	null
2025-06-10	One Patch to Rule Them All: Transforming Static Patches into Dynamic Attacks in the Physical World	Xingshuo Han et.al.	2506.08482	null
2025-06-09	Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence	Octave Mariotti et.al.	2506.08220	null
2025-06-09	Hidden in plain sight: VLMs overlook their visual representations	Stephanie Fu et.al.	2506.08008	null
2025-06-09	EgoM2P: Egocentric Multimodal Multitask Pretraining	Gen Li et.al.	2506.07886	null
2025-06-09	Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images	Yingping Liang et.al.	2506.07740	null
2025-06-07	Dark Channel-Assisted Depth-from-Defocus from a Single Image	Moushumi Medhi et.al.	2506.06643	null
2025-06-06	NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces	Pierluigi Zama Ramirez et.al.	2506.05815	null
2025-06-06	Advancement and Field Evaluation of a Dual-arm Apple Harvesting Robot	Keyi Zhu et.al.	2506.05714	null
2025-06-06	Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration	Fanhu Zeng et.al.	2506.05709	null
2025-06-06	Aerial Multi-View Stereo via Adaptive Depth Range Inference and Normal Cues	Yimei Liu et.al.	2506.05655	null
2025-06-09	Structure-Aware Radar-Camera Depth Estimation	Fuyi Zhang et.al.	2506.05008	null
2025-06-05	Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer	Filip Slezak et.al.	2506.04908	null
2025-06-05	Toward Better SSIM Loss for Unsupervised Monocular Depth Estimation	Yijun Cao et.al.	2506.04758	null
2025-06-04	JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting	Yang Xiao et.al.	2506.03872	null
2025-06-04	Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation	Joonkyung Kim et.al.	2506.03834	null
2025-06-03	ViT-Split: Unleashing the Power of Vision Foundation Models via Efficient Splitting Heads	Yifan Li et.al.	2506.03433	null
2025-06-02	E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models	Wenyan Cong et.al.	2506.01933	null
2025-06-01	Perceptual Inductive Bias Is What You Need Before Contrastive Learning	Tianqin Li et.al.	2506.01201	null
2025-06-01	Depth-Aware Scoring and Hierarchical Alignment for Multiple Object Tracking	Milad Khanchi et.al.	2506.00774	null
2025-05-31	XYZ-IBD: High-precision Bin-picking Dataset for Object 6D Pose Estimation Capturing Real-world Industrial Complexity	Junwen Huang et.al.	2506.00599	null
2025-05-31	Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline	Zhaoying Wang et.al.	2506.00546	null
2025-05-31	Improving Optical Flow and Stereo Depth Estimation by Leveraging Uncertainty-Based Learning Difficulties	Jisoo Jeong et.al.	2506.00324	null
2025-05-30	Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization	Qingyao Tian et.al.	2505.24249	null
2025-05-29	Ultrafast High-Flux Single-Photon LiDAR Simulator via Neural Mapping	Weijian Zhang et.al.	2505.23992	null
2025-05-29	Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation	Sanggyun Ma et.al.	2505.23400	null
2025-05-29	GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion	Gwanghyun Kim et.al.	2505.23085	null
2025-05-28	Depth to magnetic source estimation using TDX contour	Hammed Oyekan et.al.	2505.22780	null
2025-05-28	Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss	Wenjun Lu et.al.	2505.22279	null
2025-05-27	Object Concepts Emerge from Motion	Haoqian Liang et.al.	2505.21635	null
2025-05-23	EvidenceMoE: A Physics-Guided Mixture-of-Experts with Evidential Critics for Advancing Fluorescence Light Detection and Ranging in Scattering Media	Ismail Erbas et.al.	2505.21532	null
2025-05-27	Occlusion Boundary and Depth: Mutual Enhancement via Multi-Task Learning	Lintao Xu et.al.	2505.21231	null
2025-05-27	Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing	Dehao Wang et.al.	2505.21049	null
2025-05-27	Spatial RoboGrasp: Generalized Robotic Grasping Control Policy	Yiqi Huang et.al.	2505.20814	null
2025-05-26	SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams	Zhuoheng Gao et.al.	2505.19487	null
2025-05-25	From Single Images to Motion Policies via Video-Generation Environment Representations	Weiming Zhi et.al.	2505.19306	null
2025-05-23	Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues	Chinmay Talegaonkar et.al.	2505.17358	null
2025-05-22	MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation	Bohan Zhou et.al.	2505.16602	null
2025-05-22	BadDepth: Backdoor Attacks Against Monocular Depth Estimation in the Physical World	Ji Guo et.al.	2505.16154	null
2025-05-21	RadarRGBD A Multi-Sensor Fusion Dataset for Perception with RGB-D and mmWave Radar	Tieshuai Song et.al.	2505.15860	null
2025-05-21	MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models	Yifan Liu et.al.	2505.15185	link
2025-05-20	Diving into the Fusion of Monocular Priors for Generalized Stereo Matching	Chengtang Yao et.al.	2505.14414	link
2025-05-20	M3Depth: Wavelet-Enhanced Depth Estimation on Mars via Mutual Boosting of Dual-Modal Data	Junjie Li et.al.	2505.14159	null
2025-05-20	Multi-Label Stereo Matching for Transparent Scene Depth Estimation	Zhidan Liu et.al.	2505.14008	link
2025-05-20	Event-Driven Dynamic Scene Depth Completion	Zhiqiang Yan et.al.	2505.13279	null
2025-05-19	DB3D-L: Depth-aware BEV Feature Transformation for Accurate 3D Lane Detection	Yehao Liu et.al.	2505.13266	null
2025-05-20	3D Visual Illusion Depth Estimation	Chengtang Yao et.al.	2505.13061	link
2025-05-19	IA-MVS: Instance-Focused Adaptive Depth Sampling for Multi-View Stereo	Yinzhe Wang et.al.	2505.12714	null
2025-05-18	Depth Transfer: Learning to See Like a Simulator for Real-World Drone Navigation	Hang Yu et.al.	2505.12428	null
2025-05-18	Always Clear Depth: Robust Monocular Depth Estimation under Adverse Weather	Kui Jiang et.al.	2505.12199	link
2025-05-17	SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations	Songchun Zhang et.al.	2505.11992	null
2025-05-17	MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos	Hongyi Zhou et.al.	2505.11868	null
2025-05-16	SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision	Utsav Rai et.al.	2505.11439	null
2025-05-16	Attention on the Sphere	Boris Bonev et.al.	2505.11157	link
2025-05-15	Depth Anything with Any Prior	Zehan Wang et.al.	2505.10565	null
2025-05-15	JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation	Tiancong Cheng et.al.	2505.10057	null
2025-05-14	Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis	Bingxin Ke et.al.	2505.09358	link
2025-05-13	Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World	Yuran Wang et.al.	2505.08607	null
2025-05-13	Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images	Ziteng Liu et.al.	2505.08178	null
2025-05-12	Some insights into depth estimators for location and scatter in the multivariate setting	Jorge G. Adrover et.al.	2505.07383	null
2025-05-11	Reinforcement Learning-Based Monocular Vision Approach for Autonomous UAV Landing	Tarik Houichime et.al.	2505.06963	null
2025-05-10	ElectricSight: 3D Hazard Monitoring for Power Lines Using Low-Cost Sensors	Xingchen Li et.al.	2505.06573	null
2025-05-09	Camera-Only Bird’s Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles	Anupkumar Bochare et.al.	2505.06113	null
2025-05-09	MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection	Zhihao Zhang et.al.	2505.04594	null
2025-05-13	Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach	Srecharan Selvam et.al.	2505.03702	null
2025-05-06	LiftFeat: 3D Geometry-Aware Local Feature Matching	Yepeng Liu et.al.	2505.03422	link
2025-05-06	VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery	Bojin Wu et.al.	2505.02704	link
2025-05-05	DELTA: Dense Depth from Events and LiDAR using Transformer’s Attention	Vincent Brebion et.al.	2505.02593	null
2025-05-03	PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth	Bu Jin et.al.	2505.01729	null
2025-05-02	LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment	Jiahuan Long et.al.	2505.00980	null
2025-05-01	JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers	Kwon Byung-Ki et.al.	2505.00482	link
2025-04-30	HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation	Haiyang Zhou et.al.	2504.21650	link
2025-04-30	eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes	Henry John Krumb et.al.	2504.21562	null
2025-04-29	Real-Time Wayfinding Assistant for Blind and Low-Vision Users	Dabbrata Das et.al.	2504.20976	null
2025-04-29	Large-scale visual SLAM for in-the-wild videos	Shuo Sun et.al.	2504.20496	null
2025-04-28	MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion	Zador Pataki et.al.	2504.20040	link
2025-04-28	Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video	Hoang Chuong Nguyen et.al.	2504.19819	null
2025-04-27	Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection	Athul M. Mathew et.al.	2504.19271	null
2025-04-26	Depth as Points: Center Point-based Depth Estimation	Zhiheng Tu et.al.	2504.18773	null
2025-04-25	LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning	Rui Li et.al.	2504.18424	null
2025-04-25	Dense Geometry Supervision for Underwater Depth Estimation	Wenxiang Gua et.al.	2504.18233	null
2025-04-25	LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring	Raul David Dominguez Sanchez et.al.	2504.18203	null
2025-04-24	The Fourth Monocular Depth Estimation Challenge	Anton Obukhov et.al.	2504.17787	null
2025-04-24	Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images	Zebo Huang et.al.	2504.17582	null
2025-04-24	Invasion depth estimation of gastric cancer in early stage using circularly polarized light scattering: Phantom studies	Mike R. Maskey et.al.	2504.17161	null
2025-04-23	PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth Estimation	Xinqi Xiong et.al.	2504.17067	null
2025-04-23	Helping Blind People Grasp: Enhancing a Tactile Bracelet with an Automated Hand Navigation System	Marcin Furtak et.al.	2504.16502	null
2025-04-21	MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation	Xingxing Zuo et.al.	2504.16127	null
2025-04-22	DERD-Net: Learning Depth from Event-based Ray Densities	Diego de Oliveira Hitzges et.al.	2504.15863	null
2025-04-22	VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation	Mingxia Zhan et.al.	2504.15095	null
2025-04-21	Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation	Chenjie Cao et.al.	2504.14899	link
2025-04-20	Seurat: From Moving Points to Depth	Seokju Cho et.al.	2504.14687	link
2025-04-18	Occlusion-Ordered Semantic Instance Segmentation	Soroosh Baselizadeh et.al.	2504.14054	null
2025-04-18	Enhancing Pothole Detection and Characterization: Integrated Segmentation and Depth Estimation in Road Anomaly Systems	Uthman Baroudi et.al.	2504.13648	null
2025-04-17	Perception Encoder: The best visual embeddings are not at the output of the network	Daniel Bolya et.al.	2504.13181	null
2025-04-17	TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors	Mingwei Li et.al.	2504.12799	null
2025-04-17	Privacy-Preserving Operating Room Workflow Analysis using Digital Twins	Alejandra Perez et.al.	2504.12552	null
2025-04-16	Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image	Tao Wen et.al.	2504.12103	null
2025-04-16	TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion	Yiran Wang et.al.	2504.11773	null
2025-04-16	An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World	Xingwu Ji et.al.	2504.11698	link
2025-04-15	Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception	Ziqi Pang et.al.	2504.11457	link
2025-04-16	DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation	Soyoung Yoo et.al.	2504.11347	null
2025-04-18	Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting	Jiaxin Huang et.al.	2504.11092	null
2025-04-13	TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting	Zhicong Wu et.al.	2504.09588	null
2025-04-12	Text To 3D Object Generation For Scalable Room Assembly	Sonia Laguna et.al.	2504.09328	null
2025-04-11	Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation	Bram Vanherle et.al.	2504.08473	link
2025-04-10	Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction	Zeren Jiang et.al.	2504.07961	link
2025-04-09	FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution	Gene Chou et.al.	2504.07093	link
2025-04-08	POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction	Songyan Zhang et.al.	2504.05692	link
2025-04-07	Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and Semidensification	Yasuhiro Yao et.al.	2504.05148	link
2025-04-04	3D Scene Understanding Through Local Random Access Sequence Modeling	Wanhee Lee et.al.	2504.03875	null
2025-04-04	RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation	Hanbo Bi et.al.	2504.03166	null
2025-04-03	All-day Depth Completion via Thermal-LiDAR Fusion	Janghyun Kim et.al.	2504.02356	null
2025-04-02	FreSca: Unveiling the Scaling Space in Diffusion Models	Chao Huang et.al.	2504.02154	null
2025-04-02	Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis	Niluthpol Chowdhury Mithun et.al.	2504.01960	null
2025-04-03	Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting	Shu-Wei Lu et.al.	2504.01957	null
2025-04-02	A novel gesture interaction control method for rehabilitation lower extremity exoskeleton	Shuang Qiu et.al.	2504.01888	null
2025-04-02	DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image	Jijun Xiang et.al.	2504.01596	link
2025-04-01	GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors	Tian-Xing Xu et.al.	2504.01016	null
2025-04-01	Monocular and Generalizable Gaussian Talking Head Animation	Shengjie Gong et.al.	2504.00665	null
2025-03-31	ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image	Tianyi Gong et.al.	2503.23881	null
2025-03-31	Detail-aware multi-view stereo network for depth estimation	Haitao Tian et.al.	2503.23684	null
2025-03-30	Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries	Wei Xu et.al.	2503.23606	null
2025-03-30	Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model	Jannik Endres et.al.	2503.23502	link
2025-03-28	SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations	Krispin Wandel et.al.	2503.22462	null
2025-03-28	EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting	Xu Wang et.al.	2503.22437	link
2025-03-28	MVSAnywhere: Zero-Shot Multi-View Stereo	Sergio Izquierdo et.al.	2503.22430	null
2025-03-28	One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images	Byeongjun Kwon et.al.	2503.22351	null
2025-03-28	Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces	Wonhyeok Choi et.al.	2503.22209	null
2025-03-28	Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges	Ukcheol Shin et.al.	2503.22060	link
2025-03-27	A Unified Image-Dense Annotation Generation Model for Underwater Scenes	Hongkai Lin et.al.	2503.21771	link
2025-03-27	ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo	Yuxi Hu et.al.	2503.21525	null
2025-03-26	Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors	Weilong Yan et.al.	2503.20211	link
2025-03-26	FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion	Pihai Sun et.al.	2503.19739	link
2025-03-25	Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving	Yusen Xie et.al.	2503.19713	link
2025-03-25	StableGS: A Floater-Free Framework for 3D Gaussian Splatting	Luchao Wang et.al.	2503.18458	null
2025-03-24	PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes	Xinhua Xu et.al.	2503.18393	null
2025-03-24	MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction	Wenyuan Zhang et.al.	2503.18363	null
2025-03-23	Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images	Yara AlaaEldin et.al.	2503.17982	link
2025-03-21	Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image	Jerred Chen et.al.	2503.17358	null
2025-03-21	Radar-Guided Polynomial Fitting for Metric Depth Estimation	Patrick Rim et.al.	2503.17182	null
2025-03-21	AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process	Junjie Hu et.al.	2503.17029	null
2025-03-21	Distilling Monocular Foundation Model for Fine-grained Depth Completion	Yingping Liang et.al.	2503.16970	null
2025-03-20	QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge	Xuan Shen et.al.	2503.16709	link
2025-03-20	A Recipe for Generating 3D Worlds From a Single Image	Katja Schwarz et.al.	2503.16611	null
2025-03-20	DreamTexture: Shape from Virtual Texture with Analysis by Augmentation	Ananta R. Bhattarai et.al.	2503.16412	null
2025-03-20	Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors	Tian Yi Lim et.al.	2503.16275	null
2025-03-20	Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras	Beilei Cui et.al.	2503.15917	null
2025-03-20	Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation	Jiyuan Wang et.al.	2503.15905	null
2025-03-19	TULIP: Towards Unified Language-Image Pretraining	Zineng Tang et.al.	2503.15485	null
2025-03-19	EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining	Boshen Xu et.al.	2503.15470	link
2025-03-19	USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network	Joseph Emmanuel DL Dayo et.al.	2503.14950	null
2025-03-18	Multi-view Reconstruction via SfM-guided Monocular Depth Estimation	Haoyu Guo et.al.	2503.14483	null
2025-03-18	DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers	Mert Bulent Sariyildiz et.al.	2503.14405	null
2025-03-18	3D Densification for Multi-Map Monocular VSLAM in Endoscopy	X. Anadón et.al.	2503.14346	null
2025-03-17	MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models	Johannes Meier et.al.	2503.13743	null
2025-03-17	SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint	Zhenlong Yuan et.al.	2503.13721	null
2025-03-17	Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios	Iryna Repinetska et.al.	2503.13710	null
2025-03-19	FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis	Luxi Chen et.al.	2503.13265	null
2025-03-17	MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs	Erik Daxberger et.al.	2503.13111	null
2025-03-17	TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image	Haoxiao Wang et.al.	2503.12779	null
2025-03-16	UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing	Tsu-Jui Fu et.al.	2503.12652	null
2025-03-16	Deblur Gaussian Splatting SLAM	Francesco Girlanda et.al.	2503.12572	null
2025-03-16	Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View	Xianzu Wu et.al.	2503.12553	link
2025-03-14	VGGT: Visual Geometry Grounded Transformer	Jianyuan Wang et.al.	2503.11651	link
2025-03-14	Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation	Hongyu Wen et.al.	2503.11633	null
2025-03-14	Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation	Fengchen He et.al.	2503.11213	link
2025-03-13	Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations	Xunzhi Zheng et.al.	2503.10464	null
2025-03-15	WonderVerse: Extendable 3D Scene Generation with Video Generative Models	Hao Feng et.al.	2503.09160	null
2025-03-11	Language-Depth Navigated Thermal and Visible Image Fusion	Jinchang Zhang et.al.	2503.08676	null
2025-03-11	CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning	Kaiqiang Xiong et.al.	2503.08219	null
2025-03-10	SIRE: SE(3) Intrinsic Rigidity Embeddings	Cameron Smith et.al.	2503.07739	null
2025-03-10	LBM: Latent Bridge Matching for Fast Image-to-Image Translation	Clément Chadebec et.al.	2503.07535	link
2025-03-12	Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion	Mona Sheikh Zeinoddin et.al.	2503.07204	null
2025-03-11	LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation	Quanjian Song et.al.	2503.06508	link
2025-03-08	Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity	Xiaohao Xu et.al.	2503.06014	link
2025-03-07	TomatoScanner: phenotyping tomato fruit based on only RGB image	Xiaobei Zhao et.al.	2503.05568	link
2025-03-07	Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects	Justin Yu et.al.	2503.05189	null
2025-03-05	RTFusion: A depth estimation network based on multimodal fusion in challenging scenarios	Zelin Meng et.al.	2503.04821	null
2025-03-06	A Novel Solution for Drone Photogrammetry with Low-overlap Aerial Images using Monocular Depth Estimation	Jiageng Zhong et.al.	2503.04513	null
2025-03-08	EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images	Rohit Menon et.al.	2503.04441	null
2025-03-06	H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision	Yunxiao Shi et.al.	2503.04059	null
2025-03-05	Task-Agnostic Attacks Against Vision Foundation Models	Brian Pulfer et.al.	2503.03842	link
2025-03-05	Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings	Xusheng Du et.al.	2503.03068	null
2025-03-04	RGBSQGrasp: Inferring Local Superquadric Primitives from Single RGB Image for Graspability-Aware Bin Picking	Yifeng Xu et.al.	2503.02387	null
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661	link
2025-03-02	Bridging Spectral-wise and Multi-spectral Depth Estimation via Geometry-guided Contrastive Learning	Ukcheol Shin et.al.	2503.00793	link
2025-02-28	EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering	John J. Han et.al.	2502.20669	null
2025-02-27	UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler	Luigi Piccinelli et.al.	2502.20110	link
2025-02-26	Stellar Models Also Limit Exoplanet Atmosphere Studies in Emission	Thomas J. Fauchez et.al.	2502.19585	null
2025-02-26	Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator	Xiankang He et.al.	2502.19204	link
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932	null
2025-02-19	Physical Depth-aware Early Accident Anticipation: A Multi-dimensional Visual Feature Fusion Framework	Hongpu Huang et.al.	2502.18496	null
2025-02-21	RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes	Sicheng Yu et.al.	2502.15633	null
2025-02-20	CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting	Qilin Zhang et.al.	2502.14684	link
2025-03-03	Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion	Jiangyuan Liu et.al.	2502.14616	link
2025-02-20	Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining	Wonhyeok Choi et.al.	2502.14573	null
2025-02-20	OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images	Zhichao Zheng et.al.	2502.14279	null
2025-02-18	Pre-training Auto-regressive Robotic Models with 4D Representations	Dantong Niu et.al.	2502.13142	null
2025-02-18	SHADeS: Self-supervised Monocular Depth Estimation Through Non-Lambertian Image Decomposition	Rema Daher et.al.	2502.12994	link
2025-02-17	Deep Neural Networks for Accurate Depth Estimation with Latent Space Features	Siddiqui Muhammad Yasir et.al.	2502.11777	null
2025-02-16	Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation	Kunal Swami et.al.	2502.11002	null
2025-02-14	ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences	Liyuan Zhu et.al.	2502.10377	null
2025-02-14	RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control	Teng Li et.al.	2502.10059	null
2025-02-13	SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest	Jack Erhardt et.al.	2502.09528	null
2025-02-17	S $^2$ -Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation	Quantao Yang et.al.	2502.09389	null
2025-02-13	CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery	Chenghao Zhang et.al.	2502.08902	null
2025-02-13	Visual-based spatial audio generation system for multi-speaker environments	Xiaojing Liu et.al.	2502.07538	null
2025-02-11	Learning Inverse Laplacian Pyramid for Progressive Depth Completion	Kun Wang et.al.	2502.07289	null
2025-02-10	From Image to Video: An Empirical Study of Diffusion Representations	Pedro Vélez et.al.	2502.07001	null
2025-02-09	Revisiting Gradient-based Uncertainty for Monocular Depth Estimation	Julia Hornauer et.al.	2502.05964	null
2025-02-09	SphereFusion: Efficient Panorama Depth Estimation via Gated Fusion	Qingsong Yan et.al.	2502.05859	null
2025-02-05	MetaFE-DE: Learning Meta Feature Embedding for Depth Estimation from Monocular Endoscopic Images	Dawei Lu et.al.	2502.03493	null
2025-02-04	DOC-Depth: A novel approach for dense depth ground truth generation	Simon de Moreau et.al.	2502.02144	null
2025-02-01	Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding	Jingming Xia et.al.	2502.01666	null
2025-02-01	Exploring Representation-Aligned Latent Space for Better Generation	Wanghan Xu et.al.	2502.00359	null
2025-02-01	MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model	Jihyeok Kim et.al.	2502.00315	null
2025-01-30	Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion	Vitor Guizilini et.al.	2501.18804	null
2025-01-25	Snapshot Compressed Imaging Based Single-Measurement Computer Vision for Videos	Fengpu Pan et.al.	2501.15122	null
2025-01-24	Rethinking Encoder-Decoder Flow Through Shared Structures	Frederik Laboyrie et.al.	2501.14535	null
2025-01-23	IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models	Jiayi Lei et.al.	2501.13920	null
2025-01-23	PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments	Changhao Wang et.al.	2501.13796	null
2025-01-22	Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation	Akshay Krishnan et.al.	2501.13087	null
2025-01-22	Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks	Alessio Quercia et.al.	2501.12824	link
2025-01-22	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null
2025-01-21	Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging	Shuyi Hu et.al.	2501.11884	null
2025-01-21	Survey on Monocular Metric Depth Estimation	Jiuling Zhang et.al.	2501.11841	null
2025-01-19	RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering	Chenlu Zhan et.al.	2501.11102	null
2025-01-15	BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation	Xiaolu Hou et.al.	2501.10462	link
2025-01-20	Zero-Shot Monocular Scene Flow Estimation in the Wild	Yiqing Liang et.al.	2501.10357	null
2025-01-17	One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression	Keita Miwa et.al.	2501.10064	null
2025-01-17	Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography	Mohammed Salah et.al.	2501.09994	link
2025-01-21	FoundationStereo: Zero-Shot Stereo Matching	Bowen Wen et.al.	2501.09898	link
2025-01-16	DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Hualie Jiang et.al.	2501.09466	link
2025-01-15	StereoGen: High-quality Stereo Image Generation from a Single Image	Xianqi Wang et.al.	2501.08654	null
2025-01-15	MonSter: Marry Monodepth to Stereo Unleashes Power	Junda Cheng et.al.	2501.08643	link
2025-01-14	A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation	Steven Landgraf et.al.	2501.08188	null
2025-01-14	Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2	Seamie Hayes et.al.	2501.08118	null
2025-01-13	Fixing the Scale and Shift in Monocular Depth For Camera Pose Estimation	Yaqing Ding et.al.	2501.07742	link
2025-01-13	Matching Free Depth Recovery from Structured Light	Zhuohang Yu et.al.	2501.07113	null
2025-01-09	Relative Pose Estimation through Affine Corrections of Monocular Depth Priors	Yifan Yu et.al.	2501.05446	link
2025-01-09	*$DPF^$ : improved Depth Potential Function for scale-invariant sulcal depth estimation**	Maxime Dieudonné et.al.	2501.05436	link
2025-01-09	A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision	Ali Rohan et.al.	2501.05147	null
2025-01-08	FatesGS: Fast and Accurate Sparse-View Surface Reconstruction using Gaussian Splatting with Depth-Feature Consistency	Han Huang et.al.	2501.04628	null
2025-01-08	FrontierNet: Learning Visual Cues to Explore	Boyang Sun et.al.	2501.04597	link
2025-01-07	AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features	Ruochen Zhang et.al.	2501.03700	null
2025-01-05	DepthMaster: Taming Diffusion Models for Monocular Depth Estimation	Ziyang Song et.al.	2501.02576	link
2025-01-05	Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera	Yuliang Guo et.al.	2501.02464	link
2025-01-03	SafeAug: Safety-Critical Driving Data Augmentation from Naturalistic Datasets	Zhaobin Mo et.al.	2501.02143	null
2025-01-03	Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery	Baoru Huang et.al.	2501.01752	null
2025-01-03	IGAF: Incremental Guided Attention Fusion for Depth Super-Resolution	Athanasios Tragakis et.al.	2501.01723	null
2024-12-31	Tech Report: Divide and Conquer 3D Real-Time Reconstruction for Improved IGS	Yicheng Zhu et.al.	2501.01465	null
2025-01-02	TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions	Vriksha Srihari et.al.	2501.01156	null
2025-01-02	PatchRefiner V2: Fast and Lightweight Real-Domain High-Resolution Metric Depth Estimation	Zhenyu Li et.al.	2501.01121	null
2024-12-30	FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI	Zhengdong Li et.al.	2412.20974	null
2024-12-29	MetricDepth: Enhancing Monocular Depth Estimation with Deep Metric Learning	Chunpu Liu et.al.	2412.20390	null
2024-12-28	Multi-Modality Driven LoRA for Adverse Condition Depth Estimation	Guanglei Yang et.al.	2412.20162	null
2024-12-28	DepthMamba with Adaptive Fusion	Zelin Meng et.al.	2412.19964	null
2024-12-26	An End-to-End Depth-Based Pipeline for Selfie Image Rectification	Ahmed Alhawwary et.al.	2412.19189	null
2024-12-26	Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement	Qiude Zhang et.al.	2412.19165	null
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130	null
2024-12-26	Learning Monocular Depth from Events via Egomotion Compensation	Haitao Meng et.al.	2412.19067	null
2024-12-24	RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis	Yiling Yao et.al.	2412.18380	null
2024-12-23	V $^2$ -SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy	Long Bai et.al.	2412.17595	null
2024-12-22	GeoTexDensifier: Geometry-Texture-Aware Densification for High-Quality Photorealistic 3D Gaussian Splatting	Hanqing Jiang et.al.	2412.16809	null
2024-12-27	LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty Guidance	Huawei Sun et.al.	2412.16380	link
2024-12-19	Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Qihao Liu et.al.	2412.15213	null
2024-12-19	Scaling 4D Representations	João Carreira et.al.	2412.15212	null
2024-12-18	Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Rémi Marsal et.al.	2412.14103	null
2024-12-18	Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation	Haotong Lin et.al.	2412.14015	link
2024-12-18	Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion	Massimiliano Viola et.al.	2412.13389	null
2024-12-18	Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera	Zhengdi Yu et.al.	2412.12861	null
2024-12-17	PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts	Kun Guo et.al.	2412.12460	null
2024-12-16	V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations	Jin-Cheng Jhang et.al.	2412.11412	null
2024-12-16	Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video	Junkai Fan et.al.	2412.11395	null
2024-12-15	ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction	Yi Feng et.al.	2412.11210	link
2024-12-14	MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance	Wenjun Huang et.al.	2412.10730	null
2024-12-12	Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos	Linyi Jin et.al.	2412.09621	null
2024-12-12	T-SVG: Text-Driven Stereoscopic Video Generation	Qiao Jin et.al.	2412.09323	null
2024-12-12	Cross-View Completion Models are Zero-shot Correspondence Estimators	Honggyu An et.al.	2412.09072	null
2024-12-11	BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation	Shengze Wang et.al.	2412.08640	null
2024-12-13	Utilizing Multi-step Loss for Single Image Reflection Removal	Abdelrahman Elnenaey et.al.	2412.08582	link
2024-12-11	Combining Neural Fields and Deformation Models for Non-Rigid 3D Motion Reconstruction from Partial Data	Aymen Merrouche et.al.	2412.08511	null
2024-12-11	Dense Depth from Event Focal Stack	Kenta Horikawa et.al.	2412.08120	null
2024-12-10	Diffusion-Based Attention Warping for Consistent 3D Scene Editing	Eyal Gomel et.al.	2412.07984	null
2024-12-10	Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation	Kurt H. W. Stolle et.al.	2412.07966	null
2024-12-09	SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception	Yaniv Benny et.al.	2412.06968	null
2024-12-09	Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving	Xin Fei et.al.	2412.06777	link
2024-12-09	MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views	Antoine Guédon et.al.	2412.06767	null
2024-12-09	On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events	Jesse Hagenaars et.al.	2412.06359	null
2024-12-09	Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction	Dongxu Wei et.al.	2412.06273	null
2024-12-09	Event fields: Capturing light fields at high speed, resolution, and dynamic range	Ziyuan Qu et.al.	2412.06191	null
2024-12-08	GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion	Karlo Koledic et.al.	2412.06080	null
2024-12-08	Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors	Alex Rich et.al.	2412.05771	null
2024-12-10	TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action	Zixian Ma et.al.	2412.05479	link
2024-12-06	SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images	Jiahua Dong et.al.	2412.05274	null
2024-12-06	Penetrative rotating magnetoconvection subject to lateral variations in temperature gradients	Tirtharaj Barman et.al.	2412.05235	null
2024-12-06	PanoDreamer: 3D Panorama Synthesis from a Single Image	Avinash Paliwal et.al.	2412.04827	link
2024-12-05	LAA-Net: A Physical-prior-knowledge Based Network for Robust Nighttime Depth Estimation	Kebin Peng et.al.	2412.04666	null
2024-12-05	Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail	Luca Bartolomei et.al.	2412.04472	link
2024-12-05	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463	null
2024-12-05	MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction	Mithun Parab et.al.	2412.03928	null
2024-12-04	Perception Tokens Enhance Visual Reasoning in Multimodal Language Models	Mahtab Bigverdi et.al.	2412.03548	null
2024-12-04	Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter	Hermes McGriff et.al.	2412.03518	null
2024-12-04	2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction	Wanting Zhang et.al.	2412.03428	null
2024-12-04	MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction	Gangjian Zhang et.al.	2412.03103	null
2024-12-05	Align3R: Aligned Monocular Depth Estimation for Dynamic Videos	Jiahao Lu et.al.	2412.03079	null
2024-12-03	Single-Shot Metric Depth from Focused Plenoptic Cameras	Blanca Lasheras-Hernandez et.al.	2412.02386	null
2024-12-03	Dual Exposure Stereo for Extended Dynamic Range 3D Imaging	Juhyung Choi et.al.	2412.02351	null
2024-12-03	Amodal Depth Anything: Amodal Depth Estimation in the Wild	Zhenyu Li et.al.	2412.02336	null
2024-12-03	GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos	Zhiyuan Chen et.al.	2412.02267	null
2024-12-03	FoveaSPAD: Exploiting Depth Priors for Adaptive and Efficient Single-Photon 3D Imaging	Justin Folden et.al.	2412.02052	null
2024-12-02	Mutli-View 3D Reconstruction using Knowledge Distillation	Aditya Dutt et.al.	2412.02039	link
2024-12-02	AVS-Net: Audio-Visual Scale Net for Self-supervised Monocular Metric Depth Estimation	Xiaohu Liu et.al.	2412.01637	null
2024-12-02	STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation	Sunghun Yang et.al.	2412.01090	null
2024-12-01	FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation	Yunpeng Bai et.al.	2412.00671	null
2024-11-29	SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection	Philipp Wolters et.al.	2411.19860	null
2024-11-29	MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications	Gasser Elazab et.al.	2411.19717	null
2024-11-29	Gaussian Splashing: Direct Volumetric Rendering Underwater	Nir Mualem et.al.	2411.19588	null
2024-11-28	Learning Surrogate Rainfall-driven Inundation Models with Few Data	Marzieh Alireza Mirhoseini et.al.	2411.19323	null
2024-11-28	AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones	Xuqian Ren et.al.	2411.19271	null
2024-11-28	Video Depth without Video Models	Bingxin Ke et.al.	2411.19189	null
2024-11-28	360Recon: An Accurate Reconstruction Method Based on Depth Fusion from 360 Images	Zhongmiao Yan et.al.	2411.19102	null
2024-11-27	Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation	Mehdi Zayene et.al.	2411.18335	link
2024-11-27	GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation	Wenbo Cui et.al.	2411.18276	null
2024-11-27	SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation	Duc-Hai Pham et.al.	2411.18229	null
2024-11-26	Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation	Sudarshan Rajagopalan et.al.	2411.17814	null
2024-11-26	Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors	Ziang Xu et.al.	2411.17790	null
2024-11-26	DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting	Christian Homeyer et.al.	2411.17660	link
2024-11-26	Spatially Visual Perception for End-to-End Robotic Learning	Travis Davies et.al.	2411.17458	null
2024-11-26	DepthCues: Evaluating Monocular Depth Perception in Large Vision Models	Duolikun Danier et.al.	2411.17385	null
2024-11-26	Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration	Junyuan Deng et.al.	2411.17240	link
2024-11-25	G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs	Kunyi Li et.al.	2411.16898	null
2024-11-24	PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation	Ziyao Zeng et.al.	2411.16750	null
2024-11-25	Generative Omnimatte: Learning to Decompose Video into Layers	Yao-Chih Lee et.al.	2411.16683	null
2024-11-25	One Diffusion to Generate Them All	Duong H. Le et.al.	2411.16318	link
2024-11-24	Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors	Soumava Paul et.al.	2411.15966	null
2024-11-21	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart	Jian Shi et.al.	2411.14295	link
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291	null
2024-11-20	OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging	Rajini Makam et.al.	2411.13230	link
2024-11-15	SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction	Yutao Tang et.al.	2411.12592	link
2024-11-18	Towards Degradation-Robust Reconstruction in Generalizable NeRF	Chan Ho Park et.al.	2411.11691	null
2024-11-18	MGNiceNet: Unified Monocular Geometric Scene Understanding	Markus Schön et.al.	2411.11466	null
2024-11-18	The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather	Markus Schön et.al.	2411.11455	null
2024-11-18	GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views	Boyao Zhou et.al.	2411.11363	null
2024-11-18	Scalable Autoregressive Monocular Depth Estimation	Jinhong Wang et.al.	2411.11361	null
2024-11-16	MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation	Ansh Shah et.al.	2411.10886	link
2024-11-19	EVT: Efficient View Transformation for Multi-Modal 3D Object Detection	Yongjin Lee et.al.	2411.10715	null
2024-11-15	Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses	Yongfan Liu et.al.	2411.10013	link
2024-11-14	Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting	Yian Wang et.al.	2411.09823	null
2024-11-14	Adversarial Attacks Using Differentiable Rendering: A Survey	Matthew Hull et.al.	2411.09749	null
2024-11-14	Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching	Yuran Wang et.al.	2411.09151	null
2024-11-13	OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances	Youqi Liao et.al.	2411.08665	link
2024-11-09	Online Collision Risk Estimation via Monocular Depth-Aware Object Detectors and Fuzzy Inference	Brian Hsuan-Cheng Liao et.al.	2411.08060	null
2024-11-13	Scaling Properties of Diffusion Models for Perceptual Tasks	Rahul Ravishankar et.al.	2411.08034	null
2024-11-11	$SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation	Yinshuang Xu et.al.	2411.07326	null
2024-11-08	Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning	Quang Truong Nguyen et.al.	2411.05344	null
2024-11-08	SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection	Yun Zhao et.al.	2411.05292	null
2024-11-07	D $^3$ epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes	Siyu Chen et.al.	2411.04826	null
2024-11-06	Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation	Teppei Kurita et.al.	2411.04714	null
2024-11-07	Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation	Qingyao Tian et.al.	2411.04404	null
2024-11-04	PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes	Kebin Peng et.al.	2411.04227	null
2024-11-06	Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions	Zihan Qin et.al.	2411.03638	null
2024-11-05	Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor	Anish Bhattacharya et.al.	2411.03303	null
2024-11-05	Correlation of Object Detection Performance with Visual Saliency and Depth Estimation	Matthias Bartolo et.al.	2411.02844	link
2024-11-05	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-05	Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training	Yuanqi Yao et.al.	2411.02149	null
2024-11-02	MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction	Wang Zhao et.al.	2411.01226	link
2024-11-01	MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes	Sanghyun Byun et.al.	2411.01048	null
2024-11-01	On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR	Li Li et.al.	2411.00600	link
2024-10-31	Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving	Ce Zhou et.al.	2411.00192	null
2024-10-31	ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images	Timing Yang et.al.	2410.24001	link
2024-10-30	Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe	Songyu Xu et.al.	2410.23154	null
2024-10-29	Active Event Alignment for Monocular Distance Estimation	Nan Cai et.al.	2410.22280	null
2024-10-29	PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting	Sunghwan Hong et.al.	2410.22128	link
2024-10-27	Unlocking Comics: The AI4VA Dataset for Visual Understanding	Peter Grönquist et.al.	2410.20459	link
2024-10-27	Depth Attention for Robust RGB Tracking	Yu Liu et.al.	2410.20395	link
2024-10-21	YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning	Ranjan Sapkota et.al.	2410.19846	null
2024-10-25	MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors	Fanqi Pu et.al.	2410.19590	link
2024-10-24	Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction	Hongxin Peng et.al.	2410.18433	null
2024-10-24	Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images	Dong-Guw Lee et.al.	2410.18340	link
2024-10-25	UnCLe: Unsupervised Continual Learning of Depth Completion	Suchisrit Gangopadhyay et.al.	2410.18074	null
2024-10-21	TIPS: Text-Image Pretraining with Spatial Awareness	Kevis-Kokitsi Maninis et.al.	2410.16512	null
2024-10-22	DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain	Kun Wang et.al.	2410.14980	link
2024-10-17	DepthSplat: Connecting Gaussian Splatting and Depth	Haofei Xu et.al.	2410.13862	link
2024-10-16	DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning	Jiabao Wei et.al.	2410.12501	null
2024-10-16	Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture	Dabbrata Das et.al.	2410.11610	link
2024-10-16	CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction	Pranav Gupta et.al.	2410.11211	link
2024-10-14	Few-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting	Raja Kumar et.al.	2410.11080	link
2024-10-14	When Does Perceptual Alignment Benefit Vision Representations?	Shobhita Sundaram et.al.	2410.10817	null
2024-10-14	Depth Any Video with Scalable Synthetic Data	Honghui Yang et.al.	2410.10815	link
2024-10-15	Improved Depth Estimation of Bayesian Neural Networks	Bart van Erp et.al.	2410.10395	link
2024-10-10	Color-Guided Flying Pixel Correction in Depth Images	Ekamresh Vasudevan et.al.	2410.08084	link
2024-10-09	Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models	Ange Lou et.al.	2410.07434	null
2024-10-09	Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Runze Chen et.al.	2410.06982	null
2024-10-09	Analysis of different disparity estimation techniques on aerial stereo image datasets	Ishan Narayan et.al.	2410.06711	null
2024-10-08	Vision Transformer based Random Walk for Group Re-Identification	Guoqing Zhang et.al.	2410.05808	null
2024-10-08	CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality	Wenjie Chang et.al.	2410.05735	null
2024-10-07	PhotoReg: Photometrically Registering 3D Gaussian Splatting Models	Ziwen Yuan et.al.	2410.05044	null
2024-10-06	Mode-GS: Monocular Depth Guided Anchored 3D Gaussian Splatting for Robust Ground-View Scene Rendering	Yonghan Lee et.al.	2410.04646	null
2024-10-10	Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy	Pengcheng Chen et.al.	2410.04041	null
2024-10-04	Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	Laura Fink et.al.	2410.03861	link
2024-10-03	DecTrain: Deciding When to Train a DNN Online	Zih-Sing Fu et.al.	2410.02980	null
2024-10-03	RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions	Ziyao Zeng et.al.	2410.02924	link
2024-10-02	Depth Pro: Sharp Monocular Metric Depth in Less Than a Second	Aleksei Bochkovskii et.al.	2410.02073	link
2024-10-02	Learning from the Giants: A Practical Approach to Underwater Depth and Surface Normals Estimation	Alzayat Saleh et.al.	2410.02072	null
2024-10-02	SinkSAM: A Monocular Depth-Guided SAM Framework for Automatic Sinkhole Segmentation	Osher Rafaeli et.al.	2410.01473	link
2024-10-01	Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation	Shuting Zhao et.al.	2410.00979	null
2024-10-01	Radar Meets Vision: Robustifying Monocular Metric Depth Prediction for Mobile Robotics	Marco Job et.al.	2410.00736	null
2024-10-01	Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration	Yida Lin et.al.	2410.00503	null
2024-10-01	Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance	Hongchao Shu et.al.	2410.00386	null
2024-09-30	CCDepth: A Lightweight Self-supervised Depth Estimation Network with Enhanced Interpretability	Xi Zhang et.al.	2409.19933	null
2024-09-30	EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth Prediction	Ivan Reyes-Amezcua et.al.	2409.19930	link
2024-09-29	fCOP: Focal Length Estimation from Category-level Object Priors	Xinyue Zhang et.al.	2409.19641	null
2024-09-29	KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation	Soofiyan Atar et.al.	2409.19490	null
2024-09-27	Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping	Anthony A. Song et.al.	2409.19153	null
2024-09-26	Self-supervised Monocular Depth Estimation with Large Kernel Attention	Xuezhi Xiang et.al.	2409.17895	null
2024-09-26	Self-Distilled Depth Refinement with Noisy Poisson Fusion	Jiaqi Li et.al.	2409.17880	link
2024-09-27	A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts	Aurel Pjetri et.al.	2409.17851	null
2024-09-26	Event-based Stereo Depth Estimation: A Survey	Suman Ghosh et.al.	2409.17680	null
2024-09-26	CAMOT: Camera Angle-aware Multi-Object Tracking	Felix Limanta et.al.	2409.17533	null
2024-09-25	Optical Lens Attack on Deep Learning Based Monocular Depth Estimation	Ce Zhou et.al.	2409.17376	null
2024-09-25	Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation	Richard D. Paul et.al.	2409.17085	null
2024-09-25	EventHDR: from Event to High-Speed HDR Videos and Beyond	Yunhao Zou et.al.	2409.17029	null
2024-09-25	3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation	Yi Gu et.al.	2409.16702	link
2024-09-24	MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling	Yifang Men et.al.	2409.16160	null
2024-09-24	Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data	An Wang et.al.	2409.16063	link
2024-09-23	FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera	Guoyang Zhao et.al.	2409.15054	link
2024-09-23	DepthART: Monocular Depth Estimation as Autoregressive Refinement Task	Bulat Gabdullin et.al.	2409.15010	null
2024-09-23	Generalizing monocular colonoscopy image depth estimation by uncertainty-based global and local fusion network	Sijia Du et.al.	2409.15006	null
2024-09-23	GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth	Aurélien Cecille et.al.	2409.14850	link
2024-09-23	Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras	Ming Li et.al.	2409.14766	null
2024-09-25	D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation	Songlin Wei et.al.	2409.14365	null
2024-09-22	MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views	Wangze Xu et.al.	2409.14316	null
2024-09-21	@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology	Xin Jiang et.al.	2409.14215	null
2024-09-18	Panoptic-Depth Forecasting	Juana Valeria Hurtado et.al.	2409.12008	null
2024-09-17	Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think	Gonzalo Martin Garcia et.al.	2409.11355	link
2024-09-15	GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion	Vitor Guizilini et.al.	2409.09896	null
2024-09-15	Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation	Xiaolong Qian et.al.	2409.09754	link
2024-09-13	PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage	Denis Zavadski et.al.	2409.09144	link
2024-09-23	Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding	Rania Hossam et.al.	2409.08695	link
2024-09-12	Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor	Andrea Conti et.al.	2409.08277	null
2024-09-12	LED: Light Enhanced Depth Estimation at Night	Simon de Moreau et.al.	2409.08031	link
2024-09-12	Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes	Ming Li et.al.	2409.07843	null
2024-09-12	Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy	Bojian Li et.al.	2409.07723	null
2024-09-12	FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments	Devansh Dhrafani et.al.	2409.07715	null
2024-09-10	Deep Neural Networks: Multi-Classification and Universal Approximation	Martín Hernández et.al.	2409.06555	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	link
2024-09-11	EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels	Qingyao Tian et.al.	2409.05442	link
2024-09-09	Spontaneous magnetic field and disorder effects in BaPtAs_1-x_Sb_x_ with honeycomb network	T. Adachi et.al.	2409.05266	null
2024-09-08	TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs	Horatiu Florea et.al.	2409.05142	null
2024-09-12	Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective	Tim Bader et.al.	2409.04086	link
2024-09-08	Estimating Indoor Scene Depth Maps from Ultrasonic Echoes	Junpei Honma et.al.	2409.03336	null
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-02	GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling	Huawei Sun et.al.	2409.02720	link
2024-09-04	Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects	Kyungmin Jo et.al.	2409.02653	null
2024-09-04	UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching	Soomin Kim et.al.	2409.02545	null
2024-09-04	SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction	Sumin Son et.al.	2409.02513	null
2024-09-04	Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation	Li Liu et.al.	2409.02494	link
2024-09-04	Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization	Cho-Ying Wu et.al.	2409.02486	null
2024-09-04	GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving	Huasong Han et.al.	2409.02382	null
2024-09-03	DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos	Wenbo Hu et.al.	2409.02095	link
2024-09-02	Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling	Haicheng Liao et.al.	2409.01256	null
2024-08-30	DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model	Mona Sheikh Zeinoddin et.al.	2408.17433	link
2024-08-30	Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method	Yuji Lin et.al.	2408.17339	link
2024-08-30	Synthetic Lunar Terrain: A Multimodal Open Dataset for Training and Evaluating Neuromorphic Vision Algorithms	Marcus Märtens et.al.	2408.16971	null
2024-08-29	EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More	Kanghao Chen et.al.	2408.16254	null
2024-08-30	Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective	Zhijie Shen et.al.	2408.16227	link
2024-08-27	Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack	Naufal Suryanto et.al.	2408.14879	link
2024-08-26	NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training	Albert Luginov et.al.	2408.14177	link
2024-08-26	Pixel-Aligned Multi-View Generation with Depth Guided Decoder	Zhenggang Tang et.al.	2408.14016	null
2024-08-25	TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers	Chuanrui Zhang et.al.	2408.13770	null
2024-08-25	InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth	Cho-Ying Wu et.al.	2408.13708	null
2024-08-25	SeeBelow: Sub-dermal 3D Reconstruction of Tumors with Surgical Robotic Palpation and Tactile Exploration	Raghava Uppuluri et.al.	2408.13699	null
2024-08-27	Sapiens: Foundation for Human Vision Models	Rawal Khirodkar et.al.	2408.12569	null
2024-08-21	LiFCal: Online Light Field Camera Calibration via Bundle Adjustment	Aymeric Fleith et.al.	2408.11682	null
2024-08-19	Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video	Shuxian Wang et.al.	2408.10153	link
2024-08-19	SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition	Wiktor Mucha et.al.	2408.10037	link
2024-08-19	P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders	Xuechao Chen et.al.	2408.10007	link
2024-08-14	Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling	Ruofeng Wei et.al.	2408.07266	null
2024-08-12	Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces	Junrui Zhang et.al.	2408.06083	null
2024-08-08	Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation	Daniele Rege Cambrin et.al.	2408.04523	link
2024-08-08	Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework	Subhasis Dasgupta et.al.	2408.04360	null
2024-08-08	Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform	Daniel Vargas et.al.	2408.04195	null
2024-08-07	Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach	Benedikt W. Hosp et.al.	2408.03591	null
2024-08-06	BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications	G. Manni et.al.	2408.03078	link
2024-08-05	Gaussian Mixture based Evidential Learning for Stereo Matching	Weide Liu et.al.	2408.02796	null
2024-08-05	Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining	Dongyang Liu et.al.	2408.02657	link
2024-08-03	MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas	Feng Qiao et.al.	2408.01653	null
2024-08-02	Self-Supervised Depth Estimation Based on Camera Models	Jinchang Zhang et.al.	2408.01565	null
2024-08-01	MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection	Youjia Fu et.al.	2408.00438	null
2024-08-01	High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior	Wencheng Han et.al.	2408.00361	null
2024-08-01	LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting	Zhenyu Bao et.al.	2408.00254	null
2024-07-31	Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching	Pengjie Zhang et.al.	2407.21735	null
2024-07-29	BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation	Kieran Saunders et.al.	2407.20437	null
2024-07-29	Analysis and Improvement of Rank-Ordered Mean Algorithm in Single-Photon LiDAR	William C. Yau et.al.	2407.20399	null
2024-07-29	Improving 2D Feature Representations by 3D-Aware Fine-Tuning	Yuanwen Yue et.al.	2407.20229	null
2024-07-27	Revisit Self-supervised Depth Estimation with Local Structure-from-Motion	Shengjie Zhu et.al.	2407.19166	null
2024-07-27	RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry	Shengjie Zhu et.al.	2407.19154	null
2024-07-26	HybridDepth: Robust Depth Fusion for Mobile AR by Leveraging Depth from Focus and Single-Image Priors	Ashkan Ganj et.al.	2407.18443	link
2024-07-26	Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation	Razieh Azizi et.al.	2407.18195	null
2024-07-25	BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation	Xiang Zhang et.al.	2407.17952	null
2024-07-25	UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation	Jian Wang et.al.	2407.17838	null
2024-07-24	DarSwin-Unet: Distortion Aware Encoder-Decoder Architecture	Akshaya Athwale et.al.	2407.17328	null
2024-07-24	Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches	Chenxing Zhao et.al.	2407.17312	null
2024-07-23	SINDER: Repairing the Singular Defects of DINOv2	Haoqi Wang et.al.	2407.16826	link
2024-07-23	Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions	Fabio Tosi et.al.	2407.16698	link
2024-07-23	ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation	Zhenhua Wu et.al.	2407.16508	null
2024-07-19	Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation	Jinfeng Liu et.al.	2407.14126	link
2024-07-18	Unveiling the purely young star formation history of the SMC’s northeastern shell from colour-magnitude diagram fitting	Joanna D. Sakowska et.al.	2407.13876	null
2024-07-18	Many Perception Tasks are Highly Redundant Functions of their Input Data	Rahul Ramesh et.al.	2407.13841	null
2024-07-18	Shape of Motion: 4D Reconstruction from a Single Video	Qianqian Wang et.al.	2407.13764	null
2024-07-18	Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks	Antoni Kowalczuk et.al.	2407.12588	link
2024-07-16	Temporally Consistent Stereo Matching	Jiaxi Zeng et.al.	2407.11950	link
2024-07-15	IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation	Yuanhao Zhai et.al.	2407.10937	link
2024-07-15	OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection	Jinghua Hou et.al.	2407.10753	link
2024-07-15	Towards Scale-Aware Full Surround Monodepth with Transformers	Yuchen Yang et.al.	2407.10406	null
2024-07-12	ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion	Sungmin Woo et.al.	2407.09303	link
2024-07-11	ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation	Ruijie Zhu et.al.	2407.08187	link
2024-07-10	Controlling Space and Time with Diffusion Models	Daniel Watson et.al.	2407.07860	null
2024-07-07	SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning	Yi Feng et.al.	2407.05283	link
2024-07-05	A Physical Model-Guided Framework for Underwater Image Enhancement and Depth Estimation	Dazhao Du et.al.	2407.04230	link
2024-07-04	Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation	Laiyan Ding et.al.	2407.04041	link
2024-07-02	Parametric Modeling and Estimation of Photon Registrations for 3D Imaging	Weijian Zhang et.al.	2407.02712	null
2024-07-02	Depth-Aware Endoscopic Video Inpainting	Francis Xiatian Zhang et.al.	2407.02675	link
2024-07-04	Camera-LiDAR Cross-modality Gait Recognition	Wenxuan Guo et.al.	2407.02038	null
2024-07-07	CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation	Huawei Sun et.al.	2407.00697	link
2024-06-28	Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey	Uchitha Rajapaksha et.al.	2406.19675	null
2024-06-27	What Matters in Detecting AI-Generated Videos like Sora?	Chirui Chang et.al.	2406.19568	null
2024-07-05	360 in the Wild: Dataset for Depth Prediction and View Synthesis	Kibaek Park et.al.	2406.18898	null
2024-06-27	Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach	Yuxiang Huang et.al.	2406.18837	null
2024-06-26	MultiDiff: Consistent Novel View Synthesis from a Single Image	Norman Müller et.al.	2406.18524	null
2024-06-26	DoubleTake: Geometry Guided Depth Estimation	Mohamed Sayed et.al.	2406.18387	null
2024-06-25	Depth-Guided Semi-Supervised Instance Segmentation	Xin Chen et.al.	2406.17413	null
2024-06-20	Uncertainty and Self-Supervision in Single-View Depth	Javier Rodriguez-Puigvert et.al.	2406.14226	null
2024-06-19	WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation	Yilin Ding et.al.	2406.13344	link
2024-06-18	Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation	Ning-Hsu Wang et.al.	2406.12849	null
2024-06-21	GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models	Yongtao Ge et.al.	2406.12671	link
2024-06-17	DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features	Letian Wang et.al.	2406.12095	null
2024-06-17	MEDeA: Multi-view Efficient Depth Adjustment	Mikhail Artemyev et.al.	2406.12048	null
2024-06-16	Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry	Boris Chidlovskii et.al.	2406.11019	null
2024-06-16	3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments	Eduardo Davalos et.al.	2406.11003	null
2024-06-15	GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR	Bharat Singh et.al.	2406.10722	null
2024-06-14	The BabyView dataset: High-resolution egocentric videos of infants’ and young children’s everyday experiences	Bria Long et.al.	2406.10447	null
2024-06-14	D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video	Moritz Kappel et.al.	2406.10078	null
2024-06-14	DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications	Li Li et.al.	2406.10068	link
2024-06-14	Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion	Runze Liu et.al.	2406.09782	null
2024-06-13	Depth Anything V2	Lihe Yang et.al.	2406.09414	link
2024-06-14	WonderWorld: Interactive 3D Scene Generation from a Single Image	Hong-Xing Yu et.al.	2406.09394	null
2024-06-13	Scale-Invariant Monocular Depth Estimation via SSI Depth	S. Mahdi H. Miangoleh et.al.	2406.09374	link
2024-06-13	Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid Transformer	Guodong Sun et.al.	2406.08928	link
2024-06-13	ToSA: Token Selective Attention for Efficient Vision Transformers	Manish Kumar Singh et.al.	2406.08816	null
2024-06-11	Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation	Yufan Zhu et.al.	2406.07741	link
2024-06-11	PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow	Joshua Tokarsky et.al.	2406.07667	null
2024-06-11	RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks	Zhechao Wang et.al.	2406.07032	null
2024-06-10	PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation	Zhenyu Li et.al.	2406.06679	null
2024-06-10	Visual-Inertial SLAM as Simple as A, B, VINS	Nathaniel Merrill et.al.	2406.05969	null
2024-06-09	Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks	Zhiyuan Cheng et.al.	2406.05857	link
2024-06-09	RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering	Rui Zhang et.al.	2406.05852	null
2024-06-07	Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction	Aarya Patel et.al.	2406.04861	null
2024-06-07	UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection	Yuchao Wang et.al.	2406.04647	null
2024-06-06	MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation	Ionuţ Grigore et.al.	2406.04532	null
2024-06-06	Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image	Stanislaw Szymanowicz et.al.	2406.04343	link
2024-06-06	Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry	Kaichen Zhou et.al.	2406.04301	null
2024-06-04	VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors	Markus Plack et.al.	2406.02552	null
2024-06-03	L-MAGIC: Language Model Assisted Generation of Images with Coherence	Zhipeng Cai et.al.	2406.01843	link
2024-06-04	Learning Temporally Consistent Video Depth from Video Diffusion Priors	Jiahao Shao et.al.	2406.01493	null
2024-06-03	Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Takayuki Kanai et.al.	2406.00929	null
2024-06-01	MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos	Qingming Liu et.al.	2406.00434	null
2024-05-30	Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian	Wei Sun et.al.	2405.19657	null
2024-05-28	Hybrid Multi-Head Physics-informed Neural Network for Depth Estimation in Terahertz Imaging	Mingjun Xiang et.al.	2405.18317	null
2024-05-27	Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation	Amir El-Ghoussani et.al.	2405.17704	link
2024-05-27	Benchmarking and Improving Bird’s Eye View Perception Robustness in Autonomous Driving	Shaoyuan Xie et.al.	2405.17426	link
2024-05-27	All-day Depth Completion	Vadim Ezhov et.al.	2405.17315	null
2024-05-27	GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping	Junyoung Seo et.al.	2405.17251	link
2024-05-27	SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing	Yong-Qiang Mao et.al.	2405.17140	null
2024-05-27	DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge	Yifan Mao et.al.	2405.17102	null
2024-05-27	Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2405.17097	null
2024-05-27	DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation	Mengtan Zhang et.al.	2405.16960	link
2024-05-27	ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection	Ziying Song et.al.	2405.16873	null
2024-05-27	Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations	Jingguo Liu et.al.	2405.16858	null
2024-05-26	Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians	Erik Sandström et.al.	2405.16544	link
2024-05-24	Transparent Object Depth Completion	Yifan Zhou et.al.	2405.15299	null
2024-05-24	MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method	Pan Liao et.al.	2405.15176	null
2024-05-23	EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting	Jiaxu Wang et.al.	2405.14959	link
2024-05-23	Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks	Xingguang Jiang et.al.	2405.14520	null
2024-05-23	MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes	Ruiyuan Gao et.al.	2405.14475	null
2024-05-23	Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning	Zhenyu Wei et.al.	2405.14195	null
2024-05-21	Cross-spectral Gated-RGB Stereo Depth Estimation	Samuel Brucker et.al.	2405.12759	null
2024-05-20	Depth Reconstruction with Neural Signed Distance Fields in Structured Light Systems	Rukun Qiao et.al.	2405.12006	null
2024-05-20	Depth Prompting for Sensor-Agnostic Depth Estimation	Jin-Hwi Park et.al.	2405.11867	null
2024-05-19	CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs	Zidong Cao et.al.	2405.11564	null
2024-05-18	Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models	Madhu Vankadari et.al.	2405.11158	link
2024-05-17	FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation	Fei Wang et.al.	2405.10885	link
2024-05-17	Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory	Jonas Kälble et.al.	2405.10575	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	link
2024-05-16	KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment	Zhengxu Shi et.al.	2405.09964	null
2024-05-14	CLIP with Quality Captions: A Strong Pretraining for Vision Tasks	Pavan Kumar Anasosalu Vasu et.al.	2405.08911	null
2024-05-14	The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition	Lingdong Kong et.al.	2405.08816	null
2024-05-14	EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera	Beilei Cui et.al.	2405.08672	link
2024-05-13	SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling	Yijun Yuan et.al.	2405.07847	null
2024-05-11	TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization	Zhen Tan et.al.	2405.07027	link
2024-05-11	Learning Monocular Depth from Focus with Event Focal Stack	Chenxu Jiang et.al.	2405.06944	null

Optical flow

Publish Date	Title	Authors	PDF	Code
2025-07-20	Systole-Conditioned Generative Cardiac Motion	Shahar Zuler et.al.	2507.15894	null
2025-07-23	EndoControlMag: Robust Endoscopic Vascular Motion Magnification with Periodic Reference Resetting and Hierarchical Tissue-aware Dual-Mask Contro	An Wang et.al.	2507.15292	null
2025-07-19	Motion Segmentation and Egomotion Estimation from Event-Based Normal Flow	Zhiyuan Hua et.al.	2507.14500	null
2025-07-18	DUSTrack: Semi-automated point tracking in ultrasound videos	Praneeth Namburi et.al.	2507.14368	null
2025-07-18	Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation	Masahiro Ogawa et.al.	2507.13628	null
2025-07-17	Latent Policy Steering with Embodiment-Agnostic Pretrained World Models	Yiqi Wang et.al.	2507.13340	null
2025-07-17	Channel-wise Motion Features for Efficient Motion Segmentation	Riku Inoue et.al.	2507.13082	null
2025-07-16	Understanding visual attention beehind bee-inspired UAV navigation	Pranav Rajbhandari et.al.	2507.11992	null
2025-07-14	Well-posedness of an optical flow based optimal control formulation for image registration	Johannes Haubner et.al.	2507.10188	null
2025-07-14	Taming Modern Point Tracking for Speckle Tracking Echocardiography via Impartial Motion	Md Abulkalam Azad et.al.	2507.10127	null
2025-07-11	Taming generative video models for zero-shot optical flow extraction	Seungwoo Kim et.al.	2507.09082	null
2025-07-11	An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan	Mengyuan Liu et.al.	2507.08690	null
2025-07-11	PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models	Yongjian Zhang et.al.	2507.08400	null
2025-07-11	MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion	Jihao Gu et.al.	2507.08344	null
2025-07-10	X-RAFT: Cross-Modal Non-Rigid Registration of Blue and White Light Neurosurgical Hyperspectral Images	Charlie Budd et.al.	2507.07747	null
2025-07-09	mmFlux: Crowd Flow Analytics with Commodity mmWave MIMO Radar	Anurag Pallaprolu et.al.	2507.07331	null
2025-07-08	Learning to Track Any Points from Human Motion	Inès Hyeonsu Kim et.al.	2507.06233	null
2025-07-07	MoDiT: Learning Highly Consistent 3D Motion Coefficients with Diffusion Transformer for Talking Head Generation	Yucheng Wang et.al.	2507.05092	null
2025-07-07	TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation	Zonglin Lyu et.al.	2507.04984	null
2025-07-10	MCFormer: A Multi-Cost-Volume Network and Comprehensive Benchmark for Particle Image Velocimetry	Zicheng Lin et.al.	2507.04750	null
2025-07-06	FB-Diff: Fourier Basis-guided Diffusion for Temporal Interpolation of 4D Medical Imaging	Xin You et.al.	2507.04547	null
2025-07-03	Flow-CDNet: A Novel Network for Detecting Both Slow and Fast Changes in Bitemporal Images	Haoxuan Li et.al.	2507.02307	null
2025-07-01	TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency	Minye Shao et.al.	2507.00802	null
2025-07-01	DIJE: Dense Image Jacobian Estimation for Robust Robotic Self-Recognition and Visual Servoing	Yasunori Toshimitsu et.al.	2507.00446	null
2025-06-30	C3VDv2 – Colonoscopy 3D video dataset with enhanced realism	Mayank V. Golhar et.al.	2506.24074	null
2025-07-03	PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal View	Longliang Liu et.al.	2506.23897	null
2025-06-30	Proteus-ID: ID-Consistent and Motion-Coherent Video Customization	Guiyu Zhang et.al.	2506.23729	null
2025-06-29	MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation	Vladislav Bargatin et.al.	2506.23151	null
2025-06-26	WAFT: Warping-Alone Field Transforms for Optical Flow	Yihan Wang et.al.	2506.21526	null
2025-06-26	EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting	Taoyu Wu et.al.	2506.21420	null
2025-06-25	Feature Hallucination for Self-supervised Action Recognition	Lei Wang et.al.	2506.20342	null
2025-06-24	Online camera-pose-free stereo endoscopic tissue deformation recovery with tissue-invariant vision-biomechanics consistency	Jiahe Chen et.al.	2506.19388	null
2025-06-23	Flow-Aware Diffusion for Real-Time VR Restoration: Enhancing Spatiotemporal Coherence and Efficiency	Yitong Zhu et.al.	2506.18786	null
2025-06-24	Multimodal Fusion SLAM with Fourier Attention	Youjie Zhou et.al.	2506.18204	null
2025-06-19	EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised Training	Liangjing Shao et.al.	2506.16017	link
2025-06-17	MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution	Zhiwen Shao et.al.	2506.14511	link
2025-06-21	Inference-Time Gaze Refinement for Micro-Expression Recognition: Enhancing Event-Based Eye Tracking with Motion-Aware Post-Processing	Nuwan Bandara et.al.	2506.12524	link
2025-06-13	MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution	Linfeng He et.al.	2506.11768	null
2025-06-12	Post-Training Quantization for Video Matting	Tianrui Zhu et.al.	2506.10840	null
2025-06-10	UFM: A Simple Path towards Unified Dense Correspondence with Flow	Yuchen Zhang et.al.	2506.09278	null
2025-06-10	Princeton365: A Diverse Dataset with Accurate Camera Pose	Karhan Kayan et.al.	2506.09035	null
2025-06-09	Spatio-Temporal State Space Model For Efficient Event-Based Optical Flow	Muhammad Ahmed Humais et.al.	2506.07878	link
2025-06-09	Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images	Yingping Liang et.al.	2506.07740	null
2025-06-13	Consistent Video Editing as Flow-Driven Image-to-Video Generation	Ge Wang et.al.	2506.07713	null
2025-06-08	AllTracker: Efficient Dense Point Tracking at High Resolution	Adam W. Harley et.al.	2506.07310	null
2025-06-08	GoTrack: Generic 6DoF Object Pose Refinement and Tracking	Van Nguyen Nguyen et.al.	2506.07155	null
2025-06-07	EV-LayerSegNet: Self-supervised Motion Segmentation using Event Cameras	Youssef Farah et.al.	2506.06596	null
2025-06-06	3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model	Hongyan Zhi et.al.	2506.06199	link
2025-06-06	Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments	Mingrui Li et.al.	2506.05965	null
2025-06-05	DualX-VSR: Dual Axial Spatial $\times$ Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation	Shuo Cao et.al.	2506.04830	null
2025-06-04	JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting	Yang Xiao et.al.	2506.03872	null
2025-06-04	EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation	Daikun Liu et.al.	2506.03512	null
2025-06-03	Learning Optical Flow Field via Neural Ordinary Differential Equation	Leyla Mirvakhabova et.al.	2506.03290	null
2025-06-03	LinkTo-Anime: A 2D Animation Optical Flow Dataset from 3D Model Rendering	Xiaoyi Feng et.al.	2506.02733	null
2025-06-03	LumosFlow: Motion-Guided Long Video Generation	Jiahao Chen et.al.	2506.02497	null
2025-06-02	MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow	Jakob Schmid et.al.	2506.01443	null
2025-06-01	MOOSE: Pay Attention to Temporal Dynamics for Video Understanding via Optical Flows	Hong Nguyen et.al.	2506.01119	null
2025-05-31	Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline	Zhaoying Wang et.al.	2506.00546	null
2025-05-31	Improving Optical Flow and Stereo Depth Estimation by Leveraging Uncertainty-Based Learning Difficulties	Jisoo Jeong et.al.	2506.00324	null
2025-05-30	Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction	Chenyou Fan et.al.	2505.24156	null
2025-05-29	Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing	Tongtong Su et.al.	2505.23134	link
2025-05-27	Object Concepts Emerge from Motion	Haoqian Liang et.al.	2505.21635	null
2025-05-26	A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking	Zixiang Zhao et.al.	2505.19858	null
2025-05-23	Brightness-Invariant Tracking Estimation in Tagged MRI	Zhangxing Bian et.al.	2505.18365	null
2025-05-31	CTRL-GS: Cascaded Temporal Residue Learning for 4D Gaussian Splatting	Karly Hou et.al.	2505.18306	null
2025-05-23	Real-time Traffic Accident Anticipation with Feature Reuse	Inpyo Song et.al.	2505.17449	null
2025-05-22	Efficient Correlation Volume Sampling for Ultra-High-Resolution Optical Flow Estimation	Karlis Martins Briedis et.al.	2505.16942	null
2025-05-22	V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation	Hanyue Lou et.al.	2505.16797	link
2025-05-21	SENSE – Sensor-Enhanced Neural Shear Stress Estimation for Quantitative Oilfilm Visualizations	Lennart Rohlfs et.al.	2505.15697	null
2025-05-19	RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers	Ahmet Berke Gokmen et.al.	2505.13344	null
2025-05-19	eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks	Jad Mansour et.al.	2505.13309	null
2025-05-19	FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching	Alp Eren Sari et.al.	2505.13174	null
2025-05-19	Just Dance with $π$ ! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection	Snehashis Majhi et.al.	2505.13123	null
2025-05-17	MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos	Hongyi Zhou et.al.	2505.11868	null
2025-05-16	Planar Velocity Estimation for Fast-Moving Mobile Robots Using Event-Based Optical Flow	Liam Boyle et.al.	2505.11116	null
2025-05-15	TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation	Manthan Patel et.al.	2505.10696	null
2025-05-15	A label-free sub-diffractive technique for 3D intracellular tomography using thermally induced convection currents	Jayesh Goswami et.al.	2505.10112	null
2025-05-14	FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling	Yue Wen et.al.	2505.09406	null
2025-05-14	RobustSpring: Benchmarking Robustness to Image Corruptions for Optical Flow, Scene Flow and Stereo	Jenny Schmalfuss et.al.	2505.09368	null
2025-05-13	Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection	Ayush K. Rai et.al.	2505.08561	null
2025-05-13	TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection	Wenkui Yang et.al.	2505.08437	link
2025-05-13	EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation	Hanle Zheng et.al.	2505.08235	null
2025-05-13	Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images	Ziteng Liu et.al.	2505.08178	null
2025-05-12	Asynchronous Multi-Object Tracking with an Event Camera	Angus Apps et.al.	2505.08126	link
2025-05-11	MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception	Zhengye Zhang et.al.	2505.07007	link
2025-05-13	Detection of Moving Objects Using Self-motion Constraints on Optic Flow	Hope Lutwak et.al.	2505.06686	null
2025-05-08	Nonlinear Motion-Guided and Spatio-Temporal Aware Network for Unsupervised Event-Based Optical Flow	Zuntao Liu et.al.	2505.05089	null
2025-05-08	A Simple Detector with Frame Dynamics is a Strong Tracker	Chenxu Peng et.al.	2505.04917	link
2025-05-06	Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment	João Alves et.al.	2505.03554	link
2025-05-06	TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion	Haoyue Liu et.al.	2505.03116	null
2025-05-04	Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance	Yingkai Zhang et.al.	2505.02109	null
2025-05-02	Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation	Zhen Yao et.al.	2505.01548	link
2025-04-30	AnimalMotionCLIP: Embedding motion in CLIP for Animal Behavior Analysis	Enmin Zhong et.al.	2505.00569	null
2025-04-29	LPVIMO-SAM: Tightly-coupled LiDAR/Polarization Vision/Inertial/Magnetometer/Optical Flow Odometry via Smoothing and Mapping	Derui Shan et.al.	2504.20380	null
2025-04-25	RapidPIV: Full Flow-Field kHz PIV for Real-Time Display and Control	Scott A. Bollt et.al.	2504.17987	null
2025-04-22	Motion-Enhanced Nonlocal Similarity Implicit Neural Representation for Infrared Dim and Small Target Detection	Pei Liu et.al.	2504.15665	null
2025-04-22	DiTPainter: Efficient Video Inpainting with Diffusion Transformers	Xian Wu et.al.	2504.15661	null
2025-04-21	PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV	Qianyu Zhu et.al.	2504.14952	link
2025-04-21	Multimodal Non-Semantic Feature Fusion for Predicting Segment Access Frequency in Lecture Archives	Ruozhu Sheng et.al.	2504.14927	null
2025-04-20	FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models	Kuanting Wu et.al.	2504.14535	null
2025-04-18	Neural Ganglion Sensors: Learning Task-specific Event Cameras Inspired by the Neural Circuit of the Human Retina	Haley M. So et.al.	2504.13457	null
2025-04-18	MicroFlow: Domain-Specific Optical Flow for Ground Deformation Estimation in Seismic Events	Juliette Bertrand et.al.	2504.13452	null
2025-04-18	Event-Enhanced Blurry Video Super-Resolution	Dachun Kai et.al.	2504.13042	link
2025-04-17	SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration	Xi Tong et.al.	2504.12869	null
2025-04-17	SAM-Based Building Change Detection with Distribution-Aware Fourier Adaptation and Edge-Constrained Warping	Yun-Cheng Li et.al.	2504.12619	null
2025-04-14	Perturbed State Space Feature Encoders for Optical Flow with Event Cameras	Gokul Raju Govinda Raju et.al.	2504.10669	null
2025-04-15	WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs	Nguyen Ngoc Dat et.al.	2504.10165	null
2025-04-11	Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review	Claudio Cimarelli et.al.	2504.08588	null
2025-04-10	Extending Visual Dynamics for Video-to-Music Generation	Xiaohao Liu et.al.	2504.07594	null
2025-04-08	Intrinsic Saliency Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation	Xiangyu Zheng et.al.	2504.05904	null
2025-04-07	Towards Efficient Real-Time Video Motion Transfer via Generative Time Series Modeling	Tasmiah Haque et.al.	2504.05537	null
2025-04-06	FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency	Shiyan Liu et.al.	2504.04427	null
2025-04-05	Simultaneous Motion And Noise Estimation with Event Cameras	Shintaro Shiba et.al.	2504.04029	null
2025-04-04	3D Scene Understanding Through Local Random Access Sequence Modeling	Wanhee Lee et.al.	2504.03875	null
2025-04-03	L-LBVC: Long-Term Motion Estimation and Prediction for Learned Bi-Directional Video Compression	Yongqi Zhai et.al.	2504.02560	null
2025-04-01	Beyond Wide-Angle Images: Unsupervised Video Portrait Correction via Spatiotemporal Diffusion Adaptation	Wenbo Nie et.al.	2504.00401	null
2025-04-01	Hierarchical Flow Diffusion for Efficient Frame Interpolation	Yang Hai et.al.	2504.00380	null
2025-03-31	Easi3R: Estimating Disentangled Motion from DUSt3R Without Training	Xingyu Chen et.al.	2503.24391	link
2025-04-03	Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey	Haoyang Wang et.al.	2503.22943	null
2025-03-28	Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision	Rulin Zhou et.al.	2503.22394	null
2025-03-28	Segment Any Motion in Videos	Nan Huang et.al.	2503.22268	null
2025-03-28	Synergistic Bleeding Region and Point Detection in Surgical Videos	Jialun Pei et.al.	2503.22174	null
2025-03-27	VADMamba: Exploring State Space Models for Fast Video Anomaly Detection	Jiahao Lyu et.al.	2503.21169	link
2025-03-27	Can Video Diffusion Model Reconstruct 4D Geometry?	Jinjie Mai et.al.	2503.21082	null
2025-03-25	Burst Image Super-Resolution with Mamba	Ozan Unal et.al.	2503.19634	null
2025-03-24	NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting	Yulong Zheng et.al.	2503.18794	null
2025-03-27	MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion	Yikun Ma et.al.	2503.17695	null
2025-03-21	Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks	Bhishma Dedhia et.al.	2503.17539	null
2025-03-21	Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras	Shuang Guo et.al.	2503.17262	link
2025-03-20	4D Gaussian Splatting SLAM	Yanyan Li et.al.	2503.16710	null
2025-03-20	EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation	Zihao Zhang et.al.	2503.15831	null
2025-03-19	DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework	Henrique Morimitsu et.al.	2503.14880	link
2025-03-19	Temporal-Consistent Video Restoration with Pre-trained Diffusion Models	Hengkang Wang et.al.	2503.14863	null
2025-03-18	GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics	Tingyang Xiao et.al.	2503.14247	link
2025-03-17	UCF-Crime-DVS: A Novel Event-Based Dataset for Video Anomaly Detection with Spiking Neural Networks	Yuanbin Qian et.al.	2503.12905	link
2025-03-16	ProbDiffFlow: An Efficient Learning-Free Framework for Probabilistic Single-Image Optical Flow Estimation	Mo Zhou et.al.	2503.12348	null
2025-03-17	EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation	Zengyu Wan et.al.	2503.11371	null
2025-03-14	FG-DFPN: Flow Guided Deformable Frame Prediction Network	M. Akın Yılmaz et.al.	2503.11343	link
2025-03-14	Zero-TIG: Temporal Consistency-Aware Zero-Shot Illumination-Guided Low-light Video Enhancement	Yini Li et.al.	2503.11175	link
2025-03-14	A High-Accuracy Alignment Approach for Solar Images of Different Wavelengths	Yun Wang et.al.	2503.11035	null
2025-03-13	Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations	Xunzhi Zheng et.al.	2503.10464	null
2025-03-13	Markerless Tracking-Based Registration for Medical Image Motion Correction	Luisa Neubig et.al.	2503.10260	null
2025-03-13	ST-FlowNet: An Efficient Spiking Neural Network for Event-Based Optical Flow Estimation	Hongze Sun et.al.	2503.10195	null
2025-03-12	Investigation of Frame Differences as Motion Cues for Video Object Segmentation	Sota Kawamura et.al.	2503.09132	null
2025-03-11	Feature Alignment with Equivariant Convolutions for Burst Image Super-Resolution	Xinyi Liu et.al.	2503.08300	null
2025-03-10	MambaFlow: A Mamba-Centric Architecture for End-to-End Optical Flow Estimation	Juntian Du et.al.	2503.07046	null
2025-03-11	Bridge Frame and Event: Common Spatiotemporal Fusion for High-Dynamic Scene Optical Flow	Hanyu Zhou et.al.	2503.06992	null
2025-03-09	Online Dense Point Tracking with Streaming Memory	Qiaole Dong et.al.	2503.06471	link
2025-03-10	VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control	Yuxuan Bian et.al.	2503.05639	link
2025-03-07	Stereo Any Video: Temporally Consistent Stereo Matching	Junpeng Jing et.al.	2503.05549	null
2025-03-06	Implicit Neural Representation for Video and Image Super-Resolution	Mary Aiyetigbo et.al.	2503.04665	null
2025-03-09	ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem	Yu-Hsi Chen et.al.	2503.04500	link
2025-03-05	Video Super-Resolution: All You Need is a Video Diffusion Model	Zhihao Zhan et.al.	2503.03355	null
2025-03-05	BAT: Learning Event-based Optical Flow with Bidirectional Adaptive Temporal Correlation	Gangwei Xu et.al.	2503.03256	null
2025-03-05	Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria	Asma A. Almutairi et.al.	2503.03100	null
2025-03-04	Anomaly detection in non-stationary videos using time-recursive differencing network based prediction	Gargi V. Pillai et.al.	2503.02234	null
2025-03-03	MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features	Chao Ye et.al.	2503.01571	link
2025-03-03	AI-Driven Relocation Tracking in Dynamic Kitchen Environments	Arash Nasr Esfahani et.al.	2503.01547	link
2025-03-02	Vid2Fluid: 3D Dynamic Fluid Assets from Single-View Videos with Generative Gaussian Splatting	Zhiwei Zhao et.al.	2503.00868	null
2025-02-28	EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration	Kuangyi Chen et.al.	2503.00167	link
2025-02-21	Peripheral Teleportation: A Rest Frame Design to Mitigate Cybersickness During Virtual Locomotion	Tongyu Nie et.al.	2502.15227	null
2025-02-20	Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance	Meng Wang et.al.	2502.14520	null
2025-02-18	L4P: Low-Level 4D Vision Perception Unified	Abhishek Badki et.al.	2502.13078	null
2025-02-18	Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection	Zijian Cao et.al.	2502.12735	null
2025-02-17	Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance	Jixiang Chen et.al.	2502.11971	null
2025-02-17	Stonefish: Supporting Machine Learning Research in Marine Robotics	Michele Grimaldi et.al.	2502.11887	link
2025-02-15	Super Resolution image reconstructs via total variation-based image deconvolution: a majorization-minimization approach	Mouhamad Chehaitly et.al.	2502.10876	null
2025-02-15	Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video	Runyang Feng et.al.	2502.10616	null
2025-02-11	A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision	Hao Ai et.al.	2502.10444	null
2025-02-12	FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis	Wonjoon Jin et.al.	2502.08244	null
2025-02-11	Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors	Lin-Zhuo Chen et.al.	2502.07615	null
2025-02-18	A Physical Coherence Benchmark for Evaluating Video Generation Models via Optical Flow-guided Frame Prediction	Yongfan Chen et.al.	2502.05503	link
2025-02-05	MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent	Xinyao Liao et.al.	2502.03207	null
2025-02-03	XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications	Shangjin Zhai et.al.	2502.01297	null
2025-01-28	Image Velocimetry using Direct Displacement Field estimation with Neural Networks for Fluids	Efraín Magaña et.al.	2501.18641	link
2025-02-02	REMOTE: Real-time Ego-motion Tracking for Various Endoscopes via Multimodal Visual Feature Learning	Liangjing Shao et.al.	2501.18124	null
2025-01-28	Improved Encoding for Overfitted Video Codecs	Thomas Leguay et.al.	2501.16976	null
2025-01-28	Assessing ultrasonic and optical flow velocimetry in a millifluidic device using oil-in-water emulsions as blood mimicking fluid	Estelle Lu et.al.	2501.16959	null
2025-01-28	Extending Information Bottleneck Attribution to Video Sequences	Veronika Solopova et.al.	2501.16889	link
2025-02-04	Event-Based Adaptive Koopman Framework for Optic Flow-Guided Landing on Moving Platforms	Bazeela Banday et.al.	2501.16868	null
2025-01-23	GC-ConsFlow: Leveraging Optical Flow Residuals and Global Context for Robust Deepfake Detection	Jiaxin Chen et.al.	2501.13435	null
2025-01-22	MONA: Moving Object Detection from Videos Shot by Dynamic Camera	Boxun Hu et.al.	2501.13183	null
2025-01-22	Machine Learning Modeling for Multi-order Human Visual Motion Processing	Zitang Sun et.al.	2501.12810	link
2025-01-21	Efficient Dynamic Image Reconstruction with motion estimation	Toluwani Okunola et.al.	2501.12497	null
2025-01-21	Learning segmentation from point trajectories	Laurynas Karazija et.al.	2501.12392	link
2025-01-22	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null
2025-01-21	VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models	Chaohao Xie et.al.	2501.12267	null
2025-01-20	Event-based vision for egomotion estimation using precise event timing	Hugh Greatorex et.al.	2501.11554	null
2025-01-19	BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution	Eunjin Kim et.al.	2501.11043	link
2025-01-25	Quadcopter Position Hold Function using Optical Flow in a Smartphone-based Flight Computer	Noel P. Caliston et.al.	2501.10752	null
2025-01-18	Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection	Yifang Xu et.al.	2501.10692	null
2025-01-17	DiffuEraser: A Diffusion Model for Video Inpainting	Xiaowen Li et.al.	2501.10018	link
2025-01-16	VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization	Zixun Fang et.al.	2501.09499	null
2025-01-16	Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise	Ryan Burgert et.al.	2501.08331	link
2025-01-13	Aligning First, Then Fusing: A Novel Weakly Supervised Multimodal Violence Detection Method	Wenping Jin et.al.	2501.07496	link
2025-01-08	Edit as You See: Image-guided Video Editing via Masked Motion Modeling	Zhi-Lin Huang et.al.	2501.04325	null
2025-01-06	TinySense: A Lighter Weight and More Power-efficient Avionics System for Flying Insect-scale Robots	Zhitao Yu et.al.	2501.03416	null
2025-01-06	ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking	Tingyang Zhang et.al.	2501.03220	null
2025-01-05	AHMSA-Net: Adaptive Hierarchical Multi-Scale Attention Network for Micro-Expression Recognition	Lijun Zhang et.al.	2501.02539	null
2025-01-01	Spatially-guided Temporal Aggregation for Robust Event-RGB Optical Flow Estimation	Qianang Zhou et.al.	2501.00838	null
2025-01-05	How Honeybees Perceive and Traverse Apertures	Timothy Jakobi et.al.	2501.00646	null
2024-12-29	Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition	Xiu-Feng Huang et.al.	2412.20327	link
2024-12-28	Enhancing Marine Debris Acoustic Monitoring by Optical Flow-Based Motion Vector Analysis	Xiaoteng Zhou et.al.	2412.20085	null
2024-12-27	Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark	Lukas Picek et.al.	2412.19944	null
2024-12-27	Generalized Uncertainty-Based Evidential Fusion with Hybrid Multi-Head Attention for Weak-Supervised Temporal Action Localization	Yuanpeng He et.al.	2412.19418	link
2025-01-03	Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry	Zhaoxing Zhang et.al.	2412.16923	link
2024-12-20	SOUS VIDE: Cooking Visual Drone Navigation Policies in a Gaussian Splatting Vacuum	JunEn Low et.al.	2412.16346	null
2024-12-20	MotiF: Making Text Count in Image Animation with Motion Focal Loss	Shijie Wang et.al.	2412.16153	null
2024-12-18	Dynamic semantic VSLAM with known and unknown objects	Sanghyoup Gu et.al.	2412.14359	null
2024-12-18	SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation	Tong Chen et.al.	2412.14018	null
2024-12-17	CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices	Andrei Znobishchev et.al.	2412.13273	null
2024-12-17	Complex extension of optical flow and its practical evaluation for undersampled dynamic MRI	Matthias J. Ehrhardt et.al.	2412.12711	null
2024-12-17	GG-SSMs: Graph-Generating State Space Models	Nikola Zubić et.al.	2412.12423	null
2024-12-16	Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising	Zikang Chen et.al.	2412.11820	link
2024-12-16	Exploring More from Multiple Gait Modalities for Human Identification	Dongyang Jin et.al.	2412.11495	link
2024-12-16	BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions	Wonyong Seo et.al.	2412.11365	null
2024-12-15	Learning Normal Flow Directly From Event Neighborhoods	Dehao Yuan et.al.	2412.11284	link
2024-12-13	BatDeck – Ultra Low-power Ultrasonic Ego-velocity Estimation and Obstacle Avoidance on Nano-drones	Hanna Müller et.al.	2412.10048	null
2024-12-12	A Plug-and-Play Algorithm for 3D Video Super-Resolution of Single-Photon LiDAR data	Alice Ruget et.al.	2412.09427	null
2024-12-12	eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction	Jad Mansour et.al.	2412.09209	link
2024-12-12	ResFlow: Fine-tuning Residual Optical Flow for Event-based High Temporal Resolution Motion Estimation	Qianang Zhou et.al.	2412.09105	null
2024-12-12	Mojito: Motion Trajectory and Intensity Control for Video Generation	Xuehai He et.al.	2412.08948	null
2024-12-12	Labits: Layered Bidirectional Time Surfaces Representation for Event Camera-based Continuous Dense Trajectory Estimation	Zhongyang Zhang et.al.	2412.08849	null
2024-12-11	Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation	Zhigang Cen et.al.	2412.08034	null
2024-12-10	EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision	Qiang Qu et.al.	2412.07080	link
2024-12-09	Local Attention Transformers for High-Detail Optical Flow Upsampling	Alexander Gielisse et.al.	2412.06439	null
2024-12-08	MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation	Shuwei Shi et.al.	2412.05848	null
2024-12-05	Deep Learning and Hybrid Approaches for Dynamic Scene Analysis, Object Detection and Motion Tracking	Shahran Rahman Alve et.al.	2412.05331	null
2024-12-04	Advancing Auto-Regressive Continuation for Video Frames	Ruibo Ming et.al.	2412.03758	null
2024-12-03	Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback	Hiroki Furuta et.al.	2412.02617	null
2024-12-02	STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation	Sunghun Yang et.al.	2412.01090	null
2024-12-01	Advanced Video Inpainting Using Optical Flow-Guided Efficient Diffusion	Bohai Gu et.al.	2412.00857	null
2024-11-30	A conditional Generative Adversarial network model for the Weather4Cast 2024 Challenge	Atharva Deshpande et.al.	2412.00451	null
2024-11-30	Hybrid Local-Global Context Learning for Neural Video Compression	Yongqi Zhai et.al.	2412.00446	null
2024-11-27	RoMo: Robust Motion Segmentation Improves Structure from Motion	Lily Goli et.al.	2411.18650	null
2024-11-27	ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching	Yangrui Dong et.al.	2411.18174	null
2024-11-27	An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognition	Song-Jiang Lai et.al.	2411.18002	null
2024-11-26	Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors	Zhengfei Kuang et.al.	2411.17249	null
2024-11-25	Context-Aware Input Orchestration for Video Inpainting	Hoyoung Kim et.al.	2411.16926	null
2024-11-22	TSkips: Efficiency Through Explicit Temporal Delay Connections in Spiking Neural Networks	Prajna G. Malettira et.al.	2411.16711	null
2024-11-24	PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments	Haoang Li et.al.	2411.15800	null
2024-11-23	Optical-Flow Guided Prompt Optimization for Coherent Video Generation	Hyelin Nam et.al.	2411.15540	null
2024-11-22	Benchmarking the Robustness of Optical Flow Estimation to Corruptions	Zhonghua Yi et.al.	2411.14865	link
2024-11-21	EdgeFlowNet: 100FPS@1W Dense Optical Flow For Tiny Mobile Robots	Sai Ramana Kiran Pinnama Raju et.al.	2411.14576	null
2024-11-21	Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation	Zhuoman Liu et.al.	2411.14423	null
2024-11-21	Transforming Static Images Using Generative Models for Video Salient Object Detection	Suhwan Cho et.al.	2411.13975	link
2024-11-20	Sparse Input View Synthesis: 3D Representations and Reliable Priors	Nagabhushan Somraj et.al.	2411.13631	null
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291	null
2024-11-20	Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark	Bing Cao et.al.	2411.13056	null
2024-11-16	AnimateAnything: Consistent and Controllable Animation for Video Generation	Guojun Lei et.al.	2411.10836	null
2024-11-15	OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models	Mathis Koroglu et.al.	2411.10501	null
2024-11-14	Adversarial Attacks Using Differentiable Rendering: A Survey	Matthew Hull et.al.	2411.09749	null
2024-11-14	MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation	Jonas Serych et.al.	2411.09551	link
2024-11-12	DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection	Shawn Li et.al.	2411.08227	link
2024-11-17	Scaling Properties of Diffusion Models for Perceptual Tasks	Rahul Ravishankar et.al.	2411.08034	null
2024-11-11	Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters	Corwin Grant Jeon MacMillan et.al.	2411.05225	null
2024-11-07	Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera	Yu Hu et.al.	2411.04413	null
2024-11-07	AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation	Mingyu Sheng et.al.	2411.03695	link
2024-11-04	Neural optical flow for planar and stereo PIV	Andrew I. Masker et.al.	2411.02373	null
2024-11-03	Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation	Zhenbin Wang et.al.	2411.01647	null
2024-11-03	Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli	Matthias Tangemann et.al.	2411.01505	link
2024-11-02	Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks	Aarjav Kavathia et.al.	2411.01348	null
2024-10-29	Motion Graph Unleashed: A Novel Approach to Video Prediction	Yiqi Zhong et.al.	2410.22288	link
2024-10-29	FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives	Qizhi Chen et.al.	2410.22070	null
2024-10-29	Investigation of moving objects through atmospheric turbulence from a non-stationary platform	Nicholas Ferrante et.al.	2410.21639	null
2024-10-27	CloudCast – Total Cloud Cover Nowcasting with Machine Learning	Mikko Partio et.al.	2410.21329	link
2024-10-28	Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual Context	Manuel Benavent-Lledo et.al.	2410.21275	link
2024-10-27	BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events	Yijin Li et.al.	2410.20451	null
2024-10-26	UniVST: A Unified Framework for Training-free Localized Video Style Transfer	Quanjian Song et.al.	2410.20084	link
2024-10-23	Separating edges from microstructure in X-ray dark-field imaging: Evolving and devolving perspectives via the X-ray Fokker-Planck equation	Samantha J. Alloo et.al.	2410.18317	null
2024-10-16	Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks	Pranjali Pathre et.al.	2410.12432	link
2024-10-14	Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world	Han Ling et.al.	2410.10453	link
2024-10-12	A Collaborative Team of UAV-Hexapod for an Autonomous Retrieval System in GNSS-Denied Maritime Environments	Seungwook Lee et.al.	2410.09606	null
2024-10-12	Robust Optical Flow Computation: A Higher-Order Differential Approach	Chanuka Algama et.al.	2410.09563	null
2024-10-10	MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting	Ruijie Zhu et.al.	2410.07707	link
2024-10-09	Z-upscaling: Optical Flow Guided Frame Interpolation for Isotropic Reconstruction of 3D EM Volumes	Fisseha A. Ferede et.al.	2410.07043	link
2024-10-08	Future frame prediction in chest cine MR imaging using the PCA respiratory motion model and dynamically trained recurrent neural networks	Michel Pohl et.al.	2410.05882	null
2024-10-01	Descriptor: Face Detection Dataset for Programmable Threshold-Based Sparse-Vision	Riadul Islam et.al.	2410.00368	link
2024-10-08	DressRecon: Freeform 4D Human Reconstruction from Monocular Video	Jeff Tan et.al.	2409.20563	null
2024-10-06	Visual collective behaviors on spherical robots	Diego Castro et.al.	2409.20539	null
2024-09-26	Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming	Zehao Zhu et.al.	2409.17596	null
2024-09-26	TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene	Sandika Biswas et.al.	2409.17459	link
2024-09-25	EventHDR: from Event to High-Speed HDR Videos and Beyond	Yunhao Zou et.al.	2409.17029	null
2024-09-25	Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation	Hanyu Zhou et.al.	2409.17001	null
2024-09-25	Pose-Guided Fine-Grained Sign Language Video Generation	Tongkai Shi et.al.	2409.16709	null
2024-09-21	BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow	EungGu Kang et.al.	2409.15384	link
2024-09-23	Skills Made to Order: Efficient Acquisition of Robot Cooking Skills Guided by Multiple Forms of Internet Data	Mrinal Verghese et.al.	2409.15172	null
2024-09-22	Secrets of Edge-Informed Contrast Maximization for Event-Based Vision	Pritam P. Karmokar et.al.	2409.14611	null
2024-09-18	Optical Flow Matters: an Empirical Comparative Study on Fusing Monocular Extracted Modalities for Better Steering	Fouad Makiyeh et.al.	2409.12716	null
2024-09-16	ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video	Han Ling et.al.	2409.12202	link
2024-09-16	Continual Learning of Conjugated Visual Representations through Higher-order Motion Flows	Simone Marullo et.al.	2409.11441	null
2024-09-17	Training Datasets Generation for Machine Learning: Application to Vision Based Navigation	Jérémy Lebreton et.al.	2409.11383	null
2024-09-17	Multimodal Attention-Enhanced Feature Fusion-based Weekly Supervised Anomaly Violence Detection	Yuta Kaneko et.al.	2409.11223	null
2024-09-16	SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning	Amogh Joshi et.al.	2409.09990	null
2024-09-15	Dynamic Layer Detection of a Thin Silk Cloth using DenseTact Optical Tactile Sensors	Ankush Kundan Dhawan et.al.	2409.09849	null
2024-09-15	Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings	Oriel Perl et.al.	2409.09841	null
2024-09-13	InstantDrag: Improving Interactivity in Drag-based Image Editing	Joonghyuk Shin et.al.	2409.08857	null
2024-09-11	Violence detection in videos using deep recurrent and convolutional neural networks	Abdarahmane Traoré et.al.	2409.07581	null
2024-09-11	Distance Measurement for UAVs in Deep Hazardous Tunnels	Vishal Choudhary et.al.	2409.07160	null
2024-09-09	LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow	Hongyu Wen et.al.	2409.05688	null
2024-09-11	Real-Time Human Action Recognition on Embedded Platforms	Ruiqi Wang et.al.	2409.05662	null
2024-09-15	HMAFlow: Learning More Accurate Optical Flow via Hierarchical Motion Field Alignment	Dianbo Ma et.al.	2409.05531	link
2024-09-09	FacialFlowNet: Advancing Facial Optical Flow Estimation with a Diverse Dataset and a Decomposed Model	Jianzhi Lu et.al.	2409.05396	link
2024-09-06	Hybrid Cost Volume for Memory-Efficient Optical Flow	Yang Zhao et.al.	2409.04243	link
2024-09-06	SDformerFlow: Spatiotemporal swin spikeformer for event-based optical flow estimation	Yi Tian et.al.	2409.04082	link
2024-09-03	DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos	Wenbo Hu et.al.	2409.02095	link
2024-08-29	FlowRetrieval: Flow-Guided Data Retrieval for Few-Shot Imitation Learning	Li-Heng Lin et.al.	2408.16944	null
2024-08-29	Estimating Dynamic Flow Features in Groups of Tracked Objects	Tanner D. Harms et.al.	2408.16190	null
2024-08-28	MMASD+: A Novel Dataset for Privacy-Preserving Behavior Analysis of Children with Autism Spectrum Disorder	Pavan Uttej Ravva et.al.	2408.15077	link
2024-08-21	Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars	Zhihao Lin et.al.	2408.11582	null
2024-08-21	SelfDRSC++: Self-Supervised Learning for Dual Reversed Rolling Shutter Correction	Wei Shang et.al.	2408.11411	link
2024-09-02	Video Diffusion Models are Strong Video Inpainter	Minhyeok Lee et.al.	2408.11402	null
2024-08-20	PooDLe: Pooled and dense self-supervised learning from naturalistic videos	Alex N. Wang et.al.	2408.11208	null
2024-08-21	NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices	Zhiyong Zhang et.al.	2408.10161	link
2024-08-19	Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data	Tao Yang et.al.	2408.10119	null
2024-08-18	Contactless seismocardiography via Gunnar-Farneback optical flow	Mohammad Muntasir Rahman et.al.	2408.09512	null
2024-08-18	OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare	Chen Long-fei et.al.	2408.09409	null
2024-08-16	CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving	Shihan Peng et.al.	2408.08500	null
2024-08-15	MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing	Chenjie Cao et.al.	2408.08000	null
2024-08-12	FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework	Lukas Meyer et.al.	2408.06190	link
2024-08-12	Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network	Kailai Sun et.al.	2408.05877	null
2024-08-11	Egocentric Vision Language Planning	Zhirui Fang et.al.	2408.05802	null
2024-08-08	KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance	Jingxian Lu et.al.	2408.02912	null
2024-08-02	NOLO: Navigate Only Look Once	Bohan Zhou et.al.	2408.01384	null
2024-07-31	RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining	Hongtao Wu et.al.	2407.21773	link
2024-07-31	Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching	Pengjie Zhang et.al.	2407.21735	null
2024-07-30	SpotFormer: Multi-Scale Spatio-Temporal Transformer for Facial Expression Spotting	Yicheng Deng et.al.	2407.20799	null
2024-07-29	Event-based Optical Flow on Neuromorphic Processor: ANN vs. SNN Comparison based on Activation Sparsification	Yingfu Xu et.al.	2407.20421	link
2024-07-26	Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations	Zipeng Wang et.al.	2407.18500	null
2024-07-23	Occlusion-Aware 3D Motion Interpretation for Abnormal Behavior Detection	Su Li et.al.	2407.16788	null
2024-07-23	SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging	Lingtong Kong et.al.	2407.16308	link
2024-07-18	Many Perception Tasks are Highly Redundant Functions of their Input Data	Rahul Ramesh et.al.	2407.13841	null
2024-07-18	Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain	Bach Nguyen Gia et.al.	2407.13159	link
2024-07-17	Fusion Flow-enhanced Graph Pooling Residual Networks for Unmanned Aerial Vehicles Surveillance in Day and Night Dual Visions	Alam Noor et.al.	2407.12647	null
2024-07-16	Improving Unsupervised Video Object Segmentation via Fake Flow Generation	Suhwan Cho et.al.	2407.11714	link
2024-07-16	ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment	Xinyi Wang et.al.	2407.11496	link
2024-07-16	Hybrid physics-AI outperforms numerical weather prediction for extreme precipitation nowcasting	Puja Das et.al.	2407.11317	null
2024-07-15	Temporal Event Stereo via Joint Learning with Stereoscopic Flow	Hoonhee Cho et.al.	2407.10831	link
2024-07-15	Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation	Friedhelm Hamann et.al.	2407.10802	link
2024-07-14	Research Experience of an Undergraduate Student in Computer Vision and Robotics	Ayush V. Gowda et.al.	2407.10044	null
2024-07-13	ScaleRAFT: Cross-Scale Recurrent All-Pairs Field Transforms for 3D Motion Estimation	Han Ling et.al.	2407.09797	link
2024-07-11	Generalizable Implicit Motion Modeling for Video Frame Interpolation	Zujin Guo et.al.	2407.08680	null
2024-07-11	Event-based vision on FPGAs – a survey	Tomasz Kryjak et.al.	2407.08356	null
2024-07-10	Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction	Yili Liu et.al.	2407.07587	null
2024-07-05	Unsupervised 4D Cardiac Motion Tracking with Spatiotemporal Optical Flow Networks	Long Teng et.al.	2407.04663	null
2024-07-04	CardioSpectrum: Comprehensive Myocardium Motion Analysis with 3D Deep Learning and Geometric Insights	Shahar Zuler et.al.	2407.03794	link
2024-07-03	Towards High Resolution Real-Time Optical Flow Particle Image Velocimetry	Juan Pimienta et.al.	2407.03057	null
2024-07-03	Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction	Jiaxin Guo et.al.	2407.02918	link
2024-07-01	DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models	Chang-Han Yeh et.al.	2407.01519	link
2024-07-01	RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields	Haochen Jiang et.al.	2407.01303	link
2024-06-27	What Matters in Detecting AI-Generated Videos like Sora?	Chirui Chang et.al.	2406.19568	null
2024-06-27	A Universal Railway Obstacle Detection System based on Semi-supervised Segmentation And Optical Flow	Qiushi Guo et.al.	2406.18908	null
2024-06-27	Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach	Yuxiang Huang et.al.	2406.18837	null
2024-06-25	Disentangled Motion Modeling for Video Frame Interpolation	Jaihyun Lew et.al.	2406.17256	link
2024-06-26	Splatter a Video: Video Gaussian Representation for Versatile Processing	Yang-Tian Sun et.al.	2406.13870	null
2024-06-19	Low Latency Visual Inertial Odometry with On-Sensor Accelerated Optical Flow for Resource-Constrained UAVs	Jonas Kühne et.al.	2406.13345	null
2024-06-17	MEDeA: Multi-view Efficient Depth Adjustment	Mikhail Artemyev et.al.	2406.12048	null
2024-06-13	Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion	Linzhan Mou et.al.	2406.09402	null
2024-06-11	PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow	Joshua Tokarsky et.al.	2406.07667	null
2024-06-11	Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring	Huicong Zhang et.al.	2406.07551	link
2024-06-07	DVOS: Self-Supervised Dense-Pattern Video Object Segmentation	Keyhan Najafian et.al.	2406.05131	null
2024-06-07	Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior	Tanvir Mahmud et.al.	2406.04873	link
2024-06-07	Interplay between preconditioning and regularization for linear ill-posed problems solved by conjugate gradient. Application to optical flow estimation	Ahmed Chabib et.al.	2406.04695	null
2024-06-04	Neural Representations of Dynamic Visual Stimuli	Jacob Yeung et.al.	2406.02659	null
2024-06-03	DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation	Chun-Hung Wu et.al.	2406.01591	null
2024-06-03	Prototypical Transformer as Unified Motion Learners	Cheng Han et.al.	2406.01559	null
2024-06-03	Enhancing Dynamic CT Image Reconstruction with Neural Fields Through Explicit Motion Regularizers	Pablo Arratia et.al.	2406.01299	null
2024-06-03	Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting	Fang Li et.al.	2406.01042	link
2024-06-03	Synthetic Data Generation for 3D Myocardium Deformation Analysis	Shahar Zuler et.al.	2406.01040	link
2024-05-30	EMAG: Ego-motion Aware and Generalizable 2D Hand Forecasting from Egocentric Videos	Masashi Hatano et.al.	2405.20030	null
2024-05-30	May the Dance be with You: Dance Generation Framework for Non-Humanoids	Hyemin Ahn et.al.	2405.19743	null
2024-05-28	GFlow: Recovering 4D World from Monocular Video	Shizun Wang et.al.	2405.18426	null
2024-05-28	Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition	Muhammad Adi Nugroho et.al.	2405.18012	null
2024-05-27	DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation	Mengtan Zhang et.al.	2405.16960	link
2024-05-27	SCSim: A Realistic Spike Cameras Simulator	Liwen Hu et.al.	2405.16790	link
2024-05-26	Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition	Tong Shi et.al.	2405.16701	null
2024-05-26	Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception	Shuangpeng Han et.al.	2405.16493	link
2024-05-24	Time-Harmonic Optical Flow with Applications in Elastography	Oleh Melnyk et.al.	2405.15507	link
2024-05-24	Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features	Lichuan Ji et.al.	2405.15343	null
2024-05-24	Unsupervised Motion Segmentation for Neuromorphic Aerial Surveillance	Sami Arja et.al.	2405.15209	link
2024-05-23	SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow	Yihan Wang et.al.	2405.14793	link
2024-05-23	OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance	Shuheng Ge et.al.	2405.14709	null
2024-05-23	Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields	Tom Fischer et.al.	2405.14599	null
2024-05-22	MotionCraft: Physics-based Zero-Shot Video Generation	Luca Savant Aira et.al.	2405.13557	link
2024-05-21	Weakly supervised alignment and registration of MR-CT for cervical cancer radiotherapy	Jjahao Zhang et.al.	2405.12850	null
2024-05-21	Rethink Predicting the Optical Flow with the Kinetics Perspective	Yuhao Cheng et.al.	2405.12512	link
2024-05-18	GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition	Mallika Garg et.al.	2405.11180	link
2024-05-17	MicroBundlePillarTrack, A Python package for automated segmentation, tracking, and analysis of pillar deflection in cardiac microbundles	Hiba Kobeissi et.al.	2405.11096	link
2024-05-16	Physics-incorporated Graph Neural Network for Multivariate Time Series Imputation	Guojun Liang et.al.	2405.10995	link
2024-05-15	Dance Any Beat: Blending Beats with Visuals in Dance Video Generation	Xuanchen Wang et.al.	2405.09266	null
2024-05-11	DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation	Volodymyr Fedynyak et.al.	2405.08715	null
2024-05-14	EchoTracker: Advancing Myocardial Point Tracking in Echocardiography	Md Abulkalam Azad et.al.	2405.08587	link
2024-05-15	Vector-Symbolic Architecture for Event-Based Optical Flow	Hongzhi You et.al.	2405.08300	null
2024-05-12	NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU	Yuhao Zhang et.al.	2405.07392	link
2024-05-11	Global Motion Understanding in Large-Scale Video Object Segmentation	Volodymyr Fedynyak et.al.	2405.07031	null
2024-05-09	A Survey on Backbones for Deep Video Action Recognition	Zixuan Tang et.al.	2405.05584	null
2024-05-08	Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection	Shengyang Sun et.al.	2405.05130	link
2024-05-07	Visually Guided Swarm Motion Coordination via Insect-inspired Small Target Motion Reactions	Md Arif Billah et.al.	2405.04591	null
2024-05-06	Diffeomorphic Template Registration for Atmospheric Turbulence Mitigation	Dong Lao et.al.	2405.03662	null

Object Tracking

Publish Date	Title	Authors	PDF	Code
2025-07-22	Benchmarking pig detection and tracking under diverse and challenging conditions	Jonathan Henrich et.al.	2507.16639	null
2025-07-21	Is Tracking really more challenging in First Person Egocentric Vision?	Matteo Dunnhofer et.al.	2507.16015	null
2025-07-20	BleedOrigin: Dynamic Bleeding Source Localization in Endoscopic Submucosal Dissection via Dual-Stage Detection and Tracking	Mengya Xu et.al.	2507.15094	null
2025-07-19	Depthwise-Dilated Convolutional Adapters for Medical Object Tracking and Segmentation Using the Segment Anything Model 2	Guoping Xu et.al.	2507.14613	null
2025-07-18	DUSTrack: Semi-automated point tracking in ultrasound videos	Praneeth Namburi et.al.	2507.14368	null
2025-07-18	Generalist Forecasting with Frozen Video Models via Latent Diffusion	Jacob C Walker et.al.	2507.13942	null
2025-07-18	GOSPA and T-GOSPA quasi-metrics for evaluation of multi-object tracking algorithms	Ángel F. García-Fernández et.al.	2507.13706	null
2025-07-17	MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results	Yuki Kondo et.al.	2507.12832	null
2025-07-19	SpatialTrackerV2: 3D Point Tracking Made Easy	Yuxi Xiao et.al.	2507.12462	null
2025-07-16	Integrated Switched Capacitor Array and Synchronous Charge Extraction with Adaptive Hybrid MPPT for Piezoelectric Harvesters	Pramit Karmakar et.al.	2507.12163	null
2025-07-20	YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association	Xiang Yu et.al.	2507.12087	null
2025-07-15	CharaConsist: Fine-Grained Consistent Character Generation	Mengyu Wang et.al.	2507.11533	null
2025-07-14	Taming Modern Point Tracking for Speckle Tracking Echocardiography via Impartial Motion	Md Abulkalam Azad et.al.	2507.10127	null
2025-07-14	MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second	Chenguo Lin et.al.	2507.10065	null
2025-07-14	OpenHuman4D: Open-Vocabulary 4D Human Parsing	Keito Suzuki et.al.	2507.09880	null
2025-07-12	Online Long-term Point Tracking in the Foundation Model Era	Görkay Aydemir et.al.	2507.09217	null
2025-07-12	On the Fragility of Multimodal Perception to Temporal Misalignment in Autonomous Driving	Md Hasan Shahriar et.al.	2507.09095	null
2025-07-11	SAM2RL: Towards Reinforcement Learning Memory Control in Segment Anything Model 2	Alen Adamyan et.al.	2507.08548	null
2025-07-14	HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking	Ruixiang Chen et.al.	2507.07603	null
2025-07-10	Temporal Unlearnable Examples: Preventing Personal Video Data from Unauthorized Exploitation by Object Tracking	Qiangqiang Wu et.al.	2507.07483	null
2025-07-08	When Trackers Date Fish: A Benchmark and Framework for Underwater Multiple Fish Tracking	Weiran Li et.al.	2507.06400	null
2025-07-08	Learning to Track Any Points from Human Motion	Inès Hyeonsu Kim et.al.	2507.06233	null
2025-07-08	Cooperative Mapping, Localization, and Beam Management via Multi-Modal SLAM in ISAC Systems	Hang Que et.al.	2507.05718	null
2025-07-07	Self-Supervised Real-Time Tracking of Military Vehicles in Low-FPS UAV Footage	Markiyan Kostiv et.al.	2507.05229	null
2025-07-07	Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking	Maria Damanaki et.al.	2507.04762	null
2025-07-05	Integrated Gaussian Processes for Robust and Adaptive Multi-Object Tracking	Fred Lydeard et.al.	2507.04116	null
2025-07-03	CrowdTrack: A Benchmark for Difficult Multiple Pedestrian Tracking in Real Scenarios	Teng Fu et.al.	2507.02479	null
2025-07-03	A Novel Tuning Method for Real-time Multiple-Object Tracking Utilizing Thermal Sensor with Complexity Motion Pattern	Duong Nguyen-Ngoc Tran et.al.	2507.02408	null
2025-07-03	PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection	Seokyeong Lee et.al.	2507.02393	null
2025-07-02	TrackingMiM: Efficient Mamba-in-Mamba Serialization for Real-time UAV Object Tracking	Bingxi Liu et.al.	2507.01535	null
2025-07-04	Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations	Shivansh Patel et.al.	2507.00990	null
2025-07-01	UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions	Siyuan Yao et.al.	2507.00648	null
2025-06-30	Visual and Memory Dual Adapter for Multi-Modal Object Tracking	Boyue Xu et.al.	2506.23972	null
2025-06-30	Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking	Shiao Wang et.al.	2506.23783	null
2025-06-28	Optimal Trajectory Planning for Space Object Tracking with Collision-Avoidance Constraints	Saif R. Kazi et.al.	2506.22797	null
2025-06-27	Improving Token-based Object Detection with Video	Abhineet Singh et.al.	2506.22562	null
2025-07-01	R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning	Biao Wang et.al.	2506.21980	null
2025-06-26	Linear and Second-order-cone Valid Inequalities for Problems with Storage	Juan M. Morales et.al.	2506.21470	null
2025-06-24	VideoPCDNet: Video Parsing and Prediction with Phase Correlation Networks	Noel José Rodrigues Vicente et.al.	2506.19621	null
2025-06-24	Trajectory Prediction in Dynamic Object Tracking: A Critical Study	Zhongping Dong et.al.	2506.19341	null
2025-06-23	Lightweight RGB-T Tracking with Mobile Vision Transformers	Mahdi Falaki et.al.	2506.19154	null
2025-06-23	USVTrack: USV-Based 4D Radar-Camera Tracking Dataset for Autonomous Driving in Inland Waterways	Shanliang Yao et.al.	2506.18737	null
2025-06-23	Emergent Temporal Correspondences from Video Diffusion Transformers	Jisu Nam et.al.	2506.17220	link
2025-06-20	RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking	Teng Guo et.al.	2506.17119	link
2025-06-19	From Theory to Practice: Identifying the Optimal Approach for Offset Point Tracking in the Context of Agricultural Robotics	Stephane Ngnepiepaye Wembe et.al.	2506.16143	null
2025-06-19	KARL: Kalman-Filter Assisted Reinforcement Learner for Dynamic Object Tracking and Grasping	Kowndinya Boyalakuntla et.al.	2506.15945	null
2025-06-18	Probabilistic Trajectory GOSPA: A Metric for Uncertainty-Aware Multi-Object Tracking Performance Evaluation	Yuxuan Xia et.al.	2506.15148	null
2025-06-17	Projected integral control of impedance passive nonlinear systems	Nicolas Vanspranghe et.al.	2506.14267	null
2025-06-16	Deep Learning-Based Multi-Object Tracking: A Comprehensive Survey from Foundations to State-of-the-Art	Momir Adžemović et.al.	2506.13457	null
2025-06-15	Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors	Wen-Hsuan Chu et.al.	2506.12716	null
2025-06-13	Multiple Object Tracking in Video SAR: A Benchmark and Tracking Baseline	Haoxiang Chen et.al.	2506.12105	null
2025-06-11	Optimizing Cooperative Multi-Object Tracking using Graph Signal Processing	Maria Damanaki et.al.	2506.09469	null
2025-06-10	MOSE: A Novel Orchestration Framework for Stateful Microservice Migration at the Edge	Antonio Calagna et.al.	2506.09159	null
2025-06-10	MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning	Mohammadreza Salehi et.al.	2506.08694	link
2025-06-09	SAM2Auto: Auto Annotation Using FLASH	Arash Rocky et.al.	2506.07850	null
2025-06-09	DragNeXt: Rethinking Drag-Based Image Editing	Yuan Zhou et.al.	2506.07611	null
2025-06-08	AllTracker: Efficient Dense Point Tracking at High Resolution	Adam W. Harley et.al.	2506.07310	null
2025-06-05	FRAME: Pre-Training Video Feature Representations via Anticipation and Memory	Sethuraman TV et.al.	2506.05543	null
2025-06-08	Context Is Not Comprehension	Alex Pan et.al.	2506.04907	null
2025-06-04	Contour Errors: An Ego-Centric Metric for Reliable 3D Multi-Object Tracking	Sharang Kaul et.al.	2506.04122	null
2025-06-03	SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports	Dheeraj Khanna et.al.	2506.03335	null
2025-06-03	IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation	Yuanze Lin et.al.	2506.03150	null
2025-06-03	MVTD: A Benchmark Dataset for Maritime Visual Object Tracking	Ahsan Baidar Bakht et.al.	2506.02866	null
2025-06-09	E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models	Wenyan Cong et.al.	2506.01933	null
2025-06-02	UMA: Ultra-detailed Human Avatars via Multi-level Surface Alignment	Heming Zhu et.al.	2506.01802	null
2025-06-02	No Train Yet Gain: Towards Generic Multi-Object Tracking in Sports and Beyond	Tomasz Stanczyk et.al.	2506.01373	null
2025-06-01	Depth-Aware Scoring and Hierarchical Alignment for Multiple Object Tracking	Milad Khanchi et.al.	2506.00774	null
2025-05-29	Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping	Justin Lazarow et.al.	2505.23756	null
2025-05-27	SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation	Claudia Cuttano et.al.	2505.21795	link
2025-05-27	Fully Spiking Neural Networks for Unified Frame-Event Object Tracking	Jingjun Yang et.al.	2505.20834	null
2025-05-26	Video-based Direct Time Series Measurement of Along-Strike Slip on the Coseismic Surface Rupture During the 2025 Mw7.7 Myanmar Earthquake	Jianhao Gao et.al.	2505.20494	null
2025-05-26	ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking	Sijia Chen et.al.	2505.20381	link
2025-05-28	Progressive Scaling Visual Object Tracking	Jack Hong et.al.	2505.19990	null
2025-05-24	Distributed Expectation Propagation for Multi-Object Tracking over Sensor Networks	Qing Li et.al.	2505.18795	null
2025-05-24	FusionTrack: End-to-End Multi-Object Tracking in Arbitrary Multi-View Environment	Xiaohe Li et.al.	2505.18727	null
2025-05-24	EOTNet: Deep Memory Aided Bayesian Filter for Extended Object Tracking	Zhixing Wang et.al.	2505.18684	link
2025-05-23	Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking	Cheng-Yen Yang et.al.	2505.18111	null
2025-05-22	A Framework for Multi-View Multiple Object Tracking using Single-View Multi-Object Trackers on Fish Data	Chaim Chai Elchik et.al.	2505.17201	null
2025-05-22	Temporal Object Captioning for Street Scene Videos from LiDAR Tracks	Vignesh Gopinathan et.al.	2505.16594	null
2025-05-21	Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detection	Shichao Li et.al.	2505.16029	link
2025-05-21	ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation	Tony Montes et.al.	2505.15928	link
2025-05-19	Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach	Shiao Wang et.al.	2505.12903	link
2025-05-22	LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking	Martha Teiko Teye et.al.	2505.12753	null
2025-05-19	Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking	Shiyu Xuan et.al.	2505.12606	null
2025-05-20	DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model	Siwei Xia et.al.	2505.12427	link
2025-05-18	DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking	Jirong Zha et.al.	2505.12340	null
2025-05-17	GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity	Takuya Ikeda et.al.	2505.11905	null
2025-05-12	Asynchronous Multi-Object Tracking with an Event Camera	Angus Apps et.al.	2505.08126	link
2025-05-12	SAEN-BGS: Energy-Efficient Spiking AutoEncoder Network for Background Subtraction	Zhixuan Zhang et.al.	2505.07336	null
2025-05-12	Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking	Mohamed Nagy et.al.	2505.07254	null
2025-05-09	Hyperbolic and Elliptic Points Tracking Algorithm (HEPTA) in two-dimensional non-stationary velocity fields defined on a discrete grid	A. A. Udalov et.al.	2505.05975	null
2025-05-09	CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV Tracking	Weihong Li et.al.	2505.05936	link
2025-05-09	You Are Your Best Teacher: Semi-Supervised Surgical Point Tracking with Cycle-Consistent Self-Distillation	Valay Bundele et.al.	2505.05722	null
2025-05-08	A Simple Detector with Frame Dynamics is a Strong Tracker	Chenxu Peng et.al.	2505.04917	link
2025-05-11	SMMT: Siamese Motion Mamba with Self-attention for Thermal Infrared Target Tracking	Shang Zhang et.al.	2505.04088	null
2025-05-06	Interactive Instance Annotation with Siamese Networks	Xiang Xu et.al.	2505.03184	null
2025-05-06	TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion	Haoyue Liu et.al.	2505.03116	null
2025-05-02	CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking	Vladimir Somers et.al.	2505.01257	link
2025-05-02	Optimizing Indoor Farm Monitoring Efficiency Using UAV: Yield Estimation in a GNSS-Denied Cherry Tomato Greenhouse	Taewook Park et.al.	2505.00995	null
2025-04-30	MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection	Qiushi Yang et.al.	2505.00739	null
2025-05-01	A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic	Muhammad Imran Zaman et.al.	2505.00534	null
2025-04-30	Direct Motion Models for Assessing Generated Videos	Kelsey Allen et.al.	2505.00209	null
2025-04-30	Stereo X-ray tomography on deformed object tracking	Zhenduo Shang et.al.	2505.00122	null
2025-04-30	LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics	Marc Glocker et.al.	2504.21716	link
2025-04-30	Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction	Zihan Zhou et.al.	2504.21692	null
2025-04-30	Model-Free Two-Degree-of-Freedom PID Controller Design for Unknown LTI Systems	Taiga Kiyota et.al.	2504.21341	null
2025-04-29	The Mean of Multi-Object Trajectories	Tran Thien Dat Nguyen et.al.	2504.20391	null
2025-04-28	Improving trajectory continuity in drone-based crowd monitoring using a set of minimal-cost techniques and deep discriminative correlation filters	Bartosz Ptak et.al.	2504.20234	null
2025-04-28	A computer vision method to estimate ventilation rate of Atlantic salmon in sea fish farms	Lukas Folkman et.al.	2504.19719	null
2025-04-25	Decentralized Fusion of 3D Extended Object Tracking based on a B-Spline Shape Model	Longfei Han et.al.	2504.18708	null
2025-04-25	Multi-Sensor Fusion of Active and Passive Measurements for Extended Object Tracking	Hong Zhu et.al.	2504.18301	null
2025-04-25	PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models	Michel Gokan Khan et.al.	2504.18165	link
2025-04-25	S3MOT: Monocular 3D Object Tracking with Selective State Space Model	Zhuohao Yan et.al.	2504.18068	null
2025-04-24	Dynamic Camera Poses and Where to Find Them	Chris Rockwell et.al.	2504.17788	null
2025-04-23	PRaDA: Projective Radial Distortion Averaging	Daniil Sinitsyn et.al.	2504.16499	null
2025-04-22	SonarT165: A Large-scale Benchmark and STFTrack Framework for Acoustic Object Tracking	Yunfeng Li et.al.	2504.15609	link
2025-04-20	TAPIP3D: Tracking Any Point in Persistent 3D Geometry	Bowei Zhang et.al.	2504.14717	link
2025-04-20	Seurat: From Moving Points to Depth	Seokju Cho et.al.	2504.14687	link
2025-04-19	Adversarial Attack for RGB-Event based Visual Object Tracking	Qiang Chen et.al.	2504.14423	link
2025-04-17	St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World	Haiwen Feng et.al.	2504.13152	null
2025-04-17	Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving	Shumin Wang et.al.	2504.12709	null
2025-04-16	Robust Visual Servoing under Human Supervision for Assembly Tasks	Victor Nan Fernandez-Ayala et.al.	2504.12506	null
2025-04-13	Intelligent driving vehicle front multi-target tracking and detection based on YOLOv5 and point cloud 3D projection	Dayong Liu et.al.	2504.11310	null
2025-04-15	WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs	Nguyen Ngoc Dat et.al.	2504.10165	null
2025-04-14	LiteTracker: Leveraging Temporal Causality for Accurate Low-latency Tissue Tracking	Mert Asim Karaoglu et.al.	2504.09904	null
2025-04-12	PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking	Jiahuan Long et.al.	2504.09361	null
2025-04-12	Text To 3D Object Generation For Scalable Room Assembly	Sonia Laguna et.al.	2504.09328	null
2025-04-12	ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking	Tzoulio Chamiti et.al.	2504.09195	null
2025-04-10	GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation	Lang Lin et.al.	2504.07962	null
2025-04-09	Multi-Object Tracking for Collision Avoidance Using Multiple Cameras in Open RAN Networks	Jordi Serra et.al.	2504.07163	null
2025-04-13	VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning	Xinhao Li et.al.	2504.06958	null
2025-04-08	POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction	Songyan Zhang et.al.	2504.05692	link
2025-04-06	SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation	Junjie Jiang et.al.	2504.04519	link
2025-04-05	Risk-Aware Robot Control in Dynamic Environments Using Belief Control Barrier Functions	Shaohang Han et.al.	2504.04097	link
2025-04-04	TQD-Track: Temporal Query Denoising for 3D Multi-Object Tracking	Shuxiao Ding et.al.	2504.03258	null
2025-04-03	Attention-Aware Multi-View Pedestrian Tracking	Reef Alturki et.al.	2504.03047	null
2025-04-03	Data-Driven Object Tracking: Integrating Modular Neural Networks into a Kalman Framework	Christian Alexander Holz et.al.	2504.02519	null
2025-04-02	Deep LG-Track: An Enhanced Localization-Confidence-Guided Multi-Object Tracker	Ting Meng et.al.	2504.01457	null
2025-04-02	COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking	Chunhui Zhang et.al.	2504.01321	link
2025-04-01	IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval	Bangwei Liu et.al.	2504.00954	null
2025-03-31	Point Tracking in Surgery–The 2024 Surgical Tattoos in Infrared (STIR) Challenge	Adam Schmidt et.al.	2503.24306	link
2025-04-03	Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey	Haoyang Wang et.al.	2503.22943	null
2025-03-28	Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision	Rulin Zhou et.al.	2503.22394	null
2025-03-28	Hyperspectral Adapter for Object Tracking based on Hyperspectral Video	Long Gao et.al.	2503.22199	null
2025-03-25	Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better	Zihang Lai et.al.	2503.19904	null
2025-03-24	TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court Videos	Kazuhiro Yamada et.al.	2503.18282	link
2025-03-22	MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking	Haolin Qin et.al.	2503.17699	link
2025-03-21	Dynamic Attention Mechanism in Spatiotemporal Memory Networks for Object Tracking	Meng Zhou et.al.	2503.16768	null
2025-03-20	Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction	Edgar Sucar et.al.	2503.16318	null
2025-03-19	Toward Scalable, Flexible Scene Flow for Point Clouds	Kyle Vedder et.al.	2503.15666	null
2025-03-17	Real-Time Multi-Object Tracking using YOLOv8 and SORT on a SoC FPGA	Michal Danilowicz et.al.	2503.13023	null
2025-03-17	OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering	Guanhua Ding et.al.	2503.12968	null
2025-03-17	GIFT: Generated Indoor video frames for Texture-less point tracking	Jianzheng Huang et.al.	2503.12944	null
2025-03-17	UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Network	Siyuan Yao et.al.	2503.12888	link
2025-03-16	History-Aware Transformation of ReID Features for Multiple Object Tracking	Ruopeng Gao et.al.	2503.12562	link
2025-03-15	ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object	Zhe Shan et.al.	2503.12006	link
2025-03-14	VGGT: Visual Geometry Grounded Transformer	Jianyuan Wang et.al.	2503.11651	link
2025-03-14	Cognitive Disentanglement for Referring Multi-Object Tracking	Shaofeng Liang et.al.	2503.11496	null
2025-03-13	3D Extended Object Tracking based on Extruded B-Spline Side View Profiles	Longfei Han et.al.	2503.10730	null
2025-03-18	OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer	Jinyang Li et.al.	2503.10616	link
2025-03-13	Low Complexity Point Tracking of the Myocardium in 2D Echocardiography	Artem Chernyshov et.al.	2503.10431	link
2025-03-13	Target-aware Bidirectional Fusion Transformer for Aerial Object Tracking	Xinglong Sun et.al.	2503.09951	null
2025-03-12	How good are deep learning methods for automated road safety analysis using video data? An experimental study	Qingwu Liu et.al.	2503.09807	null
2025-03-11	TrackOcc: Camera-based 4D Panoptic Occupancy Tracking	Zhuoguang Chen et.al.	2503.08471	link
2025-03-11	Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking	Yunhao Li et.al.	2503.08145	null
2025-03-10	SIRE: SE(3) Intrinsic Rigidity Embeddings	Cameron Smith et.al.	2503.07739	null
2025-03-10	CPAny: Couple With Any Encoder to Refer Multi-Object Tracking	Weize Li et.al.	2503.07516	null
2025-03-09	Online Dense Point Tracking with Streaming Memory	Qiaole Dong et.al.	2503.06471	link
2025-03-06	A Novel Control Strategy for Offset Points Tracking in the Context of Agricultural Robotics	Stephane Ngnepiepaye Wembe et.al.	2503.05835	null
2025-03-06	Omnidirectional Multi-Object Tracking	Kai Luo et.al.	2503.04565	link
2025-03-09	ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem	Yu-Hsi Chen et.al.	2503.04500	link
2025-03-06	A Modular Pipeline for 3D Object Tracking Using RGB Cameras	Lars Bredereke et.al.	2503.04322	link
2025-03-03	AI-Driven Relocation Tracking in Dynamic Kitchen Environments	Arash Nasr Esfahani et.al.	2503.01547	link
2025-02-27	MITracker: Multi-View Integration for Visual Object Tracking	Mengjie Xu et.al.	2502.20111	null
2025-02-26	Spectral-Enhanced Transformers: Leveraging Large-Scale Pretrained Models for Hyperspectral Object Tracking	Shaheer Mohamed et.al.	2502.18748	null
2025-02-25	UASTrack: A Unified Adaptive Selection Framework with Modality-Customization in Single Object Tracking	He Wang et.al.	2502.18220	null
2025-02-26	Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking	Peng Zhang et.al.	2502.17822	null
2025-02-24	V-HOP: Visuo-Haptic 6D Object Pose Tracking	Hongyu Li et.al.	2502.17434	null
2025-02-24	Enriching Physical-Virtual Interaction in AR Gaming by Tracking Identical Real Objects	Liuchuan Yu et.al.	2502.17399	link
2025-02-24	CRTrack: Low-Light Semi-Supervised Multi-object Tracking Based on Consistency Regularization	Zijing Zhao et.al.	2502.16809	null
2025-02-23	Benchmarking Online Object Trackers for Underwater Robot Position Locking Applications	Ali Safa et.al.	2502.16569	null
2025-02-19	A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects	Arjun Gupta et.al.	2502.13964	null
2025-02-19	MEX: Memory-efficient Approach to Referring Multi-Object Tracking	Huu-Thien Tran et.al.	2502.13875	null
2025-02-18	Pre-training Auto-regressive Robotic Models with 4D Representations	Dantong Niu et.al.	2502.13142	null
2025-02-13	IMM-MOT: A Novel 3D Multi-object Tracking Framework with Interacting Multiple Model Filter	Xiaohong Liu et.al.	2502.09672	null
2025-02-12	Control Barrier Function-Based Quadratic Programming for SafeOperation of Tethered UAVs	Samuel O. Folorunsho et.al.	2502.08129	null
2025-02-10	Adaptive Perception for Unified Visual Multi-modal Object Tracking	Xiantao Hu et.al.	2502.06583	null
2025-02-09	Energy-Efficient Autonomous Aerial Navigation with Dynamic Vision Sensors: A Physics-Guided Neuromorphic Approach	Sourav Sanyal et.al.	2502.05938	null
2025-02-08	Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark	Shiao Wang et.al.	2502.05574	link
2025-02-06	OneTrack-M: A multitask approach to transformer-based MOT models	Luiz C. S. de Araujo et.al.	2502.04478	null
2025-02-06	RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology	Nhat-Tan Do et.al.	2502.03760	null
2025-02-04	Rethinking Vision Transformer for Object Centric Foundation Models	Manuel Traub et.al.	2502.02763	null
2025-02-04	INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy	Nastaran Darabi et.al.	2502.01896	null
2025-02-03	Bayesian Approximation-Based Trajectory Prediction and Tracking with 4D Radar	Dong-In Kim et.al.	2502.01357	null
2025-02-03	Solgenia – A Test Vessel Toward Energy-Efficient Autonomous Water Taxi Applications	Hannes Homburger et.al.	2502.01207	link
2025-01-30	Track-On: Transformer-based Online Point Tracking with Memory	Görkay Aydemir et.al.	2501.18487	link
2025-01-28	Overcoming Semantic Dilution in Transformer-Based Next Frame Prediction	Hy Nguyen et.al.	2501.16753	null
2025-01-27	Understanding Long Videos via LLM-Powered Entity Relation Graphs	Meng Chu et.al.	2501.15953	null
2025-01-24	MATCHA:Towards Matching Anything	Fei Xue et.al.	2501.14945	null
2025-01-24	Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection	Viktor Kozák et.al.	2501.14587	null
2025-01-23	CSAOT: Cooperative Multi-Agent System for Active Object Tracking	Hy Nguyen et.al.	2501.13994	null
2025-01-23	YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID	Iñaki Erregue et.al.	2501.13710	link
2025-01-21	Learning segmentation from point trajectories	Laurynas Karazija et.al.	2501.12392	link
2025-01-22	InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling	Yi Wang et.al.	2501.12386	link
2025-01-21	Exploring Temporally-Aware Features for Point Tracking	Inès Hyeonsu Kim et.al.	2501.12218	link
2025-01-20	PD-SORT: Occlusion-Robust Multi-Object Tracking Using Pseudo-Depth Cues	Yanchao Wang et.al.	2501.11288	link
2025-01-17	Spatio-temporal Graph Learning on Adaptive Mined Key Frames for High-performance Multi-Object Tracking	Futian Wang et.al.	2501.10129	null
2025-01-13	SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing	Varun Biyyala et.al.	2501.07554	link
2025-01-13	TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations	Daniel Steininger et.al.	2501.07360	link
2025-01-13	Robust Single Object Tracking in LiDAR Point Clouds under Adverse Weather Conditions	Xiantong Zhao et.al.	2501.07133	null
2025-01-09	An Empirical Study of Autoregressive Pre-training from Videos	Jathushan Rajasegaran et.al.	2501.05453	null
2025-01-08	Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs	Zeyi Huang et.al.	2501.04336	null
2025-01-07	Neuromorphic Optical Tracking and Imaging of Randomly Moving Targets through Strongly Scattering Media	Ning Zhang et.al.	2501.03874	null
2025-01-06	ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking	Tingyang Zhang et.al.	2501.03220	null
2025-01-05	GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking	Weikang Bian et.al.	2501.02690	null
2025-01-05	DeTrack: In-model Latent Denoising Learning for Visual Object Tracking	Xinyu Zhou et.al.	2501.02467	null
2025-01-02	HybridTrack: A Hybrid Approach for Robust Multi-Object Tracking	Leandro Di Bella et.al.	2501.01275	link
2025-01-02	Sensitivity of Room Impulse Responses in Changing Acoustic Environment	Karolina Prawda et.al.	2501.01206	null
2025-01-01	Less is More: Token Context-aware Learning for Object Tracking	Chenlong Xu et.al.	2501.00758	link
2024-12-26	SUTrack: Towards Simple and Unified Single Object Tracking	Xin Chen et.al.	2412.19138	link
2024-12-23	Cross-View Referring Multi-Object Tracking	Sijia Chen et.al.	2412.17807	link
2024-12-20	Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking	Xiantao Hu et.al.	2412.15691	link
2024-12-19	Scaling 4D Representations	João Carreira et.al.	2412.15212	null
2024-12-18	Joint Perception and Prediction for Autonomous Driving: A Survey	Lucas Dal’Col et.al.	2412.14088	link
2024-12-18	MambaLCT: Boosting Tracking via Long-term Context State Space Model	Xiaohai Li et.al.	2412.13615	link
2024-12-17	CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices	Andrei Znobishchev et.al.	2412.13273	null
2024-12-17	Tell Me What to Track: Infusing Robust Language Guidance for Enhanced Referring Multi-Object Tracking	Wenjun Huang et.al.	2412.12561	null
2024-12-15	Exploring Enhanced Contextual Information for Video-Level Object Tracking	Ben Kang et.al.	2412.11023	link
2024-12-14	Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos	Qingyu Xu et.al.	2412.10861	link
2024-12-14	Patch-level Sounding Object Tracking for Audio-Visual Question Answering	Zhangbin Li et.al.	2412.10749	null
2024-12-12	Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach	Kailas PS et.al.	2412.10453	null
2024-12-13	Visual Object Tracking across Diverse Data Modalities: A Review	Mengmeng Wang et.al.	2412.09991	null
2024-12-12	NormalFlow: Fast, Robust, and Accurate Contact-based Object 6DoF Pose Tracking with Vision-based Tactile Sensors	Hung-Jui Huang et.al.	2412.09617	link
2024-12-12	Temporal-Assisted Beamforming and Trajectory Prediction in Sensing-Enabled UAV Communications	Shengcai Zhou et.al.	2412.09097	null
2024-12-11	TGOSPA Metric Parameters Selection and Evaluation for Visual Multi-object Tracking	Jan Krejčí et.al.	2412.08321	null
2024-12-11	Post-Hoc MOTS: Exploring the Capabilities of Time-Symmetric Multi-Object Tracking	Gergely Szabó et.al.	2412.08313	null
2024-12-11	DTAA: A Detect, Track and Avoid Architecture for navigation in spaces with Multiple Velocity Objects	Samuel Nordström et.al.	2412.08121	null
2024-12-10	Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation	Kurt H. W. Stolle et.al.	2412.07966	null
2024-12-10	Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments	Muhayy Ud Din et.al.	2412.07392	null
2024-12-10	Optical Levitation of Arrays of Microspheres	Benjamin Siegel et.al.	2412.07088	null
2024-12-09	Microcontroller-Driven MPPT System for Enhanced Photovoltaic Efficiency: An Experimental Approach in Nepal	Diwakar Khadka et.al.	2412.06956	null
2024-12-09	Enhanced Multi-Object Tracking Using Pose-based Virtual Markers in 3x3 Basketball	Li Yin et.al.	2412.06258	null
2024-12-10	Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation	Hyeonho Jeong et.al.	2412.06016	null
2024-12-07	Street Gaussians without 3D Object Tracker	Ruida Zhang et.al.	2412.05548	null
2024-12-06	HOLa: HoloLens Object Labeling	Michael Schwimmbeck et.al.	2412.04945	link
2024-12-06	Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection	Khurram Azeem Hashmi et.al.	2412.04915	null
2024-12-05	EgoPoints: Advancing Point Tracking for Egocentric Videos	Ahmad Darkhalil et.al.	2412.04592	null
2024-12-04	Distillation of Diffusion Features for Semantic Correspondence	Frank Fundel et.al.	2412.03512	null
2024-12-03	MVCTrack: Boosting 3D Point Cloud Tracking via Multimodal-Guided Virtual Cues	Zhaofeng Hu et.al.	2412.02734	link
2024-12-03	GSOT3D: Towards Generic 3D Single Object Tracking in the Wild	Yifan Jiao et.al.	2412.02129	link
2024-12-02	6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting	Yufeng Jin et.al.	2412.01543	null
2024-12-02	A2VIS: Amodal-Aware Approach to Video Instance Segmentation	Minh Tran et.al.	2412.01147	null
2024-12-02	Referring Video Object Segmentation via Language-aligned Track Selection	Seongchan Kim et.al.	2412.01136	link
2024-12-02	Eyes on the Road: State-of-the-Art Video Question Answering Models Assessment for Traffic Monitoring Tasks	Joseph Raj Vishal et.al.	2412.01132	link
2024-12-02	Object Tracking in a $360^o$ View: A Novel Perspective on Bridging the Gap to Biomedical Advancements	Mojtaba S. Fazli et.al.	2412.01119	null
2024-12-02	LiDAR SLAMMOT based on Confidence-guided Data Association	Susu Fang et.al.	2412.01041	null
2024-12-01	BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird’s-Eye View	Yizhou Wang et.al.	2412.00692	null
2024-11-29	Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark	Joseph Heyward et.al.	2411.19941	null
2024-11-28	HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos	Prithviraj Banerjee et.al.	2411.19167	null
2024-11-28	Visual SLAMMOT Considering Multiple Motion Models	Peilin Tian et.al.	2411.19134	null
2024-11-28	CrossTracker: Robust Multi-modal 3D Multi-Object Tracking via Cross Correction	Lipeng Gu et.al.	2411.18850	null
2024-11-27	TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video	Jinyuan Qu et.al.	2411.18671	null
2024-11-27	A comparison of extended object tracking with multi-modal sensors in indoor environment	Jiangtao Shuai et.al.	2411.18476	null
2024-11-27	Efficient Dynamic LiDAR Odometry for Mobile Robots with Structured Point Clouds	Jonathan Lichtenfeld et.al.	2411.18443	link
2024-11-26	A Distractor-Aware Memory for Visual Object Tracking with SAM2	Jovana Videnovic et.al.	2411.17576	link
2024-11-24	FastTrackTr:Towards Fast Multi-Object Tracking with Transformers	Pan Liao et.al.	2411.15811	null
2024-11-23	How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking	Xuchen Li et.al.	2411.15600	null
2024-11-23	MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking	Xinqi Liu et.al.	2411.15459	null
2024-11-20	Gaze2AOI: Open Source Deep-learning Based System for Automatic Area of Interest Annotation with Eye Tracking Data	Karolina Trajkovska et.al.	2411.13346	null
2024-11-20	Teaching VLMs to Localize Specific Objects from In-context Examples	Sivan Doveh et.al.	2411.13317	link
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291	null
2024-11-24	ClickTrack: Towards Real-time Interactive Single Object Tracking	Kuiran Wang et.al.	2411.13183	null
2024-11-20	Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity	Wassim El Ahmar et.al.	2411.12943	link
2024-11-19	Resolution Improvement in OFDM-based Joint Communication and Sensing through Combined Tracking and Interpolation	Charlotte Muth et.al.	2411.12464	null
2024-11-18	SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory	Cheng-Yen Yang et.al.	2411.11922	link
2024-11-18	Learning a Neural Association Network for Self-supervised Multi-Object Tracking	Shuai Li et.al.	2411.11514	null
2024-11-15	Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras	Ishrath Ahamed et.al.	2411.10072	null
2024-11-21	MOT FCG++: Enhanced Representation of Spatio-temporal Motion and Appearance Features	Yanzhao Fang et.al.	2411.10028	null
2024-11-13	Predictive Visuo-Tactile Interactive Perception Framework for Object Properties Inference	Anirvan Dutta et.al.	2411.09020	null
2024-11-13	3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter	Xiaoxiang Wang et.al.	2411.08433	null
2024-11-13	DEEGITS: Deep Learning based Framework for Measuring Heterogenous Traffic State in Challenging Traffic Scenarios	Muttahirul Islam et.al.	2411.08335	null
2024-11-12	GTA: Global Tracklet Association for Multi-Object Tracking in Sports	Jiacheng Sun et.al.	2411.08216	link
2024-11-11	BuckTales : A multi-UAV dataset for multi-object tracking and re-identification of wild antelopes	Hemal Naik et.al.	2411.06896	null
2024-11-11	HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision	Shubo Lin et.al.	2411.06780	null
2024-11-11	Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs	Jia Syuen Lim et.al.	2411.06702	null
2024-11-10	PKF: Probabilistic Data Association Kalman Filter for Multi-Object Tracking	Hanwen Cao et.al.	2411.06378	link
2024-11-09	Multi-object Tracking by Detection and Query: an efficient end-to-end manner	Shukun Jia et.al.	2411.06197	null
2024-11-08	Agile UAV landing control on moving ship in adverse conditions	James Mordaunt et.al.	2411.05445	null
2024-11-06	Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving	Depanshu Sani et.al.	2411.03702	null
2024-11-05	Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting	Michael Büttner et.al.	2411.03555	null
2024-11-04	SIRA: Scalable Inter-frame Relation and Association for Radar Perception	Ryoma Yataka et.al.	2411.02220	null
2024-11-04	Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations	Thanh Nguyen Canh et.al.	2411.01816	null
2024-11-04	ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model	Yiming Sun et.al.	2411.01756	null
2024-11-01	HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices	Xiang Li et.al.	2411.00608	null
2024-11-01	Is Multiple Object Tracking a Matter of Specialization?	Gianluca Mancusi et.al.	2411.00553	null
2024-10-31	Extended Object Tracking and Classification based on Linear Splines	Matteo Tesori et.al.	2410.24183	null
2024-10-30	IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking	Run Luo et.al.	2410.23907	null
2024-10-28	Evaluating the Robustness of LiDAR Point Cloud Tracking Against Adversarial Attack	Shengjing Tian et.al.	2410.20893	null
2024-10-27	BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events	Yijin Li et.al.	2410.20451	null
2024-10-27	NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tracking	Yu Liu et.al.	2410.20421	link
2024-10-27	Depth Attention for Robust RGB Tracking	Yu Liu et.al.	2410.20395	link
2024-10-26	SFTrack: A Robust Scale and Motion Adaptive Algorithm for Tracking Small and Fast Moving Objects	InPyo Song et.al.	2410.20079	null
2024-10-25	A-MFST: Adaptive Multi-Flow Sparse Tracker for Real-Time Tissue Tracking Under Occlusion	Yuxin Chen et.al.	2410.19996	null
2024-10-23	ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting	Shaofei Cai et.al.	2410.17856	link
2024-10-23	Real-time Vehicle-to-Vehicle Communication Based Network Cooperative Control System through Distributed Database and Multimodal Perception: Demonstrated in Crossroads	Xinwen Zhu et.al.	2410.17576	link
2024-10-23	OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking	Haiji Liang et.al.	2410.17534	link
2024-10-22	MPT: A Large-scale Multi-Phytoplankton Tracking Benchmark	Yang Yu et.al.	2410.16695	link
2024-10-19	The Solution for Single Object Tracking Task of Perception Test Challenge 2024	Zhiqiang Zhong et.al.	2410.16329	null
2024-10-20	TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool	Thinh Phan et.al.	2410.15518	link
2024-10-20	Multiset Combinatorial Gray Codes with Application to Proximity Sensor Networks	Chung Shue Chen et.al.	2410.15428	null
2024-10-19	3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving	Linh Van Ma et.al.	2410.14977	link
2024-10-18	Enhancing In-vehicle Multiple Object Tracking Systems with Embeddable Ising Machines	Kosuke Tatsumura et.al.	2410.14093	null
2024-10-17	Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation	Changcheng Xiao et.al.	2410.13437	null
2024-10-17	TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal	Yanpeng Jia et.al.	2410.13240	null
2024-10-15	CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos	Nikita Karaev et.al.	2410.11831	null
2024-10-17	UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles	Hui Ye et.al.	2410.11125	null
2024-10-14	Motion-guided small MAV detection in complex and non-planar scenes	Hanqing Guo et.al.	2410.10527	null
2024-10-14	SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments	Khaled Gabr et.al.	2410.10409	link
2024-10-14	DINTR: Tracking via Diffusion-based Interpolation	Pha Nguyen et.al.	2410.10053	null
2024-10-11	Enhanced Kalman with Adaptive Appearance Motion SORT for Grounded Generic Multiple Object Tracking	Duy Le Dinh Anh et.al.	2410.09243	null
2024-10-11	VideoSAM: Open-World Video Segmentation	Pinxue Guo et.al.	2410.08781	null
2024-10-11	Efficient Multi-Object Tracking on Edge Devices via Reconstruction-Based Channel Pruning	Jan Müller et.al.	2410.08769	null
2024-10-11	VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking	Zekun Qian et.al.	2410.08529	null
2024-10-05	ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments	Lorenzo Terenzi et.al.	2410.04250	null
2024-10-04	Combing Text-based and Drag-based Editing for Precise and Flexible Image Editing	Ziqi Jiang et.al.	2410.03097	null
2024-10-03	Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking	Fabian Herzog et.al.	2410.02638	link
2024-10-09	DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM	Xuchen Li et.al.	2410.02492	null
2024-10-03	Spiking Neural Network as Adaptive Event Stream Slicer	Jiahang Cao et.al.	2410.02249	link
2024-10-10	Tracking objects that change in appearance with phase synchrony	Sabine Muzellec et.al.	2410.02094	null
2024-10-02	Scene Flow as a Partial Differential Equation	Kyle Vedder et.al.	2410.02031	null
2024-10-02	Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking	Mattia Segu et.al.	2410.01806	null
2024-10-02	Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking	Ayesha Ishaq et.al.	2410.01678	link
2024-09-29	One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos	Zechen Bai et.al.	2409.19603	link
2024-09-27	Improving Visual Object Tracking through Visual Prompting	Shih-Fang Chen et.al.	2409.18901	link
2024-09-30	An Overview of Multi-Object Estimation via Labeled Random Finite Set	Ba-Ngu Vo et.al.	2409.18531	null
2024-09-26	BlinkTrack: Feature Tracking over 100 FPS via Events and Images	Yichen Shen et.al.	2409.17981	null
2024-09-26	General Compression Framework for Efficient Transformer Object Tracking	Lingyi Hong et.al.	2409.17564	null
2024-09-26	CAMOT: Camera Angle-aware Multi-Object Tracking	Felix Limanta et.al.	2409.17533	null
2024-09-25	Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs	Mattia Segu et.al.	2409.17221	null
2024-09-25	Automated Surgical Skill Assessment in Endoscopic Pituitary Surgery using Real-time Instrument Tracking on a High-fidelity Bench-top Phantom	Adrito Das et.al.	2409.17025	null
2024-09-25	Towards Underwater Camouflaged Object Tracking: An Experimental Evaluation of SAM and SAM 2	Chunhui Zhang et.al.	2409.16902	link
2024-09-25	Conditional Generative Denoiser for Nighttime UAV Tracking	Yucheng Wang et.al.	2409.16834	link
2024-09-25	Progressive Representation Learning for Real-Time UAV Tracking	Changhong Fu et.al.	2409.16652	link
2024-09-25	Enhancing Nighttime UAV Tracking with Light Distribution Suppression	Liangliang Yao et.al.	2409.16631	link
2024-09-24	Transformer based time series prediction of the maximum power point for solar photovoltaic cells	Palaash Agrawal et.al.	2409.16342	null
2024-09-24	Self-Supervised Any-Point Tracking by Contrastive Random Walks	Ayush Shrivastava et.al.	2409.16288	link
2024-09-23	MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving	Xiyang Wang et.al.	2409.16149	link
2024-09-24	CloudTrack: Scalable UAV Tracking with Cloud Semantics	Yannik Blei et.al.	2409.16111	link
2024-09-22	TrackNetV4: Enhancing Fast Sports Object Tracking with Motion Attention Maps	Arjun Raj et.al.	2409.14543	null
2024-09-21	Masks and Boxes: Combining the Best of Both Worlds for Multi-Object Tracking	Tomasz Stanczyk et.al.	2409.14220	null
2024-09-21	Foundation Models for Amodal Video Instance Segmentation in Automated Driving	Jasmin Breitenstein et.al.	2409.14095	link
2024-09-18	Tracking Any Point with Frame-Event Fusion Network at High Frame Rate	Jiaxiong Liu et.al.	2409.11953	null
2024-09-18	RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework	Xiaoyu Li et.al.	2409.11749	null
2024-09-17	SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking	Siyuan Li et.al.	2409.11235	link
2024-09-17	STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking	Jianbo Ma et.al.	2409.11234	link
2024-09-17	TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection	Philip Jacobson et.al.	2409.10901	null
2024-09-15	Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings	Oriel Perl et.al.	2409.09841	null
2024-09-14	Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown	Zimeng Fang et.al.	2409.09293	link
2024-09-12	FACT: Feature Adaptive Continual-learning Tracker for Multiple Object Tracking	Rongzihan Song et.al.	2409.07904	null
2024-09-10	When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking	Emirhan Bayar et.al.	2409.06617	link
2024-09-09	Leveraging Object Priors for Point Tracking	Bikram Boote et.al.	2409.05786	link
2024-09-08	RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network	Zhiwei Lin et.al.	2409.04979	null
2024-09-06	LITE: A Paradigm Shift in Multi-Object Tracking with Efficient ReID Feature Integration	Jumabek Alikhanov et.al.	2409.04187	link
2024-09-05	Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints	Keisuke Toida et.al.	2409.03252	null
2024-09-04	TP-GMOT: Tracking Generic Multiple Object by Textual Prompt with Motion-Appearance Cost (MAC) SORT	Duy Le Dinh Anh et.al.	2409.02490	link
2024-09-03	DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction	Jenny Seidenschwarz et.al.	2409.02104	null
2024-09-01	YOLOO: You Only Learn from Others Once	Lipeng Gu et.al.	2409.00618	null
2024-09-10	TrackSSM: A General Motion Predictor by State-Space Model	Bin Hu et.al.	2409.00487	link
2024-08-31	Fish Tracking Challenge 2024: A Multi-Object Tracking Competition with Sweetfish Schooling Data	Makoto M. Itoh et.al.	2409.00339	null
2024-08-30	UTrack: Multi-Object Tracking with Uncertain Detections	Edgardo Solano-Carrillo et.al.	2408.17098	link
2024-08-29	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445	link
2024-08-29	Estimating Dynamic Flow Features in Groups of Tracked Objects	Tanner D. Harms et.al.	2408.16190	null
2024-08-28	ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model	Lifan Jiang et.al.	2408.15548	link
2024-08-25	Camouflaged_Object_Tracking__A_Benchmark	Xiaoyu Guo et.al.	2408.13877	link
2024-08-24	Can Visual Foundation Models Achieve Long-term Point Tracking?	Görkay Aydemir et.al.	2408.13575	null
2024-08-23	MCTR: Multi Camera Tracking Transformer	Alexandru Niculescu-Mizil et.al.	2408.13243	null
2024-08-23	BoostTrack++: using tracklet information to detect more objects in multiple object tracking	Vukašin Stanojević et.al.	2408.13003	link
2024-08-22	BankTweak: Adversarial Attack against Multi-Object Trackers by Manipulating Feature Banks	Woojin Shin et.al.	2408.12727	null
2024-08-22	BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking	Hanzheng Wang et.al.	2408.12232	null
2024-08-21	CHOTA: A Higher Order Accuracy Metric for Cell Tracking	Timo Kaiser et.al.	2408.11571	link
2024-08-21	Low-Light Object Tracking: A Benchmark	Pengzhi Zhong et.al.	2408.11463	link
2024-08-20	MambaEVT: Event Stream based Visual Object Tracking using State Space Model	Xiao Wang et.al.	2408.10487	link
2024-08-17	GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System	Shuo Wang et.al.	2408.09191	null
2024-08-17	MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model	Changcheng Xiao et.al.	2408.09178	null
2024-08-14	Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving	Yuqing Wen et.al.	2408.07605	null
2024-08-14	RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking	Song Guo et.al.	2408.07344	null
2024-08-13	Object Tracking Incorporating Transfer Learning into Unscented and Cubature Kalman Filters	Omar Alotaibi et.al.	2408.07157	null
2024-08-12	FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework	Lukas Meyer et.al.	2408.06190	link
2024-08-11	A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot	Haoxuan Ding et.al.	2408.05729	link
2024-08-09	Mesh-based Object Tracking for Dynamic Semantic 3D Scene Graphs via Ray Tracing	Lennart Niecksch et.al.	2408.04979	null
2024-08-06	Quantum Imaging Using Spatially Entangled Photon Pairs from a Nonlinear Metasurface	Jinyong Ma et.al.	2408.02903	null
2024-08-05	VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking	Yuxuan Lu et.al.	2408.02263	null
2024-08-04	3D Single-object Tracking in Point Clouds with High Temporal Variation	Qiao Wu et.al.	2408.02049	null
2024-08-03	SiamMo: Siamese Motion-Centric 3D Object Tracking	Yuxiang Yang et.al.	2408.01688	link
2024-08-02	Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach	Yabin Zhu et.al.	2408.00969	link
2024-08-05	U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight	Tongtong Feng et.al.	2408.00606	link
2024-08-01	A Batch Update Using Multiplicative Noise Modelling for Extended Object Tracking	Christian Gramsch et.al.	2408.00417	null
2024-07-30	Autogenic Language Embedding for Coherent Point Tracking	Zikai Song et.al.	2407.20730	link
2024-07-30	SharkTrack: an accurate, generalisable software for streamlining shark and ray underwater video analysis	Filippo Varini et.al.	2407.20623	null
2024-07-29	MEVDT: Multi-Modal Event-Based Vehicle Detection and Tracking Dataset	Zaid A. El Shair et.al.	2407.20446	null
2024-07-28	Progressive Domain Adaptation for Thermal Infrared Object Tracking	Qiao Li et.al.	2407.19430	null
2024-07-25	Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT	Niels G. Faber et.al.	2407.18288	link
2024-07-20	CORT: Class-Oriented Real-time Tracking for Embedded Systems	Edoardo Cittadini et.al.	2407.17521	null
2024-07-23	PlantTrack: Task-Driven Plant Keypoint Tracking with Zero-Shot Sim2Real Transfer	Samhita Marri et.al.	2407.16829	null
2024-07-23	Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos	Jiahe Liu et.al.	2407.16124	link
2024-07-22	Local All-Pair Correspondence for Point Tracking	Seokju Cho et.al.	2407.15420	link
2024-07-21	Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety Analysis	Jingwei Guo et.al.	2407.15199	link
2024-07-19	Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking	Yunfei Zhang et.al.	2407.14086	link
2024-07-19	OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking	Zekun Qian et.al.	2407.14047	null
2024-07-18	Boosting Online 3D Multi-Object Tracking through Camera-Radar Cross Check	Sheng-Yao Kuan et.al.	2407.13937	null
2024-07-18	Long-Term 3D Point Tracking By Cost Volume Fusion	Hung Nguyen et.al.	2407.13337	null
2024-07-17	Strawberry detection and counting based on YOLOv7 pruning and information based tracking algorithm	Shiyu Liu et.al.	2407.12614	null
2024-07-15	Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation	Friedhelm Hamann et.al.	2407.10802	link
2024-07-15	Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss	Mufeng Yao et.al.	2407.10485	link
2024-07-16	Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking	Lorenzo Vaquero et.al.	2407.10151	link
2024-07-14	Power System Architecture and Control for Green Hydrogen Production via Power Converter-less Photovoltaic-Electrolyser Integration	Aymeric Fabre et.al.	2407.10075	null
2024-07-12	DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects	Peng Wang et.al.	2407.09051	null
2024-07-11	Manipulating a Tetris-Inspired 3D Video Representation	Mihir Godbole et.al.	2407.08885	null
2024-07-11	Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets	Linh Van Ma et.al.	2407.08872	link
2024-07-11	CommRad: Context-Aware Sensing-Driven Millimeter-Wave Networks	Ish Kumar Jain et.al.	2407.08817	null
2024-07-10	Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors	Lei Cheng et.al.	2407.08049	null
2024-07-10	MSC-LIO: An MSCKF-Based LiDAR-Inertial Odometry with Same-Plane-Point Tracking	Tisheng Zhang et.al.	2407.07589	null
2024-07-09	Decomposition Betters Tracking Everything Everywhere	Rui Li et.al.	2407.06531	link
2024-07-08	GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images	Jon Crall et.al.	2407.06337	null
2024-07-08	TAPVid-3D: A Benchmark for Tracking Any Point in 3D	Skanda Koppula et.al.	2407.05921	link
2024-07-07	Addressing single object tracking in satellite imagery through prompt-engineered solutions	Athena Psalta et.al.	2407.05518	null
2024-07-09	P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds	Jiahao Nie et.al.	2407.05238	link
2024-07-06	VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking	Xuefeng Jiang et.al.	2407.05017	null
2024-07-05	TF-SASM: Training-free Spatial-aware Sparse Memory for Multi-object Tracking	Thuc Nguyen-Quang et.al.	2407.04327	null
2024-07-08	SSP-GNN: Learning to Track via Bilevel Optimization	Griffin Golias et.al.	2407.04308	null
2024-07-05	FeatureSORT: Essential Features for Effective Tracking	Hamidreza Hashempoor et.al.	2407.04249	null
2024-07-04	Attention Normalization Impacts Cardinality Generalization in Slot Attention	Markus Krimmel et.al.	2407.04170	link
2024-07-04	TrackPGD: A White-box Attack using Binary Masks against Robust Transformer Trackers	Fatemeh Nourilenjan Nokabadi et.al.	2407.03946	link
2024-07-03	Applying Extended Object Tracking for Self-Localization of Roadside Radar Sensors	Longfei Han et.al.	2407.03084	null
2024-07-02	FlowTrack: Point-level Flow Network for 3D Single Object Tracking	Shuo Li et.al.	2407.01959	null
2024-07-02	The Solution for the ICCV 2023 Perception Test Challenge 2023 – Task 6 – Grounded videoQA	Hailiang Zhang et.al.	2407.01907	null
2024-06-30	DroBoost: An Intelligent Score and Model Boosting Method for Drone Detection	Ogulcan Eryuksel et.al.	2407.00830	null
2024-06-30	Engineering an Efficient Object Tracker for Non-Linear Motion	Momir Adžemović et.al.	2407.00738	null
2024-06-28	PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators	Kuo-Hao Zeng et.al.	2406.20083	null
2024-06-28	eMoE-Tracker: Environmental MoE-based Transformer for Robust Event-guided Object Tracking	Yucheng Chen et.al.	2406.20024	null
2024-06-28	StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction	Jiaheng Zhuang et.al.	2406.19844	null
2024-06-28	Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking	Qingrui Hu et.al.	2406.19655	null
2024-06-28	Optimal Video Compression using Pixel Shift Tracking	Hitesh Saai Mananchery Panneerselvam et.al.	2406.19630	link
2024-06-26	Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos	Colton Stearns et.al.	2406.18717	link
2024-06-26	BiTrack: Bidirectional Offline 3D Multi-Object Tracking Using Camera-LiDAR Data	Kemiao Huang et.al.	2406.18414	link
2024-06-24	POPCat: Propagation of particles for complex annotation tasks	Adam Srebrnjak Yang et.al.	2406.17183	null
2024-06-24	A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking	Lorenzo Shaikewitz et.al.	2406.16837	link
2024-06-24	The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers	Abhi Kamboj et.al.	2406.16784	null
2024-06-21	LU2Net: A Lightweight Network for Real-time Underwater Image Enhancement	Haodong Yang et.al.	2406.14973	null
2024-06-22	Velocity Analysis of Moving Objects in Earth Observation Satellite Images Using Multi-Spectral Push Broom Scanning	Eric Keto et.al.	2406.13710	null
2024-06-19	Hierarchical IoU Tracking based on Interval	Yunhao Du et.al.	2406.13271	link
2024-06-19	Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models	Akchay Srivastava et.al.	2406.13232	null
2024-06-17	Deep HM-SORT: Enhancing Multi-Object Tracking in Sports with Deep Features, Harmonic Mean, and Expansion IOU	Matias Gran-Henriksen et.al.	2406.12081	null
2024-06-17	VideoVista: A Versatile Benchmark for Video Understanding and Reasoning	Yunxin Li et.al.	2406.11303	null
2024-06-14	Robust compressive tracking via online weighted multiple instance learning	Sandeep Singh Sengar et.al.	2406.09914	null
2024-06-13	Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking	Prithviraj Banerjee et.al.	2406.09598	null
2024-06-12	LaMOT: Language-Guided Multi-Object Tracking	Yunhao Li et.al.	2406.08324	link
2024-06-12	Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance	Yasod Ginige et.al.	2406.08294	null
2024-06-11	Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos	Duc Pham et.al.	2406.07680	null
2024-06-11	Haptic Repurposing with GenAI	Haoyu Wang et.al.	2406.07228	null
2024-06-11	UVIS: Unsupervised Video Instance Segmentation	Shuaiyi Huang et.al.	2406.06908	null
2024-06-09	ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving	Chen Ma et.al.	2406.05810	null
2024-06-09	SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving	Chen Ma et.al.	2406.05800	null
2024-06-08	Training-Free Robust Interactive Video Object Segmentation	Xiaoli Wei et.al.	2406.05485	null
2024-06-07	Bootstrapping Referring Multi-Object Tracking	Yani Zhang et.al.	2406.05039	link
2024-06-07	Multi-Granularity Language-Guided Multi-Object Tracking	Yuhao Li et.al.	2406.04844	link
2024-06-06	Matching Anything by Segmenting Anything	Siyuan Li et.al.	2406.04221	link
2024-06-06	ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints	Divij Handa et.al.	2406.04046	null
2024-06-04	UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking	Lijun Zhou et.al.	2406.02147	null
2024-06-03	Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers	Fatemeh Nourilenjan Nokabadi et.al.	2406.01765	link
2024-06-03	Prototypical Transformer as Unified Motion Learners	Cheng Han et.al.	2406.01559	null
2024-06-03	Convolutional Unscented Kalman Filter for Multi-Object Tracking with Outliers	Shiqi Liu et.al.	2406.01380	null
2024-06-03	Programmable Multi-input Buck-Boost Converter for Photovoltaics Arrays	Zhongting Tang et.al.	2406.01193	null
2024-06-03	Multi-Object Tracking based on Imaging Radar 3D Object Detection	Patrick Palmer et.al.	2406.01011	null
2024-06-01	Towards Generalizable Multi-Object Tracking	Zheng Qin et.al.	2406.00429	link
2024-05-30	WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark	Chunhui Zhang et.al.	2405.19818	link
2024-05-29	DGD: Dynamic 3D Gaussians Distillation	Isaac Labe et.al.	2405.19321	null
2024-05-28	Track Initialization and Re-Identification for~3D Multi-View Multi-Object Tracking	Linh Van Ma et.al.	2405.18606	link
2024-05-28	Reliable Object Tracking by Multimodal Hybrid Feature Extraction and Transformer-Based Fusion	Hongze Sun et.al.	2405.17903	link
2024-05-28	Towards a Generalist and Blind RGB-X Tracker	Yuedong Tan et.al.	2405.17773	link
2024-06-03	BaboonLand Dataset: Tracking Primates in the Wild and Automating Behaviour Recognition from Drone Videos	Isla Duporge et.al.	2405.17698	null
2024-05-27	Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association	Tingwei Liu et.al.	2405.17323	null
2024-05-24	ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking	Xudong Han et.al.	2405.15755	null
2024-05-24	Trackastra: Transformer-based cell tracking for live-cell microscopy	Benjamin Gallusser et.al.	2405.15700	link
2024-05-24	An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking	Pratyusha Musunuru et.al.	2405.15137	null
2024-05-23	Awesome Multi-modal Object Tracking	Chunhui Zhang et.al.	2405.14200	link
2024-05-23	Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning	Zhenyu Wei et.al.	2405.14195	null
2024-05-23	PuTR: A Pure Transformer for Decoupled and Online Multi-Object Tracking	Chongwei Liu et.al.	2405.14119	link
2024-05-22	Multi Player Tracking in Ice Hockey with Homographic Projections	Harish Prakash et.al.	2405.13397	null
2024-05-20	Building Temporal Kernels with Orthogonal Polynomials	Yan Ru Pei et.al.	2405.12179	link
2024-05-20	WiDRa – Enabling Millimeter-Level Differential Ranging Accuracy in Wi-Fi Using Carrier Phase	Vishnu V. Ratnam et.al.	2405.12168	null
2024-05-20	DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM	Xuchen Li et.al.	2405.12139	null
2024-05-20	A Vision on Open Science for the Evolution of Software Engineering Research and Practice	Edson OliveiraJr et.al.	2405.12132	null
2024-05-20	PATE: Proximity-Aware Time series anomaly Evaluation	Ramin Ghorbani et.al.	2405.12096	link
2024-05-20	SEMv3: A Fast and Robust Approach to Table Separation Line Detection	Chunxia Qin et.al.	2405.11862	link
2024-05-20	Online Learning Feedback Control Considering Hysteresis for Musculoskeletal Structures	Kento Kawaharazuka et.al.	2405.11808	null
2024-05-20	CDM-MPC: An Integrated Dynamic Planning and Control Framework for Bipedal Robots Jumping	Zhicheng He et.al.	2405.11773	null
2024-05-19	PBI: Position-Based Dynamics Handles Updated Lagrangian Inelasticity	Chang Yu et.al.	2405.11694	null
2024-05-19	Auto-Platoon : Freight by example	Tharun V. Puthanveettil et.al.	2405.11659	link
2024-05-19	Track Anything Rapter(TAR)	Tharun V. Puthanveettil et.al.	2405.11655	link
2024-05-19	RobMOT: Robust 3D Multi-Object Tracking by Observational Noise and State Estimation Drift Mitigation on LiDAR PointCloud	Mohamed Nagy et.al.	2405.11536	link
2024-05-17	Air Signing and Privacy-Preserving Signature Verification for Digital Documents	P. Sarveswarasarma et.al.	2405.10868	link
2024-05-17	Review on physical impedance models in perovskite solar cells	Rajat Kumar Goyal et.al.	2405.10855	null
2024-05-17	Model Predictive Contouring Control for Vehicle Obstacle Avoidance at the Limit of Handling Using Torque Vectoring	Alberto Bertipaglia et.al.	2405.10847	null
2024-05-17	Heterogeneity-Informed Meta-Parameter Learning for Spatiotemporal Time Series Forecasting	Zheng Dong et.al.	2405.10800	link
2024-05-17	Anomalous relaxation of coarsening foams with viscoelastic continuous phase	Chiara Guidolin et.al.	2405.10657	null
2024-05-17	Cyclical Weight Consolidation: Towards Solving Catastrophic Forgetting in Serial Federated Learning	Haoyue Song et.al.	2405.10647	null
2024-05-17	COMET: NFT Price Prediction with Wallet Profiling	Tianfu Wang et.al.	2405.10640	link
2024-05-17	Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track	Xiaoshuai Hao et.al.	2405.10567	null
2024-05-17	Dynamic Cluster Analysis to Detect and Track Novelty in Network Telescopes	Kai Huang et.al.	2405.10545	null
2024-05-17	Hawkes Models And Their Applications	Patrick J. Laub et.al.	2405.10527	null
2024-05-16	A Novel Bounding Box Regression Method for Single Object Tracking	Omar Abdelaziz et.al.	2405.10444	null
2024-05-16	Beyond Traditional Single Object Tracking: A Survey	Omar Abdelaziz et.al.	2405.10439	null
2024-05-16	Spatial Cognition: a Wave Hypothesis	Robert Worden et.al.	2405.10112	null
2024-05-14	Learning Correspondence for Deformable Objects	Priya Sundaresan et.al.	2405.08996	null
2024-05-14	ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association	Shuxiao Ding et.al.	2405.08909	link
2024-05-14	EchoTracker: Advancing Myocardial Point Tracking in Echocardiography	Md Abulkalam Azad et.al.	2405.08587	link

Defocus

Publish Date	Title	Authors	PDF	Code
2025-07-15	Digital defocus aberration interference for automated optical microscopy	Haowen Zhou et.al.	2507.10867	null
2025-07-01	Efficient Depth- and Spatially-Varying Image Simulation for Defocus Deblur	Xinge Yang et.al.	2507.00372	null
2025-07-09	High-quality metalens enables minimally invasive CFB endoscopy	Ruixiang Song et.al.	2506.21379	null
2025-06-26	Quantitative structure determination from experimental four-dimensional scanning transmission electron microscopy via the scattering matrix	Emmanuel W. C. Terzoudis-Lumsden et.al.	2506.21004	null
2025-06-22	On the Particle Image Overlap in Single Camera Defocusing Approaches	Christian Sax et.al.	2506.18170	null
2025-06-25	Dark Channel-Assisted Depth-from-Defocus from a Single Image	Moushumi Medhi et.al.	2506.06643	null
2025-05-29	Dc-EEMF: Pushing depth-of-field limit of photoacoustic microscopy via decision-level constrained learning	Wangting Zhou et.al.	2506.03181	null
2025-05-31	Fovea Stacking: Imaging with Dynamic Localized Aberration Correction	Shi Mao et.al.	2506.00716	null
2025-05-30	High resolution up-conversion imaging in the 10 μm band under incoherent illumination	Zhao-Qi-Zhi Han et.al.	2505.24367	null
2025-05-30	Fourier ptychographic microscopy aided with transport of intensity equation for robust full phase spectrum reconstruction	Mikołaj Rogalski et.al.	2505.24322	null
2025-07-02	Real-Time Blind Defocus Deblurring for Earth Observation: The IMAGIN-e Mission Approach	Alejandro D. Mousist et.al.	2505.22128	null
2025-05-27	Any-to-Bokeh: One-Step Video Bokeh via Multi-Plane Image Guided Diffusion	Yang Yang et.al.	2505.21593	null
2025-05-23	Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues	Chinmay Talegaonkar et.al.	2505.17358	null
2025-05-19	Combinatorial Sample-and Back-Focal-Plane (BFP) Imaging. Pt. I: Instrument and acquisition parameters affecting BFP images and their analysis	Omer Shavit et.al.	2505.13190	null
2025-05-12	Apple’s Synthetic Defocus Noise Pattern: Characterization and Forensic Applications	David Vázquez-Padín et.al.	2505.07380	null
2025-05-09	Development of precession Lorentz transmission electron microscopy	Shunsuke Hayashi et.al.	2505.05790	null
2025-05-07	Image Restoration via Multi-domain Learning	Xingyu Jiang et.al.	2505.05504	link
2025-05-08	Differentiation of Distinct Single Atoms via Multi-Defocus Fusion Method	Yangfan Li et.al.	2505.04078	null
2025-05-09	Back-illumination interference tomography for imaging weak scattering in thick tissues	Gregory N. McKay et.al.	2504.19278	null
2025-04-25	Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models	Patrick Müller et.al.	2504.18510	null
2025-04-24	Surface morphology and thickness variation estimation of zeolites via electron ptychography	Enci Zhang et.al.	2504.17501	null
2025-04-23	Dual-Camera All-in-Focus Neural Radiance Fields	Xianrui Luo et.al.	2504.16636	null
2025-04-15	Focal Split: Untethered Snapshot Depth from Differential Defocus	Junjie Luo et.al.	2504.11202	null
2025-04-15	Three-dimensional neural network driving self-interference digital holography enables high-fidelity, non-scanning volumetric fluorescence microscopy	Tianlong Man et.al.	2504.10769	null
2025-04-14	Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials	Jingyun Yang et.al.	2504.10281	null
2025-04-11	Optical vortex trajectories as probes for wavefront aberrations	Aleksandra K. Korzeniewska et.al.	2504.08643	null
2025-03-31	InstructRestore: Region-Customized Image Restoration with Human Instructions	Shuaizheng Liu et.al.	2503.24357	link
2025-03-30	Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries	Wei Xu et.al.	2503.23606	null
2025-03-26	Spectrum from Defocus: Fast Spectral Imaging with Chromatic Focal Stack	M. Kerem Aydin et.al.	2503.20184	null
2025-03-24	MaSS13K: A Matting-level Semantic Segmentation Benchmark	Chenxi Xie et.al.	2503.18364	link
2025-03-22	Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration	Yawei Li et.al.	2503.17825	null
2025-03-25	Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures	Tim Seizinger et.al.	2503.16067	link
2025-03-18	The Power of Context: How Multimodality Improves Image Super-Resolution	Kangfu Mei et.al.	2503.14503	null
2025-03-18	Intra and Inter Parser-Prompted Transformers for Effective Image Restoration	Cong Wang et.al.	2503.14037	link
2025-03-16	Pathology Image Restoration via Mixture of Prompts	Jiangdong Cai et.al.	2503.12399	link
2025-03-24	Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models	Armando Fortes et.al.	2503.08434	null
2025-03-12	Free Your Hands: Lightweight Relightable Turntable Capture Pipeline	Jiahui Fan et.al.	2503.05511	null
2025-03-03	Blind Augmentation: Calibration-free Camera Distortion Model Estimation for Real-time Mixed-reality Consistency	Siddhant Prakash et.al.	2503.01387	link
2025-03-13	DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting	Liao Shen et.al.	2503.00746	null
2025-01-24	Linnik point spread functions, time-reversed logarithmic diffusion equations, and blind deconvolution of electron microscope imagery	Alfred S. Carasso et.al.	2502.19420	null
2025-02-20	Exploiting Deblurring Networks for Radiance Fields	Haeyun Choi et.al.	2502.14454	link
2025-02-16	Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation	Kunal Swami et.al.	2502.11002	null
2025-02-11	CodePhys: Robust Video-based Remote Physiological Measurement through Latent Codebook Querying	Shuyang Chu et.al.	2502.07526	null
2025-02-10	SparseFocus: Learning-based One-shot Autofocus for Microscopy with Sparse Content	Yongping Zhai et.al.	2502.06452	null
2025-02-13	Self-similar Features in Sub-secondary Breakup of a Droplet and Ligament Mediated Fragmentation under Extreme Conditions	Saini Jatin Rao et.al.	2502.05976	null
2025-01-29	Five-dimensional single-shot fluorescence imaging using a polarized Fourier light-field microscope	Oumeng Zhang et.al.	2501.18047	null
2025-01-25	Image formation theory of optical coherence tomography with optical aberrations and its application for computational aberration correction	Shuichi Makita et.al.	2501.15011	null
2025-01-23	Theoretical analysis of performance limitation of computational refocusing in optical coherence tomography	Yue Zhu et.al.	2501.13874	null
2025-01-16	SE-BSFV: Online Subspace Learning based Shadow Enhancement and Background Suppression for ViSAR under Complex Background	Shangqu Yan et.al.	2501.09341	null
2025-02-23	Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks	Shuang Cui et.al.	2501.09052	null
2024-12-24	Dissecting CLIP: Decomposition with a Schur Complement-based Approach	Azim Ospanov et.al.	2412.18645	link
2024-12-20	CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images	Jungho Lee et.al.	2412.16028	null
2025-01-06	LEDiff: Latent Exposure Diffusion for HDR Generation	Chao Wang et.al.	2412.14456	null
2024-12-29	AKiRa: Augmentation Kit on Rays for optical video generation	Xi Wang et.al.	2412.14158	null
2024-12-17	Strain engineering of magnetic anisotropy in the kagome magnet Fe3Sn2	D. Kong et.al.	2412.12684	null
2024-12-16	Photoacoustic microscopy with meta-optics	Dorian S. H. Brandmüller et.al.	2412.11733	null
2024-12-11	Dense Depth from Event Focal Stack	Kenta Horikawa et.al.	2412.08120	null
2024-11-15	Resilient Stellarator Divertor Characteristics in the Helically Symmetric eXperiment	K. A. Garcia et.al.	2411.10611	null
2024-10-18	Variable Aperture Bokeh Rendering via Customized Focal Plane Guidance	Kang Chen et.al.	2410.14400	link
2024-11-15	Feature Extraction Reimagined: Achieving Superior Accuracy in Camera Calibration	Zezhun Shi et.al.	2410.13371	link
2024-10-08	First experimental study of multiple orientation muon tomography, with image optimization in sparse data environments	Jesus J. Valencia et.al.	2410.07264	null
2024-10-02	Recording dynamic facial micro-expressions with a multi-focus camera array	Lucas Kreiss et.al.	2410.01973	null
2024-10-29	EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis	Alexander Mai et.al.	2410.01804	null
2024-10-02	Frequency-Dependent F-Numbers Suppress Grating Lobes and Improve the Lateral Resolution in Line-by-Line Scanning	Martin F. Schiffner et.al.	2410.01593	null
2024-10-02	Estimating Atmospheric Wind Speeds From Gemini Planet Imager AO Telemetry	Zhenxi Du et.al.	2410.01193	null
2024-09-28	Extending Depth of Field for Varifocal Multiview Images	Zhilong Li et.al.	2409.19220	null
2024-09-26	PNR: Physics-informed Neural Representation for high-resolution LFM reconstruction	Jiayin Zhao et.al.	2409.18223	null
2024-09-26	Reblurring-Guided Single Image Defocus Deblurring: A Learning Framework with Misaligned Training Pairs	Xinya Shu et.al.	2409.17792	link
2024-09-18	Depth Estimation Based on 3D Gaussian Splatting Siamese Defocus	Jinchang Zhang et.al.	2409.12323	null
2024-09-16	Depth from Coupled Optical Differentiation	Junjie Luo et.al.	2409.10725	link
2024-09-16	Focus diverse phase retrieval test results on broadband continuous wavefront sensing in space telescope applications	Hyukmo Kang et.al.	2409.10500	null
2024-09-15	Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation	Xiaolong Qian et.al.	2409.09754	link
2024-09-14	Innovative schemes for Correlation Plenoptic Imaging	Gianlorenzo Massaro et.al.	2409.09459	null
2024-09-14	Plenoptic microscopy and photography from intensity correlations	Francesco V. Pepe et.al.	2409.09456	null
2024-09-03	F2former: When Fractional Fourier Meets Deep Wiener Deconvolution and Selective Frequency Transformer for Image Deblurring	Subhajit Paul et.al.	2409.02056	null
2024-08-17	Pupil-Adaptive 3D Holography Beyond Coherent Depth-of-Field	Yujie Wang et.al.	2409.00028	null
2024-08-05	Joint-Motion Mutual Learning for Pose Estimation in Videos	Sifan Wu et.al.	2408.02285	null
2024-08-28	Enhancing Quantitative Image Synthesis through Pretraining and Resolution Scaling for Bone Mineral Density Estimation from a Plain X-ray Image	Yi Gu et.al.	2407.20495	link
2024-07-26	3D Orbital Angular Momentum Nonlinear Holography	Feiyang Shen et.al.	2407.18696	null
2024-07-23	HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images	Shreyas Singh et.al.	2407.16503	link
2024-07-21	A Novel Method to Improve Quality Surface Coverage in Multi-View Capture	Wei-Lun Huang et.al.	2407.15883	null
2024-07-20	A New Dataset and Framework for Real-World Blurred Images Super-Resolution	Rui Qin et.al.	2407.14880	link
2024-07-15	Automated high-resolution backscattered-electron imaging at macroscopic scale	Zhiyuan Lang et.al.	2407.10628	null
2024-07-24	Inverse-designed 3D laser nanoprinted phase masks to extend the depth of field of imaging systems	T. J. Sturges et.al.	2407.08482	null
2024-07-11	GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views	Vinayak Gupta et.al.	2407.08221	link
2024-07-31	Dynamic Neural Radiance Field From Defocused Monocular Video	Xianrui Luo et.al.	2407.05586	null
2024-07-01	Point-Spread Function of the Optics in Scanning Electron Microscopes	Surya Kamal et.al.	2407.01439	null
2024-06-27	Super-resolution imaging using super-oscillatory diffractive neural networks	Hang Chen et.al.	2406.19126	null
2024-06-27	The Space Coronagraph Optical Bench (SCoOB): 5. End-to-end simulations of polarization aberrations	Ramya M Anche et.al.	2406.18886	null
2024-06-22	Robust Ptychographic Reconstruction with an Out-of-Focus Electron Probe	Shoucong Ning et.al.	2406.15879	null
2024-06-15	fNeRF: High Quality Radiance Fields from Practical Cameras	Yi Hua et.al.	2406.10633	null
2024-06-12	Striving towards robust phase diversity on-sky: Implementing LIFT for VLT/MUSE-NFM	Arseniy Kuznetsov et.al.	2406.08529	link
2024-06-21	Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field	Chao Wang et.al.	2406.07329	null
2024-06-06	Single Exposure Quantitative Phase Imaging with a Conventional Microscope using Diffusion Models	Gabriel della Maggiora et.al.	2406.04388	null
2024-06-03	Improved Three-Dimensional Reconstructions in Electron Ptychography through Defocus Series Measurements	Marcel Schloz et.al.	2406.01141	null
2024-06-02	End-to-End Hybrid Refractive-Diffractive Lens Design with Differentiable Ray-Wave Model	Xinge Yang et.al.	2406.00834	null
2024-06-10	In vivo fundus imaging and computational refocusing with a diffuser-based fundus camera	Corey Simmerer et.al.	2406.00122	null
2024-05-31	Axial HoloTile: Extended Depth-of-Focus of Dynamic Holographic Light Projections	Andreas Erik Gejl Madsen et.al.	2405.20997	null
2024-05-27	DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal	Yujie Wang et.al.	2405.17351	null
2024-05-20	Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction	Aryan Garg et.al.	2405.11823	null
2024-06-04	Single-shot volumetric fluorescence imaging with neural fields	Oumeng Zhang et.al.	2405.10463	null
2024-05-09	Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft	Debabrata Pal et.al.	2405.05574	null
2024-04-05	Robust Gaussian Splatting	François Darmon et.al.	2404.04211	null
2024-04-05	Deep Phase Coded Image Prior	Nimrod Shabtay et.al.	2404.03906	null
2024-04-02	Multiple scattering suppression for in vivo optical coherence tomography measurement using B-scan-wise multi-focus averaging method	Yiqiang Zhu et.al.	2404.01811	null
2024-03-29	Depth from Defocus Technique for High Number Densities and Non-spherical Particles	Rixin Xua et.al.	2403.20004	null
2024-04-01	Video-Based Human Pose Regression via Decoupled Space-Time Aggregation	Jijie He et.al.	2403.19926	link
2024-03-21	Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data	Michael John Fanous et.al.	2403.14324	null
2024-05-06	Expected Impact of Glints from Space Debris in the LSST	J. Anthony Tyson et.al.	2403.04942	null
2024-02-25	Forward and inverse modeling of depth-of-field effects in background-oriented schlieren	Joseph P. Molnar et.al.	2402.15954	null
2024-02-12	Roll-to-roll tomographic volumetric additive manufacturing for continuous production of microstructures on long flexible substrates	Joseph Toombs et.al.	2402.10955	null
2024-04-03	Ptycho-endoscopy on a lensless ultrathin fiber bundle tip	Pengming Song et.al.	2401.17213	null
2024-02-09	Exploring one giga electronvolt cosmic gamma rays with a Cherenkov plenoscope capable of recording atmospheric light fields, Part 1: Optics	Sebastian Achim Mueller et.al.	2401.16148	null
2024-01-29	Light-field imaging from position-momentum correlations	Davide Giannella et.al.	2401.16129	null
2024-01-25	Single- and multi-layer micro-scale diffractive lens fabrication for fiber imaging probes with versatile depth-of-field	Fei He et.al.	2401.14551	null