Contributors Forks Stargazers Issues

Updated on 2025.07.17

Usage instructions: here

SLAM

Publish Date Title Authors PDF Code
2025-07-11 Towards Robust Sensor-Fusion Ground SLAM: A Comprehensive Benchmark and A Resilient Framework Deteng Zhang et.al. 2507.08364 null
2025-07-10 Hardware-Aware Feature Extraction Quantisation for Real-Time Visual Odometry on FPGA Platforms Mateusz Wasala et.al. 2507.07903 null
2025-07-10 IRAF-SLAM: An Illumination-Robust and Adaptive Feature-Culling Front-End for Visual SLAM in Challenging Environments Thanh Nguyen Canh et.al. 2507.07752 null
2025-07-09 g2o vs. Ceres: Optimizing Scan Matching in Cartographer SLAM Quanjie Qiu et.al. 2507.07142 null
2025-07-08 Mapping the Catacombs: An Underwater Cave Segment of the Devil’s Eye System Michalis Chatzispyrou et.al. 2507.06397 null
2025-07-08 Cooperative Mapping, Localization, and Beam Management via Multi-Modal SLAM in ISAC Systems Hang Que et.al. 2507.05718 null
2025-07-07 Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR Tao Du et.al. 2507.04662 null
2025-07-06 Lidar Variability: A Novel Dataset and Comparative Study of Solid-State and Spinning Lidars Doumegna Mawuto Koudjo Felix et.al. 2507.04321 null
2025-07-09 Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM Xiaolei Lang et.al. 2507.04004 null
2025-07-04 Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps Chong Cheng et.al. 2507.03737 null
2025-07-01 RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles David Hunt et.al. 2507.00937 null
2025-07-01 Generation of Indoor Open Street Maps for Robot Navigation from CAD Files Jiajie Zhang et.al. 2507.00552 null
2025-06-30 VOCAL: Visual Odometry via ContrAstive Learning Chi-Yao Huang et.al. 2507.00243 null
2025-06-29 TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints Zhen Tan et.al. 2506.23207 null
2025-06-29 Event-based Stereo Visual-Inertial Odometry with Voxel Map Zhaoxing Zhang et.al. 2506.23078 null
2025-06-26 Adaptive Multipath-Based SLAM for Distributed MIMO Systems Xuhong Li et.al. 2506.21798 null
2025-06-24 Ark: An Open-source Python-based Framework for Robot Learning Magnus Dierking et.al. 2506.21628 null
2025-06-26 EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting Taoyu Wu et.al. 2506.21420 null
2025-06-26 CURL-SLAM: Continuous and Compact LiDAR Mapping Kaicheng Zhang et.al. 2506.21077 null
2025-06-25 SPARK: Graph-Based Online Semantic Integration System for Robot Task Planning Mimo Shirasaka et.al. 2506.20394 null
2025-06-25 Real-Time Obstacle Avoidance Algorithms for Unmanned Aerial and Ground Vehicles Jingwen Wei et.al. 2506.20311 null
2025-06-24 Posterior Cramér-Rao Bounds on Localization and Mapping Errors in Distributed MIMO SLAM Benjamin J. B. Deutschmann et.al. 2506.19957 null
2025-06-23 GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM Annika Thomas et.al. 2506.18885 null
2025-06-23 MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation Tianchen Deng et.al. 2506.18678 null
2025-06-24 Multimodal Fusion SLAM with Fourier Attention Youjie Zhou et.al. 2506.18204 null
2025-06-22 ADA-DPM: A Neural Descriptors-based Adaptive Noise Point Filtering Strategy for SLAM Yongxin Shao et.al. 2506.18016 null
2025-06-21 Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems Sebastian Sansoni et.al. 2506.17775 null
2025-06-18 MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System Miaoxin Pan et.al. 2506.15402 null
2025-06-24 RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories Qingsong Yan et.al. 2506.15242 null
2025-06-18 SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization Hanjun Kim et.al. 2506.15175 null
2025-06-18 VIMS: A Visual-Inertial-Magnetic-Sonar SLAM System in Underwater Environments Bingbing Zhang et.al. 2506.15126 null
2025-06-16 Slanted light-sheet array microscopy for large volume imaging at rates exceeding 100 Hz Kai Long et.al. 2506.13664 null
2025-06-16 Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots Jaehong Oh et.al. 2506.13149 null
2025-06-16 A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method Zhanhua Xin et.al. 2506.13100 null
2025-06-16 SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure Shahram Najam Syed et.al. 2506.13089 link
2025-06-12 LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System Hongbeen Park et.al. 2506.10567 null
2025-06-11 VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots Miguel Á. González-Santamarta et.al. 2506.09583 null
2025-06-10 UFM: A Simple Path towards Unified Dense Correspondence with Flow Yuchen Zhang et.al. 2506.09278 null
2025-06-10 Princeton365: A Diverse Dataset with Accurate Camera Pose Karhan Kayan et.al. 2506.09035 null
2025-06-10 Planar Collisionless Shock Simulations with Semi-Implicit Particle-in-Cell Model FLEKS Hongyang Zhou et.al. 2506.08384 null
2025-06-09 ZeroVO: Visual Odometry with Minimal Assumptions Lei Lai et.al. 2506.08005 null
2025-06-08 Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs Qiong Chang et.al. 2506.07164 null
2025-06-08 UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment Wentao Zhao et.al. 2506.07013 null
2025-06-06 GS4: Generalizable Sparse Splatting Semantic SLAM Mingqi Jiang et.al. 2506.06517 null
2025-06-06 Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception Pushyami Kaveti et.al. 2506.06476 null
2025-06-06 Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments Mingrui Li et.al. 2506.05965 null
2025-06-06 Analysis of points outcome in ATP Grand Slam Tennis using big data and machine learning Martin Illum et.al. 2506.05866 null
2025-06-05 On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images Andreas Meuleman et.al. 2506.05558 null
2025-06-05 Deep Learning Reforms Image Matching: A Survey and Outlook Shihua Zhang et.al. 2506.04619 null
2025-06-04 cuVSLAM: CUDA accelerated visual odometry Alexander Korovko et.al. 2506.04359 link
2025-06-04 Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset Zirui Wang et.al. 2506.04224 null
2025-06-03 LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM Roman Titkov et.al. 2506.03073 null
2025-06-03 Online Performance Assessment of Multi-Source-Localization for Autonomous Driving Systems Using Subjective Logic Stefan Orf et.al. 2506.02932 null
2025-06-03 VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians Pengchong Hu et.al. 2506.02741 null
2025-06-03 GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal Shufan Qing et.al. 2506.02736 link
2025-06-03 Olfactory Inertial Odometry: Methodology for Effective Robot Navigation by Scent Kordel K. France et.al. 2506.02373 null
2025-06-01 Globally Consistent RGB-D SLAM with 2D Gaussian Splatting Xingguang Zhong et.al. 2506.00970 link
2025-05-30 Black-box Adversarial Attacks on CNN-based SLAM Algorithms Maria Rafaela Gkeka et.al. 2505.24654 null
2025-05-28 Semantic Exploration and Dense Mapping of Complex Environments using Ground Robots Equipped with LiDAR and Panoramic Camera Xiaoyang Zhan et.al. 2505.22880 null
2025-05-28 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians Hidenobu Matsuki et.al. 2505.22859 null
2025-05-28 UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments Wancai Zheng et.al. 2505.22335 null
2025-05-27 HS-SLAM: A Fast and Hybrid Strategy-Based SLAM Approach for Low-Speed Autonomous Driving Bingxiang Kang et.al. 2505.20906 null
2025-05-27 ProBA: Probabilistic Bundle Adjustment with the Bhattacharyya Coefficient Jason Chui et.al. 2505.20858 null
2025-05-26 ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Wenhua Wu et.al. 2505.19420 null
2025-05-25 VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes Tianchen Deng et.al. 2505.18992 link
2025-05-23 CU-Multi: A Dataset for Multi-Robot Data Association Doncey Albin et.al. 2505.17576 null
2025-05-22 TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition Oliver Grainge et.al. 2505.16447 null
2025-05-20 A Methodological Framework for Measuring Spatial Labeling Similarity Yihang Du et.al. 2505.14128 link
2025-05-22 Place Recognition: A Comprehensive Review, Current Challenges and Future Directions Zhenyu Li et.al. 2505.14068 link
2025-05-19 eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks Jad Mansour et.al. 2505.13309 null
2025-05-23 VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold Dominic Maggio et.al. 2505.12549 null
2025-05-18 Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey Calvin Galagain et.al. 2505.12384 null
2025-05-18 Structureless VIO Junlin Song et.al. 2505.12337 null
2025-05-16 EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video Ryan Hoque et.al. 2505.11709 null
2025-05-16 Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization Aaron Wilhelm et.al. 2505.11620 null
2025-05-16 Robust 2D lidar-based SLAM in arboreal environments without IMU/GNSS Paola Nazate-Burgos et.al. 2505.10847 null
2025-05-15 TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation Manthan Patel et.al. 2505.10696 null
2025-05-15 A hybrid SLAM-Payne framework for atmospheric parameter and abundance determination of early-type Stars from LAMOST DR9 low-resolution Spectra Weijia Sun et.al. 2505.10310 null
2025-05-15 Large-Scale Gaussian Splatting SLAM Zhe Xin et.al. 2505.09915 null
2025-05-13 Automated Meta Prompt Engineering for Alignment with the Theory of Mind Aaron Baughman et.al. 2505.09024 null
2025-05-13 MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM Saqi Hussain Kalan et.al. 2505.08388 null
2025-05-13 SKiD-SLAM: Robust, Lightweight, and Distributed Multi-Robot LiDAR SLAM in Resource-Constrained Field Environments Hogyun Kim et.al. 2505.08230 null
2025-05-12 RDD: Robust Feature Detector and Descriptor using Deformable Transformer Gonglin Chen et.al. 2505.08013 null
2025-05-12 Ranking-aware Continual Learning for LiDAR Place Recognition Xufei Wang et.al. 2505.07198 null
2025-05-07 Scalable Aerial GNSS Localization for Marine Robots Shuo Wen et.al. 2505.04095 link
2025-05-06 Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions Lukas Schichler et.al. 2505.03565 null
2025-05-06 AquaticVision: Benchmarking Visual SLAM in Underwater Environment with Events and Frames Yifan Peng et.al. 2505.03448 null
2025-05-06 LiftFeat: 3D Geometry-Aware Local Feature Matching Yepeng Liu et.al. 2505.03422 link
2025-05-05 LiDAR-Inertial SLAM-Based Navigation and Safety-Oriented AI-Driven Control System for Skid-Steer Robots Mehdi Heydari Shahna et.al. 2505.02598 null
2025-05-04 Robust Localization, Mapping, and Navigation for Quadruped Robots Dyuman Aditya et.al. 2505.02272 null
2025-05-04 SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2505.01956 null
2025-05-03 GauS-SLAM: Dense RGB-D SLAM with Gaussian Surfels Yongxin Su et.al. 2505.01934 null
2025-05-02 Tightly Coupled Range Inertial Odometry and Mapping with Exact Point Cloud Downsampling Kenji Koide et.al. 2505.01017 null
2025-04-30 An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation Yaming Ou et.al. 2504.21826 null
2025-04-30 eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes Henry John Krumb et.al. 2504.21562 null
2025-04-29 Large-scale visual SLAM for in-the-wild videos Shuo Sun et.al. 2504.20496 null
2025-04-28 Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM Leon Davies et.al. 2504.19654 null
2025-04-28 GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM Leon Davies et.al. 2504.19653 null
2025-04-28 GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field Zuxing Lu et.al. 2504.19409 null
2025-04-27 Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users Apurv Varshney et.al. 2504.19345 null
2025-04-27 NANO-SLAM : Natural Gradient Gaussian Approximation for Vehicle SLAM Tianyi Zhang et.al. 2504.19195 null
2025-04-27 MISO: Multiresolution Submap Optimization for Efficient Globally Consistent Neural Implicit Reconstruction Yulun Tian et.al. 2504.19104 null
2025-04-25 Certifiably-Correct Mapping for Safe Navigation Despite Odometry Drift Devansh R. Agrawal et.al. 2504.18713 null
2025-04-25 Range-based 6-DoF Monte Carlo SLAM with Gradient-guided Particle Filter on GPU Takumi Nakao et.al. 2504.18056 null
2025-04-24 Autonomous Navigation Of Quadrupeds Using Coverage Path Planning Alexander James Becoy et.al. 2504.17880 null
2025-04-22 SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos Yuxin Yao et.al. 2504.17810 null
2025-04-24 BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring Asier Bikandi et.al. 2504.17693 null
2025-04-24 Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images Zebo Huang et.al. 2504.17582 null
2025-04-24 Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization Guangyang Zeng et.al. 2504.17410 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-23 ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration Andrea Conti et.al. 2504.16545 null
2025-04-22 DERD-Net: Learning Depth from Event-based Ray Densities Diego de Oliveira Hitzges et.al. 2504.15863 null
2025-04-23 SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems Abhishek Tyagi et.al. 2504.15305 null
2025-04-20 Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction Weirong Chen et.al. 2504.14516 null
2025-04-20 SG-Reg: Generalizable and Efficient Scene Graph Registration Chuhao Liu et.al. 2504.14440 link
2025-04-19 Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering Jonathan Embley-Riches et.al. 2504.14135 null
2025-04-21 SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM Samuel Cerezo et.al. 2504.13713 link
2025-04-16 An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World Xingwu Ji et.al. 2504.11698 link
2025-04-18 Doppler-SLAM: Doppler-Aided Radar-Inertial and LiDAR-Inertial Simultaneous Localization and Mapping Dong Wang et.al. 2504.11634 link
2025-04-14 Region Based SLAM-Aware Exploration: Efficient and Robust Autonomous Mapping Strategy That Can Scale Megha Maheshwari et.al. 2504.10416 null
2025-04-14 RoboCup Rescue 2025 Team Description Paper UruBots Kevin Farias et.al. 2504.09778 null
2025-04-11 FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment Sebastián Barbas Laina et.al. 2504.08603 null
2025-04-11 PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection Xiong Li et.al. 2504.08280 null
2025-04-11 II-NVM: Enhancing Map Accuracy and Consistency with Normal Vector-Assisted Mapping Chengwei Zhao et.al. 2504.08204 link
2025-04-10 UWB Anchor Based Localization of a Planetary Rover Andreas Nüchter et.al. 2504.07658 null
2025-04-10 Event Signal Filtering via Probability Flux Estimation Jinze Chen et.al. 2504.07503 null
2025-04-07 Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM Zhicong Sun et.al. 2504.04844 link
2025-04-06 SELC: Self-Supervised Efficient Local Correspondence Learning for Low Quality Images Yuqing Wang et.al. 2504.04497 null
2025-04-06 VSLAM-LAB: A Comprehensive Framework for Visual SLAM Methods and Datasets Alejandro Fontan et.al. 2504.04457 link
2025-04-05 Nonlinear Observer Design for Landmark-Inertial Simultaneous Localization and Mapping Mouaad Boughellaba et.al. 2504.04239 null
2025-04-04 WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments Jianhao Zheng et.al. 2504.03886 null
2025-04-03 SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections Prashant Kumar et.al. 2504.03089 null
2025-04-03 Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision Xiaofeng Han et.al. 2504.02477 null
2025-04-03 MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM Renwu Li et.al. 2504.02437 null
2025-04-02 A Chefs KISS – Utilizing semantic information in both ICP and SLAM framework Sven Ochs et.al. 2504.02086 null
2025-04-01 Semantic SLAM with Rolling-Shutter Cameras and Low-Precision INS in Outdoor Environments Yuchen Zhang et.al. 2504.01997 null
2025-04-02 Strengthening Multi-Robot Systems for SAR: Co-Designing Robotics and Communication Towards 6G Juan Bravo-Arrabal et.al. 2504.01940 null
2025-04-02 Dynamic Initialization for LiDAR-inertial SLAM Jie Xu et.al. 2504.01451 link
2025-04-02 ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue Thomas Pritchard et.al. 2504.01261 link
2025-03-31 SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection Yannick Burkhardt et.al. 2504.00139 null
2025-03-30 A Visual-Inertial Motion Prior SLAM for Dynamic Environments Weilong Sun et.al. 2503.23429 null
2025-03-30 AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos Felix Wimbauer et.al. 2503.23282 link
2025-03-29 Incorporating GNSS Information with LIDAR-Inertial Odometry for Accurate Land-Vehicle Localization Jintao Cheng et.al. 2503.23199 null
2025-03-29 Towards Mobile Sensing with Event Cameras on High-mobility Resource-constrained Devices: A Survey Haoyang Wang et.al. 2503.22943 null
2025-03-27 HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM Ziren Gong et.al. 2503.21778 null
2025-03-27 STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM Yongxu Wang et.al. 2503.21425 null
2025-03-25 Scene-agnostic Pose Regression for Visual Localization Junwei Zheng et.al. 2503.19543 null
2025-03-25 First Results on UAV-aided User Localization Using ToA and OpenAirInterface in 5G NR Omid Esrafilian et.al. 2503.19529 null
2025-03-25 MM-LINS: a Multi-Map LiDAR-Inertial System for Over-Degenerate Environments Yongxin Ma et.al. 2503.19506 link
2025-03-24 Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control Tohid Kargar Tasooji et.al. 2503.19135 null
2025-03-24 GI-SLAM: Gaussian-Inertial SLAM Xulang Liu et.al. 2503.18275 null
2025-03-22 LightLoc: Learning Outdoor LiDAR Localization at Light Speed Wen Li et.al. 2503.17814 link
2025-03-21 Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions Muhua Zhang et.al. 2503.17005 null
2025-03-20 4D Gaussian Splatting SLAM Yanyan Li et.al. 2503.16710 null
2025-03-20 Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education Giovanni Adorni et.al. 2503.16307 null
2025-03-20 Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors Tian Yi Lim et.al. 2503.16275 null
2025-03-19 A Sigma Point-based Low Complexity Algorithm for Multipath-based SLAM in MIMO Systems Anna Masiero et.al. 2503.15286 null
2025-03-19 ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents Hao Liang et.al. 2503.14948 null
2025-03-18 3D Densification for Multi-Map Monocular VSLAM in Endoscopy X. Anadón et.al. 2503.14346 null
2025-03-18 GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics Tingyang Xiao et.al. 2503.14247 link
2025-03-18 A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios Huy-Hoang Bui et.al. 2503.13982 link
2025-03-17 Digital Beamforming Enhanced Radar Odometry Jingqi Jiang et.al. 2503.13252 link
2025-03-17 Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes Tatsuro Sakai et.al. 2503.12768 null
2025-03-16 KISS-SLAM: A Simple, Robust, and Accurate 3D LiDAR SLAM System With Enhanced Generalization Capabilities Tiziano Guadagnino et.al. 2503.12660 null
2025-03-16 Deblur Gaussian Splatting SLAM Francesco Girlanda et.al. 2503.12572 null
2025-03-16 M2UD: A Multi-model, Multi-scenario, Uneven-terrain Dataset for Ground Robot with Localization and Mapping Evaluation Yanpeng Jia et.al. 2503.12387 null
2025-03-15 DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes Runfa Blark Li et.al. 2503.11979 null
2025-03-14 AQUA-SLAM: Tightly-Coupled Underwater Acoustic-Visual-Inertial SLAM with Sensor Calibration Shida Xu et.al. 2503.11420 link
2025-03-14 NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications Li Cui et.al. 2503.11199 null
2025-03-14 Leveraging Semantic Graphs for Efficient and Robust LiDAR SLAM Neng Wang et.al. 2503.11145 link
2025-03-13 Rapidly Converging Time-Discounted Ergodicity on Graphs for Active Inspection of Confined Spaces Benjamin Wong et.al. 2503.10853 null
2025-03-13 OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions Maxim Popov et.al. 2503.10331 null
2025-03-12 Online Language Splatting Saimouli Katragadda et.al. 2503.09447 null
2025-03-12 MonoSLAM: Robust Monocular SLAM with Global Structure Optimization Bingzheng Jiang et.al. 2503.09296 null
2025-03-11 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-11 GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats Kai Deng et.al. 2503.08071 link
2025-03-10 POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality Joey Wilson et.al. 2503.07819 null
2025-03-08 HIPPO-MAT: Decentralized Task Allocation Using GraphSAGE and Multi-Agent Deep Reinforcement Learning Lavanya Ratnabala et.al. 2503.07662 null
2025-03-10 AirSwarm: Enabling Cost-Effective Multi-UAV Research with COTS drones Xiaowei Li et.al. 2503.06890 link
2025-03-08 InfoFusion Controller: Informed TRRT Star with Mutual Information based on Fusion of Pure Pursuit and MPC for Enhanced Path Planning Seongjun Choi et.al. 2503.06010 link
2025-03-07 THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks Chaoran Xiong et.al. 2503.05112 null
2025-03-07 Adaptive-LIO: Enhancing Robustness and Precision through Environmental Adaptation in LiDAR Inertial Odometry Chengwei Zhao et.al. 2503.05077 link
2025-03-06 MarsLGPR: Mars Rover Localization with Ground Penetrating Radar Anja Sheppard et.al. 2503.04944 null
2025-03-06 On the Connection Between Magnetic-Field Odometry Aided Inertial Navigation and Magnetic-Field SLAM Isaac Skog et.al. 2503.04286 null
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235 null
2025-03-06 DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems Joshua Bird et.al. 2503.04126 null
2025-03-05 Equivariant Filter Design for Range-only SLAM Yixiao Ge et.al. 2503.03973 null
2025-03-05 Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments Jie Deng et.al. 2503.03373 link
2025-03-05 OpenGV 2.0: Motion prior-assisted calibration and SLAM with vehicle-mounted surround-view systems Kun Huang et.al. 2503.03230 null
2025-03-05 Distributed Certifiably Correct Range-Aided SLAM Alexander Thoms et.al. 2503.03192 link
2025-03-04 Monocular visual simultaneous localization and mapping: (r)evolution from geometry to deep learning-based pipelines Olaya Alvarez-Tunon et.al. 2503.02955 link
2025-03-04 Introspective Loop Closure for SLAM with 4D Imaging Radar Maximilian Hilger et.al. 2503.02383 null
2025-03-04 DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting Haoyuan Li et.al. 2503.02223 link
2025-03-03 Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM Marco Giberna et.al. 2503.02050 null
2025-03-03 vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding Ali Tourani et.al. 2503.01783 null
2025-03-03 MUSt3R: Multi-view Network for Stereo 3D Reconstruction Yohann Cabon et.al. 2503.01661 link
2025-03-03 OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding Dianyi Yang et.al. 2503.01646 null
2025-03-03 MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features Chao Ye et.al. 2503.01571 link
2025-03-03 AI-Driven Relocation Tracking in Dynamic Kitchen Environments Arash Nasr Esfahani et.al. 2503.01547 link
2025-03-03 Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning Xintao Chao et.al. 2503.01543 null
2025-03-03 RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation Shu Pan et.al. 2503.01434 null
2025-02-28 A2DO: Adaptive Anti-Degradation Odometry with Deep Multi-Sensor Fusion for Autonomous Navigation Hui Lai et.al. 2502.20767 null
2025-02-27 BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground Yufei Wei et.al. 2502.20078 null
2025-02-26 Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects Petri Mäkinen et.al. 2502.19169 null
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932 null
2025-02-28 S-Graphs 2.0 – A Hierarchical-Semantic Optimization and Loop Closure for SLAM Hriday Bavle et.al. 2502.18044 link
2025-02-25 MegaLoc: One Retrieval to Place Them All Gabriele Berton et.al. 2502.17237 link
2025-02-24 SLABIM: A SLAM-BIM Coupled Dataset in HKUST Main Building Haoming Huang et.al. 2502.16856 link
2025-02-27 Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM Yao Zhang et.al. 2502.16495 null
2025-02-19 Slamming: Training a Speech Language Model on One GPU in a Day Gallil Maimon et.al. 2502.15814 link
2025-02-21 RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes Sicheng Yu et.al. 2502.15633 null
2025-02-20 Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2502.14931 null
2025-02-19 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments Vincent Ress et.al. 2502.13803 null
2025-02-19 Active Illumination for Visual Ego-Motion Estimation in the Dark Francesco Crocetti et.al. 2502.13708 null
2025-02-17 From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations Matteo Scucchia et.al. 2502.12303 null
2025-02-19 pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM Luigi Freda et.al. 2502.11955 link
2025-02-17 Anti-Degeneracy Scheme for Lidar SLAM based on Particle Filter in Geometry Feature-Less Environments Yanbin Li et.al. 2502.11486 null
2025-02-16 GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting Zelin Zhou et.al. 2502.10975 null
2025-02-19 MonoForce: Learnable Image-conditioned Physics Engine Ruslan Agishev et.al. 2502.10156 link
2025-02-13 Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions Dario Pisanti et.al. 2502.09795 null
2025-02-13 DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior Mingrui Li et.al. 2502.09111 null
2025-02-12 LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features Shujie Zhou et.al. 2502.08676 link
2025-02-14 Occupancy-SLAM: An Efficient and Robust Algorithm for Simultaneously Optimizing Robot Poses and Occupancy Map Yingyu Wang et.al. 2502.06292 link
2025-02-09 PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map Yue Pan et.al. 2502.05752 link
2025-02-07 Joint State and Noise Covariance Estimation Kasra Khosoussi et.al. 2502.04584 null
2025-02-05 GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM Mingrui Li et.al. 2502.03228 null
2025-02-04 SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification Yifu Tao et.al. 2502.02657 null
2025-02-04 HeRCULES: Heterogeneous Radar Dataset in Complex Urban Environment for Multi-session Radar SLAM Hanjun Kim et.al. 2502.01946 null
2025-02-03 Statistical enhance learning for modeling and prediction tennis matches at Grand Slam tournaments Nourah Buhamra et.al. 2502.01613 null
2025-02-03 Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter Dabin Kim et.al. 2502.01092 null
2025-02-01 FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps Maximilian Leitenstern et.al. 2502.00395 link
2025-01-31 LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks Liudi Yang et.al. 2501.19382 link
2025-01-31 Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping Yiming Huang et.al. 2501.19319 link
2025-01-31 GO: The Great Outdoors Multimodal Dataset Peng Jiang et.al. 2501.19274 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110 null
2025-01-28 SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios Yinqi Chen et.al. 2501.16754 null
2025-01-27 Visual-Lidar Map Alignment for Infrastructure Inspections Jake McLaughlin et.al. 2501.14486 link
2025-01-24 Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video Xiaohao Xu et.al. 2501.14319 link
2025-01-24 HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting Javier Yu et.al. 2501.14147 null
2025-01-23 FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation Bingyang Zhou et.al. 2501.13876 null
2025-01-23 VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM Gyuhyeon Pak et.al. 2501.13402 null
2025-01-22 Grid-based Submap Joining: An Efficient Algorithm for Simultaneously Optimizing Global Occupancy Map and Local Submap Frames Yingyu Wang et.al. 2501.12764 null
2025-01-21 DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM Jesse Morris et.al. 2501.11893 link
2025-01-21 Survey on Monocular Metric Depth Estimation Jiuling Zhang et.al. 2501.11841 null
2025-01-19 OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors Dominik Kulmer et.al. 2501.11111 link
2025-01-19 Factor Graph-Based Active SLAM for Spacecraft Proximity Operations Lorenzo Ticozzi et.al. 2501.10950 null
2025-01-23 Mesh2SLAM in VR: A Fast Geometry-Based SLAM Framework for Rapid Prototyping in Virtual Reality Applications Carlos Augusto Pinheiro de Sousa et.al. 2501.09600 null
2025-01-16 Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment Maksim Filipenko et.al. 2501.09490 null
2025-01-15 Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures Pengru Deng et.al. 2501.09203 null
2025-01-15 AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning Assaf Lahiany et.al. 2501.09160 null
2025-01-15 SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM Yuhang Ming et.al. 2501.08880 null
2025-01-15 GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping Sheng Hong et.al. 2501.08672 null
2025-01-16 BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module Dongzhihan Wang et.al. 2501.08659 null
2025-01-15 Self-Organizing Edge Computing Distribution Framework for Visual SLAM Jussi Kalliola et.al. 2501.08629 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2025-01-14 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015 null
2025-01-12 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927 link
2025-01-11 SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors Zhen Hong et.al. 2501.06469 null
2025-01-09 Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping Wen Tianci et.al. 2501.05242 null
2025-01-07 SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment Yuchun Fan et.al. 2501.03681 link
2025-01-06 HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos Jinglei Zhang et.al. 2501.02973 null
2025-01-09 LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments Haosong Yue et.al. 2501.02580 link
2025-01-04 ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle Yinchuan Wang et.al. 2501.02166 link
2024-12-31 PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM Runnan Chen et.al. 2501.00352 null
2024-12-30 Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields Evgenii Kruzhkov et.al. 2412.20976 null
2024-12-28 MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing Shuo Wang et.al. 2412.20082 null
2024-12-27 DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction Kai Xu et.al. 2412.19584 null
2024-12-26 MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo Byeonggwon Lee et.al. 2412.19130 null
2024-12-23 End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework Fuhua Jia et.al. 2412.17343 null
2024-12-23 LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation Riku Uemura et.al. 2412.17282 null
2024-12-23 Selective Kalman Filter: When and How to Fuse Multi-Sensor Information to Overcome Degeneracy in SLAM Jie Xu et.al. 2412.17235 null
2025-01-03 Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry Zhaoxing Zhang et.al. 2412.16923 link
2024-12-21 Query Quantized Neural SLAM Sijia Jiang et.al. 2412.16476 link
2024-12-20 SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training Wenxi Chen et.al. 2412.15649 link
2024-12-18 Energy-Efficient SLAM via Joint Design of Sensing, Communication, and Exploration Speed Zidong Han et.al. 2412.13912 null
2024-12-18 Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation Sait Akturk et.al. 2412.13752 null
2024-12-18 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching Fernando Amodeo et.al. 2412.13639 link
2024-12-17 NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment Andrea Dunn Beltran et.al. 2412.13176 null
2024-12-18 Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera Zhengdi Yu et.al. 2412.12861 null
2024-12-16 Global SLAM in Visual-Inertial Systems with 5G Time-of-Arrival Integration Meisam Kabiri et.al. 2412.12406 null
2024-12-16 MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors Riku Murai et.al. 2412.12392 null
2024-12-16 Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges Martin Aubard et.al. 2412.11840 null
2024-12-19 RoMeO: Robust Metric Visual Odometry Junda Cheng et.al. 2412.11530 null
2024-12-14 Affine EKF: Exploring and Utilizing Sufficient and Necessary Conditions for Observability Maintenance to Improve EKF Consistency Yang Song et.al. 2412.10809 link
2024-12-13 RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting Lizhi Bai et.al. 2412.09868 null
2024-12-12 SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos Yuzheng Liu et.al. 2412.09401 link
2024-12-12 eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction Jad Mansour et.al. 2412.09209 link
2024-12-12 Drift-free Visual SLAM using Digital Twins Roxane Merat et.al. 2412.08496 null
2024-12-10 A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM Zongbo Liao et.al. 2412.07513 null
2024-12-08 DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments Juwon Kim et.al. 2412.05839 null
2024-12-06 MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos Zhengqi Li et.al. 2412.04463 null
2024-12-05 Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset Fuzhang Han et.al. 2412.04287 link
2024-12-10 MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application Hyesu Jang et.al. 2412.03887 null
2024-12-04 Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars John McConnell et.al. 2412.03760 null
2024-12-04 BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement Miguel Arturo Vega Torres et.al. 2412.03434 link
2024-12-04 NeRF and Gaussian Splatting SLAM in the Wild Fabian Schmidt et.al. 2412.03263 link
2024-12-04 MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras Huai Yu et.al. 2412.03146 link
2024-12-04 An indoor DSO-based ceiling-vision odometry system for indoor industrial environments Abdelhak Bougouffa et.al. 2412.02950 null
2024-12-03 ROVER: A Multi-Season Dataset for Visual SLAM Fabian Schmidt et.al. 2412.02506 link
2024-12-04 RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting Zhenzhong Cao et.al. 2412.01217 link
2024-12-02 Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM Alejandro Fontan et.al. 2412.01116 null
2024-12-02 LiDAR SLAMMOT based on Confidence-guided Data Association Susu Fang et.al. 2412.01041 null
2024-12-01 FlashSLAM: Accelerated RGB-D SLAM for Real-Time 3D Scene Reconstruction with Gaussian Splatting Phu Pham et.al. 2412.00682 null
2024-11-29 Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction Shaoxiang Wang et.al. 2412.00242 null
2024-11-28 Visual SLAMMOT Considering Multiple Motion Models Peilin Tian et.al. 2411.19134 null
2024-11-27 ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching Yangrui Dong et.al. 2411.18174 null
2024-11-27 HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction Wei Zhang et.al. 2411.17982 link
2024-11-26 MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework Xiangcheng Hu et.al. 2411.17928 link
2024-11-29 DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting Christian Homeyer et.al. 2411.17660 link
2024-11-25 MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM Vladimir Yugay et.al. 2411.16785 null
2024-11-24 Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors Soumava Paul et.al. 2411.15966 null
2024-11-24 Near-Range Environmental Perception for Inland Waterway Vessels: A Comparative Study of LiDAR and Automotive FMCW RADAR Sensors R. Herrmann et.al. 2411.15901 null
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800 null
2024-11-23 Gassidy: Gaussian Splatting SLAM in Dynamic Environments Long Wen et.al. 2411.15476 null
2024-11-22 OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping Tomas Berriel Martins et.al. 2411.15043 link
2024-11-22 A Benchmark Dataset for Collaborative SLAM in Service Environments Harin Park et.al. 2411.14775 link
2024-11-21 InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation Marziyeh Bamdad et.al. 2411.14358 link
2024-11-20 Robust Monocular Visual Odometry using Curriculum Learning Assaf Lahiany et.al. 2411.13438 null
2024-11-20 Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds Jelena Trisovic et.al. 2411.13310 null
2024-11-19 3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality Hanbeom Chang et.al. 2411.12514 null
2024-11-19 LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments Renxiang Xiao et.al. 2411.12185 null
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-18 The Blue Horizontal-Branch Stars From the LAMOST Survey: Atmospheric Parameters Jie Ju et.al. 2411.11250 null
2024-11-17 A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality Wei-Hsiang Lien et.al. 2411.10940 null
2024-11-16 DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment Mangyu Kong et.al. 2411.10722 link
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546 null
2024-11-15 BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation Yufei Wei et.al. 2411.10195 null
2024-11-13 DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization Yueming Xu et.al. 2411.08373 null
2024-11-13 MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation Peng Wang et.al. 2411.08279 link
2024-11-12 Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments Ankit Shaw et.al. 2411.08231 null
2024-11-12 NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN Sonia Raychaudhuri et.al. 2411.07848 null
2024-11-11 Lost in Tracking Translation: A Comprehensive Analysis of Visual SLAM in Human-Centered XR and IoT Ecosystems Yasra Chandio et.al. 2411.07146 null
2024-11-11 Learning from Feedback: Semantic Enhancement for Object SLAM Using Foundation Models Jungseok Hong et.al. 2411.06752 null
2024-11-11 HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation Xiaolong Wang et.al. 2411.06700 null
2024-11-08 Development of an indoor localization and navigation system based on monocular SLAM for mobile robots Thanh Nguyen Canh et.al. 2411.05337 null
2024-11-07 Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping Sayat Ibrayev et.al. 2411.04797 null
2024-11-07 MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation Sayan Paul et.al. 2411.04796 null
2024-11-09 DEIO: Deep Event Inertial Odometry Weipeng Guan et.al. 2411.03928 link
2024-11-06 Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward Shashi Kumar et.al. 2411.03866 null
2024-11-06 LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior Jiahui Wang et.al. 2411.03610 link
2024-11-05 LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting Huibin Zhao et.al. 2411.02703 null
2024-11-04 Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing Xinran Zhang et.al. 2411.02553 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804 null
2024-10-31 XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM Xiaomeng Wang et.al. 2410.23690 link
2024-10-30 LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM Yucheng Huang et.al. 2410.23231 link
2024-10-30 ISAC Prototype System for Multi-Domain Cooperative Communication Networks Jie Yang et.al. 2410.22956 null
2024-10-30 SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark HyunJun Jung et.al. 2410.22715 link
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213 null
2024-10-29 EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments Linus Nwankwo et.al. 2410.22200 null
2024-10-28 NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments Taiyi Pan et.al. 2410.21615 link
2024-10-28 coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM Emiliano Höss et.al. 2410.21149 link
2024-11-01 RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior Mingjiang Liang et.al. 2410.20358 null
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-22 AG-SLAM: Active Gaussian Splatting SLAM Wen Jiang et.al. 2410.17422 null
2024-10-22 Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study J. Jorge et.al. 2410.17171 null
2024-10-19 EndoMetric: Near-light metric scale monocular SLAM Raúl Iranzo et.al. 2410.15065 null
2024-10-17 Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot Dongkun Han et.al. 2410.13612 null
2024-10-17 TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal Yanpeng Jia et.al. 2410.13240 null
2024-10-16 QueensCAMP: an RGB-D dataset for robust Visual SLAM Hudson M. S. Bruno et.al. 2410.12520 link
2024-10-18 PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM Guanghao Li et.al. 2410.12324 null
2024-10-16 Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem Yichen Sha et.al. 2410.12169 null
2024-10-15 V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting Tuan Dang et.al. 2410.12068 link
2024-10-15 GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information Wancai Zheng et.al. 2410.11356 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 link
2024-10-14 MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator Taozhe Li et.al. 2410.10669 null
2024-10-13 Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph Benoit Casseau et.al. 2410.09896 null
2024-10-12 SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs Wenxi Chen et.al. 2410.09503 link
2024-10-12 An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation Wei Liang et.al. 2410.09443 null
2024-10-12 ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras Junkai Niu et.al. 2410.09374 link
2024-10-11 Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System Zheng Liu et.al. 2410.08935 link
2024-10-11 Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints Yicheng He et.al. 2410.08780 null
2024-10-10 ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization Mason B. Peterson et.al. 2410.08262 link
2024-10-10 IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera Jian Huang et.al. 2410.08107 link
2024-10-08 Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching Gongxin Yao et.al. 2410.06285 null
2024-10-08 Submodular Optimization for Keyframe Selection & Usage in SLAM David Thorne et.al. 2410.05576 null
2024-10-07 SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones Denis Davletshin et.al. 2410.05405 null
2024-10-07 Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection Ang He et.al. 2410.05017 null
2024-10-05 A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems Nikola Radulov et.al. 2410.04242 link
2024-10-05 High-Speed Stereo Visual SLAM for Low-Powered Computing Devices Ashish Kumar et.al. 2410.04090 link
2024-10-04 EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM Shi Chen et.al. 2410.03812 null
2024-10-04 Estimating Body and Hand Motion in an Ego-sensed World Brent Yi et.al. 2410.03665 null
2024-10-03 LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features Zihao Dong et.al. 2410.02961 null
2024-10-02 ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space Hogyun Kim et.al. 2410.01325 null
2024-10-01 Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency William Dubois et.al. 2410.00758 null
2024-10-02 CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM Dapeng Feng et.al. 2410.00486 link
2024-09-30 Additively Manufactured Open-Source Quadruped Robots for Multi-Robot SLAM Applications Zachary Fuge et.al. 2410.00122 null
2024-09-30 Direct Multipath-Based SLAM Mingchao Liang et.al. 2409.20552 null
2024-09-30 Robust Gaussian Splatting SLAM by Leveraging Loop Closure Zunjie Zhu et.al. 2409.20111 null
2024-09-30 DynORecon: Dynamic Object Reconstruction for Navigation Yiduo Wang et.al. 2409.19928 null
2024-09-29 CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation Yifan Duan et.al. 2409.19597 null
2024-09-29 CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought Yexing Du et.al. 2409.19510 link
2024-09-29 Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface Ziniu Wu et.al. 2409.19499 null
2024-09-27 Royal Reveals: LiDAR Mapping of Kronborg Castle, Echoes of Hamlet’s Halls Leon Davies et.al. 2409.18752 null
2024-09-26 BlinkTrack: Feature Tracking over 100 FPS via Events and Images Yichen Shen et.al. 2409.17981 null
2024-09-26 Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry Qi Zhang et.al. 2409.17729 null
2024-09-26 Event-based Stereo Depth Estimation: A Survey Suman Ghosh et.al. 2409.17680 null
2024-09-25 Efficient Submap-based Autonomous MAV Exploration using Visual-Inertial SLAM Configurable for LiDARs or Depth Cameras Sotiris Papatheodorou et.al. 2409.16972 null
2024-09-25 Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM Phu Pham et.al. 2409.16944 null
2024-09-25 Inline Photometrically Calibrated Hybrid Visual SLAM Nicolas Abboud et.al. 2409.16810 link
2024-09-25 Topological SLAM in colonoscopies leveraging deep features and topological priors Javier Morlana et.al. 2409.16806 link
2024-09-25 Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots Masoud Dayani Najafabadi et.al. 2409.16595 link
2024-09-25 Task-driven SLAM Benchmarking Yanwei Du et.al. 2409.16573 link
2024-09-24 SoMaSLAM: 2D Graph SLAM for Sparse Range Sensing with Soft Manhattan World Constraints Jeahn Han et.al. 2409.15736 null
2024-09-23 Spectral Graph Theoretic Methods for Enhancing Network Robustness in Robot Localization Neelkamal Somisetty et.al. 2409.15506 null
2024-09-22 SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms Niraj Pudasaini et.al. 2409.14515 null
2024-09-21 Point Cloud Structural Similarity-based Underwater Sonar Loop Detection Donghwi Jung et.al. 2409.14020 link
2024-09-20 HMD $^2$ : Environment-aware Motion Generation from Single Egocentric Head-Mounted Device Vladimir Guzov et.al. 2409.13426 null
2024-09-20 Learning Visual Information Utility with PIXER Yash Turkar et.al. 2409.13151 null
2024-09-19 MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting Yan Song Hu et.al. 2409.13055 null
2024-09-19 Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2409.12518 link
2024-09-18 Bundle Adjustment in the Eager Mode Zitong Zhan et.al. 2409.12190 null
2024-09-23 Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping Jaehyung Jung et.al. 2409.12051 null
2024-09-18 Metric-Semantic Factor Graph Generation based on Graph Neural Networks Jose Andres Millan-Romera et.al. 2409.11972 null
2024-09-18 Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments Lei Cheng et.al. 2409.11854 null
2024-09-18 ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation Yanlin Jin et.al. 2409.11692 null
2024-09-18 SLAM assisted 3D tracking system for laparoscopic surgery Jingwei Song et.al. 2409.11688 null
2024-09-17 GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure Ziheng Xu et.al. 2409.10982 null
2024-09-17 Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells Ankit Butola et.al. 2409.10971 null
2024-09-17 Evaluating and Improving the Robustness of LiDAR-based Localization and Mapping Bo Yang et.al. 2409.10824 link
2024-09-16 P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty Yufan Zhang et.al. 2409.10143 link
2024-09-16 SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning Amogh Joshi et.al. 2409.09990 null
2024-09-16 Enhancing Visual Inertial SLAM with Magnetic Measurements Bharat Joshi et.al. 2409.09904 null
2024-09-15 Marginalizing and Conditioning Gaussians onto Linear Approximations of Smooth Manifolds with Applications in Robotics Zi Cong Guo et.al. 2409.09871 link
2024-09-15 Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping Yi Liu et.al. 2409.09763 null
2024-09-15 High Definition Map Mapping and Update: A General Overview and Future Directions Benny Wijaya et.al. 2409.09726 null
2024-09-14 MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry Yuheng Qiu et.al. 2409.09479 null
2024-09-14 Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM Haoying Li et.al. 2409.09410 null
2024-09-14 GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians Dasong Gao et.al. 2409.09295 link
2024-09-14 Panoramic Direct LiDAR-assisted Visual Odometry Zikang Yuan et.al. 2409.09287 link
2024-09-11 Object Depth and Size Estimation using Stereo-vision and Integration with SLAM Layth Hamad et.al. 2409.07623 null
2024-09-11 Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry Anbo Tao et.al. 2409.06948 null
2024-09-10 Technical Report of Mobile Manipulator Robot for Industrial Environments Erfan Amoozad Khalili et.al. 2409.06693 null
2024-09-10 Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios Zhiqiang Chen et.al. 2409.04961 link
2024-09-08 FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat Changfei Fu et.al. 2409.03457 null
2024-09-03 Integration of Augmented Reality and Mobile Robot Indoor SLAM for Enhanced Spatial Awareness Michael D. Friske et.al. 2409.01915 null
2024-09-03 Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric Tingchen Ma et.al. 2409.01856 null
2024-09-02 Saying goodbyes to rotating your phone: Magnetometer calibration during SLAM Ilari Vallivaara et.al. 2409.01242 null
2024-09-02 Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection Manon Kok et.al. 2409.01091 null
2024-09-02 Robust Vehicle Localization and Tracking in Rain using Street Maps Yu Xiang Tan et.al. 2409.01038 link
2024-08-31 UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM Mostafa Mansour et.al. 2409.00362 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-08-30 Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning Shuyang Zhang et.al. 2408.17005 link
2024-08-29 Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry Michael Adlerstein et.al. 2408.16472 null
2024-08-28 Single-Photon 3D Imaging with Equi-Depth Photon Histograms Kaustubh Sadekar et.al. 2408.16150 null
2024-08-28 BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR Miguel Arturo Vega Torres et.al. 2408.15870 link
2024-08-30 Addressing the challenges of loop detection in agricultural environments Nicolás Soncini et.al. 2408.15761 link
2024-08-28 ES-PTAM: Event-based Stereo Parallel Tracking and Mapping Suman Ghosh et.al. 2408.15605 link
2024-08-28 PointEMRay: A Novel Efficient SBR Framework on Point Based Geometry Kaiqiao Yang et.al. 2408.15583 null
2024-09-02 Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration Rongge Zhang et.al. 2408.14726 link
2024-08-26 A Survey on Reinforcement Learning Applications in SLAM Mohammad Dehghani Tezerjani et.al. 2408.14518 null
2024-08-28 FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry Chunran Zheng et.al. 2408.14035 link
2024-08-21 Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild Turcan Tuna et.al. 2408.11809 null
2024-08-21 LiFCal: Online Light Field Camera Calibration via Bundle Adjustment Aymeric Fleith et.al. 2408.11682 null
2024-08-21 Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars Zhihao Lin et.al. 2408.11582 null
2024-08-21 RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions Transform Maximilian Hilger et.al. 2408.11576 link
2024-08-21 Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models Kento Kawaharazuka et.al. 2408.11380 null
2024-08-20 LoopSplat: Loop Closure by Registering 3D Gaussian Splats Liyuan Zhu et.al. 2408.10154 link
2024-08-19 Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM Sanghyun Hahn et.al. 2408.09727 link
2024-08-17 GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System Shuo Wang et.al. 2408.09191 null
2024-08-15 GOReloc: Graph-based Object-Level Relocalization for Visual SLAM Yutong Wang et.al. 2408.07917 link
2024-08-14 Inverse k-visibility for RSSI-based Indoor Geometric Mapping Junseo Kim et.al. 2408.07757 null
2024-08-14 Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition Hogyun Kim et.al. 2408.07330 link
2024-08-12 CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments Yanpeng Jia et.al. 2408.05981 null
2024-08-21 Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis Zhongche Qu et.al. 2408.05635 null
2024-08-10 TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping Seoyeon Jang et.al. 2408.05453 null
2024-08-08 Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods Yiming Zhou et.al. 2408.04268 null
2024-08-07 Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM Yan Song Hu et.al. 2408.03825 null
2024-08-07 AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System Kuan Xu et.al. 2408.03520 link
2024-08-06 BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications G. Manni et.al. 2408.03078 link
2024-08-04 SLAMS-Propelled Electron Acceleration at High-Mach Number Astrophysical Shocks Vladimir Zeković et.al. 2408.02084 null
2024-08-03 Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing Fabian Schmidt et.al. 2408.01716 link
2024-08-03 Deep Patch Visual SLAM Lahav Lipson et.al. 2408.01654 link
2024-08-02 Momentum Capture and Prediction System Based on Wimbledon Open2023 Tournament Data Chang Liu et.al. 2408.01544 null
2024-08-07 IG-SLAM: Instant Gaussian SLAM F. Aykut Sarikamis et.al. 2408.01126 null
2024-08-01 Collecting Larg-Scale Robotic Datasets on a High-Speed Mobile Platform Yuxin Lin et.al. 2408.00545 null
2024-08-01 High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets Jian Li et.al. 2408.00538 link
2024-07-31 SuperVINS: A visual-inertial SLAM framework integrated deep learning features Hongkun Luo et.al. 2407.21348 link
2024-07-30 NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding Hongjia Zhai et.al. 2407.20853 null
2024-07-29 A flexible framework for accurate LiDAR odometry, map manipulation, and localization José Luis Blanco-Claraco et.al. 2407.20465 link
2024-07-28 Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data Azmyin Md. Kamal et.al. 2407.19518 null
2024-07-26 Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation Aditya Penumarti et.al. 2407.19046 null
2024-07-26 HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM Zhe Xin et.al. 2407.18813 null
2024-07-25 CodedVO: Coded Visual Odometry Sachin Shah et.al. 2407.18240 null
2024-07-28 HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation Zhenzhi Wang et.al. 2407.17438 link
2024-07-22 Memory Management for Real-Time Appearance-Based Loop Closure Detection Mathieu Labbé et.al. 2407.15890 null
2024-07-22 Reinforcement Learning Meets Visual Odometry Nico Messikommer et.al. 2407.15626 link
2024-07-22 Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM Mathieu Labbe et.al. 2407.15305 null
2024-07-21 Semi-Supervised Pipe Video Temporal Defect Interval Localization Zhu Huang et.al. 2407.15170 null
2024-07-21 VoxDepth: Rectification of Depth Images on Edge Devices Yashashwee Chakrabarty et.al. 2407.15067 null
2024-07-20 From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM Lorenzo Montano-Oliván et.al. 2407.14797 null
2024-07-19 MSSP : A Versatile Multi-Scenario Adaptable Intelligent Robot Simulation Platform Based on LIDAR-Inertial Fusion Qiyan Li et.al. 2407.14102 null
2024-07-18 A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion Jianxiang Xu et.al. 2407.13878 link
2024-07-18 Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM Baicheng Li et.al. 2407.13338 null
2024-07-18 Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain Bach Nguyen Gia et.al. 2407.13159 link
2024-07-17 Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge Andrea Albanese et.al. 2407.12663 null
2024-07-17 Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM Markus Weißflog et.al. 2407.12408 null
2024-07-19 Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion Sangjun Lee et.al. 2407.12405 link
2024-07-17 Fusion LiDAR-Inertial-Encoder data for High-Accuracy SLAM Manh Do Duc et.al. 2407.11870 null
2024-07-17 GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection Jingwen Yu et.al. 2407.11736 link
2024-07-16 Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems Jianzhu Huai et.al. 2407.11705 null
2024-07-16 Batch SLAM with PMBM Data Association Sampling and Graph-Based Optimization Yu Ge et.al. 2407.11643 null
2024-07-16 I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM Gwangtak Bae et.al. 2407.11347 null
2024-07-16 FR-SLAM: A SLAM Improvement Method Based on Floor Plan Registration Jiantao Feng et.al. 2407.11299 null
2024-07-15 Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method Adam Korycki et.al. 2407.11238 null
2024-07-12 An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks Seyed Alireza Rahimi Azghadi et.al. 2407.09242 null
2024-07-11 SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM Neng Wang et.al. 2407.08106 link
2024-07-09 Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM David Hug et.al. 2407.07074 link
2024-07-15 A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM Yasra Chandio et.al. 2407.06889 null
2024-07-08 Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots Siva Krishna Ravipati et.al. 2407.06077 link
2024-07-10 Co-RaL: Complementary Radar-Leg Odometry with 4-DoF Optimization and Rolling Contact Sangwoo Jung et.al. 2407.05820 null
2024-07-07 Active Collaborative Visual SLAM exploiting ORB Features Muhammad Farhan Ahmed et.al. 2407.05453 null
2024-07-06 VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking Xuefeng Jiang et.al. 2407.05017 null
2024-07-06 Symmetric Linear Arc Monadic Datalog and Gadget Reductions Manuel Bodirsky et.al. 2407.04924 null
2024-07-03 Ultra-Lightweight Collaborative Mapping for Robot Swarms Vlad Niculescu et.al. 2407.03136 null
2024-07-01 RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields Haochen Jiang et.al. 2407.01303 link
2024-07-01 Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation Lianjie Guo et.al. 2407.01292 link
2024-07-01 Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization Ruofei Bai et.al. 2407.01013 link
2024-06-30 Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation Adnan Abdullah et.al. 2407.00848 null
2024-06-30 OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration Fengyuan Yang et.al. 2407.00574 null
2024-06-24 Compressing Search with Language Models Thomas Mulc et.al. 2407.00085 null
2024-06-28 CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services DongKi Noh et.al. 2406.19634 null
2024-06-25 Benchmarking SLAM Algorithms in the Cloud: The SLAM Hive System Xinzhe Liu et.al. 2406.17586 null
2024-07-02 SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation Xu Liu et.al. 2406.17249 link
2024-06-24 From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking Xiaohao Xu et.al. 2406.16850 link
2024-06-23 Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy Chen Wang et.al. 2406.16087 null
2024-06-19 Simultaneous Map and Object Reconstruction Nathaniel Chodosh et.al. 2406.13896 null
2024-06-14 Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization Wonho Song et.al. 2406.11599 null
2024-06-16 Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry Boris Chidlovskii et.al. 2406.11019 null
2024-06-15 Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM Yinjie Li et.al. 2406.10494 link
2024-06-12 From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers Swaminathan Gurumurthy et.al. 2406.07785 link
2024-06-27 Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF) Gyubeom Im et.al. 2406.06427 null
2024-06-10 Notes on Various Errors and Jacobian Derivations for SLAM Gyubeom Im et.al. 2406.06422 null
2024-06-23 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374 link
2024-06-15 Visual-Inertial SLAM as Simple as A, B, VINS Nathaniel Merrill et.al. 2406.05969 null
2024-06-09 MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps Jianhao Zheng et.al. 2406.05849 null
2024-06-06 Open Problem: Active Representation Learning Nikola Milosevic et.al. 2406.03845 null
2024-06-04 ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization Chen Mao et.al. 2406.01906 link
2024-06-03 The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry Paolo Cudrano et.al. 2406.01797 null
2024-06-03 Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry Takayuki Kanai et.al. 2406.00929 null
2024-06-02 Visual place recognition for aerial imagery: A survey Ivan Moskalenko et.al. 2406.00885 link
2024-05-30 Structure Gaussian SLAM with Manhattan World Hypothesis Shuhong Liu et.al. 2405.20031 null
2024-05-30 Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar Wouter Jansen et.al. 2405.19869 null
2024-05-30 SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization Jiang Wang et.al. 2405.19813 link
2024-05-30 TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM Peifeng Jiang et.al. 2405.19614 null
2024-05-27 CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy Richard Elvira et.al. 2405.16932 null
2024-05-26 Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians Erik Sandström et.al. 2405.16544 link
2024-05-24 NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes Lizhi Bai et.al. 2405.15151 null
2024-05-23 ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization Han Song et.al. 2405.15082 null
2024-05-23 Synergistic Global-space Camera and Human Reconstruction from Videos Yizhou Zhao et.al. 2405.14855 null
2024-05-23 CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments Yang Zhou et.al. 2405.14731 link
2024-05-23 Efficient Robot Learning for Perception and Mapping Niclas Vödisch et.al. 2405.14688 null
2024-05-22 Monocular Gaussian SLAM with Language Extended Loop Closure Tian Lan et.al. 2405.13748 null
2024-05-26 NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments Dongha Chung et.al. 2405.12563 link
2024-05-20 EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving Boyi Liu et.al. 2405.12120 null
2024-05-24 Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation Hyungtae Lim et.al. 2405.11176 null
2024-05-18 MotionGS : Compact Gaussian Splatting SLAM by Motion Filter Xinli Guo et.al. 2405.11129 link
2024-05-17 CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion Gang Wang et.al. 2405.10793 null
2024-05-17 Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map Liang Zhao et.al. 2405.10743 null
2024-05-10 MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization Pengcheng Zhu et.al. 2405.06241 null
2024-05-07 Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map Yuxuan Xia et.al. 2405.04290 null
2024-05-07 IMU-Aided Event-based Stereo Visual Odometry Junkai Niu et.al. 2405.04071 link
2024-04-27 An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation Olivier Brochu Dufour et.al. 2404.17745 null
2024-04-26 Camera Motion Estimation from RGB-D-Inertial Scene Flow Samuel Cerezo et.al. 2404.17251 link
2024-04-23 Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization Lahav Lipson et.al. 2404.15263 link
2024-04-18 SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints Spencer Carmichael et.al. 2404.12339 null
2024-04-17 VBR: A Vision Benchmark in Rome Leonardo Brizi et.al. 2404.11322 link
2024-04-14 Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration Yanhao Zhang et.al. 2404.09169 link
2024-04-06 Salient Sparse Visual Odometry With Pose-Only Supervision Siyu Chen et.al. 2404.04677 null
2024-03-25 A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments Gianluca D’Amico et.al. 2403.17084 null
2024-03-19 On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine Jagatpreet Singh Nir et.al. 2403.13170 null
2024-03-18 The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions Margaret Hansen et.al. 2403.12194 null
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639 null
2024-03-16 Efficient Domain Adaptation for Endoscopic Visual Odometry Junyang Wu et.al. 2403.10860 null
2024-03-14 Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO) Matthew Lisondra et.al. 2403.09882 null
2024-03-02 Grid-based Fast and Structural Visual Odometry Zhang Zhihe et.al. 2403.01110 null
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961 link
2024-02-22 Secure Navigation using Landmark-based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2402.14280 null
2024-02-19 Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment Ganesh Sapkota et.al. 2402.12551 null
2024-02-07 Online and Certifiably Correct Visual Odometry and Mapping Devansh R Agrawal et.al. 2402.05254 null
2024-02-06 YOLOPoint Joint Keypoint and Object Detection Anton Backhaus et.al. 2402.03989 link
2024-01-19 Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning André O. Françani et.al. 2401.10857 null
2024-01-17 Event-Based Visual Odometry on Non-Holonomic Ground Vehicles Wanting Xu et.al. 2401.09331 link
2024-01-11 On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering Feng Zhu et.al. 2401.05836 null
2023-12-19 Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry Olaya Álvarez-Tuñón et.al. 2401.05396 link
2024-01-07 Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people Ali Samadzadeh et.al. 2401.03604 link
2024-01-03 LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry Weirong Chen et.al. 2401.01887 link
2023-12-28 SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction Zikang Yuan et.al. 2312.16800 link
2023-12-20 NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields Jens Naumann et.al. 2312.13471 null
2023-12-22 Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM Junru Lin et.al. 2312.13332 null
2023-12-20 Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach Habib Boloorchi Tabrizi et.al. 2312.13162 link
2023-12-20 Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera Abdulkadhem A. Abdulkadhem et.al. 2312.12680 null
2023-12-15 Deep Event Visual Odometry Simon Klenk et.al. 2312.09800 link
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889 null
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141 link
2023-11-30 Event-based Visual Inertial Velometer Xiuyuan Lu et.al. 2311.18189 null
2023-11-21 CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems Young-Hee Lee et.al. 2311.12580 null
2023-11-10 Dense Visual Odometry Using Genetic Algorithm Slimane Djema et.al. 2311.06149 null
2023-11-07 Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM Seongwook Yoon et.al. 2311.03722 null
2023-10-23 Converting Depth Images and Point Clouds for Feature-based Pose Estimation Robert Lösch et.al. 2310.14924 link
2023-10-17 Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms Yanyan Li et.al. 2310.10931 link
2023-10-12 Jointly Optimized Global-Local Visual Localization of UAVs Haoling Li et.al. 2310.08082 null
2023-10-10 l-dyno: framework to learn consistent visual features using robot’s motion Kartikeya Singh et.al. 2310.06249 link
2023-10-08 XVO: Generalized Visual Odometry via Cross-Modal Self-Training Lei Lai et.al. 2309.16772 null
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 link
2023-09-23 Tag-based Visual Odometry Estimation for Indoor UAVs Localization Massimiliano Bertoni et.al. 2309.13311 null
2023-09-22 Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms Olivier Gamache et.al. 2309.13139 link
2023-09-20 Conformalized Multimodal Uncertainty Regression and Reasoning Domenico Parente et.al. 2309.11018 null
2023-09-20 OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving Heng Li et.al. 2309.11011 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link
2023-09-21 Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration Hongbo Zhao et.al. 2309.10314 null
2023-09-18 End-to-End Learned Event- and Image-based Visual Odometry Roberto Pellerito et.al. 2309.09947 link
2023-09-14 An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments Yehao Liu et.al. 2309.07408 null
2023-09-11 Evaluating Visual Odometry Methods for Autonomous Driving in Rain Yu Xiang Tan et.al. 2309.05249 null
2023-09-08 Robot Localization and Mapping Final Report – Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Akankshya Kar et.al. 2309.04147 null
2023-09-04 EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity Zijie Jiang et.al. 2309.01296 null
2023-08-27 Deep Learning for Visual Localization and Mapping: A Survey Changhao Chen et.al. 2308.14039 null
2023-08-19 Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters Xiao Liu et.al. 2308.09870 link
2023-08-12 4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion Guirong Zhuo et.al. 2308.06573 null
2023-08-10 Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU U. V. B. L. Udugama et.al. 2308.05515 null
2023-08-02 A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry Cora A. Dimmig et.al. 2308.01398 null
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125 null
2023-08-02 Preliminary Design of the Dragonfly Navigation Filter Ben Schilling et.al. 2307.13513 null
2023-07-19 Optimizing the extended Fourier Mellin Transformation Algorithm Wenqing Jiang et.al. 2307.10015 link
2023-07-15 Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents Ke Cao et.al. 2307.07763 null
2023-07-26 Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression Jianeng Wang et.al. 2306.01188 null
2023-07-06 OSPC: Online Sequential Photometric Calibration Jawad Haidar et.al. 2305.17673 null
2023-05-15 Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface Shifan Zhu et.al. 2305.08962 null
2023-05-10 Transformer-based model for monocular visual odometry: a video understanding approach André O. Françani et.al. 2305.06121 link
2023-04-29 Modality-invariant Visual Odometry for Embodied Vision Marius Memmel et.al. 2305.00348 link
2023-04-21 FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving Yuxuan Liu et.al. 2304.10719 null
2023-07-08 Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Visual Bootstrapping Hanyu Cai et.al. 2304.08978 null
2023-04-12 SiLK – Simple Learned Keypoints Pierre Gleize et.al. 2304.06194 link
2023-04-11 ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster Yifei Dong et.al. 2304.04943 null
2023-03-21 Learning a Depth Covariance Function Eric Dexheimer et.al. 2303.12157 null
2023-03-21 Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network Alessandro Navone et.al. 2303.11725 null
2023-03-20 VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors Thien Hoang Nguyen et.al. 2303.10903 null
2023-03-17 CoVIO: Online Continual Learning for Visual-Inertial Odometry Niclas Vödisch et.al. 2303.10149 link
2023-03-15 UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry Chaoyang Jiang et.al. 2303.08550 null
2023-03-13 Discovering Multiple Algorithm Configurations Leonid Keselman et.al. 2303.07434 null
2023-03-09 Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation Masahiro Hirano et.al. 2303.05192 null
2023-03-16 Stereo Event-based Visual-Inertial Odometry Kunfeng Wang et.al. 2303.05086 link
2023-03-07 Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor Eduardo Gallo et.al. 2303.03804 null
2023-03-03 Lightweight, Uncertainty-Aware Conformalized Visual Odometry Alex C. Stutts et.al. 2303.02207 null
2023-02-24 FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets Yelena Randall et.al. 2302.12772 null
2023-02-27 CP+: Camera Poses Augmentation with Large-scale LiDAR Maps Jiadi Cui et.al. 2302.12198 null
2023-02-19 EdgeVO: An Efficient and Accurate Edge-based Visual Odometry Hui Zhao et.al. 2302.09493 null
2023-01-27 HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera Mostafa Ahmadi et.al. 2301.11823 null
2023-01-26 Distributed Optimization Methods for Multi-Robot Systems: Part I – A Tutorial Ola Shorinwa et.al. 2301.11313 null
2023-01-24 Generalized Object Search Kaiyu Zheng et.al. 2301.10121 null
2023-01-22 Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories Hanlin Chen et.al. 2301.09194 null
2023-01-21 Dense RGB SLAM with Neural Implicit Maps Heng Li et.al. 2301.08930 null
2023-01-18 Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information Junshi Chen et.al. 2301.07560 null
2023-01-17 COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM Manthan Patel et.al. 2301.07147 link
2023-01-31 Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems Pierre-Yves Lajoie et.al. 2301.06230 link
2023-01-13 A LiDAR-Inertial-Visual SLAM System with Loop Detection Kangcheng Liu et.al. 2301.05604 null
2023-01-11 AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization Ying Chen et.al. 2301.04620 link
2023-01-12 TBV Radar SLAM – trust but verify loop candidates Daniel Adolfsson et.al. 2301.04397 link
2022-12-31 Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges Maxwell McManus et.al. 2301.03359 null
2023-01-09 Motion Addition and Motion Optimization Liqun Qi et.al. 2301.03174 null
2023-01-08 Towards Open World NeRF-Based SLAM Daniil Lisus et.al. 2301.03102 null
2023-01-06 CyberLoc: Towards Accurate Long-term Visual Localization Liu Liu et.al. 2301.02403 null
2023-01-03 LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation Shreyansh Daftry et.al. 2301.01350 null
2022-12-31 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions Patrick Wenzel et.al. 2301.01147 null
2023-01-03 BS3D: Building-scale 3D Reconstruction from RGB-D Images Janne Mustaniemi et.al. 2301.01057 null
2023-01-10 An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping Masoud Dayani Najafabadi et.al. 2301.00618 link
2022-12-25 A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion Nadia Figueroa et.al. 2212.14772 null
2022-12-29 An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping Kangcheng Liu et.al. 2212.14209 link
2022-12-27 Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands Felipe Gómez-Cuba et.al. 2212.13477 link
2022-12-26 ESVIO: Event-based Stereo Visual Inertial Odometry Peiyu Chen et.al. 2212.13184 link
2022-12-24 A Comprehensive Review on Autonomous Navigation Saeid Nahavandi et.al. 2212.12808 null
2022-12-23 Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation Marina Lotti et.al. 2212.12388 null
2022-12-23 Implementation of a Blind navigation method in outdoors/indoors areas Mohammad Javadian Farzaneh et.al. 2212.12185 null
2022-12-22 S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations Hriday Bavle et.al. 2212.11770 link
2022-12-22 Active SLAM: A Review On Last Decade Muhammad Farhan Ahmed et.al. 2212.11654 null
2022-12-27 Motion, Unit Dual Quaternion and Motion Optimization Liqun Qi et.al. 2212.11593 null
2022-12-22 Vision-Based Environmental Perception for Autonomous Driving Fei Liu et.al. 2212.11453 null
2022-12-19 Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models Yong Cheng et.al. 2212.09553 null
2022-12-16 Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments Lasitha Weerakoon et.al. 2212.08633 null
2022-12-16 rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments Bo Wei et.al. 2212.08418 null
2023-03-02 AirVO: An Illumination-Robust Point-Line Visual Odometry Kuan Xu et.al. 2212.07595 link
2022-12-14 Autonomous Vehicle Navigation with LIDAR using Path Planning Rahul M K et.al. 2212.07155 null
2022-12-14 RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping Hyowon Kim et.al. 2212.07141 null
2022-12-13 Know What You Don’t Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version) Daniil Lisus et.al. 2212.06923 null
2022-12-13 SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance Chenyangguang Zhang et.al. 2212.06524 null
2022-12-13 Localization and Navigation System for Indoor Mobile Robot Yanbaihui Liu et.al. 2212.06391 null
2022-12-12 Evaluation of RGB-D SLAM in Large Indoor Environments Kirill Muravyev et.al. 2212.05980 null
2022-12-19 A Light-Weight LiDAR-Inertial SLAM System with Loop Closing Kangcheng Liu et.al. 2212.05743 link
2022-12-12 An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds Kangcheng Liu et.al. 2212.05705 link
2022-12-09 SLAM for Visually Impaired People: A Survey Marziyeh Bamdad et.al. 2212.04745 null
2022-12-09 Ego-Body Pose Estimation via Ego-Head Pose Estimation Jiaman Li et.al. 2212.04636 null
2022-12-06 Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles Sushant Veer et.al. 2212.03323 link
2022-12-06 PRISM: Probabilistic Real-Time Inference in Spatial World Models Atanas Mirchev et.al. 2212.02988 null
2022-12-06 RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps Florian Sauerbeck et.al. 2212.02085 link
2022-12-05 DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization Xuebo Tian et.al. 2212.02077 null
2022-12-05 ObjectMatch: Robust Registration using Canonical Object Correspondences Can Gümeli et.al. 2212.01985 null
2022-12-02 Sparse SPN: Depth Completion from Sparse Keypoints Yuqun Wu et.al. 2212.00987 null
2022-12-01 maplab 2.0 – A Modular and Multi-Modal Mapping Framework Andrei Cramariuc et.al. 2212.00654 link
2022-12-01 AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body – Theory and Experiments Mehregan Dor et.al. 2212.00350 null
2022-11-30 MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves Pranjali Pathre et.al. 2211.16882 null
2022-11-29 PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images Hartmut Surmann et.al. 2211.16266 link
2022-11-29 MmWave Mapping and SLAM for 5G and Beyond Yu Ge et.al. 2211.16024 null
2022-11-28 Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map Xi Zheng et.al. 2211.15127 null
2022-11-29 BALF: Simple and Efficient Blur Aware Local Feature Detector Zhenjun Zhao et.al. 2211.14731 null
2022-11-27 Development of a Modular Real-time Shared-control System for a Smart Wheelchair Vaishanth Ramaraj et.al. 2211.14711 null
2022-11-26 A1 SLAM: Quadruped SLAM using the A1’s Onboard Sensors Jerred Chen et.al. 2211.14432 link
2022-11-23 ActiveRMAP: Radiance Field for Active Mapping And Planning Huangying Zhan et.al. 2211.12656 null
2022-11-22 Vision-based localization methods under GPS-denied conditions Zihao Lu et.al. 2211.11988 null
2022-11-21 Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques David Ramirez et.al. 2211.11836 null
2022-11-21 ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields Mohammad Mahdi Johari et.al. 2211.11704 null
2022-11-24 Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths Erik Leitinger et.al. 2211.09241 null
2022-11-16 Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery Hao Qu et.al. 2211.08904 null
2022-11-20 Detecting Line Segments in Motion-blurred Images with Events Huai Yu et.al. 2211.07365 link
2022-11-13 Automatic Eye-in-Hand Calibration using EKF Aditya Ramakrishnan et.al. 2211.06881 null
2022-11-12 Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling Zhihao Wang et.al. 2211.06557 link
2022-11-11 Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications Jie Yang et.al. 2211.05982 null
2022-11-10 Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time Ignacio Torroba et.al. 2211.05601 link
2022-11-07 When Geometry is not Enough: Using Reflector Markers in Lidar SLAM Gerhard Kurz et.al. 2211.03484 null
2022-11-07 Detecting Invalid Map Merges in Lifelong SLAM Matthias Holoch et.al. 2211.03423 null
2022-11-06 Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU Yibin Wu et.al. 2211.03174 link
2022-11-07 Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments Daniel Adolfsson et.al. 2211.02445 link
2022-11-03 DyOb-SLAM : Dynamic Object Tracking SLAM System Rushmian Annoy Wadud et.al. 2211.01941 null
2022-11-03 Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM Yang Chen et.al. 2211.01749 null
2022-11-04 $D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm Hao Xu et.al. 2211.01538 link
2022-11-02 Semantic SuperPoint: A Deep Semantic Descriptor Gabriel S. Gama et.al. 2211.01098 link
2022-11-02 Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation Myung-Hwan Jeon et.al. 2211.00960 link
2022-10-31 Mapping Extended Landmarks for Radar SLAM Shuai Sun et.al. 2210.17207 null
2022-10-25 MAROAM: Map-based Radar SLAM through Two-step Feature Selection Dequan Wang et.al. 2210.13797 null
2022-10-25 S3E: A Large-scale Multimodal Dataset for Collaborative SLAM Dapeng Feng et.al. 2210.13723 link
2022-10-24 NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields Antoni Rosinol et.al. 2210.13641 link
2022-10-24 Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging Geng Wang et.al. 2210.13556 null
2022-10-28 VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points Andreas Georgis et.al. 2210.12756 null
2022-10-22 SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation Junliang Chen et.al. 2210.12417 null
2022-10-21 DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm Shipeng Zhong et.al. 2210.11978 link
2022-10-21 Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments Shubham Kedia et.al. 2210.11652 null
2022-10-22 Visual SLAM: What are the Current Trends and What to Expect? Ali Tourani et.al. 2210.10491 null
2022-10-18 Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM Geon Choi et.al. 2210.09636 null
2022-10-16 D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments Ayman Beghdadi et.al. 2210.08647 null
2022-10-16 Indoor Smartphone SLAM with Learned Echoic Location Features Wenjie Luo et.al. 2210.08493 null
2022-10-15 Self-Improving SLAM in Dynamic Environments: Learning When to Mask Adrian Bojko et.al. 2210.08350 link
2022-10-13 Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems Pushyami Kaveti et.al. 2210.07315 link
2022-10-12 RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map Xuecheng Xu et.al. 2210.05984 link
2022-10-11 Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization Yuanzheng He et.al. 2210.05600 null
2022-10-11 Autonomous Asteroid Characterization Through Nanosatellite Swarming Kaitlin Dennison et.al. 2210.05518 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517 null
2022-10-11 Multi-Object Navigation with dynamically learned neural implicit representations Pierre Marza et.al. 2210.05129 link
2022-10-12 Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation Yulun Tian et.al. 2210.05020 null
2022-10-10 Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios Xingyu Chen et.al. 2210.04562 null
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236 null
2022-10-06 SCORE: A Second-Order Conic Initialization for Range-Aided SLAM Alan Papalia et.al. 2210.03177 link
2022-10-06 Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding Kirill Mazur et.al. 2210.03043 null
2022-10-06 Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence Osian Morgan et.al. 2210.02642 null
2022-10-05 MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation Hanwei Zhang et.al. 2210.02038 null
2022-10-04 O2S: Open-source open shuttle Nwankwo Linus et.al. 2210.01627 null
2022-10-04 Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing Weiying Wang et.al. 2210.01320 null
2022-10-03 Probabilistic Volumetric Fusion for Dense Monocular SLAM Antoni Rosinol et.al. 2210.01276 null
2022-10-03 DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams John McConnell et.al. 2210.00867 link
2022-10-03 A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments Ha Sier et.al. 2210.00812 link
2022-10-01 Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2 Ali Eslamian et.al. 2210.00278 null
2022-09-30 PyPose: A Library for Robot Learning with Physics-based Optimization Chen Wang et.al. 2209.15428 link
2022-09-29 DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment Mariia Gladkova et.al. 2209.14965 null
2022-09-28 Robust Incremental Smoothing and Mapping (riSAM) Daniel McGann et.al. 2209.14359 null
2022-09-27 Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping Chi-Ming Chung et.al. 2209.13274 link
2022-09-24 Graph Neural Networks for Multi-Robot Active Information Acquisition Mariliza Tzes et.al. 2209.12091 null
2022-09-24 Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes Jonathan J. Y. Kim et.al. 2209.11894 null
2022-09-23 involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs Gilad Rotman et.al. 2209.11591 null
2022-09-23 Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot David Balaban et.al. 2209.11432 null
2022-09-22 SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation Xiao Han et.al. 2209.10817 null
2022-09-22 Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio Wenhao Qiu et.al. 2209.10726 null
2022-09-21 Visual Localization and Mapping in Dynamic and Changing Environments João Carlos Virgolino Soares et.al. 2209.10710 null
2022-09-20 Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM Sabir Hossain et.al. 2209.10047 null
2022-09-20 WGICP: Differentiable Weighted GICP-Based Lidar Odometry Sanghyun Son et.al. 2209.09777 null
2022-09-20 PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention José Arce et.al. 2209.09699 link
2022-09-19 MeSLAM: Memory Efficient SLAM based on Neural Fields Evgenii Kruzhkov et.al. 2209.09357 null
2022-09-19 LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM Letian Zhang et.al. 2209.08810 null
2022-09-18 HGI-SLAM: Loop Closure With Human and Geometric Importance Features Shuhul Mujoo et.al. 2209.08608 null
2022-09-18 Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM Jiarui Tan et.al. 2209.08578 link
2022-09-17 DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments Shihao Shen et.al. 2209.08430 link
2022-09-17 OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM Matthieu Zins et.al. 2209.08338 null
2022-09-17 PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments Adam Dai et.al. 2209.08248 link
2022-09-16 ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM Aditya Arun et.al. 2209.08091 null
2022-09-16 iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking Yuhang Ming et.al. 2209.07919 null
2022-09-16 TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM Mathieu Gonzalez et.al. 2209.07888 null
2022-09-15 Landmark Management in the Application of Radar SLAM Shuai Sun et.al. 2209.07199 link
2022-09-15 PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization Xianwei Meng et.al. 2209.07061 null
2022-09-14 Semantic Visual Simultaneous Localization and Mapping: A Survey Kaiqi Chen et.al. 2209.06428 null
2022-09-13 Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets Islam Ali et.al. 2209.06316 null
2022-09-12 A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding Tin Lai et.al. 2209.05222 null
2022-09-12 Attitude-Guided Loop Closure for Cameras with Negative Plane Ze Wang et.al. 2209.05167 link
2022-09-09 General Place Recognition Survey: Towards the Real-world Autonomy Age Peng Yin et.al. 2209.04497 link
2022-09-08 ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology Julio A. Placed et.al. 2209.03693 link
2022-09-08 R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator Jiarong Lin et.al. 2209.03666 link
2022-09-06 Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection Brendon Forsgren et.al. 2209.02658 link
2022-09-05 Neuromorphic Visual Odometry with Resonator Networks Alpha Renner et.al. 2209.02000 null
2022-09-05 MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM Pavel Karpyshev et.al. 2209.01936 null
2022-09-05 ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics Boyi Liu et.al. 2209.01774 null
2022-09-04 CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud Evgeny Yudin et.al. 2209.01605 null
2022-08-31 PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM Yifan Duan et.al. 2208.14848 null
2022-08-30 BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition Peng Yin et.al. 2208.14543 null
2022-08-27 Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes Ali Safa et.al. 2208.12997 null
2022-08-25 FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms Jianhao Jiao et.al. 2208.11865 null
2022-08-25 Lidar SLAM for Autonomous Driving Vehicles Farhad Aghili et.al. 2208.11855 null
2022-08-24 DynaVINS: A Visual-Inertial SLAM for Dynamic Environments Seungwon Song et.al. 2208.11500 link
2022-08-22 Doppler Exploitation in Bistatic mmWave Radio SLAM Yu Ge et.al. 2208.10204 null
2022-08-21 Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping Lintong Zhang et.al. 2208.09825 link
2022-08-26 JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario Longrui Dong et.al. 2208.09777 null
2022-08-15 BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM Yunge Cui et.al. 2208.07473 link
2022-08-12 Handling Constrained Optimization in Factor Graphs for Autonomous Navigation Barbara Bazzana et.al. 2208.06325 null
2022-08-11 RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild Jason Y. Zhang et.al. 2208.05963 null
2022-08-08 Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation Yifei Ren et.al. 2208.04274 link
2022-08-08 SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty Shuai Zhang et.al. 2208.03945 link
2022-08-05 A Survey on Visual Map Localization Using LiDARs and Cameras Elhousni Mahdi et.al. 2208.03376 null
2022-08-04 SROS2: Usable Cyber Security Tools for ROS 2 Victor Mayoral Vilches et.al. 2208.02615 link
2022-08-03 Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms Bharath Garigipati et.al. 2208.02063 null
2022-08-02 Present and Future of SLAM in Extreme Underground Environments Kamak Ebadi et.al. 2208.01787 null
2022-08-01 Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion Simon Boche et.al. 2208.00709 null
2022-07-29 Neural Density-Distance Fields Itsuki Ueda et.al. 2207.14455 link
2022-07-25 DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions Tristan Laidlow et.al. 2207.12244 null
2022-07-25 Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration Kenji Koide et.al. 2207.11942 null
2022-07-22 NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction Yunlong Ran et.al. 2207.10985 null
2022-07-22 Dense RGB-D-Inertial SLAM with Map Deformations Tristan Laidlow et.al. 2207.10940 null
2022-07-22 PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes BaoSheng Zhang et.al. 2207.10916 null
2022-07-21 Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion Suman Ghosh et.al. 2207.10494 link
2022-07-21 Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions Quentin Serdel et.al. 2207.10489 link
2022-07-21 On applicability of von Karman’s momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity Yujin Lu et.al. 2207.10413 null
2022-07-19 Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM Tuvy Lemberg et.al. 2207.09103 null
2022-07-18 DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM Weicai Ye et.al. 2207.08794 link
2022-07-18 Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction Marco Orsingher et.al. 2207.08439 null
2022-07-18 ORB-based SLAM accelerator on SoC FPGA Vibhakar Vemulapati et.al. 2207.08405 null
2022-07-14 Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset Riccardo Giubilato et.al. 2207.06815 null
2022-07-14 Semi-supervised Vector-Quantization in Visual SLAM using HGCN Amir Zarringhalam et.al. 2207.06738 null
2022-07-14 Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders Amir Zarringhalam et.al. 2207.06732 null
2022-07-13 SLAM: SLO-Aware Memory Optimization for Serverless Applications Gor Safaryan et.al. 2207.06183 null
2022-07-19 Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras Fangwen Shu et.al. 2207.06058 link
2022-07-12 Accelerating Certifiable Estimation with Preconditioned Eigensolvers David M. Rosen et.al. 2207.05257 null
2022-07-12 Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features Meiyu Zhi et.al. 2207.05244 null
2022-07-14 SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial Chih-Yuan Chiu et.al. 2207.05043 null
2022-07-08 BlindSpotNet: Seeing Where We Cannot See Taichi Fukuda et.al. 2207.03870 null
2022-07-08 Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints Philipp Glira et.al. 2207.03785 null
2022-07-08 Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements Ran Liu et.al. 2207.03700 null
2022-07-07 RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments Qihao Peng et.al. 2207.03539 null
2022-07-06 VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization Marius Laska et.al. 2207.02668 null
2022-07-06 A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models Axel Garcia-Vega et.al. 2207.02396 null
2022-07-04 VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM Ling Gao et.al. 2207.01404 null
2022-07-04 VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM Danpeng Chen et.al. 2207.01158 null
2022-07-03 Wireless Channel Prediction in Partially Observed Environments Mingsheng Yin et.al. 2207.00934 null
2022-07-01 A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers Julio A. Placed et.al. 2207.00254 null
2022-07-01 Keeping Less is More: Point Sparsification for Visual SLAM Yeonsoo Park et.al. 2207.00225 null
2022-06-30 Controlled and impulsive compression of an entrapped air bubble during impact Utkarsh Jain et.al. 2206.15297 null
2022-06-30 Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery Yuehao Wang et.al. 2206.15255 link
2022-06-27 IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments Abanob Soliman et.al. 2206.13455 link
2022-06-26 An Efficient Global Optimality Certificate for Landmark-Based SLAM Connor Holmes et.al. 2206.12961 link
2022-06-21 Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping Davide Tateo et.al. 2206.10263 link
2022-06-20 Data Fusion for Radio Frequency SLAM with Robust Sampling Erik Leitinger et.al. 2206.09746 null
2022-06-19 RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments Chenglong Qian et.al. 2206.09463 null
2022-06-17 Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments Khairuldanial Ismail et.al. 2206.08733 null
2022-06-17 An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions Yijun Yuan et.al. 2206.08712 link
2022-06-13 ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy Hao Bai et.al. 2206.06435 null
2022-06-10 Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming Javier Cremona et.al. 2206.05066 link
2022-06-09 SparseFormer: Attention-based Depth Completion Network Frederik Warburg et.al. 2206.04557 null
2022-06-07 Robot Self-Calibration Using Actuated 3D Sensors Arne Peters et.al. 2206.03430 null
2022-06-07 Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map Haodong Yuan et.al. 2206.03062 null
2022-06-05 DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions Alena Savinykh et.al. 2206.02199 null
2022-06-04 C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy Erez Posner et.al. 2206.01961 null
2022-06-01 PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry Dong-Uk Seo et.al. 2206.00266 link
2022-05-27 A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching Arno Solin et.al. 2205.13821 null
2022-05-31 LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments Yun Chang et.al. 2205.13135 link
2022-05-25 Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM Milad Ramezani et.al. 2205.12595 null
2022-05-24 Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM Christopher E. Denniston et.al. 2205.12402 link
2022-05-22 ALITA: A Large-scale Incremental Dataset for Long-term Autonomy Peng Yin et.al. 2205.10737 link
2022-05-19 FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2 Jeffrey Ichnowski et.al. 2205.09778 link
2022-05-17 Global Data Association for SLAM with 3D Grassmannian Manifold Objects Parker C. Lusk et.al. 2205.08556 null
2022-05-19 Cluster on Wheels Yuanyuan Yang et.al. 2205.08151 null
2022-05-12 Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry Shihao Shen et.al. 2205.05916 link
2022-05-12 S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization Ran Cheng et.al. 2205.05861 null
2022-05-14 Multi-modal Semantic SLAM for Complex Dynamic Environments Han Wang et.al. 2205.04300 link
2022-05-06 OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations Carmen Delgado et.al. 2205.03256 null
2022-05-05 CNN-Augmented Visual-Inertial SLAM with Planar Constraints Pan Ji et.al. 2205.02940 null
2022-05-05 PMBM-based SLAM Filters in 5G mmWave Vehicular Networks Hyowon Kim et.al. 2205.02502 null
2022-05-04 BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking Dorian Henning et.al. 2205.02301 null
2022-05-04 A Global Asymptotic Convergent Observer for SLAM Seyed Hamed Hashemi et.al. 2205.01953 null
2022-05-04 Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation Nathaniel Merrill et.al. 2205.01823 link
2022-05-03 GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping Pan Ji et.al. 2205.01656 null
2022-04-29 Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM Jinwoo Jeon et.al. 2204.13877 link
2022-04-27 The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection Konstantinos A. Tsintotas et.al. 2204.12831 null
2022-04-27 Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment Wenyu Li et.al. 2204.12769 null
2022-04-29 MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment Tingchen Ma et.al. 2204.11621 null
2022-04-23 Indoor simultaneous localization and mapping based on fringe projection profilometry Yang Zhao et.al. 2204.11020 null
2022-04-22 Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria Julio A. Placed et.al. 2204.10631 null
2022-04-22 Fast Autonomous Robotic Exploration Using the Underlying Graph Structure Julio A. Placed et.al. 2204.10610 null
2022-04-22 Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions Yutong Hu et.al. 2204.10552 null
2022-04-22 Implicit Object Mapping With Noisy Data Jad Abou-Chakra et.al. 2204.10516 link
2022-04-19 Photometric single-view dense 3D reconstruction in endoscopy Victor M. Batlle et.al. 2204.09083 null
2022-04-18 Pulsar skips: Understanding variations in the regular periods of rotating neutron stars Clayton Miller et.al. 2204.08449 null
2022-04-18 Tracking monocular camera pose and deformation for SLAM inside the human body Juan J. Gomez Rodriguez et.al. 2204.08309 null
2022-04-18 Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker Hanjing Ye et.al. 2204.08163 null
2022-04-14 ViViD++: Vision for Visibility Dataset Alex Junho Lee et.al. 2204.06183 null
2022-04-12 HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud Zhixing Hou et.al. 2204.05481 null
2022-04-12 RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room Cong Gao et.al. 2204.05467 null
2022-04-11 Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context Lizhou Liao et.al. 2204.04932 link
2022-04-04 Monitoring social distancing with single image depth estimation Alessio Mingozzi et.al. 2204.01693 null
2022-04-01 Bi-directional Loop Closure for Visual SLAM Ihtisham Ali et.al. 2204.01524 null
2022-04-04 IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers Lei Sun et.al. 2204.01324 link
2022-04-03 Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor Wenyan Ou et.al. 2204.01154 null
2022-04-02 UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps Ayyappa Swamy Thatavarthy et.al. 2204.00865 link
2022-03-31 Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects Yujie Lu et.al. 2204.00035 null
2022-03-30 GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios Chih-Yuan Chiu et.al. 2203.16690 null
2022-03-29 Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field Mostafa Osman et.al. 2203.15866 null
2022-03-29 Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform Mingjun Li et.al. 2203.15439 null
2022-03-29 Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots Pranay Mathur et.al. 2203.15272 null
2022-03-28 Are High-Resolution Event Cameras Really Needed? Daniel Gehrig et.al. 2203.14672 null
2022-03-25 Spectral Measurement Sparsification for Pose-Graph SLAM Kevin J. Doherty et.al. 2203.13897 link
2022-03-25 FD-SLAM: 3-D Reconstruction Using Features and Dense Matching Xingrui Yang et.al. 2203.13861 null
2022-03-25 Gravity-constrained point cloud registration Vladimír Kubelka et.al. 2203.13799 null
2022-03-24 MD-SLAM: Multi-cue Direct SLAM Luca Di Giammarino et.al. 2203.13237 link
2022-03-24 Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video Shun Taguchi et.al. 2203.12804 null
2022-03-19 Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems Jie Yang et.al. 2203.10267 null
2022-03-16 Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR Ian D. Miller et.al. 2203.08925 link
2022-03-15 Neural RF SLAM for unsupervised positioning and mapping with channel state information Shreya Kadambi et.al. 2203.08264 null
2022-03-15 Simultaneous Localisation and Mapping with Quadric Surfaces Tristan Laidlow et.al. 2203.08040 null
2022-03-14 Drift Reduced Navigation with Deep Explainable Features Mohd Omama et.al. 2203.06897 link
2022-03-11 An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs Keisuke Sugiura et.al. 2203.05763 null
2022-03-10 High Definition, Inexpensive, Underwater Mapping Bharat Joshi et.al. 2203.05640 link
2022-03-10 SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning Jaehoon Choi et.al. 2203.05332 null
2022-03-08 Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM Pierre-Yves Lajoie et.al. 2203.04446 link
2022-03-08 SLAM-Supported Self-Training for 6D Object Pose Estimation Ziqi Lu et.al. 2203.04424 link
2022-03-08 An Online Semantic Mapping System for Extending and Enhancing Visual SLAM Thorsten Hempel et.al. 2203.03944 null
2022-03-07 Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms Qingqing Li et.al. 2203.03454 link
2022-03-07 OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition Junyi Ma et.al. 2203.03397 link
2022-03-06 Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM Kazushi Aiba et.al. 2203.02887 null
2022-03-06 RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects Ran Long et.al. 2203.02882 null
2022-03-03 STUN: Self-Teaching Uncertainty Estimation for Place Recognition Kaiwen Cai et.al. 2203.01851 link
2022-03-03 Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning Niclas Vödisch et.al. 2203.01578 link
2022-03-02 FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry Chunran Zheng et.al. 2203.00893 link
2022-03-02 Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation Yulun Tian et.al. 2203.00851 null
2022-03-01 Descriptellation: Deep Learned Constellation Descriptors for SLAM Chunwei Xing et.al. 2203.00567 null
2022-03-01 Collaborative Robot Mapping using Spectral Graph Analysis Lukas Bernreiter et.al. 2203.00308 null
2022-02-26 RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization Nikolaos Kourtzanidis et.al. 2202.13221 link
2022-02-25 Probabilistic Data Association for Semantic SLAM at Scale Elad Michael et.al. 2202.12802 link
2022-02-24 TwistSLAM: Constrained SLAM in Dynamic Environment Mathieu Gonzalez et.al. 2202.12384 null
2022-02-24 Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion Hyeonsoo Jang et.al. 2202.12108 null
2022-02-23 MITI: SLAM Benchmark for Laparoscopic Surgery Regine Hartwig et.al. 2202.11496 null
2022-02-23 DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization Xuebo Tian et.al. 2202.11431 null
2022-02-23 Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets Islam Ali et.al. 2202.11312 null
2022-02-22 SAGE: SLAM with Appearance and Geometry Prior for Endoscopy Xingtong Liu et.al. 2202.09487 link
2022-02-18 OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure Stefan Leutenegger et.al. 2202.09199 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146 link
2022-02-18 An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems Qiang Liu et.al. 2202.08952 null
2022-02-17 Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study Giovanni Cioffi et.al. 2202.08894 link
2022-02-17 LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building Jiashi Zhang et.al. 2202.08487 null
2022-02-16 Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments Jinkun Wang et.al. 2202.08359 null
2022-02-11 Overhead Image Factors for Underwater Sonar-based SLAM John McConnell et.al. 2202.05811 null
2022-02-10 Scale Estimation with Dual Quadrics for Monocular Object SLAM Shuangfu Song et.al. 2202.04816 null
2022-02-08 A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition Nie Jiwei et.al. 2202.03677 null
2022-01-25 Autonomous Vehicles: Open-Source Technologies, Considerations, and Development Oussama Saoudi et.al. 2202.03148 null
2022-02-07 Temporal Point Cloud Completion with Pose Disturbance Jieqi Shi et.al. 2202.03084 null
2022-02-04 DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments Xinggang Hu et.al. 2202.01938 null
2022-02-01 A Model for Multi-View Residual Covariances based on Perspective Deformation Alejandro Fontan et.al. 2202.00765 null
2022-01-30 Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM Xinghe Chu et.al. 2201.12726 null
2022-01-28 RGB-D SLAM Using Attention Guided Frame Association Ali Caglayan et.al. 2201.12047 null
2022-02-04 Learning to Act with Affordance-Aware Multimodal Neural SLAM Zhiwei Jia et.al. 2201.09862 link
2022-01-22 Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems Xi Zheng et.al. 2201.09048 link
2022-01-17 SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System Giseop Kim et.al. 2201.06423 null
2022-01-14 SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions Ali Samadzadeh et.al. 2201.05386 link
2022-01-19 Multi-Hypothesis Scan Matching through Clustering Giorgio Iavicoli et.al. 2201.03814 null
2022-01-11 Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM Kevin J. Doherty et.al. 2201.03773 null
2022-01-10 High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM Brian M. Hopkinson et.al. 2201.03364 link
2022-01-10 Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition M. Usman Maqbool Bhutta et.al. 2201.03212 link
2022-01-04 Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds Xueliang Wen et.al. 2201.00959 null
2021-12-29 Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic Khen Elimelech et.al. 2112.14428 null
2021-12-19 M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots Jie Yin et.al. 2112.13659 link
2021-12-27 UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping Hyunjun Lim et.al. 2112.13515 link
2021-12-25 Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs Yusheng Wang et.al. 2112.13224 null
2021-12-25 Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping Peng Huang et.al. 2112.13222 null
2021-12-24 3D Point Cloud Reconstruction and SLAM as an Input Ziyu Li et.al. 2112.12907 null
2021-12-22 NICE-SLAM: Neural Implicit Scalable Encoding for SLAM Zihan Zhu et.al. 2112.12130 link
2021-12-18 Fast and Robust Registration of Partially Overlapping Point Clouds Eduardo Arnold et.al. 2112.09922 link
2021-12-17 Symmetry-aware Neural Architecture for Embodied Visual Navigation Shuang Liu et.al. 2112.09515 null
2021-12-27 Homography Decomposition Networks for Planar Object Tracking Xinrui Zhan et.al. 2112.07909 link
2021-12-14 Autonomous Navigation System from Simultaneous Localization and Mapping Micheal Caracciolo et.al. 2112.07723 link
2021-12-12 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation Bolivar Solarte et.al. 2112.06180 link
2021-12-11 Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization Amay Saxena et.al. 2112.05921 null
2021-12-07 Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems Gideon Billings et.al. 2112.03826 link
2021-12-05 Iterated Posterior Linearization PMB Filter for 5G SLAM Yu Ge et.al. 2112.02575 null
2021-12-03 Fast Direct Stereo Visual SLAM Jiawei Mo et.al. 2112.01890 link
2021-12-02 MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment Jie Ren et.al. 2112.01349 link
2021-12-01 Research on Event Accumulator Settings for Event-Based SLAM Kun Xiao et.al. 2112.00427 link
2021-11-29 An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments Assem Sadek et.al. 2111.14666 null
2021-11-29 Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report Hartmut Surmann et.al. 2111.14542 null
2021-11-24 Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment V. Ayala-Alfaro et.al. 2111.12690 null
2021-11-24 Autonomous bot with ML-based reactive navigation for indoor environment Yash Srivastava et.al. 2111.12542 null
2021-11-22 A General Framework for Lifelong Localization and Mapping in Changing Environment Min Zhao et.al. 2111.10946 link
2021-11-17 Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network Xiaoming Zhao et.al. 2111.09006 null
2021-11-10 Comparing dominance of tennis’ big three via multiple-output Bayesian quantile regression models Bruno Santos et.al. 2111.05631 null
2021-11-10 TomoSLAM: factor graph optimization for rotation angle refinement in microtomography Mark Griguletskii et.al. 2111.05562 null
2021-11-07 Hierarchical Segment-based Optimization for SLAM Yuxin Tian et.al. 2111.04101 null
2021-11-07 Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM Shing Yan Loo et.al. 2111.04096 null
2021-11-05 MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry Joan P. Company-Corcoles et.al. 2111.03408 null
2021-10-31 Loop closure detection using local 3D deep descriptors Youjie Zhou et.al. 2111.00440 link
2021-10-27 Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification Mingsheng Yin et.al. 2110.14789 link
2021-10-27 Efficient Placard Discovery for Semantic Mapping During Frontier Exploration David Balaban et.al. 2110.14742 null
2021-10-26 Robust Multi-view Registration of Point Sets with Laplacian Mixture Model Jin Zhang et.al. 2110.13744 null
2021-10-25 WOLF: A modular estimation framework for robotics based on factor graphs Joan Sola et.al. 2110.12919 null
2021-10-21 Real-Time Ground-Plane Refined LiDAR SLAM Fan Yang et.al. 2110.11517 null
2021-10-21 SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words Jonathan J. Y. Kim et.al. 2110.11491 null
2021-10-21 InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion Zhenkun Zhu et.al. 2110.11040 null
2021-10-20 SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training Ankur Bapna et.al. 2110.10329 null
2021-10-18 Enhancing exploration algorithms for navigation with visual SLAM Kirill Muravyev et.al. 2110.09156 null
2021-10-18 Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment Rui Tian et.al. 2110.08977 null
2021-10-16 Partial Hierarchical Pose Graph Optimization for SLAM Alexander Korovko et.al. 2110.08639 null
2021-10-14 Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach Shumon Koga et.al. 2110.07546 null
2021-10-13 Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity Ran Liu et.al. 2110.06541 null
2021-10-12 Learning Efficient Multi-Agent Cooperative Visual Exploration Chao Yu et.al. 2110.05734 null
2021-10-07 Self-Supervised Depth Completion for Active Stereo Frederik Warburg et.al. 2110.03234 null
2021-10-06 InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes Zhenkun Zhu et.al. 2110.02593 null
2021-10-03 AEROS: Adaptive RObust least-Squares for Graph-Based SLAM Milad Ramezani et.al. 2110.02018 null
2021-10-04 Fast Uncertainty Quantification for Active Graph SLAM Julio A. Placed et.al. 2110.01289 link
2021-10-04 Geometry-based Graph Pruning for Lifelong SLAM Gerhard Kurz et.al. 2110.01286 null
2021-10-03 Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration Marcus Greiff et.al. 2110.01099 null
2021-10-02 Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows Qiangqiang Huang et.al. 2110.00876 link

SFM

Publish Date Title Authors PDF Code
2025-07-14 Supporting SENĆOTEN Language Documentation Efforts with Automatic Speech Recognition Mengzhe Geng et.al. 2507.10827 null
2025-07-11 Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT Wei Zhang et.al. 2507.08448 null
2025-07-04 MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion Peilin Tao et.al. 2507.03306 null
2025-06-30 Towards Initialization-free Calibrated Bundle Adjustment Carl Olsson et.al. 2506.23808 null
2025-06-30 AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention Ziao Liu et.al. 2506.23611 null
2025-06-27 Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras Petr Hruby et.al. 2506.22069 null
2025-06-24 ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes Chenhao Zhang et.al. 2506.21629 null
2025-07-08 Wild refitting for black box prediction Martin J. Wainwright et.al. 2506.21460 null
2025-06-24 Experimental Assessment of Neural 3D Reconstruction for Small UAV-based Applications Genís Castillo Gómez-Raya et.al. 2506.19491 null
2025-06-23 ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs Michal Nazarczuk et.al. 2506.18792 null
2025-06-23 Room temperature spin injection into commercial VCSELs at non-resonant wavelengths Timur Almabetov et.al. 2506.18376 null
2025-06-11 OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary Yui Sudo et.al. 2506.09448 null
2025-06-06 SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction Yuchao Zheng et.al. 2506.05935 null
2025-06-05 On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images Andreas Meuleman et.al. 2506.05558 null
2025-06-05 SupeRANSAC: One RANSAC to Rule Them All Daniel Barath et.al. 2506.04803 link
2025-06-04 Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Tianyu Huang et.al. 2506.04225 null
2025-06-04 Accelerating SfM-based Pose Estimation with Dominating Set Joji Joseph et.al. 2506.03667 null
2025-06-03 Nearby dwarf galaxies with extreme star formation rates: a window into dwarf-galaxy evolution in the early Universe S. Kaviraj et.al. 2506.03265 null
2025-06-02 Fast and Robust Rotation Averaging with Anisotropic Coordinate Descent Yaroslava Lochman et.al. 2506.01940 null
2025-06-03 Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC Qingzheng Wang et.al. 2505.24200 null
2025-05-29 Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping Justin Lazarow et.al. 2505.23756 null
2025-05-30 FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian Sara Papi et.al. 2505.22759 link
2025-05-28 UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images Junhuan Liu et.al. 2505.22098 null
2025-05-28 Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule San Jiang et.al. 2505.22089 null
2025-05-30 Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations Whenty Ariyanti et.al. 2505.21356 null
2025-05-27 Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting Xiangyu Sun et.al. 2505.20729 null
2025-05-26 Robust fine-tuning of speech recognition models via model merging: application to disordered speech Alexandre Ducorroy et.al. 2505.20477 null
2025-05-29 Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud Natsuki Takama et.al. 2505.19854 null
2025-05-25 Improving Novel view synthesis of 360 $^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images Guangan Chen et.al. 2505.19264 link
2025-05-24 Token-Level Logits Matter: A Closer Look at Speech Foundation Models for Ambiguous Emotion Recognition Jule Valendo Halim et.al. 2505.18484 null
2025-05-23 To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models Simone Gaisbauer et.al. 2505.17973 link
2025-05-23 Corporate Needs You to Find the Difference: Revisiting Submodular and Supermodular Ratio Optimization Problems Elfarouk Harb et.al. 2505.17443 link
2025-05-23 Tracking the Flight: Exploring a Computational Framework for Analyzing Escape Responses in Plains Zebra (Equus quagga) Isla Duporge et.al. 2505.16882 link
2025-05-21 A Taxonomy of Structure from Motion Methods Federica Arrigoni et.al. 2505.15814 null
2025-05-18 Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis Dong Yang et.al. 2505.12226 null
2025-05-15 Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis Francisco Raverta Capua et.al. 2505.10751 link
2025-05-13 Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People Haoshuai Zhou et.al. 2505.08215 null
2025-05-12 RDD: Robust Feature Detector and Descriptor using Deformable Transformer Gonglin Chen et.al. 2505.08013 null
2025-05-12 Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild Lintao Xiang et.al. 2505.07373 null
2025-05-11 Symmetry in Fundamental Parameters of Galaxies on the Star-forming Main Sequence Zhicheng He et.al. 2505.06868 null
2025-05-10 TPK: Trustworthy Trajectory Prediction Integrating Prior Knowledge For Interpretability and Kinematic Feasibility Marius Baden et.al. 2505.06743 null
2025-05-08 DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion Qitao Zhao et.al. 2505.05473 null
2025-05-20 FastMap: Revisiting Dense and Scalable Structure from Motion Jiahao Li et.al. 2505.04612 link
2025-05-15 Estimating the Diameter at Breast Height of Trees in a Forest With a Single 360 Camera Siming He et.al. 2505.03093 null
2025-05-03 AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting Junhao Shi et.al. 2505.01799 null
2025-05-03 PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth Bu Jin et.al. 2505.01729 null
2025-05-01 Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation? Viktor Kocur et.al. 2505.00866 link
2025-04-29 Large-scale visual SLAM for in-the-wild videos Shuo Sun et.al. 2504.20496 null
2025-04-29 Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views Jiang Wu et.al. 2504.20378 link
2025-04-28 MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion Zador Pataki et.al. 2504.20040 link
2025-04-24 Dynamic Camera Poses and Where to Find Them Chris Rockwell et.al. 2504.17788 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-23 A Low-Cost Photogrammetry System for 3D Plant Modeling and Phenotyping Joe Hrzich et.al. 2504.16840 null
2025-04-23 PRaDA: Projective Radial Distortion Averaging Daniil Sinitsyn et.al. 2504.16499 null
2025-04-21 Traversing the Star-Forming Main Sequence with Molecular Gas Stacks of z~1.6 Cluster Galaxies Alex Pigarelli et.al. 2504.15381 null
2025-04-21 Towards Understanding Camera Motions in Any Video Zhiqiu Lin et.al. 2504.15376 null
2025-04-21 StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models Yeona Hong et.al. 2504.14915 null
2025-04-17 Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering Landon Dyken et.al. 2504.13339 null
2025-04-15 EDGS: Eliminating Densification for Efficient Convergence of 3DGS Dmytro Kotovenko et.al. 2504.13204 null
2025-04-15 Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps Panagiotis Agrafiotis et.al. 2504.11416 link
2025-04-12 A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds Jizong Peng et.al. 2504.09129 null
2025-04-11 Stereophotoclinometry Revisited Travis Driver et.al. 2504.08252 null
2025-04-08 Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring José A. Pilartes-Congo et.al. 2504.06464 null
2025-04-07 Decoding the variability in the star-formation histories of z ~ 0.8 galaxies Jenny T. Wan et.al. 2504.05281 null
2025-04-05 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS Zhisheng Huang et.al. 2504.04294 null
2025-04-04 An Algebraic Geometry Approach to Viewing Graph Solvability Federica Arrigoni et.al. 2504.03637 null
2025-04-04 Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video Jiaxin Guo et.al. 2504.03198 null
2025-04-03 Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation Feng Gao et.al. 2504.02647 link
2025-04-09 FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking Ulas Gunes et.al. 2504.01732 null
2025-03-31 LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors Han Zhou et.al. 2504.00219 null
2025-03-30 AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos Felix Wimbauer et.al. 2503.23282 link
2025-03-24 Ground Penetrating Radar-Assisted Multimodal Robot Odometry Using Subsurface Feature Matrix Haifeng Li et.al. 2503.18301 null
2025-03-22 3D Modeling: Camera Movement Estimation and path Correction for SFM Model using the Combination of Modified A-SIFT and Stereo System Usha Kumari et.al. 2503.17668 null
2025-03-25 ProtoGS: Efficient and High-Quality Rendering with 3D Gaussian Prototypes Zhengqing Gao et.al. 2503.17486 null
2025-03-21 ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration Johan Edstedt et.al. 2503.17093 link
2025-03-20 From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction Ayberk Acar et.al. 2503.16263 null
2025-03-22 Euclid Quick Data Release (Q1). A first view of the star-forming main sequence in the Euclid Deep Fields Euclid Collaboration et.al. 2503.15314 null
2025-03-18 Multi-view Reconstruction via SfM-guided Monocular Depth Estimation Haoyu Guo et.al. 2503.14483 null
2025-03-18 A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios Huy-Hoang Bui et.al. 2503.13982 link
2025-03-17 Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios Iryna Repinetska et.al. 2503.13710 null
2025-03-17 Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization Yiwei Xu et.al. 2503.13086 null
2025-03-15 SFMNet: Sparse Focal Modulation for 3D Object Detection Oren Shrout et.al. 2503.12093 null
2025-03-11 A Framework for Reducing the Complexity of Geometric Vision Problems and its Application to Two-View Triangulation with Approximation Bounds Felix Rydell et.al. 2503.08142 null
2025-03-11 DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection Johan Edstedt et.al. 2503.07347 link
2025-03-18 Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion Mona Sheikh Zeinoddin et.al. 2503.07204 null
2025-03-10 VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation Hanzhi Chen et.al. 2503.07135 null
2025-03-09 AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation Yang Zou et.al. 2503.06660 null
2025-03-07 LiDAR-enhanced 3D Gaussian Splatting Mapping Jian Shen et.al. 2503.05425 null
2025-03-06 PLMP – Point-Line Minimal Problems for Projective SfM Kim Kiehn et.al. 2503.04351 null
2025-03-03 MUSt3R: Multi-view Network for Stereo 3D Reconstruction Yohann Cabon et.al. 2503.01661 link
2025-03-03 ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization Anas Abdelkarim et.al. 2503.01311 link
2025-03-05 A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping Jialei He et.al. 2503.01202 null
2025-03-02 MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain Rui Yi Yong et.al. 2503.00853 null
2025-03-02 PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery BoCheng Li et.al. 2503.00848 null
2025-03-02 Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration Jinjiang You et.al. 2503.00737 link
2025-02-28 The THESAN-ZOOM project: Burst, quench, repeat – unveiling the evolution of high-redshift galaxies along the star-forming main sequence William McClymont et.al. 2503.00106 null
2025-02-27 Best Foot Forward: Robust Foot Reconstruction in-the-wild Kyle Fogarty et.al. 2502.20511 null
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932 null
2025-03-04 Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model Yaxuan Huang et.al. 2502.16779 null
2025-02-20 CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting Qilin Zhang et.al. 2502.14684 link
2025-02-19 Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections Seong Jong Yoo et.al. 2502.13986 null
2025-02-19 IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras Dongki Jung et.al. 2502.12545 null
2025-02-12 Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors Vishwanath Pratap Singh et.al. 2502.08587 null
2025-02-10 FOCUS – Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences Oliver Boyne et.al. 2502.06367 link
2025-02-09 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Jing-Xuan Zhang et.al. 2502.05766 link
2025-02-10 Building Rome with Convex Optimization Haoyu Han et.al. 2502.04640 null
2025-02-04 SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification Yifu Tao et.al. 2502.02657 null
2025-02-05 GP-GS: Gaussian Processes for Enhanced Gaussian Splatting Zhihao Guo et.al. 2502.02283 link
2025-02-03 XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications Shangjin Zhai et.al. 2502.01297 null
2025-01-29 Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment Zixue Zeng et.al. 2501.17690 link
2025-01-28 Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction Tim Flückiger et.al. 2501.16221 null
2025-01-25 Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos Zhen-Hui Dong et.al. 2501.15096 null
2025-01-24 MATCHA:Towards Matching Anything Fei Xue et.al. 2501.14945 null
2025-01-24 Light3R-SfM: Towards Feed-forward Structure-from-Motion Sven Elflein et.al. 2501.14914 null
2025-01-24 Dense-SfM: Structure from Motion with Dense Consistent Matching JongMin Lee et.al. 2501.14277 null
2025-01-21 Theory of quantum-geometric charge and spin Josephson diode effects in strongly spin-polarized hybrid structures with noncoplanar spin textures Niklas L. Schulz et.al. 2501.12232 null
2025-01-14 Selective Attention Merging for low resource tasks: A case study of Child ASR Natarajan Balaji Shankar et.al. 2501.08468 link
2025-01-14 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015 null
2025-02-02 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927 link
2025-01-11 Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis Aditya Rauniyar et.al. 2501.06431 null
2025-01-09 Existence of dynamical fluctuation in AMPT generated data for Au+Au collisions at 10 AGeV Somen Gope et.al. 2501.05175 null
2025-01-06 Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation Yuezhang Lv et.al. 2501.02821 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-02 EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy Ao Gao et.al. 2501.01003 null
2024-12-30 KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences Keng-Wei Chang et.al. 2412.20767 null
2024-12-27 Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images Xudong Cai et.al. 2412.19518 null
2024-12-25 Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition Shujie Hu et.al. 2412.18832 null
2024-12-23 Reconstructing People, Places, and Cameras Lea Müller et.al. 2412.17806 link
2024-12-18 Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation Rémi Marsal et.al. 2412.14103 null
2024-12-16 Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection Beomseok Lee et.al. 2412.11978 null
2024-12-18 SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video Jongmin Park et.al. 2412.09982 null
2024-12-12 CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework Yushan Han et.al. 2412.08344 null
2024-12-10 Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling Hui Deng et.al. 2412.07230 null
2024-12-08 Unveiling True Talent: The Soccer Factor Model for Skill Evaluation Alexandre Andorra et.al. 2412.05911 null
2024-12-08 Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features Yuanbo Xiangli et.al. 2412.05826 null
2024-12-06 MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos Zhengqi Li et.al. 2412.04463 null
2024-12-03 ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification Pan Zhang et.al. 2412.02044 link
2024-12-02 SfM-Free 3D Gaussian Splatting via Hierarchical Training Bo Ji et.al. 2412.01553 link
2024-12-02 MVImgNet2.0: A Larger-scale Dataset of Multi-view Images Xiaoguang Han et.al. 2412.01430 null
2024-12-02 TAS-TsC: A Data-Driven Framework for Estimating Time of Arrival Using Temporal-Attribute-Spatial Tri-space Coordination of Truck Trajectories Mengran Li et.al. 2412.01122 null
2024-12-02 Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM Alejandro Fontan et.al. 2412.01116 null
2024-11-27 RoMo: Robust Motion Segmentation Improves Structure from Motion Lily Goli et.al. 2411.18650 null
2024-11-26 The MAGPI Survey: radial trends in star formation across different cosmological simulations in comparison with observations at $z \sim$ 0.3 Marcie Mun et.al. 2411.17882 null
2024-11-25 Characterizing Stellar and Gas Properties in NGC 628: Spatial Distributions, Radial Gradients, and Resolved Scaling Relations Peng Wei et.al. 2411.16150 null
2024-11-24 ZeroGS: Training 3D Gaussian Splatting from Unposed Images Yu Chen et.al. 2411.15779 null
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291 null
2024-11-15 SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction Yutao Tang et.al. 2411.12592 link
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546 null
2024-11-13 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization Mijeong Kim et.al. 2411.08879 null
2024-11-13 Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model Yutao Shen et.al. 2411.08453 null
2024-11-08 From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS Haoran Zhang et.al. 2411.05362 link
2024-10-29 A Cascade Approach for APT Campaign Attribution in System Event Logs: Technique Hunting and Subgraph Matching Yi-Ting Huang et.al. 2410.22602 null
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213 null
2024-10-17 Stochastic Flow Matching for Resolving Small-Scale Physics Stathi Fotiadis et.al. 2410.19814 null
2024-10-25 A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint Changshi Mu et.al. 2410.19473 link
2024-10-30 Large Spatial Model: End-to-end Unposed Images to Semantic 3D Zhiwen Fan et.al. 2410.18956 link
2024-10-23 CO-CAVITY project: Molecular gas and star formation in void galaxies M. I. Rodríguez et.al. 2410.18078 null
2024-10-23 PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting Yu Wang et.al. 2410.17505 null
2024-10-20 Neural Active Structure-from-Motion in Dark and Textureless Environment Kazuto Ichimaru et.al. 2410.15378 null
2024-10-17 SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation Shiao Xie et.al. 2410.13486 null
2024-10-16 Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks Orchid Chetia Phukan et.al. 2410.12947 null
2024-10-16 Gravity-aligned Rotation Averaging with Circular Regression Linfei Pan et.al. 2410.12763 link
2024-10-16 Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals Orchid Chetia Phukan et.al. 2410.12645 null
2024-10-15 SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection Yizhe Liu et.al. 2410.12080 link
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-09 Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models Ange Lou et.al. 2410.07434 null
2024-10-09 Deep HI Mapping of M 106 Group with FAST Yao Liu et.al. 2410.07038 null
2024-10-09 MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data Mingu Kang et.al. 2410.06442 null
2024-10-08 Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? Charalambos Tzamos et.al. 2410.05984 link
2024-10-04 Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering Laura Fink et.al. 2410.03861 link
2024-10-01 MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages Marco Gaido et.al. 2410.01036 link
2024-10-01 Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance Hongchao Shu et.al. 2410.00386 null
2024-09-29 Robust Incremental Structure-from-Motion with Hybrid Features Shaohui Liu et.al. 2409.19811 null
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152 null
2024-09-27 Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras Yipeng Lu et.al. 2409.18673 null
2024-09-26 BlinkTrack: Feature Tracking over 100 FPS via Events and Images Yichen Shen et.al. 2409.17981 null
2024-09-25 How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not Francesco Verdini et.al. 2409.17044 null
2024-09-24 Frequency-based View Selection in Gaussian Splatting Reconstruction Monica M. Q. Li et.al. 2409.16470 null
2024-10-07 Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion Juan-Diego Florez et.al. 2409.16465 null
2024-09-24 Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research Vandita Shukla et.al. 2409.15914 null
2024-09-23 Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments Francisco Roza de Moraes et.al. 2409.15602 null
2024-09-23 Evaluating Robot Influence on Pedestrian Behavior Models for Crowd Simulation and Benchmarking Subham Agrawal et.al. 2409.14844 null
2024-09-21 Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models Orchid Chetia Phukan et.al. 2409.14131 null
2024-09-17 GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module Yichen Zhang et.al. 2409.11307 null
2024-09-13 Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints Shan Chen et.al. 2409.08613 null
2024-09-09 KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction Davide Di Nucci et.al. 2409.05407 null
2024-09-06 The Arizona Molecular ISM Survey with the SMT: Variations in the CO(2-1)/CO(1-0) Line Ratio Across the Galaxy Population Ryan P. Keenan et.al. 2409.03963 null
2024-09-05 Active Galactic Nuclei in the Green Valley at z $\sim$ 0.7 Charity Woodrum et.al. 2409.03197 null
2024-09-04 Object Gaussian for Monocular 6D Pose Estimation from Sparse Views Luqing Luo et.al. 2409.02581 null
2024-09-11 Geometry-aware Feature Matching for Large-Scale Structure from Motion Gonglin Chen et.al. 2409.02310 null
2024-09-04 The study of strongly intensive observables for $π^{\pm,0}$ in $pp$ collisions at LHC energy in the framework of PYTHIA model Tumpa Biswas et.al. 2409.00525 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-09-15 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966 null
2024-08-20 TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks Jinjie Mai et.al. 2408.10739 null
2024-08-16 Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS Wei Sun et.al. 2408.08723 null
2024-08-15 CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning Wei Zhu et.al. 2408.08134 link
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648 null
2024-08-07 Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM Yan Song Hu et.al. 2408.03825 null
2024-08-05 Context-aware Mamba-based Reinforcement Learning for social robot navigation Syed Muhammad Mustafa et.al. 2408.02661 null
2024-08-04 Birational geometry of critical loci in Algebraic Vision Marina Bertolini et.al. 2408.02067 null
2024-08-04 PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone Xin Yang et.al. 2408.02053 null
2024-08-02 Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris Kentaro Uno et.al. 2408.01035 null
2024-08-01 LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting Zhenyu Bao et.al. 2408.00254 null
2024-07-29 Global Structure-from-Motion Revisited Linfei Pan et.al. 2407.20219 link
2024-08-06 Revisit Self-supervised Depth Estimation with Local Structure-from-Motion Shengjie Zhu et.al. 2407.19166 null
2024-07-23 The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations Hao Liu et.al. 2407.16452 null
2024-07-22 Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures Ruizhe Wang et.al. 2407.15435 null
2024-07-16 NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models Francesco Milano et.al. 2407.12207 link
2024-07-15 LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning Zhuozhu Jian et.al. 2407.10782 null
2024-07-15 Towards Scale-Aware Full Surround Monodepth with Transformers Yuchen Yang et.al. 2407.10406 null
2024-07-14 3DEgo: 3D Editing on the Go! Umar Khalid et.al. 2407.10102 null
2024-07-10 Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization Jinjie Mai et.al. 2407.08023 link
2024-07-10 Euclid preparation. Forecasting the recovery of galaxy physical properties and their relations with template-fitting and machine-learning methods Euclid Collaboration et.al. 2407.07940 null
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860 null
2024-07-09 Computer vision tasks for intelligent aerospace missions: An overview Huilin Chen et.al. 2407.06513 null
2024-07-08 Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views Jiawei Guo et.al. 2407.05666 null
2024-07-05 Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization Shaohan Li et.al. 2407.04260 null
2024-07-15 SfM on-the-fly: Get better 3D from What You Capture Zongqian Zhan et.al. 2407.03939 null
2024-07-03 Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction Jiaxin Guo et.al. 2407.02918 link
2024-07-02 Indoor 3D Reconstruction with an Unknown Camera-Projector Pair Zhaoshuai Qi et.al. 2407.01945 null
2024-06-27 SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas John Lambert et.al. 2406.19390 link
2024-06-27 STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning Yanan Zhang et.al. 2406.19362 null
2024-06-26 VDG: Vision-Only Dynamic Gaussian for Driving Simulation Hao Li et.al. 2406.18198 null
2024-06-25 Consensus Learning with Deep Sets for Essential Matrix Estimation Dror Moran et.al. 2406.17414 link
2024-06-24 Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction Tong Qin et.al. 2406.16289 null
2024-06-21 The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization Ivan Nikolić et.al. 2406.15237 link
2024-06-19 MVSBoost: An Efficient Point Cloud-based 3D Reconstruction Umair Haroon et.al. 2406.13515 null
2024-06-17 MegaScenes: Scene-Level View Synthesis at Scale Joseph Tung et.al. 2406.11819 link
2024-06-15 Benchmarking Children’s ASR with Supervised and Self-supervised Speech Foundation Models Ruchao Fan et.al. 2406.10507 link
2024-06-14 On the Evaluation of Speech Foundation Models for Spoken Language Understanding Siddhant Arora et.al. 2406.10083 null
2024-06-12 Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement Maxime Pietrantoni et.al. 2406.08463 null
2024-06-12 SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models Chun Yin et.al. 2406.08445 null
2024-06-10 Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis Xin Jin et.al. 2406.06216 link
2024-06-07 The Star-Forming Main Sequence in JADES and CEERS at $z>1.4$ : Investigating the Burstiness of Star Formation Leonardo Clarke et.al. 2406.05178 null
2024-06-13 Gaussian Splatting with Localized Points Management Haosen Yang et.al. 2406.04251 null
2024-06-05 L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration Yibo Liu et.al. 2406.03298 link
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509 null
2024-05-29 Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy Zijie Jiang et.al. 2405.18863 null
2024-05-29 3D Reconstruction with Fast Dipole Sums Hanyu Chen et.al. 2405.16788 null
2024-05-26 MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups Yusen Xie et.al. 2405.16599 null
2024-05-26 Categorical Flow Matching on Statistical Manifolds Chaoran Cheng et.al. 2405.16441 link
2024-05-22 Exploring Galaxy Properties of eCALIFA with Contrastive Learning G. Martínez-Solaeche et.al. 2405.13471 null
2024-05-23 Switched Flow Matching: Eliminating Singularities via Switching ODEs Qunxi Zhu et.al. 2405.11605 null
2024-05-28 NeRO: Neural Road Surface Reconstruction Ruibo Wang et.al. 2405.10554 link
2024-05-15 Three Dimensional Spatial Cognition: Bees and Bats Robert Worden et.al. 2405.09413 null
2024-05-09 Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media Zhizhen Zhang et.al. 2405.05760 null
2024-05-09 Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment Simon Weber et.al. 2405.05079 link
2024-05-07 Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications Markus Hillemann et.al. 2405.04345 null
2024-05-07 Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling Jiawei Shi et.al. 2405.04309 null
2024-05-06 Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion Yunfeng Li et.al. 2405.03177 link
2024-05-03 HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 Miriam Jäger et.al. 2405.02005 null
2024-04-25 The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time Marcie Mun et.al. 2404.16319 null
2024-04-22 Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer Eric Brachmann et.al. 2404.14351 null
2024-04-22 RESFM: Robust Equivariant Multiview Structure from Motion Fadi Khatib et.al. 2404.14280 null
2024-04-22 Does Gaussian Splatting need SFM Initialization? Yalda Foroutan et.al. 2404.12547 null
2024-05-07 A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion Feng Yu et.al. 2404.11590 link
2024-04-18 DeblurGS: Gaussian Splatting for Camera Motion Blur Jeongtaek Oh et.al. 2404.11358 null
2024-05-21 LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives Jiadi Cui et.al. 2404.09748 null
2024-04-12 MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance Yuqun Wu et.al. 2404.08252 null
2024-04-11 Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation Keonhee Han et.al. 2404.07933 null
2024-04-07 NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization Peng Tu et.al. 2404.04875 null
2024-04-04 GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis Emmanouil Nikolakakis et.al. 2404.03126 null
2024-03-29 InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds Zhiwen Fan et.al. 2403.20309 link
2024-03-29 HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes Zhuopeng Li et.al. 2403.20032 null
2024-03-26 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation Jiahao Chen et.al. 2403.17537 null
2024-03-25 INPC: Implicit Neural Point Clouds for Radiance Field Rendering Florian Hahlbohm et.al. 2403.16862 null
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639 null
2024-03-14 Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting Jaewoo Jung et.al. 2403.09413 link
2024-03-13 Refractive COLMAP: Refractive Structure-from-Motion Revisited Mengkun She et.al. 2403.08640 null
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156 link
2024-03-11 SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection Yifu Tao et.al. 2403.06877 null
2024-03-24 BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling Cheng Peng et.al. 2403.04926 link
2024-02-22 GaussianPro: 3D Gaussian Splatting with Progressive Propagation Kai Cheng et.al. 2402.14650 null
2024-02-25 A Robust Error-Resistant View Selection Method for 3D Reconstruction Shaojie Zhang et.al. 2402.11431 null
2024-02-17 Dense Matchers for Dense Tracking Tomáš Jelínek et.al. 2402.11287 null
2024-03-11 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592 link
2024-01-22 HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs Zelin Gao et.al. 2401.11711 null
2024-01-19 SCENES: Subpixel Correspondence Estimation With Epipolar Supervision Dominik A. Kloepfer et.al. 2401.10886 null
2024-01-15 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data Mathilde Letard et.al. 2401.09481 link
2024-01-17 3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey Thiago Lopes Trugillo da Silveira et.al. 2401.09252 null
2024-01-17 ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization Weiyao Wang et.al. 2401.08937 null
2024-01-16 Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions Yi-Fan Zuo et.al. 2401.08043 link
2024-01-10 Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects Tianhang Cheng et.al. 2401.05236 link
2024-01-07 A Classification of Critical Configurations for any Number of Projective Views Martin Bråtelund et.al. 2401.03450 link
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-16 Transformers in Unsupervised Structure-from-Motion Hemang Chawla et.al. 2312.10529 link
2023-12-14 HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video Xueying Wang et.al. 2312.08863 null
2023-12-14 CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning Qingsong Yan et.al. 2312.08760 null
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865 link
2023-12-11 Gaussian Splatting SLAM Hidenobu Matsuki et.al. 2312.06741 null
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889 null
2023-12-07 Visual Geometry Grounded Deep Structure From Motion Jianyuan Wang et.al. 2312.04563 null
2023-11-30 Distributed Global Structure-from-Motion with a Deep Front-End Ayush Baid et.al. 2311.18801 link
2023-11-21 Robot Hand-Eye Calibration using Structure-from-Motion Nicolas Andreff et.al. 2311.11808 null
2023-11-18 LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation Sébastien Henry et.al. 2311.11171 null
2023-11-10 MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty Rémi Marsal et.al. 2311.06137 link
2023-11-08 VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering Linus Franke et.al. 2311.04634 link
2023-10-22 A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video Jan Emily Mangulabnan et.al. 2310.14364 null
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605 null
2023-10-09 Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration Chunge Bai et.al. 2310.05504 link
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134 null
2023-11-29 Pose-Free Generalizable Rendering Transformer Zhiwen Fan et.al. 2310.03704 link
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092 null
2023-10-01 Propagating Semantic Labels in Video Data David Balaban et.al. 2310.00783 null
2023-09-22 Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning Jonathan Sauder et.al. 2309.12804 null
2023-09-21 On-the-Fly SfM: What you capture is What you get Zongqian Zhan et.al. 2309.11883 link
2023-09-19 Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water Jayesh Tripathi et.al. 2309.10269 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927 link
2023-09-08 Robot Localization and Mapping Final Report – Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Akankshya Kar et.al. 2309.04147 null
2023-09-01 SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation Youhong Wang et.al. 2309.00526 null
2023-09-01 Dense Voxel 3D Reconstruction Using a Monocular Event Camera Haodong Chen et.al. 2309.00385 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984 link
2023-08-26 Disjoint Pose and Shape for 3D Face Reconstruction Raja Kumar et.al. 2308.13903 null
2023-08-30 CamP: Camera Preconditioning for Neural Radiance Fields Keunhong Park et.al. 2308.10902 null
2023-08-18 Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling Haorui Ji et.al. 2308.10705 null
2023-08-14 Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation Tao Liu et.al. 2308.07231 link
2023-08-11 Efficient Large-scale AUV-based Visual Seafloor Mapping Mengkun She et.al. 2308.06147 null
2023-08-04 EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems Weihan Wang et.al. 2308.02670 null
2023-08-15 Tirtha – An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites Jyotirmaya Shivottam et.al. 2308.01246 link
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125 null
2023-07-27 PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking Yang Zheng et.al. 2307.15055 link
2023-07-28 SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jerome Revaud et.al. 2307.11702 null
2023-07-19 Lazy Visual Localization via Motion Averaging Siyan Dong et.al. 2307.09981 null
2023-07-10 Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor San Jiang et.al. 2307.04520 null
2023-07-07 RGB-D Mapping and Tracking in a Plenoxel Radiance Field Andreas L. Teigen et.al. 2307.03404 link
2023-06-29 The Drunkard’s Odometry: Estimating Camera Motion in Deforming Scenes David Recasens et.al. 2306.16917 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669 link
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667 null
2023-06-24 3D Reconstruction of Spherical Images based on Incremental Structure from Motion San Jiang et.al. 2306.12770 link
2023-06-15 NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Varun Jampani et.al. 2306.09109 link
2023-06-15 Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization Dror Aiger et.al. 2306.09012 link
2023-06-10 3D reconstruction using Structure for Motion Kshitij Karnawat et.al. 2306.06360 link
2023-06-02 Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images Marcela Mera-Trujillo et.al. 2306.01938 null
2023-05-31 FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow Cameron Smith et.al. 2306.00180 null
2023-05-19 SIDAR: Synthetic Image Dataset for Alignment & Restoration Monika Kwiatkowski et.al. 2305.12036 link
2023-05-09 Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization Clémentin Boittiaux et.al. 2305.05301 link
2023-05-09 Rotation Synchronization via Deep Matrix Factorization Gk Tejus et.al. 2305.05268 link
2023-04-20 A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion Miriam Jäger et.al. 2304.10664 null
2023-04-14 Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments Felix Ott et.al. 2304.07250 null
2023-04-12 Visual Localization using Imperfect 3D Models from the Internet Vojtech Panek et.al. 2304.05947 link
2023-04-08 Photometric Correction for Infrared Sensors Jincheng Zhang et.al. 2304.03930 null
2023-04-07 DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium Antyanta Bangunharcana et.al. 2304.03560 link
2023-04-05 Semantic Validation in Structure from Motion Joseph Rowell et.al. 2304.02420 link
2023-03-31 Learning Internal Representations of 3D Transformations from 2D Projected Inputs Marissa Connor et.al. 2303.17776 null
2023-03-30 3D Line Mapping Revisited Shaohui Liu et.al. 2303.17504 link
2023-03-27 TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering Jaehoon Choi et.al. 2303.15060 null
2023-03-26 On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks HyunJun Jung et.al. 2303.14840 link
2023-03-24 Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container Jinguang Tong et.al. 2303.13805 link
2023-03-24 Progressively Optimized Local Radiance Fields for Robust View Synthesis Andreas Meuleman et.al. 2303.13791 null
2023-03-15 RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters Shuja Khalid et.al. 2303.08695 null
2023-03-09 Revisiting Rotation Averaging: Uncertainties and Robust Losses Ganlin Zhang et.al. 2303.05195 link
2023-02-28 Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images Zhongli Fan et.al. 2302.14239 link
2023-03-25 BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling Sameera Ramasinghe et.al. 2302.13543 null
2023-02-21 EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images Zhichao Ye et.al. 2302.10544 link
2023-02-18 Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering Tatsuro Yamane et.al. 2302.09208 null
2023-02-12 Uncertainty-Driven Dense Two-View Structure from Motion Weirong Chen et.al. 2302.00523 null
2023-01-28 AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion Yu Chen et.al. 2301.12135 null
2023-01-20 A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles Zhefan Xu et.al. 2301.08422 link
2023-03-21 Robust Dynamic Radiance Fields Yu-Lun Liu et.al. 2301.02239 link
2022-12-24 Polarimetric Multi-View Inverse Rendering Jinyu Zhao et.al. 2212.12721 null
2022-12-13 Accidental Turntables: Learning 3D Pose by Watching Objects Turn Zezhou Cheng et.al. 2212.06300 null
2022-12-04 3D Object Aided Self-Supervised Monocular Depth Estimation Songlin Wei et.al. 2212.01768 null
2022-12-02 High-Res Facial Appearance Capture from Polarized Smartphone Images Dejan Azinović et.al. 2212.01160 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069 link
2022-11-24 JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models Sepidehsadat Hosseini et.al. 2211.13785 null
2022-11-24 SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks Sergio Izquierdo et.al. 2211.13551 link
2022-11-22 Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces Yuxi Xiao et.al. 2211.12018 link
2022-11-21 Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques David Ramirez et.al. 2211.11836 null
2022-11-14 Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion René Haas et.al. 2211.07195 null
2022-10-13 Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach Zhiang Chen et.al. 2210.07349 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517 null
2022-10-07 Leveraging Structure from Motion to Localize Inaccessible Bus Stops Indu Panigrahi et.al. 2210.03646 link
2022-10-01 Structure-Aware NeRF without Posed Camera via Epipolar Constraint Shu Chen et.al. 2210.00183 link
2022-10-05 FAST-LIO, Then Bayesian ICP, Then GTSFM Jerred Chen et.al. 2210.00146 null
2022-09-20 BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction Ahalya Ravendran et.al. 2209.09470 null
2022-09-19 A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion Gerry Chen et.al. 2209.08690 null
2022-09-14 End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes Qiao Chen et.al. 2209.06926 null
2022-09-07 Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021 Hartmut Surmann et.al. 2209.03084 null
2022-08-27 Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data Thomas A. Ciarfuglia et.al. 2208.13001 null
2022-08-12 Handling Constrained Optimization in Factor Graphs for Autonomous Navigation Barbara Bazzana et.al. 2208.06325 null
2022-08-04 Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training Yao-Chih Lee et.al. 2208.02709 link
2022-07-31 One Object at a Time: Accurate and Robust Structure From Motion for Robots Aravind Battaje et.al. 2208.00487 null
2022-07-23 Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks Daniel Posada et.al. 2207.11413 null
2022-07-25 MeshLoc: Mesh-Based Visual Localization Vojtech Panek et.al. 2207.10762 link
2022-07-19 ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild Wang Zhao et.al. 2207.09137 link
2022-07-16 Organic Priors in Non-Rigid Structure from Motion Suryansh Kumar et.al. 2207.06262 null
2022-07-06 A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models Axel Garcia-Vega et.al. 2207.02396 null
2022-06-24 Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set San Jiang et.al. 2206.11499 null
2022-06-13 TC-SfM: Robust Track-Community-Based Structure-from-Motion Lei Wang et.al. 2206.05866 null
2022-06-10 EigenFairing: 3D Model Fairing using Image Coherence Pragyana Mishra et.al. 2206.05309 null
2022-06-01 Semantic Room Wireframe Detection from a Single View David Gillsjö et.al. 2206.00491 link
2022-05-31 Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction Qiancheng Fu et.al. 2205.15848 null
2022-05-09 Is my Depth Ground-Truth Good Enough? HAMMER – Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression HyunJun Jung et.al. 2205.04565 null
2022-05-07 Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs Pedro F. Proença et.al. 2205.03522 null
2022-05-06 EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms Levi Burner et.al. 2205.03467 null
2022-04-20 Learned Monocular Depth Priors in Visual-Inertial Initialization Yunwen Zhou et.al. 2204.09171 null
2022-04-10 Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective Hui Deng et.al. 2204.04730 null
2022-04-08 Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems Debao Huang et.al. 2204.04145 null
2022-04-07 SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation Yi Wei et.al. 2204.03636 link
2022-04-06 Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion Lukas Bommes et.al. 2204.02733 link
2022-04-05 Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows Sheng Liu et.al. 2204.02509 link
2022-03-31 Fast, Accurate and Memory-Efficient Partial Permutation Synchronization Shaohan Li et.al. 2203.16505 null
2022-03-28 Visual Odometry for RGB-D Cameras Afonso Fontes et.al. 2203.15119 null
2022-03-28 Optimizing Elimination Templates by Greedy Parameter Search Evgeniy Martyushev et.al. 2203.14901 link
2022-03-23 Event-Based Dense Reconstruction Pipeline Kun Xiao et.al. 2203.12270 null
2022-03-21 DiffPoseNet: Direct Differentiable Camera Pose Estimation Chethan M. Parameshwara et.al. 2203.11174 null
2022-03-02 Asynchronous Optimisation for Event-based Visual Odometry Daqi Liu et.al. 2203.01037 null
2022-03-02 Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation Yulun Tian et.al. 2203.00851 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146 link
2022-01-20 GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry Yunhan Zhao et.al. 2201.08131 null
2022-01-13 Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching Yunpeng Shi et.al. 2201.04797 link
2022-01-10 High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM Brian M. Hopkinson et.al. 2201.03364 link
2022-01-06 De-rendering 3D Objects in the Wild Felix Wimbauer et.al. 2201.02279 link
2021-12-29 On the Instability of Relative Pose Estimation and RANSAC’s Role Hongyi Fan et.al. 2112.14651 null
2021-12-16 Road-aware Monocular Structure from Motion and Homography Estimation Wei Sui et.al. 2112.08635 null
2021-12-10 Critical configurations for three projective views Martin Bråtelund et.al. 2112.05478 null
2021-12-09 Critical configurations for two projective views, a new approach Martin Bråtelund et.al. 2112.05074 null
2021-12-06 Dense Depth Priors for Neural Radiance Fields from Sparse Input Views Barbara Roessle et.al. 2112.03288 link
2021-12-10 MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment Jie Ren et.al. 2112.01349 link
2021-11-11 Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft Pascal Schoppmann et.al. 2111.06271 null
2021-11-10 Damage Estimation and Localization from Sparse Aerial Imagery Rene Garcia Franceschini et.al. 2111.03708 null
2021-11-03 Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems Swarnabja Bhaumik et.al. 2111.02064 null
2021-10-14 Modeling dynamic target deformation in camera calibration Annika Hagemann et.al. 2110.07322 null
2021-10-13 Hyperspectral 3D Mapping of Underwater Environments Maxime Ferrera et.al. 2110.06571 null
2021-09-24 Automatic Map Update Using Dashcam Videos Aziza Zhanabatyrova et.al. 2109.12131 null
2021-09-16 Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs Gabriel Moreira et.al. 2109.08046 link
2021-09-06 Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications Tejas Mane et.al. 2109.02740 null
2021-09-02 Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency Beatrix-Emőke Fülöp-Balogh et.al. 2109.01018 null
2021-09-01 On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation Eric Brachmann et.al. 2109.00524 link
2021-08-31 DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension Roman Shapovalov et.al. 2109.00033 null
2021-08-29 Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration Seyed-Mahdi Nasiri et.al. 2108.12876 null
2021-08-23 Burst Imaging for Light-Constrained Structure-From-Motion Ahalya Ravendran et.al. 2108.09895 null

Visual Localization

Publish Date Title Authors PDF Code
2025-07-09 Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning Konstantinos I. Roumeliotis et.al. 2507.10571 null
2025-07-14 GT-Loc: Unifying When and Where in Images Through a Joint Embedding Space David G. Shatwell et.al. 2507.10473 null
2025-07-14 Text-to-Remote-Sensing-Image Retrieval beyond RGB Sources Daniele Rege Cambrin et.al. 2507.10403 null
2025-07-14 Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures Xinlong Ding et.al. 2507.10265 null
2025-07-11 RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features Inye Na et.al. 2507.08546 null
2025-07-11 LiDAR, GNSS and IMU Sensor Alignment through Dynamic Time Warping to Construct 3D City Maps Haitian Wang et.al. 2507.08420 null
2025-07-11 Deep Hashing with Semantic Hash Centers for Image Retrieval Li Chen et.al. 2507.08404 null
2025-07-08 Unveiling Effective In-Context Configurations for Image Captioning: An External & Internal Analysis Li Li et.al. 2507.08021 null
2025-07-10 SCREP: Scene Coordinate Regression and Evidential Learning-based Perception-Aware Trajectory Generation Juyeop Han et.al. 2507.07467 null
2025-07-10 VP-SelDoA: Visual-prompted Selective DoA Estimation of Target Sound via Semantic-Spatial Matching Yu Chen et.al. 2507.07384 null
2025-07-08 FACap: A Large-scale Fashion Dataset for Fine-grained Composed Image Retrieval François Gardères et.al. 2507.07135 null
2025-07-09 Evaluating Attribute Confusion in Fashion Text-to-Image Generation Ziyue Liu et.al. 2507.07079 null
2025-07-09 MS-DPPs: Multi-Source Determinantal Point Processes for Contextual Diversity Refinement of Composite Attributes in Text to Image Retrieval Naoya Sogi et.al. 2507.06654 null
2025-07-08 Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval Haiwen Li et.al. 2507.05970 null
2025-07-08 OFFSET: Segmentation-based Focus Shift Revision for Composed Image Retrieval Zhiwei Chen et.al. 2507.05631 null
2025-07-07 Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model Mengyao Xu et.al. 2507.05513 null
2025-07-07 An analysis of vision-language models for fabric retrieval Francesco Giuliari et.al. 2507.04735 null
2025-07-08 What’s Making That Sound Right Now? Video-centric Audio-Visual Localization Hahyeon Choi et.al. 2507.04667 null
2025-07-07 Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR Tao Du et.al. 2507.04662 null
2025-07-06 U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration Xiaofan Li et.al. 2507.04503 null
2025-07-04 Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition Jiuhong Xiao et.al. 2507.03831 null
2025-07-01 LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment Juelin Zhu et.al. 2507.00659 null
2025-06-28 Utilizing a Novel Deep Learning Method for Scene Categorization in Remote Sensing Data Ghufran A. Omran et.al. 2506.22939 null
2025-06-28 Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval Li-Cheng Shen et.al. 2506.22864 null
2025-06-27 MatChA: Cross-Algorithm Matching with Feature Augmentation Paula Carbó Cubero et.al. 2506.22336 null
2025-06-26 OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography Caoshuo Li et.al. 2506.21101 null
2025-06-25 Visualizing intercalation effects in 2D materials using AFM based techniques Karmen Kapustić et.al. 2506.20467 null
2025-06-25 On the Burstiness of Faces in Set Jiong Wang et.al. 2506.20312 null
2025-06-24 jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval Michael Günther et.al. 2506.18902 null
2025-06-26 Referring Expression Instance Retrieval and A Strong End-to-End Baseline Xiangzhao Hao et.al. 2506.18246 null
2025-06-20 Class Agnostic Instance-level Descriptor for Visual Instance Search Qi-Ying Sun et.al. 2506.16745 null
2025-06-19 MambaHash: Visual State Space Deep Hashing Model for Large-Scale Image Retrieval Chao He et.al. 2506.16353 link
2025-06-19 Fine-grained Image Retrieval via Dual-Vision Adaptation Xin Jiang et.al. 2506.16273 null
2025-06-19 Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation Connor Malone et.al. 2506.15988 link
2025-06-18 Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles Qiyuan Wu et.al. 2506.15851 null
2025-06-18 ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections Ziling Huang et.al. 2506.15180 null
2025-06-17 HARMONY: A Scalable Distributed Vector Database for High-Throughput Approximate Nearest Neighbor Search Qian Xu et.al. 2506.14707 null
2025-06-17 TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping Jeewon Kim et.al. 2506.14178 null
2025-06-16 A Semantically-Aware Relevance Measure for Content-Based Medical Image Retrieval Evaluation Xiaoyang Wei et.al. 2506.13509 null
2025-06-19 Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval Kshitij Kavimandan et.al. 2506.13496 null
2025-06-16 EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition Bingxi Liu et.al. 2506.13133 null
2025-06-16 SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models Bingxi Liu et.al. 2506.13073 null
2025-06-14 Feature Complementation Architecture for Visual Place Recognition Weiwei Wang et.al. 2506.12401 null
2025-06-11 Towards a general-purpose foundation model for fMRI analysis Cheng Wang et.al. 2506.11167 null
2025-06-11 Improving Personalized Search with Regularized Low-Rank Parameter Updates Fiona Ryan et.al. 2506.10182 link
2025-06-10 Safeguarding Multimodal Knowledge Copyright in the RAG-as-a-Service Environment Tianyu Chen et.al. 2506.10030 link
2025-06-11 Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints Xiangkai Zhang et.al. 2506.09748 null
2025-06-10 Robust Visual Localization via Semantic-Guided Multi-Scale Transformer Zhongtao Tian et.al. 2506.08526 null
2025-06-08 Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs Yikun Ji et.al. 2506.07045 null
2025-06-07 Zero Shot Composed Image Retrieval Santhosh Kakarla et.al. 2506.06602 null
2025-06-06 GenIR: Generative Visual Feedback for Mental Image Retrieval Diji Yang et.al. 2506.06220 null
2025-06-06 Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning Sheng Chen et.al. 2506.06205 null
2025-06-05 HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition Suhan Woo et.al. 2506.04764 null
2025-06-05 Deep Learning Reforms Image Matching: A Survey and Outlook Shihua Zhang et.al. 2506.04619 null
2025-06-02 Entity Image and Mixed-Modal Image Retrieval Datasets Cristian-Ioan Blaga et.al. 2506.02291 null
2025-06-01 Quantization-based Bounds on the Wasserstein Metric Jonathan Bobrutsky et.al. 2506.00976 null
2025-05-30 SORCE: Small Object Retrieval in Complex Environments Chunxu Liu et.al. 2505.24441 link
2025-05-29 Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch Aneeshan Sain et.al. 2505.23763 null
2025-05-28 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians Hidenobu Matsuki et.al. 2505.22859 null
2025-05-28 UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images Junhuan Liu et.al. 2505.22098 null
2025-05-28 Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule San Jiang et.al. 2505.22089 null
2025-05-27 Visual Loop Closure Detection Through Deep Graph Consensus Martin Büchner et.al. 2505.21754 null
2025-05-27 QuARI: Query Adaptive Retrieval Improvement Eric Xing et.al. 2505.21647 null
2025-05-27 ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval Eric Xing et.al. 2505.20764 link
2025-05-26 Visualized Text-to-Image Retrieval Di Wu et.al. 2505.20291 link
2025-05-26 Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval Rong-Cheng Tu et.al. 2505.19952 null
2025-05-26 Can Visual Encoder Learn to See Arrows? Naoyuki Terashita et.al. 2505.19944 null
2025-05-26 MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval Rong-Cheng Tu et.al. 2505.19707 null
2025-05-24 Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU Yicheng Lin et.al. 2505.18652 link
2025-05-24 TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP Yuliang Cai et.al. 2505.18434 null
2025-05-23 ImLPR: Image-based LiDAR Place Recognition using Vision Foundation Models Minwoo Jung et.al. 2505.18364 null
2025-05-23 DART $^3$ : Leveraging Distance for Test Time Adaptation in Person Re-Identification Rajarshi Bhattacharya et.al. 2505.18337 null
2025-05-23 To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models Simone Gaisbauer et.al. 2505.17973 link
2025-05-23 DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval Yuxin Yang et.al. 2505.17796 null
2025-05-23 CU-Multi: A Dataset for Multi-Robot Data Association Doncey Albin et.al. 2505.17576 null
2025-05-22 TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition Oliver Grainge et.al. 2505.16447 null
2025-05-21 Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval Siting Li et.al. 2505.15877 null
2025-05-21 SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval Nikolaos Chaidos et.al. 2505.15867 link
2025-05-20 Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models Kiarash Naghavi Khanghah et.al. 2505.13828 null
2025-05-18 MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark Yiwei Ou et.al. 2505.12254 null
2025-05-16 Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization Aaron Wilhelm et.al. 2505.11620 null
2025-05-16 Redundancy-Aware Pretraining of Vision-Language Foundation Models in Remote Sensing Mathis Jürgen Adler et.al. 2505.11121 null
2025-05-04 OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery Chongsheng Zhang et.al. 2505.03836 link
2025-05-06 Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions Lukas Schichler et.al. 2505.03565 null
2025-05-06 LiftFeat: 3D Geometry-Aware Local Feature Matching Yepeng Liu et.al. 2505.03422 link
2025-05-06 Seeing the Abstract: Translating the Abstract Language for Vision Language Models Davide Talon et.al. 2505.03242 link
2025-05-13 SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2505.01956 null
2025-05-02 NeuroLoc: Encoding Navigation Cells for 6-DOF Camera Localization Xun Li et.al. 2505.01113 null
2025-05-01 GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting Jongwon Lee et.al. 2504.20379 null
2025-04-25 From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval Yabing Wang et.al. 2504.17990 null
2025-04-24 A Guide to Structureless Visual Localization Vojtech Panek et.al. 2504.17636 null
2025-04-23 Rethinking Vision Transformer for Large-Scale Fine-Grained Image Retrieval Xin Jiang et.al. 2504.16691 null
2025-04-22 Media Content Atlas: A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs Merve Cerit et.al. 2504.16323 link
2025-04-19 A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling Kyle Buettner et.al. 2504.14359 null
2025-04-17 SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs Haoxuan Li et.al. 2504.13172 null
2025-04-16 Generalized Visual Relation Detection with Diffusion Models Kaifeng Gao et.al. 2504.12100 null
2025-04-15 Visual Re-Ranking with Non-Visual Side Information Gustav Hanning et.al. 2504.11134 link
2025-04-15 TMCIR: Token Merge Benefits Composed Image Retrieval Chaoyang Wang et.al. 2504.10995 null
2025-04-14 Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition Changwei Wang et.al. 2504.09881 link
2025-04-12 Evolved Hierarchical Masking for Self-Supervised Learning Zhanzhou Feng et.al. 2504.09155 null
2025-04-11 HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields Asterios Reppas et.al. 2504.08901 null
2025-04-11 Hypergraph Vision Transformers: Images are More than Nodes, More than Edges Joshua Fixelle et.al. 2504.08710 null
2025-04-11 FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations Cheng-Yu Hsieh et.al. 2504.08368 null
2025-04-11 PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection Xiong Li et.al. 2504.08280 null
2025-04-10 Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval Zehong Ma et.al. 2504.07718 null
2025-04-09 A Pointcloud Registration Framework for Relocalization in Subterranean Environments David Akhihiero et.al. 2504.07231 null
2025-04-09 Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception Ruotian Peng et.al. 2504.06666 null
2025-04-08 To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition Davide Sferrazza et.al. 2504.06116 link
2025-04-06 NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval Peng Gao et.al. 2504.04339 null
2025-04-04 REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval Shabnam Choudhury et.al. 2504.03169 null
2025-04-06 Re-thinking Temporal Search for Long-Form Video Understanding Jinhui Ye et.al. 2504.02259 link
2025-04-02 A Chefs KISS – Utilizing semantic information in both ICP and SLAM framework Sven Ochs et.al. 2504.02086 null
2025-04-02 Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval Yuji Nozawa et.al. 2504.01348 null
2025-04-01 IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval Bangwei Liu et.al. 2504.00954 null
2025-04-01 Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data Yiqun Duan et.al. 2504.00812 null
2025-03-31 CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization Yingrui Ji et.al. 2503.24182 null
2025-03-31 LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds Masahiko Tsuji et.al. 2503.23664 null
2025-03-30 Multiview Image-Based Localization Cameron Fiore et.al. 2503.23577 null
2025-03-27 LOCORE: Image Re-ranking with Long-Context Sequence Modeling Zilin Xiao et.al. 2503.21772 link
2025-03-27 Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck Adrian Bulat et.al. 2503.21757 null
2025-03-27 UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation Yehui Shen et.al. 2503.21338 link
2025-03-27 FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval Zixu Li et.al. 2503.21309 link
2025-03-27 Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing Shuai Li et.al. 2503.21236 null
2025-03-25 CoLLM: A Large Language Model for Composed Image Retrieval Chuong Huynh et.al. 2503.19910 link
2025-03-25 Scene-agnostic Pose Regression for Visual Localization Junwei Zheng et.al. 2503.19543 null
2025-03-25 From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting Zhiwei Huang et.al. 2503.19358 null
2025-03-25 Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval Haoqiang Lin et.al. 2503.19296 link
2025-03-23 LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space Zhangyu Wang et.al. 2503.18142 null
2025-03-23 Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning Xiang Fang et.al. 2503.17938 null
2025-03-23 What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images Dongheng Lin et.al. 2503.17899 null
2025-03-22 good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval Pranavi Kolouju et.al. 2503.17871 null
2025-03-21 Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2503.17109 link
2025-03-21 Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions Muhua Zhang et.al. 2503.17005 null
2025-03-20 PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval Qiang Zou et.al. 2503.16064 link
2025-03-20 Automating 3D Dataset Generation with Neural Radiance Fields P. Schulz et.al. 2503.15997 link
2025-03-18 3D Densification for Multi-Map Monocular VSLAM in Endoscopy X. Anadón et.al. 2503.14346 null
2025-03-18 A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios Huy-Hoang Bui et.al. 2503.13982 link
2025-03-17 Scale Efficient Training for Large Datasets Qing Zhou et.al. 2503.13385 link
2025-03-17 Multi-Platform Teach-and-Repeat Navigation by Visual Place Recognition Based on Deep-Learned Local Features Václav Truhlařík et.al. 2503.13090 null
2025-03-17 All You Need to Know About Training Image Retrieval Models Gabriele Berton et.al. 2503.13045 link
2025-03-12 Exploring the best way for UAV visual localization under Low-altitude Multi-view Observation Condition: a Benchmark Yibin Ye et.al. 2503.10692 link
2025-03-13 ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning Pengfei Luo et.al. 2503.10166 link
2025-03-12 Revisiting Medical Image Retrieval via Knowledge Consolidation Yang Nan et.al. 2503.09370 null
2025-03-11 CQVPR: Landmark-aware Contextual Queries for Visual Place Recognition Dongyue Li et.al. 2503.08170 null
2025-03-10 Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization Michael Green et.al. 2503.07038 null
2025-03-10 Zero-Shot Hashing Based on Reconstruction With Part Alignment Yan Jiang et.al. 2503.07037 null
2025-03-10 Improving Visual Place Recognition with Sequence-Matching Receptiveness Prediction Somayeh Hussaini et.al. 2503.06840 null
2025-03-09 RoboDesign1M: A Large-scale Dataset for Robot Design Understanding Tri Le et.al. 2503.06796 null
2025-03-09 StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place Recognition Yanqing Shen et.al. 2503.06601 link
2025-03-09 TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification Huaqi Tao et.al. 2503.06501 link
2025-03-08 NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features Hongjia Zhai et.al. 2503.06117 null
2025-03-07 Data-Efficient Generalization for Zero-shot Composed Image Retrieval Zining Chen et.al. 2503.05204 null
2025-03-06 RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining Tengfei Zhang et.al. 2503.04653 null
2025-03-06 ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images Yanqing Shen et.al. 2503.04475 link
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235 null
2025-03-06 Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior Haitao Wu et.al. 2503.04207 link
2025-03-06 Image-Based Relocalization and Alignment for Long-Term Monitoring of Dynamic Underwater Environments Beverley Gorry et.al. 2503.04096 link
2025-03-04 TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition Oliver Grainge et.al. 2503.02511 null
2025-03-04 Introspective Loop Closure for SLAM with 4D Imaging Radar Maximilian Hilger et.al. 2503.02383 null
2025-03-04 Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models Kenta Tsukahara et.al. 2503.02256 null
2025-03-03 Composed Multi-modal Retrieval: A Survey of Approaches and Applications Kun Zhang et.al. 2503.01334 link
2025-03-03 AirRoom: Objects Matter in Room Reidentification Runmao Yao et.al. 2503.01130 null
2025-03-02 Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching Jinyu Miao et.al. 2503.00862 null
2025-03-01 Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning Songlin Dong et.al. 2503.00515 null
2025-02-28 EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration Kuangyi Chen et.al. 2503.00167 link
2025-02-28 CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval Zelong Sun et.al. 2502.20826 null
2025-02-28 SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition Shanshan Wan et.al. 2502.20676 null
2025-02-27 A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization Yejun Zhang et.al. 2502.20036 link
2025-02-27 On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation Ruben T. Lucassen et.al. 2502.19285 null
2025-02-26 BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure Haoxin Cai et.al. 2502.19242 link
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932 null
2025-02-19 A Comprehensive Survey on Composed Image Retrieval Xuemeng Song et.al. 2502.18495 link
2025-02-25 MegaLoc: One Retrieval to Place Them All Gabriele Berton et.al. 2502.17237 link
2025-02-23 Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries Yin Wu et.al. 2502.16636 link
2025-02-23 SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition Feng Lu et.al. 2502.16601 link
2025-02-21 ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval Guanqi Zhan et.al. 2502.15682 null
2025-02-20 Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition Tianyi Shang et.al. 2502.14195 link
2025-02-19 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments Vincent Ress et.al. 2502.13803 null
2025-02-18 Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Shuo Xing et.al. 2502.13146 link
2025-02-19 IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras Dongki Jung et.al. 2502.12545 null
2025-02-17 From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations Matteo Scucchia et.al. 2502.12303 null
2025-02-17 Descriminative-Generative Custom Tokens for Vision-Language Models Pramuditha Perera et.al. 2502.12095 null
2025-02-17 ILIAS: Instance-Level Image retrieval At Scale Giorgos Kordopatis-Zilos et.al. 2502.11748 null
2025-02-17 Range and Bird’s Eye View Fused Cross-Modal Visual Place Recognition Jianyi Peng et.al. 2502.11742 link
2025-02-17 Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics Francesco Croce et.al. 2502.11725 link
2025-02-17 Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization Yuanze Xu et.al. 2502.11408 null
2025-02-12 E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection Junjie Wu et.al. 2502.10455 null
2025-02-11 Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning Yuhang Dong et.al. 2502.09649 null
2025-02-13 ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Rotem Shalev-Arkushin et.al. 2502.09411 null
2025-02-12 SpeechCompass: Enhancing Mobile Captioning with Diarization and Directional Guidance via Multi-Microphone Localization Artem Dementyev et.al. 2502.08848 null
2025-02-12 Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions Prajwal Gatti et.al. 2502.08438 null
2025-02-11 Captured by Captions: On Memorization and its Mitigation in CLIP Models Wenhao Wang et.al. 2502.07830 null
2025-02-11 Ultrafast 4D scanning transmission electron microscopy for imaging of localized optical fields Petr Koutenský et.al. 2502.07338 null
2025-02-11 Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos Haowen Gao et.al. 2502.07327 null
2025-02-11 PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval Osman Tursun et.al. 2502.07215 null
2025-02-10 AstroLoc: Robust Space to Ground Image Localizer Gabriele Berton et.al. 2502.07003 null
2025-02-09 Uni-Retrieval: A Multi-Style Retrieval Framework for STEM’s Education Yanhao Jia et.al. 2502.05863 null
2025-02-07 Learning Street View Representations with Spatiotemporal Contrast Yong Li et.al. 2502.04638 null
2025-02-06 Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Marco Mistretta et.al. 2502.04263 link
2025-02-05 Human-Aligned Image Models Improve Visual Decoding from the Brain Nona Rajabi et.al. 2502.03081 null
2025-02-03 ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies Costin F. Ciusdel et.al. 2502.01335 null
2025-01-31 LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks Liudi Yang et.al. 2501.19382 link
2025-01-27 Freestyle Sketch-in-the-Loop Image Segmentation Subhadeep Koley et.al. 2501.16022 null
2025-01-26 Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations Zijun Long et.al. 2501.15379 null
2025-01-24 Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection Viktor Kozák et.al. 2501.14587 null
2025-01-23 Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models Jakob Krogh Petersen et.al. 2501.14051 link
2025-01-22 Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation Kenta Uesugi et.al. 2501.13968 null
2025-01-19 Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection Zhipeng Yu et.al. 2501.11063 link
2025-01-18 A Resource-Efficient Training Framework for Remote Sensing Text–Image Retrieval Weihang Zhang et.al. 2501.10638 null
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 link
2025-01-12 SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval Bhavin Jawade et.al. 2501.08347 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2025-01-12 Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation Zhenyang Feng et.al. 2501.06749 null
2025-01-06 Integrating Language-Image Prior into EEG Decoding for Cross-Task Zero-Calibration RSVP-BCI Xujin Li et.al. 2501.02841 null
2025-01-03 A Minimal Subset Approach for Efficient and Scalable Loop Closure Nikolaos Stathoulopoulos et.al. 2501.01791 link
2025-01-03 iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings Shuhei Tomoshige et.al. 2501.01642 null
2025-01-02 R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization Xudong Jiang et.al. 2501.01421 link
2025-01-02 Training Medical Large Vision-Language Models with Abnormal-Aware Feedback Yucheng Zhou et.al. 2501.01377 null
2025-01-02 Domain-invariant feature learning in brain MR imaging for content-based image retrieval Shuya Tobari et.al. 2501.01326 null
2024-12-28 GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting Atticus J. Zeller et.al. 2412.20056 link
2024-12-25 FOR: Finetuning for Object Level Open Vocabulary Image Retrieval Hila Levi et.al. 2412.18806 null
2024-12-24 ERVD: An Efficient and Robust ViT-Based Distillation Framework for Remote Sensing Image Retrieval Le Dong et.al. 2412.18136 link
2024-12-22 Where am I? Cross-View Geo-localization with Natural Language Descriptions Junyan Ye et.al. 2412.17007 null
2024-12-22 Large-Scale UWB Anchor Calibration and One-Shot Localization Using Gaussian Process Shenghai Yuan et.al. 2412.16880 null
2024-12-24 Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling Daichi Yashima et.al. 2412.16576 link
2024-12-20 A New Method to Capturing Compositional Knowledge in Linguistic Space Jiahe Wan et.al. 2412.15632 null
2024-12-20 Stabilizing Laplacian Inversion in Fokker-Planck Image Retrieval using the Transport-of-Intensity Equation Samantha J Alloo et.al. 2412.15513 null
2024-12-19 Learning Visual Composition through Improved Semantic Guidance Austin Stone et.al. 2412.15396 null
2024-12-19 MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Junjie Zhou et.al. 2412.14475 null
2024-12-18 Adversarial Hubness in Multi-Modal Retrieval Tingwei Zhang et.al. 2412.14113 link
2024-12-18 Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval Giacomo Pacini et.al. 2412.13834 null
2024-12-18 ConDo: Continual Domain Expansion for Absolute Pose Regression Zijun Li et.al. 2412.13452 link
2024-12-17 Three Things to Know about Deep Metric Learning Yash Patel et.al. 2412.12432 null
2024-12-15 Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval Zelong Sun et.al. 2412.11087 null
2024-12-20 Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2412.11077 link
2024-12-13 MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition Qiwen Gu et.al. 2412.09199 null
2024-12-12 A Flexible Plug-and-Play Module for Generating Variable-Length Liyang He et.al. 2412.08922 link
2024-12-11 Image Retrieval Methods in the Dissimilarity Space Madhu Kiran et.al. 2412.08618 null
2024-12-11 Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization Siyan Dong et.al. 2412.08376 link
2024-12-11 Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin Benjamin D. Killeen et.al. 2412.08020 null
2024-12-10 On Motion Blur and Deblurring in Visual Place Recognition Timur Ismagilov et.al. 2412.07751 null
2024-12-10 Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance Wanwen Chen et.al. 2412.07741 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition Connor Malone et.al. 2412.06153 null
2024-12-07 Compositional Image Retrieval via Instruction-Aware Contrastive Learning Wenliang Zhong et.al. 2412.05756 link
2024-12-06 DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification Ying Jin et.al. 2412.04828 null
2024-12-04 Distillation of Diffusion Features for Semantic Correspondence Frank Fundel et.al. 2412.03512 null
2024-12-04 Composed Image Retrieval for Training-Free Domain Conversion Nikos Efthymiadis et.al. 2412.03297 link
2024-12-03 A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration Thulio Amorim et.al. 2412.02881 null
2024-12-03 Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval Leah Bar et.al. 2412.02310 link
2024-12-02 Mutli-View 3D Reconstruction using Knowledge Distillation Aditya Dutt et.al. 2412.02039 link
2024-12-02 Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features MD Shaikh Rahman et.al. 2412.01555 null
2024-12-02 Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models Yi Liao et.al. 2412.01202 null
2024-12-01 EDTformer: An Efficient Decoder Transformer for Visual Place Recognition Tong Jin et.al. 2412.00784 link
2024-11-28 EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval Muhammad Huzaifa et.al. 2412.00139 null
2024-11-28 Unleashing the Power of Data Synthesis in Visual Localization Sihang Li et.al. 2412.00138 null
2024-11-28 Relation-Aware Meta-Learning for Zero-shot Sketch-Based Image Retrieval Yang Liu et.al. 2412.00120 null
2024-11-29 A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications Liqiang Zhang Ye Tian Dongyan Wei et.al. 2411.19845 null
2024-11-27 Optimizing Image Retrieval with an Extended b-Metric Space Abdelkader Belhenniche et.al. 2411.18800 null
2024-11-26 Learning Visual Hierarchies with Hyperbolic Embeddings Ziwei Wang et.al. 2411.17490 null
2024-12-02 Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy You Li et.al. 2411.16752 null
2024-12-02 AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks You Li et.al. 2411.16749 null
2024-11-25 Image Generation Diversity Issues and How to Tame Them Mischa Dombrowski et.al. 2411.16171 link
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800 null
2024-11-22 Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval Zengbao Sun et.al. 2411.14704 null
2024-11-20 Globally Correlation-Aware Hard Negative Generation Wenjie Peng et.al. 2411.13145 link
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-13 OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances Youqi Liao et.al. 2411.08665 link
2024-11-13 Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval Saul Santos et.al. 2411.08590 link
2024-11-22 Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments Ashkan Nejad et.al. 2411.08567 link
2024-11-13 MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation Peng Wang et.al. 2411.08279 link
2024-11-05 From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing Xintian Sun et.al. 2411.05826 null
2024-11-04 TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Maitreya Patel et.al. 2411.02545 null
2024-11-11 INQUIRE: A Natural World Text-to-Image Retrieval Benchmark Edward Vendrow et.al. 2411.02537 link
2024-11-20 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804 null
2024-11-03 Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification MD Shaikh Rahman et.al. 2411.01473 null
2024-11-01 Identifying Implicit Social Biases in Vision-Language Models Kimia Hamidieh et.al. 2411.00997 null
2024-10-31 Nearest Neighbor Normalization Improves Multimodal Retrieval Neil Chowdhury et.al. 2410.24114 link
2024-10-31 MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval Haiwen Li et.al. 2410.23736 null
2024-10-30 Decoupling Semantic Similarity from Spatial Alignment for Neural Networks Tassilo Wald et.al. 2410.23107 link
2024-10-29 Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications Monica Riedler et.al. 2410.21943 link
2024-10-28 NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments Taiyi Pan et.al. 2410.21615 link
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-24 ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval Zijia Zhao et.al. 2410.18715 link
2024-10-25 On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features Tomáš Pivoňka et.al. 2410.18573 null
2024-10-22 Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2410.17393 null
2024-10-20 GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning Haiwen Diao et.al. 2410.15266 link
2024-10-19 Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway’s Digitised Book Collection Marie Roald et.al. 2410.14969 link
2024-10-16 Development of Image Collection Method Using YOLO and Siamese Network Chan Young Shin et.al. 2410.12561 null
2024-10-16 LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment Juelin Zhu et.al. 2410.12269 link
2024-10-16 Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization Nanda Febri Istighfarin et.al. 2410.12240 null
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-11 Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System Zheng Liu et.al. 2410.08935 link
2024-10-16 Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP Eunji Kim et.al. 2410.08469 null
2024-10-11 A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification Eugene P. W. Ang et.al. 2410.08456 null
2024-10-10 A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks Hoin Jung et.al. 2410.07593 link
2024-10-09 Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval Mohammad Omama et.al. 2410.07022 null
2024-10-09 Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers Stephen Hausler et.al. 2410.06614 link
2024-10-09 MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging Noel C. F. Codella et.al. 2410.06542 null
2024-10-08 Temporal Image Caption Retrieval Competition – Description and Results Jakub Pokrywka et.al. 2410.06314 null
2024-10-08 Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching Gongxin Yao et.al. 2410.06285 null
2024-10-08 GSLoc: Visual Localization with 3D Gaussian Splatting Kazii Botashev et.al. 2410.06165 null
2024-10-08 Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning Ayush Singh et.al. 2410.05928 null
2024-10-08 RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps Minsoo Kim et.al. 2410.05621 null
2024-10-09 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249 null
2024-10-06 LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation Jianhao Jiao et.al. 2410.04419 null
2024-10-02 Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension Zaiquan Yang et.al. 2410.01544 null
2024-10-03 EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections Francesc Net et.al. 2410.01536 link
2024-10-04 CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Safouane El Ghazouali et.al. 2410.01411 link
2024-09-30 Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Aleyna Kütük et.al. 2410.00266 null
2024-09-29 CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation Yifan Duan et.al. 2409.19597 null
2024-09-28 VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition Ahmad Khaliq et.al. 2409.19293 link
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152 null
2024-09-26 Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval Mankeerat Sidhu et.al. 2409.18733 null
2024-09-26 Revisit Anything: Visual Place Recognition via Image Segment Retrieval Kartik Garg et.al. 2409.18049 link
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-23 CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis Xiang Zhang et.al. 2409.15169 null
2024-09-21 Combining Absolute and Semi-Generalized Relative Poses for Visual Localization Vojtech Panek et.al. 2409.14269 null
2024-09-21 SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality Hongjia Zhai et.al. 2409.14067 null
2024-09-20 Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval Morris Florek et.al. 2409.13513 link
2024-09-18 Towards Global Localization using Multi-Modal Object-Instance Re-Identification Aneesh Chavan et.al. 2409.12002 link
2024-09-17 Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching Kurran Singh et.al. 2409.11555 null
2024-09-17 Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information Kunal Chelani et.al. 2409.11536 null
2024-09-17 Improving the Efficiency of Visually Augmented Language Models Paula Ontalvilla et.al. 2409.11148 link
2024-09-21 HGSLoc: 3DGS-based Heuristic Camera Pose Refinement Zhongyan Niu et.al. 2409.10925 null
2024-09-16 SOLVR: Submap Oriented LiDAR-Visual Re-Localisation Joshua Knights et.al. 2409.10247 null
2024-09-16 Garment Attribute Manipulation with Multi-level Attention Vittorio Casula et.al. 2409.10206 null
2024-09-14 Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval Amirreza Mahbod et.al. 2409.09430 link
2024-09-12 Structured Pruning for Efficient Visual Place Recognition Oliver Grainge et.al. 2409.07834 null
2024-09-10 GeoCalib: Learning Single-image Calibration with Geometric Optimization Alexander Veicht et.al. 2409.06704 link
2024-09-10 Weakly-supervised Camera Localization by Ground-to-satellite Image Registration Yujiao Shi et.al. 2409.06471 link
2024-09-10 A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions Zhicong Wu et.al. 2409.06381 null
2024-09-09 Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding Bram Willemsen et.al. 2409.05721 link
2024-09-09 Open-World Dynamic Prompt and Continual Visual Representation Learning Youngeun Kim et.al. 2409.05312 null
2024-09-12 Training-free ZS-CIR via Weighted Modality Fusion and Similarity Ren-Di Wu et.al. 2409.04918 link
2024-09-12 Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models Saghir Alfasly et.al. 2409.04631 null
2024-09-06 Reprojection Errors as Prompts for Efficient Scene Coordinate Regression Ting-Ru Liu et.al. 2409.04178 null
2024-09-06 Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments Therese Joseph et.al. 2409.03998 null
2024-09-04 Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications Abby Stylianou et.al. 2409.03012 null
2024-09-04 NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval Sepanta Zeighami et.al. 2409.02343 link
2024-09-03 Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment Konstantin Schall et.al. 2409.01936 link
2024-09-02 A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches Kim Jinwoo et.al. 2409.01219 null
2024-09-02 Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection Manon Kok et.al. 2409.01091 null
2024-09-02 Evidential Transformers for Improved Image Retrieval Danilo Dordevic et.al. 2409.01082 null
2024-09-05 EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System Bonan Liu et.al. 2409.00343 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-09-02 RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance Avideep Mukherjee et.al. 2408.17095 null
2024-08-29 A compact neuromorphic system for ultra energy-efficient, on-device robot localization Adam D. Hines et.al. 2408.16754 link
2024-08-29 Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models Kengo Nakata et.al. 2408.16296 null
2024-08-28 Temporal Attention for Cross-View Sequential Image Localization Dong Yuan et.al. 2408.15569 link
2024-08-27 Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild Tianqi Wei et.al. 2408.14723 null
2024-08-25 LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task Ali Asgarov et.al. 2408.13909 link
2024-08-15 Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval Lifeng Zhou et.al. 2408.13705 null
2024-08-15 Coarse-to-fine Alignment Makes Better Speech-image Retrieval Lifeng Zhou et.al. 2408.13119 null
2024-08-21 FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization Son Tung Nguyen et.al. 2408.12037 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966 null
2024-08-21 UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Xiangyu Zhao et.al. 2408.11305 link
2024-08-20 GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting Changkun Liu et.al. 2408.11085 link
2024-08-19 BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval Zhenyu Lu et.al. 2408.10383 null
2024-08-23 Fashion Image-to-Image Translation for Complementary Item Retrieval Matteo Attimonelli et.al. 2408.09847 link
2024-08-20 MambaLoc: Efficient Camera Localisation via State Space Model Jialu Wang et.al. 2408.09680 null
2024-08-15 DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions Ryosuke Korekata et.al. 2408.07910 null
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648 null
2024-08-10 Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network Junyan Ye et.al. 2408.05475 link
2024-08-09 Spherical World-Locking for Audio-Visual Localization in Egocentric Videos Heeseung Yun et.al. 2408.05364 null
2024-08-06 AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval Pavel Suma et.al. 2408.03282 link
2024-08-05 CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration Gongxin Yao et.al. 2408.02394 null
2024-08-09 BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles Lun Luo et.al. 2408.01841 link
2024-08-02 On Validation of Search & Retrieval of Tissue Images in Digital Pathology H. R. Tizhoosh et.al. 2408.01570 null
2024-07-31 VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning Yuhang Ming et.al. 2407.21416 null
2024-07-31 SuperVINS: A visual-inertial SLAM framework integrated deep learning features Hongkun Luo et.al. 2407.21348 link
2024-07-30 Re-localization acceleration with Medoid Silhouette Clustering Hongyi Zhang et.al. 2407.20749 null
2024-07-29 A flexible framework for accurate LiDAR odometry, map manipulation, and localization José Luis Blanco-Claraco et.al. 2407.20465 link
2024-07-26 From 2D to 3D: AISG-SLA Visual Localization Challenge Jialin Gao et.al. 2407.18590 null
2024-07-24 Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation Yongqi Li et.al. 2407.17274 null
2024-07-24 Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments Wei Gao et.al. 2407.17078 null
2024-07-24 Pose Estimation from Camera Images for Underwater Inspection Luyuan Peng et.al. 2407.16961 null
2024-07-22 Memory Management for Real-Time Appearance-Based Loop Closure Detection Mathieu Labbé et.al. 2407.15890 null
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-22 Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM Mathieu Labbe et.al. 2407.15305 null
2024-07-22 Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation Mathieu Labbé et.al. 2407.15304 null
2024-07-19 Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization Yuehua Ding et.al. 2407.14643 null
2024-07-18 Visual Haystacks: Answering Harder Questions About Sets of Images Tsung-Han Wu et.al. 2407.13766 link
2024-07-17 Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM Markus Weißflog et.al. 2407.12408 null
2024-07-17 GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection Jingwen Yu et.al. 2407.11736 link
2024-07-16 EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis Ruijie Yang et.al. 2407.11401 null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964 link
2024-07-15 DINO Pre-training for Vision-based End-to-end Autonomous Driving Shubham Juneja et.al. 2407.10803 null
2024-07-15 Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval Youngsun Lim et.al. 2407.10683 null
2024-07-15 An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots J. J. Cabrera et.al. 2407.10596 link
2024-07-15 An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments J. J. Cabrera et.al. 2407.10536 null
2024-07-12 Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval Vaibhav Balloli et.al. 2407.08908 link
2024-07-11 Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates Owen Claxton et.al. 2407.08162 link
2024-07-12 Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal Xinyu Zhu et.al. 2407.08153 link
2024-07-11 SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM Neng Wang et.al. 2407.08106 link
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-09 CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding Wenhao Xu et.al. 2407.06611 null
2024-07-08 Pseudo-triplet Guided Few-shot Composed Image Retrieval Bohan Hou et.al. 2407.06001 null
2024-07-09 HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels Yingying Jiang et.al. 2407.05795 null
2024-07-05 Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning Mainak Singha et.al. 2407.04207 link
2024-07-04 Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models Chang-Sheng Kao et.al. 2407.03615 link
2024-07-03 Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach Pronay Debnath et.al. 2407.03486 null
2024-07-02 Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition Sergio Izquierdo et.al. 2407.02422 link
2024-07-01 Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval Aneeshan Sain et.al. 2407.01810 null
2024-07-01 Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval Hanwen Su et.al. 2407.00979 null
2024-07-01 Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios Connor Malone et.al. 2407.00863 null
2024-06-27 PathAlign: A vision-language model for whole slide images in histopathology Faruk Ahmed et.al. 2406.19578 null
2024-07-05 360 in the Wild: Dataset for Depth Prediction and View Synthesis Kibaek Park et.al. 2406.18898 null
2024-06-27 Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs Huaying Zhang et.al. 2406.18836 null
2024-06-26 WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images Yannik Glaser et.al. 2406.18765 link
2024-06-26 View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis Subin Varghese et.al. 2406.18012 null
2024-06-25 Tell Me Where You Are: Multimodal LLMs Meet Place Recognition Zonglin Lyu et.al. 2406.17520 null
2024-06-25 SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation Xu Liu et.al. 2406.17249 link
2024-06-23 Breaking the Frame: Image Retrieval by Visual Overlap Prediction Tong Wei et.al. 2406.16204 link
2024-06-19 Towards a multimodal framework for remote sensing image change retrieval and captioning Roger Ferrod et.al. 2406.13424 link
2024-06-19 CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval Christian Lülf et.al. 2406.13322 link
2024-06-17 Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization Huaiji Zhou et.al. 2406.11766 null
2024-06-22 Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment Jianan Jiang et.al. 2406.11551 link
2024-06-17 They’re All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias Salma Abdel Magid et.al. 2406.11331 null
2024-06-17 Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion Guoyuan An et.al. 2406.11242 null
2024-06-14 Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval Genc Hoxha et.al. 2406.10107 null
2024-06-14 BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval Imanol Miranda et.al. 2406.09952 link
2024-06-13 Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases Meng Wang et.al. 2406.09317 link
2024-06-13 Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval Jaeseok Byun et.al. 2406.09188 null
2024-06-13 DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification Zhengrui Xu et.al. 2406.08773 link
2024-06-12 Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement Maxime Pietrantoni et.al. 2406.08463 null
2024-06-12 ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery Kam Woh Ng et.al. 2406.08457 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502 link
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450 link
2024-06-11 Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval Adrià Molina et.al. 2406.07315 null
2024-06-10 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374 link
2024-06-09 Unified Text-to-Image Generation and Retrieval Leigang Qu et.al. 2406.05814 null
2024-06-07 The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better Scott Geng et.al. 2406.05184 link
2024-06-07 PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction Eduard Poesina et.al. 2406.04746 link
2024-06-06 GLACE: Global Local Accelerated Coordinate Encoding Fangjinhua Wang et.al. 2406.04340 link
2024-06-06 Monocular Localization with Semantics Map for Autonomous Vehicles Jixiang Wan et.al. 2406.03835 null
2024-06-05 Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach Saehyung Lee et.al. 2406.03411 link
2024-06-04 MeshVPR: Citywide Visual Place Recognition Using 3D Meshes Gabriele Berton et.al. 2406.02776 null
2024-06-04 Can CLIP help CLIP in learning 3D? Cristian Sbrolli et.al. 2406.02202 null
2024-06-03 Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP Sriram Balasubramanian et.al. 2406.01583 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-02 Visual place recognition for aerial imagery: A survey Ivan Moskalenko et.al. 2406.00885 link
2024-06-01 NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization Wugang Meng et.al. 2406.00312 null
2024-05-31 DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models Linli Yao et.al. 2405.20985 link
2024-05-29 Multi-Modal Generative Embedding Model Feipeng Ma et.al. 2405.19333 null
2024-05-29 ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions Honglin Lin et.al. 2405.19226 null
2024-05-30 CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval Xintong Jiang et.al. 2405.19149 link
2024-05-29 SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation Zhenbei Wu et.al. 2405.18801 null
2024-05-29 Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs Jialiang Xu et.al. 2405.18740 link
2024-05-28 EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition Issar Tzachor et.al. 2405.18065 null
2024-05-28 AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval Sihe Zhang et.al. 2405.17718 null
2024-05-26 MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups Yusen Xie et.al. 2405.16599 null
2024-05-29 Composed Image Retrieval for Remote Sensing Bill Psomas et.al. 2405.15587 link
2024-05-24 Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval Yiming Wu et.al. 2405.15451 null
2024-05-20 UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization Wenjia Xu et.al. 2405.11936 link
2024-05-19 Register assisted aggregation for Visual Place Recognition Xuan Yu et.al. 2405.11526 null
2024-05-26 CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion Gang Wang et.al. 2405.10793 null
2024-05-16 FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models Adrian Bulat et.al. 2405.10286 null
2024-05-15 Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study Farnaz Khun Jush et.al. 2405.09334 null
2024-05-14 BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment Lihong Jin et.al. 2405.09001 null
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-13 OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition Qiuchi Xiang et.al. 2405.07966 link
2024-05-14 HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval Chao He et.al. 2405.07524 link
2024-05-13 JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation Xubo Luo et.al. 2405.07429 link
2024-05-12 BoQ: A Place is Worth a Bag of Learnable Queries Amar Ali-bey et.al. 2405.07364 link
2024-05-07 Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction Nematollah Saeidi et.al. 2405.04211 null
2024-05-06 A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions Sharath Raghvendra et.al. 2405.03664 null
2024-05-06 Knowledge-aware Text-Image Retrieval for Remote Sensing Images Li Mi et.al. 2405.03373 null
2024-05-06 Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval Jiacheng Cheng et.al. 2405.03190 null
2024-05-05 iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval Lorenzo Agnolucci et.al. 2405.02951 link
2024-05-01 Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval Young Kyun Jang et.al. 2405.00571 null
2024-04-30 Large Language Model Informed Patent Image Retrieval Hao-Cheng Lo et.al. 2404.19360 null
2024-04-30 XFeat: Accelerated Features for Lightweight Image Matching Guilherme Potje et.al. 2404.19174 null
2024-04-29 Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models Hongyi Zhu et.al. 2404.18746 null
2024-04-29 Dual-Modal Prompting for Sketch-Based Image Retrieval Liying Gao et.al. 2404.18695 null
2024-05-01 Semantic Line Combination Detector Jinwon Ko et.al. 2404.18399 link
2024-04-26 Learning text-to-video retrieval from image captioning Lucas Ventura et.al. 2404.17498 null
2024-04-25 CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching Samia Shafique et.al. 2404.16972 link
2024-04-29 Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval Ryoya Nara et.al. 2404.16398 null
2024-04-24 Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval Haokun Wen et.al. 2404.15875 link
2024-04-24 DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines Xin Jiang et.al. 2404.15771 null
2024-04-23 Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval Young Kyun Jang et.al. 2404.15516 null
2024-04-22 EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models Mathias Thorsager et.al. 2404.14236 null
2024-04-22 Hierarchical localization with panoramic views and triplet loss functions Marcos Alfaro et.al. 2404.14117 link
2024-04-20 High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces Baoru Huang et.al. 2404.13437 null
2024-04-20 Collaborative Visual Place Recognition through Federated Learning Mattia Dutto et.al. 2404.13324 null
2024-04-18 SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints Spencer Carmichael et.al. 2404.12339 null
2024-04-17 Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives Zhangchi Feng et.al. 2404.11317 link
2024-04-17 Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing Sanggeon Yun et.al. 2404.11025 null
2024-04-16 SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments Niklas Gard et.al. 2404.10527 link
2024-04-20 CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning Haojian Huang et.al. 2404.09640 link
2024-04-11 PRAM: Place Recognition Anywhere Model for Efficient Visual Localization Fei Xue et.al. 2404.07785 null
2024-04-16 2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure Bin Zhang et.al. 2404.07644 link
2024-04-11 Semantically-correlated memories in a dense associative model Thomas F Burns et.al. 2404.07123 link
2024-04-09 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti et.al. 2404.06542 null
2024-04-09 Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping Anas Gouda et.al. 2404.06277 link
2024-04-07 Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval Jinpeng Wang et.al. 2404.04998 link
2024-04-06 Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning Juncheng Yang et.al. 2404.04538 link
2024-04-05 Towards introspective loop closure in 4D radar SLAM Maximilian Hilger et.al. 2404.03940 null
2024-04-02 TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation Yehui Shen et.al. 2404.01587 link
2024-04-01 On Train-Test Class Overlap and Detection for Image Retrieval Chull Hwan Song et.al. 2404.01524 link
2024-04-01 NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification Juyeop Han et.al. 2404.01400 null
2024-03-31 On the Estimation of Image-matching Uncertainty in Visual Place Recognition Mubariz Zaffar et.al. 2404.00546 null
2024-03-31 NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation Diwei Sheng et.al. 2404.00504 null
2024-03-30 SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs Yang Miao et.al. 2404.00469 null
2024-03-30 Do Vision-Language Models Understand Compound Nouns? Sonal Kumar et.al. 2404.00419 link
2024-04-05 FairRAG: Fair Human Generation via Fair Retrieval Augmentation Robik Shrestha et.al. 2403.19964 null
2024-03-28 JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition Gabriele Berton et.al. 2403.19787 link
2024-03-28 MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions Kai Zhang et.al. 2403.19651 link
2024-03-27 AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation Changkun Liu et.al. 2403.18281 null
2024-03-26 Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge Dongjin Kim et.al. 2403.17420 link
2024-03-25 Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras Gokul B. Nair et.al. 2403.16425 link
2024-03-24 Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval Yucheng Suo et.al. 2403.16005 link
2024-03-24 BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval Yinda Chen et.al. 2403.15992 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378 link
2024-03-22 A Multimodal Approach for Cross-Domain Image Retrieval Lucas Iijima et.al. 2403.15152 null
2024-03-22 Piecewise-Linear Manifolds for Deep Metric Learning Shubhang Bhatnagar et.al. 2403.14977 null
2024-03-21 Enhancing Historical Image Retrieval with Compositional Cues Tingyu Lin et.al. 2403.14287 link
2024-03-20 Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval Aymene Berriche et.al. 2403.13747 null
2024-03-20 Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval Haoyu Liu et.al. 2403.13317 null
2024-03-19 Learning Neural Volumetric Pose Features for Camera Localization Jingyu Lin et.al. 2403.12800 null
2024-03-19 Quantixar: High-performance Vector Data Management System Gulshan Yadav et.al. 2403.12583 null
2024-03-17 3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization Peng Jiang et.al. 2403.11367 null
2024-03-17 MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data Paul S. Scotti et.al. 2403.11207 link
2024-03-16 Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval Shunsuke Tsubaki et.al. 2403.10756 null
2024-03-16 Vector search with small radiuses Gergely Szilvasy et.al. 2403.10746 null
2024-03-13 Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer Kenta Tsukahara et.al. 2403.10552 null
2024-03-20 Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression Huy-Hoang Bui et.al. 2403.10297 link
2024-03-15 Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline Fangming Yuan et.al. 2403.10283 null
2024-03-14 The NeRFect Match: Exploring NeRF Features for Visual Localization Qunjie Zhou et.al. 2403.09577 null
2024-03-14 VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition Benjamin Ramtoula et.al. 2403.09025 null
2024-03-13 PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models Siddharth Mishra-Sharma et.al. 2403.08851 link
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156 link
2024-03-12 It’s All About Your Sketch: Democratising Sketch Control in Diffusion Models Subhadeep Koley et.al. 2403.07234 link
2024-03-12 You’ll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval Subhadeep Koley et.al. 2403.07222 null
2024-03-12 Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers Subhadeep Koley et.al. 2403.07214 null
2024-03-11 How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? Subhadeep Koley et.al. 2403.07203 null
2024-03-11 EarthLoc: Astronaut Photography Localization by Indexing Earth from Space Gabriele Berton et.al. 2403.06758 link
2024-03-11 BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues Fudong Ge et.al. 2403.06600 link
2024-03-11 Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology Stefan Denner et.al. 2403.06567 link
2024-03-10 RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation Mathieu Labbé et.al. 2403.06341 null
2024-03-10 Texture image retrieval using a classification and contourlet-based features Asal Rouhafzay et.al. 2403.06048 null
2024-03-11 LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map Xinrui Wu et.al. 2403.05002 link
2024-03-11 Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed Yifan Wang et.al. 2403.04765 null
2024-03-07 mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar Chengzhen Meng et.al. 2403.04703 null
2024-03-06 Self-supervised Photographic Image Layout Representation Learning Zhaoran Zhao et.al. 2403.03740 link
2024-03-04 Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models Benedikt Blumenstiel et.al. 2403.02059 link
2024-03-03 Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval Yongchao Du et.al. 2403.01431 null
2024-03-01 Asymmetric Feature Fusion for Image Retrieval Hui Wu et.al. 2403.00671 null
2024-03-01 Structure Similarity Preservation Learning for Asymmetric Image Retrieval Hui Wu et.al. 2403.00648 link
2024-02-29 CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition Feng Lu et.al. 2402.19231 link
2024-02-28 Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport Bin Li et.al. 2402.18411 link
2024-02-28 Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning Hanyao Wang et.al. 2402.18400 null
2024-02-28 Representing 3D sparse map points and lines for camera relocalization Bach-Thuan Bui et.al. 2402.18011 link
2024-02-27 Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control Thong Nguyen et.al. 2402.17535 link
2024-02-29 Active propulsion noise shaping for multi-rotor aircraft localization Gabriele Serussi et.al. 2402.17289 link
2024-02-27 NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer Bingxi Liu et.al. 2402.17159 link
2024-02-25 Deep Homography Estimation for Visual Place Recognition Feng Lu et.al. 2402.16086 link
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961 link
2024-02-28 Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries Zijun Long et.al. 2402.15276 null
2024-02-23 Fine-tuning CLIP Text Encoders with Two-step Paraphrasing Hyunjae Kim et.al. 2402.15120 null
2024-02-22 Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition Feng Lu et.al. 2402.14505 link
2024-02-16 Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition Chenming Hu et.al. 2402.10476 null
2024-02-15 Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task Mirko Nava et.al. 2402.09886 link
2024-02-14 Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency Yannis Kalantidis et.al. 2402.09237 null
2024-02-13 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Xiangming Gu et.al. 2402.08567 link
2024-02-13 Learning to Produce Semi-dense Correspondences for Visual Localization Khang Truong Giang et.al. 2402.08359 link
2024-02-10 Semantic Object-level Modeling for Robust Visual Camera Relocalization Yifan Zhu et.al. 2402.06951 null
2024-02-09 Large Language Models for Captioning and Retrieving Remote Sensing Images João Daniel Silva et.al. 2402.06475 null
2024-02-09 PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes Xinggang Hu et.al. 2402.06131 null
2024-02-21 MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction Heng Zhou et.al. 2402.03762 null
2024-02-04 Region-Based Representations Revisited Michal Shlapentokh-Rothman et.al. 2402.02352 link
2024-02-03 Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization Bo Yang et.al. 2402.02141 link
2024-02-01 BrainSLAM: SLAM on Neural Population Activity Data Kipp Freud et.al. 2402.00588 null
2024-02-01 Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering Tianxiao Gao et.al. 2402.00330 link
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083 link
2024-01-31 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592 link
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459 null
2024-01-29 Cross-Modal Coordination Across a Diverse Set of Input Modalities Jorge Sánchez et.al. 2401.16347 null
2024-01-29 Regressing Transformers for Data-efficient Visual Place Recognition María Leyva-Vallina et.al. 2401.16304 null
2024-01-27 Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval Ayush Dubey et.al. 2401.15362 null
2024-01-24 Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode Naresh Kumar Lahajal et.al. 2401.13613 null
2024-01-23 PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion Shyam Sundar Kannan et.al. 2401.13082 null
2024-01-23 SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization Mingyang Li et.al. 2401.13076 link
2024-01-25 CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios Xiangshuo Qiao et.al. 2401.10475 link
2024-01-19 PhotoScout: Synthesis-Powered Multi-Modal Image Search Celeste Barnaby et.al. 2401.10464 null
2024-01-19 Cross-Modality Perturbation Synergy Attack for Person Re-identification Yunpeng Gong et.al. 2401.10090 null
2024-01-16 Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging Zahra Tabatabaei et.al. 2401.08272 null
2024-01-16 Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments Bruno Arcanjo et.al. 2401.08263 null
2024-01-15 Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing Jakob Hackstein et.al. 2401.07782 link
2024-01-14 HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval Zexuan Qiu et.al. 2401.07212 link
2024-01-11 UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization Rouwan Wu et.al. 2401.05971 link
2024-01-10 Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval Eunyi Lyou et.al. 2401.04860 link
2024-01-05 Benchmarking PathCLIP for Pathology Image Analysis Sunyi Zheng et.al. 2401.02651 null
2024-01-03 DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM with Joint Semantic Encoding Mingrui Li et.al. 2401.01545 null
2024-01-02 BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving Dafeng Wei et.al. 2401.01065 null
2023-12-31 Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval Liang Wang et.al. 2401.00371 link
2023-12-29 Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering Long-Kun Du et.al. 2401.00032 null
2023-12-27 LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization Sai Shubodh Puligilla et.al. 2312.16648 null
2023-12-26 Recursive Distillation for Open-Set Distributed Robot Localization Kenta Tsukahara et.al. 2312.15897 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-23 CaLDiff: Camera Localization in NeRF via Pose Diffusion Rashik Shrestha et.al. 2312.15242 null
2023-12-20 Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition Bruno Arcanjo et.al. 2312.12995 null
2023-12-19 VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering Chun-Mei Feng et.al. 2312.12273 link
2023-12-18 Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback Boaz Lerner et.al. 2312.11078 link
2023-12-17 PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields Boming Zhao et.al. 2312.10649 null
2023-12-17 DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition Sijie Wang et.al. 2312.10616 link
2023-12-16 Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval Decheng Liu et.al. 2312.10320 link
2023-12-15 Data-Efficient Multimodal Fusion on a Single GPU Noël Vouitsis et.al. 2312.10144 link
2023-12-13 Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques Hamed Qazanfari et.al. 2312.10089 null
2023-12-15 Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval Zhe Ma et.al. 2312.09716 link
2023-12-14 Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition Oliver Grainge et.al. 2312.09028 null
2023-12-14 Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking Shitong Sun et.al. 2312.08924 null
2023-12-13 C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation Florian Fervers et.al. 2312.08060 null
2023-12-12 Contextually Affinitive Neighborhood Refinery for Deep Clustering Chunlin Yu et.al. 2312.07806 link
2023-12-12 Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval Qiwei Tian et.al. 2312.07364 link
2023-12-12 Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection Jonathan J. Y. Kim et.al. 2312.06991 null
2023-12-11 Dynamic Weighted Combiner for Mixed-Modal Image Retrieval Fuxiang Huang et.al. 2312.06179 link
2023-12-06 Lite-Mind: Towards Efficient and Versatile Brain Representation Network Zixuan Gong et.al. 2312.03781 link
2023-12-08 FreestyleRet: Retrieving Images from Style-Diversified Queries Hao Li et.al. 2312.02428 link
2023-12-04 Implicit Learning of Scene Geometry from Poses for Global Localization Mohammad Altillawi et.al. 2312.02029 null
2023-12-04 Language-only Efficient Training of Zero-shot Composed Image Retrieval Geonmo Gu et.al. 2312.01998 link
2023-12-03 G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training Che Liu et.al. 2312.01522 link
2023-12-01 Improve Supervised Representation Learning with Masked Image Modeling Kaifeng Chen et.al. 2312.00950 null
2023-12-05 Grounding Everything: Emerging Localization Properties in Vision-Language Transformers Walid Bousselham et.al. 2312.00878 link
2023-12-01 Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras Mohammad Altillawi et.al. 2312.00500 null
2023-11-30 HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance Zhuohao Yin et.al. 2311.18273 link
2023-11-30 Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models Raviteja Vemulapalli et.al. 2311.18237 link
2023-11-29 Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce Chang Liu et.al. 2311.17954 null
2023-11-28 Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames Chao Chen et.al. 2311.17940 null
2023-11-29 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries Huajian Huang et.al. 2311.17389 link
2023-11-27 Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation Samuele Poppi et.al. 2311.16254 link
2023-11-27 Optimal Transport Aggregation for Visual Place Recognition Sergio Izquierdo et.al. 2311.15937 link
2023-11-27 AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval Shicheng Xu et.al. 2311.14084 link
2023-11-23 3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology Asma Ben Abacha et.al. 2311.13752 link
2023-11-22 Medical Image Retrieval Using Pretrained Embeddings Farnaz Khun Jush et.al. 2311.13547 null
2023-11-22 Applications of Spiking Neural Networks in Visual Place Recognition Somayeh Hussaini et.al. 2311.13186 link
2023-11-21 Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval Xiu-Shen Wei et.al. 2311.12894 null
2023-11-21 Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs Zhentian Qian et.al. 2311.12245 null
2023-11-19 From Categories to Classifier: Name-Only Continual Learning by Exploring the Web Ameya Prabhu et.al. 2311.11293 null
2023-11-18 Lesion Search with Self-supervised Learning Kristin Qi et.al. 2311.11014 null
2023-11-15 Flow reconstruction and particle characterization from inertial Lagrangian tracks Ke Zhou et.al. 2311.09076 null
2023-11-15 Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval Junyang Chen et.al. 2311.07622 link
2023-11-13 VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search Shuting He et.al. 2311.07514 null
2023-11-10 Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval Xin Lu et.al. 2311.06067 null
2023-11-08 Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model Junya Shiraishi et.al. 2311.04788 null
2023-11-08 Training CLIP models on Data from Scientific Papers Calvin Metzger et.al. 2311.04711 link
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098 link
2023-11-06 Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences Zador Pataki et.al. 2311.03345 null
2023-11-06 FocusTune: Tuning Visual Localization through Focus-Guided Sampling Son Tung Nguyen et.al. 2311.02872 link
2023-11-01 DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing Gaoshuang Huang et.al. 2311.00230 link
2023-10-29 Identifiable Contrastive Learning with Automatic Feature Importance Discovery Qi Zhang et.al. 2310.18904 link
2023-10-27 LipSim: A Provably Robust Perceptual Similarity Metric Sara Ghazanfari et.al. 2310.18274 link
2023-10-27 Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation Susu Fang et.al. 2310.17879 null
2023-10-25 FoundLoc: Vision-based Onboard Aerial Localization in the Wild Yao He et.al. 2310.16299 null
2023-10-24 Cross-view Self-localization from Synthesized Scene-graphs Ryogo Yamamoto et.al. 2310.15504 null
2023-10-23 Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval Xu Yuan et.al. 2310.14637 link
2023-10-21 Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation Anastasia Kritharoula et.al. 2310.14025 link
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605 null
2023-10-20 CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants Shaoan Wang et.al. 2310.13320 link
2023-10-27 Representation Learning via Consistent Assignment of Views over Random Partitions Thalles Silva et.al. 2310.12692 link
2023-10-18 Evaluating the Fairness of Discriminative Foundation Models in Computer Vision Junaid Ali et.al. 2310.11867 link
2023-10-17 Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification Shuanglin Yan et.al. 2310.11210 null
2023-10-16 Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People Dharmateja Adapa et.al. 2310.10290 null
2023-10-16 EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge Tom Bryan et.al. 2310.10050 null
2023-10-15 CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes Yulei Qin et.al. 2310.09761 link
2023-10-13 Pairwise Similarity Learning is SimPLE Yandong Wen et.al. 2310.09449 link
2023-10-13 Vision-by-Language for Training-Free Compositional Image Retrieval Shyamgopal Karthik et.al. 2310.09291 link
2023-10-12 Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning Shiyang Yan et.al. 2310.08390 null
2023-10-12 Jointly Optimized Global-Local Visual Localization of UAVs Haoling Li et.al. 2310.08082 null
2023-10-10 Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization Le Chen et.al. 2310.06984 null
2023-10-10 Distillation Improves Visual Place Recognition for Low-Quality Queries Anbang Yang et.al. 2310.06906 link
2023-10-10 Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets Jiajun Zhang et.al. 2310.06566 null
2023-10-10 Topological RANSAC for instance verification and retrieval without fine-tuning Guoyuan An et.al. 2310.06486 null
2023-10-10 3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments Ghanta Sai Krishna et.al. 2310.06385 null
2023-10-09 Collaborative Visual Place Recognition Yiming Li et.al. 2310.05541 null
2023-10-09 Sentence-level Prompts Benefit Composed Image Retrieval Yang Bai et.al. 2310.05473 link
2023-10-08 AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition Feng Lu et.al. 2310.05184 link
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134 null
2023-10-12 ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer Yifan Xu et.al. 2310.04099 null
2023-10-06 Sub-token ViT Embedding via Stochastic Resonance Transformers Dong Lao et.al. 2310.03967 link
2023-10-04 Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach Matthew Hanlon et.al. 2310.02650 null
2023-10-02 NEUCORE: Neural Concept Reasoning for Composed Image Retrieval Shu Zhao et.al. 2310.01358 null
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092 null
2023-10-05 PlaceNav: Topological Navigation through Place Recognition Lauri Suomela et.al. 2309.17260 null
2023-09-29 Segment Anything Model is a Good Teacher for Local Feature Learning Jingqian Wu et.al. 2309.16992 link
2023-09-28 Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning Albert Mohwald et.al. 2309.16351 link
2023-09-28 FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding Pengxiang Wu et.al. 2309.16249 link
2023-09-28 Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2309.16137 link
2023-09-27 GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization Vicente Vivanco Cepeda et.al. 2309.16020 link
2023-09-27 Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization Zhenbo Song et.al. 2309.15556 null
2023-09-26 Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features Hila Levi et.al. 2309.14999 null
2023-09-23 Resolving References in Visually-Grounded Dialogue via Text Generation Bram Willemsen et.al. 2309.13430 link
2023-09-21 Face Identity-Aware Disentanglement in StyleGAN Adrian Suwała et.al. 2309.12033 null
2023-09-21 On-the-Fly SfM: What you capture is What you get Zongqian Zhan et.al. 2309.11883 link
2023-09-20 2D-3D Pose Tracking with Multi-View Constraints Huai Yu et.al. 2309.11335 null
2023-09-19 VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition Adam D. Hines et.al. 2309.10225 link
2023-09-18 DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach Chenghao Xu et.al. 2309.09879 null
2023-09-18 Decompose Semantic Shifts for Composed Image Retrieval Xingyu Yang et.al. 2309.09531 null
2023-09-16 Efficient Object Rearrangement via Multi-view Fusion Dehao Huang et.al. 2309.08994 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927 link
2023-09-16 Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning Pengyu Yin et.al. 2309.08914 link
2023-09-15 Active Learning for Fine-Grained Sketch-Based Image Retrieval Himanshu Thakur et.al. 2309.08743 null
2023-09-15 Optimization of Rank Losses for Image Retrieval Elias Ramzi et.al. 2309.08250 link
2023-09-18 Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer Yaoting Wang et.al. 2309.07929 link
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-13 RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline Mirko Usuelli et.al. 2309.07094 null
2023-09-11 Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Guoyuan An et.al. 2309.05438 link
2023-09-08 Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning Hiroki Nakamura et.al. 2309.04148 null
2023-09-05 Magnetic Navigation using Attitude-Invariant Magnetic Field Information for Loop Closure Detection Natalia Pavlasek et.al. 2309.02394 null
2023-09-05 Dual Relation Alignment for Composed Image Retrieval Xintong Jiang et.al. 2309.02169 null
2023-09-04 NLLB-CLIP – train performant multilingual image retrieval model on a budget Alexander Visheratin et.al. 2309.01859 null
2023-09-04 Target-Guided Composed Image Retrieval Haokun Wen et.al. 2309.01366 null
2023-09-02 Deep supervised hashing for fast retrieval of radio image cubes Steven Ndung’u et.al. 2309.00932 null
2023-08-31 Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval Prateksha Udhayanan et.al. 2308.16649 null
2023-08-28 Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics Nils Böhne et.al. 2308.14786 null
2023-08-28 CoVR: Learning Composed Video Retrieval from Web Video Captions Lucas Ventura et.al. 2308.14746 link
2023-08-27 Deep Learning for Visual Localization and Mapping: A Survey Changhao Chen et.al. 2308.14039 null
2023-08-26 Learning Efficient Representations for Image-Based Patent Retrieval Hongsong Wang et.al. 2308.13749 null
2023-08-25 Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers Mohammad Javad Rajabi et.al. 2308.13671 null
2023-08-24 Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities Jinze Bai et.al. 2308.12966 link
2023-08-23 Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval Huafeng Li et.al. 2308.11994 null
2023-08-23 OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes Tao Xie et.al. 2308.11928 link
2023-08-22 Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features Alberto Baldrati et.al. 2308.11485 link
2023-08-22 GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training Xinchi Deng et.al. 2308.11331 null
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223 null
2023-08-21 EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition Gabriele Berton et.al. 2308.10832 link
2023-08-20 FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory Anwesan Pal et.al. 2308.10170 null
2023-08-18 3D Model-free Visual localization System from Essential Matrix under Local Planar Motion Yanmei Jiao et.al. 2308.09566 null
2023-08-17 FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings Yulin Su et.al. 2308.09012 link
2023-08-16 Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval Aishwarya Venkataramanan et.al. 2308.08431 link
2023-08-16 Ranking-aware Uncertainty for Text-guided Image Retrieval Junyang Chen et.al. 2308.08131 null
2023-08-19 Global Features are All You Need for Image Retrieval and Reranking Shihao Shao et.al. 2308.06954 link
2023-08-14 MixBCT: Towards Self-Adapting Backward-Compatible Training Yu Liang et.al. 2308.06948 link
2023-08-10 KS-APR: Keyframe Selection for Robust Absolute Pose Regression Changkun Liu et.al. 2308.05459 null
2023-08-09 AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities Jingdan Zhang et.al. 2308.04992 link
2023-08-08 Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval Yi Bin et.al. 2308.04343 link
2023-08-08 Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval Yunquan Zhu et.al. 2308.04008 link
2023-08-05 A Comprehensive Analysis of Real-World Image Captioning and Scene Identification Sai Suprabhanu Nallapaneni et.al. 2308.02833 null
2023-08-03 Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies Eunsuk Seo et.al. 2308.01871 null
2023-08-01 AnyLoc: Towards Universal Visual Place Recognition Nikhil Keetha et.al. 2308.00688 link
2023-07-31 Guiding Image Captioning Models Toward More Specific Captions Simon Kornblith et.al. 2307.16686 null
2023-07-31 Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks Kousik Rajesh et.al. 2307.16395 null
2023-07-28 D2S: Representing local descriptors and global scene coordinates for camera relocalization Bach-Thuan Bui et.al. 2307.15250 link
2023-07-26 Neural-based Cross-modal Search and Retrieval of Artwork Yan Gong et.al. 2307.14244 null
2023-07-26 Boon: A Neural Search Engine for Cross-Modal Information Retrieval Yan Gong et.al. 2307.14240 null
2023-07-25 Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network Chull Hwan Song et.al. 2307.13254 null
2023-07-28 SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jerome Revaud et.al. 2307.11702 null
2023-07-19 Lazy Visual Localization via Motion Averaging Siyan Dong et.al. 2307.09981 null
2023-07-19 Quantum Optics based Algorithm for Measuring the Similarity between Images Vivek Mehta et.al. 2307.09789 null
2023-07-18 Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments Max Moebius et.al. 2307.09172 null
2023-07-18 3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving Qipeng Li et.al. 2307.09044 null
2023-07-19 Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation Rundong Luo et.al. 2307.08779 null
2023-07-17 Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition Gabriele Trivigno et.al. 2307.08417 link
2023-07-17 Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification Tengfei Liang et.al. 2307.08316 link
2023-07-17 NDT-Map-Code: A 3D global descriptor for real-time loop closure detection in lidar SLAM Lizhou Liao et.al. 2307.08221 link
2023-07-20 Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer Yujiao Shi et.al. 2307.08015 link
2023-07-10 Phoneme-retrieval; voice recognition; vowels recognition Brunello Tirozzi et.al. 2307.07407 null
2023-07-14 Risk Controlled Image Retrieval Kaiwen Cai et.al. 2307.07336 link
2023-07-11 ResMatch: Residual Attention Learning for Local Feature Matching Yuxin Deng et.al. 2307.05180 link
2023-07-11 Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification Yi Liao et.al. 2307.05017 null
2023-07-10 Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor San Jiang et.al. 2307.04520 null
2023-07-10 RaPlace: Place Recognition for Imaging Radar using Radon Transform and Mutable Threshold Hyesu Jang et.al. 2307.04321 link
2023-07-08 Calibration-Aware Margin Loss: Pushing the Accuracy-Calibration Consistency Pareto Frontier for Deep Metric Learning Qin Zhang et.al. 2307.04047 null
2023-07-04 Unsupervised Quality Prediction for Improved Single-Frame and Weighted Sequential Visual Place Recognition Helen Carson et.al. 2307.01464 null
2023-07-04 Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network Zizhuo Li et.al. 2307.01447 null
2023-07-03 Cross-modal Place Recognition in Image Databases using Event-based Sensors Xiang Ji et.al. 2307.01047 null
2023-06-30 DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions Stephen Hausler et.al. 2306.17536 null
2023-06-30 Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization Stephen Hausler et.al. 2306.17529 null
2023-06-27 Dental CLAIRES: Contrastive LAnguage Image REtrieval Search for Dental Research Tanjida Kabir et.al. 2306.15651 null
2023-06-27 Mean Field Theory in Deep Metric Learning Takuya Furusawa et.al. 2306.15368 null
2023-06-26 Hierarchical Matching and Reasoning for Multi-Query Image Retrieval Zhong Ji et.al. 2306.14460 link
2023-06-25 Enhancing Dynamic Image Advertising with Vision-Language Pre-training Zhoufutu Wen et.al. 2306.14112 null
2023-06-23 Catching Image Retrieval Generalization Maksim Zhdanov et.al. 2306.13357 null
2023-06-22 Deep Metric Learning with Soft Orthogonal Proxies Farshad Saberi-Movahed et.al. 2306.13055 null
2023-06-22 What to Learn: Features, Image Transformations, or Both? Yuxuan Chen et.al. 2306.13040 null
2023-06-22 Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval Katrin Glinka et.al. 2306.12843 null
2023-06-26 Annotation Cost Efficient Active Learning for Content Based Image Retrieval Julia Henkel et.al. 2306.11605 null
2023-06-19 Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning Shivaen Ramshetty et.al. 2306.11065 link
2023-06-18 LiDAR-Based Place Recognition For Autonomous Driving: A Survey Pengcheng Shi et.al. 2306.10561 link
2023-06-15 Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization Dror Aiger et.al. 2306.09012 link
2023-06-15 Prompt Performance Prediction for Generative IR Nicolas Bizzozzero et.al. 2306.08915 null
2023-06-15 Graph Convolution Based Efficient Re-Ranking for Visual Retrieval Yuqi Zhang et.al. 2306.08792 link
2023-06-13 GeneCIS: A Benchmark for General Conditional Image Similarity Sagar Vaze et.al. 2306.07969 null
2023-06-13 MOFI: Learning Image Representations from Noisy Entity Annotated Images Wentao Wu et.al. 2306.07952 link
2023-06-12 Zero-shot Composed Text-Image Retrieval Yikun Liu et.al. 2306.07272 link
2023-06-12 Sticker820K: Empowering Interactive Retrieval with Stickers Sijie Zhao et.al. 2306.06870 null
2023-06-11 Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models Yuguang Yang et.al. 2306.06691 null
2023-06-03 Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval Xu Zhang et.al. 2306.02092 null
2023-06-03 Class Anchor Margin Loss for Content-Based Image Retrieval Alexandru Ghita et.al. 2306.00630 null
2023-05-31 Chatting Makes Perfect – Chat-based Image Retrieval Matan Levy et.al. 2305.20062 link
2023-05-31 Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization Junan Chen et.al. 2305.20044 null
2023-05-30 A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation Omar Seddati et.al. 2305.18988 null
2023-05-29 Synfeal: A Data-Driven Simulator for End-to-End Camera Localization Daniel Coelho et.al. 2305.18260 link
2023-05-29 Nanoscale visualization of the thermally-driven evolution of antiferromagnetic domains in FeTe thin films Shrinkhala Sharma et.al. 2305.18197 null
2023-05-29 TReR: A Lightweight Transformer Re-Ranking Approach for 3D LiDAR Place Recognition Tiago Barros et.al. 2305.18013 null
2023-05-28 ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval Jiapeng Wang et.al. 2305.17652 null
2023-06-01 FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing Zhuang Li et.al. 2305.17497 link
2023-05-27 Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation Yueh-Cheng Huang et.al. 2305.17463 null
2023-05-26 Generating Images with Multimodal Language Models Jing Yu Koh et.al. 2305.17216 link
2023-05-25 Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder Zheyuan Liu et.al. 2305.16304 link
2023-05-23 Leveraging BEV Representation for 360-degree Visual Place Recognition Xuecheng Xu et.al. 2305.13814 link
2023-05-23 EDIS: Entity-Driven Image Search over Multimodal Web Content Siqi Liu et.al. 2305.13631 link
2023-05-20 DAC: Detector-Agnostic Spatial Covariances for Deep Local Features Javier Tirado-Garín et.al. 2305.12250 link
2023-05-19 Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach Zahra Tabatabaei et.al. 2305.11728 null
2023-05-19 Learning Sequence Descriptor based on Spatiotemporal Attention for Visual Place Recognition Fenglin Zhang et.al. 2305.11467 link
2023-05-12 IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images Varuna Krishna et.al. 2305.10438 link
2023-05-17 Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval Haokun Wen et.al. 2305.09979 null
2023-05-13 Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance Xinyu Lin et.al. 2305.07943 link
2023-05-11 Foundations of Spatial Perception for Robotics: Hierarchical Representations and Real-time Systems Nathan Hughes et.al. 2305.07154 link
2023-05-09 Visual Place Recognition with Low-Resolution Images Mihnea-Alexandru Tomita et.al. 2305.05776 null
2023-05-09 Vision-Language Models in Remote Sensing: Current Progress and Future Trends Congcong Wen et.al. 2305.05726 null
2023-05-09 An Evaluation and Ranking of Different Voting Schemes for Improved Visual Place Recognition Maria Waheed et.al. 2305.05705 null
2023-05-09 Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query Ho Hin Lee et.al. 2305.05598 null
2023-05-09 ColonMapper: topological mapping and localization for colonoscopy Javier Morlana et.al. 2305.05546 null
2023-05-09 Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization Clémentin Boittiaux et.al. 2305.05301 link
2023-05-09 Patch-DrosoNet: Classifying Image Partitions With Fly-Inspired Models For Lightweight Visual Place Recognition Bruno Arcanjo et.al. 2305.05256 null
2023-05-09 Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval Shiyin Dong et.al. 2305.05144 null
2023-05-08 Hierarchical Visual Localization Based on Sparse Feature Pyramid for Adaptive Reduction of Keypoint Map Size Andrei Potapov et.al. 2305.04856 null
2023-05-08 Privacy-Preserving Representations are not Enough – Recovering Scene Content from Camera Poses Kunal Chelani et.al. 2305.04603 link
2023-05-06 Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer Minyi Zhao et.al. 2305.04072 null
2023-05-06 Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing Swagatika Dash et.al. 2305.03881 link
2023-05-05 COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? Arijit Ray et.al. 2305.03689 link
2023-05-05 HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer Shuzhe Wang et.al. 2305.03595 null
2023-05-05 WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval Zahra Tabatabaei et.al. 2305.03383 null
2023-05-04 Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval Tan Pan et.al. 2305.02610 link
2023-05-03 Learning-based Relational Object Matching Across Views Cathrin Elich et.al. 2305.02398 null
2023-05-05 A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text Yunxin Li et.al. 2305.02265 link
2023-05-03 AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation Shentong Mo et.al. 2305.01836 null
2023-04-30 Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection Jie Ren et.al. 2305.00435 null
2023-04-28 SFD2: Semantic-guided Feature Detection and Description Fei Xue et.al. 2304.14845 link
2023-04-28 Quantum enhanced non-interferometric quantitative phase imaging Giuseppe Ortolano et.al. 2304.14727 null
2023-04-26 Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams Yun Chang et.al. 2304.13487 null
2023-04-27 STIR: Siamese Transformer for Image Retrieval Postprocessing Aleksei Shabanov et.al. 2304.13393 null
2023-04-25 DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design Jiahao Weng et.al. 2304.12506 null
2023-04-24 Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning Lucas Pascotti Valem et.al. 2304.12448 link
2023-04-23 IDLL: Inverse Depth Line based Visual Localization in Challenging Environments Wanting Li et.al. 2304.11748 null
2023-04-23 Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval Mehdi Rafiei et.al. 2304.11734 null
2023-04-17 Features-over-the-Air: Contrastive Learning Enabled Cooperative Edge Inference Haotian Wu et.al. 2304.08221 null
2023-04-17 NeRF-Loc: Visual Localization with Conditional Neural Radiance Field Jianlin Liu et.al. 2304.07979 link
2023-04-16 Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification Luca Piano et.al. 2304.07883 null
2023-04-16 Language Guided Local Infiltration for Interactive Image Retrieval Fuxiang Huang et.al. 2304.07747 null
2023-04-16 Long-term Visual Localization with Mobile Sensors Shen Yan et.al. 2304.07691 null
2023-04-16 Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging Jielin Qiu et.al. 2304.07675 null
2023-04-14 CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression Mubariz Zaffar et.al. 2304.07426 null
2023-04-14 FM-Loc: Using Foundation Models for Improved Vision-based Localization Reihaneh Mirjalili et.al. 2304.07058 null
2023-04-17 Toward Real-Time Image Annotation Using Marginalized Coupled Dictionary Learning Seyed Mahdi Roostaiyan et.al. 2304.06907 link
2023-04-17 You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization problem and dataset Matteo Toso et.al. 2304.06373 link
2023-04-12 Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation Yifeng Shi et.al. 2304.06051 link
2023-04-12 Visual Localization using Imperfect 3D Models from the Internet Vojtech Panek et.al. 2304.05947 link
2023-04-12 Are Local Features All You Need for Cross-Domain Visual Place Recognition? Giovanni Barbarani et.al. 2304.05887 link
2023-04-12 Unicom: Universal and Compact Representation Learning for Image Retrieval Xiang An et.al. 2304.05884 link
2023-04-12 SGL: Structure Guidance Learning for Camera Localization Xudong Zhang et.al. 2304.05571 null
2023-04-14 Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency Xingwu Ji et.al. 2304.05146 link
2023-04-10 CAVL: Learning Contrastive and Adaptive Representations of Vision and Language Shentong Mo et.al. 2304.04399 null
2023-04-09 Unsupervised Multi-Criteria Adversarial Detection in Deep Image Retrieval Yanru Xiao et.al. 2304.04228 null
2023-04-08 SGIDN-LCD: An Appearance-based Loop Closure Detection Algorithm using Superpixel Grids and Incremental Dynamic Nodes Baosheng Zhang et.al. 2304.03872 null
2023-04-06 $R^{2}$Former: Unified $R$etrieval and $R$ eranking Transformer for Place Recognition Sijie Zhu et.al. 2304.03410 null
2023-04-06 Distributed formation-enforcing control for UAVs robust to observation noise in relative pose measurements Viktor Walter et.al. 2304.03057 link
2023-04-05 Efficient OCR for Building a Diverse Digital History Jacob Carlson et.al. 2304.02737 link
2023-04-05 LogoNet: a fine-grained network for instance-level logo sketch retrieval Binbin Feng et.al. 2304.02214 link
2023-04-04 OrienterNet: Visual Localization in 2D Public Maps with Neural Matching Paul-Edouard Sarlin et.al. 2304.02009 link
2023-04-04 Cross-Domain Image Captioning with Discriminative Finetuning Roberto Dessì et.al. 2304.01662 link
2023-04-02 Learning Similarity between Scene Graphs and Images with Transformers Yuren Cong et.al. 2304.00590 link
2023-04-01 NPR: Nocturnal Place Recognition in Street Bingxi Liu et.al. 2304.00276 null
2023-03-31 Unsupervised crack detection on complex stone masonry surfaces Panagiotis Agrafiotis et.al. 2303.17989 null
2023-03-30 If At First You Don’t Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval Finlay G. C. Hudson et.al. 2303.17703 null
2023-03-30 Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime Rhydian Windsor et.al. 2303.17644 null
2023-03-30 3D Line Mapping Revisited Shaohui Liu et.al. 2303.17504 link
2023-03-30 Methods and advancement of content-based fashion image retrieval: A Review Amin Muhammad Shoib et.al. 2303.17371 null
2023-03-30 Adaptive Cross Batch Normalization for Metric Learning Thalaiyasingam Ajanthan et.al. 2303.17127 null
2023-03-30 MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks Weicheng Kuo et.al. 2303.16839 null
2023-03-29 Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval Leo Sampaio Ferraz Ribeiro et.al. 2303.16769 null
2023-03-29 Bi-directional Training for Composed Image Retrieval via Text Prompt Learning Zheyuan Liu et.al. 2303.16604 link
2023-03-27 Model Cascades for Efficient Image Search Robert Hönig et.al. 2303.15595 null
2023-03-27 Zero-Shot Composed Image Retrieval with Textual Inversion Alberto Baldrati et.al. 2303.15247 link
2023-03-27 What Can Human Sketches Do for Object Detection? Pinaki Nath Chowdhury et.al. 2303.15149 null
2023-03-25 Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style Fengyin Lin et.al. 2303.14348 link
2023-03-24 A-MuSIC: An Adaptive Ensemble System For Visual Place Recognition In Changing Environments Bruno Arcanjo et.al. 2303.14247 null
2023-03-24 PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View Ze Shi et.al. 2303.14095 link
2023-03-24 Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR Aneeshan Sain et.al. 2303.13779 null
2023-03-28 CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not Aneeshan Sain et.al. 2303.13440 null
2023-03-22 Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval Xunguang Wang et.al. 2303.12658 null
2023-03-21 CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion Geonmo Gu et.al. 2303.11916 link
2023-03-21 LIMITR: Leveraging Local Information for Medical Image-Text Representation Gefen Dawidowicz et.al. 2303.11755 null
2023-03-25 Data-efficient Large Scale Place Recognition with Graded Similarity Supervision Maria Leyva-Vallina et.al. 2303.11739 link
2023-03-20 Picture that Sketch: Photorealistic Image Generation from Abstract Sketches Subhadeep Koley et.al. 2303.11162 null
2023-03-19 Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths Ming Xu et.al. 2303.10778 link
2023-03-17 MRIS: A Multi-modal Retrieval Approach for Image Synthesis on Diverse Modalities Boqi Chen et.al. 2303.10249 null
2023-03-17 IRGen: Generative Modeling for Image Retrieval Yidan Zhang et.al. 2303.10126 link
2023-03-16 Data Roaming and Early Fusion for Composed Image Retrieval Matan Levy et.al. 2303.09429 link
2023-03-16 Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval Yi Xie et.al. 2303.09230 null
2023-03-16 Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space Yuhang He et.al. 2303.09192 null
2023-03-16 Unsupervised Facial Expression Representation Learning with Contrastive Local Warping Fanglei Xue et.al. 2303.09034 null
2023-03-15 A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval Saeideh Yousefzadeh et.al. 2303.08398 null
2023-03-14 Data-Free Sketch-Based Image Retrieval Abhra Chaudhuri et.al. 2303.07775 link
2023-03-14 PATS: Patch Area Transportation with Subdivision for Local Feature Matching Junjie Ni et.al. 2303.07700 null
2023-03-10 Robotic Applications of Pre-Trained Vision-Language Models to Various Recognition Behaviors Kento Kawaharazuka et.al. 2303.05674 null
2023-03-09 Dominating Set Database Selection for Visual Place Recognition Anastasiia Kornilova et.al. 2303.05123 null
2023-03-07 Graph Neural Networks in Vision-Language Image Understanding: A Survey Henry Senior et.al. 2303.03761 null
2023-03-07 Sketch-based Medical Image Retrieval Kazuma Kobayashi et.al. 2303.03633 link
2023-03-06 Visual Place Recognition: A Tutorial Stefan Schubert et.al. 2303.03281 link
2023-03-06 MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval Rohit Agarwal et.al. 2303.03050 link
2023-03-06 Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints Chenjie Cao et.al. 2303.02885 link
2023-03-05 Composing Mood Board with User Feedback in Concept Space Shin Sano et.al. 2303.02547 null
2023-03-04 FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks Xiao Han et.al. 2303.02483 link
2023-03-09 Self-Supervised Learning for Place Representation Generalization across Appearance Changes Mohamed Adel Musallam et.al. 2303.02370 null
2023-03-03 MixVPR: Feature Mixing for Visual Place Recognition Amar Ali-bey et.al. 2303.02190 link
2023-03-01 A Complementarity-Based Switch-Fuse System for Improved Visual Place Recognition Maria Waheed et.al. 2303.00714 null
2023-03-01 ORCHNet: A Robust Global Feature Aggregation approach for 3D LiDAR-based Place recognition in Orchards T. Barros et.al. 2303.00477 link
2023-03-03 Renderable Neural Radiance Map for Visual Navigation Obin Kwon et.al. 2303.00304 null
2023-03-01 Region Prediction for Efficient Robot Localization on Large Maps Matteo Scucchia et.al. 2303.00295 link
2023-02-28 OEKG: The Open Event Knowledge Graph Simon Gottschalk et.al. 2302.14688 null
2023-02-28 Global Proxy-based Hard Mining for Visual Place Recognition Amar Ali-bey et.al. 2302.14217 link
2023-02-27 Efficient Informed Proposals for Discrete Distributions via Newton’s Series Approximation Yue Xiang et.al. 2302.13929 link
2023-02-26 Data-Efficient Sequence-Based Visual Place Recognition with Highly Compressed JPEG Images Mihnea-Alexandru Tomita et.al. 2302.13314 null
2023-02-26 Learning cross space mapping via DNN using large scale click-through logs Wei Yu et.al. 2302.13275 null
2023-02-25 DeepBrainPrint: A Novel Contrastive Framework for Brain MRI Re-Identification Lemuel Puglisi et.al. 2302.13057 null
2023-02-23 Teaching CLIP to Count to Ten Roni Paiss et.al. 2302.12066 null
2023-02-22 Steerable Equivariant Representation Learning Sangnie Bhardwaj et.al. 2302.11349 null
2023-02-21 iQPP: A Benchmark for Image Query Performance Prediction Eduard Poesina et.al. 2302.10126 link
2023-02-20 Ontology-aware Network for Zero-shot Sketch-based Image Retrieval Haoxiang Zhang et.al. 2302.10040 null
2023-02-20 TBPos: Dataset for Large-Scale Precision Visual Localization Masud Fahim et.al. 2302.09825 link
2023-02-17 Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts Zhihong Chen et.al. 2302.08958 link
2023-02-22 Fashion Image Retrieval with Multi-Granular Alignment Jinkuan Zhu et.al. 2302.08902 null
2023-02-15 Unsupervised Hashing via Similarity Distribution Calibration Kam Woh Ng et.al. 2302.07669 link
2023-02-13 Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior Shen Yan et.al. 2302.06287 link
2023-02-13 Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation Binqian Jiang et.al. 2302.06149 link
2023-02-13 Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval Xu Wang et.al. 2302.06081 link
2023-02-11 Sketch Less Face Image Retrieval: A New Challenge Dawei Dai et.al. 2302.05576 link
2023-02-10 Is multi-modal vision supervision beneficial to language? Avinash Madasu et.al. 2302.05016 link
2023-02-06 Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval Kuniaki Saito et.al. 2302.03084 link
2023-02-06 Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs Michael Kirchhof et.al. 2302.02865 link
2023-02-03 Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization Yingying Zhu et.al. 2302.01572 link
2023-02-04 Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval Frederik Warburg et.al. 2302.01332 link
2023-01-31 Grounding Language Models to Images for Multimodal Generation Jing Yu Koh et.al. 2301.13823 link
2023-01-31 UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Dachuan Shi et.al. 2301.13741 link
2023-01-23 Lexi: Self-Supervised Learning of the UI Language Pratyay Banerjee et.al. 2301.10165 link
2023-01-17 Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval Yuchen Wu et.al. 2301.06685 null
2023-01-19 High-bandwidth Close-Range Information Transport through Light Pipes Joowon Lim et.al. 2301.06496 null
2023-01-13 A LiDAR-Inertial-Visual SLAM System with Loop Detection Kangcheng Liu et.al. 2301.05604 null
2023-01-12 GH-Feat: Learning Versatile Generative Hierarchical Features from GANs Yinghao Xu et.al. 2301.05315 null
2023-01-10 Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images Xindi Wu et.al. 2301.04224 null
2023-01-10 Collaborative Semantic Communication at the Edge Wing Fei Lo et.al. 2301.03996 null
2023-01-10 Online Backfilling with No Regret for Large-Scale Image Retrieval Seonguk Seo et.al. 2301.03767 null
2023-01-06 CyberLoc: Towards Accurate Long-term Visual Localization Liu Liu et.al. 2301.02403 null
2023-01-05 A Probabilistic Framework for Visual Localization in Ambiguous Scenes Fereidoon Zangeneh et.al. 2301.02086 link
2022-12-31 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions Patrick Wenzel et.al. 2301.01147 null
2022-12-30 HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images Dmitry Yudin et.al. 2212.14649 link
2022-12-27 Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning Wooyoung Kang et.al. 2212.13563 link
2022-12-23 SuperGF: Unifying Local and Global Features for Visual Localization Wenzheng Song et.al. 2212.13105 null
2022-12-24 GraffMatch: Global Matching of 3D Lines and Planes for Wide Baseline LiDAR Registration Parker C. Lusk et.al. 2212.12745 null
2022-12-19 From a Bird’s Eye View to See: Joint Camera and Subject Registration without the Camera Calibration Zekun Qian et.al. 2212.09298 link
2022-12-14 The Infinite Index: Information Retrieval on Generative Text-To-Image Models Niklas Deckers et.al. 2212.07476 null
2022-12-14 Shared Coupling-bridge for Weakly Supervised Local Feature Learning Jiayuan Sun et.al. 2212.07047 link
2022-12-08 Group Generalized Mean Pooling for Vision Transformer Byungsoo Ko et.al. 2212.04114 null
2022-12-12 Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models Gowthami Somepalli et.al. 2212.03860 null
2022-12-07 LSVL: Large-scale season-invariant visual localization for UAVs Jouko Kinnari et.al. 2212.03581 null
2022-12-06 ADIR: Adaptive Diffusion for Image Reconstruction Shady Abu-Hussein et.al. 2212.03221 null
2022-12-08 Privacy-Preserving Visual Localization with Event Cameras Junho Kim et.al. 2212.03177 link
2022-12-06 Semantic Communication for Internet of Vehicles: A Multi-User Cooperative Approach Wenjun Xu et.al. 2212.03037 null
2022-12-06 Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds Zhipeng Zhao et.al. 2212.02757 null
2022-12-04 Fast and Lightweight Scene Regressor for Camera Relocalization Thuan B. Bui et.al. 2212.01830 link
2022-12-02 Information Retrieval from the Digitized Books Riya Gupta et.al. 2212.00999 null
2022-12-09 StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition Yanqing Shen et.al. 2212.00937 null
2022-11-30 Self-Supervised Feature Learning for Long-Term Metric Visual Localization Yuxuan Chen et.al. 2212.00122 null
2022-11-30 SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation Tianyu Zhang et.al. 2211.16697 link
2022-11-28 SLAN: Self-Locator Aided Network for Cross-Modal Understanding Jiang-Tian Zhai et.al. 2211.16208 null
2022-11-29 RankDNN: Learning to Rank for Few-shot Learning Qianyu Guo et.al. 2211.15320 link
2022-11-28 Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map Xi Zheng et.al. 2211.15127 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069 link
2022-11-27 BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images Zhihuang Zhang et.al. 2211.14927 null
2022-11-27 A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition Rui Huang et.al. 2211.14864 null
2022-11-26 Visual Place Recognition Bailu Guo et.al. 2211.14533 null
2022-11-26 Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval Fan Yang et.al. 2211.14515 link
2022-11-30 Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark Floriana Ciaglia et.al. 2211.13523 link
2022-11-23 InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images Konstantin Kobs et.al. 2211.12760 link
2022-11-29 Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments Joshua Knights et.al. 2211.12732 link
2022-11-23 FE-Fusion-VPR: Attention-based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events Kuanxu Hou et.al. 2211.12244 null
2022-11-22 Multimorbidity Content-Based Medical Image Retrieval Using Proxies Yunyan Xing et.al. 2211.12185 null
2022-11-22 Vision-based localization methods under GPS-denied conditions Zihao Lu et.al. 2211.11988 null
2022-11-21 ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields Mohammad Mahdi Johari et.al. 2211.11704 null
2022-11-21 LISA: Localized Image Stylization with Audio via Implicit Neural Representation Seung Hyun Lee et.al. 2211.11381 null
2022-11-21 NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization Shitao Tang et.al. 2211.11177 link
2022-11-16 Improving Feature-based Visual Localization by Geometry-Aided Matching Hailin Yu et.al. 2211.08712 link
2022-11-15 LiePoseNet: Heterogeneous Loss Function Based on Lie Group for Significant Speed-up of PoseNet Training Process Mikhail Kurenkov et.al. 2211.08480 null
2022-11-14 Degeneracy removal of spin bands in antiferromagnets with non-interconvertible spin motif pair Lin-Ding Yuan et.al. 2211.07803 null
2022-11-14 Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition Farid Alijani et.al. 2211.07696 null
2022-11-14 Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization Yiyang Chen et.al. 2211.07394 link
2022-11-14 Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment Junyang Wang et.al. 2211.07275 null
2022-11-14 ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations Chanda Grover et.al. 2211.07122 null
2022-11-14 Few-shot Metric Learning: Online Adaptation of Embedding for Retrieval Deunsol Jung et.al. 2211.07116 null
2022-11-12 Partial Visual-Semantic Embedding: Fashion Intelligence System with Sensitive Part-by-Part Learning Ryotaro Shimizu et.al. 2211.06688 null
2022-11-09 Visual Named Entity Linking: A New Dataset and A Baseline Wenxiang Sun et.al. 2211.04872 link
2022-11-07 Ultrafast Image Retrieval from a Holographic Memory Disc for High-Speed Operation of a Shift, Scale, and Rotation Invariant Target Recognition System Julian Gamboa et.al. 2211.03881 null
2022-11-06 A Geometrically Constrained Point Matching based on View-invariant Cross-ratios, and Homography Yueh-Cheng Huang et.al. 2211.03007 null
2022-11-02 Optimizing Fiducial Marker Placement for Improved Visual Localization Qiangqiang Huang et.al. 2211.01513 link
2022-11-02 A comparison of uncertainty estimation approaches for DNN-based camera localization Matteo Vaghi et.al. 2211.01234 null
2022-11-02 M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval Layne Berry et.al. 2211.01180 null
2022-11-11 Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality Anuj Diwan et.al. 2211.00768 link
2022-11-07 Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding Ryotaro Shimizu et.al. 2210.17417 null
2022-10-27 Structuring User-Generated Content on Social Media with Multimodal Aspect-Based Sentiment Analysis Miriam Anschütz et.al. 2210.15377 link
2022-10-27 Leveraging Computer Vision Application in Visual Arts: A Case Study on the Use of Residual Neural Network to Classify and Analyze Baroque Paintings Daniel Kvak et.al. 2210.15300 null
2022-10-27 Towards Practicality of Sketch-Based Visual Understanding Ayan Kumar Bhunia et.al. 2210.15146 null
2022-10-27 MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval Chen Bao et.al. 2210.15128 null
2022-10-26 FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning Suvir Mirchandani et.al. 2210.15028 null
2022-10-26 FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization Junyang Wang et.al. 2210.14562 null
2022-11-02 A Framework for Collaborative Multi-Robot Mapping using Spectral Graph Wavelets Lukas Bernreiter et.al. 2210.13856 null
2022-10-27 Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision Tzu-Jui Julius Wang et.al. 2210.13591 null
2022-10-24 Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval Zhaopeng Dou et.al. 2210.13440 link
2022-10-23 Neural Eigenfunctions Are Structured Representation Learners Zhijie Deng et.al. 2210.12637 link
2022-10-21 Boosting vision transformers for image retrieval Chull Hwan Song et.al. 2210.11909 link
2022-10-20 Communication breakdown: On the low mutual intelligibility between human and neural captioning Roberto Dessì et.al. 2210.11512 link
2022-10-19 Image Semantic Relation Generation Mingzhe Du et.al. 2210.11253 null
2022-10-20 General Image Descriptors for Open World Image Retrieval using ViT CLIP Marcos V. Conde et.al. 2210.11141 link
2022-10-20 DeepRING: Learning Roto-translation Invariant Representation for LiDAR based Place Recognition Sha Lu et.al. 2210.11029 null
2022-10-19 Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval Abhra Chaudhuri et.al. 2210.10486 link
2022-10-19 GSV-Cities: Toward Appropriate Supervised Visual Place Recognition Amar Ali-bey et.al. 2210.10239 link
2022-10-18 A Real-Time Fusion Framework for Long-term Visual Localization Yuchen Yang et.al. 2210.09757 null
2022-10-17 Bridging the Gap between Local Semantic Concepts and Bag of Visual Words for Natural Scene Image Retrieval Yousef Alqasrawi et.al. 2210.08875 null
2022-10-17 SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation Woo Suk Choi et.al. 2210.08675 null
2022-10-16 Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers Tao Tang et.al. 2210.08458 link
2022-10-14 Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding Xuetong Xue et.al. 2210.07572 link
2022-10-14 Boosting Performance of a Baseline Visual Place Recognition Technique by Predicting the Maximally Complementary Technique Connor Malone et.al. 2210.07509 null
2022-10-11 Large-to-small Image Resolution Asymmetry in Deep Metric Learning Pavel Suma et.al. 2210.05463 link
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236 null
2022-10-05 Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features Deepak Gupta et.al. 2210.02401 link
2022-10-05 Granularity-aware Adaptation for Image Retrieval over Multiple Tasks Jon Almazán et.al. 2210.02254 null
2022-10-05 Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective Zijian Zhang et.al. 2210.02206 link
2022-10-04 Supervised Metric Learning for Retrieval via Contextual Similarity Optimization Christopher Liao et.al. 2210.01908 link
2022-10-04 Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing Weiying Wang et.al. 2210.01320 null
2022-10-03 Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments Bruno Arcanjo et.al. 2210.00834 null
2022-10-02 Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval Kei Nishimaki et.al. 2210.00506 null
2022-09-29 Guided Unsupervised Learning by Subaperture Decomposition for Ocean SAR Image Retrieval Nicolae-Cătălin Ristea et.al. 2209.15034 null
2022-09-28 TVLT: Textless Vision-Language Transformer Zineng Tang et.al. 2209.14156 link
2022-09-28 SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval Yang Shen et.al. 2209.13833 link
2022-09-28 Learning Deep Representations via Contrastive Learning for Instance Retrieval Tao Wu et.al. 2209.13832 null
2022-09-28 Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text Cheng-An Hsieh et.al. 2209.13764 link
2022-09-27 Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors Hao Dong et.al. 2209.13586 link
2022-09-27 Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability Peisong Wen et.al. 2209.13262 link
2022-09-26 NDD: A 3D Point Cloud Descriptor Based on Normal Distribution for Loop Closure Detection Ruihao Zhou et.al. 2209.12513 link
2022-09-25 Personalized Saliency in Task-Oriented Semantic Communications: Image Transmission and Performance Analysis Jiawen Kang et.al. 2209.12274 link
2022-09-24 Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes Jonathan J. Y. Kim et.al. 2209.11894 null
2022-09-23 Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs Youya Xia et.al. 2209.11673 null
2022-09-23 Query-based Hard-Image Retrieval for Object Detection at Test Time Edward Ayers et.al. 2209.11559 link
2022-09-23 Unsupervised Hashing with Semantic Concept Mining Rong-Cheng Tu et.al. 2209.11475 link
2022-09-22 UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low vision Anbang Yang et.al. 2209.11336 null
2022-09-21 Visual Localization and Mapping in Dynamic and Changing Environments João Carlos Virgolino Soares et.al. 2209.10710 null
2022-09-20 PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention José Arce et.al. 2209.09699 link
2022-09-19 Deep Metric Learning with Chance Constraints Yeti Z. Gurbuz et.al. 2209.09060 link
2022-09-18 HGI-SLAM: Loop Closure With Human and Geometric Importance Features Shuhul Mujoo et.al. 2209.08608 null
2022-09-18 Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM Jiarui Tan et.al. 2209.08578 link
2022-09-17 Data Efficient Visual Place Recognition Using Extremely JPEG-Compressed Images Mihnea-Alexandru Tomita et.al. 2209.08343 null
2022-09-15 Efficient Planar Pose Estimation via UWB Measurements Haodong Jiang et.al. 2209.06779 link
2022-09-14 Transformers and CNNs both Beat Humans on SBIR Omar Seddati et.al. 2209.06629 null
2022-09-14 Tac2Structure: Object Surface Reconstruction Only through Multi Times Touch J. Lu et.al. 2209.06545 link
2022-09-14 iSimLoc: Visual Global Localization for Previously Unseen Environments with Simulated Images Peng Yin et.al. 2209.06376 null
2022-09-09 General Place Recognition Survey: Towards the Real-world Autonomy Age Peng Yin et.al. 2209.04497 link
2022-09-09 Retinal Image Restoration and Vessel Segmentation using Modified Cycle-CBAM and CBAM-UNet Alnur Alimanov et.al. 2209.04234 link
2022-09-13 Segment Augmentation and Differentiable Ranking for Logo Retrieval Feyza Yavuz et.al. 2209.02482 null
2022-09-12 ScaleFace: Uncertainty-aware Deep Metric Learning Roman Kail et.al. 2209.01880 link
2022-09-04 CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud Evgeny Yudin et.al. 2209.01605 null
2022-08-31 EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing Qihua Feng et.al. 2208.14657 link
2022-08-25 A Deep Perceptual Measure for Lens and Camera Calibration Yannick Hold-Geoffroy et.al. 2208.12300 null
2022-08-25 A Privacy-Preserving and End-to-End-Based Encrypted Image Retrieval Scheme Zhixun Lu et.al. 2208.11876 null
2022-08-23 Satellite Image Search in AgoraEO Ahmet Kerem Aksoy et.al. 2208.10830 null
2022-08-20 Fuse and Attend: Generalized Embedding Learning for Art and Sketches Ujjal Kr Dutta et.al. 2208.09698 null
2022-08-19 Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods Chao Chen et.al. 2208.09315 link
2022-08-19 TTT-UCDR: Test-time Training for Universal Cross-Domain Retrieval Soumava Paul et.al. 2208.09198 link
2022-08-17 Visual Cross-View Metric Localization with Dense Uncertainty Estimates Zimin Xia et.al. 2208.08519 link
2022-08-17 Understanding Attention for Vision-and-Language Tasks Feiqi Cao et.al. 2208.08104 link
2022-08-14 Visual Localization via Few-Shot Scene Region Classification Siyan Dong et.al. 2208.06933 link
2022-08-14 HyP $^2$ Loss: Beyond Hypersphere Metric Space for Multi-label Image Retrieval Chengyin Xu et.al. 2208.06866 link
2022-08-13 Finding Point with Image: An End-to-End Benchmark for Vision-based UAV Localization Ming Dai et.al. 2208.06561 link
2022-08-16 Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation Georgios Kouros et.al. 2208.06195 link
2022-08-12 Instance Image Retrieval by Learning Purely From Within the Dataset Zhongyan Zhang et.al. 2208.06119 null
2022-08-07 CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization Yujiao Shi et.al. 2208.03660 null
2022-08-05 A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch Patsorn Sangkloy et.al. 2208.03354 null
2022-08-05 ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding Bingning Wang et.al. 2208.03030 link
2022-08-04 Pattern Spotting and Image Retrieval in Historical Documents using Deep Hashing Caio da S. Dias et.al. 2208.02397 null
2022-07-27 On the robustness of self-supervised representations for multi-view object classification David Torpey et.al. 2208.00787 null
2022-07-26 Multimodal Neural Machine Translation with Search Engine Based Image Retrieval ZhenHao Tang et.al. 2208.00767 null
2022-07-30 Towards Privacy-Preserving, Real-Time and Lossless Feature Matching Qiang Meng et.al. 2208.00214 link
2022-07-30 DAS: Densely-Anchored Sampling for Deep Metric Learning Lizhao Liu et.al. 2208.00119 link
2022-07-29 Curriculum Learning for Data-Efficient Vision-Language Alignment Tejas Srinivasan et.al. 2207.14525 null
2022-07-29 Neural Density-Distance Fields Itsuki Ueda et.al. 2207.14455 link
2022-07-27 Abstracting Sketches through Simple Primitives Stephan Alaniz et.al. 2207.13543 link
2022-07-27 Satellite Image Based Cross-view Localization for Autonomous Vehicle Shan Wang et.al. 2207.13506 null
2022-07-26 RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments Jiahui Zhang et.al. 2207.12579 null
2022-07-25 A hybrid-qudit representation of digital RGB images Sreetama Das et.al. 2207.12550 null
2022-07-19 ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization Ivan Cisneros et.al. 2207.12317 link
2022-07-22 PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes BaoSheng Zhang et.al. 2207.10916 null
2022-07-25 MeshLoc: Mesh-Based Visual Localization Vojtech Panek et.al. 2207.10762 link
2022-07-20 Revisiting Hotels-50K and Hotel-ID Aarash Feizi et.al. 2207.10200 link
2022-07-20 Feature Representation Learning for Unsupervised Cross-domain Image Retrieval Conghui Hu et.al. 2207.09721 link
2022-07-19 SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany Dominik Koßmann et.al. 2207.09507 null
2022-07-19 Context Unaware Knowledge Distillation for Image Retrieval Bytasandram Yaswanth Reddy et.al. 2207.09070 link
2022-07-17 FashionViL: Fashion-Focused Vision-and-Language Representation Learning Xiao Han et.al. 2207.08150 link
2022-07-14 AutoMerge: A Framework for Map Assembling and Smoothing in City-scale Environments Peng Yin et.al. 2207.06965 null
2022-07-14 Semi-supervised Vector-Quantization in Visual SLAM using HGCN Amir Zarringhalam et.al. 2207.06738 null
2022-07-14 Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders Amir Zarringhalam et.al. 2207.06732 null
2022-07-19 Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras Fangwen Shu et.al. 2207.06058 link
2022-07-12 CPO: Change Robust Panorama to Point Cloud Localization Junho Kim et.al. 2207.05317 link
2022-07-05 Hierarchical Average Precision Training for Pertinent Image Retrieval Elias Ramzi et.al. 2207.04873 link
2022-07-11 A clinically motivated self-supervised approach for content-based image retrieval of CT liver images Kristoffer Knutsen Wickstrøm et.al. 2207.04812 link
2022-07-09 BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval Wenqiao Zhang et.al. 2207.04211 null
2022-07-08 Learning Sequential Descriptors for Sequence-based Visual Place Recognition Riccardo Mereu et.al. 2207.03868 link
2022-07-08 GEMS: Scene Expansion using Generative Models of Graphs Rishi Agarwal et.al. 2207.03729 null
2022-07-05 Object-Level Targeted Selection via Deep Template Matching Suraj Kothawade et.al. 2207.01778 null
2022-07-06 Adaptive Fine-Grained Sketch-Based Image Retrieval Ayan Kumar Bhunia et.al. 2207.01723 link
2022-07-04 Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets Paul Albert et.al. 2207.01573 link
2022-07-08 Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval Keyu Wen et.al. 2207.00733 null
2022-07-01 DALG: Deep Attentive Local and Global Modeling for Image Retrieval Yuxin Song et.al. 2207.00287 null
2022-07-04 BadHash: Invisible Backdoor Attacks against Deep Hashing with Clean Label Shengshan Hu et.al. 2207.00278 link
2022-06-28 Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems Stephen Hausler et.al. 2206.13883 null
2022-07-08 How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels Tobias Fischer et.al. 2206.13673 link
2022-06-25 FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance Yongzhi Fan et.al. 2206.12628 link
2022-06-25 Inverted Semantic-Index for Image Retrieval Ying Wang et.al. 2206.12623 null
2022-06-17 RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval Yihan Wu et.al. 2206.11225 null
2022-06-22 ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas Prathmesh Madhu et.al. 2206.11115 null
2022-06-20 Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval Guile Wu et.al. 2206.09806 null
2022-06-18 Attention-based Dynamic Subspace Learners for Medical Image Analysis Sukesh Adiga V et.al. 2206.09068 null
2022-06-17 Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments Khairuldanial Ismail et.al. 2206.08733 null
2022-06-06 Learning Treatment Plan Representations for Content Based Image Retrieval Charles Huang et.al. 2206.02912 null
2022-06-19 NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation Ekaterina Nepovinnykh et.al. 2206.02498 link
2022-06-05 Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks B. G. Palm et.al. 2206.02278 null
2022-05-28 FaIRCoP: Facial Image Retrieval using Contrastive Personalization Devansh Gupta et.al. 2205.15870 null
2022-05-31 Investigating the Role of Image Retrieval for Visual Localization – An exhaustive benchmark Martin Humenberger et.al. 2205.15761 link
2022-05-27 Improving Road Segmentation in Challenging Domains Using Similar Place Priors Connor Malone et.al. 2205.14112 null
2022-05-31 LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments Yun Chang et.al. 2205.13135 link
2022-05-26 Fine-grained Image Captioning with CLIP Reward Jaemin Cho et.al. 2205.13115 link
2022-05-25 Deep Dense Local Feature Matching and Vehicle Removal for Indoor Visual Localization Kyung Ho Park et.al. 2205.12544 null
2022-05-24 OnePose: One-Shot Object Pose Estimation without CAD Models Jiaming Sun et.al. 2205.12257 link
2022-05-23 VPAIR – Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments Michael Schleiss et.al. 2205.11567 link
2022-05-23 VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering Yanan Wang et.al. 2205.11501 null
2022-05-23 Deep Image Retrieval is not Robust to Label Noise Stanislav Dereka et.al. 2205.11195 null
2022-05-22 Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval Zelong Zeng et.al. 2205.10878 link
2022-05-20 Visually-Augmented Language Modeling Weizhi Wang et.al. 2205.10178 link
2022-05-18 Deep Features for CBIR with Scarce Data using Hebbian Learning Gabriele Lagani et.al. 2205.08935 null
2022-05-19 Text Detection & Recognition in the Wild for Robot Localization Zobeir Raisi et.al. 2205.08565 null
2022-05-12 One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code Yong Dai et.al. 2205.06126 null
2022-05-11 Review on Panoramic Imaging and Its Applications in Scene Understanding Shaohua Gao et.al. 2205.05570 null
2022-05-18 Identical Image Retrieval using Deep Learning Sayan Nath et.al. 2205.04883 link
2022-05-09 Introspective Deep Metric Learning Chengkun Wang et.al. 2205.04449 link
2022-05-11 Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting Kai Uwe Barthel et.al. 2205.04255 link
2022-05-08 Adversarial Learning of Hard Positives for Place Recognition Wenxuan Fang et.al. 2205.03871 null
2022-05-10 AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching Khanh Nguyen et.al. 2205.02849 link
2022-04-29 Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval Shupeng Su et.al. 2204.13919 null
2022-04-29 Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval Siyu Ren et.al. 2204.13913 link
2022-04-28 Spatio-Temporal Graph Localization Networks for Image-based Navigation Takahiro Niwa et.al. 2204.13237 null
2022-04-27 The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection Konstantinos A. Tsintotas et.al. 2204.12831 null
2022-04-25 SceneTrilogy: On Scene Sketches and its Relationship with Text and Photo Pinaki Nath Chowdhury et.al. 2204.11964 null
2022-04-23 On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning Muhammad Umer Anwaar et.al. 2204.11848 null
2022-04-24 Progressive Learning for Image Retrieval with Hybrid-Modality Queries Yida Zhao et.al. 2204.11212 null
2022-04-23 Training and challenging models for text-guided fashion image retrieval Eric Dodds et.al. 2204.11004 link
2022-04-18 Centralized Adversarial Learning for Robust Deep Hashing Xunguang Wang et.al. 2204.10779 link
2022-04-22 Transferring ConvNet Features from Passive to Active Robot Self-Localization: The Use of Ego-Centric and World-Centric Views Kanya Kurauchi et.al. 2204.10497 null
2022-04-21 Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval Zhiqiang Yuan et.al. 2204.09868 link
2022-04-21 Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information Zhiqiang Yuan et.al. 2204.09860 link
2022-04-20 Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations Leila Pishdad et.al. 2204.09268 null
2022-04-19 Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing Georgii Mikriukov et.al. 2204.08707 null
2022-04-18 Multiple-environment Self-adaptive Network for Aerial-view Geo-localization Tingyu Wang et.al. 2204.08381 link
2022-04-15 Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder Hanjing Ye et.al. 2204.07350 link
2022-04-14 Composite Code Sparse Autoencoders for first stage retrieval Carlos Lassance et.al. 2204.07023 null
2022-04-13 Reuse your features: unifying retrieval and feature-metric alignment Javier Morlana et.al. 2204.06292 link
2022-04-12 Probabilistic Compositional Embeddings for Multimodal Image Retrieval Andrei Neculai et.al. 2204.05845 link
2022-04-12 Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval Yu-Wei Zhan et.al. 2204.05666 null
2022-04-12 HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud Zhixing Hou et.al. 2204.05481 null
2022-04-11 Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context Lizhou Liao et.al. 2204.04932 link
2022-04-10 Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image Yujiao Shi et.al. 2204.04752 link
2022-04-08 A Generic Image Retrieval Method for Date Estimation of Historical Document Collections Adrià Molina et.al. 2204.04028 null
2022-04-08 SnapMode: An Intelligent and Distributed Large-Scale Fashion Image Retrieval Platform Based On Big Data and Deep Generative Adversarial Network Technologies Narges Norouzi et.al. 2204.03998 null
2022-04-05 Leveraging Equivariant Features for Absolute Pose Regression Mohamed Adel Musallam et.al. 2204.02163 null
2022-04-04 “This is my unicorn, Fluffy”: Personalizing frozen vision-language representations Niv Cohen et.al. 2204.01694 link
2022-04-01 Bi-directional Loop Closure for Visual SLAM Ihtisham Ali et.al. 2204.01524 null
2022-04-01 LASER: LAtent SpacE Rendering for 2D Visual Localization Zhixiang Min et.al. 2204.00157 link
2022-03-31 Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning Semih Orhan et.al. 2203.16945 null
2022-03-30 AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift Burak Yildiz et.al. 2203.16291 link
2022-03-29 Long-term Visual Map Sparsification with Heterogeneous GNN Ming-Fang Chang et.al. 2203.15182 null
2022-04-01 A Simulation Benchmark for Vision-based Autonomous Navigation Lauri Suomela et.al. 2203.13048 link
2022-03-24 Is Geometry Enough for Matching in Visual Localization? Qunjie Zhou et.al. 2203.12979 link
2022-03-21 MatchFormer: Interleaving Attention in Transformers for Feature Matching Qing Wang et.al. 2203.09645 link
2022-03-10 ReF – Rotation Equivariant Features for Local Feature Matching Abhishek Peri et.al. 2203.05206 null
2022-03-09 Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction Matthieu Zins et.al. 2203.04613 null
2022-03-08 Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM Pierre-Yves Lajoie et.al. 2203.04446 link
2022-03-07 ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization Simon Maurer et.al. 2203.03610 link
2022-03-07 Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms Qingqing Li et.al. 2203.03454 link
2022-03-01 SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments Maria Waheed et.al. 2203.00591 null
2022-02-28 Deep Camera Pose Regression Using Pseudo-LiDAR Ali Raza et.al. 2203.00080 null
2022-02-25 RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation Praveen Kumar Rajendran et.al. 2202.12838 null
2022-02-24 Highly-Efficient Binary Neural Networks for Visual Place Recognition Bruno Ferrarini et.al. 2202.12375 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146 link
2022-02-14 Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition Y. Shen et.al. 2202.06470 null
2022-02-11 Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition Yingfeng Cai et.al. 2202.05738 null
2022-02-09 Object-Guided Day-Night Visual Localization in Urban Scenes Assia Benbihi et.al. 2202.04445 null
2022-02-08 A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition Nie Jiwei et.al. 2202.03677 null
2022-02-25 CFP-SLAM: A Real-time Visual SLAM Based on Coarse-to-Fine Probability in Dynamic Environments Xinggang Hu et.al. 2202.01938 null
2022-02-03 Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization Andrea Vallone et.al. 2202.01821 null
2022-02-02 Training Semantic Descriptors for Image-Based Localization Ibrahim Cinaroglu et.al. 2202.01212 null
2022-01-31 Hydra: A Real-time Spatial Perception Engine for 3D Scene Graph Construction and Optimization Nathan Hughes et.al. 2201.13360 null
2022-01-31 Rigidity Preserving Image Transformations and Equivariance in Perspective Lucas Brynte et.al. 2201.13065 null
2022-01-25 Learning Semantics for Visual Place Recognition through Multi-Scale Attention Valerio Paolicelli et.al. 2201.09701 link
2022-01-22 Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems Xi Zheng et.al. 2201.09048 link
2022-01-15 A Critical Analysis of Image-based Camera Pose Estimation Techniques Meng Xu et.al. 2201.05816 null
2022-01-14 SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions Ali Samadzadeh et.al. 2201.05386 link
2021-12-23 NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning Tony Ng et.al. 2112.12785 null
2021-12-16 CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data Qi Yan et.al. 2112.09081 link
2021-12-05 RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather Jialu Wang et.al. 2112.02469 null
2021-11-25 MegLoc: A Robust and Accurate Visual Localization Pipeline Shuxue Peng et.al. 2111.13063 null
2021-10-08 Semantic Image Alignment for Vehicle Localization Markus Herb et.al. 2110.04162 null
2021-10-05 Season-invariant GNSS-denied visual localization for UAVs Jouko Kinnari et.al. 2110.01967 link
2021-09-30 Forming a sparse representation for visual place recognition using a neurorobotic approach Sylvain Colomer et.al. 2109.14916 null
2021-09-22 Audio-Visual Grounding Referring Expression for Robotic Manipulation Yefei Wang et.al. 2109.10571 null
2021-09-20 Efficient shape mapping through dense touch and vision Sudharshan Suresh et.al. 2109.09884 link
2021-09-15 S3LAM: Structured Scene SLAM Mathieu Gonzalez et.al. 2109.07339 null
2021-09-13 Monocular Camera Localization for Automated Vehicles Using Image Retrieval Eunhyek Joa et.al. 2109.06296 null
2021-09-10 Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization Sungho Yoon et.al. 2109.04753 link
2021-09-09 CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization Ara Jafarzadeh et.al. 2109.04527 null
2021-09-09 Keeping an Eye on Things: Deep Learned Features for Long-Term Visual Localization Mona Gridseth et.al. 2109.04041 link

Keypoint Detection

Publish Date Title Authors PDF Code
2025-07-15 KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model Jie Yang et.al. 2507.11102 null
2025-07-15 GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft Weizhao Ma et.al. 2507.11077 null
2025-07-14 FPC-Net: Revisiting SuperPoint with Descriptor-Free Keypoint Detection via Feature Pyramids and Consistency-Based Implicit Matching Ionuţ Grigore et.al. 2507.10770 null
2025-07-11 Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection Subhajit Maity et.al. 2507.07994 null
2025-07-09 Reading a Ruler in the Wild Yimu Pan et.al. 2507.07077 null
2025-07-09 MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning Yifan Yang et.al. 2507.06662 null
2025-06-27 MatChA: Cross-Algorithm Matching with Feature Augmentation Paula Carbó Cubero et.al. 2506.22336 null
2025-06-27 SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images Naftaly Wambugu et.al. 2506.21945 null
2025-05-29 TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning Ron Shapira Weber et.al. 2505.23475 link
2025-05-24 Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU Yicheng Lin et.al. 2505.18652 link
2025-05-18 SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving Muleilan Pei et.al. 2505.12246 null
2025-05-17 Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation Niaz Ahmad et.al. 2505.12130 null
2025-05-16 Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation Massimiliano Cassia et.al. 2505.11110 null
2025-06-19 RDD: Robust Feature Detector and Descriptor using Deformable Transformer Gonglin Chen et.al. 2505.08013 null
2025-05-12 Enabling Privacy-Aware AI-Based Ergonomic Analysis Sander De Coninck et.al. 2505.07306 null
2025-05-09 My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing Jingrui He et.al. 2505.06436 null
2025-05-05 Unsupervised training of keypoint-agnostic descriptors for flexible retinal image registration David Rivas-Villar et.al. 2505.02787 null
2025-05-05 Unsupervised Deep Learning-based Keypoint Localization Estimating Descriptor Matching Performance David Rivas-Villar et.al. 2505.02779 null
2025-05-04 Focus What Matters: Matchability-Based Reweighting for Local Feature Matching Dongyue Li et.al. 2505.02161 null
2025-05-04 Enhancing Lidar Point Cloud Sampling via Colorization and Super-Resolution of Lidar Imagery Sier Ha et.al. 2505.02049 null
2025-04-29 Emotion Recognition in Contemporary Dance Performances Using Laban Movement Analysis Muhammad Turab et.al. 2504.21154 null
2025-04-29 Learning a General Model: Folding Clothing with Topological Dynamics Yiming Liu et.al. 2504.20720 null
2025-04-26 VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation Niaz Ahmad et.al. 2504.19032 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-15 UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques Pedro Diaz-Garcia et.al. 2504.11063 null
2025-04-15 Acquisition of high-quality images for camera calibration in robotics applications via speech prompts Timm Linder et.al. 2504.11031 null
2025-04-11 Stereophotoclinometry Revisited Travis Driver et.al. 2504.08252 null
2025-03-31 SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection Yannick Burkhardt et.al. 2504.00139 null
2025-03-29 Deep Visual Servoing of an Aerial Robot Using Keypoint Feature Extraction Shayan Sepahvand et.al. 2503.23171 null
2025-03-25 Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines Junle Liu et.al. 2503.19278 null
2025-03-05 Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing Ryan Banks et.al. 2503.13477 null
2025-03-16 Histogram Transporter: Learning Rotation-Equivariant Orientation Histograms for High-Precision Robotic Kitting Jiadong Zhou et.al. 2503.12541 null
2025-04-12 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-10 REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Yan Tai et.al. 2503.07413 link
2025-03-11 DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection Johan Edstedt et.al. 2503.07347 link
2025-03-07 Automatic determination of quasicrystalline patterns from microscopy images Tano Kim Kender et.al. 2503.05472 link
2025-03-07 Spatial regularisation for improved accuracy and interpretability in keypoint-based registration Benjamin Billot et.al. 2503.04499 link
2025-03-04 A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection Junyi Wang et.al. 2503.02481 null
2025-03-01 Autonomous Dissection in Robotic Cholecystectomy Ki-Hwan Oh et.al. 2503.00666 null
2025-02-28 CNSv2: Probabilistic Correspondence Encoded Neural Image Servo Anzhe Chen et.al. 2503.00132 null
2025-02-27 Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets Jisoo Lee et.al. 2502.19766 null
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522 null
2025-02-19 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification Yusuke Uchida et.al. 2502.13484 link
2025-01-30 Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images Wei-Lun Chen et.al. 2501.18453 null
2025-01-30 Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models Bhargav Ghanekar et.al. 2501.18361 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110 null
2025-01-21 Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction Mengyuan Li et.al. 2501.11844 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299 null
2025-01-19 Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation Shibang Liu et.al. 2501.11069 null
2025-01-13 Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications Lukas Rustler et.al. 2501.07421 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221 link
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755 null
2024-12-19 Corn Ear Detection and Orientation Estimation Using Deep Learning Nathan Sprague et.al. 2412.14954 null
2024-12-12 Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models Faith Johnson et.al. 2412.09739 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models Bingchen Gong et.al. 2412.06292 null
2024-12-07 Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures Muhammad Umar Farooq et.al. 2412.05487 null
2024-12-04 Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything Yongkyu Lee et.al. 2412.03472 link
2024-12-02 MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection Yonghao Dang et.al. 2412.01422 null
2024-11-23 OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs Chen Xin et.al. 2411.15653 link
2024-11-19 IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose Fei Ren et.al. 2411.12676 null
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851 null
2024-11-04 KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension Jie Yang et.al. 2411.01846 null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906 null
2024-10-04 Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation Aman Anand et.al. 2410.14700 null
2024-11-27 Sim2real Cattle Joint Estimation in 3D point clouds Mohammad Okour et.al. 2410.14419 null
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742 null
2024-10-16 RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition Asish Bera et.al. 2410.12718 null
2024-10-01 A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference Yuan Li et.al. 2410.11848 null
2024-10-11 Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image Marta Veganzones Rodriguez et.al. 2410.09155 null
2024-10-08 Unsupervised Model Diagnosis Yinong Oliver Wang et.al. 2410.06243 null
2024-10-08 Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration Xueyang Kang et.al. 2410.05729 link
2024-10-16 Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features Chengkai Hou et.al. 2410.02237 null
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-09-30 OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection Changsheng Lu et.al. 2409.19899 link
2024-10-07 SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation Xin Li et.al. 2409.18082 null
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-20 Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators Niloufar Amiri et.al. 2409.13668 null
2024-09-25 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695 link
2024-09-06 D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection Kentaro Hirahara et.al. 2409.04060 null
2024-10-01 Towards Practical Human Motion Prediction with LiDAR Point Clouds Xiao Han et.al. 2408.08202 null
2024-07-31 Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods Xusheng Luo et.al. 2408.00117 null
2024-07-26 SHIC: Shape-Image Correspondences with no Keypoint Supervision Aleksandar Shtedritski et.al. 2407.18907 null
2024-07-25 LION: Linear Group RNN for 3D Object Detection in Point Clouds Zhe Liu et.al. 2407.18232 link
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-04 PFGS: High Fidelity Point Cloud Rendering via Feature Splatting Jiaxu Wang et.al. 2407.03857 link
2024-07-03 A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes Li Fang et.al. 2407.02830 link
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-06-28 Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics Chengrui Gao et.al. 2406.19672 null
2024-07-23 A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking Lorenzo Shaikewitz et.al. 2406.16837 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-23 W-Net: A Facial Feature-Guided Face Super-Resolution Network Hao Liu et.al. 2406.00676 null
2024-05-25 Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration Junjie Gao et.al. 2405.16085 null
2024-06-01 Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection – Towards Precise Fish Morphological Assessment in Aquaculture Breeding Weizhen Liu et.al. 2405.12476 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-15 Vector-Symbolic Architecture for Event-Based Optical Flow Hongzhi You et.al. 2405.08300 null
2024-05-13 RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration Congjia Chen et.al. 2405.07594 null
2024-05-08 Unsupervised Skin Feature Tracking with Deep Neural Networks Jose Chang et.al. 2405.04943 null
2024-05-07 A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images László Kopácsi et.al. 2405.04650 null
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311 null
2024-04-25 Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach Tahmim Hossain et.al. 2404.14560 null
2024-04-19 SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers Vandad Davoodnia et.al. 2404.12625 null
2024-04-17 Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images Junbiao Pang et.al. 2404.10985 null
2024-03-28 Towards Long Term SLAM on Thermal Imagery Colin Keil et.al. 2403.19885 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-18 FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events Xiangyuan Wang et.al. 2403.11662 link
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217 null
2024-02-22 A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets Chengzhang Yu et.al. 2402.14241 null
2024-02-25 A Feature Matching Method Based on Multi-Level Refinement Strategy Shaojie Zhang et.al. 2402.13488 null
2024-03-05 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172 null
2024-02-25 Region Feature Descriptor Adapted to High Affine Transformations Shaojie Zhang et.al. 2402.09724 null
2024-01-29 Reconstructing Close Human Interactions from Multiple Views Qing Shuai et.al. 2401.16173 link
2024-01-17 To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection Luyi Han et.al. 2401.09336 link
2024-01-08 Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach Huanyu Liu et.al. 2401.03742 link
2024-03-22 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029 null
2023-12-27 Bezier-based Regression Feature Descriptor for Deformable Linear Objects Fangqing Chen et.al. 2312.16502 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-22 BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions Elias Marks et.al. 2312.14706 null
2023-12-19 Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation Jiaming Liu et.al. 2312.12480 null
2023-12-19 An effective image copy-move forgery detection using entropy image Zhaowei Lu et.al. 2312.11793 link
2023-12-11 VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data Jian Shi et.al. 2312.08871 link
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865 link
2023-12-01 Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) Emma Cramer et.al. 2312.00592 link
2023-11-30 Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications Sahar Almahfouz Nasser et.al. 2311.18281 null
2023-11-29 Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features Thomas Wimmer et.al. 2311.18113 link
2023-11-28 Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features Niladri Shekhar Dutt et.al. 2311.17024 link
2023-11-28 Riemannian Self-Attention Mechanism for SPD Networks Rui Wang et.al. 2311.16738 null
2023-11-27 A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor Jialin Liu et.al. 2311.15609 null
2023-11-21 Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers Bo Sun et.al. 2311.12291 null
2023-11-20 CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement Boni Hu et.al. 2311.11604 link
2023-11-17 Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration Paul J. Claasen et.al. 2311.10361 link
2023-11-13 Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning Tomáš Kunzo et.al. 2311.07398 null
2023-11-11 CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer Haoyu Ma et.al. 2311.06443 link
2023-11-08 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud Jianchao Ci et.al. 2311.04699 null
2023-11-06 TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains Alexander Naumann et.al. 2311.03124 link
2023-11-06 An invariant feature extraction for multi-modal images matching Chenzhong Gao et.al. 2311.02842 null
2023-10-20 Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification Mateus Roder et.al. 2310.13490 null
2023-10-12 UniPose: Detecting Any Keypoints Jie Yang et.al. 2310.08530 link
2023-10-10 l-dyno: framework to learn consistent visual features using robot’s motion Kartikeya Singh et.al. 2310.06249 link
2023-10-10 Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face Hao Zhang et.al. 2310.05056 link
2023-10-13 H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation Yanjie Ze et.al. 2310.01404 link
2023-10-04 Self-supervised Learning of Contextualized Local Visual Embeddings Thalles Santos Silva et.al. 2310.00527 link
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link
2023-09-18 RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy Mert Asim Karaoglu et.al. 2309.09563 null
2023-09-17 CryoAlign: feature-based method for global and local 3D alignment of EM density maps Bintao He et.al. 2309.09217 null
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-09 Mirror-Aware Neural Humans Daniel Ajisafe et.al. 2309.04750 link
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895 null
2023-09-04 SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras Himanshu Pahadia et.al. 2309.01324 null
2023-09-12 Improving the matching of deformable objects by learning to detect keypoints Felipe Cadar et.al. 2309.00434 link
2023-08-31 SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation Jiaben Chen et.al. 2308.16876 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984 link
2023-08-29 A lightweight 3D dense facial landmark estimation model from position map data Shubhajit Basak et.al. 2308.15170 link
2023-08-27 Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors Francesco Pirotti et.al. 2308.14047 null
2023-08-24 VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition Gengxuan Tian et.al. 2308.12870 null
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223 null
2023-08-20 Neural Interactive Keypoint Detection Jie Yang et.al. 2308.10174 link
2023-08-19 ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment Bingyang Zhou et.al. 2308.09987 null
2023-09-03 DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479 link
2023-08-15 CoDeF: Content Deformation Fields for Temporally Consistent Video Processing Hao Ouyang et.al. 2308.07926 link
2023-08-15 ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition Wenyuan Xue et.al. 2308.07743 null
2023-08-14 DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport Sk Aziz Ali et.al. 2308.07153 null
2023-08-14 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds Minhao Li et.al. 2308.05667 link
2023-08-02 Automated Hit-frame Detection for Badminton Match Analysis Yu-Hang Chien et.al. 2307.16000 link
2023-07-25 Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception Chuanyu Luo et.al. 2307.13300 null
2023-07-21 Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data Sahar Almahfouz Nasser et.al. 2307.10698 link
2023-07-19 SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid Zi Li et.al. 2307.09727 link
2023-07-01 SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation Fabian Duffhauss et.al. 2307.00306 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669 link
2023-06-26 CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild Li Ding et.al. 2306.15073 null
2023-06-28 Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset Ziqiao Weng et.al. 2306.07089 link
2023-06-07 Learning Probabilistic Coordinate Fields for Robust Correspondences Weiyue Zhao et.al. 2306.04231 null
2023-06-03 LDEB – Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues Amitabha Dey et.al. 2306.02193 null
2023-06-02 Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images Marcela Mera-Trujillo et.al. 2306.01938 null
2023-06-01 A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm Onur Beker et.al. 2306.00892 null
2023-05-30 Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection Supeng Wang et.al. 2305.18714 link
2023-05-23 Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Grace Luo et.al. 2305.14334 null
2023-05-15 Non-Separable Multi-Dimensional Network Flows for Visual Computing Viktoria Ehm et.al. 2305.08628 null
2023-05-13 Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance Xinyu Lin et.al. 2305.07943 link
2023-05-05 HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration Canhui Tang et.al. 2305.03487 link
2023-04-17 Human Pose Estimation in Monocular Omnidirectional Top-View Images Jingrui Yu et.al. 2304.08186 null
2023-04-14 CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression Mubariz Zaffar et.al. 2304.07426 null
2023-04-12 SiLK – Simple Learned Keypoints Pierre Gleize et.al. 2304.06194 link
2023-04-06 From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection Changsheng Lu et.al. 2304.03140 null
2023-03-29 NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud Xiangyu Zhu et.al. 2303.16465 link
2023-03-24 PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View Ze Shi et.al. 2303.14095 link
2023-03-23 Semantic Image Attack for Visual Model Diagnosis Jinqi Luo et.al. 2303.13010 null
2023-03-22 Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation Heng Yang et.al. 2303.12246 link
2023-03-21 RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network Sangmin Yoo et.al. 2303.10770 null
2023-03-17 ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty Vanessa Wirth et.al. 2303.10042 null
2023-03-15 Descriptor Distillation for Efficient Multi-Robot SLAM Xiyue Guo et.al. 2303.08420 null
2023-03-15 From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning Zhuo Su et.al. 2303.08414 null
2023-03-16 KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input Yiye Chen et.al. 2303.05617 link
2023-03-07 External Camera-based Mobile Robot Pose Estimation for Collaborative Perception with Smart Edge Sensors Simon Bultmann et.al. 2303.03797 null
2023-02-26 PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection Shenwei Xie et.al. 2302.13263 null
2023-02-24 Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks Julian Lißner et.al. 2302.12545 null
2023-02-21 Deep Reinforcement Learning Based on Local GNN for Goal-conditioned Deformable Object Rearranging Yuhong Deng et.al. 2302.10446 null
2023-02-12 A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training Jingnan Shi et.al. 2302.06019 null
2023-02-11 Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing Zitong Yu et.al. 2302.05744 null
2023-02-09 MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection Yuhe Ding et.al. 2302.04589 link
2023-02-03 Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation Jie Yang et.al. 2302.01593 link
2023-02-03 Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization Yingying Zhu et.al. 2302.01572 link
2023-01-21 Vision Aided Environment Semantics Extraction and Its Application in mmWave Beam Selection Feiyang Wen et.al. 2301.08973 null
2023-01-18 OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models Xingyi He et.al. 2301.07673 null
2023-01-12 Towards High Performance One-Stage Human Pose Estimation Ling Li et.al. 2301.04842 null
2022-12-31 Rethinking Rotation Invariance with Point Cloud Registration Jianhui Yu et.al. 2301.00149 null
2023-02-06 Fruit Ripeness Classification: a Survey Matteo Rizzo et.al. 2212.14441 null
2022-12-28 NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action Kuan-Chieh Wang et.al. 2212.13660 link
2022-12-24 HandsOff: Labeled Dataset Generation With No Additional Human Annotations Austin Xu et.al. 2212.12645 null
2022-12-13 Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images Welerson Melo et.al. 2212.09589 link
2022-12-15 Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation Bugra C. Sefercik et.al. 2212.07567 null
2023-02-01 DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization Xiangyu Xu et.al. 2212.04575 null
2022-12-07 ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation Yufei Xu et.al. 2212.04246 link
2022-12-15 Designing Feature Vector Representations: A case study from Chemistry Signe Sidwall Thygesen et.al. 2212.03731 null
2022-12-09 DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model Jeongjun Choi et.al. 2212.02796 link
2022-12-05 Images Speak in Images: A Generalist Painter for In-Context Visual Learning Xinlong Wang et.al. 2212.02499 link
2022-12-06 R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor Bai Zhu et.al. 2212.02277 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069 link
2022-11-29 BALF: Simple and Efficient Blur Aware Local Feature Detector Zhenjun Zhao et.al. 2211.14731 null
2022-11-21 Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching Paul Roetzer et.al. 2211.11589 link
2022-11-07 Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration Zixin Yang et.al. 2211.03688 null
2022-10-31 Tree Detection and Diameter Estimation Based on Deep Learning Vincent Grondin et.al. 2210.17424 link
2022-10-26 Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds Zhiyuan Zhang et.al. 2210.14899 null
2022-10-23 Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders Ömer Sümer et.al. 2210.12705 null
2022-10-21 Real-time Detection of 2D Tool Landmarks with Synthetic Training Data Bram Vanherle et.al. 2210.11991 null
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236 null
2022-10-04 Centroid Distance Keypoint Detector for Colored Point Clouds Hanzhe Teng et.al. 2210.01298 link
2022-09-28 Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences Jun-Jee Chao et.al. 2209.14419 null
2022-09-28 USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation Zhengrong Xue et.al. 2209.13864 null
2022-10-16 Suture Thread Spline Reconstruction from Endoscopic Images for Robotic Surgery with Reliability-driven Keypoint Detection Neelay Joglekar et.al. 2209.13657 link
2022-09-27 Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors Hao Dong et.al. 2209.13586 link
2022-09-26 Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments Kyungmin Jung et.al. 2209.12881 null
2022-10-07 Long-Lived Accurate Keypoints in Event Streams Philippe Chiberre et.al. 2209.10385 null
2022-09-20 Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence Sunghwan Hong et.al. 2209.08742 null
2022-09-15 Online Marker-free Extrinsic Camera Calibration using Person Keypoint Detections Bastian Pätzold et.al. 2209.07393 link
2022-09-07 Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip Yang Li et.al. 2209.03440 null
2022-08-27 Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes Ali Safa et.al. 2208.12997 null
2022-08-24 Self-Supervised Endoscopic Image Key-Points Matching Manel Farhat et.al. 2208.11424 link
2022-08-19 Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture Muhammad Muzammel et.al. 2208.08224 null
2022-08-08 MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis Maximilian Gilles et.al. 2208.03963 null
2022-08-07 CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization Yujiao Shi et.al. 2208.03660 null
2022-07-29 Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation Qihao Liu et.al. 2208.00090 null
2022-07-25 Translating a Visual LEGO Manual to a Machine-Executable Plan Ruocheng Wang et.al. 2207.12572 null
2022-07-21 Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network Aline Sindel et.al. 2207.10506 null
2022-07-15 Human keypoint detection for close proximity human-robot interaction Jan Docekal et.al. 2207.07742 null
2022-07-15 Adversarial Focal Loss: Asking Your Discriminator for Hard Examples Chen Liu et.al. 2207.07739 null
2022-07-13 Rapid Person Re-Identification via Sub-space Consistency Regularization Qingze Yin et.al. 2207.05933 null
2022-07-07 RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments Qihao Peng et.al. 2207.03539 null
2022-08-15 Semi-supervised Human Pose Estimation in Art-historical Images Matthias Springstein et.al. 2207.02976 link
2022-07-01 Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling Jiamin Liang et.al. 2207.00474 null
2022-06-24 Motion Estimation for Large Displacements and Deformations Qiao Chen et.al. 2206.12464 null
2022-06-24 Deep embedded clustering algorithm for clustering PACS repositories Teo Manojlović et.al. 2206.12417 null
2022-06-21 KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences Xuanhan Wang et.al. 2206.10090 link
2022-06-20 Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval Guile Wu et.al. 2206.09806 null
2022-06-15 A Unified Sequence Interface for Vision Tasks Ting Chen et.al. 2206.07669 link
2022-06-09 Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields Mingtong Zhang et.al. 2206.04669 null
2022-06-03 SNAKE: Shape-aware Neural 3D Keypoint Field Chengliang Zhong et.al. 2206.01724 link
2022-05-17 MulT: An End-to-End Multitask Learning Transformer Deblina Bhattacharjee et.al. 2205.08303 null
2022-05-10 ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild Chirag Raman et.al. 2205.05177 link
2022-04-28 Polarimetric imaging for the detection of synthetic models of SARS-CoV-2: a proof of concept Emilio Gomez-Gonzalez et.al. 2204.14050 null
2022-05-02 GRIT: General Robust Image Task Benchmark Tanmay Gupta et.al. 2204.13653 link
2022-05-24 ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation Yufei Xu et.al. 2204.12484 link
2022-04-26 Unified GCNs: Towards Connecting GCNs with CNNs Ziyan Zhang et.al. 2204.12300 null
2022-04-19 Self-Supervised Equivariant Learning for Oriented Keypoint Detection Jongmin Lee et.al. 2204.08613 link
2022-04-17 The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation Bao Zhao et.al. 2204.08024 null
2022-04-15 2D Human Pose Estimation: A Survey Haoming Chen et.al. 2204.07370 null
2022-04-11 Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification Haojie Liu et.al. 2204.04842 null
2022-04-07 Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification Yanan Wang et.al. 2204.02611 link
2022-04-02 SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning Nilaksh Das et.al. 2204.00734 link
2022-04-01 MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration Chenzhong Gao et.al. 2204.00260 null
2022-03-29 Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning David Howard et.al. 2203.15172 null
2022-03-28 REGTR: End-to-end Point Cloud Correspondences with Transformers Zi Jian Yew et.al. 2203.14517 link
2022-03-27 UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection Ye Liu et.al. 2203.12745 link
2022-03-21 MatchFormer: Interleaving Attention in Transformers for Feature Matching Qing Wang et.al. 2203.09645 link
2022-03-16 PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research R. James Cotton et.al. 2203.08792 link
2022-03-11 DRTAM: Dual Rank-1 Tensor Attention Module Hanxing Chi et.al. 2203.05893 null
2022-03-07 Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation Meng Tian et.al. 2203.03498 null
2022-02-10 Motion-Aware Transformer For Occluded Person Re-identification Mi Zhou et.al. 2202.04243 null
2022-02-03 Sim2Real Object-Centric Keypoint Detection and Description Chengliang Zhong et.al. 2202.00448 null
2022-01-16 Cross-Centroid Ripple Pattern for Facial Expression Recognition Monu Verma et.al. 2201.05958 null
2022-01-14 Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words Harry Nguyen et.al. 2201.03556 link
2022-01-10 TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials Jinnavat Sanalohit et.al. 2201.03170 null
2022-01-06 A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration Aline Sindel et.al. 2201.02242 null
2021-12-28 Skin feature point tracking using deep feature encodings Jose Ramon Chang et.al. 2112.14159 null
2021-12-23 Data-efficient learning for 3D mirror symmetry detection Yancong Lin et.al. 2112.12579 null
2021-12-22 Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations – combining input rotations and a kinematic model Michael Zwölfer et.al. 2112.12193 null
2021-12-22 Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction Henrique Siqueira et.al. 2112.12002 link
2021-12-19 Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection Renjie Li et.al. 2112.10275 null
2021-12-19 GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor Jean-Baptiste Carluer et.al. 2112.10258 link
2021-12-16 Masked Feature Prediction for Self-Supervised Visual Pre-Training Chen Wei et.al. 2112.09133 link
2021-12-13 DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points Zhengfei Kuang et.al. 2112.06910 null
2021-12-12 Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species Changsheng Lu et.al. 2112.06183 link
2021-12-13 Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings Mel Vecerik et.al. 2112.04910 null
2021-12-06 ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction Xiaoming Zhao et.al. 2112.02906 link
2021-11-25 Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association Sen Yang et.al. 2111.12892 link
2021-11-08 Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images Jianfei Guo et.al. 2111.04237 null
2021-11-04 Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image Feng Liu et.al. 2111.03098 null
2021-11-01 Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-realistic Perspective of Computer Vision Ali Safa et.al. 2111.00791 null
2021-10-30 Geometry-Aware Hierarchical Bayesian Learning on Manifolds Yonghui Fan et.al. 2111.00184 null
2021-10-26 CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration Hao Yu et.al. 2110.14076 link
2021-10-23 HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware James Hegarty et.al. 2110.12106 null
2021-10-18 Keypoint-Based Bimanual Shaping of Deformable Linear Objects under Environmental Constraints using Hierarchical Action Planning Shengzeng Huo et.al. 2110.08962 null
2021-10-11 High-order Tensor Pooling with Attention for Action Recognition Piotr Koniusz et.al. 2110.05216 null
2021-10-10 Digging Into Self-Supervised Learning of Feature Descriptors Iaroslav Melekhov et.al. 2110.04773 null
2021-10-04 BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion Zhaoqun Li et.al. 2110.01179 link
2021-10-01 Machine learning aided noise filtration and signal classification for CREDO experiment Łukasz Bibrzycki et.al. 2110.00297 null
2021-09-28 PDC-Net+: Enhanced Probabilistic Dense Correspondence Network Prune Truong et.al. 2109.13912 link
2021-09-27 HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines Fabio Bellavia et.al. 2109.12925 null
2021-09-24 Catadioptric Stereo on a Smartphone Kristijan Bartol et.al. 2109.11872 null
2021-09-20 Semi-supervised Dense Keypointsusing Unlabeled Multiview Images Zhixuan Yu et.al. 2109.09299 null
2021-08-31 A Novel Dataset for Keypoint Detection of quadruped Animals from Images Prianka Banik et.al. 2108.13958 link
2021-08-27 A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images Xiaoteng Zhou et.al. 2108.12151 null

Image Matching

Publish Date Title Authors PDF Code
2025-07-09 Dual-Granularity Cross-Modal Identity Association for Weakly-Supervised Text-to-Person Image Matching Yafei Zhang et.al. 2507.06744 null
2025-07-05 From Query to Explanation: Uni-RAG for Multi-Modal Retrieval-Augmented Learning in STEM Xinyi Wu et.al. 2507.03868 null
2025-07-02 What does really matter in image goal navigation? Gianluca Monaci et.al. 2507.01667 null
2025-06-30 Efficient and Accurate Image Provenance Analysis: A Scalable Pipeline for Large-scale Images Jiewei Lai et.al. 2506.23707 null
2025-06-29 Dynamic Contrastive Learning for Hierarchical Retrieval: A Case Study of Distance-Aware Cross-View Geo-Localization Suofei Zhang et.al. 2506.23077 null
2025-06-27 MatChA: Cross-Algorithm Matching with Feature Augmentation Paula Carbó Cubero et.al. 2506.22336 null
2025-07-07 Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs Shaojie Zhang et.al. 2506.22139 null
2025-06-27 ZeroReg3D: A Zero-shot Registration Pipeline for 3D Consecutive Histopathology Image Reconstruction Juming Xiong et.al. 2506.21923 null
2025-06-25 Fast entropy-regularized SDP relaxations for permutation synchronization Michael Lindsey et.al. 2506.20191 null
2025-06-18 ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections Ziling Huang et.al. 2506.15180 null
2025-06-16 EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition Bingxi Liu et.al. 2506.13133 null
2025-06-12 RealKeyMorph: Keypoints in Real-world Coordinates for Resolution-agnostic Image Registration Mina C. Moghadam et.al. 2506.10344 null
2025-06-11 Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints Xiangkai Zhang et.al. 2506.09748 null
2025-06-11 ScaleLSD: Scalable Deep Line Segment Detection Streamlined Zeran Ke et.al. 2506.09369 link
2025-05-21 Anti-interrupted sampling repeater jamming via linear canonical Wigner distribution lightweight LFM detection Jia-Mian Li et.al. 2506.06302 null
2025-06-05 Vanishing arcs for isolated plane curve singularities Hanwool Bae et.al. 2506.04917 null
2025-06-05 Deep Learning Reforms Image Matching: A Survey and Outlook Shihua Zhang et.al. 2506.04619 null
2025-06-20 SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping Mingxu Zhang et.al. 2505.24305 null
2025-06-05 Universal Domain Adaptation for Semantic Segmentation Seun-An Choe et.al. 2505.22458 null
2025-05-23 To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models Simone Gaisbauer et.al. 2505.17973 link
2025-05-16 Multi-view dense image matching with similarity learning and geometry priors Mohamed Ali Chebbi et.al. 2505.11264 null
2025-05-12 Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection Yuqi Cheng et.al. 2505.07375 link
2025-05-04 OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery Chongsheng Zhang et.al. 2505.03836 link
2025-05-06 LiftFeat: 3D Geometry-Aware Local Feature Matching Yepeng Liu et.al. 2505.03422 link
2025-05-04 Focus What Matters: Matchability-Based Reweighting for Local Feature Matching Dongyue Li et.al. 2505.02161 null
2025-05-15 Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective Taoyu Su et.al. 2504.19458 link
2025-04-28 Dynamic Arthroscopic Navigation System for Anterior Cruciate Ligament Reconstruction Based on Multi-level Memory Architecture Shuo Wang et.al. 2504.19398 null
2025-04-23 Road Similarity-Based BEV-Satellite Image Matching for UGV Localization Zhenping Sun et.al. 2504.16346 null
2025-04-18 Outlier-Robust Multi-Model Fitting on Quantum Annealers Saurabh Pandey et.al. 2504.13836 null
2025-04-11 Geometric Consistency Refinement for Single Image Novel View Synthesis via Test-Time Adaptation of Diffusion Models Josef Bengtson et.al. 2504.08348 null
2025-04-10 Image registration of 2D optical thin sections in a 3D porous medium: Application to a Berea sandstone digital rock image Jaehong Chung et.al. 2504.06604 link
2025-04-22 To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition Davide Sferrazza et.al. 2504.06116 link
2025-04-10 Learning Affine Correspondences by Integrating Geometric Constraints Pengju Sun et.al. 2504.04834 link
2025-04-01 Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data Yiqun Duan et.al. 2504.00812 null
2025-03-31 CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching Zizhuo Li et.al. 2503.23925 null
2025-03-28 Pairwise Matching of Intermediate Representations for Fine-grained Explainability Lauren Shrack et.al. 2503.22881 link
2025-03-26 Multimodal Image Matching based on Frequency-domain Information of Local Energy Response Meng Yang et.al. 2503.20827 null
2025-03-22 Normalized Matching Transformer Abtin Pourhadi et.al. 2503.17715 link
2025-03-20 Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors Tian Yi Lim et.al. 2503.16275 null
2025-03-20 MapGlue: Multimodal Remote Sensing Image Matching Peihao Wu et.al. 2503.16185 link
2025-03-19 PAPI-Reg: Patch-to-Pixel Solution for Efficient Cross-Modal Registration between LiDAR Point Cloud and Camera Image Yuanchao Yue et.al. 2503.15285 null
2025-04-07 Less Biased Noise Scale Estimation for Threshold-Robust RANSAC Johan Edstedt et.al. 2503.13433 null
2025-03-17 SatDepth: A Novel Dataset for Satellite Image Matching Rahul Deshmukh et.al. 2503.12706 link
2025-03-14 Refining Image Edge Detection via Linear Canonical Riesz Transforms Shuhui Yang et.al. 2503.11148 null
2025-03-13 Speedy MASt3R Jingxing Li et.al. 2503.10017 null
2025-03-11 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-06 Learning 3D Medical Image Models From Brain Functional Connectivity Network Supervision For Mental Disorder Diagnosis Xingcan Hu et.al. 2503.04205 null
2025-03-07 Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration Qianliang Wu et.al. 2503.04127 null
2025-03-05 JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba Xiaoyong Lu et.al. 2503.03437 null
2025-02-28 CNSv2: Probabilistic Correspondence Encoded Neural Image Servo Anzhe Chen et.al. 2503.00132 null
2025-02-27 A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization Yejun Zhang et.al. 2502.20036 link
2025-02-27 RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges Thibaut Loiseau et.al. 2502.19955 null
2025-02-26 BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure Haoxin Cai et.al. 2502.19242 link
2025-02-25 PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching Han Nie et.al. 2502.18104 link
2025-02-25 Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking Xin Tong et.al. 2502.17766 null
2025-03-04 Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model Yaxuan Huang et.al. 2502.16779 null
2025-02-16 FeaKM: Robust Collaborative Perception under Noisy Pose Conditions Jiuwu Hao et.al. 2502.11003 link
2025-02-24 Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation Emanuele Mule et.al. 2502.06288 link
2025-02-04 Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications William O’Donnell et.al. 2502.02624 null
2025-02-01 MambaGlue: Fast and Robust Local Feature Matching With Mamba Kihwan Ryoo et.al. 2502.00462 link
2025-01-24 Dense-SfM: Structure from Motion with Dense Consistent Matching JongMin Lee et.al. 2501.14277 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299 null
2025-01-13 MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training Xingyi He et.al. 2501.07556 null
2025-01-13 Matching Free Depth Recovery from Structured Light Zhuohang Yu et.al. 2501.07113 null
2025-01-02 Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views Yulun Wu et.al. 2501.01196 null
2024-12-31 Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights Bharath Kumar Agnur et.al. 2412.20210 null
2024-12-27 MINIMA: Modality Invariant Image Matching Xingyu Jiang et.al. 2412.19412 link
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221 link
2024-12-17 Bringing Multimodality to Amazon Visual Search System Xinliang Zhu et.al. 2412.13364 null
2024-12-04 Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis Siyoon Jin et.al. 2412.03150 null
2024-11-20 DT-LSD: Deformable Transformer-based Line Segment Detection Sebastian Janampa et.al. 2411.13005 link
2024-11-15 Image Matching Filtering and Refinement by Planes and Beyond Fabio Bellavia et.al. 2411.09484 link
2024-11-11 XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration Ismail Can Yagmur et.al. 2411.07430 link
2024-11-07 The Impact of Semi-Supervised Learning on Line Segment Detection Johanna Engman et.al. 2411.04596 link
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851 null
2024-10-30 Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants Azadeh Sharafi et.al. 2410.23329 null
2024-11-05 RelationBooth: Towards Relation-Aware Customized Object Generation Qingyu Shi et.al. 2410.23280 null
2024-10-31 ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses Junjie Ni et.al. 2410.22733 null
2024-10-30 LoFLAT: Local Feature Matching using Focused Linear Attention Transformer Naijian Cao et.al. 2410.22710 null
2024-10-26 Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification Yue Su et.al. 2410.20097 null
2024-10-01 A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference Yuan Li et.al. 2410.11848 null
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-09-27 Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras Yipeng Lu et.al. 2409.18673 null
2024-09-25 Game4Loc: A UAV Geo-Localization Benchmark from Game Data Yuxiang Ji et.al. 2409.16925 link
2024-09-24 Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge Marek Wodzinski et.al. 2409.15931 null
2024-09-10 Weakly-supervised Camera Localization by Ground-to-satellite Image Registration Yujiao Shi et.al. 2409.06471 link
2024-09-05 Enabling Practical and Privacy-Preserving Image Processing Chao Wang et.al. 2409.03568 null
2024-09-20 A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering Shuang Song et.al. 2409.03032 link
2024-08-29 Super-Resolution works for coastal simulations Zhi-Song Liu et.al. 2408.16553 null
2024-09-15 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445 link
2024-08-26 Affine steerers for structured keypoint description Georg Bökman et.al. 2408.14186 link
2024-08-25 TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers Chuanrui Zhang et.al. 2408.13770 null
2024-09-11 Coarse-to-fine Alignment Makes Better Speech-image Retrieval Lifeng Zhou et.al. 2408.13119 null
2024-08-19 BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval Zhenyu Lu et.al. 2408.10383 null
2024-08-14 RSD-DOG : A New Image Descriptor based on Second Order Derivatives Darshan Venkatrayappa et.al. 2408.07687 null
2024-08-09 One Shot is Enough for Sequential Infrared Small Target Segmentation Bingbing Dan et.al. 2408.04823 link
2024-08-07 PRISM: PRogressive dependency maxImization for Scale-invariant image Matching Xudong Cai et.al. 2408.03598 null
2024-08-05 ConDL: Detector-Free Dense Image Matching Monika Kwiatkowski et.al. 2408.02766 null
2024-08-04 Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image Xinlin Ren et.al. 2408.02079 link
2024-07-29 Image-text matching for large-scale book collections Artemis Llabrés et.al. 2407.19812 link
2024-07-26 PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis Sohyeong Kim et.al. 2407.18695 null
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-17 GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection Jingwen Yu et.al. 2407.11736 link
2024-07-16 REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching Han Nie et.al. 2407.11637 link
2024-07-16 A Self-Correcting Strategy of the Digital Volume Correlation Displacement Field Based on Image Matching: Application to Poor Speckles Quality and Complex-Large Deformation Chengsheng Li et.al. 2407.11287 null
2024-07-14 Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching Xiaoyong Lu et.al. 2407.07789 null
2024-07-10 Mutual Information calculation on different appearances Jiecheng Liao et.al. 2407.07410 null
2024-07-15 SfM on-the-fly: Get better 3D from What You Capture Zongqian Zhan et.al. 2407.03939 null
2024-07-03 IMC 2024 Methods & Solutions Review Shyam Gupta et.al. 2407.03172 null
2024-06-21 High Resolution Surface Reconstruction of Cultural Heritage Objects Using Shape from Polarization Method F. S. Mortazavi et.al. 2406.15121 null
2024-06-16 Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models Yikai Zhang et.al. 2406.10902 link
2024-06-14 Grounding Image Matching in 3D with MASt3R Vincent Leroy et.al. 2406.09756 link
2024-06-05 A Self-Supervised Denoising Strategy for Underwater Acoustic Camera Imageries Xiaoteng Zhou et.al. 2406.02914 null
2024-05-22 Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching Hongkai Chen et.al. 2405.13874 null
2024-05-21 OmniGlue: Generalizable Feature Matching with Foundation Model Guidance Hanwen Jiang et.al. 2405.12979 link
2024-07-09 Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation Rezkellah Noureddine Khiati et.al. 2405.08556 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-13 Authentic Hand Avatar from a Phone Scan via Universal Hand Model Gyeongsik Moon et.al. 2405.07933 null
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311 null
2024-04-30 XFeat: Accelerated Features for Lightweight Image Matching Guilherme Potje et.al. 2404.19174 null
2024-06-10 MinBackProp – Backpropagating through Minimal Solvers Diana Sungatullina et.al. 2404.17993 link
2024-04-25 Transformer-Based Local Feature Matching for Multimodal Image Registration Remi Delaunay et.al. 2404.16802 null
2024-04-23 FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction Hang Hua et.al. 2404.14715 null
2024-04-22 Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer Eric Brachmann et.al. 2404.14351 null
2024-04-17 A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching Francesco Pro et.al. 2404.11302 link
2024-04-16 Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction John Francis et.al. 2404.10626 null
2024-04-15 XoFTR: Cross-modal Feature Matching Transformer Önder Tuzcuoğlu et.al. 2404.09692 link
2024-04-13 DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector Johan Edstedt et.al. 2404.08928 link
2024-04-09 Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences Axel Barroso-Laguna et.al. 2404.06337 link
2024-04-01 Marrying NeRF with Feature Matching for One-step Pose Estimation Ronghan Chen et.al. 2404.00891 null
2024-04-01 3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching Yibin Ye et.al. 2404.00838 null
2024-03-31 On the Estimation of Image-matching Uncertainty in Visual Place Recognition Mubariz Zaffar et.al. 2404.00546 null
2024-03-30 Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation Yuan Wang et.al. 2404.00262 null
2024-03-26 Staircase Localization for Autonomous Exploration in Urban Environments Jinrae Kim et.al. 2403.17330 null
2024-03-23 MatchSeg: Towards Better Segmentation via Reference Image Matching Ruiqiang Xiao et.al. 2403.15901 link
2024-03-20 Unifying Local and Global Multimodal Features for Place Recognition in Aliased and Low-Texture Environments Alberto García-Hernández et.al. 2403.13395 link
2024-03-19 HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching Ying Chen et.al. 2403.12543 null
2024-03-16 Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval Shunsuke Tsubaki et.al. 2403.10756 null
2024-03-16 Vector search with small radiuses Gergely Szilvasy et.al. 2403.10746 null
2024-03-15 Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline Fangming Yuan et.al. 2403.10283 null
2024-03-15 Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning Meixuan Li et.al. 2403.10252 null
2024-03-14 Virtual birefringence imaging and histological staining of amyloid deposits in label-free tissue using autofluorescence microscopy and deep learning Xilin Yang et.al. 2403.09100 null
2024-03-18 Matching Non-Identical Objects Yusuke Marumo et.al. 2403.08227 null
2024-03-11 Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed Yifan Wang et.al. 2403.04765 null
2024-03-07 Scene Depth Estimation from Traditional Oriental Landscape Paintings Sungho Kang et.al. 2403.03408 null
2024-02-21 Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong et.al. 2402.12974 link
2024-02-16 GIM: Learning Generalizable Image Matcher From Internet Videos Xuelun Shen et.al. 2402.11095 link
2024-02-13 Are Semi-Dense Detector-Free Methods Good at Matching Local Features? Matthieu Vilain et.al. 2402.08671 null
2024-02-13 Learning to Produce Semi-dense Correspondences for Visual Localization Khang Truong Giang et.al. 2402.08359 link
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083 link
2024-03-11 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592 link
2024-01-24 Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry Qi Cai et.al. 2401.13357 null
2024-01-19 SCENES: Subpixel Correspondence Estimation With Epipolar Supervision Dominik A. Kloepfer et.al. 2401.10886 null
2024-01-18 Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation Songhe Deng et.al. 2401.09883 link
2024-01-26 RomniStereo: Recurrent Omnidirectional Stereo Matching Hualie Jiang et.al. 2401.04345 link
2024-01-05 CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs Daoan Zhang et.al. 2401.02582 null
2024-01-03 Local Adaptive Clustering Based Image Matching for Automatic Visual Identification Zhizhen Wang et.al. 2401.01720 null
2024-01-03 A Transformer-Based Adaptive Semantic Aggregation Method for UAV Visual Geo-Localization Shishen Li et.al. 2401.01574 null
2023-12-23 BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation Tavis Shore et.al. 2312.15363 link
2023-12-22 Harnessing Diffusion Models for Visual Perception with Meta Prompts Qiang Wan et.al. 2312.14733 link
2024-01-05 MatchDet: A Collaborative Framework for Image Matching and Object Detection Jinxiang Lai et.al. 2312.10983 null
2023-12-07 Visual Geometry Grounded Deep Structure From Motion Jianyuan Wang et.al. 2312.04563 null
2023-12-04 Steerers: A framework for rotation equivariant keypoint descriptors Georg Bökman et.al. 2312.02152 link
2023-11-30 DSeg: Direct Line Segments Detection Berger Cyrille et.al. 2311.18344 null
2023-11-30 Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications Sahar Almahfouz Nasser et.al. 2311.18281 null
2023-11-29 LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching Wenhao Zhong et.al. 2311.17571 link
2023-11-08 Zero-shot Translation of Attention Patterns in VQA Models to Natural Language Leonard Salewski et.al. 2311.05043 link
2023-11-06 An invariant feature extraction for multi-modal images matching Chenzhong Gao et.al. 2311.02842 null
2023-10-23 RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments Jinyu Li et.al. 2310.15072 link
2023-10-23 Player Re-Identification Using Body Part Appearences Mahesh Bhosale et.al. 2310.14469 null
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605 null
2023-11-14 RGM: A Robust Generalist Matching Model Songyan Zhang et.al. 2310.11755 link
2023-10-07 UFD-PRiME: Unsupervised Joint Learning of Optical Flow and Stereo Depth through Pixel-Level Rigid Motion Estimation Shuai Yuan et.al. 2310.04712 null
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092 null
2023-09-29 Segment Anything Model is a Good Teacher for Local Feature Learning Jingqian Wu et.al. 2309.16992 link
2023-09-27 KDD-LOAM: Jointly Learned Keypoint Detector and Descriptors Assisted LiDAR Odometry and Mapping Renlang Huang et.al. 2309.15394 null
2023-10-13 A Critical Analysis of Internal Reliability for Uncertainty Quantification of Dense Image Matching in Multi-view Stereo Debao Huang et.al. 2309.09379 null
2023-09-11 Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Guoyuan An et.al. 2309.05438 link
2023-09-09 Neural Semantic Surface Maps Luca Morreale et.al. 2309.04836 null
2023-09-05 Doppelgangers: Learning to Disambiguate Images of Similar Structures Ruojin Cai et.al. 2309.02420 link
2023-08-14 Occ $^2$ Net: Robust Image Matching Based on 3D Occupancy Estimation for Occluded Regions Miao Fan et.al. 2308.16160 null
2023-08-29 TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching Yun Liao et.al. 2308.15144 null
2023-08-27 LDL: Line Distance Functions for Panoramic Localization Junho Kim et.al. 2308.13989 link
2023-08-22 Scene-Aware Feature Matching Xiaoyong Lu et.al. 2308.09949 null
2023-09-03 DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479 link
2023-08-19 Global Features are All You Need for Image Retrieval and Reranking Shihao Shao et.al. 2308.06954 link
2023-08-02 ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation Bo Zhang et.al. 2308.00400 link
2023-07-28 Cross-Modal Concept Learning and Inference for Vision-Language Models Yi Zhang et.al. 2307.15460 null
2023-07-22 CryptoMask : Privacy-preserving Face Recognition Jianli Bai et.al. 2307.12010 null
2023-07-22 A Stronger Stitching Algorithm for Fisheye Images based on Deblurring and Registration Jing Hao et.al. 2307.11997 null
2023-07-21 Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data Sahar Almahfouz Nasser et.al. 2307.10698 link
2023-08-08 Balancing Privacy and Progress in Artificial Intelligence: Anonymization in Histopathology for Biomedical Research and Education Neel Kanwal et.al. 2307.09426 null
2023-08-01 Unsupervised Deep Graph Matching Based on Cycle Consistency Siddharth Tourani et.al. 2307.08930 link
2023-07-15 Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents Ke Cao et.al. 2307.07763 null
2023-07-09 Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion Jie S. Li et.al. 2307.05564 null
2023-07-11 ResMatch: Residual Attention Learning for Local Feature Matching Yuxin Deng et.al. 2307.05180 link
2023-07-11 TIAM – A Metric for Evaluating Alignment in Text-to-Image Generation Paul Grimal et.al. 2307.05134 link
2023-07-02 TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching Khang Truong Giang et.al. 2307.00485 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669 link
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667 null
2023-06-25 Enhancing Dynamic Image Advertising with Vision-Language Pre-training Zhoufutu Wen et.al. 2306.14112 null
2023-06-23 LightGlue: Local Feature Matching at Light Speed Philipp Lindenberger et.al. 2306.13643 link
2023-06-19 Graph Self-Supervised Learning for Endoscopic Image Matching Manel Farhat et.al. 2306.11141 link
2023-06-09 Leaving the Lines Behind: Vision-Based Crop Row Exit for Agricultural Robot Navigation Rajitha de Silva et.al. 2306.05869 null
2023-06-07 A2B: Anchor to Barycentric Coordinate for Robust Correspondence Weiyue Zhao et.al. 2306.02760 null
2023-05-27 Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation Yueh-Cheng Huang et.al. 2305.17463 null
2023-05-19 SIDAR: Synthetic Image Dataset for Alignment & Restoration Monika Kwiatkowski et.al. 2305.12036 link
2023-05-18 LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation Yujie Lu et.al. 2305.11116 link
2023-05-16 A Method for Training-free Person Image Picture Generation Tianyu Chen et.al. 2305.09817 null
2023-05-15 Image Matching by Bare Homography Fabio Bellavia et.al. 2305.08946 null
2023-05-12 CLIP-Count: Towards Text-Guided Zero-Shot Object Counting Ruixiang Jiang et.al. 2305.07304 link
2023-05-10 SENDD: Sparse Efficient Neural Depth and Deformation for Tissue Tracking Adam Schmidt et.al. 2305.06477 null
2023-05-10 Level-line Guided Edge Drawing for Robust Line Segment Detection Xinyu Lin et.al. 2305.05883 link
2023-05-09 ColonMapper: topological mapping and localization for colonoscopy Javier Morlana et.al. 2305.05546 null
2023-04-29 A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges Xinyu Lin et.al. 2305.00264 link
2023-04-28 SFD2: Semantic-guided Feature Detection and Description Fei Xue et.al. 2304.14845 link
2023-04-17 DeepSim-Nets: Deep Similarity Networks for Stereo Image Matching Mohamed Ali Chebbi et.al. 2304.08056 link
2023-04-16 Long-term Visual Localization with Mobile Sensors Shen Yan et.al. 2304.07691 null
2023-04-12 SiLK – Simple Learned Keypoints Pierre Gleize et.al. 2304.06194 link
2023-04-16 ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation Xiaoming Zhao et.al. 2304.03608 link
2023-04-04 GlueStick: Robust Image Matching by Sticking Points and Lines Together Rémi Pautrat et.al. 2304.02008 link
2023-04-03 PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching Pedro Castro et.al. 2304.01382 null
2023-04-02 Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints Guilherme Potje et.al. 2304.00583 link
2023-04-13 Structured Epipolar Matcher for Local Feature Matching Jiahao Chang et.al. 2303.16646 null
2023-03-29 Adaptive Spot-Guided Transformer for Consistent Local Feature Matching Jiahuan Yu et.al. 2303.16624 null
2023-03-28 ASIC: Aligning Sparse in-the-wild Image Collections Kamal Gupta et.al. 2303.16201 null
2023-03-25 Learning Rotation-Equivariant Features for Visual Correspondence Jongmin Lee et.al. 2303.15472 null
2023-03-27 Learnable Graph Matching: A Practical Paradigm for Data Association Jiawei He et.al. 2303.15414 link
2023-03-24 Efficient and Accurate Co-Visible Region Localization with Matching Key-Points Crop (MKPC): A Two-Stage Pipeline for Enhancing Image Matching Performance Hongjian Song et.al. 2303.13794 null
2023-03-15 Rethinking Optical Flow from Geometric Matching Consistent Perspective Qiaole Dong et.al. 2303.08384 link
2023-04-04 PATS: Patch Area Transportation with Subdivision for Local Feature Matching Junjie Ni et.al. 2303.07700 null
2023-03-07 Parsing Line Segments of Floor Plan Images Using Graph Neural Networks Mingxiang Chen et.al. 2303.03851 null
2023-03-06 Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints Chenjie Cao et.al. 2303.02885 link
2023-03-10 ParaFormer: Parallel Attention Transformer for Efficient Feature Matching Xiaoyong Lu et.al. 2303.00941 null
2023-03-01 RIFT2: Speeding-up RIFT with A New Rotation-Invariance Technique Jiayuan Li et.al. 2303.00319 link
2023-02-28 Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images Zhongli Fan et.al. 2302.14239 link
2023-02-25 BrainCLIP: Bridging Brain and Visual-Linguistic Representation via CLIP for Generic Natural Visual Stimulus Decoding from fMRI Yulong Liu et.al. 2302.12971 link
2023-02-24 Classification of structural building damage grades from multi-temporal photogrammetric point clouds using a machine learning model trained on virtual laser scanning data Vivien Zahs et.al. 2302.12591 null
2023-02-20 A Large Scale Homography Benchmark Daniel Barath et.al. 2302.09997 link
2023-02-12 OAMatcher: An Overlapping Areas-based Network for Accurate Local Feature Matching Kun Dai et.al. 2302.05846 link
2023-02-10 General, Single-shot, Target-less, and Automatic LiDAR-Camera Extrinsic Calibration Toolbox Kenji Koide et.al. 2302.05094 link
2023-02-03 Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization Yingying Zhu et.al. 2302.01572 link
2023-01-27 Harmonizing Flows: Unsupervised MR harmonization based on normalizing flows Farzad Beizaee et.al. 2301.11551 link
2023-01-25 Local Feature Extraction from Salient Regions by Feature Map Transformation Yerim Jung et.al. 2301.10413 null
2023-01-24 Feature-based Image Matching for Identifying Individual Kākā Fintan O’Sullivan et.al. 2301.06678 null
2023-01-18 Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram Images Johannes Bayer et.al. 2301.03155 null
2023-01-08 DeepMatcher: A Deep Transformer-based Network for Robust and Accurate Local Feature Matching Tao Xie et.al. 2301.02993 link
2023-01-07 Deep Learning-Based UAV Aerial Triangulation without Image Control Points Jiageng Zhong et.al. 2301.02869 null
2023-01-06 The UNCOVER Survey: A first-look HST+JWST catalog of 50,000 galaxies near Abell 2744 and beyond John R. Weaver et.al. 2301.02671 link
2023-02-13 Translating Text Synopses to Video Storyboards Xu Gu et.al. 2301.00135 link
2022-12-23 SuperGF: Unifying Local and Global Features for Visual Localization Wenzheng Song et.al. 2212.13105 null
2022-12-26 Transformer and GAN Based Super-Resolution Reconstruction Network for Medical Images Weizhi Du et.al. 2212.13068 null
2022-12-20 Seafloor-Invariant Caustics Removal from Underwater Imagery Panagiotis Agrafiotis et.al. 2212.10167 null
2022-12-15 DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients Rémi Pautrat et.al. 2212.07766 link
2022-12-14 Shared Coupling-bridge for Weakly Supervised Local Feature Learning Jiayuan Sun et.al. 2212.07047 link
2022-12-05 Real Time Incremental Image Mosaicking Without Use of Any Camera Parameter Suleyman Melih Portakal et.al. 2212.02302 null
2022-12-05 ObjectMatch: Robust Registration using Canonical Object Correspondences Can Gümeli et.al. 2212.01985 null
2022-12-07 Universe Points Representation Learning for Partial Multi-Graph Matching Zhakshylyk Nurlanov et.al. 2212.00780 null
2022-11-30 Self-Supervised Feature Learning for Long-Term Metric Visual Localization Yuxuan Chen et.al. 2212.00122 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069 link
2022-11-19 Person Text-Image Matching via Text-Feature Interpretability Embedding and External Attack Node Implantation Fan Li et.al. 2211.08657 link
2022-11-20 Detecting Line Segments in Motion-blurred Images with Events Huai Yu et.al. 2211.07365 link
2022-11-15 Fast Key Points Detection and Matching for Tree-Structured Images Hao Wang et.al. 2211.03242 null
2022-10-25 A Comparative Study on Deep-Learning Methods for Dense Image Matching of Multi-angle and Multi-date Remote Sensing Stereo Images Hessah Albanwan et.al. 2210.14031 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517 null
2022-10-07 Mars Rover Localization Based on A2G Obstacle Distribution Pattern Matching Lang Zhou et.al. 2210.03398 link
2022-09-27 Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors Hao Dong et.al. 2209.13586 link
2022-09-25 ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement Dongli Tan et.al. 2209.12213 null
2022-09-22 DRKF: Distilled Rotated Kernel Fusion for Efficiently Boosting Rotation Invariance in Image Matching Chao Li et.al. 2209.10907 null
2022-11-15 Uncertainty-aware Efficient Subgraph Isomorphism using Graph Topology Arpan Kusari et.al. 2209.09090 null
2022-09-16 SRFeat: Learning Locally Accurate and Globally Consistent Non-Rigid Shape Correspondence Lei Li et.al. 2209.07806 link
2022-08-30 ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer Hongkai Chen et.al. 2208.14201 link
2022-08-25 A Gis Aided Approach for Geolocalizing an Unmanned Aerial System Using Deep Learning Jianli Wei et.al. 2208.12251 link
2022-08-25 UAS Navigation in the Real World Using Visual Observation Yuci Han et.al. 2208.12125 null
2022-08-24 Self-Supervised Endoscopic Image Key-Points Matching Manel Farhat et.al. 2208.11424 link
2022-08-22 Equivariant Hypergraph Neural Networks Jinwoo Kim et.al. 2208.10428 link
2022-09-22 Understanding Attention for Vision-and-Language Tasks Feiqi Cao et.al. 2208.08104 link
2022-08-16 Hierarchical Attention Network for Few-Shot Object Detection via Meta-Contrastive Learning Dongwoo Park et.al. 2208.07039 link
2022-08-04 Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification Xinyu Lin et.al. 2208.02450 link
2022-08-04 OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images Weijia Li et.al. 2208.00928 null
2022-07-29 Testing Relational Understanding in Text-Guided Image Generation Colin Conwell et.al. 2208.00005 null
2022-07-21 Pose for Everything: Towards Category-Agnostic Pose Estimation Lumin Xu et.al. 2207.10387 link
2022-07-20 Explaining Deepfake Detection by Analysing Image Matching Shichao Dong et.al. 2207.09679 link
2022-07-18 Adaptive Assignment for Geometry Aware Local Feature Matching Dihe Huang et.al. 2207.08427 link
2022-07-16 Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching Jiazhen Liu et.al. 2207.07932 link
2022-07-06 Virtual staining of defocused autofluorescence images of unlabeled tissue using deep neural networks Yijie Zhang et.al. 2207.02946 null
2022-07-01 TopicFM: Robust and Interpretable Feature Matching with Topic-assisted Khang Truong Giang et.al. 2207.00328 link
2022-06-16 Virtual Correspondence: Humans as a Cue for Extreme-View Geometry Wei-Chiu Ma et.al. 2206.08365 null
2022-06-15 Self-Supervised Learning of Image Scale and Orientation Jongmin Lee et.al. 2206.07259 link
2022-05-27 Image Keypoint Matching using Graph Neural Networks Nancy Xu et.al. 2205.14275 null
2022-05-27 Fine-tuning deep learning models for stereo matching using results from semi-global matching Hessah Albanwan et.al. 2205.14051 null
2022-05-23 TransforMatcher: Match-to-Match Attention for Semantic Correspondence Seungwook Kim et.al. 2205.11634 link
2022-05-16 ReDFeat: Recoupling Detection and Description for Multimodal Feature Learning Yuxin Deng et.al. 2205.07439 null
2022-05-06 BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo Surgical Image Matching Jingwei Song et.al. 2205.03133 link
2022-05-10 AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching Khanh Nguyen et.al. 2205.02849 link
2022-04-27 Gleo-Det: Deep Convolution Feature-Guided Detector with Local Entropy Optimization for Salient Points Chao Li et.al. 2204.12884 null
2022-04-22 SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite Runzhe Zhu et.al. 2204.10704 link
2022-04-20 Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations Leila Pishdad et.al. 2204.09268 null
2022-04-19 OpenGlue: Open Source Graph Neural Net Based Pipeline for Image Matching Ostap Viniavskyi et.al. 2204.08870 link
2022-04-19 Self-Supervised Equivariant Learning for Oriented Keypoint Detection Jongmin Lee et.al. 2204.08613 link
2022-04-22 Efficient Linear Attention for Fast and Accurate Keypoint Matching Suwichaya Suwanwimolkul et.al. 2204.07731 null
2022-04-08 Lightweight starshade position sensing with convolutional neural networks and simulation-based inference Andrew Chen et.al. 2204.03853 link
2022-03-30 AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift Burak Yildiz et.al. 2203.16291 link
2022-03-29 Photographic Visualization of Weather Forecasts with Generative Adversarial Networks Christian Sigg et.al. 2203.15601 link
2022-03-29 Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots Pranay Mathur et.al. 2203.15272 null
2022-03-28 Optimizing Elimination Templates by Greedy Parameter Search Evgeniy Martyushev et.al. 2203.14901 link
2022-03-28 S2-Net: Self-supervision Guided Feature Representation Learning for Cross-Modality Images Shasha Mei et.al. 2203.14581 null
2022-03-26 Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching Yujiao Shi et.al. 2203.14148 link
2022-03-24 Keypoints Tracking via Transformer Networks Oleksii Nasypanyi et.al. 2203.12848 link
2022-03-21 MatchFormer: Interleaving Attention in Transformers for Feature Matching Qing Wang et.al. 2203.09645 link
2022-03-14 There’s no difference: Convolutional Neural Networks for transient detection without template subtraction Tatiana Acero-Cuellar et.al. 2203.07390 link
2022-03-25 Cross Language Image Matching for Weakly Supervised Semantic Segmentation Jinheng Xie et.al. 2203.02668 link
2022-03-01 CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP Zihao Wang et.al. 2203.00386 null
2022-03-09 Time-resolved Imaging of Stochastic Cascade Reactions over a Submillisecond to Second Time Range at the Angstrom Level Toshiki Shimizu et.al. 2202.13332 null
2022-02-16 Cross-view and Cross-domain Underwater Localization based on Optical Aerial and Acoustic Underwater Images Matheus M. Dos Santos et.al. 2202.07817 null
2022-02-14 CATs++: Boosting Cost Aggregation with Convolutions and Transformers Seokju Cho et.al. 2202.06817 link
2022-02-11 Improving Image-recognition Edge Caches with a Generative Adversarial Network Guilherme B. Souza et.al. 2202.05929 null
2022-02-08 Learning Optical Flow with Adaptive Graph Reasoning Ao Luo et.al. 2202.03857 link
2022-02-03 Sim2Real Object-Centric Keypoint Detection and Description Chengliang Zhong et.al. 2202.00448 null
2022-01-27 Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape context Jie Shao et.al. 2201.11296 null
2021-12-24 Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation Zhiwei Liu et.al. 2112.12917 null
2021-12-20 Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching Yujie Fu et.al. 2112.10485 null
2021-12-19 GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor Jean-Baptiste Carluer et.al. 2112.10258 link
2021-12-14 More Control for Free! Image Synthesis with Semantic Diffusion Guidance Xihui Liu et.al. 2112.05744 null
2021-12-08 Label-free virtual HER2 immunohistochemical staining of breast tissue using deep learning Bijie Bai et.al. 2112.05240 null
2021-12-01 FaSS-MVS – Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery Boitumelo Ruf et.al. 2112.00821 null
2021-12-01 CLIPstyler: Image Style Transfer with a Single Text Condition Gihyun Kwon et.al. 2112.00374 link
2021-11-29 Nonlinear Intensity Underwater Sonar Image Matching Method Based on Phase Information and Deep Convolution Features Xiaoteng Zhou et.al. 2111.15514 null
2021-11-29 Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic Yoad Tewel et.al. 2111.14447 link
2021-11-29 Heterogeneous Visible-Thermal and Visible-Infrared Face Recognition using Unit-Class Loss and Cross-Modality Discriminator Usman Cheema et.al. 2111.14339 null
2021-11-17 Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network Xiaoming Zhao et.al. 2111.09006 null
2021-11-17 Nonlinear Intensity Sonar Image Matching based on Deep Convolution Features Xiaoteng Zhou et.al. 2111.08994 null
2021-10-30 A Deep Search for Faint Chandra X-ray Sources, Radio Sources, and Optical Counterparts in NGC 6752 Haldan N. Cohn et.al. 2111.00357 null
2021-10-01 Robustly Removing Deep Sea Lighting Effects for Visual Mapping of Abyssal Plains Kevin Köser et.al. 2110.00480 null
2021-09-29 Visually Grounded Concept Composition Bowen Zhang et.al. 2109.14115 null
2021-09-27 HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines Fabio Bellavia et.al. 2109.12925 null
2021-09-20 Viewpoint Invariant Dense Matching for Visual Geolocalization Gabriele Berton et.al. 2109.09827 link
2021-09-20 Image Subtraction in Fourier Space Lei Hu et.al. 2109.09334 link
2021-09-10 Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization Sungho Yoon et.al. 2109.04753 link
2021-09-08 Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes Wenzheng Song et.al. 2109.03585 null
2021-08-27 A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images Xiaoteng Zhou et.al. 2108.12151 null
2021-08-27 Matching Underwater Sonar Images by the Learned Descriptor Based on Style Transfer Method Xiaoteng Zhou et.al. 2108.12072 null
2021-08-26 Efficient Joint Object Matching via Linear Programming Antonio De Rosa et.al. 2108.11911 null

NeRF

Publish Date Title Authors PDF Code
2025-07-14 VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling Zihang Zeng et.al. 2507.09987 null
2025-07-12 Stable Score Distillation Haiming Zhu et.al. 2507.09168 null
2025-07-11 From images to properties: a NeRF-driven framework for granular material parameter inversion Cheng-Hsi Hsiao et.al. 2507.09005 null
2025-07-10 MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation Bangning Wei et.al. 2507.07519 null
2025-07-14 BayesSDF: Surface-Based Laplacian Uncertainty Estimation for 3D Geometry with Neural Signed Distance Fields Rushil Desai et.al. 2507.06269 null
2025-07-08 Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering Jiayi Song et.al. 2507.06103 null
2025-07-08 DreamArt: Generating Interactable Articulated Objects from a Single Image Ruijie Lu et.al. 2507.05763 null
2025-07-06 A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields Aoxiang Fan et.al. 2507.04408 null
2025-07-02 Tile and Slide : A New Framework for Scaling NeRF from Local to Global 3D Earth Observation Camille Billouard et.al. 2507.01631 null
2025-07-01 Surgical Neural Radiance Fields from One Image Alberto Neri et.al. 2507.00969 null
2025-07-01 PlantSegNeRF: A few-shot, cross-dataset method for plant 3D instance point cloud reconstruction via joint-channel NeRF with multi-view image instance matching Xin Yang et.al. 2507.00371 null
2025-06-30 AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention Ziao Liu et.al. 2506.23611 null
2025-06-29 Dynamic View Synthesis from Small Camera Motion Videos Huiqiang Sun et.al. 2506.23153 null
2025-06-27 UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields Fabian Perez et.al. 2506.21884 null
2025-06-24 ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes Chenhao Zhang et.al. 2506.21629 null
2025-06-26 PanSt3R: Multi-view Consistent Panoptic Segmentation Lojze Zust et.al. 2506.21348 null
2025-06-25 Joint attitude estimation and 3D neural reconstruction of non-cooperative space objects Clément Forray et.al. 2506.20638 null
2025-06-24 NeRF-based CBCT Reconstruction needs Normalization and Initialization Zhuowei Xu et.al. 2506.19742 null
2025-06-25 Self-Supervised Multimodal NeRF for Autonomous Driving Gaurav Sharma et.al. 2506.19615 null
2025-06-24 HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis Xiaoyuan Wang et.al. 2506.19291 null
2025-06-23 MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation Tianchen Deng et.al. 2506.18678 null
2025-06-26 2D Triangle Splatting for Direct Differentiable Mesh Training Kaifeng Sheng et.al. 2506.18575 link
2025-06-22 Limitations of NERF with pre-trained Vision Features for Few-Shot 3D Reconstruction Ankit Sanjyal et.al. 2506.18208 null
2025-06-21 3D Gaussian Splatting for Fine-Detailed Surface Reconstruction in Large-Scale Scene Shihan Chen et.al. 2506.17636 null
2025-06-23 R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision Weeyoung Kwon et.al. 2506.16262 link
2025-06-24 RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories Qingsong Yan et.al. 2506.15242 null
2025-06-17 Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction Zhengquan Zhang et.al. 2506.14856 null
2025-06-18 Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting Mufan Liu et.al. 2506.12787 null
2025-06-17 Efficient multi-view training for 3D Gaussian Splatting Minhyuk Choi et.al. 2506.12727 null
2025-06-12 PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting Lintao Xiang et.al. 2506.10335 null
2025-06-11 The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images without Any 3D Knowledge Haoru Wang et.al. 2506.09885 null
2025-06-10 A Probability-guided Sampler for Neural Implicit Surface Rendering Gonçalo Dias Pais et.al. 2506.08619 null
2025-06-09 Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes Allen Tu et.al. 2506.07917 link
2025-06-20 Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency Xiangyu Guo et.al. 2506.07497 null
2025-06-07 SPC to 3D: Novel View Synthesis from Binary SPC via I2I translation Sumit Sharma et.al. 2506.06890 null
2025-06-06 Splat and Replace: 3D Reconstruction with Repetitive Elements Nicolás Violante et.al. 2506.06462 null
2025-06-06 NeurNCD: Novel Class Discovery via Implicit Neural Representation Junming Wang et.al. 2506.06412 null
2025-06-06 Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments Mingrui Li et.al. 2506.05965 null
2025-06-06 ProJo4D: Progressive Joint Optimization for Sparse-View Inverse Physics Estimation Daniel Rho et.al. 2506.05317 null
2025-06-06 Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting Nan Wang et.al. 2506.05280 link
2025-06-05 Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer Filip Slezak et.al. 2506.04908 null
2025-05-30 Hi-Dyna Graph: Hierarchical Dynamic Scene Graph for Robotic Autonomy in Human-Centric Environments Jiawei Hou et.al. 2506.00083 null
2025-05-29 PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views Mohamed Rayan Barhdadi et.al. 2505.23481 link
2025-05-29 LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering Jonas Kulhanek et.al. 2505.23158 null
2025-05-28 Can NeRFs See without Cameras? Chaitanya Amballa et.al. 2505.22441 null
2025-05-28 Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss Wenjun Lu et.al. 2505.22279 null
2025-05-28 Hyperspectral Gaussian Splatting Sunil Kumar Narayanan et.al. 2505.21890 null
2025-05-27 Structure from Collision Takuhiro Kaneko et.al. 2505.21335 null
2025-05-26 OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender Shintaro Ito et.al. 2505.20126 link
2025-05-30 ErpGS: Equirectangular Image Rendering enhanced with 3D Gaussian Regularization Shintaro Ito et.al. 2505.19883 null
2025-05-26 GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis You Wang et.al. 2505.19813 link
2025-05-26 Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction Li Fang et.al. 2505.19793 link
2025-05-26 ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Wenhua Wu et.al. 2505.19420 null
2025-05-25 Triangle Splatting for Real-Time Radiance Field Rendering Jan Held et.al. 2505.19175 null
2025-05-22 UAV See, UGV Do: Aerial Imagery and Virtual Teach Enabling Zero-Shot Ground Vehicle Repeat Desiree Fisker et.al. 2505.16912 null
2025-05-19 IPENS:Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion Wentao Song et.al. 2505.13633 null
2025-05-19 3D Gaussian Adaptive Reconstruction for Fourier Light-Field Microscopy Chenyu Xu et.al. 2505.12875 null
2025-05-18 Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey Calvin Galagain et.al. 2505.12384 null
2025-05-16 MutualNeRF: Improve the Performance of NeRF under Limited Samples with Mutual Information Theory Zifan Wang et.al. 2505.11386 null
2025-05-16 EA-3DGS: Efficient and Adaptive 3D Gaussians with Highly Enhanced Quality for outdoor scenes Jianlin Guo et.al. 2505.10787 link
2025-05-15 Large-Scale Gaussian Splatting SLAM Zhe Xin et.al. 2505.09915 null
2025-05-14 Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians Ma Changfeng et.al. 2505.09413 link
2025-05-14 FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling Yue Wen et.al. 2505.09406 null
2025-05-12 TUGS: Physics-based Compact Representation of Underwater Scenes by Tensorized Gaussian Shijie Lian et.al. 2505.08811 null
2025-05-13 FOCI: Trajectory Optimization on Gaussian Splats Mario Gomez Andreu et.al. 2505.08510 null
2025-05-13 TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset Olaf Wysocki et.al. 2505.07396 null
2025-05-12 Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild Lintao Xiang et.al. 2505.07373 null
2025-05-11 NeuGen: Amplifying the ‘Neural’ in Neural Radiance Fields for Domain Generalization Ahmed Qazi et.al. 2505.06894 null
2025-05-10 3D Characterization of Smoke Plume Dispersion Using Multi-View Drone Swarm Nikil Krishnakumar et.al. 2505.06638 null
2025-05-10 FlexNeRFer: A Multi-Dataflow, Adaptive Sparsity-Aware Accelerator for On-Device NeRF Rendering Seock-Hwan Noh et.al. 2505.06504 null
2025-05-08 3D Scene Generation: A Survey Beichen Wen et.al. 2505.05474 link
2025-05-04 HandOcc: NeRF-based Hand Rendering with Occupancy Networks Maksym Ivashechkin et.al. 2505.02079 null
2025-05-04 Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance Fields Zhenxing Mi et.al. 2505.02005 link
2025-05-03 AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting Junhao Shi et.al. 2505.01799 null
2025-05-03 Unified Steganography via Implicit Neural Representation Qi Song et.al. 2505.01749 null
2025-04-30 A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond Jiajia Li et.al. 2505.00737 link
2025-05-01 Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation Feng Xue et.al. 2505.00378 null
2025-04-29 GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction Yuhan Xie et.al. 2504.21067 link
2025-04-29 Large-scale visual SLAM for in-the-wild videos Shuo Sun et.al. 2504.20496 null
2025-05-01 GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting Jongwon Lee et.al. 2504.20379 null
2025-04-29 Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views Jiang Wu et.al. 2504.20378 link
2025-04-28 Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video Hoang Chuong Nguyen et.al. 2504.19819 null
2025-04-27 Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users Apurv Varshney et.al. 2504.19345 null
2025-04-29 IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos Yuan Li et.al. 2504.19165 null
2025-04-28 RGS-DR: Reflective Gaussian Surfels with Deferred Rendering for Shiny Objects Georgios Kouros et.al. 2504.18468 null
2025-04-23 Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning Mingxuan Cui et.al. 2504.17815 link
2025-04-24 CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos Shucheng Gong et.al. 2504.17728 link
2025-04-23 Dual-Camera All-in-Focus Neural Radiance Fields Xianrui Luo et.al. 2504.16636 null
2025-04-23 Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks Murat Bilgehan Ertan et.al. 2504.16557 null
2025-04-23 SaENeRF: Suppressing Artifacts in Event-based Neural Radiance Fields Yuanjian Wang et.al. 2504.16389 link
2025-04-22 Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models Quentin Herau et.al. 2504.15776 null
2025-04-21 StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians Cailin Zhuang et.al. 2504.15281 null
2025-04-18 Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training Andrea Amaduzzi et.al. 2504.13995 null
2025-04-21 SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM Samuel Cerezo et.al. 2504.13713 link
2025-04-16 BEV-GS: Feed-forward Gaussian Splatting in Bird’s-Eye-View for Road Reconstruction Wenhua Wu et.al. 2504.13207 null
2025-04-17 GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration Rendong Zhang et.al. 2504.12999 link
2025-04-16 R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors Haoyang Wang et.al. 2504.11946 null
2025-04-19 LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis Hao Sun et.al. 2504.10331 null
2025-04-14 MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling Yunpeng Tan et.al. 2504.09878 null
2025-04-14 NeRF-Based Transparent Object Grasping Enhanced by Shape Priors Yi Han et.al. 2504.09868 null
2025-04-11 HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields Asterios Reppas et.al. 2504.08901 null
2025-04-09 Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting Daiwei Zhang et.al. 2504.06978 null
2025-04-09 S-EO: A Large-Scale Dataset for Geometry-Aware Shadow Detection in Remote Sensing Applications Masquil Elías et.al. 2504.06920 null
2025-04-09 SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering Hanxiao Sun et.al. 2504.06815 link
2025-04-08 Meta-Continual Learning of Neural Fields Seungyoon Woo et.al. 2504.05806 null
2025-04-08 SE4Lip: Speech-Lip Encoder for Talking Head Synthesis to Solve Phoneme-Viseme Alignment Ambiguity Yihuan Huang et.al. 2504.05803 null
2025-04-08 InvNeRF-Seg: Fine-Tuning a Pre-Trained NeRF for 3D Object Segmentation Jiangsan Zhao et.al. 2504.05751 null
2025-04-07 DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal Wanzhou Liu et.al. 2504.04679 null
2025-04-06 Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models Etienne Chassaing et.al. 2504.04448 null
2025-04-04 NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices Zhe Wang et.al. 2504.03415 null
2025-04-03 MultiNeRF: Multiple Watermark Embedding for Neural Radiance Fields Yash Kulthe et.al. 2504.02517 null
2025-04-03 LPA3D: 3D Room-Level Scene Generation from In-the-Wild Images Ming-Jia Yang et.al. 2504.02337 null
2025-04-01 OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF Jingyu Shi et.al. 2504.02007 null
2025-04-02 Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis Niluthpol Chowdhury Mithun et.al. 2504.01960 null
2025-04-02 BOGausS: Better Optimized Gaussian Splatting Stéphane Pateux et.al. 2504.01844 null
2025-04-02 FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking Ulas Gunes et.al. 2504.01732 null
2025-04-02 RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars Yahui Li et.al. 2504.01559 null
2025-04-02 Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment Ziteng Cui et.al. 2504.01503 link
2025-04-01 Neural Pruning for 3D Scene Reconstruction: Efficient NeRF Acceleration Tianqi Ding et.al. 2504.00950 null
2025-04-01 NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds Mahan Rafidashti et.al. 2504.00859 null
2025-03-31 NeRF-Based defect detection Tianqi et.al. 2504.00270 null
2025-03-31 LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors Han Zhou et.al. 2504.00219 null
2025-03-31 ERUPT: Efficient Rendering with Unposed Patch Transformer Maxim V. Shugaev et.al. 2503.24374 null
2025-03-29 NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations Zhenyu Tang et.al. 2503.23162 null
2025-03-28 ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting Wenjie Liu et.al. 2503.22218 null
2025-03-27 NeRF-based Point Cloud Reconstruction using a Stationary Camera for Agricultural Applications Kibon Ku et.al. 2503.21958 null
2025-03-27 Refined Geometry-guided Head Avatar Reconstruction from Monocular RGB Video Pilseo Park et.al. 2503.21886 null
2025-03-27 HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM Ziren Gong et.al. 2503.21778 null
2025-04-01 RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting Qiyu Dai et.al. 2503.21442 null
2025-03-28 LandMarkSystem Technical Report Zhenxiang Ma et.al. 2503.21364 link
2025-03-27 UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation Yehui Shen et.al. 2503.21338 link
2025-03-25 CoMapGS: Covisibility Map-based Gaussian Splatting for Sparse Novel View Synthesis Youngkyoon Jang et.al. 2503.20998 null
2025-03-26 AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports Xiangwen Zhang et.al. 2503.20654 null
2025-03-26 EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis Sheng Miao et.al. 2503.20168 null
2025-03-25 Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals Zhirui Dai et.al. 2503.20066 null
2025-03-25 MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities Federico Lincetto et.al. 2503.19673 null
2025-03-24 NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting Yulong Zheng et.al. 2503.18794 null
2025-03-25 LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene Xiaoyu Zhang et.al. 2503.18513 null
2025-03-24 NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction Wenyuan Zhang et.al. 2503.18361 null
2025-03-23 End-to-End Implicit Neural Representations for Classification Alexander Gielisse et.al. 2503.18123 link
2025-03-23 Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving Junhao Ge et.al. 2503.18108 link
2025-03-23 PanopticSplatting: End-to-End Panoptic Gaussian Splatting Yuxuan Xie et.al. 2503.18073 null
2025-03-21 Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping Emanuele Giacomini et.al. 2503.17491 link
2025-03-21 FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields Kwan Yun et.al. 2503.17095 link
2025-03-21 DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery Jiadong Tang et.al. 2503.16964 null
2025-03-20 Digitally Prototype Your Eye Tracker: Simulating Hardware Performance using 3D Synthetic Data Esther Y. H. Lin et.al. 2503.16742 null
2025-03-20 Enhancing Close-up Novel View Synthesis via Pseudo-labeling Jiatong Xia et.al. 2503.15908 link
2025-03-19 SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints Weiwen Hu et.al. 2503.15712 null
2025-03-19 DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis Yuming Gu et.al. 2503.15667 link
2025-03-19 GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector Zechuan Li et.al. 2503.15211 null
2025-03-19 MultiBARF: Integrating Imagery of Different Wavelength Regions by Using Neural Radiance Fields Kana Kurata et.al. 2503.15070 null
2025-03-19 3D Engine-ready Photorealistic Avatars via Dynamic Textures Yifan Wang et.al. 2503.14943 null
2025-03-19 ClimateGS: Real-Time Climate Simulation with 3D Gaussian Style Transfer Yuezhen Xie et.al. 2503.14845 null
2025-03-18 Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis Yizhou Li et.al. 2503.14219 null
2025-03-17 Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios Iryna Repinetska et.al. 2503.13710 null
2025-03-17 TriDF: Triplane-Accelerated Density Fields for Few-Shot Remote Sensing Novel View Synthesis Jiaming Kang et.al. 2503.13347 null
2025-03-17 DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction Rui Wang et.al. 2503.13176 null
2025-03-17 DivCon-NeRF: Generating Augmented Rays with Diversity and Consistency for Few-shot View Synthesis Ingyun Lee et.al. 2503.12947 null
2025-03-15 FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields Rui Qian et.al. 2503.12086 null
2025-03-14 Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation Xianming Zeng et.al. 2503.11731 null
2025-03-13 Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations Xunzhi Zheng et.al. 2503.10464 null
2025-03-13 AI-assisted 3D Preservation and Reconstruction of Temple Arts Naai-Jung Shih et.al. 2503.10031 null
2025-03-12 Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation Máté Tóth et.al. 2503.09464 null
2025-03-11 GAS-NeRF: Geometry-Aware Stylization of Dynamic Radiance Fields Nhat Phuong Anh Vu et.al. 2503.08483 null
2025-03-17 Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios Zikang Yuan et.al. 2503.08317 null
2025-03-11 GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats Kai Deng et.al. 2503.08071 link
2025-03-11 NeRF-VIO: Map-Based Visual-Inertial Odometry with Initialization Leveraging Neural Radiance Fields Yanyu Zhang et.al. 2503.07952 null
2025-03-10 Neural Radiance and Gaze Fields for Visual Attention Modeling in 3D Environments Andrei Chubarau et.al. 2503.07828 null
2025-03-10 CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving Ziliang Xiong et.al. 2503.07425 null
2025-03-08 Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction Kai Li et.al. 2503.06161 null
2025-03-08 SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography Xuanyu Zhang et.al. 2503.06118 null
2025-03-08 NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features Hongjia Zhai et.al. 2503.06117 null
2025-03-06 Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering Idris O. Sunmola et.al. 2503.04079 null
2025-03-05 LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation Qian Feng et.al. 2503.03890 null
2025-03-04 Tracking-Aware Deformation Field Estimation for Non-rigid 3D Reconstruction in Robotic Surgeries Zeqing Wang et.al. 2503.02558 null
2025-03-04 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting Qipeng Yan et.al. 2503.02452 null
2025-03-04 Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views Yingji Zhong et.al. 2503.02230 null
2025-03-04 Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints Yan Miao et.al. 2503.02198 null
2025-03-03 Data Augmentation for NeRFs in the Low Data Limit Ayush Gaggar et.al. 2503.02092 null
2025-03-03 Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Jay Zhangjie Wu et.al. 2503.01774 null
2025-03-05 Category-level Meta-learned NeRF Priors for Efficient Object Mapping Saad Ejaz et.al. 2503.01582 null
2025-03-03 LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training Kaimin Liao et.al. 2503.01199 link
2025-03-02 DreamPrinting: Volumetric Printing Primitives for High-Fidelity 3D Printing Youjia Wang et.al. 2503.00887 null
2025-03-01 Scalable Real2Sim: Physics-Aware Asset Generation Via Robotic Pick-and-Place Setups Nicholas Pfaff et.al. 2503.00370 link
2025-02-27 Identity-preserving Distillation Sampling by Fixed-Point Iterator SeonHwa Kim et.al. 2502.19930 null
2025-02-27 NeRFCom: Feature Transform Coding Meets Neural Radiance Field for Free-View 3D Scene Semantic Transmission Weijie Yue et.al. 2502.19873 null
2025-02-26 Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions Muhammad Salman Ali et.al. 2502.19457 null
2025-02-26 Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? Adam Celarek et.al. 2502.19318 link
2025-02-26 The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields Ziyuan Luo et.al. 2502.19125 null
2025-02-24 Semantic Neural Radiance Fields for Multi-Date Satellite Data Valentin Wagner et.al. 2502.16992 link
2025-02-22 AquaNeRF: Neural Radiance Fields in Underwater Media with Distractor Removal Luca Gough et.al. 2502.16351 null
2025-02-22 DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation Yuxuan Xiong et.al. 2502.16302 null
2025-02-24 Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis Ziqian Ni et.al. 2502.15635 null
2025-02-20 Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2502.14931 null
2025-02-20 NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis Xiaoxing Liu et.al. 2502.14178 null
2025-02-19 GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian Bang Du et.al. 2502.14129 null
2025-02-18 Geometry-Aware Diffusion Models for Multiview Scene Inpainting Ahmad Salimi et.al. 2502.13335 null
2025-02-18 GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis Pedro Martin et.al. 2502.13196 null
2025-02-18 ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition Quoc-Anh Bui et.al. 2502.12673 null
2025-02-21 HumanGif: Single-View Human Diffusion with Generative Prior Shoukang Hu et.al. 2502.12080 link
2025-02-17 3D Gaussian Inpainting with Depth-Guided Cross-View Consistency Sheng-Yu Huang et.al. 2502.11801 null
2025-02-13 Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures Francesco Ballerini et.al. 2502.09623 null
2025-02-13 DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior Mingrui Li et.al. 2502.09111 null
2025-02-12 Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision Tianle Liu et.al. 2502.08352 null
2025-02-10 PrismAvatar: Real-time animated 3D neural head avatars on edge devices Prashant Raina et.al. 2502.07030 null
2025-02-10 Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC Siwei Meng et.al. 2502.07007 null
2025-02-08 GWRF: A Generalizable Wireless Radiance Field for Wireless Signal Propagation Modeling Kang Yang et.al. 2502.05708 null
2025-02-05 VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning Jayram Palamadai et.al. 2502.05222 null
2025-02-11 PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression Feifei Li et.al. 2502.04843 null
2025-02-04 SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification Yifu Tao et.al. 2502.02657 null
2025-02-04 MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning Shengbo Gu et.al. 2502.02372 null
2025-02-03 FourieRF: Few-Shot NeRFs via Progressive Fourier Frequency Control Diego Gomez et.al. 2502.01405 null
2025-01-31 VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting Mateusz Nowak et.al. 2501.17978 null
2025-01-28 LinPrim: Linear Primitives for Differentiable Volumetric Rendering Nicolas von Lützow et.al. 2501.16312 null
2025-01-24 SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation Yujian Liu et.al. 2501.14646 null
2025-02-05 GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting Junzhe Jiang et.al. 2501.13971 link
2025-01-23 VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM Gyuhyeon Pak et.al. 2501.13402 null
2025-01-22 Neural Radiance Fields for the Real World: A Survey Wenhui Xiao et.al. 2501.13104 null
2025-02-02 DWTNeRF: Boosting Few-shot Neural Radiance Fields via Discrete Wavelet Transform Hung Nguyen et.al. 2501.12637 null
2025-01-21 DNRSelect: Active Best View Selection for Deferred Neural Rendering Dongli Wu et.al. 2501.12150 null
2025-01-21 Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging Shuyi Hu et.al. 2501.11884 null
2025-01-16 Poxel: Voxel Reconstruction for 3D Printing Ruixiang Cao et.al. 2501.10474 null
2025-01-17 Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation Xiaoyun Zheng et.al. 2501.09947 link
2025-01-16 Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective Scenes Ji Shi et.al. 2501.09460 link
2025-01-15 SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM Yuhang Ming et.al. 2501.08880 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286 null
2025-01-13 Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes Yuhang Zhang et.al. 2501.08072 null
2025-01-14 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015 null
2025-01-12 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927 link
2025-01-12 ActiveGAMER: Active GAussian Mapping through Efficient Rendering Liyan Chen et.al. 2501.06897 null
2025-01-17 SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis Peng Zheng et.al. 2501.06770 null
2025-01-11 NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without References Qiang Qu et.al. 2501.06488 link
2025-01-10 UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping Yanjie Li et.al. 2501.05783 null
2025-01-13 Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes Ludwic Leonard et.al. 2501.05226 link
2025-01-07 NeRFs are Mirror Detectors: Using Structural Similarity for Multi-View Mirror Scene Reconstruction with 3D Surface Primitives Leif Van Holland et.al. 2501.04074 link
2025-01-07 NeuralSVG: An Implicit Representation for Text-to-Vector Generation Sagi Polaczek et.al. 2501.03992 null
2025-01-07 DehazeGS: Seeing Through Fog with 3D Gaussian Splatting Jinze Yu et.al. 2501.03659 null
2025-01-07 ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting Yifeng Yang et.al. 2501.03605 link
2025-01-07 AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene Chaoran Feng et.al. 2501.02807 null
2024-12-29 Bringing Objects to Life: 4D generation from 3D objects Ohad Rahamim et.al. 2412.20422 null
2024-12-27 Learning Radiance Fields from a Single Snapshot Compressive Image Yunhao Li et.al. 2412.19483 null
2025-01-05 BeSplat: Gaussian Splatting from a Single Blurry Image and Event Stream Gopi Raju Matta et.al. 2412.19370 link
2024-12-26 Generating Editable Head Avatars with 3D Gaussian GANs Guohao Li et.al. 2412.19149 link
2024-12-26 MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo Byeonggwon Lee et.al. 2412.19130 null
2024-12-26 Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos Changwoon Choi et.al. 2412.19089 null
2024-12-23 Editing Implicit and Explicit Representations of Radiance Fields: A Survey Arthur Hubert et.al. 2412.17628 null
2024-12-23 Exploring Dynamic Novel View Synthesis Technologies for Cinematography Adrian Azzarelli et.al. 2412.17532 null
2024-12-21 LUCES-MV: A Multi-View Dataset for Near-Field Point Light Source Photometric Stereo Fotios Logothetis et.al. 2412.16737 null
2024-12-20 NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems Laura Weihl et.al. 2412.16141 null
2024-12-20 NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images Yue Guo et.al. 2412.15890 null
2024-12-19 LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction Pou-Chun Kung et.al. 2412.15447 null
2024-12-18 DreaMark: Rooting Watermark in Score Distillation Sampling Generated Neural Radiance Fields Xingyu Zhu et.al. 2412.15278 null
2024-12-19 GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting Qianpu Sun et.al. 2412.14579 null
2024-12-19 Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images Min Wang et.al. 2412.14547 null
2024-12-18 GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians Xiaobao Wei et.al. 2412.13983 link
2024-12-17 EOGS: Gaussian Splatting for Earth Observation Luca Savant Aira et.al. 2412.13047 null
2024-12-18 Optimize the Unseen – Fast NeRF Cleanup with Free Space Prior Leo Segre et.al. 2412.12772 null
2024-12-17 Towards a Training Free Approach for 3D Scene Editing Vivek Madhavaram et.al. 2412.12766 null
2024-12-16 GS-ProCams: Gaussian Splatting-based Projector-Camera Systems Qingyue Deng et.al. 2412.11762 null
2024-12-18 Sequence Matters: Harnessing Video Models in 3D Super-Resolution Hyun-kyu Ko et.al. 2412.11525 null
2024-12-16 VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression Qiang Hu et.al. 2412.11362 null
2024-12-13 NeRF-Texture: Synthesizing Neural Radiance Field Textures Yi-Hua Huang et.al. 2412.10004 null
2024-12-13 Sharpening Your Density Fields: Spiking Neuron Aided Fast Geometry Learning Yi Gu et.al. 2412.09881 null
2024-12-12 PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields Sean Wu et.al. 2412.09680 link
2024-12-11 GN-FR:Generalizable Neural Radiance Fields for Flare Removal Gopi Raju Matta et.al. 2412.08200 null
2024-12-11 NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods Qiang Qu et.al. 2412.08029 link
2024-12-10 EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering Toshiya Yura et.al. 2412.07293 null
2024-12-09 Diffusing Differentiable Representations Yash Savani et.al. 2412.06981 null
2024-12-09 Dynamic EventNeRF: Reconstructing General Dynamic Scenes from Multi-view Event Cameras Viktor Rudnev et.al. 2412.06770 null
2024-12-09 Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video Renlong Wu et.al. 2412.06424 link
2024-12-09 Splatter-360: Generalizable 360 $^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images Zheng Chen et.al. 2412.06250 link
2024-12-07 WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking Yuqi Tan et.al. 2412.05695 null
2024-12-06 Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories Susung Hong et.al. 2412.05279 null
2024-12-11 MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting Peng Chen et.al. 2412.04955 link
2024-12-04 NeRF and Gaussian Splatting SLAM in the Wild Fabian Schmidt et.al. 2412.03263 link
2024-12-01 SAGA: Surface-Aligned Gaussian Avatar Ronghan Chen et.al. 2412.00845 null
2024-12-01 CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images Jian Liu et.al. 2412.00754 null
2024-11-30 Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives Alex Hanson et.al. 2412.00578 link
2024-11-30 Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects Amir Barda et.al. 2412.00518 null
2024-11-29 $C^{3}$ -NeRF: Modeling Multiple Scenes via Conditional-cum-Continual Neural Radiance Fields Prajwal Singh et.al. 2411.19903 null
2024-11-29 Gaussian Splashing: Direct Volumetric Rendering Underwater Nir Mualem et.al. 2411.19588 null
2024-11-29 ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration Chaojun Ni et.al. 2411.19548 null
2024-11-29 LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis Tianqi Li et.al. 2411.19525 null
2024-11-28 SAMa: Material-aware 3D Selection and Segmentation Michael Fischer et.al. 2411.19322 null
2024-11-27 Surf-NeRF: Surface Regularised Neural Radiance Fields Jack Naylor et.al. 2411.18652 null
2024-11-26 MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields Yixiong Yang et.al. 2411.17235 link
2024-11-25 The Radiance of Neural Fields: Democratizing Photorealistic and Dynamic Robotic Simulation Georgina Nuthall et.al. 2411.16940 null
2024-11-27 SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving Georg Hess et.al. 2411.16816 link
2024-11-25 Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction Ziyu Zhang et.al. 2411.16392 null
2024-11-25 U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields Vinayak Gupta et.al. 2411.16172 null
2024-11-24 ZeroGS: Training 3D Gaussian Splatting from Unposed Images Yu Chen et.al. 2411.15779 null
2024-11-24 GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision Xu Baixin et.al. 2411.15723 link
2024-11-23 NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation Menglin Zhang et.al. 2411.15551 null
2024-11-23 SplatSDF: Boosting Neural Implicit SDF via Gaussian Splatting Fusion Runfa Blark Li et.al. 2411.15468 null
2024-11-20 Sparse Input View Synthesis: 3D Representations and Reliable Priors Nagabhushan Somraj et.al. 2411.13631 null
2024-11-20 Robust SG-NeRF: Robust Scene Graph Aided Neural Surface Reconstruction Yi Gu et.al. 2411.13620 null
2024-11-20 GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting Xiaobao Wei et.al. 2411.12981 null
2024-11-25 SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image Zixu Wang et.al. 2411.12471 null
2024-11-19 GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving Shaoqing Xu et.al. 2411.12452 link
2024-11-18 Towards Degradation-Robust Reconstruction in Generalizable NeRF Chan Ho Park et.al. 2411.11691 null
2024-11-18 LeC $^2$ O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes Zhenxing Mi et.al. 2411.11374 null
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546 null
2024-11-15 USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting Kang Chen et.al. 2411.10504 link
2024-11-15 GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization Yanhao Sun et.al. 2411.10033 null
2024-11-22 BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis David Svitov et.al. 2411.08508 link
2024-11-13 Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model Yutao Shen et.al. 2411.08453 null
2024-11-13 MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation Peng Wang et.al. 2411.08279 link
2024-11-12 TomoGRAF: A Robust and Generalizable Reconstruction Network for Single-View Computed Tomography Di Xu et.al. 2411.08158 null
2024-11-12 Material Transforms from Disentangled NeRF Representations Ivan Lopes et.al. 2411.08037 link
2024-11-11 LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes Zefan Qu et.al. 2411.06757 link
2024-11-10 Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field Liuyue Xie et.al. 2411.06365 null
2024-11-09 AI-Driven Stylization of 3D Environments Yuanbo Chen et.al. 2411.06067 null
2024-11-08 A Nerf-Based Color Consistency Method for Remote Sensing Images Zongcheng Zuo et.al. 2411.05557 null
2024-11-08 Rate-aware Compression for NeRF-based Volumetric Video Zhiyu Zhang et.al. 2411.05322 null
2024-11-07 Planar Reflection-Aware Neural Radiance Fields Chen Gao et.al. 2411.04984 null
2024-11-07 GANESH: Generalizable NeRF for Lensless Imaging Rakesh Raj Madavan et.al. 2411.04810 null
2024-11-08 SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation Xun Tu et.al. 2411.04386 null
2024-11-06 Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis Rui Peng et.al. 2411.03637 link
2024-11-05 Enhancing Exploratory Capability of Visual Navigation Using Uncertainty of Implicit Scene Representation Yichen Wang et.al. 2411.03487 link
2024-11-05 CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval Xin Wen et.al. 2411.02979 null
2024-11-05 Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery Liv Kåreborn et.al. 2411.02972 null
2024-11-05 Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation Xavier Timoneda et.al. 2411.02969 null
2024-11-04 NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields Eric Zhu et.al. 2411.02482 null
2024-11-05 FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training Ruihong Yin et.al. 2411.02229 null
2024-11-06 GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes Gaochao Song et.al. 2411.01853 null
2024-11-04 A Probabilistic Formulation of LiDAR Mapping with Neural Radiance Fields Matthew McDermott et.al. 2411.01725 link
2024-11-01 ZIM: Zero-Shot Image Matting for Anything Beomyoung Kim et.al. 2411.00626 link
2024-10-31 Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes Karim Kassab et.al. 2410.23742 null
2024-10-31 Get a Grip: Multi-Finger Grasp Evaluation at Scale Enables Robust Sim-to-Real Transfer Tyler Ga Wei Lum et.al. 2410.23701 null
2024-10-31 XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM Xiaomeng Wang et.al. 2410.23690 link
2024-10-30 Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder Antoine Schnepf et.al. 2410.22936 null
2024-10-28 MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps Yating Xu et.al. 2410.21566 link
2024-10-29 EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior Xin Xiang et.al. 2410.20981 null
2024-10-28 ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings Suyoung Lee et.al. 2410.20686 link
2024-10-27 GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields Yusuke Sekikawa et.al. 2410.20306 null
2024-10-25 Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization Weihang Liu et.al. 2410.19483 link
2024-10-25 Evaluation of strategies for efficient rate-distortion NeRF streaming Pedro Martin et.al. 2410.19459 null
2024-10-27 Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis Liang Han et.al. 2410.18822 null
2024-10-24 Real-time 3D-aware Portrait Video Relighting Ziqi Cai et.al. 2410.18355 link
2024-10-22 Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies Shrey Vishen et.al. 2410.18137 link
2024-10-23 VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points Linus Franke et.al. 2410.17932 null
2024-10-23 Few-shot NeRF by Adaptive Rendering Loss Regularization Qingshan Xu et.al. 2410.17839 null
2024-10-23 Efficient Neural Implicit Representation for 3D Human Reconstruction Zexu Huang et.al. 2410.17741 link
2024-10-23 PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting Yu Wang et.al. 2410.17505 null
2024-10-22 LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias Haian Jin et.al. 2410.17242 null
2024-10-18 GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting Yusen Xie et.al. 2410.17084 null
2024-10-22 E-3DGS: Gaussian Splatting with Exposure and Motion Events Xiaoting Yin et.al. 2410.16995 link
2024-10-21 Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions Malte Prinzler et.al. 2410.16395 null
2024-10-21 FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors Chin-Yang Lin et.al. 2410.16271 null
2024-10-22 EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting Bohao Liao et.al. 2410.15392 null
2024-10-19 Neural Radiance Field Image Refinement through End-to-End Sampling Point Optimization Kazuhiro Ohta et.al. 2410.14958 null
2024-10-18 Learning autonomous driving from aerial imagery Varun Murali et.al. 2410.14177 null
2024-10-18 DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction Ange Lou et.al. 2410.14169 null
2024-10-17 DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering Jiahao Lu et.al. 2410.13607 link
2024-10-21 DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation Guosheng Zhao et.al. 2410.13571 null
2024-10-17 Object Pose Estimation Using Implicit Representation For Transparent Objects Varun Burde et.al. 2410.13465 null
2024-10-17 GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting Shuichang Lai et.al. 2410.13349 null
2024-10-16 3D Gaussian Splatting in Robotics: A Survey Siting Zhu et.al. 2410.12262 link
2024-10-16 EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View Zhaorong Wang et.al. 2410.12242 null
2024-10-14 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications Eduardo R. Corral-Soto et.al. 2410.10782 null
2024-10-14 NeRF-enabled Analysis-Through-Synthesis for ISAR Imaging of Small Everyday Objects with Sparse and Noisy UWB Radar Data Md Farhan Tasnim Oshim et.al. 2410.10085 null
2024-10-13 Magnituder Layers for Implicit Neural Representations in 3D Sang Min Kim et.al. 2410.09771 null
2024-10-12 Improving 3D Finger Traits Recognition via Generalizable Neural Rendering Hongbin Xu et.al. 2410.09582 null
2024-10-11 SceneCraft: Layout-Guided 3D Scene Generation Xiuyu Yang et.al. 2410.09049 link
2024-10-11 MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering Jaehoon Choi et.al. 2410.08941 null
2024-10-11 Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints Yicheng He et.al. 2410.08780 null
2024-10-10 RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image Xiaoxue Chen et.al. 2410.08181 null
2024-10-10 IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera Jian Huang et.al. 2410.08107 link
2024-10-11 NeRF-Accelerated Ecological Monitoring in Mixed-Evergreen Redwood Forest Adam Korycki et.al. 2410.07418 link
2024-10-09 DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation Zhiqi Li et.al. 2410.06756 null
2024-10-09 MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes Zhenhui Ye et.al. 2410.06734 null
2024-10-09 3D Representation Methods: A Survey Zhengren Wang et.al. 2410.06475 null
2024-10-08 Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters Guoji Tian et.al. 2410.05772 null
2024-10-07 Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors Ziwei Liao et.al. 2410.05514 link
2024-10-07 PH-Dropout: Prctical Epistemic Uncertainty Quantification for View Synthesis Chuanhao Sun et.al. 2410.05468 link
2024-10-07 LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting Qifeng Chen et.al. 2410.05111 null
2024-10-07 6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering Zhongpai Gao et.al. 2410.04974 null
2024-10-07 TeX-NeRF: Neural Radiance Fields from Pseudo-TeX Vision Chonghao Zhong et.al. 2410.04873 null
2024-10-06 Deformable NeRF using Recursively Subdivided Tetrahedra Zherui Qiu et.al. 2410.04402 null
2024-10-05 Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy Pengcheng Chen et.al. 2410.04041 null
2024-10-02 MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis Xiaobiao Du et.al. 2410.02103 link
2024-10-03 EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis Alexander Mai et.al. 2410.01804 null
2024-10-02 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection Yang Cao et.al. 2410.01647 link
2024-10-02 Gaussian Splatting in Mirrors: Reflection-Aware Rendering via Virtual Camera Optimization Zihan Wang et.al. 2410.01614 link
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-10-01 GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer Youngho Yoon et.al. 2410.00672 link
2024-09-30 Distributed NeRF Learning for Collaborative Multi-Robot Perception Hongrui Zhao et.al. 2409.20289 null
2024-09-30 Active Neural Mapping at Scale Zijia Kuang et.al. 2409.20276 null
2024-09-30 OPONeRF: One-Point-One NeRF for Robust Neural Rendering Yu Zheng et.al. 2409.20043 link
2024-09-28 G3R: Gradient Guided Generalizable Reconstruction Yun Chen et.al. 2409.19405 null
2024-09-26 LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field Huan Wang et.al. 2409.18057 link
2024-09-26 Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions Weng Fei Low et.al. 2409.17988 null
2024-09-26 Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry Qi Zhang et.al. 2409.17729 null
2024-09-26 TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene Sandika Biswas et.al. 2409.17459 link
2024-09-25 SeaSplat: Representing Underwater Scenes with 3D Gaussian Splatting and a Physically Grounded Image Formation Model Daniel Yang et.al. 2409.17345 null
2024-09-25 TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans Aggelina Chatziagapi et.al. 2409.16666 null
2024-09-26 Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities Peizhi Yan et.al. 2409.16147 link
2024-09-24 Disentangled Generation and Aggregation for Robust Radiance Fields Shihe Shen et.al. 2409.15715 null
2024-09-24 Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB Jae Yong Lee et.al. 2409.15689 null
2024-09-23 AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions Samarth Chopra et.al. 2409.15487 null
2024-09-22 MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views Wangze Xu et.al. 2409.14316 null
2024-09-21 MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors Zhenhua Du et.al. 2409.14019 null
2024-09-19 CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications Vladimir Frolov et.al. 2409.12617 null
2024-09-18 JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation Sai Tanmay Reddy Chakkera et.al. 2409.12156 null
2024-09-25 BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling Lulin Zhang et.al. 2409.12014 link
2024-09-17 RenderWorld: World Model with Self-Supervised 3D Label Ziyang Yan et.al. 2409.11356 null
2024-09-21 HGSLoc: 3DGS-based Heuristic Camera Pose Refinement Zhongyan Niu et.al. 2409.10925 null
2024-09-16 Baking Relightable NeRF for Real-time Direct/Indirect Illumination Rendering Euntae Choi et.al. 2409.10327 null
2024-09-16 DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments Mahmud A. Mohamad et.al. 2409.10041 link
2024-09-15 NARF24: Estimating Articulated Object Structure for Implicit Rendering Stanley Lewis et.al. 2409.09829 null
2024-09-12 DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors Thomas Hanwen Zhu et.al. 2409.08278 null
2024-09-11 DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation Haibo Yang et.al. 2409.07454 null
2024-09-11 ThermalGaussian: Thermal 3D Gaussian Splatting Rongfeng Lu et.al. 2409.07200 link
2024-09-10 LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation Archana Swaminathan et.al. 2409.06703 null
2024-09-10 Sources of Uncertainty in 3D Scene Reconstruction Marcus Klasson et.al. 2409.06407 link
2024-09-09 LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo Wei Zhi Tang et.al. 2409.06104 link
2024-09-09 G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis Lutao Jiang et.al. 2409.05617 null
2024-09-09 From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models Tessa Pulli et.al. 2409.05413 null
2024-09-09 KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction Davide Di Nucci et.al. 2409.05407 null
2024-09-09 Lagrangian Hashing for Compressed Neural Field Representations Shrisudhan Govindarajan et.al. 2409.05334 null
2024-09-09 Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems Jianheng Liu et.al. 2409.05310 null
2024-09-06 SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields Yuze Wang et.al. 2409.04482 null
2024-09-05 Weight Conditioning for Smooth Optimization of Neural Networks Hemanth Saratchandran et.al. 2409.03424 null
2024-09-05 Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction Shen Chen et.al. 2409.03213 null
2024-09-04 UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views Jiaxin Guo et.al. 2409.02917 link
2024-09-03 GraspSplats: Efficient Manipulation with 3D Feature Splatting Mazeyu Ji et.al. 2409.02084 null
2024-09-03 $S^2$ NeRF: Privacy-preserving Training Framework for NeRF Bokang Zhang et.al. 2409.01661 link
2024-08-30 ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images Xiaoshuai Zhang et.al. 2408.17027 null
2024-08-29 GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content Lebin Zhou et.al. 2408.16866 null
2024-09-01 Generic Objects as Pose Probes for Few-Shot View Synthesis Zhirui Gao et.al. 2408.16690 null
2024-08-29 Spurfies: Sparse Surface Reconstruction using Local Geometry Priors Kevin Raj et.al. 2408.16544 null
2024-08-29 NeRF-CA: Dynamic Reconstruction of X-ray Coronary Angiography with Extremely Sparse-views Kirsten W. H. Maas et.al. 2408.16355 link
2024-08-28 Towards Realistic Example-based Modeling via 3D Gaussian Stitching Xinyu Gao et.al. 2408.15708 null
2024-08-27 Learning-based Multi-View Stereo: A Survey Fangjinhua Wang et.al. 2408.15235 null
2024-08-27 GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning Shubhendu Jena et.al. 2408.14724 null
2024-08-28 FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry Chunran Zheng et.al. 2408.14035 link
2024-08-25 TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers Chuanrui Zhang et.al. 2408.13770 null
2024-08-24 G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles Adil Meric et.al. 2408.13508 null
2024-08-23 SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting Jiseung Hong et.al. 2408.13285 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966 null
2024-08-21 Irregularity Inspection using Neural Radiance Field Tianqi Ding et.al. 2408.11251 null
2024-08-20 GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting Changkun Liu et.al. 2408.11085 link
2024-08-20 Learning Part-aware 3D Representations by Fusing 2D Gaussians and Superquadrics Zhirui Gao et.al. 2408.10789 null
2024-08-20 TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks Jinjie Mai et.al. 2408.10739 null
2024-08-19 $R^2$ -Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement Haoyang Wang et.al. 2408.10135 null
2024-08-19 DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery Corentin Dumery et.al. 2408.09928 null
2024-08-20 CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning Haoyu Zhao et.al. 2408.09663 null
2024-08-18 S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis Dongze Li et.al. 2408.09347 null
2024-08-17 SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation Xiao Cao et.al. 2408.09144 null
2024-08-17 HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction Xiao Zhao et.al. 2408.09104 null
2024-08-16 VF-NeRF: Learning Neural Vector Fields for Indoor Scene Reconstruction Albert Gassol Puigjaner et.al. 2408.08766 link
2024-08-15 WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting Huapeng Li et.al. 2408.08206 null
2024-08-18 Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space Hyunjee Lee et.al. 2408.07416 null
2024-08-13 Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture Yu Feng et.al. 2408.06608 null
2024-08-13 ActiveNeRF: Learning Accurate 3D Geometry by Active Pattern Projection Jianyu Tao et.al. 2408.06592 link
2024-08-13 HDRGS: High Dynamic Range Gaussian Splatting Jiahao Wu et.al. 2408.06543 link
2024-08-12 Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering Jiameng Li et.al. 2408.06286 link
2024-08-12 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) Jaydeep Rade et.al. 2408.06244 null
2024-08-10 Radiance Field Learners As UAV First-Person Viewers Liqi Yan et.al. 2408.05533 null
2024-08-09 DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow Hangyu Li et.al. 2408.05008 null
2024-08-09 FewShotNeRF: Meta-Learning-based Novel View Synthesis for Rapid Scene-Specific Adaptation Piraveen Sivakumar et.al. 2408.04803 null
2024-08-06 LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting Joanna Kaleta et.al. 2408.04474 link
2024-08-08 A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery Mengya Xu et.al. 2408.04426 link
2024-08-08 Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods Yiming Zhou et.al. 2408.04268 null
2024-08-07 Goal-oriented Semantic Communication for the Metaverse Application Zhe Wang et.al. 2408.03646 null
2024-08-06 RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis Hugo Blanc et.al. 2408.03356 null
2024-08-06 Efficient NeRF Optimization – Not All Samples Remain Equally Hard Juuso Korhonen et.al. 2408.03193 null
2024-08-06 MGFs: Masked Gaussian Fields for Meshing Building based on Multi-View Images Tengfei Wang et.al. 2408.03060 null
2024-08-04 PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone Xin Yang et.al. 2408.02053 null
2024-08-03 FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields Yifan Wu et.al. 2408.01878 null
2024-08-03 E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images Yunshan Qi et.al. 2408.01840 null
2024-08-02 NeRFoot: Robot-Footprint Estimation for Image-Based Visual Servoing Daoxin Zhong et.al. 2408.01251 null
2024-08-05 UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization Ziwen Guo et.al. 2408.00860 null
2024-07-31 StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization Kaiyuan Tang et.al. 2408.00150 null
2024-07-22 PAV: Personalized Head Avatar from Unstructured Video Collection Akin Caliskan et.al. 2407.21047 null
2024-07-30 Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering Yanpeng Zhao et.al. 2407.20908 link
2024-07-29 Radiance Fields for Robotic Teleoperation Maximum Wilder-Smith et.al. 2407.20194 link
2024-07-29 Garment Animation NeRF with Color Editing Renke Wang et.al. 2407.19774 link
2024-07-27 Revisit Self-supervised Depth Estimation with Local Structure-from-Motion Shengjie Zhu et.al. 2407.19166 null
2024-07-26 IOVS4NeRF:Incremental Optimal View Selection for Large-Scale NeRFs Jingpeng Xie et.al. 2407.18611 null
2024-07-24 SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency Yiming Xie et.al. 2407.17470 null
2024-07-23 HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images Shreyas Singh et.al. 2407.16503 link
2024-07-23 DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors Zizheng Yan et.al. 2407.16260 null
2024-07-22 BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes Chih-Hai Su et.al. 2407.15848 null
2024-07-22 Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures Ruizhe Wang et.al. 2407.15435 null
2024-07-19 HOTS3D: Hyper-Spherical Optimal Transport for Semantic Alignment of Text-to-3D Generation Zezeng Li et.al. 2407.14419 null
2024-07-19 DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays Zongyuan Yang et.al. 2407.14053 null
2024-07-19 Semantic Communications for 3D Human Face Transmission with Neural Radiance Fields Guanlin Wu et.al. 2407.13992 null
2024-07-18 EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting Yuchen Weng et.al. 2407.13520 null
2024-07-18 GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields Xiufeng Huang et.al. 2407.13390 null
2024-07-18 KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter Yifan Zhan et.al. 2407.13185 null
2024-07-17 Generalizable Human Gaussians for Sparse View Synthesis Youngjoong Kwon et.al. 2407.12777 link
2024-07-17 SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization Yiyang Chen et.al. 2407.12667 link
2024-07-17 InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction Xulong Wang et.al. 2407.12661 link
2024-07-17 Invertible Neural Warp for NeRF Shin-Fang Chng et.al. 2407.12354 null
2024-07-17 Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections Congrong Xu et.al. 2407.12306 null
2024-07-18 Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling Jaehyeok Kim et.al. 2407.11962 null
2024-07-18 IPA-NeRF: Illusory Poisoning Attack Against Neural Radiance Fields Wenxiang Jiang et.al. 2407.11921 link
2024-07-16 DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation Jiwook Kim et.al. 2407.11394 link
2024-07-15 Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method Adam Korycki et.al. 2407.11238 null
2024-07-15 AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems Alexey Kotcov et.al. 2407.10865 null
2024-07-15 Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis Antoine Legrand et.al. 2407.10762 null
2024-07-15 IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild Shuaixian Wang et.al. 2407.10695 null
2024-07-15 NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis Yubin Hu et.al. 2407.10482 null
2024-07-15 Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering Francesco Di Sario et.al. 2407.10389 null
2024-07-14 RS-NeRF: Neural Radiance Fields from Rolling Shutter Images Muyao Niu et.al. 2407.10267 link
2024-07-14 SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion Jiyuan Zhang et.al. 2407.10062 null
2024-07-12 Physics-Informed Learning of Characteristic Trajectories for Smoke Reconstruction Yiming Wang et.al. 2407.09679 link
2024-07-12 Radiance Fields from Photons Sacha Jungerman et.al. 2407.09386 null
2024-07-12 HPC: Hierarchical Progressive Coding Framework for Volumetric Video Zihan Zheng et.al. 2407.09026 null
2024-07-11 Feasibility of Neural Radiance Fields for Crime Scene Video Reconstruction Shariq Nadeem Malik et.al. 2407.08795 null
2024-07-11 WildGaussians: 3D Gaussian Splatting in the Wild Jonas Kulhanek et.al. 2407.08447 link
2024-07-11 MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos Yushuo Chen et.al. 2407.08414 link
2024-07-11 Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression Yuke Xing et.al. 2407.08165 null
2024-07-11 Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields Haojie Lian et.al. 2407.08154 null
2024-07-11 Survey on Fundamental Deep Learning 3D Reconstruction Techniques Yonge Bai et.al. 2407.08137 null
2024-07-10 Protecting NeRFs’ Copyright via Plug-And-Play Watermarking Base Model Qi Song et.al. 2407.07735 null
2024-07-10 Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field Ganlin Yang et.al. 2407.07461 null
2024-07-09 Reference-based Controllable Scene Stylization with Gaussian Splatting Yiqun Mei et.al. 2407.07220 null
2024-07-09 Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View Dogyoon Lee et.al. 2407.06613 null
2024-07-08 RRM: Relightable assets using Radiance guided Material extraction Diego Gomez et.al. 2407.06397 null
2024-07-08 PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes Mohammad Reza Karimi Dastjerdi et.al. 2407.06150 null
2024-07-08 Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views Jiawei Guo et.al. 2407.05666 null
2024-07-08 GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields Weiyi Xue et.al. 2407.05597 null
2024-07-08 Dynamic Neural Radiance Field From Defocused Monocular Video Xianrui Luo et.al. 2407.05586 null
2024-07-07 GaussReg: Fast 3D Registration with Gaussian Splatting Jiahao Chang et.al. 2407.05254 null
2024-07-06 SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction Weixing Xie et.al. 2407.05023 link
2024-07-04 CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images Junghe Lee et.al. 2407.03923 null
2024-07-02 MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering Ahmad AlMughrabi et.al. 2407.02668 null
2024-07-03 BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream Wenpu Li et.al. 2407.02174 link
2024-07-01 Active Human Pose Estimation via an Autonomous UAV Agent Jingxi Chen et.al. 2407.01811 null
2024-07-01 DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction Yujin Ham et.al. 2407.01761 null
2024-07-01 Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation Zihan Gao et.al. 2407.01220 link
2024-06-29 Intrinsic PAPR for Point-level 3D Scene Albedo and Shading Editing Alireza Moazeni et.al. 2407.00500 null
2024-06-28 ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction Ding-Jiun Huang et.al. 2406.20066 null
2024-06-28 EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting Daiwei Zhang et.al. 2406.19811 null
2024-06-27 Shorter SPECT Scans Using Self-supervised Coordinate Learning to Synthesize Skipped Projection Views Zongyu Li et.al. 2406.18840 null
2024-06-25 Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes Qi Ma et.al. 2406.17438 link
2024-06-25 NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods Jonas Kulhanek et.al. 2406.17345 null
2024-06-24 From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking Xiaohao Xu et.al. 2406.16850 link
2024-06-24 Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis Jianning Deng et.al. 2406.16623 null
2024-06-24 Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction Tong Qin et.al. 2406.16289 null
2024-06-23 Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study Zhe Wang et.al. 2406.16068 null
2024-06-23 Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction Yangdi Lu et.al. 2406.15982 null
2024-06-22 psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery Tongtong Zhang et.al. 2406.15707 null
2024-06-21 A3D: Does Diffusion Dream about 3D Alignment? Savva Ignatyev et.al. 2406.15020 null
2024-06-21 E2GS: Event Enhanced Gaussian Splatting Hiroyuki Deguchi et.al. 2406.14978 link
2024-06-21 Relighting Scenes with Object Insertions in Neural Radiance Fields Xuening Zhu et.al. 2406.14806 null
2024-06-20 Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment Yunshan Qi et.al. 2406.14360 null
2024-06-19 NeRF-Feat: 6D Object Pose Estimation using Feature Rendering Shishir Reddy Vutukur et.al. 2406.13796 null
2024-06-19 Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images Haruo Fujiwara et.al. 2406.13393 null
2024-06-19 Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields Youngin Park et.al. 2406.13251 link
2024-06-18 Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models Paul Henderson et.al. 2406.13099 null
2024-06-18 Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings Ruijie Tang et.al. 2406.13048 null
2024-06-18 Fast Global Localization on Neural Radiance Field Mangyu Kong et.al. 2406.12202 link
2024-06-20 TutteNet: Injective 3D Deformations by Composition of 2D Mesh Deformations Bo Sun et.al. 2406.12121 null
2024-06-17 DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Letian Wang et.al. 2406.12095 null
2024-06-17 Uncertainty modeling for fine-tuned implicit functions Anna Susmelj et.al. 2406.12082 null
2024-06-17 LLaNA: Large Language and NeRF Assistant Andrea Amaduzzi et.al. 2406.11840 null
2024-06-17 Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization Huaiji Zhou et.al. 2406.11766 null
2024-06-17 InterNeRF: Scaling Radiance Fields via Parameter Interpolation Clinton Wang et.al. 2406.11737 null
2024-06-17 NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation Niu Guanchen et.al. 2406.11259 null
2024-06-15 NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows Zhenggang Tang et.al. 2406.10543 link
2024-06-15 Federated Neural Radiance Field for Distributed Intelligence Yintian Zhang et.al. 2406.10474 null
2024-06-14 Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections Jiacong Xu et.al. 2406.10373 null
2024-06-14 PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting Alex Hanson et.al. 2406.10219 link
2024-06-14 GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors Xiqian Yu et.al. 2406.10111 null
2024-06-14 OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control Yuzhong Huang et.al. 2406.10000 null
2024-06-14 dGrasp: NeRF-Informed Implicit Grasp Policies with Supervised Optimization Slopes Gergely Sóti et.al. 2406.09939 null
2024-06-14 RaNeuS: Ray-adaptive Neural Surface Reconstruction Yida Wang et.al. 2406.09801 link
2024-06-13 Rethinking Score Distillation as a Bridge Between Image Distributions David McAllister et.al. 2406.09417 null
2024-06-13 Preserving Identity with Variational Score for General-purpose 3D Editing Duong H. Le et.al. 2406.08953 null
2024-06-13 Neural NeRF Compression Tuan Pham et.al. 2406.08943 null
2024-06-14 AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis Swapnil Bhosale et.al. 2406.08920 null
2024-06-13 NeRF Director: Revisiting View Selection in Neural Volume Rendering Wenhui Xiao et.al. 2406.08839 link
2024-06-12 ICE-G: Image Conditional Editing of 3D Gaussian Splats Vishnu Jaganathan et.al. 2406.08488 null
2024-06-12 OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding Yinan Deng et.al. 2406.08009 link
2024-06-12 Spatial Annealing Smoothing for Efficient Few-shot Neural Rendering Yuru Xiao et.al. 2406.07828 link
2024-06-11 C3DAG: Controlled 3D Animal Generation using 3D pose guidance Sandeep Mishra et.al. 2406.07742 null
2024-06-11 M-LRM: Multi-view Large Reconstruction Model Mengfei Li et.al. 2406.07648 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431 null
2024-06-11 Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion Xin Yuan et.al. 2406.06972 null
2024-06-11 Neural Visibility Field for Uncertainty-Driven Active Mapping Shangjie Xue et.al. 2406.06948 null
2024-06-10 IllumiNeRF: 3D Relighting without Inverse Rendering Xiaoming Zhao et.al. 2406.06527 null
2024-06-10 GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation Haozhe Xie et.al. 2406.06526 link
2024-06-10 PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction Danpeng Chen et.al. 2406.06521 null
2024-06-10 Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis Xin Jin et.al. 2406.06216 link
2024-06-10 ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models Meng-Li Shih et.al. 2406.06133 null
2024-06-09 GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement Peiye Zhuang et.al. 2406.05649 null
2024-06-07 Multiplane Prior Guided Few-Shot Aerial Scene Rendering Zihan Gao et.al. 2406.04961 null
2024-06-07 Multi-style Neural Radiance Field with AdaIN Yu-Wen Pao et.al. 2406.04960 link
2024-06-06 Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization Takuhiro Kaneko et.al. 2406.04155 null
2024-06-06 How Far Can We Compress Instant-NGP-Based NeRF? Yihang Chen et.al. 2406.04101 link
2024-06-06 Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling Xinhang Liu et.al. 2406.03723 null
2024-06-06 Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction Diwen Wan et.al. 2406.03697 link
2024-06-04 3D-HGS: 3D Half-Gaussian Splatting Haolin Li et.al. 2406.02720 link
2024-06-06 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541 null
2024-06-04 Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning Jiaxu Wang et.al. 2406.02370 null
2024-06-03 Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting Shaojie Ma et.al. 2406.01593 null
2024-06-03 Tetrahedron Splatting for 3D Generation Chun Gu et.al. 2406.01579 link
2024-06-03 Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting Fang Li et.al. 2406.01042 link
2024-06-02 PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency Yeonsung Jung et.al. 2406.00798 null
2024-06-02 Representing Animatable Avatar via Factorized Neural Fields Chunjin Song et.al. 2406.00637 null
2024-06-04 SuperGaussian: Repurposing Video Models for 3D Super Resolution Yuan Shen et.al. 2406.00609 null
2024-06-02 Efficient Neural Light Fields (ENeLF) for Mobile Devices Austin Peng et.al. 2406.00598 null
2024-06-01 Bilateral Guided Radiance Field Processing Yuehao Wang et.al. 2406.00448 null
2024-05-31 R $^2$ -Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction Ruyi Zha et.al. 2405.20693 link
2024-05-31 4Diffusion: Multi-view Video Diffusion Model for 4D Generation Haiyu Zhang et.al. 2405.20674 null
2024-05-30 $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving Nan Huang et.al. 2405.20323 link
2024-05-30 TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes Minghao Guo et.al. 2405.20283 null
2024-05-31 NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation Pedro Martin et.al. 2405.20078 null
2024-05-30 IReNe: Instant Recoloring in Neural Radiance Fields Alessio Mazzucchelli et.al. 2405.19876 null
2024-05-30 HINT: Learning Complete Human Neural Representations from Limited Viewpoints Alessandro Sanvito et.al. 2405.19712 null
2024-05-30 View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields Haodi He et.al. 2405.19678 link
2024-05-29 Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy Zijie Jiang et.al. 2405.18863 null
2024-06-02 NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild Weining Ren et.al. 2405.18715 link
2024-05-28 Self-supervised Pre-training for Transferable Multi-modal Perception Xiaohao Xu et.al. 2405.17942 link
2024-05-28 A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction Bin Zhang et.al. 2405.17891 null
2024-05-29 HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction Haoyu Zhao et.al. 2405.17872 link
2024-05-28 Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh Xiangjun Gao et.al. 2405.17811 null
2024-05-28 F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting Xiangyu Sun et.al. 2405.17083 null
2024-05-29 PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting Zipeng Wang et.al. 2405.16829 null
2024-05-26 Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors Soumava Paul et.al. 2405.16517 null
2024-05-24 Neural Elevation Models for Terrain Mapping and Path Planning Adam Dai et.al. 2405.15227 link
2024-05-27 HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting Yuanhao Cai et.al. 2405.15125 link
2024-05-24 GS-Hider: Hiding Messages into 3D Gaussian Splatting Xuanyu Zhang et.al. 2405.15118 null
2024-05-23 NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections Dor Verbin et.al. 2405.14871 null
2024-05-23 Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling Liwen Wu et.al. 2405.14847 null
2024-05-23 Camera Relocalization in Shadow-free Neural Radiance Fields Shiyao Xu et.al. 2405.14824 link
2024-05-23 LDM: Large Tensorial SDF Model for Textured Mesh Generation Rengan Xie et.al. 2405.14580 link
2024-05-23 JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression Zihan Zheng et.al. 2405.14452 null
2024-05-22 DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus Yu Chen et.al. 2405.13943 link
2024-05-22 Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances Licheng Shen et.al. 2405.13694 null
2024-05-21 MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video Hongsheng Wang et.al. 2405.12806 null
2024-05-21 Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations Antoine Legrand et.al. 2405.12728 null
2024-05-20 Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo Tianqi Liu et.al. 2405.12218 link
2024-05-20 Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents Guanlin Wu et.al. 2405.12155 null
2024-05-20 NPLMV-PS: Neural Point-Light Multi-View Photometric Stereo Fotios Logothetis et.al. 2405.12057 null
2024-05-19 Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems Shengxiang Sun et.al. 2405.11629 null
2024-05-19 R-NeRF: Neural Radiance Fields for Modeling RIS-enabled Wireless Environments Huiying Yang et.al. 2405.11541 link
2024-05-18 MotionGS : Compact Gaussian Splatting SLAM by Motion Filter Xinli Guo et.al. 2405.11129 link
2024-05-16 When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models Xianzheng Ma et.al. 2405.10255 link
2024-05-15 From NeRFs to Gaussian Splats, and Back Siming He et.al. 2405.09717 link
2024-05-14 Dynamic NeRF: A Review Jinwei Lin et.al. 2405.08609 null
2024-05-13 Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs Mingyu Kim et.al. 2405.07857 link
2024-05-12 Point Resampling and Ray Transformation Aid to Editable NeRF Models Zhenyang Li et.al. 2405.07306 null
2024-05-12 Hologram: Realtime Holographic Overlays via LiDAR Augmented Reconstruction Ekansh Agrawal et.al. 2405.07178 null
2024-05-11 TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization Zhen Tan et.al. 2405.07027 link
2024-05-10 LIVE: LaTex Interactive Visual Editing Jinwei Lin et.al. 2405.06762 null
2024-05-14 SketchDream: Sketch-based Text-to-3D Generation and Editing Feng-Lin Liu et.al. 2405.06461 null
2024-05-10 Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering Xiaohan Zhang et.al. 2405.06214 null
2024-05-10 Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation Bardienus P. Duisterhof et.al. 2405.06181 null
2024-05-09 DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation Sitian Shen et.al. 2405.05800 null
2024-05-10 NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior Gihoon Kim et.al. 2405.05749 null
2024-05-09 RPBG: Towards Robust Neural Point-based Graphics in the Wild Qingtian Zhu et.al. 2405.05663 link
2024-05-09 Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview Yuhang Ming et.al. 2405.05526 null
2024-05-08 ${M^2D}$ NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields Ning Wang et.al. 2405.05010 null
2024-05-08 DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid Sidun Liu et.al. 2405.04416 null
2024-05-07 Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications Markus Hillemann et.al. 2405.04345 null
2024-05-05 Blending Distributed NeRFs with Tri-stage Robust Pose Optimization Baijun Ye et.al. 2405.02880 null
2024-05-05 MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior Honghua Chen et.al. 2405.02859 null
2024-05-04 TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes Christopher Maxey et.al. 2405.02762 null
2024-05-04 ActiveNeuS: Active 3D Reconstruction using Neural Implicit Surface Uncertainty Hyunseo Kim et.al. 2405.02568 null
2024-05-03 Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning Dhruva Tirumala et.al. 2405.02425 null
2024-05-03 Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids Junchen Liu et.al. 2405.02386 link
2024-05-03 WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights Youngdong Jang et.al. 2405.02066 null
2024-05-02 NeRF in Robotics: A Survey Guangming Wang et.al. 2405.01333 null
2024-05-04 LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes Shanlin Sun et.al. 2405.00900 null
2024-05-01 Depth Priors in Removal Neural Radiance Fields Zhihao Guo et.al. 2405.00630 null
2024-05-01 NeRF-Guided Unsupervised Learning of RGB-D Registration Zhinan Yu et.al. 2405.00507 null
2024-05-01 RTG-SLAM: Real-time 3D Reconstruction at Scale using Gaussian Splatting Zhexi Peng et.al. 2404.19706 null
2024-04-30 NeRF-Insert: 3D Local Editing with Multimodal Control Signals Benet Oriol Sabat et.al. 2404.19204 null
2024-04-29 SAGS: Structure-Aware 3D Gaussian Splatting Evangelos Ververas et.al. 2404.19149 null
2024-04-29 GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting Bo Chen et.al. 2404.19040 null
2024-04-29 Embedded Representation Learning Network for Animating Styled Video Portrait Tianyong Wang et.al. 2404.19038 null
2024-04-29 Simple-RF: Regularizing Sparse Input Radiance Fields with Simpler Solutions Nagabhushan Somraj et.al. 2404.19015 null
2024-04-28 S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM Zhiyao Zhang et.al. 2404.18284 null
2024-04-27 DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction Chenhe Du et.al. 2404.17890 null
2024-04-26 Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields Tianqi Liu et.al. 2404.17528 link
2024-04-25 Depth Supervised Neural Surface Reconstruction from Airborne Imagery Vincent Hackstein et.al. 2404.16429 null
2024-04-24 NeRF-XL: Scaling NeRFs with Multiple GPUs Ruilong Li et.al. 2404.16221 null
2024-04-24 ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images Jinseo Jeong et.al. 2404.15707 null
2024-04-23 DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft Sam Earle et.al. 2404.15538 null
2024-04-28 GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting Hongyun Yu et.al. 2404.14037 null
2024-04-22 NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation Chi Huang et.al. 2404.13921 null
2024-04-23 CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory Yunlong Ran et.al. 2404.13896 null
2024-04-26 Neural Radiance Field in Autonomous Driving: A Survey Lei He et.al. 2404.13816 null
2024-04-26 ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis Zichen Tang et.al. 2404.13711 link
2024-04-21 Generalizable Novel-View Synthesis using a Stereo Camera Haechan Lee et.al. 2404.13541 null
2024-04-20 High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces Baoru Huang et.al. 2404.13437 null
2024-04-20 EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment Guanghao Li et.al. 2404.13346 link
2024-04-19 FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction Maria Dronova et.al. 2404.12970 null
2024-04-22 Does Gaussian Splatting need SFM Initialization? Yalda Foroutan et.al. 2404.12547 null
2024-04-18 MeshLRM: Large Reconstruction Model for High-Quality Mesh Xinyue Wei et.al. 2404.12385 null
2024-04-18 AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering Jingfeng Guo et.al. 2404.11897 link
2024-04-18 Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations Yu Feng et.al. 2404.11852 null
2024-04-17 SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping Vincent Cartillier et.al. 2404.11419 null
2024-04-16 Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks Florian Barthel et.al. 2404.10625 null
2024-04-16 Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences Seungwook Kim et.al. 2404.10603 null
2024-04-16 1st Place Solution for ICCV 2023 OmniObject3D Challenge: Sparse-View Reconstruction Hang Du et.al. 2404.10441 null
2024-04-16 SRGS: Super-Resolution 3D Gaussian Splatting Xiang Feng et.al. 2404.10318 link
2024-04-16 Plug-and-Play Acceleration of Occupancy Grid-based NeRF Rendering using VDB Grid and Hierarchical Ray Traversal Yoshio Kato et.al. 2404.10272 link
2024-04-15 Taming Latent Diffusion Model for Neural Radiance Field Inpainting Chieh Hubert Lin et.al. 2404.09995 null
2024-04-15 Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video Hongchi Xia et.al. 2404.09833 null
2024-04-15 DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading Tong Wu et.al. 2404.09412 null
2024-04-14 VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field Fei Xue et.al. 2404.09271 link
2024-04-15 OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering Jingrui Ye et.al. 2404.08449 null
2024-04-12 GPN: Generative Point-based NeRF Haipeng Wang et.al. 2404.08312 link
2024-04-12 MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance Yuqun Wu et.al. 2404.08252 null
2024-04-11 Connecting NeRFs, Images, and Text Francesco Ballerini et.al. 2404.07993 link
2024-04-11 Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation Keonhee Han et.al. 2404.07933 link
2024-04-12 NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving William Ljungbergh et.al. 2404.07762 link
2024-04-11 G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images Zixiong Huang et.al. 2404.07474 link
2024-04-10 SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection Mathis Kruse et.al. 2404.06832 link
2024-04-10 MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views Runfa Li et.al. 2404.06753 null
2024-04-10 Bayesian NeRF: Quantifying Uncertainty with Volume Density in Neural Radiance Fields Sibeak Lee et.al. 2404.06727 link
2024-04-11 SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera Gaole Dai et.al. 2404.06710 null
2024-04-09 Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion Fan Yang et.al. 2404.06429 link
2024-04-09 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis Zhicheng Lu et.al. 2404.06270 null
2024-04-09 GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields Arnab Dey et.al. 2404.06246 null
2024-04-09 HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields Arnab Dey et.al. 2404.06152 null
2024-04-08 Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation Y. Wang et.al. 2404.05236 null
2024-04-08 StylizedGS: Controllable Stylization for 3D Gaussian Splatting Dingxi Zhang et.al. 2404.05220 null
2024-04-08 Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos Fengrui Tian et.al. 2404.05163 link
2024-04-07 CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis Gyeongjin Kang et.al. 2404.04913 null
2024-04-07 GauU-Scene V2: Expanse Lidar Image Dataset Shows Unreliable Geometric Reconstruction Using Gaussian Splatting and NeRF Butian Xiong et.al. 2404.04880 null
2024-04-07 NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization Peng Tu et.al. 2404.04875 null
2024-04-06 DATENeRF: Depth-Aware Text-based Editing of NeRFs Sara Rojas et.al. 2404.04526 null
2024-04-05 Robust Gaussian Splatting François Darmon et.al. 2404.04211 null
2024-04-04 SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer Zijie Wu et.al. 2404.03736 link
2024-04-07 RaFE: Generative Radiance Fields Restoration Zhongkai Wu et.al. 2404.03654 null
2024-04-04 OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views Francis Engelmann et.al. 2404.03650 null
2024-04-04 VF-NeRF: Viewshed Fields for Rigid NeRF Registration Leo Segre et.al. 2404.03349 null
2024-04-03 GenN2N: Generative NeRF2NeRF Translation Xiangyue Liu et.al. 2404.02788 link
2024-04-03 LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis Zehan Zheng et.al. 2404.02742 link
2024-04-03 Neural Radiance Fields with Torch Units Bingnan Ni et.al. 2404.02617 null
2024-04-03 Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition Yisheng He et.al. 2404.02514 null
2024-04-02 NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation Sicheng Li et.al. 2404.02185 null
2024-04-02 Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields Joshua Ahn et.al. 2404.02155 null
2024-04-02 Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions Saptarshi Dasgupta et.al. 2404.01812 null
2024-04-01 NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification Juyeop Han et.al. 2404.01400 null
2024-04-01 NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields Muhammad Zubair Irshad et.al. 2404.01300 link
2024-04-01 MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space Armand Comas-Massagué et.al. 2404.01296 null
2024-04-02 StructLDM: Structured Latent Diffusion for 3D Human Generation Tao Hu et.al. 2404.01241 null
2024-04-01 Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting Jiarui Meng et.al. 2404.01168 null
2024-04-01 SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance Yuru Xiao et.al. 2404.00992 null
2024-04-01 FlexiDreamer: Single Image-to-3D Generation with FlexiCubes Ruowen Zhao et.al. 2404.00987 link
2024-04-01 Marrying NeRF with Feature Matching for One-step Pose Estimation Ronghan Chen et.al. 2404.00891 null
2024-03-29 HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes Ke Wu et.al. 2403.20159 null
2024-03-29 Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior Jaehoon Ko et.al. 2403.20153 link
2024-03-29 SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior Zhongrui Yu et.al. 2403.20079 null
2024-03-29 NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising Tianchen Deng et.al. 2403.20034 link
2024-03-29 SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image Yunhao Li et.al. 2403.20018 link
2024-03-29 DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal Yunhao Li et.al. 2403.20013 link
2024-03-29 Stable Surface Regularization for Fast Few-Shot NeRF Byeongin Joung et.al. 2403.19985 null
2024-03-29 MI-NeRF: Learning a Single Face NeRF from Multiple Identities Aggelina Chatziagapi et.al. 2403.19920 null
2024-03-28 Mitigating Motion Blur in Neural Radiance Fields with Events and Frames Marco Cannici et.al. 2403.19780 link
2024-03-28 SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects Avinash Ummadisingu et.al. 2403.19607 null
2024-03-28 CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians Avinash Paliwal et.al. 2403.19495 link
2024-03-28 Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation Yujin Chen et.al. 2403.19319 null
2024-03-28 Sine Activated Low-Rank Matrices for Parameter Efficient Learning Yiping Ji et.al. 2403.19243 null
2024-03-29 Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction Qiuhong Shen et.al. 2403.18795 link
2024-03-27 SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery Camille Billouard et.al. 2403.18711 link
2024-03-27 Modeling uncertainty for Gaussian Splatting Luca Savant et.al. 2403.18476 null
2024-03-26 Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians Kerui Ren et.al. 2403.17898 link
2024-03-26 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation Jiahao Chen et.al. 2403.17537 null
2024-03-25 VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation Yang Chen et.al. 2403.17001 null
2024-03-25 CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs Yingji Zhong et.al. 2403.16885 null
2024-03-25 Spike-NeRF: Neural Radiance Field Based On Spike Camera Yijia Guo et.al. 2403.16410 null
2024-03-24 Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields Haoyuan Wang et.al. 2403.16224 null
2024-03-24 Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes Takashi Otonari et.al. 2403.16141 null
2024-03-24 CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field Jiarui Hu et.al. 2403.16095 null
2024-03-24 Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap Carl Lindström et.al. 2403.16092 null
2024-03-26 PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling Xiaoyun Zheng et.al. 2403.16080 link
2024-03-24 Semantic Is Enough: Only Semantic Information For NeRF Reconstruction Ruibo Wang et.al. 2403.16043 null
2024-03-24 Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields unhong Zhao et.al. 2403.15981 null
2024-03-23 DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation Mu-Yi Shen et.al. 2403.15791 link
2024-03-23 UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation Yuliang Guo et.al. 2403.15705 link
2024-03-22 WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization Jialu Wang et.al. 2403.15272 null
2024-03-21 Hyperspectral Neural Radiance Fields Gerry Chen et.al. 2403.14839 null
2024-03-21 ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition Tianhao Wu et.al. 2403.14619 null
2024-03-21 CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis Matteo Bonotto et.al. 2403.14412 link
2024-03-21 InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity Jiabin Liang et.al. 2403.14376 null
2024-03-21 Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions Jiacong Xu et.al. 2403.14053 link
2024-03-20 MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination Weiying Wang et.al. 2403.13348 null
2024-03-19 Depth-guided NeRF Training via Earth Mover’s Distance Anita Rau et.al. 2403.13206 null
2024-03-19 DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images Zaid Tasneem et.al. 2403.13199 null
2024-03-19 Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering Mingqi Shao et.al. 2403.12839 null
2024-03-19 Learning Neural Volumetric Pose Features for Camera Localization Jingyu Lin et.al. 2403.12800 null
2024-03-19 IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model Matteo Bortolon et.al. 2403.12682 null
2024-03-18 FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos Florian Philipp Stilz et.al. 2403.12198 null
2024-03-18 ThermoNeRF: Multimodal Neural Radiance Fields for Thermal Novel View Synthesis Mariam Hassan et.al. 2403.12154 link
2024-03-18 RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF Sibi Catley-Chandar et.al. 2403.11909 null
2024-03-18 GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors LI Yang et.al. 2403.11899 null
2024-03-18 Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging Mert Özer et.al. 2403.11865 null
2024-03-19 BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting Lingzhe Zhao et.al. 2403.11831 link
2024-03-18 Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery Yuqi Zhang et.al. 2403.11812 link
2024-03-18 DVN-SLAM: Dynamic Visual Neural SLAM Based on Local-Global Encoding Wenhua Wu et.al. 2403.11776 null
2024-03-18 Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes Antoine Schnepf et.al. 2403.11678 null
2024-03-18 UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling Yujiao Jiang et.al. 2403.11589 null
2024-03-18 Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem Mincheol Chang et.al. 2403.11573 null
2024-03-17 Creating Seamless 3D Maps Using Radiance Fields Sai Tarun Sathyan et.al. 2403.11364 null
2024-03-17 SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream Lin Zhu et.al. 2403.11222 link
2024-03-17 Recent Advances in 3D Gaussian Splatting Tong Wu et.al. 2403.11134 null
2024-03-17 Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications Yonggan Fu et.al. 2403.11131 link
2024-03-16 Fast Sparse View Guided NeRF Update for Object Reconfigurations Ziqi Lu et.al. 2403.11024 null
2024-03-16 HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering Seunghyeon Seo et.al. 2403.10906 null
2024-03-15 FeatUp: A Model-Agnostic Framework for Features at Any Resolution Stephanie Fu et.al. 2403.10516 link
2024-03-15 Thermal-NeRF: Neural Radiance Fields from an Infrared Camera Tianxiang Ye et.al. 2403.10340 link
2024-03-15 Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression Huy-Hoang Bui et.al. 2403.10297 link
2024-03-15 GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time Hao Li et.al. 2403.10147 null
2024-03-15 URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields Bo Xu et.al. 2403.10119 null
2024-03-15 DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video Huiqiang Sun et.al. 2403.10103 null
2024-03-15 Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience Xiaohang Yu et.al. 2403.09973 null
2024-03-14 GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping Yuhang Zheng et.al. 2403.09637 link
2024-03-14 The NeRFect Match: Exploring NeRF Features for Visual Localization Qunjie Zhou et.al. 2403.09577 null
2024-03-14 VIRUS-NeRF – Vision, InfraRed and UltraSonic based Neural Radiance Fields Nicolaj Schmid et.al. 2403.09477 link
2024-03-14 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation Frank Zhang et.al. 2403.09439 null
2024-03-14 RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes Thang-Anh-Quan Nguyen et.al. 2403.09419 null
2024-03-14 PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors Tianyuan Yuan et.al. 2403.09079 link
2024-03-13 Gaussian Splatting in Style Abhishek Saroha et.al. 2403.08498 null
2024-03-13 StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields Hongbin Xu et.al. 2403.08310 link
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156 link
2024-03-12 Q-SLAM: Quadric Representations for Monocular SLAM Chensheng Peng et.al. 2403.08125 null
2024-03-12 SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields Jungho Lee et.al. 2403.07547 link
2024-03-11 SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection Yifu Tao et.al. 2403.06877 null
2024-03-11 Vosh: Voxel-Mesh Hybrid Representation for Real-Time View Synthesis Chenhao Zhang et.al. 2403.06505 null
2024-03-13 FSViewFusion: Few-Shots View Generation of Novel Objects Rukhshanda Hussain et.al. 2403.06394 null
2024-03-10 Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis? Hanxin Zhu et.al. 2403.06092 null
2024-03-09 Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving Junyi Cao et.al. 2403.05907 link
2024-03-09 Large Generative Model Assisted 3D Semantic Communication Feibo Jiang et.al. 2403.05783 null
2024-03-08 GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting Francesco Palandra et.al. 2403.05154 null
2024-03-08 Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces Evangelos Skartados et.al. 2403.04508 null
2024-03-07 Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis Yuanhao Cai et.al. 2403.04116 link
2024-03-08 DNAct: Diffusion Guided Multi-Task 3D Policy Learning Ge Yan et.al. 2403.04115 null
2024-03-07 Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs Nikhil Mishra et.al. 2403.04114 link
2024-03-06 GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding Zi-Ting Chou et.al. 2403.03608 null
2024-03-05 A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction Haofan Lu et.al. 2403.03241 null
2024-03-05 Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps Timothy Chen et.al. 2403.02751 link
2024-03-04 DaReNeRF: Direction-aware Representation for Dynamic Scenes Ange Lou et.al. 2403.02265 null
2024-03-04 Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views Shuai Guo et.al. 2403.02063 null
2024-03-02 NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning Linsheng Chen et.al. 2403.01325 link
2024-03-02 Neural radiance fields-based holography [Invited] Minsung Kang et.al. 2403.01137 null
2024-03-02 Neural Field Classifiers via Target Encoding and Classification Loss Xindi Yang et.al. 2403.01058 null
2024-03-01 DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots Chunlin Li et.al. 2403.00228 link
2024-02-28 NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images Jingrui Yu et.al. 2402.18196 link
2024-02-26 Neural Radiance Fields in Medical Imaging: Challenges and Next Steps Xin Wang et.al. 2402.17797 null
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768 null
2024-02-27 VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction Jiaqi Lin et.al. 2402.17427 null
2024-02-27 Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis Zicheng Zhang et.al. 2402.17364 link
2024-02-27 DivAvatar: Diverse 3D Avatar Generation with a Single Prompt Weijing Tao et.al. 2402.17292 null
2024-02-27 CharNeRF: 3D Character Generation from Concept Art Eddy Chu et.al. 2402.17115 null
2024-02-26 Disentangled 3D Scene Generation with Layout Learning Dave Epstein et.al. 2402.16936 null
2024-02-26 CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency Hanxin Zhu et.al. 2402.16407 null
2024-02-26 SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field Zetian Song et.al. 2402.16366 null
2024-02-26 DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer Yizhe Wu et.al. 2402.16308 null
2024-02-22 Consolidating Attention Features for Multi-view Image Editing Or Patashnik et.al. 2402.14792 null
2024-02-26 FrameNeRF: A Simple and Efficient Framework for Few-shot Novel View Synthesis Yan Xing et.al. 2402.14586 null
2024-02-22 NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection Chenxi Huang et.al. 2402.14464 link
2024-02-22 TaylorGrid: Towards Fast and High-Quality Implicit Field Learning via Direct Taylor-based Grid Optimization Renyi Mao et.al. 2402.14415 null
2024-02-22 Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields Seungtae Nam et.al. 2402.14196 null
2024-02-21 Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting Joongho Jo et.al. 2402.13827 null
2024-02-21 SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields Zhentao Huang et.al. 2402.13510 null
2024-02-20 How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey Fabio Tosi et.al. 2402.13255 link
2024-02-20 Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields Bo-Yu Cheng et.al. 2402.13252 link
2024-02-20 NeRF Solves Undersampled MRI Reconstruction Tae Jun Jang et.al. 2402.13226 null
2024-02-20 OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow Simon Boeder et.al. 2402.12792 null
2024-02-19 Binary Opacity Grids: Capturing Fine Geometric Detail for Mesh-Based View Synthesis Christian Reiser et.al. 2402.12377 null
2024-02-19 Colorizing Monochromatic Radiance Fields Yean Cheng et.al. 2402.12184 null
2024-02-17 Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review Thang-Anh-Quan Nguyen et.al. 2402.11141 link
2024-02-15 Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions Muhammad Arbab Arshad et.al. 2402.10344 null
2024-02-14 PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments Xiuzhong Hu et.al. 2402.09325 link
2024-02-13 Preconditioners for the Stochastic Training of Implicit Neural Representations Shin-Fang Chng et.al. 2402.08784 null
2024-02-13 NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs Michael Fischer et.al. 2402.08622 null
2024-02-13 H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields Minyoung Park et.al. 2402.08138 null
2024-02-12 DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation Chenchang Li et.al. 2402.07648 null
2024-02-11 BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis Leandro A. Passos et.al. 2402.07310 link
2024-02-11 3D Gaussian as a New Vision Era: A Survey Ben Fei et.al. 2402.07181 null
2024-02-09 ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting Georgii Stanishevskii et.al. 2402.06390 link
2024-02-07 NeRF as Non-Distant Environment Emitter in Physics-based Inverse Rendering Jingwang Ling et.al. 2402.04829 null
2024-02-07 OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding Guibiao Liao et.al. 2402.04648 link
2024-02-11 BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery Huiqing Zhang et.al. 2402.04554 null
2024-02-06 Improved Generalization of Weight Space Networks via Augmentations Aviv Shamsian et.al. 2402.04081 link
2024-02-05 ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis Bernard Spiegl et.al. 2402.02906 link
2024-02-02 ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields Xingyu Miao et.al. 2402.01950 link
2024-02-02 Robust Inverse Graphics via Probabilistic Inference Tuan Anh Le et.al. 2402.01915 link
2024-02-02 HyperPlanes: Hypernetwork Approach to Rapid NeRF Adaptation Paweł Batorski et.al. 2402.01524 link
2024-02-02 Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses Mahboubeh Asadi et.al. 2402.01485 null
2024-02-06 GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting Joanna Waczyńska et.al. 2402.01459 link
2024-02-02 Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization Zhiyu Zhang et.al. 2402.01380 null
2024-02-06 Taming Uncertainty in Sparse-view Generalizable NeRF via Indirect Diffusion Guidance Yaokun Li et.al. 2402.01217 null
2024-02-01 ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields Jiahua Dong et.al. 2402.00864 link
2024-02-01 Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering Pinxin Liu et.al. 2402.00827 link
2024-01-31 CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting Jiezhi Yang et.al. 2401.18075 null
2024-02-01 Segment Anything in 3D Gaussians Xu Hu et.al. 2401.17857 link
2024-01-30 Physical Priors Augmented Event-Based 3D Reconstruction Jiaxu Wang et.al. 2401.17121 link
2024-01-31 Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting Yiming Huang et.al. 2401.16416 link
2024-01-29 Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields Rongkai Ma et.al. 2401.16144 null
2024-01-26 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field Zhenyu Bao et.al. 2401.14726 link
2024-01-25 Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation Jiaxu Wang et.al. 2401.14354 null
2024-01-27 Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation Minglin Chen et.al. 2401.14257 null
2024-01-24 EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues Reconstruction Yangsen Chen et.al. 2401.13352 null
2024-01-23 NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for Talking Face Synthesis Chongke Bi et.al. 2401.12568 null
2024-01-23 Exploration and Improvement of Nerf-based 3D Scene Editing Techniques Shun Fang et.al. 2401.12456 null
2024-01-23 Methods and strategies for improving the novel view synthesis quality of neural radiation field Shun Fang et.al. 2401.12451 null
2024-01-22 Single-View 3D Human Digitalization with Large Reconstruction Models Zhenzhen Weng et.al. 2401.12175 null
2024-01-22 Scaling Face Interaction Graph Networks to Real World Scenes Tatiana Lopez-Guevara et.al. 2401.11985 null
2024-01-22 HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs Zelin Gao et.al. 2401.11711 null
2024-01-23 IPR-NeRF: Ownership Verification meets Neural Radiance Field Win Kent Ong et.al. 2401.09495 null
2024-01-17 ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization Weiyao Wang et.al. 2401.08937 null
2024-01-18 ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Process Kiyohiro Nakayama et.al. 2401.08140 null
2024-01-16 Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities Xu Yan et.al. 2401.08045 link
2024-01-15 6-DoF Grasp Pose Evaluation and Optimization via Transfer Learning from NeRFs Gergely Sóti et.al. 2401.07935 null
2024-01-11 TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation Rajaei Khatib et.al. 2401.06191 null
2024-01-11 Fast High Dynamic Range Radiance Fields for Dynamic Scenes Guanjun Wu et.al. 2401.06052 null
2024-01-11 CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians Bin Dou et.al. 2401.05925 null
2024-01-11 GO-NeRF: Generating Virtual Objects in Neural Radiance Fields Peng Dai et.al. 2401.05750 null
2024-01-10 Diffusion Priors for Dynamic View Synthesis from Monocular Videos Chaoyang Wang et.al. 2401.05583 null
2024-01-10 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Mohamad Shahbazi et.al. 2401.05335 null
2024-01-10 CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video Xingyu Miao et.al. 2401.04861 link
2024-01-08 A Survey on 3D Gaussian Splatting Guikun Chen et.al. 2401.03890 link
2024-01-08 NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation Casimir Feldmann et.al. 2401.03771 null
2024-01-06 RustNeRF: Robust Neural Radiance Field with Low-Quality Images Mengfei Li et.al. 2401.03257 null
2024-01-06 Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping Tongyan Hua et.al. 2401.03203 null
2024-01-05 Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human Song Bai et.al. 2401.02620 null
2024-01-05 FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF Hao Zhang et.al. 2401.02616 link
2024-01-05 Characterizing Satellite Geometry via Accelerated 3D Gaussian Splatting Van Minh Nguyen et.al. 2401.02588 null
2024-01-03 SIGNeRF: Scene Integrated Generation for Neural Radiance Fields Jan-Niklas Dihlmann et.al. 2401.01647 null
2024-01-02 Street Gaussians for Modeling Dynamic Urban Scenes Yunzhi Yan et.al. 2401.01339 link
2024-01-02 Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise Qinglong Huang et.al. 2401.01216 null
2024-01-02 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands Xuan Huang et.al. 2401.00979 link
2024-01-01 Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior Byeonghyeon Lee et.al. 2401.00825 link
2024-01-02 GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields Xiao Pan et.al. 2401.00616 null
2023-12-30 Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models Han Jiang et.al. 2401.00208 null
2023-12-29 Informative Rays Selection for Few-Shot Neural Radiance Fields Marco Orsingher et.al. 2312.17561 null
2023-12-27 City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web Kaiwen Song et.al. 2312.16457 link
2023-12-26 DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision Lu Ling et.al. 2312.16256 null
2023-12-24 SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition Nikhil Behari et.al. 2312.16215 null
2023-12-23 INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields Andrew Hou et.al. 2312.16197 null
2023-12-26 LangSplat: 3D Language Gaussian Splatting Minghan Qin et.al. 2312.16084 link
2023-12-26 2D-Guided 3D Gaussian Segmentation Kun Lan et.al. 2312.16047 null
2023-12-26 Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images Zhan Lu et.al. 2312.15942 link
2023-12-23 Human101: Training 100+FPS Human Gaussians in 100s from 1 View Mingwei Li et.al. 2312.15258 link
2023-12-23 Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane Chen Yang et.al. 2312.15253 link
2023-12-23 CaLDiff: Camera Localization in NeRF via Pose Diffusion Rashik Shrestha et.al. 2312.15242 null
2023-12-22 PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF Mohsen Gholami et.al. 2312.14915 link
2023-12-22 Density Uncertainty Quantification with NeRF-Ensembles: Impact of Data and Scene Constraints Miriam Jäger et.al. 2312.14664 null
2023-12-21 PlatoNeRF: 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar Tzofi Klinghoffer et.al. 2312.14239 null
2023-12-21 Virtual Pets: Animatable Animal Generation in 3D Scenes Yen-Chi Cheng et.al. 2312.14154 null
2023-12-21 Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning Desai Xie et.al. 2312.13980 null
2023-12-21 SyncDreamer for 3D Reconstruction of Endangered Animal Species with NeRF and NeuS Ahmet Haydar Ornek et.al. 2312.13832 null
2023-12-22 Gaussian Splatting with NeRF-based Color and Opacity Dawid Malarz et.al. 2312.13729 link
2023-12-21 DyBluRF: Dynamic Deblurring Neural Radiance Fields for Blurry Monocular Video Minh-Quan Viet Bui et.al. 2312.13528 null
2023-12-21 Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects David Nakath et.al. 2312.13494 null
2023-12-20 NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields Jens Naumann et.al. 2312.13471 null
2023-12-20 Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM Junru Lin et.al. 2312.13332 null
2023-12-20 ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors Weijia Mao et.al. 2312.13324 null
2023-12-20 UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections Fangjinhua Wang et.al. 2312.13285 null
2023-12-19 ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields Xiang Feng et.al. 2312.12122 null
2023-12-19 LHManip: A Dataset for Long-Horizon Language-Grounded Manipulation Tasks in Cluttered Tabletop Environments Federico Ceola et.al. 2312.12036 link
2023-12-19 MixRT: Mixed Neural Representations For Real-Time NeRF Rendering Chaojian Li et.al. 2312.11841 null
2023-12-19 Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation Yuze He et.al. 2312.11774 null
2023-12-15 FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline Chien-Yu Lin et.al. 2312.11537 null
2023-12-15 Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior Nan Huang et.al. 2312.11535 null
2023-12-18 GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning Ye Yuan et.al. 2312.11461 null
2023-12-18 AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis Dongze Li et.al. 2312.10921 null
2023-12-17 PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields Boming Zhao et.al. 2312.10649 null
2023-12-19 Learning Dense Correspondence for NeRF-Based Face Reenactment Songlin Yang et.al. 2312.10422 null
2023-12-15 SlimmeRF: Slimmable Radiance Fields Shiran Yuan et.al. 2312.10034 link
2023-12-15 LAENeRF: Local Appearance Editing for Neural Radiance Fields Lukas Radl et.al. 2312.09913 null
2023-12-15 SLS4D: Sparse Latent Space for 4D Novel View Synthesis Qi-Yuan Feng et.al. 2312.09743 null
2023-12-15 Towards Transferable Targeted 3D Adversarial Attack in the Physical World Yao Huang et.al. 2312.09558 link
2023-12-14 LatentEditor: Text Driven Local Editing of 3D Scenes Umar Khalid et.al. 2312.09313 link
2023-12-14 Stable Score Distillation for High-Quality 3D Generation Boshi Tang et.al. 2312.09305 null
2023-12-14 ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining Ruoxi Shi et.al. 2312.09249 null
2023-12-15 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting Zhiyin Qian et.al. 2312.09228 null
2023-12-15 ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field Zhangkai Ni et.al. 2312.09095 link
2023-12-15 Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption Ziteng Cui et.al. 2312.09093 link
2023-12-14 iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching Yuan Sun et.al. 2312.09031 null
2023-12-14 Scene 3-D Reconstruction System in Scattering Medium Zhuoyifan Zhang et.al. 2312.09005 null
2023-12-14 CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning Qingsong Yan et.al. 2312.08760 null
2023-12-14 SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field Ru Li et.al. 2312.08692 link
2023-12-13 ProNeRF: Learning Efficient Projection-Aware Ray Sampling for Fine-Grained Implicit Neural Radiance Fields Juan Luis Gonzalez Bello et.al. 2312.08136 null
2023-12-13 Neural Radiance Fields for Transparent Object Using Visual Hull Heechan Yoon et.al. 2312.08118 null
2023-12-13 uSF: Learning Neural Semantic Field with Uncertainty Vsevolod Skorokhodov et.al. 2312.08012 link
2023-12-12 COLMAP-Free 3D Gaussian Splatting Yang Fu et.al. 2312.07504 link
2023-12-12 Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs Sunghwan Hong et.al. 2312.07246 link
2023-12-12 WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction Jingchun Zhou et.al. 2312.06946 null
2023-12-10 TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video Minye Wu et.al. 2312.06713 null
2023-12-11 CorresNeRF: Image Correspondence Priors for Neural Radiance Fields Yixing Lao et.al. 2312.06642 link
2023-12-11 DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior Tianyu Huang et.al. 2312.06439 link
2023-12-10 NeVRF: Neural Video-based Radiance Fields for Long-duration Sequences Minye Wu et.al. 2312.05855 null
2023-12-10 IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment Letian Zhang et.al. 2312.05748 null
2023-12-09 CoGS: Controllable Gaussian Splatting Heng Yu et.al. 2312.05664 null
2023-12-09 R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning Zhiling Ye et.al. 2312.05572 null
2023-12-08 Multi-view Inversion for 3D-aware Generative Adversarial Networks Florian Barthel et.al. 2312.05330 link
2023-12-08 TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis Heming Zhu et.al. 2312.05161 null
2023-12-08 Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting Xiaofeng Yang et.al. 2312.04820 null
2023-12-08 Reality’s Canvas, Language’s Brush: Crafting 3D Avatars from Monocular Video Yuchen Rao et.al. 2312.04784 null
2023-12-07 MuRF: Multi-Baseline Radiance Fields Haofei Xu et.al. 2312.04565 link
2023-12-07 EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS Sharath Girish et.al. 2312.04564 link
2023-12-07 Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection Kohei Yamashita et.al. 2312.04527 null
2023-12-07 Multi-View Unsupervised Image Generation with Cross Attention Guidance Llukman Cerkezi et.al. 2312.04337 null
2023-12-07 Towards 4D Human Video Stylization Tiantian Wang et.al. 2312.04143 link
2023-12-07 Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction Jiayi Kong et.al. 2312.04106 null
2023-12-06 Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion Kira Prabhu et.al. 2312.03869 null
2023-12-06 Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle Youtian Lin et.al. 2312.03431 null
2023-12-06 Artist-Friendly Relightable and Animatable Neural Heads Yingyan Xu et.al. 2312.03420 null
2023-12-06 Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method Hongyu Huang et.al. 2312.03372 null
2023-12-06 RING-NeRF: A Versatile Architecture based on Residual Implicit Neural Grids Doriand Petit et.al. 2312.03357 null
2023-12-06 SO-NeRF: Active View Planning for NeRF using Surrogate Objectives Keifer Lee et.al. 2312.03266 null
2023-12-06 Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields Shijie Zhou et.al. 2312.03203 link
2023-12-05 HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces Haithem Turki et.al. 2312.03160 null
2023-12-05 ReconFusion: 3D Reconstruction with Diffusion Priors Rundi Wu et.al. 2312.02981 null
2023-12-05 GauHuman: Articulated Gaussian Splatting from Monocular Human Videos Shoukang Hu et.al. 2312.02973 link
2023-12-05 Alchemist: Parametric Control of Material Properties with Diffusion Models Prafull Sharma et.al. 2312.02970 null
2023-12-05 MVHumanNet: A Large-scale Dataset of Multi-view Daily Dressing Human Captures Zhangyang Xiong et.al. 2312.02963 null
2023-12-05 C-NERF: Representing Scene Changes as Directional Consistency Difference-based NeRF Rui Huang et.al. 2312.02751 link
2023-12-05 Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent Jianmeng Liu et.al. 2312.02568 null
2023-12-04 PointNeRF++: A multi-scale, point-based Neural Radiance Field Weiwei Sun et.al. 2312.02362 null
2023-12-04 Calibrated Uncertainties for Neural Radiance Fields Niki Amini-Naieni et.al. 2312.02350 null
2023-12-04 Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis Felix Tristram et.al. 2312.02255 null
2023-12-04 ColonNeRF: Neural Radiance Fields for High-Fidelity Long-Sequence Colonoscopy Reconstruction Yufei Shi et.al. 2312.02015 null
2023-12-04 Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training Runze He et.al. 2312.01663 null
2023-12-03 SANeRF-HQ: Segment Anything for NeRF in High Quality Yichen Liu et.al. 2312.01531 null
2023-12-03 VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams Liao Wang et.al. 2312.01407 null
2023-12-02 Self-Evolving Neural Radiance Fields Jaewoo Jung et.al. 2312.01003 link
2023-12-01 Gaussian Grouping: Segment and Edit Anything in 3D Scenes Mingqiao Ye et.al. 2312.00732 link
2023-11-30 LucidDreaming: Controllable Object-Centric 3D Generation Zhaoning Wang et.al. 2312.00588 null
2023-12-01 FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting Zehao Zhu et.al. 2312.00451 null
2023-11-30 PyNeRF: Pyramidal Neural Radiance Fields Haithem Turki et.al. 2312.00252 link
2023-11-30 SparseGS: Real-Time 360° Sparse View Synthesis using Gaussian Splatting Haolin Xiong et.al. 2312.00206 link
2023-11-30 Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing Hyelin Nam et.al. 2311.18608 null
2023-11-30 ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs Violeta Menéndez González et.al. 2311.18491 null
2023-11-30 Anisotropic Neural Representation Learning for High-Quality Neural Rendering Y. Wang et.al. 2311.18311 null
2023-11-30 CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt Haiyao Xiao et.al. 2311.18288 null
2023-11-30 Compact3D: Compressing Gaussian Splat Radiance Field Models with Vector Quantization KL Navaneet et.al. 2311.18159 link
2023-11-29 GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces Yingwenqi Jiang et.al. 2311.17977 null
2023-11-29 AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text Jianfeng Zhang et.al. 2311.17917 null
2023-11-29 FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information Wen Jiang et.al. 2311.17874 link
2023-11-29 Cinematic Behavior Transfer via NeRF-based Differentiable Filming Xuekun Jiang et.al. 2311.17754 null
2023-11-29 SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis Ziqiao Peng et.al. 2311.17590 link
2023-11-29 NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields Xiaoliang Liu et.al. 2311.17332 null
2023-11-28 LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS Zhiwen Fan et.al. 2311.17245 link
2023-11-28 Continuous Pose for Monocular Cameras in Neural Implicit Representation Qi Ma et.al. 2311.17119 link
2023-11-28 UC-NeRF: Neural Radiance Field for Under-Calibrated multi-view cameras in autonomous driving Kai Cheng et.al. 2311.16945 null
2023-11-28 The Sky’s the Limit: Re-lightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility James A. D. Gardner et.al. 2311.16937 link
2023-11-28 SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation Jesus Zarzar et.al. 2311.16671 link
2023-11-28 DGNR: Density-Guided Neural Point Rendering of Large Driving Scenes Zhuopeng Li et.al. 2311.16664 null
2023-11-28 SCALAR-NeRF: SCAlable LARge-scale Neural Radiance Fields for Scene Reconstruction Yu Chen et.al. 2311.16657 null
2023-11-28 Rethinking Directional Integration in Neural Radiance Fields Congyue Deng et.al. 2311.16504 null
2023-11-27 Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images Shiu-hong Kao et.al. 2311.16499 link
2023-11-27 Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling Zhe Li et.al. 2311.16096 link
2023-11-27 SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance Fields Quentin Herau et.al. 2311.15803 null
2023-11-27 CaesarNeRF: Calibrated Semantic Representation for Few-shot Generalizable Neural Rendering Haidong Zhu et.al. 2311.15510 link
2023-11-26 Efficient Encoding of Graphics Primitives with Simplex-based Structures Yibo Wen et.al. 2311.15439 null
2023-11-26 Obj-NeRF: Extract Object NeRFs from Multi-view Images Zhiyi Li et.al. 2311.15291 null
2023-11-26 NeuRAD: Neural Rendering for Autonomous Driving Adam Tonderski et.al. 2311.15260 link
2023-11-24 Animate124: Animating One Image to 4D Dynamic Scene Yuyang Zhao et.al. 2311.14603 null
2023-11-24 GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting Yiwen Chen et.al. 2311.14521 link
2023-11-23 ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain Optimization Soonbin Lee et.al. 2311.14208 null
2023-11-23 Tube-NeRF: Efficient Imitation Learning of Visuomotor Policies from MPC using Tube-Guided Data Augmentation and NeRFs Andrea Tagliabue et.al. 2311.14153 null
2023-11-23 Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder Xiaohao Xu et.al. 2311.13750 null
2023-11-22 Compact 3D Gaussian Representation for Radiance Field Joo Chan Lee et.al. 2311.13681 link
2023-11-22 Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning Kai Yu et.al. 2311.13617 null
2023-11-22 Animatable 3D Gaussians for High-fidelity Synthesis of Human Motions Keyang Ye et.al. 2311.13404 null
2023-11-22 Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images Jaeyoung Chung et.al. 2311.13398 link
2023-11-22 3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization Jianwei Feng et.al. 2311.13168 null
2023-11-22 PIE-NeRF: Physics-based Interactive Elastodynamics with NeRF Yutao Feng et.al. 2311.13099 null
2023-11-21 SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering Antoine Guédon et.al. 2311.12775 link
2023-11-21 Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields Yifan Wang et.al. 2311.12490 null
2023-11-18 Towards Function Space Mesh Watermarking: Protecting the Copyright of Signed Distance Fields Xingyu Zhu et.al. 2311.12059 null
2023-11-20 GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding Hao Li et.al. 2311.11863 null
2023-11-20 Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields Zhiyuan Min et.al. 2311.11845 link
2023-11-19 GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise Xinhai Li et.al. 2311.11221 null
2023-11-18 SNI-SLAM: Semantic Neural Implicit SLAM Siting Zhu et.al. 2311.11016 link
2023-11-18 Structure-Aware Sparse-View X-ray 3D Reconstruction Yuanhao Cai et.al. 2311.10959 link
2023-11-17 Removing Adverse Volumetric Effects From Trained Neural Radiance Fields Andreas L. Teigen et.al. 2311.10523 null
2023-11-18 EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices Jingnan Gao et.al. 2311.09806 null
2023-11-16 Reconstructing Continuous Light Field From Single Coded Image Yuya Ishikawa et.al. 2311.09646 null
2023-11-15 Single-Image 3D Human Digitization with Shape-Guided Diffusion Badour AlBahar et.al. 2311.09221 null
2023-11-15 DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model Yinghao Xu et.al. 2311.09217 null
2023-11-15 Spiking NeRF: Representing the Real-World Geometry by a Discontinuous Representation Zhanfeng Liao et.al. 2311.09077 link
2023-11-13 $L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF Liangchen Li et.al. 2311.07044 null
2023-11-11 Aria-NeRF: Multimodal Egocentric View Synthesis Jiankai Sun et.al. 2311.06455 null
2023-11-10 Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model Jiahao Li et.al. 2311.06214 null
2023-11-10 A Neural Height-Map Approach for the Binocular Photometric Stereo Problem Fotios Logothetis et.al. 2311.05958 null
2023-11-09 BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis Hao-Bin Duan et.al. 2311.05521 link
2023-11-09 Control3D: Towards Controllable Text-to-3D Generation Yang Chen et.al. 2311.05461 null
2023-11-08 LRM: Large Reconstruction Model for Single Image to 3D Yicong Hong et.al. 2311.04400 null
2023-11-07 ADFactory: Automated Data Factory for Optical Flow Tasks Han Ling et.al. 2311.04246 null
2023-11-07 High-fidelity 3D Reconstruction of Plants using Neural Radiance Field Kewei Hu et.al. 2311.04154 null
2023-11-07 Fast Sun-aligned Outdoor Scene Relighting based on TensoRF Yeonjin Chang et.al. 2311.03965 null
2023-11-08 UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields Injae Kim et.al. 2311.03784 link
2023-11-06 Osprey: Multi-Session Autonomous Aerial Mapping with LiDAR-based SLAM and Next Best View Planning Rowan Border et.al. 2311.03484 null
2023-11-06 Animating NeRFs from Texture Space: A Framework for Pose-Dependent Rendering of Human Performances Paul Knoll et.al. 2311.03140 null
2023-11-06 InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image Jianhui Li et.al. 2311.02826 link
2023-11-03 Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields Jianxiong Shen et.al. 2311.01815 null
2023-11-03 PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation Yuhan Ding et.al. 2311.01773 null
2023-11-03 Efficient Cloud Pipelines for Neural Radiance Fields Derek Jacoby et.al. 2311.01659 null
2023-11-02 Novel View Synthesis from a Single RGBD Image for Indoor Scenes Congrui Hetang et.al. 2311.01065 null
2023-10-31 FPO++: Efficient Encoding and Rendering of Dynamic Neural Radiance Fields by Analyzing and Enhancing Fourier PlenOctrees Saskia Rabich et.al. 2310.20710 link
2023-10-31 NeRF Revisited: Fixing Quadrature Instability in Volume Rendering Mikaela Angelina Uy et.al. 2310.20685 null
2023-10-30 Generative Neural Fields by Mixtures of Neural Implicit Functions Tackgeun You et.al. 2310.19464 null
2023-11-04 TiV-NeRF: Tracking and Mapping via Time-Varying Representation with Dynamic Neural Radiance Fields Chengyao Duan et.al. 2310.18917 null
2023-10-28 INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings Amirhossein Kazerouni et.al. 2310.18846 link
2023-10-27 ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Kyle Sargent et.al. 2310.17994 link
2023-10-27 Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations Tristan Aumentado-Armstrong et.al. 2310.17880 null
2023-10-27 HyperFields: Towards Zero-Shot Generation of NeRFs from Text Sudarshan Babu et.al. 2310.17075 null
2023-10-25 4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via 4D Semantic Segmentation Dadong Jiang et.al. 2310.16858 null
2023-10-26 LightSpeed: Light and Fast Neural Light Fields on Mobile Devices Aarush Gupta et.al. 2310.16832 link
2023-10-28 PERF: Panoramic Neural Radiance Field from a Single Panorama Guangcong Wang et.al. 2310.16831 link
2023-10-25 Open-NeRF: Towards Open Vocabulary NeRF Decomposition Hao Zhang et.al. 2310.16383 null
2023-10-25 UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception Christopher Maxey et.al. 2310.16255 null
2023-10-24 Cross-view Self-localization from Synthesized Scene-graphs Ryogo Yamamoto et.al. 2310.15504 null
2023-10-23 CAwa-NeRF: Instant Learning of Compression-Aware NeRF Features Omnia Mahmoud et.al. 2310.14695 null
2023-10-23 VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations Yiying Yang et.al. 2310.14487 null
2023-10-20 ManifoldNeRF: View-dependent Image Feature Supervision for Few-shot Neural Radiance Fields Daiju Kanaoka et.al. 2310.13670 null
2023-10-20 Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos Seoha Kim et.al. 2310.13356 link
2023-10-20 UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene Jiaming Gu et.al. 2310.13263 null
2023-10-18 VQ-NeRF: Neural Reflectance Decomposition and Editing with Vector Quantization Hongliang Zhong et.al. 2310.11864 null
2023-10-18 Towards Abdominal 3-D Scene Rendering from Laparoscopy Surgical Videos using NeRFs Khoa Tuan Nguyen et.al. 2310.11645 null
2023-10-16 TraM-NeRF: Tracing Mirror and Near-Perfect Specular Reflections through Neural Radiance Fields Leif Van Holland et.al. 2310.10650 link
2023-10-16 DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing Jia-Wei Liu et.al. 2310.10624 null
2023-10-16 Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model Junpeng Tan et.al. 2310.10209 null
2023-10-15 ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context Binglun Wang et.al. 2310.09965 null
2023-10-15 Active Perception using Neural Radiance Fields Siming He et.al. 2310.09892 link
2023-10-15 CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses Hongyu Fu et.al. 2310.09776 null
2023-10-11 Dynamic Appearance Particle Neural Radiance Field Ancheng Lin et.al. 2310.07916 null
2023-10-12 PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction Jia-Wang Bian et.al. 2310.07449 link
2023-10-11 rpcPRF: Generalizable MPI Neural Radiance Field for Satellite Camera Tongtong Zhang et.al. 2310.07179 null
2023-10-10 Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization Le Chen et.al. 2310.06984 null
2023-10-10 High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field Minghan Qin et.al. 2310.06275 null
2023-10-09 A Real-time Method for Inserting Virtual Objects into Neural Radiance Fields Keyang Ye et.al. 2310.05837 null
2023-10-09 Neural Impostor: Editing Neural Radiance Fields with Explicit Shape Manipulation Ruiyang Liu et.al. 2310.05391 null
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134 null
2023-10-08 Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation Dominik Hollidt et.al. 2310.05133 null
2023-10-06 Improving Neural Radiance Field using Near-Surface Sampling with Point Cloud Generation Hye Bin Yoo et.al. 2310.04152 null
2023-10-05 Drag View: Generalizable Novel View Synthesis with Unposed Imagery Zhiwen Fan et.al. 2310.03704 link
2023-10-05 Targeted Adversarial Attacks on Generalizable Neural Radiance Fields Andras Horvath et.al. 2310.03578 null
2023-10-05 BID-NeRF: RGB-D image pose estimation with inverted Neural Radiance Fields Ágoston István Csehi et.al. 2310.03563 null
2023-10-04 Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation Yihan Wu et.al. 2310.03125 null
2023-10-04 T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation Yuze He et.al. 2310.02977 link
2023-10-04 ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF Jangho Park et.al. 2310.02712 null
2023-10-05 USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields Moyang Li et.al. 2310.02687 link
2023-10-03 EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields Anish Bhattacharya et.al. 2310.02437 link
2023-10-03 Adaptive Multi-NeRF: Exploit Efficient Parallelism in Adaptive Multiple Scale Neural Radiance Field Rendering Tong Wang et.al. 2310.01881 null
2023-10-03 MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields Takuhiro Kaneko et.al. 2310.01821 null
2023-10-02 PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments Xiuzhong Hu et.al. 2310.00874 link
2023-10-01 How Many Views Are Needed to Reconstruct an Unknown Object Using NeRF? Sicong Pan et.al. 2310.00684 link
2023-10-01 Enabling Neural Radiance Fields (NeRF) for Large-scale Aerial Images – A Multi-tiling Approaching and the Geometry Assessment of NeRF Ningli Xu et.al. 2310.00530 null
2023-09-30 MMPI: a Flexible Radiance Field Representation by Multiple Multi-plane Images Blending Yuze He et.al. 2310.00249 null
2023-09-29 Multi-task View Synthesis with Neural Radiance Fields Shuhong Zheng et.al. 2309.17450 link
2023-09-29 Forward Flow for Novel View Synthesis of Dynamic Scenes Xiang Guo et.al. 2309.17390 null
2023-09-29 HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field Xiaochen Zhao et.al. 2309.17128 null
2023-09-28 Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis Marcel C. Bühler et.al. 2309.16859 null
2023-09-28 MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond Yixuan Li et.al. 2309.16553 null
2023-09-28 FG-NeRF: Flow-GAN based Probabilistic Neural Radiance Field for Independence-Assumption-Free Uncertainty Estimation Songlin Wei et.al. 2309.16364 null
2023-09-28 Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge Zheyuan Yang et.al. 2309.16110 null
2023-09-27 P2I-NET: Mapping Camera Pose to Image via Adversarial Learning for New View Synthesis in Real Indoor Environments Xujie Kang et.al. 2309.15526 null
2023-09-27 BASED: Bundle-Adjusting Surgical Endoscopic Dynamic Video Reconstruction using Neural Radiance Fields Shreya Saha et.al. 2309.15329 null
2023-09-26 3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction Miriam Jäger et.al. 2309.14800 null
2023-09-25 NAS-NeRF: Generative Neural Architecture Search for Neural Radiance Fields Saeejith Nair et.al. 2309.14293 null
2023-09-25 Variational Inference for Scalable 3D Object-centric Learning Tianyu Wang et.al. 2309.14010 null
2023-09-24 MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field Zijiang Yang et.al. 2309.13607 null
2023-09-23 NeRF-Enhanced Outpainting for Faithful Field-of-View Extrapolation Rui Yu et.al. 2309.13240 null
2023-09-22 NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields Xiaoxue Chen et.al. 2309.13039 link
2023-09-21 ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding Yu Cheng et.al. 2309.12183 null
2023-09-21 NeuralLabeling: A versatile toolset for labeling vision datasets using Neural Radiance Fields Floris Erich et.al. 2309.11966 link
2023-09-21 Fast Satellite Tensorial Radiance Field for Multi-date Satellite Imagery of Large Size Tongtong Zhang et.al. 2309.11767 null
2023-09-21 MarkNerf:Watermarking for Neural Radiance Field Lifeng Chen et.al. 2309.11747 null
2023-09-21 Rendering stable features improves sampling-based localisation with Neural radiance fields Boxuan Zhang et.al. 2309.11698 null
2023-09-20 GenLayNeRF: Generalizable Layered Representations with 3D Model Alignment for Multi-Human View Synthesis Youssef Abdelkareem et.al. 2309.11627 null
2023-09-20 Light Field Diffusion for Single-View Novel View Synthesis Yifeng Xiong et.al. 2309.11525 null
2023-09-21 Controllable Dynamic Appearance for Neural 3D Portraits ShahRukh Athar et.al. 2309.11009 null
2023-09-20 Spiking NeRF: Making Bio-inspired Neural Networks See through the Real World Xingting Yao et.al. 2309.10987 link
2023-09-19 Locally Stylized Neural Radiance Fields Hong-Wing Pang et.al. 2309.10684 null
2023-09-19 Steganography for Neural Radiance Fields by Backdooring Weina Dong et.al. 2309.10503 null
2023-09-18 Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach Rong Liu et.al. 2309.10011 null
2023-09-18 RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision Mingjie Pan et.al. 2309.09502 link
2023-09-17 NeRF-VINS: A Real-time Neural Radiance Field Map-based Visual-Inertial Navigation System Saimouli Katragadda et.al. 2309.09295 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927 link
2023-09-15 Robust e-NeRF: NeRF from Sparse & Noisy Events under Non-Uniform Motion Weng Fei Low et.al. 2309.08596 link
2023-09-14 Gradient based Grasp Pose Optimization on a NeRF that Approximates Grasp Success Gergely Sóti et.al. 2309.08040 null
2023-09-14 MC-NeRF: Muti-Camera Neural Radiance Fields for Muti-Camera Image Acquisition Systems Yu Gao et.al. 2309.07846 null
2023-09-14 DT-NeRF: Decomposed Triplane-Hash Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis Yaoyu Su et.al. 2309.07752 null
2023-09-14 CoRF : Colorizing Radiance Fields using Knowledge Distillation Ankit Dhiman et.al. 2309.07668 null
2023-09-13 Text-Guided Generation and Editing of Compositional 3D Avatars Hao Zhang et.al. 2309.07125 null
2023-09-13 Dynamic NeRFs for Soccer Scenes Sacha Lewin et.al. 2309.06802 link
2023-09-12 Federated Learning for Large-Scale Scene Modeling with Neural Radiance Fields Teppei Suzuki et.al. 2309.06030 null
2023-09-11 PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics Claus Smitt et.al. 2309.05339 null
2023-09-10 Text-driven Editing of 3D Scenes without Retraining Shuangkang Fang et.al. 2309.04917 link
2023-09-09 Mirror-Aware Neural Humans Daniel Ajisafe et.al. 2309.04750 link
2023-09-08 Dynamic Mesh-Aware Radiance Fields Yi-Ling Qiao et.al. 2309.04581 null
2023-09-08 DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields Junzhe Zhang et.al. 2309.04410 link
2023-09-14 SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions Nagabhushan Somraj et.al. 2309.03955 null
2023-09-07 BluNF: Blueprint Neural Field Robin Courant et.al. 2309.03933 null
2023-09-07 Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model Sungwon Hwang et.al. 2309.03550 null
2023-09-06 Bayes’ Rays: Uncertainty Quantification for Neural Radiance Fields Lily Goli et.al. 2309.03185 link
2023-09-06 ResFields: Residual Neural Fields for Spatiotemporal Signals Marko Mihajlovic et.al. 2309.03160 link
2023-09-06 Instant Continual Learning of Neural Radiance Fields Ryan Po et.al. 2309.01811 null
2023-09-04 Adv3D: Generating 3D Adversarial Examples in Driving Scenarios with NeRF Leheng Li et.al. 2309.01351 null
2023-09-01 SparseSat-NeRF: Dense Depth Supervised Neural Radiance Fields for Sparse Satellite Images Lulin Zhang et.al. 2309.00277 link
2023-08-24 Improving NeRF Quality by Progressive Camera Placement for Unrestricted Navigation in Complex Environments Georgios Kopanas et.al. 2309.00014 null
2023-09-03 GHuNeRF: Generalizable Human NeRF from a Monocular Video Chen Li et.al. 2308.16576 link
2023-08-30 From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications Shreyank N Gowda et.al. 2308.16041 null
2023-08-30 Drone-NeRF: Efficient NeRF Based 3D Scene Reconstruction for Large-Scale Drone Survey Zhihao Jia et.al. 2308.15733 null
2023-08-29 Efficient Ray Sampling for Radiance Fields Reconstruction Shilei Sun et.al. 2308.15547 null
2023-08-29 Pose-Free Neural Radiance Fields via Implicit Pose Regularization Jiahui Zhang et.al. 2308.15049 null
2023-08-28 CLNeRF: Continual Learning Meets NeRF Zhipeng Cai et.al. 2308.14816 link
2023-08-26 InsertNeRF: Instilling Generalizability into NeRF with HyperNet Modules Yanqi Bao et.al. 2308.13897 link
2023-08-24 NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects Dakshit Agrawal et.al. 2308.12560 link
2023-08-23 Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields Hyeonseop Song et.al. 2308.11974 null
2023-08-25 Pose Modulated Avatars from Video Chunjin Song et.al. 2308.11951 null
2023-08-22 Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts Wenyan Cong et.al. 2308.11793 link
2023-08-22 SAMSNeRF: Segment Anything Model (SAM) Guides Dynamic Surgical Scene Reconstruction by Neural Radiance Field (NeRF) Ange Lou et.al. 2308.11774 null
2023-08-22 Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views Wentian Qu et.al. 2308.11198 null
2023-08-22 Efficient View Synthesis with Neural Radiance Distribution Field Yushuang Wu et.al. 2308.11130 null
2023-08-21 CamP: Camera Preconditioning for Neural Radiance Fields Keunhong Park et.al. 2308.10902 null
2023-08-20 Strata-NeRF : Neural Radiance Fields for Stratified Scenes Ankit Dhiman et.al. 2308.10337 null
2023-08-19 HollowNeRF: Pruning Hashgrid-Based NeRFs with Trainable Collision Mitigation Xiufeng Xie et.al. 2308.10122 null
2023-08-19 AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization Kun Wang et.al. 2308.10001 null
2023-08-19 Semantic-Human: Neural Rendering of Humans from Monocular Video with Human Parsing Jie Zhang et.al. 2308.09894 null
2023-08-18 MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection Junkai Xu et.al. 2308.09421 link
2023-08-18 DReg-NeRF: Deep Registration for Neural Radiance Fields Yu Chen et.al. 2308.09386 link
2023-08-17 Watch Your Steps: Local Image and Scene Editing by Text Instructions Ashkan Mirzaei et.al. 2308.08947 null
2023-08-21 Ref-DVGO: Reflection-Aware Direct Voxel Grid Optimization for an Improved Quality-Efficiency Trade-Off in Reflective Scene Reconstruction Georgios Kouros et.al. 2308.08530 link
2023-08-16 SceNeRFlow: Time-Consistent Reconstruction of General Dynamic Scenes Edith Tretschk et.al. 2308.08258 null
2023-08-16 Neural radiance fields in the industrial and robotics domain: applications, research opportunities and use cases Eugen Šlapak et.al. 2308.07118 link
2023-08-14 S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural Fields Zeke Xie et.al. 2308.07032 link
2023-08-11 Focused Specific Objects NeRF Yuesong Li et.al. 2308.05970 null
2023-08-11 VERF: Runtime Monitoring of Pose Estimation with Neural Radiance Fields Dominic Maggio et.al. 2308.05939 null
2023-08-09 WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields Muyu Xu et.al. 2308.04826 null
2023-08-14 A General Implicit Framework for Fast NeRF Composition and Rendering Xinyu Gao et.al. 2308.04669 null
2023-08-08 Digging into Depth Priors for Outdoor Neural Radiance Fields Chen Wang et.al. 2308.04413 null
2023-08-07 Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing Junyi Zeng et.al. 2308.03280 null
2023-08-05 Where and How: Mitigating Confusion in Neural Radiance Fields from Sparse Inputs Yanqi Bao et.al. 2308.02908 link
2023-08-05 Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis Yuxin Wang et.al. 2308.02840 null
2023-08-05 NeRFs: The Search for the Best 3D Representation Ravi Ramamoorthi et.al. 2308.02751 null
2023-08-04 ES-MVSNet: Efficient Framework for End-to-end Self-supervised Multi-View Stereo Qiang Zhou et.al. 2308.02191 null
2023-08-02 Incorporating Season and Solar Specificity into Renderings made by a NeRF Architecture using Satellite Images Michael Gableman et.al. 2308.01262 link
2023-08-01 High-Fidelity Eye Animatable Neural Radiance Fields for Human Face Hengfei Wang et.al. 2308.00773 null
2023-08-01 Context-Aware Talking-Head Video Editing Songlin Yang et.al. 2308.00462 null
2023-07-28 Dynamic PlenOctree for Adaptive Sampling Refinement in Explicit NeRF Haotian Bai et.al. 2307.15333 null
2023-07-27 Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields Xiangyu Wang et.al. 2307.15131 link
2023-07-27 MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving Zirui Wu et.al. 2307.15058 link
2023-07-27 NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection Chenfeng Xu et.al. 2307.14620 link
2023-07-26 Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation Chaohui Yu et.al. 2307.13908 null
2023-07-24 Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields Shangzhan Zhang et.al. 2307.12909 null
2023-07-24 CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components Davide Di Nucci et.al. 2307.12718 null
2023-07-23 TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering Xiao Pan et.al. 2307.12291 null
2023-07-29 CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields Ziyuan Luo et.al. 2307.11526 link
2023-07-21 FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields Sungwon Hwang et.al. 2307.11418 null
2023-07-21 Tri-MipRF: Tri-Mip Representation for Efficient Anti-Aliasing Neural Radiance Fields Wenbo Hu et.al. 2307.11335 null
2023-07-20 Urban Radiance Field Representation with Deformable Neural Mesh Primitives Fan Lu et.al. 2307.10776 null
2023-07-20 Lighting up NeRF via Unsupervised Decomposition and Enhancement Haoyuan Wang et.al. 2307.10664 link
2023-07-19 An Improved NeuMIP with Better Accuracy Bowen Xue et.al. 2307.10135 null
2023-07-19 Magic NeRF Lens: Interactive Fusion of Neural Radiance Fields for Virtual Facility Inspection Ke Li et.al. 2307.09860 link
2023-07-14 Transient Neural Radiance Fields for Lidar View Synthesis and 3D Reconstruction Anagh Malik et.al. 2307.09555 null
2023-07-18 Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis Jiahe Li et.al. 2307.09323 link
2023-07-16 Cross-Ray Neural Radiance Fields for Novel-view Synthesis from Unconstrained Image Collections Yifan Yang et.al. 2307.08093 link
2023-07-15 Improving NeRF with Height Data for Utilization of GIS Data Hinata Aoki et.al. 2307.07729 null
2023-07-11 SAR-NeRF: Neural Radiance Fields for Synthetic Aperture Radar Multi-View Representation Zhengxin Lei et.al. 2307.05087 null
2023-07-07 NOFA: NeRF-based One-shot Facial Avatar Reconstruction Wangbo Yu et.al. 2307.03441 null
2023-07-07 RGB-D Mapping and Tracking in a Plenoxel Radiance Field Andreas L. Teigen et.al. 2307.03404 link
2023-07-16 FlipNeRF: Flipped Reflection Rays for Few-shot Novel View Synthesis Seunghyeon Seo et.al. 2306.17723 link
2023-07-03 Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial Predictions Gengchen Mai et.al. 2306.17624 null
2023-06-28 Envisioning a Next Generation Extended Reality Conferencing System with Efficient Photorealistic Human Rendering Chuanyue Shen et.al. 2306.16541 null
2023-06-27 Unsupervised Polychromatic Neural Representation for CT Metal Artifact Reduction Qing Wu et.al. 2306.15203 link
2023-06-22 Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields Ori Gordon et.al. 2306.12760 link
2023-06-21 Local 3D Editing via 3D Distillation of CLIP Knowledge Junha Hyung et.al. 2306.12570 null
2023-06-21 Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase Qiuyu Wang et.al. 2306.12423 link
2023-06-21 DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation Yukun Huang et.al. 2306.12422 null
2023-06-20 NeRF synthesis with shading guidance Chenbin Li et.al. 2306.11556 null
2023-06-24 MA-NeRF: Motion-Assisted Neural Radiance Fields for Face Synthesis from Sparse Images Weichen Zhang et.al. 2306.10350 null
2023-06-15 Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model Lu Yu et.al. 2306.09551 null
2023-06-16 UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video Zhi-Hao Lin et.al. 2306.09349 null
2023-06-13 DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$ Allan Jabri et.al. 2306.08068 null
2023-06-13 Binary Radiance Fields Seungjoo Shin et.al. 2306.07581 null
2023-06-10 From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm Kun Zhou et.al. 2306.06388 null
2023-06-15 NERFBK: A High-Quality Benchmark for NERF-Based 3D Reconstruction Ali Karami et.al. 2306.06300 link
2023-06-09 HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork Bipasha Sen et.al. 2306.06093 null
2023-06-09 GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields Barbara Roessle et.al. 2306.06044 null
2023-06-09 RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models Xingchen Zhou et.al. 2306.05668 null
2023-06-08 LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs Zezhou Cheng et.al. 2306.05410 null
2023-06-08 Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields Qianqiu Tan et.al. 2306.05303 link
2023-06-06 Towards Visual Foundational Models of Physical Scenes Chethan Parameshwara et.al. 2306.03727 null
2023-06-06 Human 3D Avatar Modeling with Implicit Neural Representation: A Brief Survey Mingyang Sun et.al. 2306.03576 null
2023-06-05 H2-Mapping: Real-time Dense Mapping Using Hierarchical Hybrid Representation Chenxing Jiang et.al. 2306.03207 link
2023-06-05 BeyondPixels: A Comprehensive Review of the Evolution of Neural Radiance Fields AKM Shahariar Azad Rabby et.al. 2306.03000 null
2023-06-05 ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields Kanghyeok Ko et.al. 2306.02741 null
2023-06-01 FDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models Hao Zhang et.al. 2306.00783 link
2023-06-01 Analyzing the Internals of Neural Radiance Fields Lukas Radl et.al. 2306.00696 link
2023-06-02 AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars Mohit Mendiratta et.al. 2306.00547 null
2023-05-30 DäRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation Jiuhn Song et.al. 2305.19201 link
2023-05-30 Template-free Articulated Neural Point Clouds for Reposable View Synthesis Lukas Uzolas et.al. 2305.19065 link
2023-05-31 HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance Junzhe Zhu et.al. 2305.18766 link
2023-05-31 Towards a Robust Framework for NeRF Evaluation Adrian Azzarelli et.al. 2305.18079 link
2023-05-31 Volume Feature Rendering for Fast Neural Radiance Field Reconstruction Kang Han et.al. 2305.17916 null
2023-05-30 PlaNeRF: SVD Unsupervised 3D Plane Regularization for NeRF Large-Scale Scene Reconstruction Fusang Wang et.al. 2305.16914 null
2023-05-25 ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image Zhenzhen Weng et.al. 2305.16411 null
2023-05-25 Interactive Segment Anything NeRF with Feature Imitation Xiaokang Chen et.al. 2305.16233 null
2023-05-25 ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation Zhengyi Wang et.al. 2305.16213 link
2023-05-31 Deceptive-NeRF: Enhancing NeRF Reconstruction using Pseudo-Observations from Diffusion Models Xinhang Liu et.al. 2305.15171 null
2023-05-24 InpaintNeRF360: Text-Guided 3D Inpainting on Unbounded Neural Radiance Fields Dongqing Wang et.al. 2305.15094 null
2023-05-24 OD-NeRF: Efficient Training of On-the-Fly Dynamic Neural Radiance Fields Zhiwen Yan et.al. 2305.14831 null
2023-05-24 3D Open-vocabulary Segmentation with Foundation Models Kunhao Liu et.al. 2305.14093 link
2023-05-22 NeRFuser: Large-Scale Scene Representation by NeRF Fusion Jiading Fang et.al. 2305.13307 link
2023-05-22 Registering Neural Radiance Fields as 3D Density Images Han Jiang et.al. 2305.12843 null
2023-05-19 Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields Jingbo Zhang et.al. 2305.11588 link
2023-05-18 MVPSNet: Fast Generalizable Multi-view Photometric Stereo Dongxu Zhao et.al. 2305.11167 null
2023-05-18 ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis Shoukang Hu et.al. 2305.11031 link
2023-05-17 MultiPlaneNeRF: Neural Radiance Field with Non-Trainable Representation Dominik Zimny et.al. 2305.10579 link
2023-05-24 OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields Youtan Yin et.al. 2305.10503 link
2023-05-16 NerfBridge: Bringing Real-time, Online Neural Radiance Field Training to Robotics Javier Yu et.al. 2305.09761 link
2023-05-15 MV-Map: Offboard HD-Map Generation with Multi-view Consistency Ziyang Xie et.al. 2305.08851 link
2023-05-12 BundleRecon: Ray Bundle-Based 3D Neural Reconstruction Weikun Zhang et.al. 2305.07342 null
2023-05-10 Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era Chenghao Li et.al. 2305.06131 null
2023-05-10 NeRF $^\textbf{2}$ : Neural Radio-Frequency Radiance Fields Xiaopeng Zhao et.al. 2305.06118 null
2023-05-09 Instant-NeRF: Instant On-Device Neural Radiance Field Training via Algorithm-Accelerator Co-Designed Near-Memory Processing Yang Zhao et.al. 2305.05766 null
2023-05-09 PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces Yiqun Wang et.al. 2305.05594 link
2023-05-08 NerfAcc: Efficient Sampling Accelerates NeRFs Ruilong Li et.al. 2305.04966 null
2023-05-08 AvatarReX: Real-time Expressive Full-body Avatars Zerong Zheng et.al. 2305.04789 null
2023-05-07 HashCC: Lightweight Method to Improve the Quality of the Camera-less NeRF Scene Generation Jan Olszewski et.al. 2305.04296 null
2023-05-07 Multi-Space Neural Radiance Fields Ze-Xin Yin et.al. 2305.04268 null
2023-05-04 NeRF-QA: Neural Radiance Fields Quality Assessment Database Pedro Martin et.al. 2305.03176 null
2023-05-04 NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds Jun-Kun Chen et.al. 2305.03049 null
2023-05-04 Radiance Field Gradient Scaling for Unbiased Near-Camera Training Julien Philip et.al. 2305.02756 link
2023-05-04 Semantic-aware Generation of Multi-view Portrait Drawings Biao Ma et.al. 2305.02618 link
2023-05-02 Neural LiDAR Fields for Novel View Synthesis Shengyu Huang et.al. 2305.01643 null
2023-05-03 LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar Yuelang Xu et.al. 2305.01190 null
2023-05-02 Federated Neural Radiance Fields Lachlan Holden et.al. 2305.01163 link
2023-05-01 GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation Zhenhui Ye et.al. 2305.00787 null
2023-04-30 Neural Radiance Fields (NeRFs): A Review and Some Recent Developments Mohamed Debbagh et.al. 2305.00375 null
2023-04-28 ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance Fields Nagabhushan Somraj et.al. 2305.00041 link
2023-04-28 NeRF-LiDAR: Generating Realistic LiDAR Point Clouds with Neural Radiance Fields Junge Zhang et.al. 2304.14811 link
2023-04-27 Learning a Diffusion Prior for NeRFs Guandao Yang et.al. 2304.14473 null
2023-04-27 ActorsNeRF: Animatable Few-shot Human Rendering with Generalizable NeRFs Jiteng Mu et.al. 2304.14401 null
2023-05-03 Combining HoloLens with Instant-NeRFs: Advanced Real-Time 3D Mobile Mapping Dennis Haitz et.al. 2304.14301 null
2023-04-27 Compositional 3D Human-Object Neural Animation Zhi Hou et.al. 2304.14070 null
2023-04-26 Super-NeRF: View-consistent Detail Generation for NeRF super-resolution Yuqi Han et.al. 2304.13518 null
2023-04-26 VGOS: Voxel Grid Optimization for View Synthesis from Sparse Inputs Jiakai Sun et.al. 2304.13386 link
2023-04-25 Local Implicit Ray Function for Generalizable Radiance Field Representation Xin Huang et.al. 2304.12746 null
2023-04-27 MF-NeRF: Memory Efficient NeRF with Mixed-Feature Hash Table Yongjae Lee et.al. 2304.12587 link
2023-04-24 Instant-3D: Instant Neural Radiance Field Training Towards On-Device AR/VR 3D Reconstruction Sixu Li et.al. 2304.12467 null
2023-04-24 TextMesh: Generation of Realistic 3D Meshes From Text Prompts Christina Tsalicoglou et.al. 2304.12439 null
2023-04-26 Segment Anything in 3D with NeRFs Jiazhong Cen et.al. 2304.12308 link
2023-04-24 Explicit Correspondence Matching for Generalizable Neural Radiance Fields Yuedong Chen et.al. 2304.12294 link
2023-04-25 Gen-NeRF: Efficient and Generalizable Neural Radiance Fields via Algorithm-Hardware Co-Design Yonggan Fu et.al. 2304.11842 null
2023-04-22 3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes Haotian Xue et.al. 2304.11470 null
2023-04-22 Dehazing-NeRF: Neural Radiance Fields from Hazy Images Tian Li et.al. 2304.11448 null
2023-04-22 NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation Baao Xie et.al. 2304.11342 link
2023-04-21 AutoNeRF: Training Implicit Scene Representations with Autonomous Agents Pierre Marza et.al. 2304.11241 link
2023-04-21 Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction Binbin Huang et.al. 2304.10780 null
2023-04-20 A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion Miriam Jäger et.al. 2304.10664 null
2023-04-20 Learning Neural Duplex Radiance Fields for Real-Time View Synthesis Ziyu Wan et.al. 2304.10537 null
2023-04-21 Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs Frederik Warburg et.al. 2304.10532 link
2023-04-20 ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects Marco Toschi et.al. 2304.10448 null
2023-04-20 LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields Tang Tao et.al. 2304.10406 link
2023-04-20 Revisiting Implicit Neural Representations in Low-Level Vision Wentian Xu et.al. 2304.10250 link
2023-04-20 Multiscale Representation for Real-Time Anti-Aliasing Neural Rendering Dongting Hu et.al. 2304.10075 null
2023-04-20 Neural Radiance Fields: Past, Present, and Future Ansh Mittal et.al. 2304.10050 link
2023-04-19 Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra Jonas Kulhanek et.al. 2304.09987 link
2023-04-20 Reference-guided Controllable Inpainting of Neural Radiance Fields Ashkan Mirzaei et.al. 2304.09677 null
2023-04-18 SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes Yiming Gao et.al. 2304.08971 null
2023-04-18 NeAI: A Pre-convoluted Representation for Plug-and-Play Neural Ambient Illumination Yiyu Zhuang et.al. 2304.08757 null
2023-04-17 MoDA: Modeling Deformable 3D Objects from Casual Videos Chaoyue Song et.al. 2304.08279 link
2023-04-17 NeRF-Loc: Visual Localization with Conditional Neural Radiance Field Jianlin Liu et.al. 2304.07979 link
2023-04-16 Likelihood-Based Generative Radiance Field with Latent Space Energy-Based Model for 3D-Aware Disentangled Image Representation Yaxuan Zhu et.al. 2304.07918 null
2023-04-16 CAT-NeRF: Constancy-Aware Tx $^2$ Former for Dynamic Body Modeling Haidong Zhu et.al. 2304.07915 link
2023-04-16 SeaThru-NeRF: Neural Radiance Fields in Scattering Media Deborah Levy et.al. 2304.07743 link
2023-04-14 UVA: Towards Unified Volumetric Avatar for View Synthesis, Pose rendering, Geometry and Texture Editing Jinlong Fan et.al. 2304.06969 null
2023-04-17 Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction Hansheng Chen et.al. 2304.06714 link
2023-04-13 Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields Jonathan T. Barron et.al. 2304.06706 null
2023-04-13 NeRFVS: Neural Radiance Fields for Free View Synthesis via Geometry Scaffolds Chen Yang et.al. 2304.06287 null
2023-04-12 NutritionVerse-Thin: An Optimized Strategy for Enabling Improved Rendering of 3D Thin Food Models Chi-en Amy Tai et.al. 2304.05620 null
2023-04-11 Improving Neural Radiance Fields with Depth-aware Optimization for Novel View Synthesis Shu Chen et.al. 2304.05218 link
2023-04-11 One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field Weichuang Li et.al. 2304.05097 null
2023-04-11 MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields Ganlin Yang et.al. 2304.04962 link
2023-04-10 Neural Image-based Avatars: Generalizable Radiance Fields for Human Avatar Modeling Youngjoong Kwon et.al. 2304.04897 null
2023-04-07 Event-based Camera Tracker by $\nabla$ t NeRF Mana Masuda et.al. 2304.04559 null
2023-04-10 Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos Liao Wang et.al. 2304.04452 null
2023-04-10 Inferring Fluid Dynamics via Inverse Rendering Jinxian Liu et.al. 2304.04446 null
2023-04-10 Instance Neural Radiance Field Benran Hu et.al. 2304.04395 link
2023-04-12 NeRF applied to satellite imagery for surface reconstruction Federico Semeraro et.al. 2304.04133 link
2023-04-08 PVD-AL: Progressive Volume Distillation with Active Learning for Efficient Conversion Between Different NeRF Architectures Shuangkang Fang et.al. 2304.04012 link
2023-04-07 Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field Leheng Li et.al. 2304.03526 null
2023-04-06 Beyond NeRF Underwater: Learning Neural Reflectance Fields for True Color Correction of Marine Imagery Tianyi Zhang et.al. 2304.03384 link
2023-04-06 LANe: Lighting-Aware Neural Fields for Compositional Scene Synthesis Akshay Krishnan et.al. 2304.03280 null
2023-04-06 Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes Zian Wang et.al. 2304.03266 null
2023-04-06 DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model Hoigi Seo et.al. 2304.02827 null
2023-04-05 Image Stabilization for Hololens Camera in Remote Collaboration Gowtham Senthil et.al. 2304.02736 null
2023-04-04 Generating Continual Human Motion in Diverse 3D Scenes Aymen Mir et.al. 2304.02061 null
2023-04-04 MonoHuman: Animatable Human Neural Field from Monocular Video Zhengming Yu et.al. 2304.02001 null
2023-04-06 DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models Yukang Cao et.al. 2304.00916 link
2023-04-01 JacobiNeRF: NeRF Shaping with Mutual Information Gradients Xiaomeng Xu et.al. 2304.00341 link
2023-03-31 VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization Bingfan Zhu et.al. 2303.17968 link
2023-03-30 NeRF-Supervised Deep Stereo Fabio Tosi et.al. 2303.17603 link
2023-03-30 SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling Zhitao Yang et.al. 2303.17368 link
2023-03-30 NeILF++: Inter-Reflectable Light Fields for Geometry and Material Estimation Jingyang Zhang et.al. 2303.17147 null
2023-03-30 Enhanced Stable View Synthesis Nishant Jain et.al. 2303.17094 null
2023-03-29 TriVol: Point Cloud Rendering via Triple Volumes Tao Hu et.al. 2303.16485 link
2023-03-29 Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance Fields Tao Hu et.al. 2303.16482 null
2023-03-28 Flow supervision for Deformable NeRF Chaoyang Wang et.al. 2303.16333 null
2023-03-28 SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis Guangcong Wang et.al. 2303.16196 link
2023-03-28 VMesh: Hybrid Volume-Mesh Representation for Efficient View Synthesis Yuan-Chen Guo et.al. 2303.16184 null
2023-03-30 Adaptive Voronoi NeRFs Tim Elsner et.al. 2303.16001 null
2023-03-28 F $^{2}$ -NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories Peng Wang et.al. 2303.15951 link
2023-03-27 JAWS: Just A Wild Shot for Cinematic Transfer in Neural Radiance Fields Xi Wang et.al. 2303.15427 link
2023-03-27 Generalizable Neural Voxels for Fast Human Radiance Fields Taoran Yi et.al. 2303.15387 null
2023-03-27 NeUDF: Learning Unsigned Distance Fields from Multi-view Images for Reconstructing Non-watertight Models Fei Hou et.al. 2303.15368 link
2023-03-24 Perceptual Quality Assessment of NeRF and Neural View Synthesis Methods for Front-Facing Views Hanxue Liang et.al. 2303.15206 null
2023-03-27 3D-Aware Multi-Class Image-to-Image Translation with NeRFs Senmao Li et.al. 2303.15012 link
2023-03-26 Clean-NeRF: Reformulating NeRF to account for View-Dependent Observations Xinhang Liu et.al. 2303.14707 null
2023-03-25 SUDS: Scalable Urban Dynamic Scenes Haithem Turki et.al. 2303.14536 null
2023-03-25 DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields Yu Chen et.al. 2303.14478 null
2023-03-25 NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects Zhiwen Yan et.al. 2303.14435 link
2023-03-24 Grid-guided Neural Radiance Fields for Large Urban Scenes Linning Xu et.al. 2303.14001 null
2023-03-24 CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout Yiqi Lin et.al. 2303.13843 null
2023-03-24 HandNeRF: Neural Radiance Fields for Animatable Interacting Hands Zhiyang Guo et.al. 2303.13825 null
2023-03-24 ABLE-NeRF: Attention-Based Rendering with Learnable Embeddings for Neural Radiance Field Zhe Jun Tang et.al. 2303.13817 link
2023-03-24 GM-NeRF: Learning Generalizable Model-based Neural Radiance Fields from Multi-view Images Jianchuan Chen et.al. 2303.13777 null
2023-03-24 TEGLO: High Fidelity Canonical Texture Mapping from Single-View Images Vishal Vinod et.al. 2303.13743 null
2023-03-23 SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates Mikaela Angelina Uy et.al. 2303.13582 null
2023-03-23 TriPlaneNet: An Encoder for EG3D Inversion Ananta R. Bhattarai et.al. 2303.13497 null
2023-03-23 Plotting Behind the Scenes: Towards Learnable Game Engines Willi Menapace et.al. 2303.13472 null
2023-03-23 Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes Dana Cohen-Bar et.al. 2303.13450 link
2023-03-23 SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field Chong Bao et.al. 2303.13277 link
2023-03-23 Transforming Radiance Field with Lipschitz Network for Photorealistic 3D Scene Stylization Zicheng Zhang et.al. 2303.13232 null
2023-03-23 Semantic Ray: Learning a Generalizable Semantic Field with Cross-Reprojection Attention Fangfu Liu et.al. 2303.13014 link
2023-03-22 NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions Mohamad Shahbazi et.al. 2303.12865 link
2023-03-22 SHERF: Generalizable Human NeRF from a Single Image Shoukang Hu et.al. 2303.12791 link
2023-03-22 Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions Ayaan Haque et.al. 2303.12789 null
2023-03-22 FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models Jianglong Ye et.al. 2303.12786 link
2023-03-22 Balanced Spherical Grid for Egocentric View Synthesis Changwoon Choi et.al. 2303.12408 link
2023-03-21 Pre-NeRF 360: Enriching Unbounded Appearances for Neural Radiance Fields Ahmad AlMughrabi et.al. 2303.12234 link
2023-03-21 3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion Yu-Jhe Li et.al. 2303.11938 null
2023-03-22 ExtremeNeRF: Few-shot Neural Radiance Fields Under Unconstrained Illumination SeokYeong Lee et.al. 2303.11728 null
2023-03-20 DehazeNeRF: Multiple Image Haze Removal and 3D Shape Reconstruction using Neural Radiance Fields Wei-Ting Chen et.al. 2303.11364 null
2023-03-20 ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-real Novel View Synthesis via Contrastive Learning Hao Yang et.al. 2303.11052 null
2023-03-19 SKED: Sketch-guided Text-based 3D Editing Aryan Mikaeili et.al. 2303.10735 null
2023-03-19 NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping Junyuan Deng et.al. 2303.10709 link
2023-03-18 3D Data Augmentation for Driving Scenes on Camera Wenwen Tong et.al. 2303.10340 null
2023-03-17 $α$ Surf: Implicit Surface Reconstruction for Semi-Transparent and Thin Objects with Decoupled Geometry and Opacity Tianhao Wu et.al. 2303.10083 null
2023-03-17 Single-view Neural Radiance Fields with Depth Teacher Yurui Chen et.al. 2303.09952 null
2023-03-21 PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D Supervision Konstantinos Tertikas et.al. 2303.09554 null
2023-03-16 LERF: Language Embedded Radiance Fields Justin Kerr et.al. 2303.09553 null
2023-03-16 NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes Marie-Julie Rakotosaona et.al. 2303.09431 null
2023-03-17 NeRFtrinsic Four: An End-To-End Trainable NeRF Jointly Optimizing Diverse Intrinsic and Extrinsic Camera Parameters Hannah Schieber et.al. 2303.09412 link
2023-03-16 Reliable Image Dehazing by NeRF Zheyan Jin et.al. 2303.09153 null
2023-03-15 Mesh Strikes Back: Fast and Efficient Human Reconstruction from RGB videos Rohit Jena et.al. 2303.08808 null
2023-03-15 Re-ReND: Real-time Rendering of NeRFs across Devices Sara Rojas et.al. 2303.08717 link
2023-03-15 RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters Shuja Khalid et.al. 2303.08695 null
2023-03-15 Harnessing Low-Frequency Neural Fields for Few-Shot View Synthesis Liangchen Song et.al. 2303.08370 link
2023-03-14 MELON: NeRF with Unposed Images Using Equivalence Class Estimation Axel Levy et.al. 2303.08096 null
2023-03-16 Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation Junyoung Seo et.al. 2303.07937 link
2023-03-16 NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction from Multi-view Images Yunfan Ye et.al. 2303.07653 link
2023-03-14 Frequency-Modulated Point Cloud Rendering with Easy Editing Yi Zhang et.al. 2303.07596 link
2023-03-13 FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization Jiawei Yang et.al. 2303.07418 link
2023-03-13 NeRFLiX: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-viewpoint MiXer Kun Zhou et.al. 2303.06919 link
2023-03-11 Just Flip: Flipped Observation Generation and Optimization for Neural Radiance Fields to Cover Unobserved View Minjae Lee et.al. 2303.06335 link
2023-03-10 NeRFlame: FLAME-based conditioning of NeRF for 3D face rendering Wojciech Zając et.al. 2303.06226 link
2023-03-10 You Only Train Once: Multi-Identity Free-Viewpoint Neural Human Rendering from Monocular Videos Jaehyeok Kim et.al. 2303.05835 null
2023-03-10 Aleth-NeRF: Low-light Condition View Synthesis with Concealing Fields Ziteng Cui et.al. 2303.05807 null
2023-03-10 Self-NeRF: A Self-Training Pipeline for Few-Shot Neural Radiance Fields Jiayang Bai et.al. 2303.05775 null
2023-03-14 Hardware Acceleration of Neural Graphics Muhammad Husnain Mubarik et.al. 2303.05735 null
2023-03-10 MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field Kaizhi Yang et.al. 2303.05703 null
2023-03-09 PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification Xuan Li et.al. 2303.05512 null
2023-03-08 FastSurf: Fast Neural RGB-D Surface Reconstruction using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning Seunghwan Lee et.al. 2303.04508 link
2023-03-08 DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields Dipam Patel et.al. 2303.04322 null
2023-03-07 NEPHELE: A Neural Platform for Highly Realistic Cloud Radiance Rendering Haimin Luo et.al. 2303.04086 null
2023-03-05 Semantic-aware Occlusion Filtering Neural Radiance Fields in the Wild Jaewon Lee et.al. 2303.03966 null
2023-03-07 Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis Kang Han et.al. 2303.03808 link
2023-03-10 Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation from 2D Supervision Xiaoshuai Zhang et.al. 2303.03361 null
2023-03-07 Efficient Large-scale Scene Representation with a Hybrid of High-resolution Grid and Plane Features Yuqi Zhang et.al. 2303.03003 link
2023-03-03 Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement Jiaxiang Tang et.al. 2303.02091 link
2023-03-03 Multi-Plane Neural Radiance Fields for Novel View Synthesis Youssef Abdelkareem et.al. 2303.01736 null
2023-03-01 S-NeRF: Neural Radiance Fields for Street Views Ziyang Xie et.al. 2303.00749 null
2023-02-28 IntrinsicNGP: Intrinsic Coordinate based Hash Encoding for Human NeRF Bo Peng et.al. 2302.14683 null
2023-02-27 BaLi-RF: Bandlimited Radiance Fields for Dynamic Scene Modeling Sameera Ramasinghe et.al. 2302.13543 null
2023-02-26 Efficient physics-informed neural networks using hash encoding Xinquan Huang et.al. 2302.13397 null
2023-02-24 CATNIPS: Collision Avoidance Through Neural Implicit Probabilistic Scenes Timothy Chen et.al. 2302.12931 link
2023-02-24 Learning Neural Volumetric Representations of Dynamic Humans in Minutes Chen Geng et.al. 2302.12237 link
2023-02-23 DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models Jamie Wynn et.al. 2302.12231 link
2023-02-20 NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion Jiatao Gu et.al. 2302.10109 null
2023-02-19 LC-NeRF: Local Controllable Face Generation in Neural Randiance Field Wenyang Zhou et.al. 2302.09486 null
2023-02-17 MixNeRF: Modeling a Ray with Mixture Density for Novel View Synthesis from Sparse Inputs Seunghyeon Seo et.al. 2302.08788 link
2023-02-14 VQ3D: Learning a 3D-Aware Generative Model on ImageNet Kyle Sargent et.al. 2302.06833 null
2023-02-13 3D-aware Blending with Generative NeRFs Hyunsu Kim et.al. 2302.06608 link
2023-02-11 3D Colored Shape Reconstruction from a Single RGB Image through Diffusion Bo Li et.al. 2302.05573 null
2023-02-08 Nerfstudio: A Modular Framework for Neural Radiance Field Development Matthew Tancik et.al. 2302.04264 null
2023-02-07 AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis Susan Liang et.al. 2302.02088 null
2023-02-03 Semantic 3D-aware Portrait Synthesis and Manipulation Based on Compositional Neural Radiance Field Tianxiang Ma et.al. 2302.01579 link
2023-02-03 Robust Camera Pose Refinement for Multi-Resolution Hash Encoding Hwan Heo et.al. 2302.01571 null
2023-02-03 INV: Towards Streaming Incremental Neural Videos Shengze Wang et.al. 2302.01532 null
2023-02-02 Factor Fields: A Unified Framework for Neural Fields and Beyond Anpei Chen et.al. 2302.01226 null
2023-02-02 RobustNeRF: Ignoring Distractors with Robust Losses Sara Sabour et.al. 2302.00833 null
2023-01-31 GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis Zhenhui Ye et.al. 2301.13430 null
2023-01-30 Equivariant Architectures for Learning in Deep Weight Spaces Aviv Navon et.al. 2301.12780 link
2023-01-27 HyperNeRFGAN: Hypernetwork approach to 3D NeRF GAN Adam Kania et.al. 2301.11631 link
2023-01-27 A Comparison of Tiny-nerf versus Spatial Representations for 3d Reconstruction Saulo Abraham Gante et.al. 2301.11522 null
2023-01-27 SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning Dongseok Shim et.al. 2301.11520 null
2023-01-26 Text-To-4D Dynamic Scene Generation Uriel Singer et.al. 2301.11280 null
2023-01-26 GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency Minseop Kwak et.al. 2301.10941 link
2023-01-23 HexPlane: A Fast Representation for Dynamic Scenes Ang Cao et.al. 2301.09632 link
2023-01-22 3D Reconstruction of Non-cooperative Resident Space Objects using Instant NGP-accelerated NeRF and D-NeRF Trupti Mahendrakar et.al. 2301.09060 null
2023-01-18 NeRF in the Palm of Your Hand: Corrective Augmentation for Robotics via Novel-View Synthesis Allan Zhou et.al. 2301.08556 null
2023-01-19 RecolorNeRF: Layer Decomposed Radiance Field for Efficient Color Editing of 3D Scenes Bingchen Gong et.al. 2301.07958 null
2023-01-18 Behind the Scenes: Density Fields for Single View Reconstruction Felix Wimbauer et.al. 2301.07668 link
2023-01-17 A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction Chongshan Lu et.al. 2301.06782 null
2023-01-13 Laser: Latent Set Representations for 3D Generative Modeling Pol Moreno et.al. 2301.05747 null
2023-01-10 Benchmarking Robustness in Neural Radiance Fields Chen Wang et.al. 2301.04075 null
2023-01-08 Towards Open World NeRF-Based SLAM Daniil Lisus et.al. 2301.03102 null
2023-01-10 Traditional Readability Formulas Compared for English Bruce W. Lee et.al. 2301.02975 null
2023-01-09 Class-Continuous Conditional Generative Neural Radiance Field Jiwook Kim et.al. 2301.00950 link
2023-01-11 Detachable Novel Views Synthesis of Dynamic Scenes Using Distribution-Driven Neural Radiance Fields Boyu Zhang et.al. 2301.00411 link
2022-12-26 MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos Fengrui Tian et.al. 2212.13056 link
2022-12-25 PaletteNeRF: Palette-based Color Editing for NeRFs Qiling Wu et.al. 2212.12871 null
2022-12-22 Removing Objects From Neural Radiance Fields Silvan Weder et.al. 2212.11966 null
2022-12-21 Incremental Learning for Neural Radiance Field with Uncertainty-Filtered Knowledge Distillation Mengqi Guo et.al. 2212.10950 link
2022-12-21 PaletteNeRF: Palette-based Appearance Editing of Neural Radiance Fields Zhengfei Kuang et.al. 2212.10699 null
2022-12-20 Correspondence Distillation from NeRF-based GAN Yushi Lan et.al. 2212.09735 null
2022-12-19 StyleTRF: Stylizing Tensorial Radiance Fields Rahul Goel et.al. 2212.09330 null
2022-12-18 SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images Abdullah Hamdi et.al. 2212.09100 link
2022-12-18 Masked Wavelet Representation for Compact Neural Radiance Fields Daniel Rho et.al. 2212.09069 link
2022-12-15 SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory Sicheng Li et.al. 2212.08476 null
2022-12-16 MEIL-NeRF: Memory-Efficient Incremental Learning of Neural Radiance Fields Jaeyoung Chung et.al. 2212.08328 null
2022-12-15 NeRF-Art: Text-Driven Neural Radiance Fields Stylization Can Wang et.al. 2212.08070 link
2022-12-15 Real-Time Neural Light Field on Mobile Devices Junli Cao et.al. 2212.08057 link
2022-12-14 NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior Wenjing Bian et.al. 2212.07388 link
2022-12-08 GazeNeRF: 3D-Aware Gaze Redirection with Neural Radiance Fields Alessandro Ruzzi et.al. 2212.04823 link
2022-12-09 4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions Zhongshu Wang et.al. 2212.04701 link
2022-12-07 EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points Chengwei Zheng et.al. 2212.04247 null
2022-12-08 NeRFEditor: Differentiable Style Decomposition for Full 3D Scene Editing Chunyi Sun et.al. 2212.03848 null
2022-12-07 Non-uniform Sampling Strategies for NeRF on 360{\textdegree} images Takashi Otonari et.al. 2212.03635 null
2022-12-07 SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields Siddhant Ranade et.al. 2212.03406 null
2022-12-06 NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors Congyue Deng et.al. 2212.03267 null
2022-12-05 SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields Anh-Quan Cao et.al. 2212.02501 link
2022-12-05 Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields Rohith Agaram et.al. 2212.02493 link
2022-12-06 D-TensoRF: Tensorial Radiance Fields for Dynamic Scenes Hankyu Jang et.al. 2212.02375 null
2022-12-07 GARF:Geometry-Aware Generalized Neural Radiance Field Yue Shi et.al. 2212.02280 null
2022-12-05 INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors Chaojian Li et.al. 2212.01959 null
2022-12-03 MaRF: Representing Mars as Neural Radiance Fields Lorenzo Giusti et.al. 2212.01672 link
2022-12-03 StegaNeRF: Embedding Invisible Information within Neural Radiance Fields Chenxin Li et.al. 2212.01602 null
2022-12-02 RT-NeRF: Real-Time On-Device Neural Radiance Fields Towards Immersive AR/VR Rendering Chaojian Li et.al. 2212.01120 null
2022-12-02 3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation Zutao Jiang et.al. 2212.01103 null
2022-12-02 QFF: Quantized Fourier Features for Neural Field Representations Jae Yong Lee et.al. 2212.00914 null
2022-12-01 ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural Radiance Fields Octave Mariotti et.al. 2212.00436 null
2022-11-30 NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation Yu Yin et.al. 2211.17235 null
2022-11-29 NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views Dejia Xu et.al. 2211.16431 link
2022-11-29 Compressing Volumetric Radiance Fields to 1 MB Lingzhi Li et.al. 2211.16386 link
2022-11-28 In-Hand 3D Object Scanning from an RGB Sequence Shreyas Hampali et.al. 2211.16193 null
2022-11-30 One is All: Bridging the Gap Between Neural Radiance Fields Architectures with Progressive Volume Distillation Shuangkang Fang et.al. 2211.15977 link
2022-11-28 High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors Yunpeng Bai et.al. 2211.15064 null
2022-11-27 SuNeRF: Validation of a 3D Global Reconstruction of the Solar Corona Using Simulated EUV Images Kyriaki-Margarita Bintsi et.al. 2211.14879 null
2022-11-27 3D Scene Creation and Rendering via Rough Meshes: A Lighting Transfer Avenue Yujie Li et.al. 2211.14823 null
2022-11-27 Sampling Neural Radiance Fields for Refractive Objects Jen-I Pan et.al. 2211.14799 link
2022-11-25 3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models Gang Li et.al. 2211.14108 null
2022-11-25 ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision Jingwang Ling et.al. 2211.14086 link
2022-11-25 Dynamic Neural Portraits Michail Christos Doukas et.al. 2211.13994 null
2022-11-25 Unsupervised Continual Semantic Adaptation through Neural Rendering Zhizheng Liu et.al. 2211.13969 link
2022-11-25 TPA-Net: Generate A Dataset for Text to Physics-based Animation Yuxing Qiu et.al. 2211.13887 null
2022-11-24 ScanNeRF: a Scalable Benchmark for Neural Radiance Fields Luca De Luigi et.al. 2211.13762 null
2022-11-24 Immersive Neural Graphics Primitives Ke Li et.al. 2211.13494 link
2022-11-23 CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields Keqiang Sun et.al. 2211.13251 null
2022-11-26 ClimateNeRF: Physically-based Neural Rendering for Extreme Climate Synthesis Yuan Li et.al. 2211.13226 null
2022-11-23 ManVatar : Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels Yuelang Xu et.al. 2211.13206 null
2022-11-23 BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields Peng Wang et.al. 2211.12853 link
2022-11-23 PANeRF: Pseudo-view Augmentation for Improved Neural Radiance Fields Based on Few-shot Inputs Young Chun Ahn et.al. 2211.12758 null
2022-11-23 ActiveRMAP: Radiance Field for Active Mapping And Planning Huangying Zhan et.al. 2211.12656 null
2022-11-22 Zero NeRF: Registration with Zero Overlap Casey Peat et.al. 2211.12544 null
2022-11-22 Depth-Supervised NeRF for Multi-View RGB-D Operating Room Images Beerend G. A. Gerats et.al. 2211.12436 null
2022-11-22 Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition Jiaxiang Tang et.al. 2211.12368 null
2022-11-22 Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields Brian K. S. Isaac-Medina et.al. 2211.12285 link
2022-11-22 SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields Ashkan Mirzaei et.al. 2211.12254 null
2022-11-22 Deblurred Neural Radiance Field with Physical Scene Priors Dogyoon Lee et.al. 2211.12046 link
2022-11-22 ONeRF: Unsupervised 3D Object Segmentation from Multiple Views Shengnan Liang et.al. 2211.12038 null
2022-11-21 Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques David Ramirez et.al. 2211.11836 null
2022-11-21 SPARF: Neural Radiance Fields from Sparse and Noisy Poses Prune Truong et.al. 2211.11738 link
2022-11-21 ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields Mohammad Mahdi Johari et.al. 2211.11704 null
2022-11-21 Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion Dario Pavllo et.al. 2211.11674 link
2022-11-18 Magic3D: High-Resolution Text-to-3D Content Creation Chen-Hsuan Lin et.al. 2211.10440 null
2022-11-17 AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training Yifan Jiang et.al. 2211.09682 null
2022-11-16 CoNFies: Controllable Neural Face Avatars Heng Yu et.al. 2211.08610 null
2022-11-14 Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures Gal Metzer et.al. 2211.07600 link
2022-11-12 3D-Aware Encoding for Style-based Neural Radiance Fields Yu-Jhe Li et.al. 2211.06583 null
2022-11-11 ParticleNeRF: A Particle-Based Encoding for Online Neural Radiance Fields in Dynamic Scenes Jad Abou-Chakra et.al. 2211.04041 null
2022-11-07 Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories Samarth Sinha et.al. 2211.03889 null
2022-11-03 nerf2nerf: Pairwise Registration of Neural Radiance Fields Lily Goli et.al. 2211.01600 null
2022-10-27 ProbNeRF: Uncertainty-Aware Inference of 3D Shapes from 2D Images Matthew D. Hoffman et.al. 2210.17415 null
2022-10-27 Boosting Point Clouds Rendering via Radiance Mapping Xiaoyang Huang et.al. 2210.15107 link
2022-10-24 Learning Neural Radiance Fields from Multi-View Geometry Marco Orsingher et.al. 2210.13041 null
2022-10-23 Compressing Explicit Voxel Grid Representations: fast NeRFs become also small Chenxi Lola Deng et.al. 2210.12782 null
2022-11-06 Joint Rigid Motion Correction and Sparse-View CT via Self-Calibrating Neural Field Qing Wu et.al. 2210.12731 null
2022-10-21 An Exploration of Neural Radiance Field Scene Reconstruction: Synthetic, Real-world and Dynamic Scenes Benedict Quartey et.al. 2210.12268 null
2022-11-06 Neural Fields for Robotic Object Manipulation from a Single Image Valts Blukis et.al. 2210.12126 null
2022-10-21 HDHumans: A Hybrid Approach for High-fidelity Digital Humans Marc Habermann et.al. 2210.12003 null
2022-10-21 RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control Zhenggang Tang et.al. 2210.11668 null
2022-10-21 Coordinates Are NOT Lonely – Codebook Prior Helps Implicit Neural 3D Representations Fukun Yin et.al. 2210.11170 link
2022-10-18 Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation Yunzhi Lin et.al. 2210.10108 link
2022-10-18 ARAH: Animatable Volume Rendering of Articulated Human SDFs Shaofei Wang et.al. 2210.10036 null
2022-10-20 Differentiable Physics Simulation of Dynamics-Augmented Neural Objects Simon Le Cleac’h et.al. 2210.09420 null
2022-10-15 SPIDR: SDF-based Neural Point Fields for Illumination and Deformation Ruofan Liang et.al. 2210.08398 null
2022-10-15 IBL-NeRF: Image-Based Lighting Formulation of Neural Radiance Fields Changwoon Choi et.al. 2210.08202 link
2022-10-17 3D GAN Inversion with Pose Optimization Jaehoon Ko et.al. 2210.07301 link
2022-10-13 Multiplane NeRF-Supervised Disentanglement of Depth and Camera Pose from Videos Yang Fu et.al. 2210.07181 null
2022-10-12 GraspNeRF: Multiview-based 6-DoF Grasp Detection for Transparent and Specular Objects Using Generalizable NeRF Qiyu Dai et.al. 2210.06575 link
2022-10-12 Reconstructing Personalized Semantic Facial NeRF Models From Monocular Video Xuan Gao et.al. 2210.06108 link
2022-10-11 X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360 $^{\circ}$ Insufficient RGB-D Views Haoyi Zhu et.al. 2210.05135 link
2022-10-10 NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields Arunkumar Byravan et.al. 2210.04932 null
2022-10-10 EVA3D: Compositional 3D Human Generation from 2D Image Collections Fangzhou Hong et.al. 2210.04888 link
2022-10-13 NerfAcc: A General NeRF Acceleration Toolbox Ruilong Li et.al. 2210.04847 link
2022-10-10 SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction Yitong Xia et.al. 2210.04553 link
2022-10-09 Robustifying the Multi-Scale Representation of Neural Radiance Fields Nishant Jain et.al. 2210.04233 null
2022-10-09 Estimating Neural Reflectance Field from Radiance Field using Tree Structures Xiu Li et.al. 2210.04217 null
2022-10-09 Data augmentation for NeRF: a geometric consistent solution based on view morphing Matteo Bortolon et.al. 2210.04214 link
2022-10-09 Towards Efficient Neural Scene Graphs by Learning Consistency Fields Yeji Song et.al. 2210.04127 null
2022-10-08 ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints Yinpeng Dong et.al. 2210.03895 link
2022-10-04 SelfNeRF: Fast Training NeRF for Human from Monocular Self-rotating Video Bo Peng et.al. 2210.01651 null
2022-10-03 NARF22: Neural Articulated Radiance Fields for Configuration-Aware Rendering Stanley Lewis et.al. 2210.01166 null
2022-10-02 IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis Weicai Ye et.al. 2210.00647 link
2022-10-02 Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation Xinhang Liu et.al. 2210.00489 null
2022-10-01 NeRF: Neural Radiance Field in 3D Vision, A Comprehensive Review Kyle Gao et.al. 2210.00379 null
2022-10-01 Structure-Aware NeRF without Posed Camera via Epipolar Constraint Shu Chen et.al. 2210.00183 link
2022-09-30 Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator Zifan Shi et.al. 2209.15637 null
2022-09-30 Understanding Pure CLIP Guidance for Voxel Grid NeRF Models Han-Hung Lee et.al. 2209.15172 null
2022-09-29 DreamFusion: Text-to-3D using 2D Diffusion Ben Poole et.al. 2209.14988 null
2022-09-29 SymmNeRF: Learning to Explore Symmetry Prior for Single-View View Synthesis Xingyi Li et.al. 2209.14819 link
2022-10-03 360FusionNeRF: Panoramic Neural Radiance Fields with Joint Guidance Shreyas Kulkarni et.al. 2209.14265 link
2022-09-27 OmniNeRF: Hybriding Omnidirectional Distance and Radiance fields for Neural Surface Reconstruction Jiaming Shen et.al. 2209.13433 null
2022-09-27 Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping Chi-Ming Chung et.al. 2209.13274 link
2022-09-27 WaterNeRF: Neural Radiance Fields for Underwater Scenes Advaith Venkatramanan Sethuraman et.al. 2209.13091 null
2022-09-26 Baking in the Feature: Accelerating Volumetric Segmentation by Rendering Feature Maps Kenneth Blomqvist et.al. 2209.12744 null
2022-09-25 Enforcing safety for vision-based controllers via Control Barrier Functions and Neural Radiance Fields Mukun Tong et.al. 2209.12266 null
2022-09-24 NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields Jiankai Sun et.al. 2209.12068 null
2022-09-19 Loc-NeRF: Monte Carlo Localization using Neural Radiance Fields Dominic Maggio et.al. 2209.09050 link
2022-09-23 NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes Zhiwen Fan et.al. 2209.08776 link
2022-09-19 Density-aware NeRF Ensembles: Quantifying Predictive Uncertainty in Neural Radiance Fields Niko Sünderhauf et.al. 2209.08718 null
2022-09-18 ActiveNeRF: Learning where to See with Uncertainty Estimation Xuran Pan et.al. 2209.08546 link
2022-09-18 LATITUDE: Robotic Global Localization with Truncated Dynamic Low-pass Filter in City-scale NeRF Zhenxin Zhu et.al. 2209.08498 link
2022-09-16 iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking Yuhang Ming et.al. 2209.07919 null
2022-09-12 StructNeRF: Neural Radiance Fields for Indoor Scenes with Structural Hints Zheng Chen et.al. 2209.05277 null
2022-09-09 Generative Deformable Radiance Fields for Disentangled Image Synthesis of Topology-Varying Objects Ziyu Wang et.al. 2209.04183 null
2022-09-08 im2nerf: Image to Neural Radiance Field in the Wild Lu Mi et.al. 2209.04061 null
2022-09-08 PixTrack: Precise 6DoF Object Pose Tracking using NeRF Templates and Feature-metric Alignment Prajwal Chidananda et.al. 2209.03910 link
2022-09-07 Neural Feature Fusion Fields: 3D Distillation of Self-Supervised 2D Image Representations Vadim Tschernezki et.al. 2209.03494 null
2022-08-29 Volume Rendering Digest (for NeRF) Andrea Tagliasacchi et.al. 2209.02417 null
2022-09-06 CLONeR: Camera-Lidar Fusion for Occupancy Grid-aided Neural Representations Alexandra Carlson et.al. 2209.01194 null
2022-09-01 On Quantizing Implicit Neural Representations Cameron Gordon et.al. 2209.01019 null
2022-08-31 Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces Yihao Zhi et.al. 2208.14851 link
2022-08-30 A Portable Multiscopic Camera for Novel View and Time Synthesis in Dynamic Scenes Tianjia Zhang et.al. 2208.14433 null
2022-08-24 PeRFception: Perception using Radiance Fields Yoonwoo Jeong et.al. 2208.11537 link
2022-08-24 E-NeRF: Neural Radiance Fields from a Moving Event Camera Simon Klenk et.al. 2208.11300 link
2022-08-18 Neural Capture of Animatable 3D Human from Monocular Video Gusi Te et.al. 2208.08728 null
2022-08-16 Casual Indoor HDR Radiance Capture from Omnidirectional Images Pulkit Gera et.al. 2208.07903 null
2022-08-15 DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images Bing Wang et.al. 2208.07227 link
2022-08-11 RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild Jason Y. Zhang et.al. 2208.05963 null
2022-08-11 FDNeRF: Few-shot Dynamic Neural Radiance Fields for Face Reconstruction and Expression Editing Jingbo Zhang et.al. 2208.05751 link
2022-08-04 360Roam: Real-Time Indoor Roaming Using Geometry-Aware ${360^\circ}$ Radiance Fields Huajian Huang et.al. 2208.02705 null
2022-08-02 T4DT: Tensorizing Time for Learning Temporal 3D Visual Data Mikhail Usvyatsov et.al. 2208.01421 link
2022-08-01 DoF-NeRF: Depth-of-Field Meets Neural Radiance Fields Zijin Wu et.al. 2208.00945 link
2022-08-06 MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures Zhiqin Chen et.al. 2208.00277 link
2022-07-30 Distilled Low Rank Neural Radiance Field with Quantization for Light Field Compression Jinglei Shi et.al. 2208.00164 null
2022-08-01 End-to-end View Synthesis via NeRF Attention Zelin Zhao et.al. 2207.14741 null
2022-07-29 Neural Density-Distance Fields Itsuki Ueda et.al. 2207.14455 link
2022-07-27 Is Attention All NeRF Needs? Mukund Varma T et.al. 2207.13298 null

Gaussian Splatting

Publish Date Title Authors PDF Code
2025-07-15 A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction Haoxuan Qu et.al. 2507.11321 null
2025-07-16 TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update Jeongyun Kim et.al. 2507.11069 null
2025-07-15 Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling Hayeon Kim et.al. 2507.11061 null
2025-07-14 ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions Shivangi Aneja et.al. 2507.10542 null
2025-07-14 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving Yixun Zhang et.al. 2507.09993 null
2025-07-11 Learning human-to-robot handovers through 3D scene reconstruction Yuekun Wu et.al. 2507.08726 null
2025-07-11 RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting Ji Hyun Seo et.al. 2507.08434 null
2025-07-10 Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction Hyungjun Doh et.al. 2507.08137 null
2025-07-10 RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration Chong Cheng et.al. 2507.08136 null
2025-07-10 RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection Yongyang Zhou et.al. 2507.07733 null
2025-07-10 MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation Bangning Wei et.al. 2507.07519 null
2025-07-10 SD-GS: Structured Deformable 3D Gaussians for Efficient Dynamic Scene Reconstruction Wei Yao et.al. 2507.07465 null
2025-07-10 Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections Yongtang Bao et.al. 2507.07395 null
2025-07-09 LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS Wanhua Li et.al. 2507.07136 null
2025-07-09 Enhancing non-Rigid 3D Model Deformations Using Mesh-based Gaussian Splatting Wijayathunga W. M. R. D. B et.al. 2507.07000 null
2025-07-09 Photometric Stereo using Gaussian Splatting and inverse rendering Matéo Ducastel et.al. 2507.06684 null
2025-07-09 FlexGaussian: Flexible and Cost-Effective Training-Free Compression for 3D Gaussian Splatting Boyuan Tian et.al. 2507.06671 null
2025-07-09 ClipGS: Clippable Gaussian Splatting for Interactive Cinematic Visualization of Volumetric Medical Data Chengkun Li et.al. 2507.06647 null
2025-07-08 LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures Seungoh Han et.al. 2507.06109 null
2025-07-08 Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering Jiayi Song et.al. 2507.06103 null
2025-07-08 VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis Alexandre Symeonidis-Herzig et.al. 2507.06060 null
2025-07-08 D-FCGS: Feedforward Compression of Dynamic Gaussian Splatting for Free-Viewpoint Videos Wenkang Zhang et.al. 2507.05859 null
2025-07-08 DreamArt: Generating Interactable Articulated Objects from a Single Image Ruijie Lu et.al. 2507.05763 null
2025-07-08 3DGS_LSR:Large_Scale Relocation for Autonomous Driving Based on 3D Gaussian Splatting Haitao Lu et.al. 2507.05661 null
2025-07-07 Mastering Regional 3DGS: Locating, Initializing, and Editing with Diverse 2D Priors Lanqing Guo et.al. 2507.05426 null
2025-07-07 SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation Jiahao Zhu et.al. 2507.05256 null
2025-07-07 InterGSEdit: Interactive 3D Gaussian Splatting Editing with 3D Geometry-Consistent Attention Prior Minghao Wen et.al. 2507.04961 null
2025-07-05 A3FR: Agile 3D Gaussian Splatting with Incremental Gaze Tracked Foveated Rendering in Virtual Reality Shuo Xin et.al. 2507.04147 null
2025-07-05 Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM Xiaolei Lang et.al. 2507.04004 null
2025-07-05 ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments Guile Wu et.al. 2507.03886 null
2025-07-04 Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps Chong Cheng et.al. 2507.03737 null
2025-07-03 HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars Gent Serifi et.al. 2507.02803 null
2025-07-03 ArtGS:3D Gaussian Splatting for Interactive Visual-Physical Modeling and Manipulation of Articulated Objects Qiaojun Yu et.al. 2507.02600 null
2025-07-03 LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling Jiahao Wu et.al. 2507.02363 null
2025-07-03 Gbake: Baking 3D Gaussian Splats into Reflection Probes Stephen Pasch et.al. 2507.02257 null
2025-07-02 3D Gaussian Splatting Driven Multi-View Robust Physical Adversarial Camouflage Generation Tianrui Lou et.al. 2507.01367 null
2025-07-01 VISTA: Open-Vocabulary, Task-Relevant Robot Exploration with Online Semantic Gaussian Splatting Keiko Nagami et.al. 2507.01125 null
2025-07-01 A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory Felix Windisch et.al. 2507.01110 null
2025-07-01 Masks make discriminative models great again! Tianshi Cao et.al. 2507.00916 null
2025-07-01 GaussianVLM: Scene-centric 3D Vision-Language Models using Language-aligned Gaussian Splats for Embodied Reasoning and Beyond Anna-Maria Halacheva et.al. 2507.00886 null
2025-07-01 LOD-GS: Level-of-Detail-Sensitive 3D Gaussian Splatting for Detail Conserved Anti-Aliasing Zhenya Yang et.al. 2507.00554 null
2025-07-01 GDGS: 3D Gaussian Splatting Via Geometry-Guided Initialization And Dynamic Density Control Xingjun Wang et.al. 2507.00363 null
2025-06-30 MILo: Mesh-In-the-Loop Gaussian Splatting for Detailed and Efficient Surface Reconstruction Antoine Guédon et.al. 2506.24096 null
2025-06-30 GaVS: 3D-Grounded Video Stabilization via Temporally-Consistent Local Reconstruction and Rendering Zinuo You et.al. 2506.23957 null
2025-06-30 AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention Ziao Liu et.al. 2506.23611 null
2025-06-30 Instant GaussianImage: A Generalizable and Self-Adaptive Image Representation via 2D Gaussian Splatting Zhaojie Zeng et.al. 2506.23479 null
2025-07-01 SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting Yiming Huang et.al. 2506.23309 null
2025-06-29 Endo-4DGX: Robust Endoscopic Scene Reconstruction and Illumination Correction with Gaussian Splatting Yiming Huang et.al. 2506.23308 null
2025-06-29 TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints Zhen Tan et.al. 2506.23207 null
2025-06-29 STD-GS: Exploring Frame-Event Interaction for SpatioTemporal-Disentangled Gaussian Splatting to Reconstruct High-Dynamic Scene Hanyu Zhou et.al. 2506.23157 null
2025-06-29 From Coarse to Fine: Learnable Discrete Wavelet Transforms for Efficient 3D Gaussian Splatting Hung Nguyen et.al. 2506.23042 null
2025-06-28 Confident Splatting: Confidence-Based Compression of 3D Gaussian Splatting via Learnable Beta Distributions AmirHossein Naghi Razlighi et.al. 2506.22973 null
2025-06-27 DIGS: Dynamic CBCT Reconstruction using Deformation-Informed 4D Gaussian Splatting and a Low-Rank Free-Form Deformation Model Yuliang Huang et.al. 2506.22280 null
2025-06-27 BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting Zipei Ma et.al. 2506.22099 null
2025-06-26 MADrive: Memory-Augmented Driving Scene Modeling Polina Karpikova et.al. 2506.21520 null
2025-06-26 EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting Taoyu Wu et.al. 2506.21420 null
2025-06-28 Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction Zhirui Gao et.al. 2506.21401 null
2025-06-26 Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image Pufan Li et.al. 2506.21152 null
2025-06-26 CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization Jan Ackermann et.al. 2506.21117 null
2025-06-26 User-in-the-Loop View Sampling with Error Peaking Visualization Ayaka Yasunaga et.al. 2506.21009 null
2025-06-26 DBMovi-GS: Dynamic View Synthesis from Blurry Monocular Video via Sparse-Controlled Gaussian Splatting Yeon-Ji Song et.al. 2506.20998 null
2025-06-25 3DGH: 3D Head Generation with Composable Hair and Face Chengan He et.al. 2506.20875 null
2025-06-25 RaRa Clipper: A Clipper for Gaussian Splatting Based on Ray Tracer and Rasterizer Da Li et.al. 2506.20202 null
2025-06-24 ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model Tengbo Yu et.al. 2506.19842 null
2025-06-24 Virtual Memory for 3D Gaussian Splatting Jonathan Haberl et.al. 2506.19415 null
2025-06-24 HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis Xiaoyuan Wang et.al. 2506.19291 null
2025-06-23 GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM Annika Thomas et.al. 2506.18885 null
2025-06-23 ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs Michal Nazarczuk et.al. 2506.18792 null
2025-06-23 3D Arena: An Open Platform for Generative 3D Evaluation Dylan Ebert et.al. 2506.18787 null
2025-06-23 Reconstructing Tornadoes in 3D with Gaussian Splatting Adam Yang et.al. 2506.18677 null
2025-06-21 3D Gaussian Splatting for Fine-Detailed Surface Reconstruction in Large-Scale Scene Shihan Chen et.al. 2506.17636 null
2025-06-20 Part $^{2}$ GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting Tianjiao Yu et.al. 2506.17212 null
2025-06-23 R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision Weeyoung Kwon et.al. 2506.16262 link
2025-06-19 Information-computation trade-offs in non-linear transforms Connor Ding et.al. 2506.15948 null
2025-06-18 Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos Kaifeng Zhang et.al. 2506.15680 null
2025-06-18 RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories Qingsong Yan et.al. 2506.15242 null
2025-06-17 Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction Zhengquan Zhang et.al. 2506.14856 null
2025-06-17 SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting Ziqiao Peng et.al. 2506.14742 null
2025-06-17 3DGS-IEval-15K: A Large-scale Image Quality Evaluation Database for 3D Gaussian-Splatting Yuke Xing et.al. 2506.14642 link
2025-06-17 HRGS: Hierarchical Gaussian Splatting for Memory-Efficient High-Resolution 3D Reconstruction Changbai Li et.al. 2506.14229 null
2025-06-17 GAF: Gaussian Action Field as a Dvnamic World Model for Robotic Mlanipulation Ying Chai et.al. 2506.14135 null
2025-06-16 GRaD-Nav++: Vision-Language Model Enabled Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics Qianzhong Chen et.al. 2506.14009 null
2025-06-16 PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images Lingteng Qiu et.al. 2506.13766 null
2025-06-16 Micro-macro Gaussian Splatting with Enhanced Scalability for Unconstrained Scene Reconstruction Yihui Li et.al. 2506.13516 link
2025-06-16 Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields Jungeon Kim et.al. 2506.13508 null
2025-06-16 TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian Splatting Mae Younes et.al. 2506.13348 link
2025-06-16 GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction Jinguang Tong et.al. 2506.13110 null
2025-06-15 Metropolis-Hastings Sampling for 3D Gaussian Reconstruction Hyunjin Kim et.al. 2506.12945 null
2025-06-15 Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting Mufan Liu et.al. 2506.12787 null
2025-06-17 Efficient multi-view training for 3D Gaussian Splatting Minhyuk Choi et.al. 2506.12727 null
2025-06-15 Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors Wen-Hsuan Chu et.al. 2506.12716 null
2025-06-14 Perceptual-GS: Scene-adaptive Perceptual Densification for Gaussian Splatting Hongbi Zhou et.al. 2506.12400 link
2025-06-12 Anti-Aliased 2D Gaussian Splatting Mae Younes et.al. 2506.11252 link
2025-06-12 PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting Lintao Xiang et.al. 2506.10335 null
2025-06-11 DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos Chieh Hubert Lin et.al. 2506.09997 null
2025-06-11 UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting Ziyi Wang et.al. 2506.09952 link
2025-06-11 DynaSplat: Dynamic-Static Gaussian Splatting with Hierarchical Motion Decomposition for Scene Reconstruction Junli Deng et.al. 2506.09836 null
2025-06-11 Self-Supervised Multi-Part Articulated Objects Modeling via Deformable Gaussian Splatting and Progressive Primitive Segmentation Haowen Wang et.al. 2506.09663 null
2025-06-11 Gaussian Herding across Pens: An Optimal Transport Perspective on Global Gaussian Reduction for 3DGS Tao Wang et.al. 2506.09534 null
2025-06-11 HAIF-GS: Hierarchical and Induced Flow-Guided Gaussian Splatting for Dynamic Scene Jianing Chen et.al. 2506.09518 null
2025-06-11 TinySplat: Feedforward Approach for Generating Compact 3D Scene Representation Zetian Song et.al. 2506.09479 null
2025-06-12 ODG: Occupancy Prediction Using Dual Gaussians Yunxiao Shi et.al. 2506.09417 null
2025-06-11 UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images Qijian Tian et.al. 2506.09378 null
2025-06-10 StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams Zike Wu et.al. 2506.08862 link
2025-06-11 Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting Keyi Liu et.al. 2506.08777 null
2025-06-10 SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting Mengjiao Ma et.al. 2506.08710 null
2025-06-10 TraGraph-GS: Trajectory Graph-based Gaussian Splatting for Arbitrary Large-Scale Scene Rendering Xiaohan Zhang et.al. 2506.08704 null
2025-06-10 Complex-Valued Holographic Radiance Fields Yicheng Zhan et.al. 2506.08350 null
2025-06-09 Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes Allen Tu et.al. 2506.07917 link
2025-06-09 GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution Shuja Khalid et.al. 2506.07897 null
2025-06-09 R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation William Ljungbergh et.al. 2506.07826 null
2025-06-09 OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting Jens Piekenbrinck et.al. 2506.07697 null
2025-06-09 ProSplat: Improved Feed-Forward 3D Gaussian Splatting for Wide-Baseline Sparse Views Xiaohan Lu et.al. 2506.07670 null
2025-06-09 PIG: Physically-based Multi-Material Interaction with 3D Gaussians Zeyu Xiao et.al. 2506.07657 null
2025-06-09 Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation Yijie Deng et.al. 2506.07338 null
2025-06-08 Accelerating 3D Gaussian Splatting with Neural Sorting and Axis-Oriented Rasterization Zhican Wang et.al. 2506.07069 null
2025-06-08 Hybrid Mesh-Gaussian Representation for Efficient Indoor Scene Reconstruction Binxiao Huang et.al. 2506.06988 null
2025-06-07 Gaussian Mapping for Evolving Scenes Vladimir Yugay et.al. 2506.06909 null
2025-06-06 Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments Mingrui Li et.al. 2506.05965 null
2025-06-06 SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction Yuchao Zheng et.al. 2506.05935 null
2025-06-06 Lumina: Real-Time Mobile Neural Rendering by Exploiting Computational Redundancy Yu Feng et.al. 2506.05682 null
2025-06-05 VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction Ziyue Zhu et.al. 2506.05563 null
2025-06-05 On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images Andreas Meuleman et.al. 2506.05558 null
2025-06-05 ODE-GS: Latent ODEs for Dynamic Scene Extrapolation with 3D Gaussian Splatting Daniel Wang et.al. 2506.05480 null
2025-06-05 Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting Duochao Shi et.al. 2506.05327 null
2025-06-06 Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting Nan Wang et.al. 2506.05280 link
2025-06-05 Synthetic Dataset Generation for Autonomous Mobile Robots Using 3D Gaussian Splatting for Vision Training Aneesh Deogan et.al. 2506.05092 null
2025-06-05 UAV4D: Dynamic Neural Rendering of Human-Centric UAV Imagery using Gaussian Splatting Jaehoon Choi et.al. 2506.05011 null
2025-06-05 Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting Alfred T. Christiansen et.al. 2506.05009 null
2025-06-05 Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer Filip Slezak et.al. 2506.04908 null
2025-06-05 Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations Gaia Di Lorenzo et.al. 2506.04789 null
2025-06-04 Photoreal Scene Reconstruction from an Egocentric Device Zhaoyang Lv et.al. 2506.04444 link
2025-06-04 HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting Maksym Ivashechkin et.al. 2506.04351 null
2025-06-04 Pseudo-Simulation for Autonomous Driving Wei Cao et.al. 2506.04218 link
2025-06-04 FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting Hengyu Liu et.al. 2506.04174 null
2025-06-04 Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data Ben Moran et.al. 2506.04120 null
2025-06-04 JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting Yang Xiao et.al. 2506.03872 null
2025-06-04 SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting Shengjie Lin et.al. 2506.03594 link
2025-06-04 Robust Neural Rendering in the Wild with Asymmetric Dual 3D Gaussian Splatting Chengqi Li et.al. 2506.03538 null
2025-06-03 Multi-Spectral Gaussian Splatting with Neural Color Representation Lukas Meyer et.al. 2506.03407 null
2025-06-03 LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM Roman Titkov et.al. 2506.03073 null
2025-06-03 Large Processor Chip Model Kaiyan Chang et.al. 2506.02929 null
2025-06-04 Voyager: Real-Time Splatting City-Scale 3D Gaussians on Your Phone Zheng Liu et.al. 2506.02774 null
2025-06-03 RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS Chuanyu Fu et.al. 2506.02751 null
2025-06-03 EyeNavGS: A 6-DoF Navigation Dataset and Record-n-Replay Software for Real-World 3DGS Scenes in VR Zihao Ding et.al. 2506.02380 link
2025-06-02 GSCodec Studio: A Modular Framework for Gaussian Splat Compression Sicheng Li et.al. 2506.01822 link
2025-06-02 WorldExplorer: Towards Generating Fully Navigable 3D Scenes Manuel-Andreas Schneider et.al. 2506.01799 null
2025-06-02 WoMAP: World Models For Embodied Open-Vocabulary Object Localization Tenny Yin et.al. 2506.01600 null
2025-06-02 RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes Pou-Chun Kung et.al. 2506.01379 null
2025-06-01 CountingFruit: Real-Time 3D Fruit Counting with Language-Guided Semantic Gaussian Splatting Fengze Li et.al. 2506.01109 null
2025-05-30 AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion Yangyi Huang et.al. 2505.24877 null
2025-05-30 TC-GS: A Faster Gaussian Splatting Module Utilizing Tensor Cores Zimu Liao et.al. 2505.24796 link
2025-05-30 Tackling View-Dependent Semantics in 3D Language Gaussian Splatting Jiazhong Cen et.al. 2505.24746 link
2025-05-30 GARLIC: GAussian Representation LearnIng for spaCe partitioning Panagiotis Rigas et.al. 2505.24608 null
2025-05-30 LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework Xin Kang et.al. 2505.24245 null
2025-05-29 3DGEER: Exact and Efficient Volumetric Rendering with 3D Gaussians Zixun Huang et.al. 2505.24053 link
2025-05-30 ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS Weijie Wang et.al. 2505.23734 link
2025-05-29 AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views Lihan Jiang et.al. 2505.23716 null
2025-05-29 Mobi- $π$ : Mobilizing Your Robot Learning Policy Jingyun Yang et.al. 2505.23692 null
2025-05-29 Radiant Triangle Soup with Soft Connectivity Forces for 3D Reconstruction and Novel View Synthesis Nathaniel Burgdorfer et.al. 2505.23642 null
2025-05-29 Holistic Large-Scale Scene Reconstruction via Mixed Gaussian Splatting Chuandong Liu et.al. 2505.23280 link
2025-05-29 LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering Jonas Kulhanek et.al. 2505.23158 null
2025-05-29 Pose-free 3D Gaussian splatting via shape-ray estimation Youngju Na et.al. 2505.22978 null
2025-05-28 3DGS Compression with Sparsity-guided Hierarchical Transform Coding Hao Xu et.al. 2505.22908 null
2025-05-28 CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting Kornel Howil et.al. 2505.22854 link
2025-05-28 STDR: Spatio-Temporal Decoupling for Real-Time Dynamic Scene Rendering Zehao Li et.al. 2505.22400 null
2025-05-28 UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments Wancai Zheng et.al. 2505.22335 null
2025-05-28 Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss Wenjun Lu et.al. 2505.22279 null
2025-05-28 Hyperspectral Gaussian Splatting Sunil Kumar Narayanan et.al. 2505.21890 null
2025-05-27 Generalizable and Relightable Gaussian Splatting for Human Novel View Synthesis Yipengjing Sun et.al. 2505.21502 null
2025-05-27 Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility Yidi Li et.al. 2505.21377 link
2025-05-27 Structure from Collision Takuhiro Kaneko et.al. 2505.21335 null
2025-05-29 3D-UIR: 3D Gaussian for Underwater 3D Scene Reconstruction via Physics Based Appearance-Medium Decoupling Jieyu Yuan et.al. 2505.21238 null
2025-05-28 CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians Weihang Liu et.al. 2505.21041 null
2025-05-27 Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting Xiangyu Sun et.al. 2505.20729 null
2025-05-27 Wideband RF Radiance Field Modeling Using Frequency-embedded 3D Gaussian Splatting Zechen Li et.al. 2505.20714 link
2025-05-26 CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting Lei Tian et.al. 2505.20469 null
2025-05-26 ParticleGS: Particle-Based Dynamics Modeling of 3D Gaussians for Prior-free Motion Extrapolation Jinsheng Quan et.al. 2505.20270 link
2025-05-26 HaloGS: Loose Coupling of Compact Geometry and Gaussian Splats for 3D Scenes Changjian Jiang et.al. 2505.20267 null
2025-05-26 OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender Shintaro Ito et.al. 2505.20126 link
2025-05-26 Weather-Magician: Reconstruction and Rendering Framework for 4D Weather Synthesis In Real Time Chen Sang et.al. 2505.19919 null
2025-05-26 Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud Natsuki Takama et.al. 2505.19854 null
2025-05-26 K-Buffers: A Plug-in Method for Enhancing Neural Fields with Multiple Buffers Haofan Ren et.al. 2505.19564 link
2025-05-26 ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Wenhua Wu et.al. 2505.19420 null
2025-05-25 Improving Novel view synthesis of 360 $^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images Guangan Chen et.al. 2505.19264 link
2025-05-25 Triangle Splatting for Real-Time Radiance Field Rendering Jan Held et.al. 2505.19175 null
2025-05-25 FHGS: Feature-Homogenized Gaussian Splatting Q. G. Duan et.al. 2505.19154 null
2025-05-25 Veta-GS: View-dependent deformable 3D Gaussian Splatting for thermal infrared Novel-view Synthesis Myeongseok Nam et.al. 2505.19138 null
2025-05-25 VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes Tianchen Deng et.al. 2505.18992 link
2025-05-23 SplatCo: Structure-View Collaborative Gaussian Splatting for Detail-Preserving Rendering of Large-Scale Unbounded Scenes Haihong Xiao et.al. 2505.17951 null
2025-05-23 CGS-GAN: 3D Consistent Gaussian Splatting GANs for High Resolution Human Head Synthesis Florian Barthel et.al. 2505.17590 link
2025-05-23 From Flight to Insight: Semantic 3D Reconstruction for Aerial Inspection via Gaussian Splatting and Language-Guided Segmentation Mahmoud Chick Zaouali et.al. 2505.17402 null
2025-05-22 Render-FM: A Foundation Model for Real-time Photorealistic Volumetric Rendering Zhongpai Gao et.al. 2505.17338 null
2025-05-22 SHaDe: Compact and Consistent Dynamic 3D Reconstruction via Tri-Plane Deformation and Latent Diffusion Asrar Alruwayqi et.al. 2505.16535 null
2025-05-22 Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction Jiacong Chen et.al. 2505.16533 null
2025-05-21 RUSplatting: Robust 3D Gaussian Splatting for Sparse-View Underwater Scene Reconstruction Zhuodong Jiang et.al. 2505.15737 null
2025-05-21 PlantDreamer: Achieving Realistic 3D Plant Models with Diffusion-Guided Gaussian Splatting Zane K J Hartley et.al. 2505.15528 null
2025-05-21 R3GS: Gaussian Splatting for Robust Reconstruction and Relocalization in Unconstrained Image Collections Xu yan et.al. 2505.15294 null
2025-05-21 GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation Yuchen Li et.al. 2505.15287 null
2025-05-21 X-GRM: Large Gaussian Reconstruction Model for Sparse-view X-rays to Computed Tomography Yifan Liu et.al. 2505.15235 link
2025-05-21 GT^2-GS: Geometry-aware Texture Transfer for Gaussian Splatting Wenjie Liu et.al. 2505.15208 null
2025-05-21 MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models Yifan Liu et.al. 2505.15185 link
2025-05-20 Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning Amine Elhafsi et.al. 2505.14938 null
2025-05-20 Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image Yuxuan Wang et.al. 2505.14537 null
2025-05-20 MGStream: Motion-aware 3D Gaussian for Streamable Dynamic Scene Reconstruction Zhenyu Bao et.al. 2505.13839 link
2025-05-19 Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos Ruoyu Wang et.al. 2505.13440 link
2025-05-19 Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation Seungjun Oh et.al. 2505.13215 link
2025-05-19 3D Gaussian Adaptive Reconstruction for Fourier Light-Field Microscopy Chenyu Xu et.al. 2505.12875 null
2025-05-19 TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy Luyao Lei et.al. 2505.12693 null
2025-05-18 Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey Calvin Galagain et.al. 2505.12384 null
2025-05-17 GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity Takuya Ikeda et.al. 2505.11905 null
2025-05-17 MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos Hongyi Zhou et.al. 2505.11868 null
2025-05-17 Gaussian Splatting as a Unified Representation for Autonomy in Unstructured Environments Dexter Ong et.al. 2505.11794 null
2025-05-16 Exploiting Radiance Fields for Grasp Generation on Novel Synthetic Views Abhishek Kashyap et.al. 2505.11467 null
2025-05-16 GrowSplat: Constructing Temporal Digital Twins of Plants with Gaussian Splats Simeon Adebola et.al. 2505.10923 null
2025-05-16 EA-3DGS: Efficient and Adaptive 3D Gaussians with Highly Enhanced Quality for outdoor scenes Jianlin Guo et.al. 2505.10787 link
2025-05-14 ExploreGS: a vision-based low overhead framework for 3D scene reconstruction Yunji Feng et.al. 2505.10578 null
2025-05-15 Consistent Quantity-Quality Control across Scenes for Deployment-Aware Gaussian Splatting Fengdi Zhang et.al. 2505.10473 link
2025-05-15 VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality Xuechang Tu et.al. 2505.10144 link
2025-05-15 Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field Jinlong Fan et.al. 2505.10049 link
2025-05-15 Large-Scale Gaussian Splatting SLAM Zhe Xin et.al. 2505.09915 null
2025-05-14 Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware Justin Yu et.al. 2505.09601 null
2025-05-14 Neural Video Compression using 2D Gaussian Splatting Lakshya Gupta et.al. 2505.09324 null
2025-05-15 NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance Wenzhe Cai et.al. 2505.08712 null
2025-05-13 DLO-Splatting: Tracking Deformable Linear Objects Using 3D Gaussian Splatting Holly Dinkel et.al. 2505.08644 null
2025-05-13 FOCI: Trajectory Optimization on Gaussian Splats Mario Gomez Andreu et.al. 2505.08510 null
2025-05-13 A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering Chuanzhi Xu et.al. 2505.08438 null
2025-05-13 ADC-GS: Anchor-Driven Deformable and Compressed Gaussian Splatting for Dynamic Scene Reconstruction He Huang et.al. 2505.08196 link
2025-05-12 SLAG: Scalable Language-Augmented Gaussian Splatting Laszlo Szilagyi et.al. 2505.08124 null
2025-05-12 GIFStream: 4D Gaussian-based Immersive Video with Feature Stream Hao Li et.al. 2505.07539 null
2025-05-13 TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset Olaf Wysocki et.al. 2505.07396 null
2025-05-10 Virtualized 3D Gaussians: Flexible Cluster-based Level-of-Detail System for Real-Time Rendering of Composed Scenes Xijie Yang et.al. 2505.06523 null
2025-05-08 TeGA: Texture Space Gaussian Avatars for High-Resolution Dynamic Head Modeling Gengyan Li et.al. 2505.05672 null
2025-05-08 UltraGauss: Ultrafast Gaussian Reconstruction of 3D Ultrasound Volumes Mark C. Eid et.al. 2505.05643 null
2025-05-08 QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization Yueh-Cheng Liu et.al. 2505.05591 null
2025-05-08 Steepest Descent Density Control for Compact 3D Gaussian Splatting Peihao Wang et.al. 2505.05587 null
2025-05-08 SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation Yonwoo Choi et.al. 2505.05475 link
2025-05-08 Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields Runfeng Li et.al. 2505.05356 null
2025-05-07 SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction Xinran Yang et.al. 2505.04668 link
2025-05-07 GSsplat: Generalizable Semantic Gaussian Splatting for Novel-view Synthesis in 3D Scenes Feng Xiao et.al. 2505.04659 link
2025-05-07 Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting Feng Yang et.al. 2505.04262 null
2025-05-06 3D Gaussian Splatting Data Compression with Mixture of Priors Lei Liu et.al. 2505.03310 null
2025-05-04 Sparfels: Fast Reconstruction from Sparse Unposed Imagery Shubhendu Jena et.al. 2505.02178 null
2025-05-04 SparSplat: Fast Multi-View Reconstruction with Generalizable 2D Gaussian Splatting Shubhendu Jena et.al. 2505.02175 null
2025-05-04 GarmentGS: Point-Cloud Guided Gaussian Splatting for High-Fidelity Non-Watertight 3D Garment Reconstruction Zhihao Tang et.al. 2505.02126 null
2025-05-04 SignSplat: Rendering Sign Language via Gaussian Splatting Maksym Ivashechkin et.al. 2505.02108 null
2025-05-03 HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud Encoder Qi Yang et.al. 2505.01938 link
2025-05-03 GenSync: A Generalized Talking Head Framework for Audio-driven Multi-Subject Lip-Sync using 3D Gaussian Splatting Anushka Agarwal et.al. 2505.01928 null
2025-05-03 Visual enhancement and 3D representation for underwater scenes: a review Guoxi Huang et.al. 2505.01869 null
2025-05-03 AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting Junhao Shi et.al. 2505.01799 null
2025-05-02 FalconWing: An Open-Source Platform for Ultra-Light Fixed-Wing Aircraft Research Yan Miao et.al. 2505.01383 null
2025-05-02 Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting Youngsik Yun et.al. 2505.01235 null
2025-04-30 A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond Jiajia Li et.al. 2505.00737 link
2025-05-01 Real-Time Animatable 2DGS-Avatars with Detail Enhancement from Monocular Videos Xia Yuan et.al. 2505.00421 null
2025-04-30 HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Haiyang Zhou et.al. 2504.21650 link
2025-04-29 GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction Yuhan Xie et.al. 2504.21067 link
2025-04-29 GaussTrap: Stealthy Poisoning Attacks on 3D Gaussian Splatting for Targeted Scene Confusion Jiaxin Hong et.al. 2504.20829 null
2025-04-29 EfficientHuman: Efficient Training and Reconstruction of Moving Human using Articulated 2D Gaussian Hao Tian et.al. 2504.20607 null
2025-04-29 Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting Hanxi Liu et.al. 2504.20403 null
2025-05-01 GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting Jongwon Lee et.al. 2504.20379 null
2025-04-29 Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views Jiang Wu et.al. 2504.20378 link
2025-04-28 Mesh-Learner: Texturing Mesh with Spherical Harmonics Yunfei Wan et.al. 2504.19938 link
2025-04-28 CE-NPBG: Connectivity Enhanced Neural Point-Based Graphics for Novel View Synthesis in Autonomous Driving Scenes Mohammad Altillawi et.al. 2504.19557 null
2025-04-28 GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field Zuxing Lu et.al. 2504.19409 null
2025-04-27 Rendering Anywhere You See: Renderability Field-guided Gaussian Splatting Xiaofeng Jin et.al. 2504.19261 null
2025-04-30 4DGS-CC: A Contextual Coding Framework for 4D Gaussian Splatting Data Compression Zicong Chen et.al. 2504.18925 null
2025-04-26 TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians Letian Huang et.al. 2504.18768 null
2025-04-28 RGS-DR: Reflective Gaussian Surfels with Deferred Rendering for Shiny Objects Georgios Kouros et.al. 2504.18468 null
2025-04-25 STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting Yunze Deng et.al. 2504.18318 null
2025-04-25 PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models Michel Gokan Khan et.al. 2504.18165 link
2025-04-24 iVR-GS: Inverse Volume Rendering for Explorable Visualization via Editable 3D Gaussian Splatting Kaiyuan Tang et.al. 2504.17954 link
2025-04-23 Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning Mingxuan Cui et.al. 2504.17815 link
2025-04-24 CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos Shucheng Gong et.al. 2504.17728 link
2025-04-23 Gaussian Splatting is an Effective Data Generator for 3D Object Detection Farhad G. Zanjani et.al. 2504.16740 null
2025-04-23 PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation Wenxuan Li et.al. 2504.16693 null
2025-04-23 HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction Zhongtao Wang et.al. 2504.16606 null
2025-04-23 ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration Andrea Conti et.al. 2504.16545 null
2025-04-21 StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians Cailin Zhuang et.al. 2504.15281 null
2025-04-21 Immersive Teleoperation Framework for Locomanipulation Tasks Takuya Boehringer et.al. 2504.15229 null
2025-04-21 MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video Minh-Quan Viet Bui et.al. 2504.15122 null
2025-04-20 IXGS-Intraoperative 3D Reconstruction from Sparse, Arbitrarily Posed Real X-rays Sascha Jecklin et.al. 2504.14699 null
2025-04-20 NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation Junyuan Fang et.al. 2504.14638 null
2025-04-20 VGNC: Reducing the Overfitting of Sparse-view 3DGS via Validation-guided Gaussian Number Control Lifeng Lin et.al. 2504.14548 null
2025-04-20 Metamon-GS: Enhancing Representability with Variance-Guided Densification and Light Encoding Junyan Su et.al. 2504.14460 null
2025-04-23 SEGA: Drivable 3D Gaussian Head Avatar from a Single Image Chen Guo et.al. 2504.14373 null
2025-04-21 SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM Samuel Cerezo et.al. 2504.13713 link
2025-04-18 Green Robotic Mixed Reality with Gaussian Splatting Chenxuan Liu et.al. 2504.13697 null
2025-04-18 EG-Gaussian: Epipolar Geometry and Graph Network Enhanced 3D Gaussian Splatting Beizhen Zhao et.al. 2504.13540 null
2025-04-17 Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering Landon Dyken et.al. 2504.13339 null
2025-04-17 Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation Sizhe Yang et.al. 2504.13175 null
2025-04-18 ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos Zetong Zhang et.al. 2504.13167 null
2025-04-17 Digital Twin Generation from Visual Data: A Survey Andrew Melnik et.al. 2504.13159 link
2025-04-17 Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint Graphs Shaohui Dai et.al. 2504.13153 link
2025-04-17 CompGS++: Compressed Gaussian Splatting for Static and Dynamic Scene Representation Xiangrui Liu et.al. 2504.13022 null
2025-04-17 GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration Rendong Zhang et.al. 2504.12999 link
2025-04-17 Second-order Optimization of Gaussian Splats with Importance Sampling Hamza Pehlivan et.al. 2504.12905 null
2025-04-17 AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering Michael Steiner et.al. 2504.12811 null
2025-04-17 CAGE-GS: High-fidelity Cage Based 3D Gaussian Splatting Deformation Yifei Tong et.al. 2504.12800 null
2025-04-17 TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors Mingwei Li et.al. 2504.12799 null
2025-04-16 CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting Wei Sun et.al. 2504.11893 null
2025-04-16 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians Zeming Wei et.al. 2504.11218 link
2025-04-15 Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation Andrea Simonelli et.al. 2504.11024 null
2025-04-15 3D Gabor Splatting: Reconstruction of High-frequency Surface Texture using Gabor Noise Haato Watanabe et.al. 2504.11003 null
2025-04-15 GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR Christophe Bolduc et.al. 2504.10809 null
2025-04-14 DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting Zeren Jiang et.al. 2504.10486 link
2025-04-15 LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis Hao Sun et.al. 2504.10331 null
2025-04-14 ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting Huiqi Wu et.al. 2504.10316 null
2025-04-14 EBAD-Gaussian: Event-driven Bundle Adjusted Deblur Gaussian Splatting Yufei Deng et.al. 2504.10012 null
2025-04-16 GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting Junlin Hao et.al. 2504.10001 null
2025-04-14 MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling Yunpeng Tan et.al. 2504.09878 null
2025-04-13 TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting Zhicong Wu et.al. 2504.09588 null
2025-04-13 DropoutGS: Dropping Out Gaussians for Better Sparse-view Rendering Yexing Xu et.al. 2504.09491 null
2025-04-12 A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds Jizong Peng et.al. 2504.09129 null
2025-04-12 BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting Jeongwan On et.al. 2504.09097 null
2025-04-11 FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents Xin Tan et.al. 2504.08581 null
2025-04-11 Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation Bram Vanherle et.al. 2504.08473 link
2025-04-11 In-2-4D: Inbetweening from Two Single-View Images to 4D Generation Sauradip Nag et.al. 2504.08366 null
2025-04-10 ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting Junbang Liu et.al. 2504.08100 link
2025-04-10 InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians Kefan Chen et.al. 2504.07949 null
2025-04-10 View-Dependent Uncertainty Estimation of 3D Gaussian Splatting Chenyu Han et.al. 2504.07370 null
2025-04-09 Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting Daiwei Zhang et.al. 2504.06978 null
2025-04-09 IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments Can Zhang et.al. 2504.06827 null
2025-04-09 SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering Hanxiao Sun et.al. 2504.06815 link
2025-04-09 GSta: Efficient Training Scheme with Siestaed Gaussians for Monocular 3D Scene Reconstruction Anil Armagan et.al. 2504.06716 null
2025-04-09 Collision avoidance from monocular vision trained with novel view synthesis Valentin Tordjman–Levavasseur et.al. 2504.06651 null
2025-04-10 Stochastic Ray Tracing of 3D Transparent Gaussians Xin Sun et.al. 2504.06598 null
2025-04-08 Micro-splatting: Maximizing Isotropic Constraints for Refined Optimization in 3D Gaussian Splatting Jee Won Lee et.al. 2504.05740 null
2025-04-07 View-Dependent Deformation Fields for 2D Editing of 3D Models Martin El Mqirmi et.al. 2504.05544 null
2025-04-07 L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene Delivery Yi-Zhen Tsai et.al. 2504.05517 link
2025-04-07 Let it Snow! Animating Static Gaussian Scenes With Dynamic Weather Effects Gal Fiebelman et.al. 2504.05296 null
2025-04-07 PanoDreamer: Consistent Text to 360-Degree Scene Generation Zhexiao Xiong et.al. 2504.05152 null
2025-04-07 3D Gaussian Particle Approximation of VDB Datasets: A Study for Scientific Visualization Isha Sharma et.al. 2504.04857 null
2025-04-07 Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM Zhicong Sun et.al. 2504.04844 link
2025-04-07 DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal Wanzhou Liu et.al. 2504.04679 null
2025-04-06 Tool-as-Interface: Learning Robot Policies from Human Tool Usage through Imitation Learning Haonan Chen et.al. 2504.04612 null
2025-04-06 Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models Etienne Chassaing et.al. 2504.04448 null
2025-04-05 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS Zhisheng Huang et.al. 2504.04294 null
2025-04-05 Interpretable Single-View 3D Gaussian Splatting using Unsupervised Hierarchical Disentangled Representation Learning Yuyang Zhang et.al. 2504.04190 null
2025-04-04 WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments Jianhao Zheng et.al. 2504.03886 null
2025-04-04 HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration Boyuan Wang et.al. 2504.03536 null
2025-04-03 Compressing 3D Gaussian Splatting by Noise-Substituted Vector Quantization Haishan Wang et.al. 2504.03059 link
2025-04-03 MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM Renwu Li et.al. 2504.02437 null
2025-04-03 ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation Yuan Zhou et.al. 2504.02316 link
2025-04-03 Digital-twin imaging based on descattering Gaussian splatting Suguru Shimomura et.al. 2504.02278 null
2025-04-02 UAVTwin: Neural Digital Twins for UAVs using Gaussian Splatting Jaehoon Choi et.al. 2504.02158 null
2025-04-02 WorldPrompter: Traversable Text-to-Scene Generation Zhaoyang Zhang et.al. 2504.02045 null
2025-04-02 Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis Niluthpol Chowdhury Mithun et.al. 2504.01960 null
2025-04-03 Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting Shu-Wei Lu et.al. 2504.01957 null
2025-04-02 BOGausS: Better Optimized Gaussian Splatting Stéphane Pateux et.al. 2504.01844 null
2025-04-02 FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking Ulas Gunes et.al. 2504.01732 null
2025-04-02 FlowR: Flowing from Sparse to Dense 3D Reconstructions Tobias Fischer et.al. 2504.01647 null
2025-04-02 3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting Hao Wu et.al. 2504.01619 null
2025-04-02 RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars Yahui Li et.al. 2504.01559 null
2025-04-02 High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model Yiyang Shen et.al. 2504.01512 null
2025-04-02 Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment Ziteng Cui et.al. 2504.01503 link
2025-04-02 3D Gaussian Inverse Rendering with Approximated Global Illumination Zirui Wu et.al. 2504.01358 null
2025-03-31 Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views Chong Bao et.al. 2503.24382 null
2025-03-31 ERUPT: Efficient Rendering with Unposed Patch Transformer Maxim V. Shugaev et.al. 2503.24374 null
2025-03-31 StochasticSplats: Stochastic Rasterization for Sorting-Free 3D Gaussian Splatting Shakiba Kheradmand et.al. 2503.24366 null
2025-04-01 Visual Acoustic Fields Yuelei Li et.al. 2503.24270 null
2025-03-31 DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting Seungjun Lee et.al. 2503.24210 null
2025-03-31 Learning 3D-Gaussian Simulators from RGB Videos Mikel Zhobro et.al. 2503.24009 null
2025-03-31 ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image Tianyi Gong et.al. 2503.23881 null
2025-03-30 Gaussian Blending Unit: An Edge GPU Plug-in for Real-Time Gaussian-Based Rendering in AR/VR Zhifan Ye et.al. 2503.23625 null
2025-03-30 Enhancing 3D Gaussian Splatting Compression via Spatial Condition-based Prediction Jingui Ma et.al. 2503.23337 null
2025-03-30 ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning Zhenyang Liu et.al. 2503.23297 null
2025-03-28 TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting Boyang et.al. 2503.22676 null
2025-03-28 Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis Shuai Shen et.al. 2503.22605 null
2025-03-28 EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting Xu Wang et.al. 2503.22437 link
2025-03-28 AH-GS: Augmented 3D Gaussian Splatting for High-Frequency Detail Representation Chenyang Xu et.al. 2503.22324 null
2025-03-28 Follow Your Motion: A Generic Temporal Consistency Portrait Editing Framework with Trajectory Guidance Haijie Yang et.al. 2503.22225 null
2025-03-28 ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting Wenjie Liu et.al. 2503.22218 null
2025-03-28 Segment then Splat: A Unified Approach for 3D Open-Vocabulary Segmentation based on Gaussian Splatting Yiren Lu et.al. 2503.22204 null
2025-03-28 Disentangled 4D Gaussian Splatting: Towards Faster and More Efficient Dynamic Scene Rendering Hao Feng et.al. 2503.22159 null
2025-03-27 X $^{2}$ -Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction Weihao Yu et.al. 2503.21779 null
2025-03-27 Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying Hairong Yin et.al. 2503.21767 null
2025-03-27 RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting Qiyu Dai et.al. 2503.21442 null
2025-03-28 LandMarkSystem Technical Report Zhenxiang Ma et.al. 2503.21364 link
2025-03-27 Frequency-Aware Gaussian Splatting Decomposition Yishai Lavi et.al. 2503.21226 null
2025-03-27 StyledStreets: Multi-style Street Simulator with Spatial and Temporal Consistency Yuyin Chen et.al. 2503.21104 null
2025-03-26 PGC: Physics-Based Gaussian Cloth from a Single Pose Michelle Guo et.al. 2503.20779 null
2025-03-28 Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Shijie Zhou et.al. 2503.20776 null
2025-03-26 TC-GS: Tri-plane based compression for 3D Gaussian Splatting Taorui Wang et.al. 2503.20221 link
2025-03-26 EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis Sheng Miao et.al. 2503.20168 null
2025-03-25 Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields Navami Kairanda et.al. 2503.19976 null
2025-03-26 A Survey on Event-driven 3D Reconstruction: Development under Different Categories Chuanzhi Xu et.al. 2503.19753 null
2025-03-25 High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting Qian Wang et.al. 2503.19703 null
2025-03-25 GaussianUDF: Inferring Unsigned Distance Functions through 3D Gaussian Splatting Shujuan Li et.al. 2503.19458 null
2025-03-25 SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors Yiqing Li et.al. 2503.19452 null
2025-03-26 COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting Jiaxin Zhang et.al. 2503.19443 link
2025-03-25 From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting Zhiwei Huang et.al. 2503.19358 null
2025-03-25 Divide-and-Conquer: Dual-Hierarchical Optimization for Semantic 4D Gaussian Spatting Zhiying Yan et.al. 2503.19332 null
2025-03-25 MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection Jee Won Lee et.al. 2503.19330 null
2025-03-25 HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting Xinpeng Liu et.al. 2503.19232 link
2025-03-24 NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting Yulong Zheng et.al. 2503.18794 null
2025-03-24 GS-Marker: Generalizable and Robust Watermarking for 3D Gaussian Splatting Lijiang Li et.al. 2503.18718 null
2025-03-24 Hardware-Rasterized Ray-Based Gaussian Splatting Samuel Rota Bulò et.al. 2503.18682 null
2025-03-24 LLGS: Unsupervised Gaussian Splatting for Image Enhancement and Reconstruction in Pure Dark Environment Haoran Wang et.al. 2503.18640 null
2025-03-25 StableGS: A Floater-Free Framework for 3D Gaussian Splatting Luchao Wang et.al. 2503.18458 null
2025-03-24 4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video Qiang Hu et.al. 2503.18421 null
2025-03-24 DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds Youyu Chen et.al. 2503.18402 null
2025-03-24 GI-SLAM: Gaussian-Inertial SLAM Xulang Liu et.al. 2503.18275 null
2025-03-23 Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving Junhao Ge et.al. 2503.18108 link
2025-03-23 PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding Hongjia Zhai et.al. 2503.18107 null
2025-03-21 TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting Jianchuan Chen et.al. 2503.17032 null
2025-03-21 Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting Jinbo Yan et.al. 2503.16979 link
2025-03-21 DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery Jiadong Tang et.al. 2503.16964 null
2025-03-21 Optimized Minimal 3D Gaussian Splatting Joo Chan Lee et.al. 2503.16924 null
2025-03-20 SAGE: Semantic-Driven Adaptive Gaussian Splatting in Extended Reality Chiara Schiavo et.al. 2503.16747 null
2025-03-20 4D Gaussian Splatting SLAM Yanyan Li et.al. 2503.16710 null
2025-03-20 GauRast: Enhancing GPU Triangle Rasterizers to Accelerate 3D Gaussian Splatting Sixu Li et.al. 2503.16681 null
2025-03-20 1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering Yuheng Yuan et.al. 2503.16422 null
2025-03-20 M3: 3D-Spatial MultiModal Memory Xueyan Zou et.al. 2503.16413 link
2025-03-20 Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images Shengjun Zhang et.al. 2503.16338 null
2025-03-20 OccluGaussian: Occlusion-Aware Gaussian Splatting for Large Scene Reconstruction and Rendering Shiyong Liu et.al. 2503.16177 null
2025-03-20 Enhancing Close-up Novel View Synthesis via Pseudo-labeling Jiatong Xia et.al. 2503.15908 link
2025-03-20 VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling Hyojun Go et.al. 2503.15855 null
2025-03-20 BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting Yiren Lu et.al. 2503.15835 null
2025-03-18 HandSplat: Embedding-Driven Gaussian Splatting for High-Fidelity Hand Rendering Yilan Dong et.al. 2503.14736 null
2025-03-18 SplatVoxel: History-Aware Novel View Streaming without Temporal Training Yiming Wang et.al. 2503.14698 null
2025-03-18 Optimized 3D Gaussian Splatting using Coarse-to-Fine Image Frequency Modulation Umar Farooq et.al. 2503.14475 null
2025-03-18 Improving Adaptive Density Control for 3D Gaussian Splatting Glenn Grubert et.al. 2503.14274 link
2025-03-18 RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images Junjin Xiao et.al. 2503.14198 link
2025-03-18 Lightweight Gradient-Aware Upscaling of 3D Gaussian Splatting Images Simon Niedermayr et.al. 2503.14171 null
2025-03-18 Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting Runsong Zhu et.al. 2503.14029 link
2025-03-18 Light4GS: Lightweight Compact 4D Gaussian Splatting Generation via Context Model Mufan Liu et.al. 2503.13948 null
2025-03-17 Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors Katja Schwarz et.al. 2503.13272 null
2025-03-17 DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction Rui Wang et.al. 2503.13176 null
2025-03-17 Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization Yiwei Xu et.al. 2503.13086 null
2025-03-17 CAT-3DGS Pro: A New Benchmark for Efficient 3DGS Compression Yu-Ting Zhan et.al. 2503.12862 null
2025-03-17 CompMarkGS: Robust Watermarking for Compression 3D Gaussian Splatting Sumin In et.al. 2503.12836 null
2025-03-17 AV-Surf: Surface-Enhanced Geometry-Aware Novel-View Acoustic Synthesis Hadam Baek et.al. 2503.12806 null
2025-03-16 Deblur Gaussian Splatting SLAM Francesco Girlanda et.al. 2503.12572 null
2025-03-16 MTGS: Multi-Traversal Gaussian Splatting Tianyu Li et.al. 2503.12552 link
2025-03-16 SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs Guibiao Liao et.al. 2503.12535 null
2025-03-16 VRsketch2Gaussian: 3D VR Sketch Guided 3D Object Generation with Gaussian Splatting Songen Gu et.al. 2503.12383 null
2025-03-14 Advancing 3D Gaussian Splatting Editing with Complementary and Consensus Information Xuanqi Zhang et.al. 2503.11601 null
2025-03-14 EgoSplat: Open-Vocabulary Egocentric Scene Understanding with Language Embedded 3D Gaussian Splatting Di Li et.al. 2503.11345 null
2025-03-14 Uncertainty-Aware Normal-Guided Gaussian Splatting for Surface Reconstruction from Sparse Image Sequences Zhen Tan et.al. 2503.11172 null
2025-03-13 RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors Avinash Paliwal et.al. 2503.10860 link
2025-03-13 LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds Lingteng Qiu et.al. 2503.10625 link
2025-03-13 MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction Yingshuang Zou et.al. 2503.10604 null
2025-03-13 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Wanhua Li et.al. 2503.10437 link
2025-03-13 VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames Zhiqi Li et.al. 2503.10286 null
2025-03-13 ROODI: Reconstructing Occluded Objects with Denoising Inpainters Yeonjin Chang et.al. 2503.10256 null
2025-03-13 GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction Jianheng Liu et.al. 2503.10170 link
2025-03-13 3D Student Splatting and Scooping Jialin Zhu et.al. 2503.10148 link
2025-03-13 GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping Jinfeng Liu et.al. 2503.10143 null
2025-03-12 Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation Máté Tóth et.al. 2503.09464 null
2025-03-12 Online Language Splatting Saimouli Katragadda et.al. 2503.09447 null
2025-03-12 Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training Jiatong Xia et.al. 2503.09396 null
2025-03-12 GASPACHO: Gaussian Splatting for Controllable Humans and Objects Aymen Mir et.al. 2503.09342 null
2025-03-12 SDD-4DGS: Static-Dynamic Aware Decoupling in Gaussian Splatting for 4D Scene Reconstruction Dai Sun et.al. 2503.09332 null
2025-03-12 Motion Blender Gaussian Splatting for Dynamic Reconstruction Xinyu Zhang et.al. 2503.09040 link
2025-03-11 PCGS: Progressive Compression of 3D Gaussian Splatting Yihang Chen et.al. 2503.08511 link
2025-03-11 TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting Fengyi Zhang et.al. 2503.08485 null
2025-03-11 Mitigating Ambiguities in 3D Classification with Gaussian Splatting Ruiqi Zhang et.al. 2503.08352 null
2025-03-11 Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios Zikang Yuan et.al. 2503.08317 null
2025-03-11 ELECTRA: A Symmetry-breaking Cartesian Network for Charge Density Prediction with Floating Orbitals Jonas Elsborg et.al. 2503.08305 link
2025-03-11 HRAvatar: High-Quality and Relightable Gaussian Head Avatar Dongbin Zhang et.al. 2503.08224 null
2025-03-11 S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction Guangting Zheng et.al. 2503.08217 null
2025-03-11 Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming Jiaxuan Zhu et.al. 2503.08166 null
2025-03-11 ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting Junfu Guo et.al. 2503.08135 null
2025-03-11 MVGSR: Multi-View Consistency Gaussian Splatting for Robust Surface Reconstruction Chenfeng Hou et.al. 2503.08093 null
2025-03-10 SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting Jiahui Zhang et.al. 2503.07476 null
2025-03-10 EigenGS Representation: From Eigenspace to Gaussian Image Space Lo-Wei Tai et.al. 2503.07446 null
2025-03-10 All That Glitters Is Not Gold: Key-Secured 3D Secrets within 3D Gaussian Splatting Yan Ren et.al. 2503.07191 link
2025-03-10 Multi-Modal 3D Mesh Reconstruction from Images and Text Melvin Reka et.al. 2503.07190 null
2025-03-10 Frequency-Aware Density Control via Reparameterization for High-Quality Rendering of 3D Gaussian Splatting Zhaojie Zeng et.al. 2503.07000 link
2025-03-10 DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation Xiaoliang Ju et.al. 2503.06900 null
2025-03-10 ActiveInitSplat: How Active Image Selection Helps Gaussian Splatting Konstantinos D. Polyzos et.al. 2503.06859 null
2025-03-09 Gaussian RBFNet: Gaussian Radial Basis Functions for Fast and Accurate Representation and Reconstruction of Neural Fields Abdelaziz Bouzidi et.al. 2503.06762 null
2025-03-09 CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving Rui Song et.al. 2503.06744 null
2025-03-09 D3DR: Lighting-Aware Object Insertion in Gaussian Splatting Vsevolod Skorokhodov et.al. 2503.06740 null
2025-03-07 D2GV: Deformable 2D Gaussian Splatting for Video Representation in 400FPS Mufan Liu et.al. 2503.05600 link
2025-03-07 Free Your Hands: Lightweight Relightable Turntable Capture Pipeline Jiahui Fan et.al. 2503.05511 null
2025-03-07 LiDAR-enhanced 3D Gaussian Splatting Mapping Jian Shen et.al. 2503.05425 null
2025-03-07 Self-Modeling Robots by Photographing Kejun Hu et.al. 2503.05398 null
2025-03-07 CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images Jungho Lee et.al. 2503.05332 link
2025-03-07 STGA: Selective-Training Gaussian Head Avatars Hanzhi Guo et.al. 2503.05196 null
2025-03-07 Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects Justin Yu et.al. 2503.05189 null
2025-03-07 MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions Qingyuan Zhou et.al. 2503.05182 null
2025-03-07 SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting Linqi Yang et.al. 2503.05174 null
2025-03-07 SeeLe: A Unified Acceleration Framework for Real-Time Gaussian Splatting Xiaotong Huang et.al. 2503.05168 null
2025-03-06 GaussianVideo: Efficient Video Representation and Compression by Gaussian Splatting Inseo Lee et.al. 2503.04333 null
2025-03-06 S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting Yecong Wan et.al. 2503.04314 null
2025-03-06 Instrument-Splatting: Controllable Photorealistic Reconstruction of Surgical Instruments Using Gaussian Splatting Shuojue Yang et.al. 2503.04082 null
2025-03-06 Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details Yifei Gao et.al. 2503.04037 null
2025-03-06 GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding Xihan Wang et.al. 2503.04034 null
2025-03-06 GRaD-Nav: Efficiently Learning Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics Qianzhong Chen et.al. 2503.03984 null
2025-03-05 LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation Qian Feng et.al. 2503.03890 null
2025-03-05 NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics Kun Yang et.al. 2503.03115 null
2025-03-04 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting Qipeng Yan et.al. 2503.02452 null
2025-03-04 DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting Haoyuan Li et.al. 2503.02223 link
2025-03-03 Morpheus: Text-Driven 3D Gaussian Splat Shape and Color Stylization Jamie Wynn et.al. 2503.02009 null
2025-03-03 Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Jay Zhangjie Wu et.al. 2503.01774 null
2025-03-03 OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding Dianyi Yang et.al. 2503.01646 null
2025-03-03 LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training Kaimin Liao et.al. 2503.01199 link
2025-03-03 FGS-SLAM: Fourier-based Gaussian Splatting for Real-time SLAM with Sparse and Dense Map Fusion Yansong Xu et.al. 2503.01109 null
2025-03-02 Evolving High-Quality Rendering and Reconstruction in a Unified Framework with Contribution-Adaptive Regularization You Shen et.al. 2503.00881 null
2025-03-02 Vid2Fluid: 3D Dynamic Fluid Assets from Single-View Videos with Generative Gaussian Splatting Zhiwei Zhao et.al. 2503.00868 null
2025-03-02 PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery BoCheng Li et.al. 2503.00848 null
2025-03-03 FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering Jingqiu Zhou et.al. 2502.21093 null
2025-02-28 EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering John J. Han et.al. 2502.20669 null
2025-02-27 ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting Dexter Ong et.al. 2502.20386 null
2025-02-27 Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling Hanyang Kong et.al. 2502.20378 null
2025-02-27 No Parameters, No Problem: 3D Gaussian Splatting without Camera Intrinsics and Extrinsics Dongbo Shi et.al. 2502.19800 null
2025-02-27 Open-Vocabulary Semantic Part Segmentation of 3D Human Keito Suzuki et.al. 2502.19782 null
2025-02-26 Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting Yu Liu et.al. 2502.19459 link
2025-02-26 Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions Muhammad Salman Ali et.al. 2502.19457 null
2025-02-26 Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? Adam Celarek et.al. 2502.19318 link
2025-02-28 OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation Yunpeng Gao et.al. 2502.18041 null
2025-02-27 UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting Haoyuan Li et.al. 2502.17860 null
2025-02-24 Laplace-Beltrami Operator for Gaussian Splatting Hongyu Zhou et.al. 2502.17531 null
2025-02-24 Graph-Guided Scene Reconstruction from Images with 3D Gaussian Splatting Chong Cheng et.al. 2502.17377 null
2025-02-25 GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow Simon Boeder et.al. 2502.17288 null
2025-02-24 VR-Pipe: Streamlining Hardware Graphics Pipeline for Volume Rendering Junseo Lee et.al. 2502.17078 null
2025-02-23 GS-TransUNet: Integrated 2D Gaussian Splatting and Transformer UNet for Accurate Skin Lesion Analysis Anand Kumar et.al. 2502.16748 null
2025-02-23 Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration Kim Jun-Seong et.al. 2502.16652 null
2025-02-23 Dragen3D: Multiview Geometry Consistent 3D Gaussian Generation with Drag-Based Control Jinbo Yan et.al. 2502.16475 null
2025-02-21 RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes Sicheng Yu et.al. 2502.15633 null
2025-02-24 DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation Luzhou Ge et.al. 2502.15309 link
2025-02-20 GS-Cache: A GS-Cache Inference Framework for Large-scale Gaussian Splatting Models Miao Tao et.al. 2502.14938 null
2025-02-20 Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2502.14931 null
2025-02-20 CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting Qilin Zhang et.al. 2502.14684 link
2025-02-20 OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving Yedong Shen et.al. 2502.14235 null
2025-02-19 GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian Bang Du et.al. 2502.14129 null
2025-02-19 Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction Gan Chen et.al. 2502.14004 link
2025-02-19 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments Vincent Ress et.al. 2502.13803 null
2025-02-18 GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis Pedro Martin et.al. 2502.13196 null
2025-02-18 RadSplatter: Extending 3D Gaussian Splatting to Radio Frequencies for Wireless Radiomap Extrapolation Yiheng Wang et.al. 2502.12686 null
2025-02-17 PUGS: Zero-shot Physical Understanding with Gaussian Splatting Yinghao Shuai et.al. 2502.12231 link
2025-02-17 3D Gaussian Inpainting with Depth-Guided Cross-View Consistency Sheng-Yu Huang et.al. 2502.11801 null
2025-02-17 Exploring the Versal AI Engine for 3D Gaussian Splatting Kotaro Shimamura et.al. 2502.11782 null
2025-02-17 GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text Gyumin Shim et.al. 2502.11642 null
2025-02-16 OMG: Opacity Matters in Material Modeling with Gaussian Splatting Silong Yong et.al. 2502.10988 null
2025-02-16 GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting Zelin Zhou et.al. 2502.10975 null
2025-02-15 E-3DGS: Event-Based Novel View Rendering of Large-Scale Scenes Using 3D Gaussian Splatting Sohaib Zahid et.al. 2502.10827 null
2025-02-13 X-SG $^2$ S: Safe and Generalizable Gaussian Splatting with X-dimensional Watermarks Zihang Cheng et.al. 2502.10475 null
2025-02-13 Self-Calibrating Gaussian Splatting for Large Field of View Reconstruction Youming Deng et.al. 2502.09563 null
2025-02-13 DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior Mingrui Li et.al. 2502.09111 null
2025-02-13 Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting Lingting Zhu et.al. 2502.09039 link
2025-02-12 Interactive Holographic Visualization for 3D Facial Avatar Tri Tung Nguyen Nguyen et.al. 2502.08085 null
2025-02-11 TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation Jeongyun Kim et.al. 2502.07840 link
2025-02-11 MeshSplats: Mesh-Based Rendering with Gaussian Splatting Initialization Rafał Tobiasz et.al. 2502.07754 link
2025-02-11 Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors Lin-Zhuo Chen et.al. 2502.07615 null
2025-02-10 Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC Siwei Meng et.al. 2502.07007 null
2025-02-10 SIREN: Semantic, Initialization-Free Registration of Multi-Robot Gaussian Splatting Maps Ola Shorinwa et.al. 2502.06519 null
2025-02-10 Three-Dimensional MRI Reconstruction with Gaussian Representations: Tackling the Undersampling Problem Tengya Peng et.al. 2502.06510 null
2025-02-11 Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform Kyle Gao et.al. 2502.05769 null
2025-02-09 PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map Yue Pan et.al. 2502.05752 link
2025-02-08 Vision-in-the-loop Simulation for Deep Monocular Pose Estimation of UAV in Ocean Environment Maneesha Wickramasuriya et.al. 2502.05409 null
2025-02-07 AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting Chung-Ho Wu et.al. 2502.05176 null
2025-02-07 GaussRender: Learning 3D Occupancy with Gaussian Rendering Loick Chambon et.al. 2502.05040 link
2025-02-07 OccGS: Zero-shot 3D Occupancy Reconstruction with Semantic and Geometric-Aware Gaussian Splatting Xiaoyu Zhou et.al. 2502.04981 null
2025-02-07 PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression Feifei Li et.al. 2502.04843 null
2025-02-07 SC-OmniGS: Self-Calibrating Omnidirectional Gaussian Splatting Huajian Huang et.al. 2502.04734 null
2025-02-07 High-Speed Dynamic 3D Imaging with Sensor Fusion Splatting Zihao Zou et.al. 2502.04630 null
2025-02-05 GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM Mingrui Li et.al. 2502.03228 null
2025-02-05 GP-GS: Gaussian Processes for Enhanced Gaussian Splatting Zhihao Guo et.al. 2502.02283 link
2025-02-04 LAYOUTDREAMER: Physics-guided Layout for Text-to-3D Compositional Scene Generation Yang Zhou et.al. 2502.01949 null
2025-02-03 UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping Aashish Rai et.al. 2502.01846 null
2025-02-03 Scalable 3D Gaussian Splatting-Based RF Signal Spatial Propagation Modeling Kang Yang et.al. 2502.01826 null
2025-02-03 VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion Shaoting Zhu et.al. 2502.01536 null
2025-02-03 Radiant Foam: Real-Time Differentiable Ray Tracing Shrisudhan Govindarajan et.al. 2502.01157 null
2025-02-02 EmoTalkingGaussian: Continuous Emotion-conditioned Talking Head Synthesis Junuk Cha et.al. 2502.00654 null
2025-01-31 Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation Rohan Chacko et.al. 2502.00173 null
2025-01-31 Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping Yiming Huang et.al. 2501.19319 link
2025-01-31 RaySplats: Ray Tracing based Gaussian Splatting Krzysztof Byrski et.al. 2501.19196 link
2025-01-31 JGHand: Joint-Driven Animatable Hand Avater via 3D Gaussian Splatting Zhoutao Sun et.al. 2501.19088 null
2025-01-30 Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting Yansong Qu et.al. 2501.18672 null
2025-01-29 3D Reconstruction of Shoes for Augmented Reality Pratik Shrestha et.al. 2501.18643 null
2025-01-31 VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting Mateusz Nowak et.al. 2501.17978 null
2025-01-29 CrowdSplat: Exploring Gaussian Splatting For Crowd Rendering Xiaohan Sun et.al. 2501.17792 link
2025-01-29 FeatureGS: Eigenvalue-Feature Optimization in 3D Gaussian Splatting for Geometrically Accurate and Artifact-Reduced Reconstruction Miriam Jäger et.al. 2501.17655 null
2025-01-28 Evaluating CrowdSplat: Perceived Level of Detail for Gaussian Crowds Xiaohan Sun et.al. 2501.17085 null
2025-01-28 DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Chenguo Lin et.al. 2501.16764 null
2025-01-26 GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting Jiajun Dong et.al. 2501.15619 link
2025-01-25 Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos Zhen-Hui Dong et.al. 2501.15096 null
2025-01-25 HuGDiffusion: Generalizable Single-Image Human Rendering via 3D Gaussian Diffusion Yingzhi Tang et.al. 2501.15008 null
2025-01-24 Trick-GS: A Balanced Bag of Tricks for Efficient Gaussian Splatting Anil Armagan et.al. 2501.14534 null
2025-01-24 Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video Xiaohao Xu et.al. 2501.14319 link
2025-01-24 Dense-SfM: Structure from Motion with Dense Consistent Matching JongMin Lee et.al. 2501.14277 null
2025-01-24 Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images Yihui Li et.al. 2501.14231 null
2025-01-24 HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting Javier Yu et.al. 2501.14147 null
2025-01-23 GoDe: Gaussians on Demand for Progressive Level of Detail and Scalable Compression Francesco Di Sario et.al. 2501.13558 null
2025-01-23 MultiDreamer3D: Multi-concept 3D Customization with Concept-Aware Diffusion Guidance Wooseok Song et.al. 2501.13449 null
2025-01-23 GeomGS: LiDAR-Guided Geometry-Aware Gaussian Splatting for Robot Localization Jaewon Lee et.al. 2501.13417 null
2025-01-23 VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM Gyuhyeon Pak et.al. 2501.13402 null
2025-01-23 Deblur-Avatar: Animatable Avatars from Motion-Blurred Monocular Videos Xianrui Luo et.al. 2501.13335 null
2025-01-22 Sketch and Patch: Efficient 3D Gaussian Representation for Man-Made Scenes Yuang Shi et.al. 2501.13045 null
2025-01-21 DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions Vishagar Arunan et.al. 2501.12369 null
2025-01-22 HAC++: Towards 100X Compression of 3D Gaussian Splatting Yihang Chen et.al. 2501.12255 link
2025-01-22 GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting Longan Wang et.al. 2501.12060 null
2025-01-20 See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization Zongqi He et.al. 2501.11508 null
2025-01-19 RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering Chenlu Zhan et.al. 2501.11102 null
2025-01-18 Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting Jiaqi Lin et.al. 2501.10788 null
2025-01-15 BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation Xiaolu Hou et.al. 2501.10462 link
2025-01-20 GSTAR: Gaussian Surface Tracking and Reconstruction Chengwei Zheng et.al. 2501.10283 null
2025-01-16 Creating Virtual Environments with 3D Gaussian Splatting: A Comparative Study Shi Qiu et.al. 2501.09302 null
2025-01-15 CityLoc: 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation Qi Ma et.al. 2501.08982 null
2025-01-15 GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping Sheng Hong et.al. 2501.08672 null
2025-01-14 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering Meenakshi Krishnan et.al. 2501.08370 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286 null
2025-01-14 Object-Centric 2D Gaussian Splatting: Background Removal and Occlusion-Aware Pruning for Compact Object Models Marcel Rogge et.al. 2501.08174 null
2025-01-13 Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes Yuhang Zhang et.al. 2501.08072 null
2025-01-13 UnCommon Objects in 3D Xingchen Liu et.al. 2501.07574 link
2025-01-13 3DGS-to-PC: Convert a 3D Gaussian Splatting Scene into a Dense Point Cloud or Mesh Lewis A G Stuart et.al. 2501.07478 link
2025-01-13 RMAvatar: Photorealistic Human Avatar Reconstruction from Monocular Video Based on Rectified Mesh-embedded Gaussians Sen Peng et.al. 2501.07104 null
2025-01-14 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015 null
2025-01-12 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927 link
2025-01-12 Synthetic Prior for Few-Shot Drivable Head Avatar Inversion Wojciech Zielonka et.al. 2501.06903 null
2025-01-12 ActiveGAMER: Active GAussian Mapping through Efficient Rendering Liyan Chen et.al. 2501.06897 null
2025-01-12 Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution Du Chen et.al. 2501.06838 link
2025-01-12 F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian Splatting Yuxin Wang et.al. 2501.06714 null
2025-01-11 MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis Hengyuan Zhang et.al. 2501.06660 null
2025-01-10 Locality-aware Gaussian Compression for Fast and High-quality Rendering Seungjoo Shin et.al. 2501.05757 null
2025-01-09 Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation Xuyi Meng et.al. 2501.05427 null
2025-01-09 Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance Dimitrios Gerogiannis et.al. 2501.05379 null
2025-01-09 Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping Wen Tianci et.al. 2501.05242 null
2025-01-08 GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting Andrew Bond et.al. 2501.04782 null
2025-01-08 FatesGS: Fast and Accurate Sparse-View Surface Reconstruction using Gaussian Splatting with Depth-Feature Consistency Han Huang et.al. 2501.04628 null
2025-01-07 ZDySS – Zero-Shot Dynamic Scene Stylization using Gaussian Splatting Abhishek Saroha et.al. 2501.03875 null
2025-01-07 MoDec-GS: Global-to-Local Motion Decomposition and Temporal Interval Adjustment for Compact Dynamic 3D Gaussian Splatting Sangwoon Kwak et.al. 2501.03714 null
2025-01-07 DehazeGS: Seeing Through Fog with 3D Gaussian Splatting Jinze Yu et.al. 2501.03659 null
2025-01-07 ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting Yifeng Yang et.al. 2501.03605 link
2025-01-06 Compression of 3D Gaussian Splatting with Optimized Feature Planes and Standard Video Codecs Soonbin Lee et.al. 2501.03399 null
2025-01-06 Gaussian Masked Autoencoders Jathushan Rajasegaran et.al. 2501.03229 null
2025-01-06 HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation Wentian Qu et.al. 2501.02845 null
2025-01-05 GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking Weikang Bian et.al. 2501.02690 null
2025-01-03 EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation Siyuan Huang et.al. 2501.01895 null
2025-01-03 Cloth-Splatting: 3D Cloth State Estimation from RGB Supervision Alberta Longhini et.al. 2501.01715 null
2025-01-03 CrossView-GS: Cross-view Gaussian Splatting For Large-scale Scene Reconstruction Chenhao Zhang et.al. 2501.01695 null
2025-01-03 PG-SAG: Parallel Gaussian Splatting for Fine-Grained Large-Scale Urban Buildings Reconstruction via Semantic-Aware Grouping Tengfei Wang et.al. 2501.01677 link
2025-01-02 Deformable Gaussian Splatting for Efficient and High-Fidelity Reconstruction of Surgical Scenes Jiwei Shan et.al. 2501.01101 null
2025-01-02 EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy Ao Gao et.al. 2501.01003 null
2024-12-31 Gaussian Building Mesh (GBM): Extract a Building’s 3D Mesh with Google Earth and Gaussian Splatting Kyle Gao et.al. 2501.00625 null
2024-12-31 DreamDrive: Generative 4D Scene Modeling from Street View Images Jiageng Mao et.al. 2501.00601 null
2024-12-31 PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM Runnan Chen et.al. 2501.00352 null
2024-12-31 SG-Splatting: Accelerating 3D Gaussian Splatting with Spherical Gaussians Yiwen Wang et.al. 2501.00342 null
2024-12-30 PERSE: Personalized 3D Generative Avatars from A Single Portrait Hyunsoo Cha et.al. 2412.21206 null
2024-12-30 KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences Keng-Wei Chang et.al. 2412.20767 null
2024-12-30 4D Gaussian Splatting: Modeling Dynamic Scenes with Native 4D Primitives Zeyu Yang et.al. 2412.20720 null
2024-12-29 MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks Yifei Liu et.al. 2412.20522 link
2024-12-28 DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis Kaijun Deng et.al. 2412.20148 link
2024-12-28 GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting Atticus J. Zeller et.al. 2412.20056 link
2024-12-27 DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction Kai Xu et.al. 2412.19584 null
2024-12-27 Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images Xudong Cai et.al. 2412.19518 null
2024-12-27 Learning Radiance Fields from a Single Snapshot Compressive Image Yunhao Li et.al. 2412.19483 null
2024-12-26 BeSplat – Gaussian Splatting from a Single Blurry Image and Event Stream Gopi Raju Matta et.al. 2412.19370 link
2024-12-26 Reflective Gaussian Splatting Yuxuan Yao et.al. 2412.19282 null
2024-12-26 Generating Editable Head Avatars with 3D Gaussian GANs Guohao Li et.al. 2412.19149 link
2024-12-26 CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting Siyu Jiao et.al. 2412.19142 null
2024-12-26 MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo Byeonggwon Lee et.al. 2412.19130 null
2024-12-25 WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian Splatting Chenghao Qian et.al. 2412.18862 link
2024-12-25 GSAVS: Gaussian Splatting-based Autonomous Vehicle Simulator Rami Wilson et.al. 2412.18816 null
2024-12-24 Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation Anselm Krainovic et.al. 2412.18584 null
2024-12-24 RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis Yiling Yao et.al. 2412.18380 null
2024-12-23 FaceLift: Single Image to 3D Head with View Generation and GS-LRM Weijie Lyu et.al. 2412.17812 null
2024-12-23 ActiveGS: Active Scene Reconstruction using Gaussian Splatting Liren Jin et.al. 2412.17769 link
2024-12-23 GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance Jingqiu Zhou et.al. 2412.17715 null
2024-12-24 LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding Hao Li et.al. 2412.17635 null
2024-12-23 CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction Yuanyuan Gao et.al. 2412.17612 null
2024-12-23 Exploring Dynamic Novel View Synthesis Technologies for Cinematography Adrian Azzarelli et.al. 2412.17532 null
2024-12-23 Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling Hao Gui et.al. 2412.17378 null
2024-12-22 GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs Xingrui Wang et.al. 2412.16932 link
2024-12-22 GeoTexDensifier: Geometry-Texture-Aware Densification for High-Quality Photorealistic 3D Gaussian Splatting Hanqing Jiang et.al. 2412.16809 null
2024-12-21 Topology-Aware 3D Gaussian Splatting: Leveraging Persistent Homology for Optimized Structural Integrity Tianqi Shen et.al. 2412.16619 link
2024-12-20 CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images Jungho Lee et.al. 2412.16028 null
2024-12-20 IRGS: Inter-Reflective Gaussian Splatting with 2D Gaussian Ray Tracing Chun Gu et.al. 2412.15867 null
2024-12-20 AvatarPerfect: User-Assisted 3D Gaussian Splatting Avatar Refinement with Automatic Pose Suggestion Jotaro Sakamiya et.al. 2412.15609 null
2024-12-20 EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene Yixiong Huo et.al. 2412.15550 link
2024-12-19 LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction Pou-Chun Kung et.al. 2412.15447 null
2024-12-19 SolidGS: Consolidating Gaussian Surfel Splatting for Sparse-View Surface Reconstruction Zhuowen Shen et.al. 2412.15400 null
2024-12-19 SqueezeMe: Efficient Gaussian Avatars for VR Shunsuke Saito et.al. 2412.15171 null
2024-12-19 Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination Leonardo Barcellona et.al. 2412.14957 null
2024-12-19 GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting Qianpu Sun et.al. 2412.14579 null
2024-12-19 Improving Geometry in Sparse-View 3DGS via Reprojection-based DoF Separation Yongsung Kim et.al. 2412.14568 null
2024-12-18 GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians Xiaobao Wei et.al. 2412.13983 link
2024-12-18 GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting Yuning Peng et.al. 2412.13654 null
2024-12-18 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching Fernando Amodeo et.al. 2412.13639 link
2024-12-18 Turbo-GS: Accelerating 3D Gaussian Fitting for High-Quality Radiance Fields Tao Lu et.al. 2412.13547 null
2024-12-18 Vivar: A Generative AR System for Intuitive Multi-Modal Sensor Data Presentation Yunqi Guo et.al. 2412.13509 null
2024-12-17 Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures Guoxing Sun et.al. 2412.13183 null
2024-12-17 EOGS: Gaussian Splatting for Earth Observation Luca Savant Aira et.al. 2412.13047 null
2024-12-17 4DRGS: 4D Radiative Gaussian Splatting for Efficient 3D Vessel Reconstruction from Sparse-View Dynamic DSA Images Zhentao Liu et.al. 2412.12919 link
2024-12-17 CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image Wonseok Roh et.al. 2412.12906 null
2024-12-17 HyperGS: Hyperspectral 3D Gaussian Splatting Christopher Thirgood et.al. 2412.12849 null
2024-12-17 Gaussian Billboards: Expressive 2D Gaussian Splatting with Textures Sebastian Weiss et.al. 2412.12734 null
2024-12-17 3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian Splatting Qi Wu et.al. 2412.12507 link
2024-12-16 PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting Cheng Zhang et.al. 2412.12096 link
2024-12-16 Wonderland: Navigating 3D Scenes from a Single Image Hanwen Liang et.al. 2412.12091 null
2024-12-16 GS-ProCams: Gaussian Splatting-based Projector-Camera Systems Qingyue Deng et.al. 2412.11762 null
2024-12-16 Deformable Radial Kernel Splatting Yi-Hua Huang et.al. 2412.11752 null
2024-12-16 SweepEvGS: Event-Based 3D Gaussian Splatting for Macro and Micro Radiance Field Rendering from a Single Sweep Jingqian Wu et.al. 2412.11579 null
2024-12-16 EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting Dong In Lee et.al. 2412.11520 null
2024-12-14 DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting Luis Wiedmann et.al. 2412.10972 link
2024-12-13 SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians Siyun Liang et.al. 2412.10231 null
2024-12-13 GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion Jiapeng Tang et.al. 2412.10209 null
2024-12-13 TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views Liang Zhao et.al. 2412.10051 link
2024-12-13 SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video Jongmin Park et.al. 2412.09982 null
2024-12-13 RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting Lizhi Bai et.al. 2412.09868 null
2024-12-12 MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction Xiaohao Xu et.al. 2412.09723 link
2024-12-12 PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields Sean Wu et.al. 2412.09680 link
2024-12-12 Feat2GS: Probing Visual Foundation Models with Gaussian Splatting Yue Chen et.al. 2412.09606 null
2024-12-12 LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors Yabo Chen et.al. 2412.09597 null
2024-12-12 FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction Jiale Xu et.al. 2412.09573 null
2024-12-12 GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency Dongyue Lu et.al. 2412.09511 link
2024-12-12 LIVE-GS: LLM Powers Interactive VR by Enhancing Gaussian Splatting Haotian Mao et.al. 2412.09176 null
2024-12-11 SLGaussian: Fast Language Gaussian Splatting in Sparse Views Kangjie Chen et.al. 2412.08331 null
2024-12-11 ProGDF: Progressive Gaussian Differential Field for Controllable and Flexible 3D Editing Yian Zhao et.al. 2412.08152 null
2024-12-10 Diffusion-Based Attention Warping for Consistent 3D Scene Editing Eyal Gomel et.al. 2412.07984 null
2024-12-10 GASP: Gaussian Avatars with Synthetic Priors Jack Saunders et.al. 2412.07739 null
2024-12-10 Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians Yixuan Li et.al. 2412.07660 null
2024-12-10 Faster and Better 3D Splatting via Group Training Chengbo Wang et.al. 2412.07608 null
2024-12-10 ResGS: Residual Densification of 3D Gaussian for Efficient Detail Recovery Yanzhe Lyu et.al. 2412.07494 null
2024-12-10 EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering Toshiya Yura et.al. 2412.07293 null
2024-12-09 MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds Zhenggang Tang et.al. 2412.06974 null
2024-12-09 Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video Renlong Wu et.al. 2412.06424 link
2024-12-09 4D Gaussian Splatting with Scale-aware Residual Field and Adaptive Optimization for Real-time Rendering of Temporally Complex Dynamic Scenes Jinbo Yan et.al. 2412.06299 null
2024-12-09 Advancing Extended Reality with 3D Gaussian Splatting: Innovations and Prospects Shi Qiu et.al. 2412.06257 null
2024-12-09 Splatter-360: Generalizable 360 $^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images Zheng Chen et.al. 2412.06250 link
2024-12-09 Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction Seungtae Nam et.al. 2412.06234 null
2024-12-08 Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation Zipeng Qi et.al. 2412.05969 null
2024-12-08 GBR: Generative Bundle Refinement for High-fidelity Gaussian Splatting and Meshing Jianing Zhang et.al. 2412.05908 null
2024-12-07 Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes Saqib Javed et.al. 2412.05700 null
2024-12-07 WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking Yuqi Tan et.al. 2412.05695 null
2024-12-07 Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis Diwen Wan et.al. 2412.05570 null
2024-12-06 Extrapolated Urban View Synthesis Benchmark Xiangyu Han et.al. 2412.05256 link
2024-12-06 MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting Peng Chen et.al. 2412.04955 link
2024-12-06 Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction Jixuan Fan et.al. 2412.04887 link
2024-12-06 WRF-GS: Wireless Radiation Field Reconstruction with 3D Gaussian Splatting Chaozheng Wen et.al. 2412.04832 link
2024-12-06 Pushing Rendering Boundaries: Hard Gaussian Splatting Qingshan Xu et.al. 2412.04826 null
2024-12-05 Turbo3D: Ultra-fast Text-to-3D Generation Hanzhe Hu et.al. 2412.04470 null
2024-12-05 QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos Sharath Girish et.al. 2412.04469 null
2024-12-05 Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering Cheng Sun et.al. 2412.04459 link
2024-12-05 Monocular Dynamic Gaussian Splatting is Fast and Brittle but Smooth Motion Helps Yiqing Liang et.al. 2412.04457 null
2024-12-05 PBDyG: Position Based Dynamic Gaussians for Motion-Aware Clothed Human Avatars Shota Sasaki et.al. 2412.04433 null
2024-12-05 Multi-View Pose-Agnostic Change Localization with Zero Labels Chamuditha Jayanga Galappaththige et.al. 2412.03911 link
2024-12-05 DGNS: Deformable Gaussian Splatting and Dynamic Neural Surface for Monocular Dynamic 3D Reconstruction Xuesong Li et.al. 2412.03910 link
2024-12-05 HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting Jingyu Lin et.al. 2412.03844 link
2024-12-04 Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos Hanxue Liang et.al. 2412.03526 null
2024-12-04 Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter Hermes McGriff et.al. 2412.03518 null
2024-12-04 Urban4D: Semantic-Guided 4D Gaussian Splatting for Urban Scene Reconstruction Ziwen Li et.al. 2412.03473 null
2024-12-04 2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction Wanting Zhang et.al. 2412.03428 null
2024-12-04 Volumetrically Consistent 3D Gaussian Rasterization Chinmay Talegaonkar et.al. 2412.03378 link
2024-12-04 SGSST: Scaling Gaussian Splatting StyleTransfer Bruno Galerne et.al. 2412.03371 link
2024-12-04 NeRF and Gaussian Splatting SLAM in the Wild Fabian Schmidt et.al. 2412.03263 link
2024-12-04 Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting Yijia Guo et.al. 2412.03121 null
2024-12-04 RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videos Yoonwoo Jeong et.al. 2412.03077 null
2024-12-03 Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects Abdurrahman Zeybey et.al. 2412.02803 null
2024-12-03 AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction Lingteng Qiu et.al. 2412.02684 null
2024-12-03 RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians Qiankun Gao et.al. 2412.02493 link
2024-12-03 TimeWalker: Personalized Neural Space for Lifelong Head Avatars Dongwei Pan et.al. 2412.02421 null
2024-12-03 GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos Zhiyuan Chen et.al. 2412.02267 null
2024-12-03 Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance Jing Zeng et.al. 2412.02249 null
2024-12-03 SparseLGS: Sparse View Language Embedded Gaussian Splatting Jun Hu et.al. 2412.02245 null
2024-12-03 How to Use Diffusion Priors under Sparse Views? Qisen Wang et.al. 2412.02225 link
2024-12-03 SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images Junqiu Yu et.al. 2412.02140 null
2024-12-03 Gaussian Object Carver: Object-Compositional Gaussian Splatting with surfaces completion Liu Liu et.al. 2412.02075 link
2024-12-02 Planar Gaussian Splatting Farhad G. Zanjani et.al. 2412.01931 null
2024-12-02 GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting Zixuan Chen et.al. 2411.19895 link
2024-11-29 DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering Yihao Wang et.al. 2411.19756 null
2024-11-29 TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting Bojun Xiong et.al. 2411.19654 link
2024-11-29 Tortho-Gaussian: Splatting True Digital Orthophoto Maps Xin Wang et.al. 2411.19594 null
2024-11-29 Gaussian Splashing: Direct Volumetric Rendering Underwater Nir Mualem et.al. 2411.19588 null
2024-11-29 Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Wenbo Zhang et.al. 2411.19551 link
2024-12-02 GausSurf: Geometry-Guided 3D Gaussian Splatting for Surface Reconstruction Jiepeng Wang et.al. 2411.19454 null
2024-11-29 RF-3DGS: Wireless Channel Modeling with Radio Radiance Field and 3D Gaussian Splatting Lihao Zhang et.al. 2411.19420 link
2024-11-28 SADG: Segment Any Dynamic Gaussian Without Object Trackers Yun-Jin Li et.al. 2411.19290 link
2024-11-28 AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones Xuqian Ren et.al. 2411.19271 null
2024-11-27 Textured Gaussians for Enhanced 3D Scene Appearance Modeling Brian Chao et.al. 2411.18625 null
2024-11-27 PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image Han Yan et.al. 2411.18548 null
2024-11-27 HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression Lei Liu et.al. 2411.18473 null
2024-11-27 Neural Surface Priors for Editable Gaussian Splatting Jakub Szymkowiak et.al. 2411.18311 link
2024-11-27 Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters Zhiyang Guo et.al. 2411.18197 null
2024-11-27 SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images Yanyan Li et.al. 2411.18072 null
2024-11-27 GLS: Geometry-aware 3D Language Gaussian Splatting Jiaxiong Qiu et.al. 2411.18066 link
2024-11-27 HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction Wei Zhang et.al. 2411.17982 link
2024-11-26 DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting Christian Homeyer et.al. 2411.17660 link
2024-11-26 Distractor-free Generalizable 3D Gaussian Splatting Yanqi Bao et.al. 2411.17605 link
2024-11-26 SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting Gyeongjin Kang et.al. 2411.17190 null
2024-11-26 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction Woong Oh Cho et.al. 2411.17044 null
2024-11-25 G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs Kunyi Li et.al. 2411.16898 null
2024-11-25 PreF3R: Pose-Free Feed-Forward 3D Gaussian Splatting from Variable-length Image Sequence Zequn Chen et.al. 2411.16877 null
2024-11-25 SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving Georg Hess et.al. 2411.16816 link
2024-11-25 SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis Hyojun Go et.al. 2411.16443 link
2024-11-25 Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction Ziyu Zhang et.al. 2411.16392 null
2024-11-25 Event-boosted Deformable 3D Gaussians for Fast Dynamic Scene Reconstruction Wenhao Xu et.al. 2411.16180 null
2024-11-25 UnitedVLN: Generalizable Gaussian Splatting for Continuous Vision-Language Navigation Guangzhao Dai et.al. 2411.16053 null
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800 null
2024-11-24 ZeroGS: Training 3D Gaussian Splatting from Unposed Images Yu Chen et.al. 2411.15779 null
2024-11-24 DynamicAvatars: Accurate Dynamic Facial Avatars Reconstruction and Precise Editing with Diffusion Models Yangyang Qian et.al. 2411.15732 null
2024-11-24 GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision Xu Baixin et.al. 2411.15723 link
2024-11-23 EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting Xiaobao Wei et.al. 2411.15582 null
2024-11-23 SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving Su Sun et.al. 2411.15482 null
2024-11-22 Neural 4D Evolution under Large Topological Changes from 2D Images AmirHossein Naghi Razlighi et.al. 2411.15018 null
2024-11-22 3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Jan Held et.al. 2411.14974 link
2024-11-22 Dynamics-Aware Gaussian Splatting Streaming Towards Fast On-the-Fly Training for 4D Reconstruction Zhening Liu et.al. 2411.14847 null
2024-11-22 VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving Haiming Zhang et.al. 2411.14716 null
2024-11-21 NexusSplats: Efficient 3D Gaussian Splatting in the Wild Yuzhou Tang et.al. 2411.14514 null
2024-11-21 Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation Zhuoman Liu et.al. 2411.14423 null
2024-11-21 Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation Yuanhao Cai et.al. 2411.14384 null
2024-11-21 SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching Arjun P S et.al. 2411.14322 link
2024-11-20 FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting Ola Shorinwa et.al. 2411.13753 null
2024-11-20 Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization Hao Ju et.al. 2411.13610 null
2024-11-20 Generating 3D-Consistent Videos from Unposed Internet Photos Gene Chou et.al. 2411.13549 null
2024-11-20 GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting Xiaobao Wei et.al. 2411.12981 null
2024-11-19 Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting Haoyu Zhao et.al. 2411.12789 null
2024-11-19 Mini-Splatting2: Building 360 Scenes within Minutes via Aggressive Gaussian Densification Guangchi Fang et.al. 2411.12788 null
2024-11-19 PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy Joanna Kaleta et.al. 2411.12510 link
2024-11-19 SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image Zixu Wang et.al. 2411.12471 null
2024-11-20 Beyond Gaussians: Fast and High-Fidelity 3D Splatting with Linear Kernels Haodong Chen et.al. 2411.12440 null
2024-11-19 LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments Renxiang Xiao et.al. 2411.12185 null
2024-11-19 Sketch-guided Cage-based 3D Gaussian Splatting Deformation Tianhao Xie et.al. 2411.12168 null
2024-11-18 FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting Fangyu Wu et.al. 2411.12089 null
2024-11-18 TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction DaDong Jiang et.al. 2411.11941 null
2024-11-18 DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes Chensheng Peng et.al. 2411.11921 link
2024-11-18 RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator Xinhai Li et.al. 2411.11839 null
2024-11-18 GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views Boyao Zhou et.al. 2411.11363 null
2024-11-17 VeGaS: Video Gaussian Splatting Weronika Smolak-Dyżewska et.al. 2411.11024 link
2024-11-17 Direct and Explicit 3D Generation from a Single Image Haoyu Wu et.al. 2411.10947 null
2024-11-16 DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment Mangyu Kong et.al. 2411.10722 link
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546 null
2024-11-15 USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting Kang Chen et.al. 2411.10504 link
2024-11-15 Efficient Density Control for 3D Gaussian Splatting Xiaobin Deng et.al. 2411.10133 link
2024-11-15 GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization Yanhao Sun et.al. 2411.10033 null
2024-11-15 GGAvatar: Reconstructing Garment-Separated 3D Gaussian Splatting Avatars from Monocular Video Jingxuan Chen et.al. 2411.09952 link
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 DyGASR: Dynamic Generalized Exponential Splatting with Surface Alignment for Accelerated 3D Mesh Reconstruction Shengchao Zhao et.al. 2411.09156 null
2024-11-13 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization Mijeong Kim et.al. 2411.08879 null
2024-11-13 Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models Chengdong Dong et.al. 2411.08642 null
2024-11-13 BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis David Svitov et.al. 2411.08508 link
2024-11-13 Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model Yutao Shen et.al. 2411.08453 null
2024-11-13 DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization Yueming Xu et.al. 2411.08373 null
2024-11-13 MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation Peng Wang et.al. 2411.08279 link
2024-11-14 Projecting Gaussian Ellipsoids While Avoiding Affine Projection Approximation Han Qi et.al. 2411.07579 null
2024-11-12 GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting Umangi Jain et.al. 2411.07555 null
2024-11-12 HiCoM: Hierarchical Coherent Motion for Streamable Dynamic Scene with 3D Gaussian Splatting Qiankun Gao et.al. 2411.07541 link
2024-11-12 GUS-IR: Gaussian Splatting with Unified Shading for Inverse Rendering Zhihao Liang et.al. 2411.07478 null
2024-11-11 A Hierarchical Compression Technique for 3D Gaussian Splatting Compression He Huang et.al. 2411.06976 null
2024-11-10 Adaptive and Temporally Consistent Gaussian Surfels for Multi-view Dynamic Reconstruction Decai Chen et.al. 2411.06602 null
2024-11-12 SplatFormer: Point Transformer for Robust 3D Gaussian Splatting Yutong Chen et.al. 2411.06390 link
2024-11-10 Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field Liuyue Xie et.al. 2411.06365 null
2024-11-09 AI-Driven Stylization of 3D Environments Yuanbo Chen et.al. 2411.06067 null
2024-11-09 GaussianSpa: An “Optimizing-Sparsifying” Simplification Framework for Compact and High-Quality 3D Gaussian Splatting Yangming Zhang et.al. 2411.06019 null
2024-11-07 ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing Jun-Kun Chen et.al. 2411.05006 null
2024-11-07 MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views Yuedong Chen et.al. 2411.04924 link
2024-11-08 GS2Pose: Two-stage 6D Object Pose Estimation Guided by Gaussian Splatting Jilan Mei et.al. 2411.03807 null
2024-11-06 3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object Rearrangement Ziqi Lu et.al. 2411.03706 link
2024-11-06 Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis Rui Peng et.al. 2411.03637 link
2024-11-05 Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting Michael Büttner et.al. 2411.03555 null
2024-11-05 HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features Arnab Dey et.al. 2411.03086 null
2024-11-05 LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting Huibin Zhao et.al. 2411.02703 null
2024-11-04 Modeling Uncertainty in 3D Gaussian Splatting through Continuous Semantic Splatting Joey Wilson et.al. 2411.02547 null
2024-11-06 SplatOverflow: Asynchronous Hardware Troubleshooting Amritansh Kwatra et.al. 2411.02332 null
2024-11-05 FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training Ruihong Yin et.al. 2411.02229 null
2024-11-06 GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes Gaochao Song et.al. 2411.01853 null
2024-11-02 Real-Time Spatio-Temporal Reconstruction of Dynamic Endoscopic Scenes with 4D Gaussian Splatting Fengze Li et.al. 2411.01218 null
2024-11-01 CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes Yang Liu et.al. 2411.00771 null
2024-11-01 PCoTTA: Continual Test-Time Adaptation for Multi-Task Point Cloud Understanding Jincen Jiang et.al. 2411.00632 null
2024-10-31 Aquatic-GS: A Hybrid 3D Representation for Underwater Scenes Shaohua Liu et.al. 2411.00239 null
2024-10-31 Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis Chen Zhao et.al. 2411.00144 link
2024-10-31 No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images Botao Ye et.al. 2410.24207 link
2024-11-01 GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering Kai Ye et.al. 2410.24204 null
2024-10-31 GaussianMarker: Uncertainty-Aware Copyright Protection of 3D Gaussian Splatting Xiufeng Huang et.al. 2410.23718 null
2024-10-31 GS-Blur: A 3D Scene-Based Dataset for Realistic Image Deblurring Dongwoo Lee et.al. 2410.23658 link
2024-10-30 ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting Muhammad Salman Ali et.al. 2410.23213 null
2024-10-31 Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis Zhiyuan Min et.al. 2410.22817 null
2024-10-30 Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted Images Qi Song et.al. 2410.22705 null
2024-10-29 PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting Sunghwan Hong et.al. 2410.22128 link
2024-10-29 FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives Qizhi Chen et.al. 2410.22070 null
2024-10-29 ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting Yuetao Li et.al. 2410.21955 link
2024-10-28 MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps Yating Xu et.al. 2410.21566 link
2024-10-28 Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting Jiawei Xu et.al. 2410.20815 null
2024-10-28 LoDAvatar: Hierarchical Embedding and Adaptive Levels of Detail with Gaussian Splatting for Enhanced Human Avatars Xiaonuo Dongye et.al. 2410.20789 null
2024-10-28 CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians Chongjian Ge et.al. 2410.20723 null
2024-10-28 ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings Suyoung Lee et.al. 2410.20686 link
2024-10-27 Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering Meng Wei et.al. 2410.20593 null
2024-10-26 Neural Fields in Robotics: A Survey Muhammad Zubair Irshad et.al. 2410.20220 link
2024-10-25 DiffGS: Functional Gaussian Splatting Diffusion Junsheng Zhou et.al. 2410.19657 null
2024-10-25 Robotic Learning in your Backyard: A Neural Simulator from Open Source Components Liyou Zhou et.al. 2410.19564 link
2024-10-25 Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization Weihang Liu et.al. 2410.19483 link
2024-10-24 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation Hansheng Chen et.al. 2410.18974 link
2024-10-24 Sort-free Gaussian Splatting via Weighted Sum Rendering Qiqi Hou et.al. 2410.18931 null
2024-10-24 Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling Mingtong Zhang et.al. 2410.18912 null
2024-10-27 Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis Liang Han et.al. 2410.18822 null
2024-10-23 VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points Linus Franke et.al. 2410.17932 null
2024-10-23 PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting Yu Wang et.al. 2410.17505 null
2024-10-22 AG-SLAM: Active Gaussian Splatting SLAM Wen Jiang et.al. 2410.17422 null
2024-10-22 SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes Cheng-De Fan et.al. 2410.17249 null
2024-10-18 GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting Yusen Xie et.al. 2410.17084 null
2024-10-22 E-3DGS: Gaussian Splatting with Exposure and Motion Events Xiaoting Yin et.al. 2410.16995 link
2024-10-22 Multi-Layer Gaussian Splatting for Immersive Anatomy Visualization Constantin Kleinbeck et.al. 2410.16978 link
2024-10-21 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors Xi Liu et.al. 2410.16266 null
2024-10-21 MSGField: A Unified Scene Representation Integrating Motion, Semantics, and Geometry for Robotic Manipulation Yu Sheng et.al. 2410.15730 null
2024-10-22 Fully Explicit Dynamic Gaussian Splatting Junoh Lee et.al. 2410.15629 null
2024-10-22 EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting Bohao Liao et.al. 2410.15392 null
2024-10-18 LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes Juliette Marrie et.al. 2410.14462 null
2024-10-18 Neural Signed Distance Function Inference through Splatting 3D Gaussians Pulled on Zero-Level Set Wenyuan Zhang et.al. 2410.14189 null
2024-10-18 DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction Ange Lou et.al. 2410.14169 null
2024-10-17 DepthSplat: Connecting Gaussian Splatting and Depth Haofei Xu et.al. 2410.13862 link
2024-10-17 Differentiable Robot Rendering Ruoshi Liu et.al. 2410.13851 null
2024-10-17 MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes Xinjie Zhang et.al. 2410.13613 null
2024-10-17 DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering Jiahao Lu et.al. 2410.13607 link
2024-10-17 GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting Shuichang Lai et.al. 2410.13349 null
2024-10-16 Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats Chen Ziwen et.al. 2410.12781 null
2024-10-16 3D Gaussian Splatting in Robotics: A Survey Siting Zhu et.al. 2410.12262 link
2024-10-15 SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection Yizhe Liu et.al. 2410.12080 link
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-15 GS^3: Efficient Relighting with Triple Gaussian Splatting Zoubin Bi et.al. 2410.11419 link
2024-10-15 MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields Yuru Xiao et.al. 2410.11394 null
2024-10-15 GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information Wancai Zheng et.al. 2410.11356 null
2024-10-15 Scalable Indoor Novel-View Synthesis using Drone-Captured 360 Imagery with 3D Gaussian Splatting Yuanbo Chen et.al. 2410.11285 null
2024-10-14 Few-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting Raja Kumar et.al. 2410.11080 link
2024-10-15 4-LEGS: 4D Language Embedded Gaussian Splatting Gal Fiebelman et.al. 2410.10719 null
2024-10-14 4DStyleGaussian: Zero-shot 4D Style Transfer with Gaussian Splatting Wanlin Liang et.al. 2410.10412 null
2024-10-13 Gaussian Splatting Visual MPC for Granular Media Manipulation Wei-Cheng Tseng et.al. 2410.09740 null
2024-10-12 Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors Hritam Basak et.al. 2410.09467 null
2024-10-11 SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted Surgical Scene Reconstruction Jialei Chen et.al. 2410.09292 null
2024-10-11 MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering Jaehoon Choi et.al. 2410.08941 null
2024-10-11 Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars Xuan Huang et.al. 2410.08840 link
2024-10-11 Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose Initialization Christian Schmidt et.al. 2410.08743 link
2024-10-10 FusionSense: Bridging Common Sense, Vision, and Touch for Robust Sparse-View Reconstruction Irving Fang et.al. 2410.08282 null
2024-10-10 Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics Junyi Cao et.al. 2410.08257 null
2024-10-10 Poison-splat: Computation Cost Attack on 3D Gaussian Splatting Jiahao Lu et.al. 2410.08190 link
2024-10-10 DifFRelight: Diffusion-Based Facial Performance Relighting Mingming He et.al. 2410.08188 null
2024-10-10 Efficient Perspective-Correct 3D Gaussian Splatting Using Hybrid Transparency Florian Hahlbohm et.al. 2410.08129 null
2024-10-10 IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera Jian Huang et.al. 2410.08107 link
2024-10-11 Fast Feedforward 3D Gaussian Splatting Compression Yihang Chen et.al. 2410.08017 link
2024-10-10 L-VITeX: Light-weight Visual Intuition for Terrain Exploration Antar Mazumder et.al. 2410.07872 null
2024-10-10 MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting Ruijie Zhu et.al. 2410.07707 link
2024-10-10 3D Vision-Language Gaussian Splatting Qucheng Peng et.al. 2410.07577 null
2024-10-09 DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation Zhiqi Li et.al. 2410.06756 null
2024-10-09 ES-Gaussian: Gaussian Splatting Mapping via Error Space-Based Gaussian Completion Lu Chen et.al. 2410.06613 null
2024-10-09 3D Representation Methods: A Survey Zhengren Wang et.al. 2410.06475 null
2024-10-08 HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction Shengji Tang et.al. 2410.06245 null
2024-10-10 RelitLRM: Generative Relightable Radiance for Large Reconstruction Models Tianyuan Zhang et.al. 2410.06231 null
2024-10-08 GSLoc: Visual Localization with 3D Gaussian Splatting Kazii Botashev et.al. 2410.06165 null
2024-10-08 SplaTraj: Camera Trajectory Generation with Semantic Gaussian Splatting Xinyi Liu et.al. 2410.06014 null
2024-10-08 Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters Guoji Tian et.al. 2410.05772 null
2024-10-07 PH-Dropout: Prctical Epistemic Uncertainty Quantification for View Synthesis Chuanhao Sun et.al. 2410.05468 link
2024-10-07 GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting Yukang Cao et.al. 2410.05259 null
2024-10-07 LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting Qifeng Chen et.al. 2410.05111 null
2024-10-07 DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects Nidhi Mathihalli et.al. 2410.05097 link
2024-10-07 PhotoReg: Photometrically Registering 3D Gaussian Splatting Models Ziwen Yuan et.al. 2410.05044 null
2024-10-07 6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering Zhongpai Gao et.al. 2410.04974 null
2024-10-07 Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting Matthew Strong et.al. 2410.04680 link
2024-10-06 Mode-GS: Monocular Depth Guided Anchored 3D Gaussian Splatting for Robust Ground-View Scene Rendering Yonghan Lee et.al. 2410.04646 null
2024-10-06 StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting Xiao Cui et.al. 2410.04354 null
2024-10-04 Variational Bayes Gaussian Splatting Toon Van de Maele et.al. 2410.03592 link
2024-10-03 Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats Mingyang Xie et.al. 2410.02764 null
2024-10-03 GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering Hongze Chen et.al. 2410.02619 null
2024-10-03 SuperGS: Super-Resolution 3D Gaussian Splatting via Latent Feature Field and Gradient-guided Splitting Shiyun Xie et.al. 2410.02571 link
2024-10-02 MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis Xiaobiao Du et.al. 2410.02103 link
2024-10-03 EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis Alexander Mai et.al. 2410.01804 null
2024-10-02 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection Yang Cao et.al. 2410.01647 link
2024-10-02 Gaussian Splatting in Mirrors: Reflection-Aware Rendering via Virtual Camera Optimization Zihan Wang et.al. 2410.01614 link
2024-10-02 GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians Shuyi Jiang et.al. 2410.01535 null
2024-10-02 MiraGe: Editable 2D Images using Gaussian Splatting Joanna Waczyńska et.al. 2410.01521 link
2024-10-02 UW-GS: Distractor-Aware 3D Gaussian Splatting for Enhanced Underwater Scene Reconstruction Haoran Wang et.al. 2410.01517 link
2024-10-02 EVA-Gaussian: 3D Gaussian-based Real-time Human Novel View Synthesis under Diverse Camera Settings Yingdong Hu et.al. 2410.01425 null
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-10-02 CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM Dapeng Feng et.al. 2410.00486 link
2024-10-01 Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance Hongchao Shu et.al. 2410.00386 null
2024-09-30 RL-GSBridge: 3D Gaussian Splatting Based Real2Sim2Real Method for Robotic Manipulation Learning Yuxuan Wu et.al. 2409.20291 null
2024-09-30 Robust Gaussian Splatting SLAM by Leveraging Loop Closure Zunjie Zhu et.al. 2409.20111 null
2024-10-01 RNG: Relightable Neural Gaussians Jiahui Fan et.al. 2409.19702 null
2024-09-28 GS-EVT: Cross-Modal Event Camera Tracking based on Gaussian Splatting Tao Liu et.al. 2409.19228 null
2024-09-28 1st Place Solution to the 8th HANDS Workshop Challenge – ARCTIC Track: 3DGS-based Bimanual Category-agnostic Interaction Reconstruction Jeongwan On et.al. 2409.19215 null
2024-09-27 Gaussian Heritage: 3D Digitization of Cultural Heritage with Integrated Object Segmentation Mahtab Dahaghin et.al. 2409.19039 null
2024-09-27 Space-time 2D Gaussian Splatting for Accurate Surface Reconstruction under Complex Dynamic Scenes Shuo Wang et.al. 2409.18852 link
2024-09-26 RT-GuIDE: Real-Time Gaussian splatting for Information-Driven Exploration Yuezhan Tao et.al. 2409.18122 null
2024-09-26 Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot Justin Yu et.al. 2409.18108 null
2024-09-26 WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians Dmytro Kotovenko et.al. 2409.17917 null
2024-09-26 HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian Splatting Zijun Xu et.al. 2409.17624 null
2024-09-25 SeaSplat: Representing Underwater Scenes with 3D Gaussian Splatting and a Physically Grounded Image Formation Model Daniel Yang et.al. 2409.17345 null
2024-09-25 Disco4D: Disentangled 4D Human Generation and Animation from a Single Image Hui En Pang et.al. 2409.17280 null
2024-09-25 Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM Phu Pham et.al. 2409.16944 null
2024-09-25 Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model Hongliang Zhong et.al. 2409.16938 link
2024-09-25 Let’s Make a Splan: Risk-Aware Trajectory Optimization in a Normalized Gaussian Splat Jonathan Michaux et.al. 2409.16915 null
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-24 Frequency-based View Selection in Gaussian Splatting Reconstruction Monica M. Q. Li et.al. 2409.16470 null
2024-09-23 Gaussian Déjà-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities Peizhi Yan et.al. 2409.16147 link
2024-09-24 Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality Hannah Schieber et.al. 2409.15959 null
2024-09-24 Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB Jae Yong Lee et.al. 2409.15689 null
2024-09-23 Human Hair Reconstruction with Strand-Aligned 3D Gaussians Egor Zakharov et.al. 2409.14778 null
2024-09-22 MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views Wangze Xu et.al. 2409.14316 null
2024-09-21 SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality Hongjia Zhai et.al. 2409.14067 null
2024-09-20 Elite-EvGS: Learning Event-based 3D Gaussian Splatting by Distilling Event-to-Video Priors Zixin Zhang et.al. 2409.13392 null
2024-09-20 3D-GSW: 3D Gaussian Splatting Watermark for Protecting Copyrights in Radiance Fields Youngdong Jang et.al. 2409.13222 null
2024-09-19 MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting Yan Song Hu et.al. 2409.13055 null
2024-09-19 GStex: Per-Primitive Texturing of 2D Gaussian Splatting for Decoupled Appearance and Geometry Modeling Victor Rong et.al. 2409.12954 link
2024-09-18 Vista3D: Unravel the 3D Darkside of a Single Image Qiuhong Shen et.al. 2409.12193 link
2024-09-18 SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation Mingze Sun et.al. 2409.11682 link
2024-09-18 Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks Joji Joseph et.al. 2409.11681 link
2024-09-17 RenderWorld: World Model with Self-Supervised 3D Label Ziyang Yan et.al. 2409.11356 null
2024-09-17 GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module Yichen Zhang et.al. 2409.11307 null
2024-09-17 SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction Marko Mihajlovic et.al. 2409.11211 null
2024-09-17 GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure Ziheng Xu et.al. 2409.10982 null
2024-09-16 Phys3DGS: Physically-based 3D Gaussian Splatting for Inverse Rendering Euntae Choi et.al. 2409.10335 null
2024-09-16 BEINGS: Bayesian Embodied Image-goal Navigation with Gaussian Splatting Wugang Meng et.al. 2409.10216 link
2024-09-16 SplatSim: Zero-Shot Sim2Real Transfer of RGB Manipulation Policies Using Gaussian Splatting Mohammad Nomaan Qureshi et.al. 2409.10161 null
2024-09-16 Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression Yi-Hsin Li et.al. 2409.10101 null
2024-09-16 DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments Mahmud A. Mohamad et.al. 2409.10041 link
2024-09-15 SAFER-Splat: A Control Barrier Function for Safe Navigation with Online Gaussian Splatting Maps Timothy Chen et.al. 2409.09868 null
2024-09-15 MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation Shuzhao Xie et.al. 2409.09756 null
2024-09-14 GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians Dasong Gao et.al. 2409.09295 link
2024-09-13 A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis Yohan Poirier-Ginter et.al. 2409.08947 null
2024-09-13 AdR-Gaussian: Accelerating Gaussian Splatting with Adaptive Radius Xinzhe Wang et.al. 2409.08669 null
2024-09-13 Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints Shan Chen et.al. 2409.08613 null
2024-09-13 CSS: Overcoming Pose and Scene Challenges in Crowd-Sourced 3D Gaussian Splatting Runze Chen et.al. 2409.08562 null
2024-09-12 Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos Yuheng Jiang et.al. 2409.08353 null
2024-09-12 FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally Qiuhong Shen et.al. 2409.08270 link
2024-09-12 Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis Qian Chen et.al. 2409.08042 link
2024-09-12 SwinGS: Sliding Window Gaussian Splatting for Volumetric Video Streaming with Arbitrary Length Bangya Liu et.al. 2409.07759 null
2024-09-11 Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs Sadra Safadoust et.al. 2409.07456 null
2024-09-11 Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Haibo Yang et.al. 2409.07452 link
2024-09-11 Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering Dafei Qin et.al. 2409.07441 null
2024-09-11 Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks Ruihan Xu et.al. 2409.07245 null
2024-09-11 ThermalGaussian: Thermal 3D Gaussian Splatting Rongfeng Lu et.al. 2409.07200 link
2024-09-10 gsplat: An Open-Source Library for Gaussian Splatting Vickie Ye et.al. 2409.06765 link
2024-09-10 GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction Junyi Chen et.al. 2409.06685 null
2024-09-10 Sources of Uncertainty in 3D Scene Reconstruction Marcus Klasson et.al. 2409.06407 link
2024-09-09 Online 3D reconstruction and dense tracking in endoscopic videos Michel Hayoz et.al. 2409.06037 link
2024-09-09 GASP: Gaussian Splatting for Physic-Based Simulations Piotr Borycki et.al. 2409.05819 link
2024-09-09 Lagrangian Hashing for Compressed Neural Field Representations Shrisudhan Govindarajan et.al. 2409.05334 null
2024-09-08 DreamMapping: High-Fidelity Text-to-3D Generation via Variational Distribution Mapping Zeyu Cai et.al. 2409.05099 null
2024-09-08 GS-PT: Exploiting 3D Gaussian Splatting for Comprehensive Point Cloud Understanding via Self-supervised Learning Keyi Liu et.al. 2409.04963 null
2024-09-11 Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras Zimu Liao et.al. 2409.04751 link
2024-09-06 GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers Lorenza Prospero et.al. 2409.04196 link
2024-09-06 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors Yujun Huang et.al. 2409.04013 link
2024-09-05 LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors Hanyang Yu et.al. 2409.03456 null
2024-09-05 Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction Shen Chen et.al. 2409.03213 null
2024-09-04 Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models Zhibin Liu et.al. 2409.02851 link
2024-09-04 Object Gaussian for Monocular 6D Pose Estimation from Sparse Views Luqing Luo et.al. 2409.02581 null
2024-09-04 GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving Huasong Han et.al. 2409.02382 null
2024-09-03 DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction Jenny Seidenschwarz et.al. 2409.02104 null
2024-09-03 PRoGS: Progressive Rendering of Gaussian Splats Brent Zoomers et.al. 2409.01761 null
2024-09-03 GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting Zixuan Guo et.al. 2409.01581 null
2024-09-02 Free-DyGS: Camera-Pose-Free Scene Reconstruction based on Gaussian Splatting for Dynamic Surgical Videos Qian Li et.al. 2409.01003 null
2024-08-31 3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images YuanZheng Wu et.al. 2409.00381 null
2024-08-31 UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM Mostafa Mansour et.al. 2409.00362 null
2024-08-30 OG-Mapping: Octree-based Structured 3D Gaussians for Online Dense Mapping Meng Wang et.al. 2408.17223 null
2024-08-30 2DGH: 2D Gaussian-Hermite Splatting for High-quality Rendering and Better Geometry Reconstruction Ruihan Yu et.al. 2408.16982 null
2024-08-29 ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model Fangfu Liu et.al. 2408.16767 null
2024-08-29 OmniRe: Omni Urban Scene Reconstruction Ziyu Chen et.al. 2408.16760 null
2024-08-28 Towards Realistic Example-based Modeling via 3D Gaussian Stitching Xinyu Gao et.al. 2408.15708 null
2024-08-28 G-Style: Stylized Gaussian Splatting Áron Samuel Kovács et.al. 2408.15695 link
2024-08-27 Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty Saining Zhang et.al. 2408.15242 link
2024-08-27 Learning-based Multi-View Stereo: A Survey Fangjinhua Wang et.al. 2408.15235 null
2024-08-27 Robo-GS: A Physics Consistent Spatial-Temporal Model for Robotic Arm with Hybrid Representation Haozhe Lou et.al. 2408.14873 null
2024-08-27 LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming Yuang Shi et.al. 2408.14823 link
2024-08-26 Avatar Concept Slider: Manipulate Concepts In Your Human Avatar With Fine-grained Control Yixuan He et.al. 2408.13995 null
2024-08-26 DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian Splatting Weiwei Cai et.al. 2408.13972 link
2024-08-27 Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs Brandon Smart et.al. 2408.13912 null
2024-08-25 TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers Chuanrui Zhang et.al. 2408.13770 null
2024-08-25 SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting Wenrui Li et.al. 2408.13711 link
2024-08-23 BiGS: Bidirectional Gaussian Primitives for Relightable 3D Gaussian Splatting Zhenyuan Liu et.al. 2408.13370 null
2024-08-23 S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points Bing He et.al. 2408.13036 link
2024-08-23 FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering Yunji Seo et.al. 2408.12894 null
2024-08-26 GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion Jiaxin Wei et.al. 2408.12677 link
2024-08-22 Subsurface Scattering for 3D Gaussian Splatting Jan-Niklas Dihlmann et.al. 2408.12282 null
2024-08-21 Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of Distractors Paul Ungermann et.al. 2408.11697 link
2024-08-22 DeRainGS: Gaussian Splatting for Enhanced Scene Reconstruction in Rainy Environments Shuhong Liu et.al. 2408.11540 null
2024-08-21 GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting Wanshui Gan et.al. 2408.11447 link
2024-08-21 Pano2Room: Novel View Synthesis from a Single Indoor Panorama Guo Pu et.al. 2408.11413 link
2024-08-20 GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting Changkun Liu et.al. 2408.11085 link
2024-08-20 ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining Qi Ma et.al. 2408.10906 null
2024-08-20 DEGAS: Detailed Expressions on Full-Body Gaussian Avatars Zhijing Shao et.al. 2408.10588 link
2024-08-20 LoopSplat: Loop Closure by Registering 3D Gaussian Splats Liyuan Zhu et.al. 2408.10154 link
2024-08-19 Implicit Gaussian Splatting with Efficient Multi-Level Tri-Plane Representation Minye Wu et.al. 2408.10041 null
2024-08-19 SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting Haoyu Zhao et.al. 2408.09665 null
2024-08-20 CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning Haoyu Zhao et.al. 2408.09663 null
2024-08-20 Gaussian in the Dark: Real-Time View Synthesis From Inconsistent Dark Images Using Gaussian Splatting Sheng Ye et.al. 2408.09130 link
2024-08-16 Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS Wei Sun et.al. 2408.08723 null
2024-08-16 GS-ID: Illumination Decomposition on Gaussian Splatting via Diffusion Prior and Parametric Light Source Optimization Kang Du et.al. 2408.08524 link
2024-08-15 WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting Huapeng Li et.al. 2408.08206 null
2024-08-19 FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering Guofeng Feng et.al. 2408.07967 link
2024-08-14 Progressive Radiance Distillation for Inverse Rendering with Gaussian Splatting Keyang Ye et.al. 2408.07595 null
2024-08-14 3D Gaussian Editing with A Single Image Guan Luo et.al. 2408.07540 null
2024-08-13 SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis Saptarshi Neil Sinha et.al. 2408.06975 null
2024-08-13 HDRGS: High Dynamic Range Gaussian Splatting Jiahao Wu et.al. 2408.06543 link
2024-08-12 Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering Jiameng Li et.al. 2408.06286 link
2024-08-12 Developing Smart MAVs for Autonomous Inspection in GPS-denied Constructions Paoqiang Pan et.al. 2408.06030 null
2024-08-12 HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors Xiaozheng Zheng et.al. 2408.06019 null
2024-08-10 Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis Zhongche Qu et.al. 2408.05635 null
2024-08-09 DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow Hangyu Li et.al. 2408.05008 null
2024-08-14 Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction Lingbei Meng et.al. 2408.04831 null
2024-08-06 LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting Joanna Kaleta et.al. 2408.04474 link
2024-08-08 A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery Mengya Xu et.al. 2408.04426 link
2024-08-08 InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting Xin-Yi Yu et.al. 2408.04249 null
2024-08-07 Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM Yan Song Hu et.al. 2408.03825 null
2024-08-07 Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields Joo Chan Lee et.al. 2408.03822 null
2024-08-07 3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting Zhe Jun Tang et.al. 2408.03753 link
2024-08-07 PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting Yijia Guo et.al. 2408.03538 null
2024-08-02 A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness Lutao Jiang et.al. 2408.01269 null
2024-08-02 Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion Ke Li et.al. 2408.01225 link
2024-08-07 IG-SLAM: Instant Gaussian SLAM F. Aykut Sarikamis et.al. 2408.01126 null
2024-08-01 LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting Zhenyu Bao et.al. 2408.00254 null
2024-07-31 Localized Gaussian Splatting Editing with Contextual Awareness Hanyuan Xiao et.al. 2408.00083 null
2024-07-31 Expressive Whole-Body 3D Gaussian Avatar Gyeongsik Moon et.al. 2407.21686 null
2024-07-30 SceneTeller: Language-to-3D Scene Generation Başak Melis Öcal et.al. 2407.20727 null
2024-07-29 Registering Neural 4D Gaussians for Endoscopic Surgery Yiming Huang et.al. 2407.20213 null
2024-07-29 Radiance Fields for Robotic Teleoperation Maximum Wilder-Smith et.al. 2407.20194 link
2024-07-26 ScalingGaussian: Enhancing 3D Content Creation with Generative Gaussian Splatting Shen Chen et.al. 2407.19035 null
2024-07-25 GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution Jintong Hu et.al. 2407.18046 null
2024-07-24 3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities Yanqi Bao et.al. 2407.17418 link
2024-07-29 DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene Xi Shi et.al. 2407.16600 null
2024-07-23 HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images Shreyas Singh et.al. 2407.16503 link
2024-07-23 Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance Jiyeop Kim et.al. 2407.16173 null
2024-07-22 6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model Matteo Bortolon et.al. 2407.15484 null
2024-07-22 Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures Ruizhe Wang et.al. 2407.15435 null
2024-07-21 HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions Haiyang Zhou et.al. 2407.15187 null
2024-07-20 Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting Tianle Zeng et.al. 2407.14846 null
2024-07-19 A Benchmark for Gaussian Splatting Compression and Quality Assessment Study Qi Yang et.al. 2407.14197 link
2024-07-19 GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation Florian Chabot et.al. 2407.14108 null
2024-07-19 DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays Zongyuan Yang et.al. 2407.14053 null
2024-07-20 Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation Zongrui Li et.al. 2407.13584 link
2024-07-18 EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting Yuchen Weng et.al. 2407.13520 null
2024-07-17 Generalizable Human Gaussians for Sparse View Synthesis Youngjoong Kwon et.al. 2407.12777 link
2024-07-17 Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections Congrong Xu et.al. 2407.12306 null
2024-07-16 MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification Zhuoxiao Li et.al. 2407.11840 null
2024-07-16 Click-Gaussian: Interactive Segmentation to Any 3D Gaussians Seokhun Choi et.al. 2407.11793 null
2024-07-16 SlingBAG: Sliding ball adaptive growth algorithm with differentiable radiation enables super-efficient iterative 3D photoacoustic image reconstruction Shuang Li et.al. 2407.11781 link
2024-07-16 I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM Gwangtak Bae et.al. 2407.11347 null
2024-07-16 Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering Jingqian Wu et.al. 2407.11343 null
2024-07-16 Gaussian Splatting LK Liuyue Xie et.al. 2407.11309 null
2024-07-15 iHuman: Instant Animatable Digital Humans From Monocular Videos Pramish Paudel et.al. 2407.11174 link
2024-07-15 Scaling 3D Reasoning with LMMs to Large Robot Mission Environments Using Datagraphs W. J. Meijer et.al. 2407.10743 null
2024-07-15 Interactive Rendering of Relightable and Animatable Gaussian Avatars Youyi Zhan et.al. 2407.10707 link
2024-07-16 RecGS: Removing Water Caustic with Recurrent Gaussian Splatting Tianyi Zhang et.al. 2407.10318 null
2024-07-14 3DEgo: 3D Editing on the Go! Umar Khalid et.al. 2407.10102 null
2024-07-14 SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion Jiyuan Zhang et.al. 2407.10062 null
2024-07-13 Textured-GS: Gaussian Splatting with Spatially Defined Color and Opacity Zhentao Huang et.al. 2407.09733 link
2024-07-12 StyleSplat: 3D Object Style Transfer with Gaussian Splatting Sahil Jain et.al. 2407.09473 null
2024-07-11 WildGaussians: 3D Gaussian Splatting in the Wild Jonas Kulhanek et.al. 2407.08447 link
2024-07-11 Survey on Fundamental Deep Learning 3D Reconstruction Techniques Yonge Bai et.al. 2407.08137 null
2024-07-10 MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition Aggelina Chatziagapi et.al. 2407.07284 null
2024-07-09 Reference-based Controllable Scene Stylization with Gaussian Splatting Yiqun Mei et.al. 2407.07220 null
2024-07-10 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes Nicolas Moenne-Loccoz et.al. 2407.07090 null
2024-07-07 PICA: Physics-Integrated Clothed Avatar Bo Peng et.al. 2407.05324 null
2024-07-07 GaussReg: Fast 3D Registration with Gaussian Splatting Jiahao Chang et.al. 2407.05254 null
2024-07-06 SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction Weixing Xie et.al. 2407.05023 link
2024-07-05 Gaussian Eigen Models for Human Heads Wojciech Zielonka et.al. 2407.04545 null
2024-07-12 Segment Any 4D Gaussians Shengxiang Ji et.al. 2407.04504 null
2024-07-10 GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction Yuxuan Mu et.al. 2407.04237 null
2024-07-04 CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images Junghe Lee et.al. 2407.03923 null
2024-07-04 PFGS: High Fidelity Point Cloud Rendering via Feature Splatting Jiaxu Wang et.al. 2407.03857 link
2024-07-04 SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors Yijia Guo et.al. 2407.03771 null
2024-07-04 VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors Sungwon Hwang et.al. 2407.02945 link
2024-07-03 Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction Jiaxin Guo et.al. 2407.02918 link
2024-07-04 AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction Mustafa Khan et.al. 2407.02598 null
2024-07-02 TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation Chaofan Luo et.al. 2407.02034 null
2024-07-01 DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction Yujin Ham et.al. 2407.01761 null
2024-07-01 GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting Chenxin Li et.al. 2407.01301 null
2024-07-01 EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting Chenxin Li et.al. 2407.01029 null
2024-07-02 RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering Weikai Lin et.al. 2407.00435 link
2024-06-29 OccFusion: Rendering Occluded Humans with Generative Diffusion Priors Adam Sun et.al. 2407.00316 null
2024-06-28 SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting Sara Sabour et.al. 2406.20055 null
2024-06-28 EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting Daiwei Zhang et.al. 2406.19811 null
2024-06-27 Lightweight Predictive 3D Gaussian Splats Junli Cao et.al. 2406.19434 link
2024-06-26 Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos Colton Stearns et.al. 2406.18717 link
2024-06-26 On Scaling Up 3D Gaussian Splatting Training Hexu Zhao et.al. 2406.18533 link
2024-06-26 GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality Taoran Yi et.al. 2406.18462 null
2024-06-26 Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning Muhammad Salman Ali et.al. 2406.18214 link
2024-06-26 GS-Octree: Octree-based 3D Gaussian Splatting for Robust Object-level 3D Reconstruction Under Strong Lighting Jiaze Li et.al. 2406.18199 null
2024-06-26 VDG: Vision-Only Dynamic Gaussian for Driving Simulation Hao Li et.al. 2406.18198 null
2024-06-25 NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods Jonas Kulhanek et.al. 2406.17345 null
2024-06-24 Reducing the Memory Footprint of 3D Gaussian Splatting Panagiotis Papantonakis et.al. 2406.17074 null
2024-06-24 From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking Xiaohao Xu et.al. 2406.16850 link
2024-06-24 ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians Yufei Liu et.al. 2406.16815 null
2024-06-23 LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction Hengyu Liu et.al. 2406.16073 link
2024-06-23 Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction Yangdi Lu et.al. 2406.15982 null
2024-06-21 Taming 3DGS: High-Quality Radiance Fields with Limited Resources Saswat Subhajyoti Mallick et.al. 2406.15643 link
2024-06-21 Gaussian Splatting to Real World Flight Navigation Transfer with Liquid Networks Alex Quach et.al. 2406.15149 null
2024-06-21 E2GS: Event Enhanced Gaussian Splatting Hiroyuki Deguchi et.al. 2406.14978 link
2024-06-18 Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models Paul Henderson et.al. 2406.13099 null
2024-06-18 HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors Panwang Pan et.al. 2406.12459 link
2024-06-17 A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets Bernhard Kerbl et.al. 2406.12080 null
2024-06-17 RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians Bingling Li et.al. 2406.11836 null
2024-06-18 Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting Junha Hyung et.al. 2406.11672 null
2024-06-16 Physically Embodied Gaussian Splatting: A Realtime Correctable World Model for Robotics Jad Abou-Chakra et.al. 2406.10788 null
2024-06-14 Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections Jiacong Xu et.al. 2406.10373 null
2024-06-14 L4GM: Large 4D Gaussian Reconstruction Model Jiawei Ren et.al. 2406.10324 null
2024-06-14 PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting Alex Hanson et.al. 2406.10219 link
2024-06-14 GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors Xiqian Yu et.al. 2406.10111 null
2024-06-14 GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion Trapoom Ukarapol et.al. 2406.09850 link
2024-06-14 Unified Gaussian Primitives for Scene Representation and Rendering Yang Zhou et.al. 2406.09733 null
2024-06-13 Modeling Ambient Scene Dynamics for Free-view Synthesis Meng-Li Shih et.al. 2406.09395 null
2024-06-13 GGHead: Fast and Generalizable 3D Gaussian Heads Tobias Kirschstein et.al. 2406.09377 null
2024-06-14 AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis Swapnil Bhosale et.al. 2406.08920 null
2024-06-13 Gaussian-Forest: Hierarchical-Hybrid 3D Gaussian Splatting for Compressed Scene Modeling Fengyi Zhang et.al. 2406.08759 null
2024-06-12 ICE-G: Image Conditional Editing of 3D Gaussian Splats Vishnu Jaganathan et.al. 2406.08488 null
2024-06-12 Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models Yuxuan Xue et.al. 2406.08475 null
2024-06-12 From Chaos to Clarity: 3DGS in the Dark Zhihao Li et.al. 2406.08300 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499 null
2024-06-11 Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field Chao Wang et.al. 2406.07329 null
2024-06-10 GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation Haozhe Xie et.al. 2406.06526 link
2024-06-10 PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction Danpeng Chen et.al. 2406.06521 null
2024-06-10 MVGamba: Unify 3D Content Generation as State Space Sequence Modeling Xuanyu Yi et.al. 2406.06367 link
2024-06-10 Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis Xin Jin et.al. 2406.06216 link
2024-06-09 RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering Rui Zhang et.al. 2406.05852 null
2024-06-09 VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction Hanlin Chen et.al. 2406.05774 null
2024-06-06 Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image Stanislaw Szymanowicz et.al. 2406.04343 link
2024-06-06 A Survey on 3D Human Avatar Modeling – From Reconstruction to Generation Ruihe Wang et.al. 2406.04253 null
2024-06-06 Localized Gaussian Point Management Haosen Yang et.al. 2406.04251 null
2024-06-06 Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction Diwen Wan et.al. 2406.03697 link
2024-06-05 Event3DGS: Event-based 3D Gaussian Splatting for Fast Egomotion Tianyi Xiong et.al. 2406.02972 null
2024-06-05 Adversarial Generation of Hierarchical Gaussians for 3D Generative Model Sangeek Hyun et.al. 2406.02968 link
2024-06-04 3D-HGS: 3D Half-Gaussian Splatting Haolin Li et.al. 2406.02720 link
2024-06-06 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541 null
2024-06-04 SatSplatYOLO: 3D Gaussian Splatting-based Virtual Object Detection Ensembles for Satellite Feature Recognition Van Minh Nguyen et.al. 2406.02533 null
2024-06-04 DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering Zhongpai Gao et.al. 2406.02518 null
2024-06-04 WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections Yuze Wang et.al. 2406.02407 null
2024-06-04 Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning Jiaxu Wang et.al. 2406.02370 null
2024-06-04 OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding Yanmin Wu et.al. 2406.02058 null
2024-06-04 FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping Yuzhou Ji et.al. 2406.01916 null
2024-06-03 Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting Shaojie Ma et.al. 2406.01593 null
2024-06-03 Tetrahedron Splatting for 3D Generation Chun Gu et.al. 2406.01579 link
2024-06-03 DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors Tianyu Huang et.al. 2406.01476 link
2024-05-31 ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model Yufei Wang et.al. 2405.20721 link
2024-05-31 R $^2$ -Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction Ruyi Zha et.al. 2405.20693 link
2024-05-30 $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving Nan Huang et.al. 2405.20323 link
2024-06-03 A Pixel Is Worth More Than One 3D Gaussians in Single-View 3D Reconstruction Jianghao Shen et.al. 2405.20310 null
2024-05-29 EvaGaussians: Event Stream Assisted Gaussian Splatting from Blurry Images Wangbo Yu et.al. 2405.20224 null
2024-05-30 Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting Kuldeep R Barad et.al. 2405.20104 null
2024-06-04 PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting Qiaowei Miao et.al. 2405.19957 link
2024-05-30 GaussianRoom: Improving 3D Gaussian Splatting with SDF Guidance and Monocular Cues for Indoor Scene Reconstruction Haodong Xiang et.al. 2405.19671 null
2024-05-30 Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian Wei Sun et.al. 2405.19657 null
2024-05-30 TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM Peifeng Jiang et.al. 2405.19614 null
2024-05-29 NPGA: Neural Parametric Gaussian Avatars Simon Giebenhain et.al. 2405.19331 null
2024-05-29 LP-3DGS: Learning to Prune 3D Gaussian Splatting Zhaoliang Zhang et.al. 2405.18784 link
2024-05-28 GFlow: Recovering 4D World from Monocular Video Shizun Wang et.al. 2405.18426 null
2024-05-28 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting Qihang Zhang et.al. 2405.18424 null
2024-05-28 3D StreetUnveiler with Semantic-Aware 2DGS Jingwei Xu et.al. 2405.18416 null
2024-05-28 NegGS: Negative Gaussian Splatting Artur Kasymov et.al. 2405.18163 link
2024-05-28 A Grid-Free Fluid Solver based on Gaussian Spatial Representation Jingrui Xing et.al. 2405.18133 null
2024-05-28 EG4D: Explicit Generation of 4D Object without Score Distillation Qi Sun et.al. 2405.18132 link
2024-05-28 RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields Mihnea-Bogdan Jurca et.al. 2405.18033 link
2024-05-28 FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes Yunsong Wang et.al. 2405.17958 link
2024-05-28 A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction Bin Zhang et.al. 2405.17891 null
2024-05-29 HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction Haoyu Zhao et.al. 2405.17872 link
2024-05-27 MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds Jiahui Lei et.al. 2405.17421 link
2024-05-27 DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal Yujie Wang et.al. 2405.17351 null
2024-05-27 Memorize What Matters: Emergent Scene Decomposition from Multitraverse Yiming Li et.al. 2405.17187 link
2024-05-27 F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting Xiangyu Sun et.al. 2405.17083 null
2024-05-27 SA-GS: Semantic-Aware Gaussian Splatting for Large Scene Reconstruction with Geometry Constrain Butian Xiong et.al. 2405.16923 null
2024-05-28 PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting Zipeng Wang et.al. 2405.16829 null
2024-05-26 Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models Hanwen Liang et.al. 2405.16645 null
2024-05-26 Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians Erik Sandström et.al. 2405.16544 link
2024-05-24 Feature Splatting for Better Novel View Synthesis with Low Overlap T. Berriel Martins et.al. 2405.15518 link
2024-05-24 GSDeformer: Direct Cage-based Deformation for 3D Gaussian Splatting Jiajun Huang et.al. 2405.15491 null
2024-05-24 DisC-GS: Discontinuity-aware Gaussian Splatting Haoxuan Qu et.al. 2405.15196 null
2024-05-24 HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting Yuanhao Cai et.al. 2405.15125 link
2024-05-24 GS-Hider: Hiding Messages into 3D Gaussian Splatting Xuanyu Zhang et.al. 2405.15118 null
2024-05-23 EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting Jiaxu Wang et.al. 2405.14959 link
2024-05-23 Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras Hanzhang Tu et.al. 2405.14866 null
2024-05-23 MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes Ruiyuan Gao et.al. 2405.14475 null
2024-05-23 TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing Teng Xu et.al. 2405.14455 null
2024-05-24 RoGS: Large Scale Road Surface Reconstruction based on 2D Gaussian Splatting Zhiheng Feng et.al. 2405.14342 link
2024-05-23 D-MiSo: Editing Dynamic 3D Scenes using Multi-Gaussians Soup Joanna Waczyńska et.al. 2405.14276 link
2024-05-22 DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus Yu Chen et.al. 2405.13943 link
2024-05-22 Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances Licheng Shen et.al. 2405.13694 null
2024-05-21 MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video Hongsheng Wang et.al. 2405.12806 null
2024-05-21 LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting Jia Gong et.al. 2405.12663 null
2024-05-21 Gaussian Control with Hierarchical Semantic Graphs in 3D Human Recovery Hongsheng Wang et.al. 2405.12477 null
2024-05-20 GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details Boqian Li et.al. 2405.12420 link
2024-05-20 AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field Rong Liu et.al. 2405.12369 link
2024-05-20 Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo Tianqi Liu et.al. 2405.12218 link
2024-05-20 Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents Guanlin Wu et.al. 2405.12155 null
2024-05-20 CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization Jiawei Zhang et.al. 2405.12110 link
2024-05-21 Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping Tianhao Wu et.al. 2405.12069 null
2024-05-20 MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections Jiayue Liu et.al. 2405.11921 null
2024-05-18 Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching Xingyu Miao et.al. 2405.11252 link
2024-05-18 MotionGS : Compact Gaussian Splatting SLAM by Motion Filter Xinli Guo et.al. 2405.11129 link
2024-05-17 Photorealistic 3D Urban Scene Reconstruction and Point Cloud Extraction using Google Earth Imagery and Gaussian Splatting Kyle Gao et.al. 2405.11021 null
2024-05-17 ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation Pengzhi Li et.al. 2405.10508 null
2024-05-16 GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction Rui Jin et.al. 2405.10142 null
2024-05-15 From NeRFs to Gaussian Splats, and Back Siming He et.al. 2405.09717 link
2024-05-13 GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting Haodong Chen et.al. 2405.07472 null
2024-05-11 Direct Learning of Mesh and Appearance via 3D Gaussian Splatting Ancheng Lin et.al. 2405.06945 null
2024-05-10 OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation Jinwei Lin et.al. 2405.06547 link
2024-05-10 I3DGS: Improve 3D Gaussian Splatting from Multiple Dimensions Jinwei Lin et.al. 2405.06408 null
2024-05-10 MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization Pengcheng Zhu et.al. 2405.06241 null
2024-05-09 DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation Sitian Shen et.al. 2405.05800 null
2024-05-09 FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting Yikun Ma et.al. 2405.05768 null
2024-05-09 NGM-SLAM: Gaussian Splatting SLAM with Radiance Field Submap Mingrui Li et.al. 2405.05702 null

Stereo Matching

Publish Date Title Authors PDF Code
2025-07-15 Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation Zhen Xu et.al. 2507.11540 null
2025-07-15 Uniting the World by Dividing it: Federated Maps to Enable Spatial Applications Sagar Bharadwaj et.al. 2507.11437 null
2025-07-15 Caveats about measuring carbon abundances in stars using the CH band Pablo Santos-Peral et.al. 2507.11351 null
2025-07-15 MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network Jianfei Jiang et.al. 2507.11333 null
2025-07-15 Fairness-Aware Grouping for Continuous Sensitive Variables: Application for Debiasing Face Analysis with respect to Skin Tone Veronika Shilova et.al. 2507.11247 null
2025-07-15 Generative Click-through Rate Prediction with Applications to Search Advertising Lingwei Kong et.al. 2507.11246 null
2025-07-15 MMOne: Representing Multiple Modalities in One Scene Zhifeng Gu et.al. 2507.11129 null
2025-07-15 Urban delineation through the lens of commute networks: Leveraging graph embeddings to distinguish socioeconomic groups in cities Devashish Khulbe et.al. 2507.11057 null
2025-07-15 Uncertainty Aware Mapping for Vision-Based Underwater Robots Abhimanyu Bhowmik et.al. 2507.10991 null
2025-07-15 Terms and Conditions (Do Not) Apply: Understanding Exploitation Disparities in Design of Mobile-Based Financial Services Lindah Kotut et.al. 2507.10970 null
2025-07-14 Cameras as Relative Positional Encoding Ruilong Li et.al. 2507.10496 null
2025-07-14 Rows and Capabilities as Modal Effects Wenhao Tang et.al. 2507.10301 null
2025-07-14 Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures Xinlong Ding et.al. 2507.10265 null
2025-07-14 Is Micro-expression Ethnic Leaning? Huai-Qian Khor et.al. 2507.10209 null
2025-07-14 Minimizing the Pretraining Gap: Domain-aligned Text-Based Person Retrieval Shuyu Yang et.al. 2507.10195 null
2025-07-14 Simulating Biases for Interpretable Fairness in Offline and Online Classifiers Ricardo Inácio et.al. 2507.10154 null
2025-07-14 Efficient RF Chain Selection for MIMO Integrated Sensing and Communications: A Greedy Approach Subin Shin et.al. 2507.09960 null
2025-07-13 EventHunter: Dynamic Clustering and Ranking of Security Events from Hacker Forum Discussions Yasir Ech-Chammakhy et.al. 2507.09762 null
2025-07-13 Pre-trained Under Noise: A Framework for Robust Bone Fracture Detection in Medical Imaging Robby Hoover et.al. 2507.09731 null
2025-07-13 Electric Vehicle Public Charging Equity Considerations: A Systematic Review Boyou Chen et.al. 2507.09726 null
2025-07-11 Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT Wei Zhang et.al. 2507.08448 null
2025-07-11 PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models Yongjian Zhang et.al. 2507.08400 null
2025-07-10 Highly accurate simulations of asymmetric black-hole scattering and cross validation of effective-one-body models Oliver Long et.al. 2507.08071 null
2025-07-10 Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions Longfei Li et.al. 2507.07978 null
2025-07-10 On-Manifold Low-Thrust Maneuvering of Quasi-Periodic Orbits Ian M. Down et.al. 2507.07940 null
2025-07-10 TRIX- Trading Adversarial Fairness via Mixed Adversarial Training Tejaswini Medi et.al. 2507.07768 null
2025-07-10 Prime Power Residues and Blocking Sets Bhawesh Mishra et.al. 2507.07673 null
2025-07-10 Bridging the gap in FER: addressing age bias in deep learning F. Xavier Gaya-Morey et.al. 2507.07638 null
2025-07-10 Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects Yuqi Cheng et.al. 2507.07435 null
2025-07-09 Dirty Data in the Newsroom: Comparing Data Preparation in Journalism and Data Science Stephen Kasica et.al. 2507.07238 null
2025-07-09 Combining Pre-Trained Models for Enhanced Feature Representation in Reinforcement Learning Elia Piccoli et.al. 2507.07197 null
2025-07-09 Correlations between Dust Extinction Features across All Wavelength Scales: From Diffuse Interstellar Bands to R(V) Andrew K. Saydjari et.al. 2507.07162 null
2025-07-09 Hierarchical Feature Alignment for Gloss-Free Sign Language Translation Sobhan Asasi et.al. 2507.06732 null
2025-07-09 Photometric Stereo using Gaussian Splatting and inverse rendering Matéo Ducastel et.al. 2507.06684 null
2025-07-09 Transferable Parasitic Estimation via Graph Contrastive Learning and Label Rebalancing in AMS Circuits Shan Shen et.al. 2507.06535 null
2025-07-08 Bridging Sequential Deep Operator Network and Video Diffusion: Residual Refinement of Spatio-Temporal PDE Solutions Jaewan Park et.al. 2507.06133 null
2025-07-08 Discontinuity-aware Normal Integration for Generic Central Camera Models Francesco Milano et.al. 2507.06075 null
2025-07-08 Bridging Perception and Language: A Systematic Benchmark for LVLMs’ Understanding of Amodal Completion Reports Amane Watahiki et.al. 2507.05799 null
2025-07-08 Fairness-Aware Static and Dynamic Assortment Optimization: Optimal Selection with Balanced Market Share Omar El Housni et.al. 2507.05606 null
2025-07-08 SingLoRA: Low Rank Adaptation Using a Single Matrix David Bensaïd et.al. 2507.05566 null
2025-07-07 Incorporating Interventional Independence Improves Robustness against Interventional Distribution Shift Gautam Sreekumar et.al. 2507.05412 null
2025-07-07 Feature Geometry for Stereo Sidescan and Forward-looking Sonar Kalin Norman et.al. 2507.05410 null
2025-07-07 Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates Andrea Eichenseer et.al. 2507.05409 null
2025-07-07 Stereo Reproduction in the Presence of Sample Rate Offsets Srikanth Korse et.al. 2507.05402 null
2025-07-07 Untangling Selberg from the Wilson spool: 1-loop determinants and trace formulae in (A)dS $_{3}$ Samuel Haupfear et.al. 2507.05358 null
2025-07-07 Causal Impacts of Protected Bike Lanes on Cycling Behavior with Demographic Disparities Marcel Moran et.al. 2507.04936 null
2025-07-07 Spatial and Semantic Embedding Integration for Stereo Sound Event Localization and Detection in Regular Videos Davide Berghi et.al. 2507.04845 null
2025-07-07 Toward Valid Measurement Of (Un)fairness For Generative AI: A Proposal For Systematization Through The Lens Of Fair Equality of Chances Kimberly Le Truong et.al. 2507.04641 null
2025-07-07 Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts Yun Wang et.al. 2507.04631 null
2025-07-07 DisMS-TS: Eliminating Redundant Multi-Scale Features for Time Series Classification Zhipeng Liu et.al. 2507.04600 null
2025-07-06 Thousand-Brains Systems: Sensorimotor Intelligence for Rapid, Robust Learning and Inference Niels Leadholm et.al. 2507.04494 null
2025-07-05 Nested economies of scale in city mass Kangning Huang et.al. 2507.03960 null
2025-07-04 Assessing the Viability of Wave Field Synthesis in VR-Based Cognitive Research Benjamin Kahl et.al. 2507.03797 null
2025-07-04 Improving Social Determinants of Health Documentation in French EHRs Using Large Language Models Adrien Bazoge et.al. 2507.03433 null
2025-07-04 CME activities on spotless days during descending phase of solar cycles 23 and 24 Dipali Burud et.al. 2507.03399 null
2025-07-02 The Illusion of Fairness: Auditing Fairness Interventions with Audit Studies Disa Sariola et.al. 2507.02152 null
2025-07-02 The Thin Line Between Comprehension and Persuasion in LLMs Adrian de Wynter et.al. 2507.01936 null
2025-07-02 How Do Vision-Language Models Process Conflicting Information Across Modalities? Tianze Hua et.al. 2507.01790 null
2025-07-02 RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather Yuran Wang et.al. 2507.01653 null
2025-07-02 Adapting Language Models to Indonesian Local Languages: An Empirical Study of Language Transferability on Zero-Shot Settings Rifki Afina Putri et.al. 2507.01645 null
2025-07-02 Two Cases of Non-Radial Filament Eruption and Associated CME Deflection Kostadinka Koleva et.al. 2507.01580 null
2025-07-02 Penalizing Transparency? How AI Disclosure and Author Demographics Shape Human and AI Judgments About Writing Inyoung Cheong et.al. 2507.01418 null
2025-07-01 Improving Stereo 3D Sound Event Localization and Detection: Perceptual Features, Stereo-specific Data Augmentation, and Distance Normalization Jun-Wei Yeow et.al. 2507.00874 null
2025-07-01 Impact of temperature asymmetry and small fraction of static positive ions on the relaxed states of a relativistic hot pair plasma Usman Shazad et.al. 2507.00760 null
2025-07-01 Renormalization group based implicit function approach to connecting orbits Pengfei Guo et.al. 2507.00749 null
2025-07-01 Self-organization of earth’s inner magnetospheric multi-ion plasma Usman Shazad et.al. 2507.00734 null
2025-06-30 Development of Hybrid Artificial Intelligence Training on Real and Synthetic Data: Benchmark on Two Mixed Training Strategies Paul Wachter et.al. 2506.24093 null
2025-06-30 Simultaneous Super-Resolution of Spatial and Spectral Imaging with a Camera Array and Notch Filters Peng Lin et.al. 2506.24014 null
2025-06-30 Statistical Modeling for Accurate Characterization of Doppler Effect in LEO-Terrestrial Networks Islam M. Tanash et.al. 2506.23817 null
2025-06-30 AdFair-CLIP: Adversarial Fair Contrastive Language-Image Pre-training for Chest X-rays Chenlang Yi et.al. 2506.23467 null
2025-06-29 Zero-disparity Distribution Synthesis: Fast Exact Calculation of Chi-Squared Statistic Distribution for Discrete Uniform Histograms Nikola Banić et.al. 2506.23416 null
2025-06-29 Datasets for Fairness in Language Models: An In-Depth Survey Jiale Zhang et.al. 2506.23411 null
2025-06-29 Modeling European Electricity Market Integration during turbulent times Francesco Ravazzolo et.al. 2506.23289 null
2025-06-29 Event-based Stereo Visual-Inertial Odometry with Voxel Map Zhaoxing Zhang et.al. 2506.23078 null
2025-06-28 Feature-Wise Mixing for Mitigating Contextual Bias in Predictive Supervised Learning Yash Vardhan Tomar et.al. 2506.23033 null
2025-06-28 SPICE-HL3: Single-Photon, Inertial, and Stereo Camera dataset for Exploration of High-Latitude Lunar Landscapes David Rodríguez-Martínez et.al. 2506.22956 null
2025-06-27 Towards Fair Rankings: Leveraging LLMs for Gender Bias Detection and Measurement Maryam Mousavian et.al. 2506.22372 null
2025-06-27 NoticeLight: Embracing Socio-Technical Asymmetry through Tangible Peripheral Robotic Embodiment in Hybrid Collaboration Marie Altmann et.al. 2506.22125 null
2025-06-27 Quantifying Institutional Gender Inequality in Contemporary Visual Art Xindi Wang et.al. 2506.22103 null
2025-06-27 Seismic resolution enhancement via deep Learning with Knowledge Distillation and Domain Adaptation Hanpeng Cai et.al. 2506.22018 null
2025-06-27 SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images Naftaly Wambugu et.al. 2506.21945 null
2025-06-26 Counterfactual Voting Adjustment for Quality Assessment and Fairer Voting in Online Platforms with Helpfulness Evaluation Chang Liu et.al. 2506.21362 null
2025-06-26 ToosiCubix: Monocular 3D Cuboid Labeling via Vehicle Part Annotations Behrooz Nasihatkon et.al. 2506.21358 null
2025-06-26 ESMStereo: Enhanced ShuffleMixer Disparity Upsampling for Real-Time and Accurate Stereo Matching Mahmoud Tahmasebi et.al. 2506.21091 null
2025-06-26 The Role of Cyclopean-Eye in Stereo Vision Sherlon Almeida da Silva et.al. 2506.20900 null
2025-06-25 THIRDEYE: Cue-Aware Monocular Depth Estimation via Brain-Inspired Multi-Stage Fusion Calin Teodor Ioan et.al. 2506.20877 null
2025-06-25 StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation Haodong Li et.al. 2506.20756 null
2025-06-25 Don’t Hash Me Like That: Exposing and Mitigating Hash-Induced Unfairness in Local Differential Privacy Berkay Kemal Balioglu et.al. 2506.20290 null
2025-06-25 Effects of flame macrostructures on the combustion dynamics of novel counter-rotating radial swirl injector in a model can combustor SK Thirumalaikumaran et.al. 2506.20138 null
2025-06-24 Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation Jun Wang et.al. 2506.19774 null
2025-06-24 Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders Matyas Bohacek et.al. 2506.19708 null
2025-06-24 Recurrent Visual Feature Extraction and Stereo Attentions for CT Report Generation Yuanhe Tian et.al. 2506.19665 null
2025-06-24 AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models Zeyu Li et.al. 2506.19505 null
2025-06-24 MuBench: Assessment of Multilingual Capabilities of Large Language Models Across 61 Languages Wenhan Han et.al. 2506.19468 null
2025-06-24 Online camera-pose-free stereo endoscopic tissue deformation recovery with tissue-invariant vision-biomechanics consistency Jiahe Chen et.al. 2506.19388 null
2025-06-23 MOSCARD – Causal Reasoning and De-confounding for Multimodal Opportunistic Screening of Cardiovascular Adverse Events Jialu Pi et.al. 2506.19174 null
2025-06-23 Identifying Causally-Robust Mediators of Health Disparities: A Review and Simulation Studies With Directed Acyclic Graphs Soojin Park et.al. 2506.19047 null
2025-06-23 Simulation-Based Sensitivity Analysis in Optimal Treatment Regimes and Causal Decomposition with Individualized Interventions Soojin Park et.al. 2506.19010 null
2025-06-23 Causal Decomposition Analysis with Synergistic Interventions: A Triply-Robust Machine Learning Approach to Addressing Multiple Dimensions of Social Disparities Soojin Park et.al. 2506.18994 null
2025-06-23 Light of Normals: Unified Feature Representation for Universal Photometric Stereo Hong Li et.al. 2506.18882 null
2025-06-23 Evaluating Multichannel Speech Enhancement Algorithms at the Phoneme Scale Across Genders Nasser-Eddine Monir et.al. 2506.18691 null
2025-06-23 NOVA: Navigation via Object-Centric Visual Autonomy for High-Speed Target Tracking in Unstructured GPS-Denied Environments Alessandro Saviolo et.al. 2506.18689 null
2025-06-23 Bias vs Bias – Dawn of Justice: A Fair Fight in Recommendation Systems Tahsin Alamgir Kheya et.al. 2506.18327 null
2025-06-22 Mental Health Equity in LLMs: Leveraging Multi-Hop Question Answering to Detect Amplified and Silenced Perspectives Batool Haider et.al. 2506.18116 null
2025-06-22 StereoTacTip: Vision-based Tactile Sensing with Biomimetic Skin-Marker Arrangements Chenghua Lu et.al. 2506.18040 null
2025-06-22 Feedback Driven Multi Stereo Vision System for Real-Time Event Analysis Mohamed Benkedadra et.al. 2506.17910 null
2025-06-21 In-Context Learning Strategies Emerge Rationally Daniel Wurgaft et.al. 2506.17859 null
2025-06-21 Learning to Dock: A Simulation-based Study on Closing the Sim2Real Gap in Autonomous Underwater Docking Kevin Chang et.al. 2506.17823 null
2025-06-21 Optimization-Free Patch Attack on Stereo Depth Estimation Hangcheng Liu et.al. 2506.17632 null
2025-06-20 YASMOT: Yet another stereo image multi-object tracker Ketil Malde et.al. 2506.17186 link
2025-06-20 Are Bias Evaluation Methods Biased ? Lina Berrayana et.al. 2506.17111 null
2025-06-20 Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping Teng Guo et.al. 2506.17110 null
2025-06-20 Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks Samer Lahoud et.al. 2506.17063 null
2025-06-20 LunarLoc: Segment-Based Global Localization on the Moon Annika Thomas et.al. 2506.16940 link
2025-06-20 DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches Yun Xing et.al. 2506.16690 null
2025-06-19 External Evaluation of Discrimination Mitigation Efforts in Meta’s Ad Delivery Basileal Imana et.al. 2506.16560 null
2025-06-19 PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking Yan Zhou et.al. 2506.16379 null
2025-06-19 Heterotopic energy for Sobolev mappings Antoine Detaille et.al. 2506.16204 null
2025-06-19 Solar Transient Recognition Using Deep Learning (STRUDL) for heliospheric imager data Maike Bauer et.al. 2506.16194 null
2025-06-18 Mono-Modalizing Extremely Heterogeneous Multi-Modal Medical Image Registration Kyobin Choo et.al. 2506.15596 null
2025-06-18 SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models’ Knowledge of Indian Culture Arijit Maji et.al. 2506.15355 null
2025-06-18 Dissecting the gender divide: Authorship and acknowledgment in scientific publications Keigo Kusumegi et.al. 2506.15237 null
2025-06-18 Transit for All: Mapping Equitable Bike2Subway Connection using Region Representation Learning Min Namgung et.al. 2506.15113 null
2025-06-18 3D Vision-tactile Reconstruction from Infrared and Visible Images for Robotic Fine-grained Tactile Perception Yuankai Lin et.al. 2506.15087 null
2025-06-17 Time-Optimized Safe Navigation in Unstructured Environments through Learning Based Depth Completion Jeffrey Mao et.al. 2506.14975 null
2025-06-17 Cost-Aware Routing for Efficient Text-To-Image Generation Qinchan et.al. 2506.14753 null
2025-06-17 DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning Kunal Swami et.al. 2506.14709 null
2025-06-17 One Size Fits None: Rethinking Fairness in Medical AI Roland Roller et.al. 2506.14400 null
2025-06-17 Consensus Power Inequality: A Comparative Study of Blockchain Networks Kamil Tylinski et.al. 2506.14393 null
2025-06-16 Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble Zhiqi Wang et.al. 2506.13972 link
2025-06-16 Bias Delayed is Bias Denied? Assessing the Effect of Reporting Delays on Disparity Assessments Jennah Gosciak et.al. 2506.13735 link
2025-06-16 Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields Jungeon Kim et.al. 2506.13508 null
2025-06-16 Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling Wenmiao Gao et.al. 2506.13455 null
2025-06-16 Cloud-to-cloud velocity dispersions across a Local arm segment Lixia Yuan et.al. 2506.13424 null
2025-06-16 DVP-MVS++: Synergize Depth-Normal-Edge and Harmonized Visibility Prior for Multi-View Stereo Zhenlong Yuan et.al. 2506.13215 null
2025-06-16 Equitable Electronic Health Record Prediction with FAME: Fairness-Aware Multimodal Embedding Nikkie Hooman et.al. 2506.13104 null
2025-06-14 Recent Advances and Future Directions in Literature-Based Discovery Andrej Kastrin et.al. 2506.12385 null
2025-06-14 Path-specific effects for pulse-oximetry guided decisions in critical care Kevin Zhang et.al. 2506.12371 null
2025-06-16 A Reference Model and Patterns for Production Event Data Enrichment Mark van der Pas et.al. 2506.11502 null
2025-06-16 SemanticST: Spatially Informed Semantic Graph Learning for Clustering, Integration, and Scalable Analysis of Spatial Transcriptomics Roxana Zahedi et.al. 2506.11491 link
2025-06-13 A Watermark for Auto-Regressive Image Generation Models Yihan Wu et.al. 2506.11371 null
2025-06-12 Forbidden configurations for coherency Victoria Gould et.al. 2506.11321 null
2025-06-12 Principled Approaches for Extending Neural Architectures to Function Spaces for Operator Learning Julius Berner et.al. 2506.10973 link
2025-06-12 FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition Jongsuk Kim et.al. 2506.10747 null
2025-06-12 Balancing Tails when Comparing Distributions: Comprehensive Equity Index (CEI) with Application to Bias Evaluation in Operational Face Biometrics Imanol Solano et.al. 2506.10564 null
2025-06-12 EasyDRAM: An FPGA-based Infrastructure for Fast and Accurate End-to-End Evaluation of Emerging DRAM Techniques Oğuzhan Canpolat et.al. 2506.10441 link
2025-06-12 Transcorrelated Theory for Transition Metal Atoms Kristoffer Simula et.al. 2506.10429 null
2025-06-12 PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting Lintao Xiang et.al. 2506.10335 null
2025-06-12 A Novel Feedforward Youla Parameterization Method for Avoiding Local Minima in Stereo Image Based Visual Servoing Control Rongfei Li et.al. 2506.10252 null
2025-06-10 Down But Not Out: The Case of Long-Period Comet C/2021 O3 (Panstarrs) David Jewitt. Jing Li et.al. 2506.09263 null
2025-06-10 Princeton365: A Diverse Dataset with Accurate Camera Pose Karhan Kayan et.al. 2506.09035 null
2025-06-10 Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia Katelyn Xiaoying Mei et.al. 2506.08846 link
2025-06-11 Towards Fair Representation: Clustering and Consensus Diptarka Chakraborty et.al. 2506.08673 null
2025-06-09 Unmasking inequility: socio-economic determinants and gender disparities in Maharashtra and India’s health outcomes – Insights from NFHS-5 Sharmishtha Raghuvanshi et.al. 2506.08206 null
2025-06-09 GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors Wenlong Meng et.al. 2506.08188 null
2025-06-09 Balanced Area Deprivation Index (bADI): Enhancing social determinants of health indices to strengthen their association with healthcare clinical outcomes, utilization and costs Mohammad Amin Morid et.al. 2506.08131 null
2025-06-09 Unraveling Ethereum’s Mempool: The Impact of Fee Fairness, Transaction Prioritization, and Consensus Efficiency S M Mostaq Hossain et.al. 2506.07988 null
2025-06-09 LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement Dimitris Panagopoulos et.al. 2506.07915 null
2025-06-09 Erbium-implanted WS2 flakes with room-temperature photon emission at telecom wavelengths Guadalupe García-Arellano et.al. 2506.07746 null
2025-06-09 Federated In-Context Learning: Iterative Refinement for Improved Answer Quality Ruhan Wang et.al. 2506.07440 null
2025-06-09 The impact of extracurricular education on socioeconomic mobility in Japan: an application of causal machine learning Yang Qiang et.al. 2506.07421 null
2025-06-08 Analyzing Breast Cancer Survival Disparities by Race and Demographic Location: A Survival Analysis Approach Ramisa Farha et.al. 2506.07191 null
2025-06-08 Optimal Transport Driven Asymmetric Image-to-Image Translation for Nuclei Segmentation of Histological Images Suman Mahapatra et.al. 2506.07023 null
2025-06-08 End-to-End Probabilistic Framework for Learning with Hard Constraints Utkarsh Utkarsh et.al. 2506.07003 null
2025-06-07 Spatial Disparities in Fire Shelter Accessibility: Capacity Challenges in the Palisades and Eaton Fires Su Yeon Han et.al. 2506.06803 null
2025-06-06 Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception Pushyami Kaveti et.al. 2506.06476 null
2025-06-06 PyGemini: Unified Software Development towards Maritime Autonomy Systems Kjetil Vasstein et.al. 2506.06262 null
2025-06-06 Masked Language Models are Good Heterogeneous Graph Generalizers Jinyu Yang et.al. 2506.06157 link
2025-06-06 SVD: Spatial Video Dataset M. H. Izadimehr et.al. 2506.06037 null
2025-06-06 Restereo: Diffusion stereo video generation and restoration Xingchang Huang et.al. 2506.06023 null
2025-06-06 Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning Fan Yang et.al. 2506.05997 null
2025-06-06 A Culturally-Rich Romanian NLP Dataset from “Who Wants to Be a Millionaire?” Videos Alexandru-Gabriel Ganea et.al. 2506.05991 null
2025-06-06 NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces Pierluigi Zama Ramirez et.al. 2506.05815 null
2025-06-06 Efficient Online RFT with Plug-and-Play LLM Judges: Unlocking State-of-the-Art Performance Rudransh Agnihotri et.al. 2506.05748 null
2025-06-06 Aerial Multi-View Stereo via Adaptive Depth Range Inference and Normal Cues Yimei Liu et.al. 2506.05655 null
2025-06-05 Planets similar in size are often dissimilar in interior E. Mamonova et.al. 2506.05089 link
2025-06-05 Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer Filip Slezak et.al. 2506.04908 null
2025-06-05 Is It JUST Semantics? A Case Study of Discourse Particle Understanding in LLMs William Sheffield et.al. 2506.04534 null
2025-06-04 The Latent Space Hypothesis: Toward Universal Medical Representation Learning Salil Patel et.al. 2506.04515 null
2025-06-04 Edge interventions can mitigate demographic and prestige disparities in the Computer Science coauthorship network Kate Barnes et.al. 2506.04435 link
2025-06-04 MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale Ran Xu et.al. 2506.04405 null
2025-06-06 Enduring Disparities in the Workplace: A Pilot Study in the AI Community Yunusa Simpa Abdulsalam et.al. 2506.04305 null
2025-06-04 Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Tianyu Huang et.al. 2506.04225 null
2025-06-04 Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness Stephen R. Pfohl et.al. 2506.04193 link
2025-06-04 Lions and Muons: Optimization via Stochastic Frank-Wolfe Maria-Eleni Sfyraki et.al. 2506.04192 null
2025-06-04 Multi-view Surface Reconstruction Using Normal and Reflectance Cues Robin Bruneau et.al. 2506.04115 link
2025-06-04 When Fairness Isn’t Statistical: The Limits of Machine Learning in Evaluating Legal Reasoning Claire Barale et.al. 2506.03913 null
2025-06-04 FedFACT: A Provable Framework for Controllable Group-Fairness Calibration in Federated Learning Li Zhang et.al. 2506.03777 null
2025-06-04 Analyzing Pension Fund Mortality with Gaussian Processes in a Sub Population Framework Eduardo F. L. de Melo et.al. 2506.03584 null
2025-06-04 Time-Domain Excitation of Complex Resonances Asaf Farhi et.al. 2506.03485 null
2025-06-03 Targeted Forgetting of Image Subgroups in CLIP Models Zeliang Zhang et.al. 2506.03117 null
2025-06-03 A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems Đorđe Klisura et.al. 2506.02998 null
2025-06-03 Towards a Japanese Full-duplex Spoken Dialogue System Atsumoto Ohashi et.al. 2506.02979 null
2025-06-03 TaxAgent: How Large Language Model Designs Fiscal Policy Jizhou Wang et.al. 2506.02838 null
2025-06-03 HORUS: A Mixed Reality Interface for Managing Teams of Mobile Robots Omotoye Shamsudeen Adekoya et.al. 2506.02622 null
2025-06-03 On the Language and Gender Biases in PSTN, VoIP and Neural Audio Codecs Kemal Altwlkany et.al. 2506.02545 null
2025-06-03 Gender Inequality in English Textbooks Around the World: an NLP Approach Tairan Liu et.al. 2506.02425 null
2025-06-03 Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology Wenhao Tang et.al. 2506.02408 link
2025-06-02 ImpRAG: Retrieval-Augmented Generation with Implicit Queries Wenzheng Zhang et.al. 2506.02279 null
2025-06-02 Tunable magnons in a dual-gated 2D antiferromagnet Nele Stetzuhn et.al. 2506.02185 null
2025-05-30 Predicting the Past: Estimating Historical Appraisals with OCR and Machine Learning Mihir Bhaskar et.al. 2505.24676 link
2025-05-30 Thermodynamic Signatures of Gaussian Entanglement Beyond Entropy Beatriz Polo et.al. 2505.24596 null
2025-05-30 50 years of spin glass theory David Sherrington et.al. 2505.24432 null
2025-05-30 A Unified Scale Factor for the Cosmic Evolution -Motivated by Brane World Models- Farzin Safarzadeh-Maleki et.al. 2505.24420 null
2025-05-30 Verifiable Weighted Secret Sharing Kareem Shehata et.al. 2505.24289 null
2025-05-30 Evolution of Gas Velocity Dispersion in Discs from $z\sim8$ to $z\sim0.5$ E. Wisnioski et.al. 2505.24129 null
2025-05-30 CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs Ai Jian et.al. 2505.24120 null
2025-05-29 Estimation of Gender Wage Gap in the University of North Carolina System Zihan Zhang et.al. 2505.24078 null
2025-05-29 Can Emotion Fool Anti-spoofing? Aurosweta Mahapatra et.al. 2505.23962 null
2025-05-29 Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts Xuweiyi Chen et.al. 2505.23926 null
2025-05-29 ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks Akashah Shabbir et.al. 2505.23752 link
2025-05-29 Let’s Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM’s Math Capability Ruida Wang et.al. 2505.23703 null
2025-05-29 Errors in Stereo Geometry Induce Distance Misperception Raffles Xingqi Zhu et.al. 2505.23685 null
2025-05-29 Dual-Task Graph Neural Network for Joint Seizure Onset Zone Localization and Outcome Prediction using Stereo EEG Syeda Abeera Amir et.al. 2505.23669 null
2025-05-29 PAN-Crafter: Learning Modality-Consistent Alignment for PAN-Sharpening Jeonghyeok Do et.al. 2505.23367 null
2025-05-29 Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data Lingkai Kong et.al. 2505.23062 null
2025-05-29 Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift Minh Nguyen Nhat To et.al. 2505.23027 link
2025-05-28 Talent or Luck? Evaluating Attribution Bias in Large Language Models Chahat Raj et.al. 2505.22910 link
2025-05-28 Permissioned LLMs: Enforcing Access Control in Large Language Models Bargav Jayaraman et.al. 2505.22860 null
2025-05-28 Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese Hanjia Lyu et.al. 2505.22645 link
2025-05-28 Overpartitions and Kaur, Rana, and Eyyunni’s mex sequences Brian Hopkins et.al. 2505.22588 null
2025-05-28 Beyond Leaders and Laggards: A Typology of Renewable Energy Adoption Trajectories with Evidence from Off-Grid Communities Roni Blushtein-Livnon et.al. 2505.22456 null
2025-05-28 MObyGaze: a film dataset of multimodal objectification densely annotated by experts Julie Tores et.al. 2505.22084 null
2025-05-28 D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples Zijing Hu et.al. 2505.22002 null
2025-05-27 From prosthetic memory to prosthetic denial: Auditing whether large language models are prone to mass atrocity denialism Roberto Ulloa et.al. 2505.21753 null
2025-05-27 MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs Raoyuan Zhao et.al. 2505.21693 link
2025-05-27 Data and Technology for Equitable Public Administration: Understanding City Government Employees’ Challenges and Needs Angie Zhang et.al. 2505.21682 null
2025-05-27 ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models Dingming Li et.al. 2505.21500 null
2025-05-27 Subgroups Matter for Robust Bias Mitigation Anissa Alloula et.al. 2505.21363 link
2025-05-27 The Multilingual Divide and Its Impact on Global AI Safety Aidan Peppin et.al. 2505.21344 null
2025-05-27 Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models Zhaoqing Li et.al. 2505.21237 null
2025-05-27 Interpreting Social Bias in LVLMs via Information Flow Analysis and Multi-Round Dialogue Evaluation Zhengyang Ji et.al. 2505.21106 null
2025-05-27 On VLMs for Diverse Tasks in Multimodal Meme Classification Deepesh Gavit et.al. 2505.20937 null
2025-05-28 Stereo Radargrammetry Using Deep Learning from Airborne SAR Images Tatsuya Sasayama et.al. 2505.20876 null
2025-05-27 Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties Jiyoung Lee et.al. 2505.20875 null
2025-05-27 Aggregation Buffer: Revisiting DropEdge with a New Parameter Block Dooho Lee et.al. 2505.20840 null
2025-05-27 TrustSkin: A Fairness Pipeline for Trustworthy Facial Affect Analysis Across Skin Tone Ana M. Cabanas et.al. 2505.20637 null
2025-05-26 Spurious Privacy Leakage in Neural Networks Chenxiang Zhang et.al. 2505.20095 null
2025-05-26 Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud Natsuki Takama et.al. 2505.19854 null
2025-05-26 Deep learning based spatial aliasing reduction in beamforming for audio capture Mateusz Guzik et.al. 2505.19781 null
2025-05-26 SACM: SEEG-Audio Contrastive Matching for Chinese Speech Decoding Hongbin Wang et.al. 2505.19652 link
2025-05-26 Evaluating Robustness of Large Audio Language Models to Audio Injection: An Empirical Study Guanyu Hou et.al. 2505.19598 null
2025-05-26 VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models Hu Xiaobin et.al. 2505.19571 link
2025-05-26 AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and Healthcare Ying Xiao et.al. 2505.19562 link
2025-05-26 SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams Zhuoheng Gao et.al. 2505.19487 null
2025-05-25 Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding Shiyue Wang et.al. 2505.19219 null
2025-05-25 MMATH: A Multilingual Benchmark for Mathematical Reasoning Wenyang Luo et.al. 2505.19126 link
2025-05-23 Frankentext: Stitching random text fragments into long-form narratives Chau Minh Pham et.al. 2505.18128 link
2025-05-23 A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency Xiaobao Wei et.al. 2505.18024 null
2025-05-23 Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras Masataka Kobayashi et.al. 2505.17582 null
2025-05-23 H2:Towards Efficient Large-Scale LLM Training on Hyper-Heterogeneous Cluster over 1,000 Chips Ding Tang et.al. 2505.17548 null
2025-05-23 Learning Representational Disparities Pavan Ravishankar et.al. 2505.17533 null
2025-05-23 Transparency and Proportionality in Post-Processing Algorithmic Bias Correction Juliett Suárez Ferreira et.al. 2505.17525 null
2025-05-23 FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow Haoyu Sun et.al. 2505.17399 link
2025-05-23 Pulse duration dependence of material response in ultrafast laser-induced surface-penetrating nanovoids in fused silica Guodong Zhang et.al. 2505.17385 null
2025-05-22 Mitigate One, Skew Another? Tackling Intersectional Biases in Text-to-Image Models Pushkar Shukla et.al. 2505.17280 null
2025-05-22 A Framework for Multi-View Multiple Object Tracking using Single-View Multi-Object Trackers on Fish Data Chaim Chai Elchik et.al. 2505.17201 null
2025-05-22 NY Real Estate Racial Equity Analysis via Applied Machine Learning Sanjana Chalavadi et.al. 2505.16946 null
2025-05-22 Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining Shangquan Sun et.al. 2505.16811 null
2025-05-22 Optimising the decision threshold in a weighted voting system: The case of the IMF’s Board of Governors Dóra Gréta Petróczy et.al. 2505.16654 null
2025-05-22 M2SVid: End-to-End Inpainting and Refinement for Monocular-to-Stereo Video Conversion Nina Shvetsova et.al. 2505.16565 null
2025-05-22 Utilizing citation index and synthetic quality measure to compare Wikipedia languages across various topics Włodzimierz Lewoniewski et.al. 2505.16506 null
2025-05-22 KoBALT: Korean Benchmark For Advanced Linguistic Tasks Hyopil Shin et.al. 2505.16125 null
2025-05-22 Continually Self-Improving Language Models for Bariatric Surgery Question–Answering Yash Kumar Atri et.al. 2505.16102 null
2025-05-21 In Silico Trials for Sex-Specific patient Inclusion Criteria in Cardiac Resynchronization Therapy: Advancing Precision in Heart Failure Treatment Shuang Qian et.al. 2505.15708 null
2025-05-21 Kernel PCA for Out-of-Distribution Detection: Non-Linear Kernel Selections and Approximations Kun Fang et.al. 2505.15284 link
2025-05-20 DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis Prashanth Vijayaraghavan et.al. 2505.14971 null
2025-05-20 The Great Comets of 1843 and 1882 at Their Previous Return to Perihelion in the Twelfth Century: One Spectacular, the Other Dull Zdenek Sekanina et.al. 2505.14662 null
2025-05-20 Early Diagnosis of Atrial Fibrillation Recurrence: A Large Tabular Model Approach with Structured and Unstructured Clinical Data Ane G. Domingo-Aldama et.al. 2505.14643 null
2025-05-21 Mitigating Subgroup Disparities in Multi-Label Speech Emotion Recognition: A Pseudo-Labeling and Unsupervised Learning Approach Yi-Cheng Lin et.al. 2505.14449 null
2025-05-20 MindVote: How LLMs Predict Human Decision-Making in Social Media Polls Xutao Mao et.al. 2505.14422 null
2025-05-20 Diving into the Fusion of Monocular Priors for Generalized Stereo Matching Chengtang Yao et.al. 2505.14414 link
2025-05-20 Accuracy and Fairness of Facial Recognition Technology in Low-Quality Police Images: An Experiment With Synthetic Faces Maria Cuellar et.al. 2505.14320 null
2025-05-20 Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models Zahraa Al Sahili et.al. 2505.14160 null
2025-05-20 M3Depth: Wavelet-Enhanced Depth Estimation on Mars via Mutual Boosting of Dual-Modal Data Junjie Li et.al. 2505.14159 null
2025-05-20 Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts Xi Chen et.al. 2505.14088 null
2025-05-20 AppleGrowthVision: A large-scale stereo dataset for phenological analysis, fruit detection, and 3D reconstruction in apple orchards Laura-Sophia von Hirschhausen et.al. 2505.14029 null
2025-05-19 The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation David Stap et.al. 2505.13090 null
2025-05-19 Unifying concepts in information-theoretic time-series analysis Annie G. Bryant et.al. 2505.13080 null
2025-05-20 3D Visual Illusion Depth Estimation Chengtang Yao et.al. 2505.13061 link
2025-05-19 Multi-Level Aware Preference Learning: Enhancing RLHF for Complex Multi-Instruction Tasks Ruopei Sun et.al. 2505.12845 null
2025-05-19 On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding Haoyuan Wu et.al. 2505.12723 null
2025-05-19 IA-MVS: Instance-Focused Adaptive Depth Sampling for Multi-View Stereo Yinzhe Wang et.al. 2505.12714 null
2025-05-19 Rethinking Predictive Modeling for LLM Routing: When Simple kNN Beats Complex Learned Routers Yang Li et.al. 2505.12601 null
2025-05-18 On long-duration storage, weather uncertainty and limited foresight Felix Schmidt et.al. 2505.12538 link
2025-05-18 Depth Transfer: Learning to See Like a Simulator for Real-World Drone Navigation Hang Yu et.al. 2505.12428 null
2025-05-18 Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents Shuo Han et.al. 2505.12204 null
2025-05-16 SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision Utsav Rai et.al. 2505.11439 null
2025-05-16 MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection Shrutarv Awasthi et.al. 2505.11282 link
2025-05-16 Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization Yanhao Jia et.al. 2505.11217 null
2025-05-16 A Cautionary Tale on Integrating Studies with Disparate Outcome Measures for Causal Inference Harsh Parikh et.al. 2505.11014 null
2025-05-16 Patient-Specific Dynamic Digital-Physical Twin for Coronary Intervention Training: An Integrated Mixed Reality Approach Shuo Wang et.al. 2505.10902 null
2025-05-16 From Embeddings to Accuracy: Comparing Foundation Models for Radiographic Classification Xue Li et.al. 2505.10823 null
2025-05-15 TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation Manthan Patel et.al. 2505.10696 null
2025-05-15 Artificial Intelligence Bias on English Language Learners in Automatic Scoring Shuchen Guo et.al. 2505.10643 null
2025-05-15 Multi-contrast laser endoscopy for in vivo gastrointestinal imaging Taylor L. Bobrow et.al. 2505.10492 null
2025-05-15 ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention Jintian Shao et.al. 2505.10222 null
2025-05-15 VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality Xuechang Tu et.al. 2505.10144 link
2025-05-15 Large-Scale Gaussian Splatting SLAM Zhe Xin et.al. 2505.09915 null
2025-05-14 ZENN: A Thermodynamics-Inspired Computational Framework for Heterogeneous Data-Driven Modeling Shun Wang et.al. 2505.09851 null
2025-05-14 Should I Stay or Should I Go Now? An Investigation into Gender Differences in the Impact of Switching Jobs on Earnings Emily Winskill et.al. 2505.09791 null
2025-05-14 Enabling Group Fairness in Graph Unlearning via Bi-level Debiasing Yezi Liu et.al. 2505.09702 null
2025-05-14 Fairness-aware Bayes optimal functional classification Xiaoyu Hu et.al. 2505.09471 null
2025-05-14 RobustSpring: Benchmarking Robustness to Image Corruptions for Optical Flow, Scene Flow and Stereo Jenny Schmalfuss et.al. 2505.09368 null
2025-05-14 Toward Fair Federated Learning under Demographic Disparities and Data Imbalance Qiming Wu et.al. 2505.09295 link
2025-05-14 Signatures of asymmetry: Gravitational wave memory and the parity violation Indranil Chakraborty et.al. 2505.09096 null
2025-05-13 Ages and metallicities of quiescent galaxies: confronting broadband ( $UVJ$ ) colours with stellar absorption lines Chloe M. Cheng et.al. 2505.08858 null
2025-05-13 Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World Yuran Wang et.al. 2505.08607 null
2025-05-13 BizChat: Scaffolding AI-Powered Business Planning for Small Business Owners Across Digital Skill Levels Quentin Romero Lauro et.al. 2505.08493 null
2025-05-13 A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering Chuanzhi Xu et.al. 2505.08438 null
2025-05-13 Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion Anle Ke et.al. 2505.08281 link
2025-05-13 Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images Ziteng Liu et.al. 2505.08178 null
2025-05-14 Fast Text-to-Audio Generation with Adversarial Post-Training Zachary Novack et.al. 2505.08175 link
2025-05-13 MoKD: Multi-Task Optimization for Knowledge Distillation Zeeshan Hayder et.al. 2505.08170 null
2025-05-12 Unequal Journeys to Food Markets: Continental-Scale Evidence from Open Data in Africa Robert Benassai-Dalmau et.al. 2505.07913 link
2025-05-12 Disparity in sound speeds: implications for unitarity and effective potential in quantum field theory Dmitry S. Ageev et.al. 2505.07794 null
2025-05-12 Higher-Order Convolution Improves Neural Predictivity in the Retina Simone Azeglio et.al. 2505.07620 null
2025-05-11 Empirical Analysis of Asynchronous Federated Learning on Heterogeneous Devices: Efficiency, Fairness, and Privacy Trade-offs Samaneh Mohammadi et.al. 2505.07041 null
2025-05-11 Enhancing Monocular Height Estimation via Sparse LiDAR-Guided Correction Jian Song et.al. 2505.06905 null
2025-05-11 ContribChain: A Stress-Balanced Blockchain Sharding Protocol with Node Contribution Awareness Xinpeng Huang et.al. 2505.06899 null
2025-05-11 Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies Zhengmi Tang et.al. 2505.06855 null
2025-05-11 Feedback-enhanced distant entanglement of magnon and phonon modes with atomic ensembles in coupled cavities Muhammad Awais Altaf et.al. 2505.06838 null
2025-05-10 Behind the Byline: A Large-Scale Study of Scientific Author Contributions Itai Assraf et.al. 2505.06721 null
2025-05-09 Adaptive Wiping: Adaptive contact-rich manipulation through few-shot imitation learning with Force-Torque feedback and pre-trained object representations Chikaha Tsuji et.al. 2505.06451 null
2025-05-09 2D Quon Language: Unifying Framework for Cliffords, Matchgates, and Beyond Byungmin Kang et.al. 2505.06336 null
2025-05-09 Who’s at Risk? Effects of Inflation on Unemployment Risk Hie Joo Ahn et.al. 2505.05757 null
2025-05-08 Trends and Gender Disparities in Grades and Grade Penalties Among Bioscience and Health-Related Major Students Before, During, and After COVID-19 Remote Instruction Alysa Malespina et.al. 2505.05667 null
2025-05-07 StereoINR: Cross-View Geometry Consistent Stereo Super Resolution with Implicit Neural Representation Yi Liu et.al. 2505.05509 null
2025-05-08 Facets of Disparate Impact: Evaluating Legally Consistent Bias in Machine Learning Jarren Briscoe et.al. 2505.05471 link
2025-05-08 Synthesis of innovation and obsolescence Edward D. Lee et.al. 2505.05182 null
2025-05-08 DispBench: Benchmarking Disparity Estimation to Synthetic Corruptions Shashank Agnihotri et.al. 2505.05091 link
2025-05-08 Learning Item Representations Directly from Multimodal Features for Effective Recommendation Xin Zhou et.al. 2505.04960 link
2025-05-08 Enhancing Blockchain Cross Chain Interoperability: A Comprehensive Survey Zhihong Deng et.al. 2505.04934 null
2025-05-08 Advanced 3D Imaging Approach to TSV/TGV Metrology and Inspection Using Only Optical Microscopy Gugeong Sung et.al. 2505.04913 null
2025-05-06 Algorithmic Accountability in Small Data: Sample-Size-Induced Bias Within Classification Metrics Jarren Briscoe et.al. 2505.03992 link
2025-05-06 Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach Srecharan Selvam et.al. 2505.03702 null
2025-05-06 Blending 3D Geometry and Machine Learning for Multi-View Stereopsis Vibhas Vats et.al. 2505.03470 link
2025-05-06 Domain Adversarial Training for Mitigating Gender Bias in Speech-based Mental Health Detection June-Woo Kim et.al. 2505.03359 null
2025-05-06 The Impact of Large Language Models on K-12 Education in Rural India: A Thematic Analysis of Student Volunteer’s Perspectives Harshita Goyal et.al. 2505.03163 null
2025-05-06 Towards Application-Specific Evaluation of Vision Models: Case Studies in Ecology and Biology Alex Hoi Hang Chan et.al. 2505.02825 null
2025-05-05 Exceptional, but Separate: Precursors to Spontaneous Symmetry Breaking Lewis Hill et.al. 2505.02691 null
2025-05-05 VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection Hao Cheng et.al. 2505.02331 link
2025-05-04 SparSplat: Fast Multi-View Reconstruction with Generalizable 2D Gaussian Splatting Shubhendu Jena et.al. 2505.02175 null
2025-05-04 Representation Learning of Limit Order Book: A Comprehensive Study and Benchmarking Muyao Zhong et.al. 2505.02139 null
2025-05-04 Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents Christian Schroeder de Witt et.al. 2505.02077 null
2025-05-03 Mitigating Group-Level Fairness Disparities in Federated Visual Language Models Chaomeng Chen et.al. 2505.01851 null
2025-05-03 AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting Junhao Shi et.al. 2505.01799 null
2025-05-03 T-REX: Vision-Based System for Autonomous Leaf Detection and Grasp Estimation Srecharan Selvam et.al. 2505.01654 null
2025-05-02 Toward a Unified Theory of Catalysis Frank Nelson Crespilho et.al. 2505.01213 null
2025-05-02 Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods Mahdi Dhaini et.al. 2505.01198 link
2025-05-02 Enhancing MHD model accuracy and CME forecasting by constraining coronal plasma properties with Faraday rotation Salvatore Mancuso et.al. 2505.01080 null
2025-05-02 Destructive Interference: Encoding Loss in the Overlap Nik Aberle et.al. 2505.00987 null
2025-05-01 Quantum Modular Forms and Resurgence Eleanor McSpirit et.al. 2505.00799 null
2025-05-01 HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection Deanna Emery et.al. 2505.00506 null
2025-04-30 Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis Michal Geyer et.al. 2505.00135 null
2025-04-30 Stereo X-ray tomography on deformed object tracking Zhenduo Shang et.al. 2505.00122 null
2025-04-30 An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation Yaming Ou et.al. 2504.21826 null
2025-04-30 Assessing Racial Disparities in Healthcare Expenditures Using Causal Path-Specific Effects Xiaxian Ou et.al. 2504.21688 link
2025-04-30 Lights Out, Stress In: Assessing Stress Amidst Power and Energy Challenges in Bangladesh Faisal Quaiyyum et.al. 2504.21541 null
2025-04-30 DGFNet: End-to-End Audio-Visual Source Separation Based on Dynamic Gating Fusion Yinfeng Yu et.al. 2504.21366 null
2025-04-30 CMD: Constraining Multimodal Distribution for Domain Adaptation in Stereo Matching Zhelun Shen et.al. 2504.21302 null
2025-04-30 LSTM+Geo with xgBoost Filtering: A Novel Approach for Race and Ethnicity Imputation with Reduced Bias S. Chalavadi et.al. 2504.21259 null
2025-04-29 OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification Shangyu Li et.al. 2504.20964 link
2025-04-29 Imaging on the Edge: Mapping Object Corners and Edges with Stereo X-ray Tomography Zhenduo Shang et.al. 2504.20892 null
2025-04-29 Partitioned Memory Storage Inspired Few-Shot Class-Incremental learning Renye Zhang et.al. 2504.20797 null
2025-04-29 The Anyonic Quantum Carnot Engine H S Mani et.al. 2504.20596 null
2025-04-29 Mordell–Lang and disparate Selmer ranks of odd twists of some superelliptic curves over global function fields Sun Woo Park et.al. 2504.20594 null
2025-04-29 Hetu v2: A General and Scalable Deep Learning System with Hierarchical and Heterogeneous Single Program Multiple Data Annotations Haoyang Li et.al. 2504.20490 link
2025-04-29 The two-clock problem in population dynamics Kaan Öcal et.al. 2504.20388 null
2025-04-29 Neural Stereo Video Compression with Hybrid Disparity Compensation Shiyin Jiang et.al. 2504.20383 null
2025-04-29 Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views Jiang Wu et.al. 2504.20378 link
2025-04-28 $\texttt{SAGE}$ : A Generic Framework for LLM Safety Evaluation Madhur Jindal et.al. 2504.19674 link
2025-04-27 Mitigating Bias in Facial Recognition Systems: Centroid Fairness Loss Optimization Jean-Rémy Conti et.al. 2504.19370 null
2025-04-27 Unscented Particle Filter for Visual-inertial Navigation using IMU and Landmark Measurements Khashayar Ghanizadegan et.al. 2504.19318 null
2025-04-27 OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion Shuhao Kang et.al. 2504.19258 null
2025-04-26 Minimum Cost Nowhere-zero Flows and Cut-balanced Orientations Karthekeyan Chandrasekaran et.al. 2504.18767 null
2025-04-25 Fairness Is More Than Algorithms: Racial Disparities in Time-to-Recidivism Jessy Xinyi Han et.al. 2504.18629 null
2025-04-25 Are We on the Same Page? Examining Developer Perception Alignment in Open Source Code Reviews Yoseph Berhanu Alebachew et.al. 2504.18407 null
2025-04-25 Study on Real-Time Road Surface Reconstruction Using Stereo Vision Deepak Ghimire et.al. 2504.18112 null
2025-04-29 Factorization Formula Connecting the Shape Functions of Heavy Meson in QCD and Heavy Quark Effective Theory Wei Wang et.al. 2504.18018 null
2025-04-24 LLM Agent Swarm for Hypothesis-Driven Drug Discovery Kevin Song et.al. 2504.17967 null
2025-04-24 Set Phasers to Stun: Beaming Power and Control to Mobile Robots with Laser Light Charles J. Carver et.al. 2504.17865 null
2025-04-24 The Fourth Monocular Depth Estimation Challenge Anton Obukhov et.al. 2504.17787 null
2025-04-24 Spectral Irradiance Variability in Lyman-Alpha Emission During Solar Flares Luke Majury et.al. 2504.17667 null
2025-04-24 Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization Guangyang Zeng et.al. 2504.17410 null
2025-04-24 StereoMamba: Real-time and Robust Intraoperative Stereo Disparity Estimation via Long-range Spatial Dependencies Xu Wang et.al. 2504.17401 null
2025-04-24 Evaluating and Mitigating Bias in AI-Based Medical Text Generation Xiuying Chen et.al. 2504.17279 null
2025-04-23 Structural roles and gender disparities in corruption networks Arthur A. B. Pessa et.al. 2504.17086 null
2025-04-23 Procedural Dataset Generation for Zero-Shot Stereo Matching David Yan et.al. 2504.16930 null
2025-04-23 An Accelerated Camera 3DMA Framework for Efficient Urban GNSS Multipath Estimation Shiyao Lv et.al. 2504.16906 null
2025-04-23 A model of the heliocentric dust ring on Venus orbit Ariane Courtot et.al. 2504.16610 null
2025-04-23 Tinkering Against Scaling Bolun Zhang et.al. 2504.16546 null
2025-04-22 Long-term disparities in the recovery of urban mobility after COVID-19 in Latin America Carmen Cabrera et.al. 2504.15871 null
2025-04-22 DERD-Net: Learning Depth from Event-based Ray Densities Diego de Oliveira Hitzges et.al. 2504.15863 null
2025-04-22 Trustworthy Decentralized Autonomous Machines: A New Paradigm in Automation Economy Fernando Castillo et.al. 2504.15676 null
2025-04-22 Multimodal Perception for Goal-oriented Navigation: A Survey I-Tak Ieong et.al. 2504.15643 null
2025-04-22 Yet Another Diminishing Spark: Low-level Cyberattacks in the Israel-Gaza Conflict Anh V. Vu et.al. 2504.15592 null
2025-04-22 The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks Minghao Wu et.al. 2504.15521 null
2025-04-21 Real-Time Sentiment Insights from X Using VADER, DistilBERT, and Web-Scraped Data Yanampally Abhiram Reddy et.al. 2504.15448 null
2025-04-21 MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video Minh-Quan Viet Bui et.al. 2504.15122 null
2025-04-21 Robust and Real-time Surface Normal Estimation from Stereo Disparities using Affine Transformations Csongor Csanad Kariko et.al. 2504.15121 null
2025-04-21 Sum-Rate Maximization for NOMA-Assisted Pinching-Antenna Systems Ziwu Zhou et.al. 2504.15006 null
2025-04-21 Reliable Multi-Modal Object Re-Identification via Modality-Aware Graph Reasoning Xixi Wan et.al. 2504.14847 null
2025-04-21 Aligning Beam with Imbalanced Multi-modality: A Generative Federated Learning Approach Jiahui Liang et.al. 2504.14835 null
2025-04-20 Polynomial-Time Constant-Approximation for Fair Sum-of-Radii Clustering Sina Bagheri Nezhad et.al. 2504.14683 null
2025-04-20 Regret-aware Re-ranking for Guaranteeing Two-sided Fairness and Accuracy in Recommender Systems Xiaopeng Ye et.al. 2504.14550 null
2025-04-20 Anisotropic quark propagation and Zeeman effect in an external magnetic field Minghui Ding et.al. 2504.14504 null
2025-04-20 sEEG-based Encoding for Sentence Retrieval: A Contrastive Learning Approach to Brain-Language Alignment Yijun Liu et.al. 2504.14468 null
2025-04-19 Balancing Fairness and Performance in Healthcare AI: A Gradient Reconciliation Approach Xiaoyang Wang et.al. 2504.14388 null
2025-04-18 Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion Sandipan Dhar et.al. 2504.13791 null
2025-04-18 Predictors of Childhood Vaccination Uptake in England: An Explainable Machine Learning Analysis of Longitudinal Regional Data (2021-2024) Amin Noroozi et.al. 2504.13755 null
2025-04-18 Divergent LLM Adoption and Heterogeneous Convergence Paths in Research Writing Cong William Lin et.al. 2504.13629 null
2025-04-18 Open-Loop and Closed-Loop Strategies for Linear Quadratic Mean Field Games: The Direct Approach Yong Liang et.al. 2504.13496 null
2025-04-17 Addressing the Minor-Embedding Problem in Quantum Annealing and Evaluating State-of-the-Art Algorithm Performance Aitor Gómez-Tejedor et.al. 2504.13376 null
2025-04-17 Generalized Parton Distributions from Symbolic Regression Anusha Reddy Singireddy et.al. 2504.13289 null
2025-04-17 Prospects for Detecting Signs of Life on Exoplanets in the JWST Era Sara Seager et.al. 2504.12946 null
2025-04-17 Quantifying walkable accessibility to urban services: An application to Florence, Italy Leonardo Boncinelli et.al. 2504.12934 null
2025-04-17 Unsupervised Cross-Domain 3D Human Pose Estimation via Pseudo-Label-Guided Global Transforms Jingjing Liu et.al. 2504.12699 null
2025-04-16 Reinforcement Learning from Human Feedback Nathan Lambert et.al. 2504.12501 link
2025-04-16 A Survey on Archetypal Analysis Aleix Alcacer et.al. 2504.12392 null
2025-04-16 Regist3R: Incremental Registration with Stereo Foundation Model Sidun Liu et.al. 2504.12356 null
2025-04-16 Towards Explainable Fusion and Balanced Learning in Multimodal Sentiment Analysis Miaosen Luo et.al. 2504.12151 null
2025-04-16 Stochastic Quadrature Rules for Solving PDEs using Neural Networks Jamie M. Taylor et.al. 2504.11976 link
2025-04-16 Benchmarking Mutual Information-based Loss Functions in Federated Learning Sarang S et.al. 2504.11877 null
2025-04-16 Boosting Multi-View Stereo with Depth Foundation Model in the Absence of Real-World Labels Jie Zhu et.al. 2504.11845 null
2025-04-15 Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models Maria Teleki et.al. 2504.11431 link
2025-04-15 Breaking the TDD Flow for Over-the-Air Phase Synchronization in Distributed Antenna Systems Khac-Hoang Ngo et.al. 2504.11411 null
2025-04-15 Towards global equity in political polarization research Max Falkenberg et.al. 2504.11090 null
2025-04-15 Meta-learning For Few-Shot Time Series Crop Type Classification: A Benchmark On The EuroCropsML Dataset Joana Reuss et.al. 2504.11022 null
2025-04-15 Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy Botao Zhao et.al. 2504.10819 null
2025-04-14 FuzzSense: Towards A Modular Fuzzing Framework for Autonomous Driving Software Andrew Roberts et.al. 2504.10717 null
2025-04-14 Emotion Alignment: Discovering the Gap Between Social Media and Real-World Sentiments in Persian Tweets and Images Sina Elahimanesh et.al. 2504.10662 null
2025-04-14 Who Speaks for Ethics? How Demographics Shape Ethical Advocacy in Software Development Lauren Olson et.al. 2504.10276 null
2025-04-14 Localized Cultural Knowledge is Conserved and Controllable in Large Language Models Veniamin Veselovsky et.al. 2504.10191 null
2025-04-14 Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution Yiwen Wang et.al. 2504.09887 link
2025-04-14 RAKG:Document-level Retrieval Augmented Knowledge Graph Construction Hairong Zhang et.al. 2504.09823 link
2025-04-13 FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird’s Eye View Yuting Zhao et.al. 2504.09535 null
2025-04-12 “It’s not a representation of me”: Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services Shira Michel et.al. 2504.09346 null
2025-04-12 CrossLink: A Decentralized Framework for Secure Cross-Chain Smart Contract Execution Tahrim Hossain et.al. 2504.09319 link
2025-04-12 PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks Jianyu Wu et.al. 2504.09258 null
2025-04-15 FairACE: Achieving Degree Fairness in Graph Neural Networks via Contrastive and Adversarial Group-Balanced Training Jiaxin Liu et.al. 2504.09210 null
2025-04-12 Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence Yuxu Lu et.al. 2504.09197 null
2025-04-11 Application of machine learning models to predict the relationship between air pollution, ecosystem degradation, and health disparities and lung cancer in Vietnam Ngoc Hong Tran et.al. 2504.08651 null
2025-04-11 seeBias: A Comprehensive Tool for Assessing and Visualizing AI Fairness Yilin Ning et.al. 2504.08418 link
2025-04-10 Adaptive Bounded Exploration and Intermediate Actions for Data Debiasing Yifan Yang et.al. 2504.08151 link
2025-04-10 Experimental Analysis of Quadcopter Drone Hover Constraints for Localization Improvements Uthman Olawoye et.al. 2504.07843 null
2025-04-10 FairEval: Evaluating Fairness in LLM-Based Recommendations with Personality Awareness Chandan Kumar Sah et.al. 2504.07801 null
2025-04-10 MMLA: Multi-Environment, Multi-Species, Low-Altitude Aerial Footage Dataset Jenna Kline et.al. 2504.07744 null
2025-04-10 Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation Yanglin Huang et.al. 2504.07691 null
2025-04-10 Tuning chirality amplitude at ultrafast timescales Hiroki Ueda et.al. 2504.07599 null
2025-04-10 Echoes of Disagreement: Measuring Disparity in Social Consensus Marios Papachristou et.al. 2504.07480 link
2025-04-10 Continuity conditions weaker than lower semi-continuity Jacob Westerhout et.al. 2504.07451 null
2025-04-10 ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement Anning Hu et.al. 2504.07418 null
2025-04-10 FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair Arya Fayyazi et.al. 2504.07395 null
2025-04-09 Universal neural wave functions for high-pressure hydrogen David Linteau et.al. 2504.07062 null
2025-04-09 Identifying Key Challenges of Hardness-Based Resampling Pawel Pukowski et.al. 2504.07031 null
2025-04-09 Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting Daiwei Zhang et.al. 2504.06978 null
2025-04-09 Communicating complex statistical models to a public health audience: translating science into action with the FARSI approach Mattia Stival et.al. 2504.06787 null
2025-04-09 A Novel Nonlinear Fertility Catastrophe Model Based on Thom’s Differential Equations of Morphogenesis Rolando Gonzales Martinez et.al. 2504.06668 null
2025-04-08 Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring José A. Pilartes-Congo et.al. 2504.06464 null
2025-04-08 Computing for Community-Based Economies: A Sociotechnical Ecosystem for Democratic, Egalitarian and Sustainable Futures Kwame Porter Robinson et.al. 2504.06114 null
2025-04-08 Co-evolution of cooperation and resource allocation in the advantageous environment-based spatial multi-game using adaptive control Chengbin Sun et.al. 2504.06112 null
2025-04-08 AI analysis of medical images at scale as a health disparities probe: a feasibility demonstration using chest radiographs Heather M. Whitney et.al. 2504.05990 null
2025-04-08 Uncovering Fairness through Data Complexity as an Early Indicator Juliett Suárez Ferreira et.al. 2504.05923 null
2025-04-08 Thermodynamic supercriticality and complex phase diagram for the AdS black hole Zhen-Ming Xu et.al. 2504.05708 null
2025-04-08 Fairness in Machine Learning-based Hand Load Estimation: A Case Study on Load Carriage Tasks Arafat Rahman et.al. 2504.05610 null
2025-04-07 Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation Manvi Agarwal et.al. 2504.05364 null
2025-04-07 A BLE and UWB Beacon-Assist Framework for Multiuser Augmented Reality Synchronization Across Multiple Devices in Shared Environments Maitree Hirunteeyakul et.al. 2504.05293 null
2025-04-07 CARE: Aligning Language Models for Regional Cultural Awareness Geyang Guo et.al. 2504.05154 link
2025-04-07 Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and Semidensification Yasuhiro Yao et.al. 2504.05148 link
2025-04-07 M-Prometheus: A Suite of Open Multilingual LLM Judges José Pombal et.al. 2504.04953 link
2025-04-07 CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images Cheng Chen et.al. 2504.04753 null
2025-04-06 eKalibr-Stereo: Continuous-Time Spatiotemporal Calibration for Event-Based Stereo Visual Systems Shuolong Chen et.al. 2504.04451 link
2025-04-05 Exploration of Approaches for Robustness and Safety in a Low Code Open Environment for Factory Automation Gustavo Quiros A. et.al. 2504.04224 null
2025-04-05 Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources Zihao Li et.al. 2504.04152 null
2025-04-05 The Labor Market Incidence of New Technologies Tianyu Fan et.al. 2504.04047 null
2025-04-05 Disparate Privacy Vulnerability: Targeted Attribute Inference Attacks and Defenses Ehsanul Kabir et.al. 2504.04033 null
2025-04-04 SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding Yimin Wei et.al. 2504.03254 link
2025-04-03 Bias in Large Language Models Across Clinical Applications: A Systematic Review Thanathip Suenghataiphorn et.al. 2504.02917 null
2025-04-03 Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets Chuning Zhu et.al. 2504.02792 null
2025-04-03 The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context Nikhil Verma et.al. 2504.02708 null
2025-04-02 Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks Ali Al-Kaswan et.al. 2504.01850 null
2025-04-02 SOLAQUA: SINTEF Ocean Large Aquaculture Robotics Dataset Sveinung Johan Ohrem et.al. 2504.01790 null
2025-04-02 DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image Jijun Xiang et.al. 2504.01596 link
2025-04-02 Hyperbolic Diffusion Recommender Model Meng Yuan et.al. 2504.01541 null
2025-04-02 ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue Thomas Pritchard et.al. 2504.01261 link
2025-04-01 Feature-Preserving Mesh Decimation for Normal Integration Moritz Heep et.al. 2504.00867 null
2025-04-01 Bridging the Gap: Integrating Ethics and Environmental Sustainability in AI Research and Practice Alexandra Sasha Luccioni et.al. 2504.00797 null
2025-04-01 Alleviating Performance Disparity in Adversarial Spatiotemporal Graph Learning Under Zero-Inflated Distribution Songran Bai et.al. 2504.00721 null
2025-04-01 ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection Xiaoxuan Zhu et.al. 2504.00695 link
2025-04-01 Using complex prompts to identify fine-grained biases in image generation through ChatGPT-4o Marinus Ferreira et.al. 2504.00388 null
2025-03-31 Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views Chong Bao et.al. 2503.24382 null
2025-03-31 BAR-Analytics: A Web-based Platform for Analyzing Information Spreading Barriers in News: Comparative Analysis Across Multiple Barriers and Events Abdul Sittar et.al. 2503.24220 null
2025-03-31 Ride-Sourcing Vehicle Rebalancing with Service Accessibility Guarantees via Constrained Mean-Field Reinforcement Learning Matej Jusup et.al. 2503.24183 link
2025-03-31 Is LLM the Silver Bullet to Low-Resource Languages Machine Translation? Yewei Song et.al. 2503.24102 null
2025-03-31 Level the Level: Balancing Game Levels for Asymmetric Player Archetypes With Reinforcement Learning Florian Rupp et.al. 2503.24099 link
2025-03-31 Multispacecraft Observations of the 2024 September 9 Backside Solar Eruption that Resulted in a Sustained Gamma Ray Emission Event Nat Gopalswamy et.al. 2503.23852 null
2025-03-31 A PINN Methodology for Temperature Field Reconstruction in the PIV Measurement Plane: Case of Rayleigh-Bénard Convection Marie-Christine Volk et.al. 2503.23801 null
2025-03-31 Consistency-aware Self-Training for Iterative-based Stereo Matching Jingyi Zhou et.al. 2503.23747 null
2025-03-31 Detail-aware multi-view stereo network for depth estimation Haitao Tian et.al. 2503.23684 null
2025-03-30 Third Harmonic Structure in an Interplanetary Type II Radio Burst and Other Energetic Phenomena During the 2024 September 14 Solar Eruption Nat Gopalswamy et.al. 2503.23584 null
2025-03-28 Benchmarking Ultra-Low-Power $μ$ NPUs Josh Millar et.al. 2503.22567 null
2025-03-28 A Causal Framework to Measure and Mitigate Non-binary Treatment Discrimination Ayan Majumdar et.al. 2503.22454 link
2025-03-28 Scaling Laws of Scientific Discovery with AI and Robot Scientists Pengsong Zhang et.al. 2503.22444 null
2025-03-28 MVSAnywhere: Zero-Shot Multi-View Stereo Sergio Izquierdo et.al. 2503.22430 null
2025-03-28 Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion Songsong Yu et.al. 2503.22262 null
2025-03-28 An Advanced Ensemble Deep Learning Framework for Stock Price Prediction Using VAE, Transformer, and LSTM Model Anindya Sarkar et.al. 2503.22192 null
2025-03-28 Reflection on Code Contributor Demographics and Collaboration Patterns in the Rust Communit Rohit Dandamudi et.al. 2503.22066 null
2025-03-28 Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges Ukcheol Shin et.al. 2503.22060 link
2025-03-27 Improved Tomographic Reconstruction of 3D Global Coronal Density from STEREO/COR1 Observations Tongjiang Wang et.al. 2503.22041 null
2025-03-27 The commutativity problem for effective varieties of formal series, and applications Lorenzo Clemente et.al. 2503.21697 null
2025-03-27 Exploring the Energy Landscape of RBMs: Reciprocal Space Insights into Bosons, Hierarchical Learning and Symmetry Breaking J. Quetzalcóatl Toledo-Marin et.al. 2503.21536 null
2025-03-27 ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo Yuxi Hu et.al. 2503.21525 null
2025-03-27 Behavioral response to mobile phone evacuation alerts Erick Elejalde et.al. 2503.21497 null
2025-03-27 GPU-Accelerated Charge-Equilibration for Shadow Molecular Dynamics in Python Mehmet Cagri Kaymak et.al. 2503.21176 link
2025-03-26 Can Large Language Models Predict Associations Among Human Attitudes? Ana Ma et.al. 2503.21011 null
2025-03-26 CH $_3$ OH as a User-Friendly Density Probe: Calibration and Beyond A. Giannetti et.al. 2503.20944 null
2025-03-26 SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments Tanmoy Dam et.al. 2503.20614 link
2025-03-26 Emergent properties and the multiscale characterization challenge in condensed matter, from crystals to complex materials: a Review Elisabetta Nocerino et.al. 2503.20266 null
2025-03-26 Attention IoU: Examining Biases in CelebA using Attention Maps Aaron Serianni et.al. 2503.19846 link
2025-03-26 A Survey on Event-driven 3D Reconstruction: Development under Different Categories Chuanzhi Xu et.al. 2503.19753 null
2025-03-25 Fairness in Proof of Team Sprint (PoTS): Evaluating Reward Distribution Across Performance Levels Naoki Yonezawa et.al. 2503.19301 null
2025-03-25 ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency Yang Ren et.al. 2503.19283 link
2025-03-24 Information-Seeking Decision Strategies Mitigate Risk in Dynamic, Uncertain Environments Nicholas W. Barendregt et.al. 2503.19107 link
2025-03-25 Learning to segment anatomy and lesions from disparately labeled sources in brain MRI Meva Himmetoglu et.al. 2503.18840 null
2025-03-24 LeanStereo: A Leaner Backbone based Stereo Network Rafia Rahim et.al. 2503.18557 link
2025-03-24 Distilling Stereo Networks for Performant and Efficient Leaner Networks Rafia Rahim et.al. 2503.18544 link
2025-03-24 Natural Language Processing for Electronic Health Records in Scandinavian Languages: Norwegian, Swedish, and Danish Ashenafi Zebene Woldaregay et.al. 2503.18539 null
2025-03-24 PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model Junyuan Gao et.al. 2503.18484 link
2025-03-24 PS-EIP: Robust Photometric Stereo Based on Event Interval Profile Kazuma Kitazawa et.al. 2503.18341 null
2025-03-24 Vision-Guided Loco-Manipulation with a Snake Robot Adarsh Salagame et.al. 2503.18308 null
2025-03-24 RAU: Towards Regularized Alignment and Uniformity for Representation Learning in Recommendation Xi Wu et.al. 2503.18300 null
2025-03-24 Fact-checking AI-generated news reports: Can LLMs catch their own lies? Jiayi Yao et.al. 2503.18293 null
2025-03-24 GI-SLAM: Gaussian-Inertial SLAM Xulang Liu et.al. 2503.18275 null
2025-03-21 Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors Wonbong Jang et.al. 2503.17316 null
2025-03-21 Uncovering cooling center usage as an adaptation strategy for hurricane-blackout-heat compound hazards during Hurricane Beryl (2024) Tianle Duan et.al. 2503.17292 null
2025-03-21 Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers Gaojie Jin et.al. 2503.17172 null
2025-03-21 Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study of Leukocytes and Schistocytes Davide Antonio Mura et.al. 2503.17107 null
2025-03-21 TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting Jianchuan Chen et.al. 2503.17032 null
2025-03-21 Exploring the Role of Women in Hugging Face Organizations Maria Tubella Salinas et.al. 2503.17000 link
2025-03-21 DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery Jiadong Tang et.al. 2503.16964 null
2025-03-21 A Flexible Fairness Framework with Surrogate Loss Reweighting for Addressing Sociodemographic Disparities Wen Xu et.al. 2503.16836 null
2025-03-20 RESFL: An Uncertainty-Aware Framework for Responsible Federated Learning by Balancing Privacy, Fairness and Utility in Autonomous Vehicles Dawood Wasif et.al. 2503.16251 null
2025-03-20 Variance-Aware Noisy Training: Hardening DNNs against Unstable Analog Computations Xiao Wang et.al. 2503.16183 null
2025-03-19 Quantum entropy as a harbinger of factorizability Henry Bloss et.al. 2503.15603 null
2025-03-19 Evaluating Bias in Retrieval-Augmented Medical Question-Answering Systems Yuelyu Ji et.al. 2503.15454 null
2025-03-19 Beacon2Science: Enhancing STEREO/HI beacon data1 with machine learning for efficient CME tracking Justin Le Louëdec et.al. 2503.15288 link
2025-03-19 EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds Yuanchao Yue et.al. 2503.15284 link
2025-03-19 Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening Zihan Cao et.al. 2503.14975 null
2025-03-19 Body-Hand Modality Expertized Networks with Cross-attention for Fine-grained Skeleton Action Recognition Seungyeon Cho et.al. 2503.14960 null
2025-03-19 USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network Joseph Emmanuel DL Dayo et.al. 2503.14950 null
2025-03-18 VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms Seungwon Lim et.al. 2503.14427 link
2025-03-18 Exploring Disparity-Accuracy Trade-offs in Face Recognition Systems: The Role of Datasets, Architectures, and Loss Functions Siddharth D Jaiswal et.al. 2503.14138 null
2025-03-17 SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint Zhenlong Yuan et.al. 2503.13721 null
2025-03-17 Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios Iryna Repinetska et.al. 2503.13710 null
2025-03-17 A Circular Construction Product Ontology for End-of-Life Decision-Making Kwabena Adu-Duodu et.al. 2503.13708 null
2025-03-17 Subgroup Performance of a Commercial Digital Breast Tomosynthesis Model for Breast Cancer Detection Beatrice Brown-Mulry et.al. 2503.13581 null
2025-03-17 Scale Efficient Training for Large Datasets Qing Zhou et.al. 2503.13385 link
2025-03-17 Financial Adviser Misconduct and Labor Market Penalties: Uncovering Racial Disparities in the Absence of Gender Gaps Jun Honda et.al. 2503.12837 null
2025-03-17 Stereo Event-based, 6-DOF Pose Tracking for Uncooperative Spacecraft Zibin Liu et.al. 2503.12732 link
2025-03-17 GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching Feng Qiao et.al. 2503.12720 link
2025-03-16 A novel association and ranking approach identifies factors affecting educational outcomes of STEM majors Kira Adaricheva et.al. 2503.12321 link
2025-03-15 Robust Isolation Forest using Soft Sparse Random Projection and Valley Emphasis Method Hun Kang et.al. 2503.12125 null
2025-03-18 3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene Reconstruction Peizhen Zheng et.al. 2503.12001 link
2025-03-14 Black Older Adults’ Perception of Using Voice Assistants to Enact a Medical Recovery Curriculum Andrea Green et.al. 2503.11894 null
2025-03-14 Bridging the LLM Accessibility Divide? Performance, Fairness, and Cost of Closed versus Open LLMs for Automated Essay Scoring Kezia Oketch et.al. 2503.11827 null
2025-03-14 Thermodynamics of the Hubbard Model on the Bethe Lattice Jia-Lin Chen et.al. 2503.11598 link
2025-03-14 TikZero: Zero-Shot Text-Guided Graphics Program Synthesis Jonas Belouadi et.al. 2503.11509 link
2025-03-14 An automated geometric space curve approach for designing dynamically corrected gates Evangelos Piliouras et.al. 2503.11492 link
2025-03-14 ARCAS: Adaptive Runtime System for Chiplet-Aware Scheduling Alessandro Fogli et.al. 2503.11460 null
2025-03-14 AQUA-SLAM: Tightly-Coupled Underwater Acoustic-Visual-Inertial SLAM with Sensor Calibration Shida Xu et.al. 2503.11420 link
2025-03-14 Exploring Competitive and Collusive Behaviors in Algorithmic Pricing with Deep Reinforcement Learning Shidi Deng et.al. 2503.11270 null
2025-03-14 NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications Li Cui et.al. 2503.11199 null
2025-03-14 SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets Hao Liu et.al. 2503.11133 null
2025-03-14 TigerLLM – A Family of Bangla Large Language Models Nishat Raihan et.al. 2503.10995 link
2025-03-13 Design and Development of the MeCO Open-Source Autonomous Underwater Vehicle David Widhalm et.al. 2503.10928 null
2025-03-13 Controlling the dynamical phase diagram of a spinor BEC using time-dependent potentials Q. Guan et.al. 2503.10563 null
2025-03-13 Subgroup Performance Analysis in Hidden Stratifications Alceu Bissoto et.al. 2503.10382 null
2025-03-13 Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction Xiaobo Xia et.al. 2503.09947 null
2025-03-12 Approximately Counting and Sampling Hamiltonian Motifs in Sublinear Time Talya Eden et.al. 2503.09810 null
2025-03-12 How good are deep learning methods for automated road safety analysis using video data? An experimental study Qingwu Liu et.al. 2503.09807 null
2025-03-12 BiasConnect: Investigating Bias Interactions in Text-to-Image Models Pushkar Shukla et.al. 2503.09763 null
2025-03-12 Resolving the Kagome Origin of the Strange Metallicity in Ni $_3$ In Jean C. Souza et.al. 2503.09704 null
2025-03-12 Edge AI for Real-time Fetal Assessment in Rural Guatemala Nasim Katebi et.al. 2503.09659 null
2025-03-12 IUP: Integrated and Programmable User Plane for Next-Generation Mobile Networks Chieh-Chun Chen et.al. 2503.09430 null
2025-03-12 OpenVidVRD: Open-Vocabulary Video Visual Relation Detection via Prompt-Driven Semantic Space Alignment Qi Liu et.al. 2503.09416 null
2025-03-12 GRU: Mitigating the Trade-off between Unlearning and Retention for Large Language Models Yue Wang et.al. 2503.09117 null
2025-03-12 StratIncon Detector: Analyzing Strategy Inconsistencies Between Real-Time Strategy and Preferred Professional Strategy in MOBA Esports Ruofei Ma et.al. 2503.09060 null
2025-03-11 BoundarEase: Fostering Constructive Community Engagement to Inform More Equitable Student Assignment Policies Cassandra Overney et.al. 2503.08543 link
2025-03-11 Does excellence correspond to universal inequality level? Evidences from scholarly citations and Olympic medal data Soumyajyoti Biswas et.al. 2503.08480 null
2025-03-11 SegDesicNet: Lightweight Semantic Segmentation in Remote Sensing with Geo-Coordinate Embeddings for Domain Adaptation Sachin Verma et.al. 2503.08290 null
2025-03-11 CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning Kaiqiang Xiong et.al. 2503.08219 null
2025-03-10 The Janus Face of Innovation: Global Disparities and Divergent Options Nihat Mugurtay et.al. 2503.07676 null
2025-03-10 VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models Jen-tse Huang et.al. 2503.07575 link
2025-03-10 OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation Ding Zhong et.al. 2503.07098 null
2025-03-10 SDFA: Structure Aware Discriminative Feature Aggregation for Efficient Human Fall Detection in Video Sania Zahan et.al. 2503.07008 null
2025-03-10 Kinetic model and numerical method for multispecies radiation hydrodynamic system with multiscale nonequilibrium transport Mingyu Quan et.al. 2503.06906 null
2025-03-09 DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning Chengxuan Qian et.al. 2503.06456 link
2025-03-09 Socioeconomic centers in cities worldwide Shuai Pang et.al. 2503.06445 link
2025-03-09 Global physics-informed neural networks (GPINNs): from local point-wise constraint to global nodal association Feng Chen et.al. 2503.06403 null
2025-03-08 Mitigating Blockchain extractable value (BEV) threats by Distributed Transaction Sequencing in Blockchains Xiongfei Zhao et.al. 2503.06279 null
2025-03-08 Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations Meng Wang et.al. 2503.06222 null
2025-03-08 Generation of Optimized Solidity Code for Machine Learning Models using LLMs Nikumbh Sarthak Sham et.al. 2503.06203 null
2025-03-07 Stereo Any Video: Temporally Consistent Stereo Matching Junpeng Jing et.al. 2503.05549 null
2025-03-07 Asteroid phase curves and phase coloring effect using the ATLAS survey data Colazo Milagros et.al. 2503.05412 null
2025-03-07 Preparing Tetra-Digit Long-Range Entangled States via Unified Sequential Quantum Circuit Yu-Tao Hu et.al. 2503.05374 null
2025-03-07 Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects Justin Yu et.al. 2503.05189 null
2025-03-07 RocketEval: Efficient Automated LLM Evaluation via Grading Checklist Tianjun Wei et.al. 2503.05142 link
2025-03-06 Addressing the Subsumption Thesis: A Formal Bridge between Microeconomics and Active Inference Noe Kuhn et.al. 2503.05048 null
2025-03-06 MIDAS: Modeling Ground-Truth Distributions with Dark Knowledge for Domain Generalized Stereo Matching Peng Xu et.al. 2503.04376 null
2025-03-06 Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English Runtao Zhou et.al. 2503.04099 null
2025-03-06 Uncovering inequalities in new knowledge learning by large language models across different languages Chenglong Wang et.al. 2503.04064 link
2025-03-05 Connecting the dots: Tracing the evolutionary pathway of Polar Ring Galaxies in the cases of NGC 3718, NGC 2685, and NGC 4262 Krishna R. Akhil et.al. 2503.03709 null
2025-03-05 The Roles of Size, Packing, and Cohesion in the Emergence of Force Chains in Granular Packings Ankit Shrivastava et.al. 2503.03668 null
2025-03-05 Improved FPT Approximation Algorithms for TSP Jingyang Zhao et.al. 2503.03642 null
2025-03-05 Topo Goes Political: TDA-Based Controversy Detection in Imbalanced Reddit Political Data Arvindh Arun et.al. 2503.03500 null
2025-03-05 BANet: Bilateral Aggregation Network for Mobile Stereo Matching Gangwei Xu et.al. 2503.03259 link
2025-03-05 Transformer-Based Spatio-Temporal Association of Apple Fruitlets Harry Freeman et.al. 2503.03200 null
2025-03-04 CADDI: An in-Class Activity Detection Dataset using IMU data from low-cost sensors Luis Marquez-Carpintero et.al. 2503.02853 null
2025-03-04 Educational Assortative Mating and Household Income Inequality: Evidence from Brazil, Indonesia, Mexico, and South Africa Ana Kujundzic et.al. 2503.02713 null
2025-03-04 XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification Xiaoyu Zheng et.al. 2503.02619 null
2025-03-04 Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation Dengke Zhang et.al. 2503.02459 link
2025-03-04 Tabby: Tabular Data Synthesis with Language Models Sonia Cromp et.al. 2503.02152 null
2025-03-03 Building Machine Learning Challenges for Anomaly Detection in Science Elizabeth G. Campolongo et.al. 2503.02112 null
2025-03-03 Understanding Urban-Rural Disparities in Mobility Inefficiency for Colombia, Mexico, and India Nandini Iyer et.al. 2503.01810 link
2025-03-03 MUSt3R: Multi-view Network for Stereo 3D Reconstruction Yohann Cabon et.al. 2503.01661 link
2025-03-03 Unmasking Implicit Bias: Evaluating Persona-Prompted LLM Responses in Power-Disparate Social Scenarios Bryan Chen Zhengyu Tan et.al. 2503.01532 null
2025-03-03 RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation Shu Pan et.al. 2503.01434 null
2025-02-28 Back to the Future Cyclopean Stereo: a human perception approach unifying deep and geometric constraints Sherlon Almeida da Silva et.al. 2502.21280 null
2025-02-28 An LLM-based Delphi Study to Predict GenAI Evolution Francesco Bertolotti et.al. 2502.21092 null
2025-02-28 Modelling the Spatially Varying Non-Linear Effects of Heat Exposure Xinyi Chen et.al. 2502.20745 null
2025-02-28 Displaying Fear, Sadness, and Joy in Public: Schizophrenia Vloggers’ Video Narration of Emotion and Online Care-Seeking Jiaying “Lizzy” Liu et.al. 2502.20658 null
2025-02-28 FedConv: A Learning-on-Model Paradigm for Heterogeneous Federated Clients Leming Shen et.al. 2502.20639 link
2025-02-27 Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis Jeffrey Yang Fan Chiang et.al. 2502.20383 null
2025-02-27 UniTok: A Unified Tokenizer for Visual Generation and Understanding Chuofan Ma et.al. 2502.20321 link
2025-02-27 Educator Attention: How computational tools can systematically identify the distribution of a key resource for students Qingyang Zhang et.al. 2502.20135 null
2025-02-26 Treatment Non-Adherence Bias in Clinical Machine Learning: A Real-World Study on Hypertension Medication Zhongyuan Liang et.al. 2502.19625 null
2025-02-26 Do LLMs exhibit demographic parity in responses to queries about Human Rights? Rafiya Javed et.al. 2502.19463 null
2025-03-01 GraphBridge: Towards Arbitrary Transfer Learning in GNNs Li Ju et.al. 2502.19252 link
2025-02-26 Improving the quality of Web-mined Parallel Corpora of Low-Resource Languages using Debiasing Heuristics Aloka Fernando et.al. 2502.19074 null
2025-02-26 The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training Jinbo Wang et.al. 2502.19002 null
2025-02-26 Disparities in Magnetic Cloud Observations Between Two Spacecraft Having Small Radial and Angular Separations Near 1 AU Anjali Agarwal et.al. 2502.18919 null
2025-02-26 M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance Qingpei Guo et.al. 2502.18778 null
2025-02-26 Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance Xueqing Peng et.al. 2502.18772 null
2025-02-26 Deep-Bench: Deep Learning Benchmark Dataset for Code Generation Alireza Daghighfarsoodeh et.al. 2502.18726 null
2025-02-25 Expected Variational Inequalities Brian Hu Zhang et.al. 2502.18605 null
2025-02-25 Exploring Gender Disparities in Automatic Speech Recognition Technology Hend ElGhazaly et.al. 2502.18434 null
2025-02-25 A Kinetic Model of Solar Wind Acceleration Driven by Ambipolar Electric Potential and Velocity-Space Diffusion Maximilien Péters de Bonhome et.al. 2502.18132 null
2025-02-25 PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching Han Nie et.al. 2502.18104 link
2025-02-25 Assessing Large Language Models in Agentic Multilingual National Bias Qianying Liu et.al. 2502.17945 null
2025-02-25 Escaping the Subprime Trap in Algorithmic Lending Adam Bouyamourn et.al. 2502.17816 null
2025-02-25 Radial dependence of ion fluences in the 2023 July 17 SEP event from Parker Solar Probe to STEREO and ACE G. D. Muro et.al. 2502.17806 null
2025-02-25 FinP: Fairness-in-Privacy in Federated Learning by Addressing Disparities in Privacy Risk Tianyu Zhao et.al. 2502.17748 null
2025-02-24 Homophilic Effects on Economic Inequality: A Dynamic Network Agent-Based Model Gustavo L. Kohlrausch et.al. 2502.17705 null
2025-02-24 $A$-Norm and $A$ -numerical Radius Inequalities for Sums Of Operators in semi-Hilbertian spaces M. H. M. Rashid et.al. 2502.17696 null
2025-02-24 The DECADE cosmic shear project III: validation of analysis pipeline using spatially inhomogeneous data D. Anbajagane et.al. 2502.17676 null
2025-02-24 Kandinsky Conformal Prediction: Beyond Class- and Covariate-Conditional Coverage Konstantina Bairaktari et.al. 2502.17264 null
2025-02-24 Determinants of the Spousal Age Gap in India: Analysis of Indian Microdata Praveen et.al. 2502.17059 null
2025-02-24 Achieving Fair PCA Using Joint Eigenvalue Decomposition Vidhi Rathore et.al. 2502.16933 null
2025-02-24 PulseBat: A field-accessible dataset for second-life battery diagnostics from realistic histories using multidimensional rapid pulse test Shengyu Tao et.al. 2502.16848 null
2025-02-23 Optical appearance of a boson star with soliton potential Ke-Jian He et.al. 2502.16623 null
2025-02-23 Unmasking Societal Biases in Respiratory Support for ICU Patients through Social Determinants of Health Mira Moukheiber et.al. 2502.16477 link
2025-02-23 Make Literature-Based Discovery Great Again through Reproducible Pipelines Bojan Cestnik et.al. 2502.16450 link
2025-02-23 Facilitating Emergency Vehicle Passage in Congested Urban Areas Using Multi-agent Deep Reinforcement Learning Haoran Su et.al. 2502.16449 null
2025-02-22 Semantic Gaussian Mixture Variational Autoencoder for Sequential Recommendation Beibei Li et.al. 2502.16140 link
2025-02-22 A Trust-Aware and Cost-Optimized Blockchain Oracle Selection Model with Deep Reinforcement Learning Hengyang Zhang et.al. 2502.16133 link
2025-02-21 MoMa: A Modular Deep Learning Framework for Material Property Prediction Botian Wang et.al. 2502.15483 null
2025-02-21 UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction Chenyu Li et.al. 2502.15199 null
2025-02-21 Graph-Based Deep Learning on Stereo EEG for Predicting Seizure Freedom in Epilepsy Patients Artur Agaronyan et.al. 2502.15198 null
2025-02-21 TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba Xiuwei Chen et.al. 2502.15130 null
2025-02-20 Electron Beam Propagation and Radio-Wave Scattering in the Inner Heliosphere using Five Spacecraft Luis Alberto Cañizares et.al. 2502.15067 null
2025-02-20 Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion Jiangyuan Liu et.al. 2502.14616 link
2025-02-20 OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images Zhichao Zheng et.al. 2502.14279 null
2025-02-20 Asymmetric Co-Training for Source-Free Few-Shot Domain Adaptation Gengxu Li et.al. 2502.14214 link
2025-02-20 Stereo Image Coding for Machines with Joint Visual Feature Compression Dengchao Jin et.al. 2502.14190 null
2025-02-19 The NavINST Dataset for Multi-Sensor Autonomous Navigation Paulo Ricardo Marques de Araujo et.al. 2502.13863 null
2025-02-19 CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement Zheng Wu et.al. 2502.13624 null
2025-02-18 Two Tickets are Better than One: Fair and Accurate Hiring Under Strategic LLM Manipulations Lee Cohen et.al. 2502.13221 null
2025-02-18 Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks Markus J. Buehler et.al. 2502.13025 link
2025-02-18 Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version) Tianyi Zhang et.al. 2502.13017 null
2025-02-18 High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion Xiang Zhang et.al. 2502.12752 null
2025-02-18 Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection Zijian Cao et.al. 2502.12735 null
2025-02-18 Simulated Bifurcation with High-dimensional Expansion for Traffic Signal Optimization on Real-world Networks Shengda Zhao et.al. 2502.12440 null
2025-02-17 The impact of job stability on monetary poverty in Italy: causal small area estimation Katarzyna Reluga et.al. 2502.12376 null
2025-02-17 Healthcare cost prediction for heterogeneous patient profiles using deep learning models with administrative claims data Mohammad Amin Morid et.al. 2502.12277 null
2025-02-17 A versatile experimental method to measure the traction forces at interfaces Yingwei Hou et.al. 2502.12044 null
2025-02-17 pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM Luigi Freda et.al. 2502.11955 link
2025-02-17 BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages Shamsuddeen Hassan Muhammad et.al. 2502.11926 link
2025-02-17 Weak solutions and sharp interface limit of the anisotropic Cahn-Hilliard equation with disparate mobility and inhomogeneous potential Charles Elbar et.al. 2502.11849 null
2025-02-17 Text Classification in the LLM Era - Where do we stand? Sowmya Vajjala et.al. 2502.11830 null
2025-02-17 Deep Neural Networks for Accurate Depth Estimation with Latent Space Features Siddiqui Muhammad Yasir et.al. 2502.11777 null
2025-02-17 SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking Zijian Wu et.al. 2502.11534 null
2025-02-16 Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation Kunal Swami et.al. 2502.11002 null
2025-02-15 Do Deepfake Detectors Work in Reality? Simiao Ren et.al. 2502.10920 null
2025-02-15 Mobile Robotic Multi-View Photometric Stereo Suryansh Kumar et.al. 2502.10842 null
2025-02-14 Enhancing Multilingual LLM Pretraining with Model-Based Data Selection Bettina Messmer et.al. 2502.10361 null
2025-02-14 Merging public elementary schools to reduce racial/ethnic segregation Madison Landry et.al. 2502.10193 link
2025-02-14 Evaluating and Improving Graph-based Explanation Methods for Multi-Agent Coordination Siva Kailas et.al. 2502.09889 null
2025-02-13 Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages Shreyan Biswas et.al. 2502.09532 null
2025-02-13 SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest Jack Erhardt et.al. 2502.09528 null
2025-02-13 Diffusion Models Through a Global Lens: Are They Culturally Inclusive? Zahra Bayramli et.al. 2502.08914 null
2025-02-13 Uncovering Disparities in Rideshare Drivers Earning and Work Patterns: A Case Study of Chicago Hy Dang et.al. 2502.08893 null
2025-02-12 Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors Vishwanath Pratap Singh et.al. 2502.08587 null
2025-02-12 An entropy based comparative study of regional and seasonal distributions of particulate matter in Indian cities Suchismita Banerjee et.al. 2502.08491 null
2025-02-12 Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision Tianle Liu et.al. 2502.08352 null
2025-02-12 Emergent dimer-model topological order and quasi-particle excitations in liquid crystals: combinatorial vortex lattices Cuiling Meng et.al. 2502.08314 null
2025-02-12 Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model Bencheng Yan et.al. 2502.08309 null
2025-02-12 From Individual Experience to Collective Evidence: A Reporting-Based Framework for Identifying Systemic Harms Jessica Dai et.al. 2502.08166 link
2025-02-11 Federated Self-supervised Domain Generalization for Label-efficient Polyp Segmentation Xinyi Tan et.al. 2502.07951 null
2025-02-11 Small Area Estimation of Education Levels in Low- and Middle-Income Countries Yunhan Wu et.al. 2502.07946 link
2025-02-11 PFedDST: Personalized Federated Learning with Decentralized Selection Training Mengchen Fan et.al. 2502.07750 null
2025-02-11 A Nonparametric and Functional Wombling Methodology Luke A. Barratt et.al. 2502.07740 null
2025-02-11 HGTUL: A Hypergraph-based Model For Trajectory User Linking Fengjie Chang et.al. 2502.07549 null
2025-02-11 MoENAS: Mixture-of-Expert based Neural Architecture Search for jointly Accurate, Fair, and Robust Edge Deep Neural Networks Lotfi Abdelkrim Mecharbat et.al. 2502.07422 null
2025-02-11 BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Xu Huang et.al. 2502.07346 link
2025-02-11 Music for All: Exploring Multicultural Representations in Music Generation Models (Camera Ready) Atharva Mehta et.al. 2502.07328 link
2025-02-11 Does Training on Synthetic Data Make Models Less Robust? Lingze Zhang et.al. 2502.07164 null
2025-02-11 Feature Importance Depends on Properties of the Data: Towards Choosing the Correct Explanations for Your Data and Decision Trees based Models Célia Wafa Ayad et.al. 2502.07153 null
2025-02-10 Using Contextually Aligned Online Reviews to Measure LLMs’ Performance Disparities Across Language Varieties Zixin Tang et.al. 2502.07058 null
2025-02-10 A Compiler for Operations on Relations with Bag Semantics James Dong et.al. 2502.06988 null
2025-02-10 Beyond Literal Token Overlap: Token Alignability for Multilinguality Katharina Hämmerl et.al. 2502.06468 null
2025-02-10 On the reason for the widespread energetic storm particle event of 13 March 2023 N. Dresing et.al. 2502.06332 null
2025-02-10 The digital labour of artificial intelligence in Latin America: a comparison of Argentina, Brazil, and Venezuela Paola Tubaro et.al. 2502.06317 null
2025-02-08 Knowledge is Power: Harnessing Large Language Models for Enhanced Cognitive Diagnosis Zhiang Dong et.al. 2502.05556 null
2025-02-07 Point-Identifying Semiparametric Sample Selection Models with No Excluded Variable Dongwoo Kim et.al. 2502.05353 null
2025-02-07 Differentiable Mobile Display Photometric Stereo Gawoon Ban et.al. 2502.05055 null
2025-02-07 Unified Approaches in Self-Supervised Event Stream Modeling: Progress and Prospects Levente Zólyomi et.al. 2502.04899 null
2025-02-07 Practical implementation of a chiral phononic crystal demonstrator with ultra-low frequency bandgap Line Mardini et.al. 2502.04775 null
2025-02-06 Targeted Learning for Data Fairness Alexander Asemota et.al. 2502.04309 null
2025-02-06 Online Learning of Counter Categories and Ratings in PvP Games Chiu-Chou Lin et.al. 2502.03998 null
2025-02-06 Fairness Aware Reinforcement Learning via Proximal Policy Optimization Gabriele La Malfa et.al. 2502.03953 null
2025-02-05 Large Teams Overshadow Individual Recognition Lulin Yang et.al. 2502.03623 null
2025-02-04 How Inclusively do LMs Perceive Social and Moral Norms? Michael Galarnyk et.al. 2502.02696 link
2025-02-04 Fairness in Survival Analysis: A Novel Conditional Mutual Information Augmentation Approach Tianyang Xie et.al. 2502.02567 null
2025-02-04 Review of Demographic Bias in Face Recognition Ketan Kotwal et.al. 2502.02309 null
2025-02-04 Ilargi: a GPU Compatible Factorized ML Model Training Framework Wenbo Sun et.al. 2502.01985 null
2025-02-03 CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition Martijn Bartelds et.al. 2502.01777 null
2025-02-03 Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool Floris Holstege et.al. 2502.01713 null
2025-02-03 Comprehensive Modeling Approaches for Forecasting Bitcoin Transaction Fees: A Comparative Study Jiangqin Ma et.al. 2502.01029 null
2025-02-02 Fruit Fly Classification (Diptera: Tephritidae) in Images, Applying Transfer Learning Erick Andrew Bustamante Flores et.al. 2502.00939 null
2025-02-02 Psychometric-Based Evaluation for Theorem Proving with Large Language Models Jianyu Zhang et.al. 2502.00855 null
2025-02-01 DeepUKF-VIN: Adaptively-tuned Deep Unscented Kalman Filter for 3D Visual-Inertial Navigation based on IMU-Vision-Net Khashayar Ghanizadegan et.al. 2502.00575 null
2025-02-01 Evaluation of End-to-End Continuous Spanish Lipreading in Different Data Conditions David Gimeno-Gómez et.al. 2502.00464 link
2025-01-31 Beyond checkmate: exploring the creative chokepoints in AI text Nafis Irtiza Tripto et.al. 2501.19301 link
2025-02-03 DyPCL: Dynamic Phoneme-level Contrastive Learning for Dysarthric Speech Recognition Wonjun Lee et.al. 2501.19010 null
2025-01-31 Examining the Impact of Income Inequality and Gender on School Completion in Malaysia: A Machine Learning Approach Utilizing Malaysia’s Public Sector Open Data Muhammad Sukri Bin Ramli et.al. 2501.18868 null
2025-01-31 Systematic Uncertainties in the Measurement of Neutron lifetime Using Lunar Prospector Neutron Spectrometer Akshatha Vydula et.al. 2501.18831 null
2025-01-30 Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion Vitor Guizilini et.al. 2501.18804 null
2025-01-30 CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering Yumeng Wang et.al. 2501.18457 null
2025-01-30 Surface Defect Identification using Bayesian Filtering on a 3D Mesh Matteo Dalle Vedove et.al. 2501.18315 null
2025-01-29 From tools to thieves: Measuring and understanding public perceptions of AI through crowdsourced metaphors Myra Cheng et.al. 2501.18045 null
2025-01-29 STGCN-LSTM for Olympic Medal Prediction: Dynamic Power Modeling and Causal Policy Optimization Yiquan Wang et.al. 2501.17711 null
2025-01-29 Cross-Language Approach for Quranic QA Islam Oshallah et.al. 2501.17449 null
2025-01-29 Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models Yuxuan Li et.al. 2501.17420 null
2025-01-28 Stiff Transfer Learning for Physics-Informed Neural Networks Emilien Seiler et.al. 2501.17281 null
2025-01-28 Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving Evgenii Evstafev et.al. 2501.17084 null
2025-01-28 Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning Xi Chen et.al. 2501.16966 null
2025-01-28 Hybrid Phenology Modeling for Predicting Temperature Effects on Tree Dormancy Ron van Bree et.al. 2501.16848 link
2025-01-28 Strawberry Robotic Operation Interface: An Open-Source Device for Collecting Dexterous Manipulation Data in Robotic Strawberry Farming Linsheng Hou et.al. 2501.16717 null
2025-01-27 BiFold: Bimanual Cloth Folding with Language Guidance Oriol Barbany et.al. 2501.16458 null
2025-01-27 Will nanodust reappear in STEREO/WAVES data? Nicole Meyer-Vernet et.al. 2501.16133 null
2025-01-27 SampleLLM: Optimizing Tabular Data Synthesis in Recommendations Jingtong Gao et.al. 2501.16125 null
2025-01-27 Vienna Mosaic: Navigating Social Borders in a Melting Pot Marc Sadurní et.al. 2501.15920 link
2025-01-26 Fuzzy-aware Loss for Source-free Domain Adaptation in Visual Emotion Recognition Ying Zheng et.al. 2501.15519 null
2025-01-26 Dfilled: Repurposing Edge-Enhancing Diffusion for Guided DSM Void Filling Daniel Panangian et.al. 2501.15440 null
2025-01-26 Evaluating Simple Debiasing Techniques in RoBERTa-based Hate Speech Detection Models Diana Iftimie et.al. 2501.15430 null
2025-01-26 A General Approach to Relaxing Unconfoundedness Matthew A. Masten et.al. 2501.15400 null
2025-01-25 Fairness in LLM-Generated Surveys Andrés Abeliuk et.al. 2501.15351 null
2025-01-25 Fairness-aware Contextual Dynamic Pricing with Strategic Buyers Pangpang Liu et.al. 2501.15338 null
2025-01-25 The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders? Ayo Adedeji et.al. 2501.15310 null
2025-01-24 Fairness of Deep Ensembles: On the interplay between per-group task difficulty and under-representation Estanislao Claucich et.al. 2501.14551 null
2025-01-24 SoK: What Makes Private Learning Unfair? Kai Yao et.al. 2501.14414 null
2025-01-22 Synthetic CT image generation from CBCT: A Systematic Review Alzahra Altalib et.al. 2501.13972 null
2025-01-23 Analysis of Indic Language Capabilities in LLMs Aatman Vaidya et.al. 2501.13912 null
2025-01-23 You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain Timothy Chase Jr et.al. 2501.13725 null
2025-01-23 Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers Akshit Achara et.al. 2501.13302 link
2025-01-22 Flying shape and aerodynamics of a full-scale flexible Olympic windsurf sail J. Zhang et.al. 2501.13254 null
2025-01-22 On the development of open geographical data infrastructures in Latin America: progress and challenges Daniela Ballari et.al. 2501.13235 null
2025-01-22 Enhancing Multi-Attribute Fairness in Healthcare Predictive Modeling Xiaoyang Wang et.al. 2501.13219 null
2025-01-22 Machine Learning Modeling for Multi-order Human Visual Motion Processing Zitang Sun et.al. 2501.12810 link
2025-01-22 Exploring Wikipedia Gender Diversity Over Time $\unicode{x2013}$ The Wikipedia Gender Dashboard (WGD) Yahya Yunus et.al. 2501.12610 null
2025-01-23 Academic Case Reports Lack Diversity: Assessing the Presence and Diversity of Sociodemographic and Behavioral Factors related to Post COVID-19 Condition Juan Andres Medina Florez et.al. 2501.12538 null
2025-01-21 Decoherence of Schrödinger cat states in light of wave/particle duality Th. K. Mavrogordatos et.al. 2501.12328 null
2025-01-21 Improving robot understanding using conversational AI: demonstration and feasibility study Shikhar Kumar et.al. 2501.12214 null
2025-01-21 Towards autonomous photogrammetric forest inventory using a lightweight under-canopy robotic drone Väinö Karjalainen et.al. 2501.12073 null
2025-01-21 Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging Shuyi Hu et.al. 2501.11884 null
2025-01-21 FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients Jiaqi Leng et.al. 2501.11876 link
2025-01-20 Are generative models fair? A study of racial bias in dermatological image generation Miguel López-Pérez et.al. 2501.11752 null
2025-01-20 Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy Saeid Asgari Taghanaki et.al. 2501.11721 link
2025-01-20 Multi-View Spectral Clustering for Graphs with Multiple View Structures Yorgos Tsitsikas et.al. 2501.11422 link
2025-01-20 UniTrans: A Unified Vertical Federated Knowledge Transfer Framework for Enhancing Cross-Hospital Collaboration Chung-ju Huang et.al. 2501.11388 link
2025-01-20 Mitigating Spatial Disparity in Urban Prediction Using Residual-Aware Spatiotemporal Graph Neural Networks: A Chicago Case Study Dingyi Zhuang et.al. 2501.11214 null
2025-01-17 DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration Huiyun Cao et.al. 2501.10325 null
2025-01-17 Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt Qingcheng Zeng et.al. 2501.09950 null
2025-01-17 FoundationStereo: Zero-Shot Stereo Matching Bowen Wen et.al. 2501.09898 link
2025-01-16 Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment Maksim Filipenko et.al. 2501.09490 null
2025-01-16 DEFOM-Stereo: Depth Foundation Model Based Stereo Matching Hualie Jiang et.al. 2501.09466 link
2025-01-15 TeV afterglow emission from a multi-component GRB jet using the kinetic approach John P. Hope et.al. 2501.09093 null
2025-01-15 How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias Tosin Fadahunsi et.al. 2501.09014 link
2025-01-15 StereoGen: High-quality Stereo Image Generation from a Single Image Xianqi Wang et.al. 2501.08654 null
2025-01-15 MonSter: Marry Monodepth to Stereo Unleashes Power Junda Cheng et.al. 2501.08643 link
2025-01-15 Image-to-Force Estimation for Soft Tissue Interaction in Robotic-Assisted Surgery Using Structured Light Jiayin Wang et.al. 2501.08593 null
2025-01-15 Addressing Intersectionality, Explainability, and Ethics in AI-Driven Diagnostics: A Rebuttal and Call for Transdiciplinary Action Myles Joshua Toledo Tan et.al. 2501.08497 null
2025-01-16 Navigating Gender Disparities in Communication Research Leadership: Academic Recognition, Career Development, and Compensation Diego F. M. Oliveira et.al. 2501.08401 null
2025-01-14 TriMod Fusion for Multimodal Named Entity Recognition in Social Media Mosab Alfaqeeh et.al. 2501.08267 null
2025-01-13 An Investigation of Experiences Engaging the Margins in Data-Centric Innovation Gabriella Thompson et.al. 2501.07690 null
2025-01-13 Digital Twin for Smart Societies: A Catalyst for Inclusive and Accessible Healthcare Joshit Mohanty et.al. 2501.07570 null
2025-01-13 TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models Thales Sales Almeida et.al. 2501.07482 link
2025-01-13 PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations Ting-Yu Dai et.al. 2501.07447 null
2025-01-13 Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Li Liang et.al. 2501.07260 link
2025-01-13 Depth and Image Fusion for Road Obstacle Detection Using Stereo Camera Oleg Perezyabov et.al. 2501.07245 null
2025-01-13 Combined effect of incentives and coupling in multigames in two-layer networks Luo-Luo Jiang et.al. 2501.07193 null
2025-01-13 Reducing Latency by Eliminating CSIT Feedback: FDD Downlink MIMO Precoding Without CSIT Feedback for Internet-of-Things Communications Juntaek Han et.al. 2501.07094 null
2025-01-12 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927 link
2025-01-12 Integrators at War: Mediating in AI-assisted Resort-to-Force Decisions Dennis Müller et.al. 2501.06861 null
2025-01-12 Enabling Cardiac Monitoring using In-ear Ballistocardiogram on COTS Wireless Earbuds Yongjian Fu et.al. 2501.06744 null
2025-01-10 A monthly sub-national Harmonized Food Insecurity Dataset for comprehensive analysis and predictive modeling Machefer Mélissande et.al. 2501.06076 null
2025-01-10 “Cause” is Mechanistic Narrative within Scientific Domains: An Ordinary Language Philosophical Critique of “Causal Machine Learning” Vyacheslav Kungurtsev et.al. 2501.05844 null
2025-01-10 An Efficient Dual ADMM for Huber Regression with Fused Lasso Penalty Mengjiao Shi et.al. 2501.05676 null
2025-01-10 The Impact of Model Scaling on Seen and Unseen Language Performance Rhitabrat Pokharel et.al. 2501.05629 null
2025-01-09 Datasheets for Healthcare AI: A Framework for Transparency and Bias Mitigation Marjia Siddik et.al. 2501.05617 null
2025-01-09 Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping Wen Tianci et.al. 2501.05242 null
2025-01-09 An Algorithmic Approach for Causal Health Equity: A Look at Race Differentials in Intensive Care Unit (ICU) Outcomes Drago Plecko et.al. 2501.05197 null
2025-01-09 A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision Ali Rohan et.al. 2501.05147 null
2025-01-08 Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations Kirandeep Kaur et.al. 2501.04762 null
2025-01-09 Do Automated Fixes Truly Mitigate Smart Contract Exploits? Sofia Bobadilla et.al. 2501.04600 link
2025-01-08 Towards Fair Class-wise Robustness: Class Optimal Distribution Adversarial Training Hongxin Zhi et.al. 2501.04527 null
2025-01-08 Neighborhood Disparities in Smart City Service Adoption Shahaf Donio et.al. 2501.04363 null
2025-01-07 MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives Wisdom O. Ikezogwo et.al. 2501.04184 null
2025-01-07 Unifying restart accelerated gradient and proximal bundle methods Jiaming Liang et.al. 2501.04165 null
2025-01-07 Spanish heat waves curb discretionary mobility and alter work behavior Andrew Renninger et.al. 2501.03978 null
2025-01-07 Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware Hegel Pedroza et.al. 2501.03720 null
2025-01-06 Solar Cycle Variation of Axial Orientations and Favorable Locations of Eruptive MFRs Hong Xie et.al. 2501.03346 null
2025-01-06 CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation Yuanhong Chen et.al. 2501.02786 null
2025-01-05 Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera Yuliang Guo et.al. 2501.02464 link
2025-01-05 Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap Hyunwoo Ko et.al. 2501.02448 null
2025-01-05 Unsupervised Search for Ethnic Minorities’ Medical Segmentation Training Set Yixiao Chen et.al. 2501.02442 link
2025-01-04 The Integration of Blockchain and Artificial Intelligence for Secure Healthcare Systems Umar Safdar et.al. 2501.02169 null
2025-01-03 How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data Ishaan Maitra et.al. 2501.02111 link
2025-01-03 VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment Wenyan Cong et.al. 2501.01949 link
2025-01-03 Exploring Equality: An Investigation into Custom Loss Functions for Fairness Definitions Gordon Lee et.al. 2501.01889 null
2025-01-03 CycleFlow: Leveraging Cycle Consistency in Flow Matching for Speaker Style Adaptation Ziqi Liang et.al. 2501.01861 null
2025-01-03 MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling Simon Rouard et.al. 2501.01757 null
2025-01-03 The Essence of Contextual Understanding in Theory of Mind: A Study on Question Answering with Story Characters Chulun Zhou et.al. 2501.01705 null
2025-01-03 CrossView-GS: Cross-view Gaussian Splatting For Large-scale Scene Reconstruction Chenhao Zhang et.al. 2501.01695 null
2025-01-03 Equity Impacts of Public Transit Network Redesign with Shared Autonomous Mobility Services Max T. M. Ng et.al. 2501.01615 null
2025-01-02 CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries Shudong Liu et.al. 2501.01282 null
2025-01-02 TS-SatMVSNet: Slope Aware Height Estimation for Large-Scale Earth Terrain Multi-view Stereo Song Zhang et.al. 2501.01049 null
2025-01-02 Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer Ziyang Chen et.al. 2501.01023 link
2025-01-01 High-Probability Polynomial-Time Complexity of Restarted PDHG for Linear Programming Zikai Xiong et.al. 2501.00728 null
2024-12-31 H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters Pedram Fekri et.al. 2501.00514 null
2024-12-31 Who Gets Recommended? Investigating Gender, Race, and Country Disparities in Paper Recommendations from Large Language Models Yifan Tian et.al. 2501.00367 null
2024-12-31 SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation Shi-Feng Peng et.al. 2501.00303 link
2024-12-30 A Data-Centric Approach to Detecting and Mitigating Demographic Bias in Pediatric Mental Health Text: A Case Study in Anxiety Detection Julia Ive et.al. 2501.00129 null
2024-12-30 What Makes for a Good Stereoscopic Image? Netanel Y. Tamir et.al. 2412.21127 null
2024-12-30 Closing Speed Computation using Stereo Camera and Applications in Unsignalized T-Intersection Gautam Kumar et.al. 2412.20717 null
2024-12-30 MarsSQE: Stereo Quality Enhancement for Martian Images Using Bi-level Cross-view Attention Mai Xu et.al. 2412.20685 null
2024-12-29 Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control Bingliang Li et.al. 2412.20378 null
2024-12-29 Impact of Data Distribution on Fairness Guarantees in Equitable Deep Learning Yan Luo et.al. 2412.20377 link
2024-12-29 FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation Yan Luo et.al. 2412.20374 link
2024-12-29 Dual-Level Precision Edges Guided Multi-View Stereo with Accurate Planarization Kehua Chen et.al. 2412.20328 link
2024-12-28 The impact of China’s economic growth on poverty alleviation: From absolute to relative poverty Yixun Kang et.al. 2412.20176 null
2024-12-28 Neutron star stability beyond the mass peak: assessing the role of out-of-equilibrium perturbations Martin O. Canullan-Pascual et.al. 2412.20133 null
2024-12-28 Incentivizing supplemental math assignments and using AI-generated hints improve exam performance, especially for racially minoritized students Yifan Lu et.al. 2412.19961 null
2024-12-27 Analysis of Premature Death Rates in Texas Counties: The Impact of Air Quality, Socioeconomic Factors, and COPD Prevalence Richard Rich et.al. 2412.19774 null
2024-12-27 Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis Jiaqi Wang et.al. 2412.19654 link
2024-12-27 Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference Keke Zhang et.al. 2412.19553 null
2024-12-27 Is Your Text-to-Image Model Robust to Caption Noise? Weichen Yu et.al. 2412.19531 null
2024-12-27 Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images Xudong Cai et.al. 2412.19518 null
2024-12-27 Disparate Model Performance and Stability in Machine Learning Clinical Support for Diabetes and Heart Diseases Ioannis Bilionis et.al. 2412.19495 null
2024-12-27 Effects of Reynolds number and spatial resolution on the pressure source terms in turbulent boundary layers Aditya Agarwal et.al. 2412.19474 null
2024-12-26 MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo Byeonggwon Lee et.al. 2412.19130 null
2024-12-25 Evaluating authorship disambiguation quality through anomaly analysis on researchers’ career transition Huaxia Zhou et.al. 2412.18757 null
2024-12-24 Uncertainty Quantification in Stereo Matching Wenxiao Cai et.al. 2412.18703 link
2024-12-24 Topological phases protected by projective PT symmetry in alkaline-earth-like atoms Xiaofan Zhou et.al. 2412.18494 null
2024-12-24 scReader: Prompting Large Language Models to Interpret scRNA-seq Data Cong Li et.al. 2412.18156 null
2024-12-24 Fundamental Limits in the Search for Less Discriminatory Algorithms – and How to Avoid Them Benjamin Laufer et.al. 2412.18138 null
2024-12-23 Shifted Composition III: Local Error Framework for KL Divergence Jason M. Altschuler et.al. 2412.17997 null
2024-12-23 A Multimodal Fusion Framework for Bridge Defect Detection with Cross-Verification Ravi Datta Rachuri et.al. 2412.17968 null
2024-12-23 Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective Xinmiao Yu et.al. 2412.17787 null
2024-12-23 Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings Jérémie Sublime et.al. 2412.17486 null
2024-12-24 Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement Hyeonjin Kim et.al. 2412.17387 link
2024-12-22 Fairness in Reinforcement Learning with Bisimulation Metrics Sahand Rezaei-Shoshtari et.al. 2412.17123 null
2024-12-22 Differentially Private Random Block Coordinate Descent Artavazd Maranjyan et.al. 2412.17054 null
2024-12-22 Lightweight Design and Optimization methods for DCNNs: Progress and Futures Hanhua Long et.al. 2412.16886 null
2024-12-21 Does calibration mean what they say it means; or, the reference class problem rises again Lily Hu et.al. 2412.16769 null
2024-12-21 ViM-Disparity: Bridging the Gap of Speed, Accuracy and Memory for Disparity Map Generation Maheswar Bora et.al. 2412.16745 link
2024-12-21 LUCES-MV: A Multi-View Dataset for Near-Field Point Light Source Photometric Stereo Fotios Logothetis et.al. 2412.16737 null
2024-12-21 A Unifying Family of Data-Adaptive Partitioning Algorithms Guy B. Oldaker IV et.al. 2412.16713 null
2024-12-20 Climate Impact Assessment Requires Weighting: Introducing the Weighted Climate Dataset Marco Gortan et.al. 2412.15699 null
2024-12-20 Gender Disparities in Contributions, Leadership, and Collaboration: An Exploratory Study on Software Systems Research Shamse Tasnim Cynthia et.al. 2412.15661 null
2024-12-20 Radio filaments as Z-pinched Galactic center wind Fan Zhang et.al. 2412.15575 null
2024-12-20 SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation Ke Yan et.al. 2412.15526 link
2024-12-19 Uncertainty-Guided Cross Attention Ensemble Mean Teacher for Semi-supervised Medical Image Segmentation Meghana Karri et.al. 2412.15380 null
2024-12-19 Tiled Diffusion Or Madar et.al. 2412.15185 null
2024-12-19 Improving Geometry in Sparse-View 3DGS via Reprojection-based DoF Separation Yongsung Kim et.al. 2412.14568 null
2024-12-19 Provincial allocation of China’s commercial building operational carbon towards carbon neutrality Yanqiao Deng et.al. 2412.14523 null
2024-12-19 Who is Helping Whom? Student Concerns about AI- Teacher Collaboration in Higher Education Classrooms Bingyi Han et.al. 2412.14469 null
2024-12-19 An Immersive Multi-Elevation Multi-Seasonal Dataset for 3D Reconstruction and Visualization Xijun Liu et.al. 2412.14418 null
2024-12-18 I0T: Embedding Standardization Method Towards Zero Modality Gap Na Min An et.al. 2412.14384 link
2024-12-18 Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs David Restrepo et.al. 2412.14304 null
2024-12-18 What Has Been Overlooked in Contrastive Source-Free Domain Adaptation: Leveraging Source-Informed Latent Augmentation within Neighborhood Context Jing Wang et.al. 2412.14301 link
2024-12-18 On Calibration in Multi-Distribution Learning Rajeev Verma et.al. 2412.14142 null
2024-12-18 LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research Tianyang Gu et.al. 2412.14141 null
2024-12-18 Performance Gap in Entity Knowledge Extraction Across Modalities in Vision Language Models Ido Cohen et.al. 2412.14133 link
2024-12-18 Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation Rémi Marsal et.al. 2412.14103 null
2024-12-18 Neural Combinatorial Optimization for Stochastic Flexible Job Shop Scheduling Problems Igor G. Smit et.al. 2412.14052 link
2024-12-18 What If: Causal Analysis with Graph Databases Amedeo Pachera et.al. 2412.13965 null
2024-12-18 MobiFuse: A High-Precision On-device Depth Perception System with Multi-Data Fusion Jinrui Zhang et.al. 2412.13848 null
2024-12-18 A2H: A UI Converter from Android to HarmonyOS Platform Chen Wang et.al. 2412.13693 link
2024-12-18 Soft Modes as a Predictive Framework for Low Dimensional Biological Systems across Scales Christopher Joel Russo et.al. 2412.13637 null
2024-12-18 SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation Kazuki Shimada et.al. 2412.13462 null
2024-12-17 C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System Parker Addison et.al. 2412.13163 null
2024-12-17 Unlocking the Potential of Digital Pathology: Novel Baselines for Compression Maximilian Fischer et.al. 2412.13137 null
2024-12-17 Queries, Representation & Detection: The Next 100 Model Fingerprinting Schemes Augustin Godinot et.al. 2412.13021 link
2024-12-17 AoI in Context-Aware Hybrid Radio-Optical IoT Networks Aymen Hamrouni et.al. 2412.12914 null
2024-12-17 ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation Shiqi Huang et.al. 2412.12798 link
2024-12-17 Preference Robust Ordinal Priority Approach and its Satisficing Extension for Multi-Attribute Decision-Making with Incomplete Information Renlong Wang et.al. 2412.12690 null
2024-12-17 SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing Chen Chen et.al. 2412.12685 link
2024-12-17 DriveTester: A Unified Platform for Simulation-Based Autonomous Driving Testing Mingfei Cheng et.al. 2412.12656 link
2024-12-17 PBVS 2024 Solution: Self-Supervised Learning and Sampling Strategies for SAR Classification in Extreme Long-Tail Distribution Yuhyun Kim et.al. 2412.12565 null
2024-12-17 Beyond Data Quantity: Key Factors Driving Performance in Multilingual Language Models Sina Bagheri Nezhad et.al. 2412.12500 link
2024-12-16 CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models Felix Taubner et.al. 2412.12093 null
2024-12-16 IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations Zhibing Li et.al. 2412.12083 null
2024-12-16 Hybrid quantum network for sensing in the acoustic frequency range Valeriy Novikov et.al. 2412.11824 null
2024-12-16 Image Gradient-Aided Photometric Stereo Network Kaixuan Wang et.al. 2412.11650 null
2024-12-16 DVP-MVS: Synergize Depth-Edge and Visibility Prior for Multi-View Stereo Zhenlong Yuan et.al. 2412.11578 null
2024-12-16 RoMeO: Robust Metric Visual Odometry Junda Cheng et.al. 2412.11530 null
2024-12-16 SpatialMe: Stereo Video Conversion Using Depth-Warping and Blend-Inpainting Jiale Zhang et.al. 2412.11512 null
2024-12-15 On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning Pengfei Fang et.al. 2412.11017 null
2024-12-13 EvalGIM: A Library for Evaluating Generative Image Models Melissa Hall et.al. 2412.10604 link
2024-12-13 Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples Yeyuan Wang et.al. 2412.10029 null
2024-12-13 All-in-One: Transferring Vision Foundation Models into Stereo Matching Jingyi Zhou et.al. 2412.09912 null
2024-12-13 OpenForge: Probabilistic Metadata Integration Tianji Cong et.al. 2412.09788 link
2024-12-12 Egyptian fractions meet the Sierpinski triangle Laura De Carli et.al. 2412.09728 null
2024-12-12 Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos Linyi Jin et.al. 2412.09621 null
2024-12-12 Learned Compression for Compressed Learning Dan Jacobellis et.al. 2412.09405 link
2024-12-12 T-SVG: Text-Driven Stereoscopic Video Generation Qiao Jin et.al. 2412.09323 null
2024-12-12 Multimodal Sentiment Analysis based on Video and Audio Inputs Antonio Fernandez et.al. 2412.09317 null
2024-12-12 Pinpoint Counterfactuals: Reducing social bias in foundation models via localized counterfactual generation Kirill Sirotkin et.al. 2412.09160 null
2024-12-12 LV-CadeNet: Long View Feature Convolution-Attention Fusion Encoder-Decoder Network for Clinical MEG Spike Detection Kuntao Xiao et.al. 2412.08896 null
2024-12-11 jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images Andreas Koukounas et.al. 2412.08802 null
2024-12-11 TGOSPA Metric Parameters Selection and Evaluation for Visual Multi-object Tracking Jan Krejčí et.al. 2412.08321 null
2024-12-11 Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation Marta R. Costa-jussà et.al. 2412.08279 null
2024-12-11 Neural Observation Field Guided Hybrid Optimization of Camera Placement Yihan Cao et.al. 2412.08266 link
2024-12-11 Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions Mohammadmostafa Rostamkhani et.al. 2412.08169 link
2024-12-11 Rigid Communication Topologies: Impact on Stability, Safety, Energy Consumption, Passenger Comfort, and Robustness of Vehicular Platoons Amir Zakerimanesh et.al. 2412.08122 null
2024-12-11 Multilingual LLMs Inherently Reward In-Language Time-Sensitive Semantic Alignment for Low-Resource Languages Ashutosh Bajpai et.al. 2412.08090 link
2024-12-10 A large language model-based approach to quantifying the effects of social determinants in liver transplant decisions Emily Robitschek et.al. 2412.07924 null
2024-12-10 ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Jinyi Hu et.al. 2412.07720 link
2024-12-10 Access to care improves EHR reliability and clinical risk prediction model performance Anna Zink et.al. 2412.07712 null
2024-12-10 Stereo Hand-Object Reconstruction for Human-to-Robot Handover Yik Lung Pang et.al. 2412.07487 null
2024-12-10 PRM: Photometric Stereo based Large Reconstruction Model Wenhang Ge et.al. 2412.07371 null
2024-12-10 A Bayesian Mixture Model Approach to Examining Neighborhood Social Determinants of Health Disparities in Endometrial Cancer Care in Massachusetts Carmen B. Rodriguez et.al. 2412.07134 null
2024-12-10 TT-MPD: Test Time Model Pruning and Distillation Haihang Wu et.al. 2412.07114 null
2024-12-09 MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds Zhenggang Tang et.al. 2412.06974 null
2024-12-09 Bridging the Divide: Reconsidering Softmax and Linear Attention Dongchen Han et.al. 2412.06590 link
2024-12-09 Emerging Challenges in Molecular Paleontology: Misapplication of Environmental DNA Fragments and Misconception of Deamination as a Key Criterion for In Situ DNA Identification Wan-Qian Zhao et.al. 2412.06378 null
2024-12-09 SeFENet: Robust Deep Homography Estimation via Semantic-Driven Feature Enhancement Zeru Shi et.al. 2412.06352 null
2024-12-08 DECO: Life-Cycle Management of Enterprise-Grade Chatbots Yiwen Zhu et.al. 2412.06099 null
2024-12-08 Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors Alex Rich et.al. 2412.05771 null
2024-12-07 On the effective transfer of knowledge from English to Hindi Wikipedia Paramita Das et.al. 2412.05708 link
2024-12-07 A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions Ola Shorinwa et.al. 2412.05563 null
2024-12-06 Excitation spectrum of a double supersolid in a trapped dipolar Bose mixture Daniel Scheiermann et.al. 2412.05215 null
2024-12-06 Automatic Tissue Differentiation in Parotidectomy using Hyperspectral Imaging Eric L. Wisotzky et.al. 2412.04879 null
2024-12-06 Differentially Private Random Feature Model Chunyang Liao et.al. 2412.04785 link
2024-12-06 Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs Kun Wu et.al. 2412.04747 null
2024-12-05 From Models to Systems: A Comprehensive Fairness Framework for Compositional Recommender Systems Brian Hsu et.al. 2412.04655 null
2024-12-05 Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail Luca Bartolomei et.al. 2412.04472 link
2024-12-05 Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird’s-Eye-View via Uncertainty Measure Saheli Hazra et.al. 2412.04337 null
2024-12-05 Complexity of Vector-valued Prediction: From Linear Models to Stochastic Convex Optimization Matan Schliserman et.al. 2412.04274 null
2024-12-05 Relationships between Keywords and Strong Beats in Lyrical Music Callie C. Liao et.al. 2412.04202 null
2024-12-05 Adult Glioma Segmentation in Sub-Saharan Africa using Transfer Learning on Stratified Finetuning Data Abhijeet Parida et.al. 2412.04111 link
2024-12-05 Augmenting Minds or Automating Skills: The Differential Role of Human Capital in Generative AI’s Impact on Creative Tasks Meiling Huang et.al. 2412.03963 null
2024-12-05 BEFL: Balancing Energy Consumption in Federated Learning for Mobile Edge IoT Zehao Ju et.al. 2412.03950 link
2024-12-05 MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application Hyesu Jang et.al. 2412.03887 null
2024-12-05 E-Commerce in Africa: Divergent Impacts on Rural and Urban Economies Jaelyn S. Liang et.al. 2412.03879 null
2024-12-05 Un-evaluated Solutions May Be Valuable in Expensive Optimization Hao Hao et.al. 2412.03858 null
2024-12-04 Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter Hermes McGriff et.al. 2412.03518 null
2024-12-04 NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Lingen Li et.al. 2412.03517 null
2024-12-04 Data Fusion of Semantic and Depth Information in the Context of Object Detection Md Abu Yusuf et.al. 2412.03490 null
2024-12-04 Exploring trends in audio mixes and masters: Insights from a dataset analysis Angeliki Mourgela et.al. 2412.03373 null
2024-12-04 TASR: Timestep-Aware Diffusion Model for Image Super-Resolution Qinwei Lin et.al. 2412.03355 link
2024-12-04 Social media and suicide: empirical evidence from the quasi-exogenous geographical adoption of Twitter Alexis Du et.al. 2412.03217 null
2024-12-04 MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras Huai Yu et.al. 2412.03146 link
2024-12-03 Quaternion-based Unscented Kalman Filter for 6-DoF Vision-based Inertial Navigation in GPS-denied Regions Khashayar Ghanizadegan et.al. 2412.02768 null
2024-12-03 ROVER: A Multi-Season Dataset for Visual SLAM Fabian Schmidt et.al. 2412.02506 link
2024-12-03 Single-Shot Metric Depth from Focused Plenoptic Cameras Blanca Lasheras-Hernandez et.al. 2412.02386 null
2024-12-03 Dual Exposure Stereo for Extended Dynamic Range 3D Imaging Juhyung Choi et.al. 2412.02351 null
2024-12-03 SparseLGS: Sparse View Language Embedded Gaussian Splatting Jun Hu et.al. 2412.02245 null
2024-12-03 Crash Severity Risk Modeling Strategies under Data Imbalance Abdullah Al Mamun et.al. 2412.02094 null
2024-12-02 Mutli-View 3D Reconstruction using Knowledge Distillation Aditya Dutt et.al. 2412.02039 link
2024-12-02 A Shared Standard for Valid Measurement of Generative AI Systems’ Capabilities, Risks, and Impacts Alexandra Chouldechova et.al. 2412.01934 null
2024-12-02 World-consistent Video Diffusion with Explicit 3D Modeling Qihang Zhang et.al. 2412.01821 null
2024-12-03 FairML: A Julia Package for Fair Classification Jan Pablo Burgard et.al. 2412.01585 link
2024-12-02 Second FRCSyn-onGoing: Winning Solutions and Post-Challenge Analysis to Improve Face Recognition with Synthetic Data Ivan DeAndres-Tame et.al. 2412.01383 null
2024-11-29 Quantifying the synthetic and real domain gap in aerial scene understanding Alina Marcu et.al. 2411.19913 null
2024-11-29 Privacy-Preserving Orthogonal Aggregation for Guaranteeing Gender Fairness in Federated Recommendation Siqing Zhang et.al. 2411.19678 null
2024-11-29 Subjective and Objective Quality Assessment Methods of Stereoscopic Videos with Visibility Affecting Distortions Sria Biswas et.al. 2411.19522 null
2024-12-02 GausSurf: Geometry-Guided 3D Gaussian Splatting for Surface Reconstruction Jiepeng Wang et.al. 2411.19454 null
2024-11-28 Cross-Spectral Attention for Unsupervised RGB-IR Face Verification and Person Re-identification Kshitij Nikhal et.al. 2411.19215 null
2024-11-28 Examining Multimodal Gender and Content Bias in ChatGPT-4o Roberto Balestri et.al. 2411.19140 null
2024-11-28 Tracking Progress Towards Sustainable Development Goal 6 Using Satellite Imagery Othmane Echchabi et.al. 2411.19093 null
2024-11-28 Study on the Influence of Embodied Avatars on Gait Parameters in Virtual Environments and Real World Tianyi Zhou et.al. 2411.18949 null
2024-11-27 A Talent-infused Policy-gradient Approach to Efficient Co-Design of Morphology and Task Allocation Behavior of Multi-Robot Systems Prajit KrisshnaKumar et.al. 2411.18519 null
2024-11-27 A comparison of extended object tracking with multi-modal sensors in indoor environment Jiangtao Shuai et.al. 2411.18476 null
2024-11-27 When does a bridge become an aeroplane? Tina A. Dardeno et.al. 2411.18406 null
2024-11-27 Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation Mehdi Zayene et.al. 2411.18335 link
2024-11-27 Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision Jinnyeong Kim et.al. 2411.18025 null
2024-11-26 Updating the constraint on the quantum collapse models via kilogram masses Qi Dai et.al. 2411.17588 null
2024-11-26 Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles Yichen Wang et.al. 2411.17554 null
2024-11-26 Variational Quantum Simulation of the Fokker-Planck Equation applied to Quantum Radiation Reaction Óscar Amaro et.al. 2411.17517 link
2024-11-26 Object-centric proto-symbolic behavioural reasoning from pixels Ruben van Bergen et.al. 2411.17438 link
2024-11-26 Enhancing Imbalance Learning: A Novel Slack-Factor Fuzzy SVM Approach M. Tanveer et.al. 2411.17128 link
2024-11-26 Multimodal Alignment and Fusion: A Survey Songtao Li et.al. 2411.17040 null
2024-11-24 PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation Ziyao Zeng et.al. 2411.16750 null
2024-11-25 Location-Based Service (LBS) Data Quality Metrics and Effects on Mobility Inference Xinhua Wu et.al. 2411.16595 null
2024-11-23 IRSKG: Unified Intrusion Response System Knowledge Graph Ontology for Cyber Defense Damodar Panigrahi et.al. 2411.15672 null
2024-11-23 Elucidating the nature of axial-vector charm-antibottom tetraquark states U. Özdem et.al. 2411.15508 null
2024-11-22 Adaptive Group Robust Ensemble Knowledge Distillation Patrik Kenfack et.al. 2411.14984 null
2024-11-22 A Benchmark Dataset for Collaborative SLAM in Service Environments Harin Park et.al. 2411.14775 link
2024-11-22 FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification Zhengrui Guo et.al. 2411.14743 link
2024-11-22 Boson-fermion universality of mesoscopic entanglement fluctuations in free systems Cunzhong Lou et.al. 2411.14687 null
2024-11-21 Learning Fair Robustness via Domain Mixup Meiyu Zhong et.al. 2411.14424 null
2024-11-21 InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation Marziyeh Bamdad et.al. 2411.14358 link
2024-11-21 StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart Jian Shi et.al. 2411.14295 link
2024-11-21 Why do language models perform worse for morphologically complex languages? Catherine Arnett et.al. 2411.14198 link
2024-11-21 Compact Visual Data Representation for Green Multimedia – A Human Visual System Perspective Peilin Chen et.al. 2411.14135 null
2024-11-21 Stereo Anything: Unifying Stereo Matching with Large-Scale Mixed Data Xianda Guo et.al. 2411.14053 link
2024-11-21 XAgents: A Framework for Interpretable Rule-Based Multi-Agents Cooperation Hailong Yang et.al. 2411.13932 null
2024-11-20 Predictive Insights into LGBTQ+ Minority Stress: A Transductive Exploration of Social Media Discourse S. Chapagain et.al. 2411.13534 link
2024-11-20 Non-Perturbative Corrections to Charged Black Hole Evaporation Vyshnav Mohan et.al. 2411.13454 null
2024-11-20 BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation Umamaheswaran Raman Kumar et.al. 2411.13251 null
2024-11-20 Asymptotic-Preserving schemes for the Boltzmann mixture model with disparate mass Zhen Hao et.al. 2411.13240 null
2024-11-20 Superpixel Cost Volume Excitation for Stereo Matching Shanglong Liu et.al. 2411.13105 null
2024-11-19 MLDGG: Meta-Learning for Domain Generalization on Graphs Qin Tian et.al. 2411.12913 null
2024-11-19 Towards Fairness in AI for Melanoma Detection: Systemic Review and Recommendations Laura N Montoya et.al. 2411.12846 null
2024-11-19 Human-Robot Dialogue Annotation for Multi-Modal Common Ground Claire Bonial et.al. 2411.12829 link
2024-11-19 Probing the Capacity of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction Sonny George et.al. 2411.12828 link
2024-11-19 Multivariate and Online Transfer Learning with Uncertainty Quantification Jimmy Hickey et.al. 2411.12555 null
2024-11-19 Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution Yang Zou et.al. 2411.12530 link
2024-11-19 Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph Ziyang Chen et.al. 2411.12426 link
2024-11-19 Cities beyond proximity Dan Hill et.al. 2411.12335 null
2024-11-19 Neuro-3D: Towards 3D Visual Decoding from EEG Signals Zhanqiang Guo et.al. 2411.12248 null
2024-11-18 MMBind: Unleashing the Potential of Distributed and Heterogeneous Data for Multimodal Learning in IoT Xiaomin Ouyang et.al. 2411.12126 null
2024-11-18 Fair Distillation: Teaching Fairness from Biased Teachers in Medical Imaging Milad Masroor et.al. 2411.11939 null
2024-11-18 SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input Zhen Lv et.al. 2411.11934 null
2024-11-18 The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather Markus Schön et.al. 2411.11455 null
2024-11-18 Causal Effect of Group Diversity on Redundancy and Coverage in Peer-Reviewing Navita Goyal et.al. 2411.11437 null
2024-11-17 Label Sharing Incremental Learning Framework for Independent Multi-Label Segmentation Tasks Deepa Anand et.al. 2411.11105 null
2024-11-16 BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment Sizhe Wang et.al. 2411.10914 null
2024-11-16 DEAL: Decoupled Classifier with Adaptive Linear Modulation for Group Robust Early Diagnosis of MCI to AD Conversion Donggyu Lee et.al. 2411.10814 null
2024-11-16 LTCXNet: Advancing Chest X-Ray Analysis with Solutions for Long-Tailed Multi-Label Classification and Fairness Challenges Chin-Wei Huang et.al. 2411.10746 null
2024-11-16 A Wearable Gait Monitoring System for 17 Gait Parameters Based on Computer Vision Jiangang Chen et.al. 2411.10739 null
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546 null
2024-11-15 Debias-CLR: A Contrastive Learning Based Debiasing Method for Algorithmic Fairness in Healthcare Applications Ankita Agarwal et.al. 2411.10544 null
2024-11-15 Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion Haoran Wei et.al. 2411.10369 null
2024-11-15 Domain Adaptation-based Edge Computing for Cross-Conditions Fault Diagnosis Yanzhi Wang et.al. 2411.10340 null
2024-11-15 Filament eruption deflection and associated CMEs K. Koleva et.al. 2411.10110 null
2024-11-15 Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses Yongfan Liu et.al. 2411.10013 link
2024-11-15 Assessing Response Disparities in California Wildland-Urban-Interface (WUI) Cities Using the Compartmental Model Zihui Ma et.al. 2411.09946 null
2024-11-14 Propensity Score Matching: Should We Use It in Designing Observational Studies? Fei Wan et.al. 2411.09579 null
2024-11-14 Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data Rik Raes et.al. 2411.09431 null
2024-11-14 Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching Yuran Wang et.al. 2411.09151 null
2024-11-14 Artificial Intelligence for Quantum Computing Yuri Alexeev et.al. 2411.09131 null
2024-11-13 Fluoroformer: Scaling multiple instance learning to multiplexed images via attention-based channel fusion Marc Harary et.al. 2411.08975 link
2024-11-13 Gendered Words and Grant Rates: A Textual Analysis of Disparate Outcomes in the Patent System Deborah Gerhardt et.al. 2411.08526 null
2024-11-13 Anomalous Hall effect from inter-superlattice scattering in a noncollinear antiferromagnet Lilia S. Xie et.al. 2411.08381 null
2024-11-12 Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset Khaoula Chehbouni et.al. 2411.08243 null
2024-11-12 Detection asymmetry in solar energetic particle events S. Dalla et.al. 2411.08211 null
2024-11-12 Estimating Variability in Hospital Charges: The Case of Cesarean Section Anna Perfilyeva et.al. 2411.08174 null
2024-11-11 Identifying Differential Patient Care Through Inverse Intent Inference Hyewon Jeong et.al. 2411.07372 null
2024-11-11 Targeting mediating mechanisms of social disparities with an interventional effects framework, applied to the gender pay gap in West Germany Christiane Didden et.al. 2411.07368 null
2024-11-11 $SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation Yinshuang Xu et.al. 2411.07326 null
2024-11-11 Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations Kirti Bhagat et.al. 2411.07320 link
2024-11-10 Analysis of spatially clustered survival data with unobserved covariates using SBART Durbadal Ghosh et.al. 2411.06591 null
2024-11-10 Image Segmentation from Shadow-Hints using Minimum Spanning Trees Moritz Heep et.al. 2411.06530 null
2024-11-10 SymmeTac: Symmetric Color LED Driven Efficient Photometric Stereo Reconstruction Methods for Camera-based Tactile Sensors Jieji Ren et.al. 2411.06377 link
2024-11-08 Characterizing Implementability of Global Protocols with Infinite States and Data Elaine Li et.al. 2411.05722 null
2024-11-08 Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation Peidong Liu et.al. 2411.05472 link
2024-11-08 From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS Haoran Zhang et.al. 2411.05362 link
2024-11-07 Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks? Jonathan Roberts et.al. 2411.05000 null
2024-11-06 Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation Teppei Kurita et.al. 2411.04714 null
2024-11-11 The Multiple Dimensions of Spuriousness in Machine Learning Samuel J. Bell et.al. 2411.04696 null
2024-11-07 Comparing Fairness of Generative Mobility Models Daniel Wang et.al. 2411.04453 null
2024-11-06 Topology Bench: Systematic Graph Based Benchmarking for Core Optical Networks Robin Matzner et.al. 2411.04160 null
2024-11-06 Optimizing Quantum Circuits, Fast and Slow Amanda Xu et.al. 2411.04104 null
2024-11-06 These Maps Are Made by Propagation: Adapting Deep Stereo Networks to Road Scenarios with Decisive Disparity Diffusion Chuang-Wei Liu et.al. 2411.03717 null
2024-11-06 Physical Layer Deception in OFDM Systems Wenwen Chen et.al. 2411.03677 null
2024-11-06 Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions Zihan Qin et.al. 2411.03638 null
2024-11-05 Exploring the Cybersecurity-Resilience Gap: An Analysis of Student Attitudes and Behaviors in Higher Education Steve Goliath et.al. 2411.03219 null
2024-11-05 Gender Differences in Comparative Advantage Matches: Evidence from Linked Employer-Employee Data Hugo Sant’Anna et.al. 2411.03209 null
2024-11-04 Designing and Evaluating Sampling Strategies for Multiple-Forecast Visualization (MFV) Ruishi Zou et.al. 2411.02576 null
2024-11-04 Gravitational wave energy spectral density properties from BPASS Galactic binary population in the Milky Way galaxy Petra Tang et.al. 2411.02563 null
2024-11-04 Neural optical flow for planar and stereo PIV Andrew I. Masker et.al. 2411.02373 null
2024-11-04 Can Personalized Medicine Coexist with Health Equity? Examining the Cost Barrier and Ethical Implications Kishi Kobe Yee Francisco et.al. 2411.02307 null
2024-11-04 Constructing Emergent U(1) Symmetries in the Gamma-prime $\left(\bf Γ^{\prime} \right)$ model Sagar Ramchandani et.al. 2411.02070 null
2024-11-04 Typicalness-Aware Learning for Failure Detection Yijun Liu et.al. 2411.01981 link
2024-11-04 A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding Yitong Dong et.al. 2411.01893 null
2024-11-03 Mitigating Matching Biases Through Score Calibration Mohammad Hossein Moslemi et.al. 2411.01685 link
2024-11-03 One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection Zhenyu Wang et.al. 2411.01584 null
2024-11-02 Visual Fourier Prompt Tuning Runjia Zeng et.al. 2411.01327 link
2024-11-02 On The Influence Of The Solar Wind On The Propagation Of Earth-impacting Coronal Mass Ejections Sandeep Kumar et.al. 2411.01165 null
2024-11-02 Why Does the Cortex Have Such a Vast Storage Capacity? Hui Wei et.al. 2411.01164 null
2024-10-31 Matchmaker: Self-Improving Large Language Model Programs for Schema Matching Nabeel Seedat et.al. 2410.24105 null
2024-10-31 A Multi-Modal Approach for Face Anti-Spoofing in Non-Calibrated Systems using Disparity Maps Ariel Larey et.al. 2410.24031 null
2024-10-31 Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts Xiang Deng et.al. 2410.23836 null
2024-10-30 Enhancing Image Resolution: A Simulation Study and Sensitivity Analysis of System Parameters for Resourcesat-3S/3SA Ankur Garg et.al. 2410.23319 null
2024-10-30 TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models Ziyao Shangguan et.al. 2410.23266 link
2024-10-30 Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe Songyu Xu et.al. 2410.23154 null
2024-10-30 FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training Tejaswini Medi et.al. 2410.23142 null
2024-10-30 Decarbonisation of industry and the energy system: exploring mutual impacts and investment planning Quentin Raillard-Cazanove et.al. 2410.23025 null
2024-10-30 Improving Musical Accompaniment Co-creation via Diffusion Transformers Javier Nistal et.al. 2410.23005 null
2024-10-30 Knowledge Graph Based Visual Search Application Pawandeep Kaur Betz et.al. 2410.22846 null
2024-10-30 Price Regulation, Technology and Provider Redistribution Piyush Akimitsu et.al. 2410.22616 null
2024-10-29 FairSkin: Fair Diffusion for Skin Disease Image Generation Ruichen Zhang et.al. 2410.22551 null
2024-10-29 From Silos to Systems: Process-Oriented Hazard Analysis for AI Systems Shalaleh Rismani et.al. 2410.22526 null
2024-10-29 Multimodal Structure Preservation Learning Chang Liu et.al. 2410.22520 null
2024-10-29 Relieving scale disparity in binary black hole simulations Nikolas A. Wittek et.al. 2410.22290 null
2024-10-29 Complex-Phase Extensions of Szegedy Quantum Walk on Graphs Sergio A. Ortega et.al. 2410.22011 null
2024-10-29 Photonic systolic array for all-optical matrix-matrix multiplication Jungmin Kim et.al. 2410.21671 null
2024-10-28 Intersectional inequalities in social networks Samuel Martin-Gutierez et.al. 2410.21189 link
2024-10-28 Revealing the core-periphery structure of cities Federica Fanelli et.al. 2410.21133 null
2024-10-28 BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment Mehdi Hosseinzadeh et.al. 2410.20969 null
2024-10-28 The Zeno’s Paradox of `Low-Resource’ Languages Hellina Hailu Nigatu et.al. 2410.20817 null
2024-10-28 Faster WIND: Accelerating Iterative Best-of- $N$ Distillation for LLM Alignment Tong Yang et.al. 2410.20727 null
2024-10-28 Physics-Free Spectrally Multiplexed Photometric Stereo under Unknown Spectral Composition Satoshi Ikehata et.al. 2410.20716 link
2024-10-27 Language Models And A Second Opinion Use Case: The Pocket Professional David Noever et.al. 2410.20636 null
2024-10-27 TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation Juntong Shi et.al. 2410.20626 link
2024-10-27 Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks? Xuan He et.al. 2410.20533 link
2024-10-27 A Navier-Stokes asymptotic preserving Direct Simulation Monte Carlo method for multi-species gas flows Fei Fei et.al. 2410.20322 null
2024-10-25 DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems Muhammad Zaeem Shahzad et.al. 2410.19336 null
2024-10-24 Self-organized homogenization of flow networks Julien Bouvard et.al. 2410.19089 null
2024-10-24 Bridge-Coder: Unlocking LLMs’ Potential to Overcome Language Gaps in Low-Resource Code Jipeng Zhang et.al. 2410.18957 null
2024-10-27 Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis Liang Han et.al. 2410.18822 null
2024-10-24 Rigid Single-Slice-in-Volume registration via rotation-equivariant 2D/3D feature matching Stefan Brandstätter et.al. 2410.18683 null
2024-10-24 A Cranial-Feature-Based Registration Scheme for Robotic Micromanipulation Using a Microscopic Stereo Camera System Xiaofeng Lin et.al. 2410.18630 null
2024-10-24 Spatial-Temporal Search for Spiking Neural Networks Kaiwei Che et.al. 2410.18580 null
2024-10-24 Estimating early coronal mass ejection propagation direction with DIRECD during the severe May 8 and follow-up June 8, 2024 events Shantanu Jain et.al. 2410.18549 null
2024-10-24 Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction Hongxin Peng et.al. 2410.18433 null
2024-10-24 Large Language Models Reflect the Ideology of their Creators Maarten Buyl et.al. 2410.18417 link
2024-10-23 Pathological Rheology of Non-Stretching Entangled Polymers: Finite-Time Blow-Up Predictions Vickie Chen et.al. 2410.18306 null
2024-10-23 Rethinking Positive Pairs in Contrastive Learning Jiantao Wu et.al. 2410.18200 null
2024-10-23 Continual Learning on a Data Diet Elif Ceren Gok Yildirim et.al. 2410.17715 link
2024-10-23 Role of the argon and helium bath gases on the structure of H2/O2 detonations Farzane Zangene et.al. 2410.17561 null
2024-10-22 Characterizing Robocalls with Multiple Vantage Points Sathvik Prasad et.al. 2410.17361 null
2024-10-22 FairLoRA: Unpacking Bias Mitigation in Vision Models with Fairness-Driven Low-Rank Adaptation Rohan Sukumaran et.al. 2410.17358 null
2024-10-22 Dhoroni: Exploring Bengali Climate Change and Environmental Views with a Multi-Perspective News Dataset and Natural Language Processing Azmine Toushik Wasi et.al. 2410.17225 link
2024-10-22 Arabic Dataset for LLM Safeguard Evaluation Yasser Ashraf et.al. 2410.17040 link
2024-10-22 DENOASR: Debiasing ASRs through Selective Denoising Anand Kumar Rai et.al. 2410.16712 null
2024-10-21 GReFEL: Geometry-Aware Reliable Facial Expression Learning under Bias and Imbalanced Data Distribution Azmine Toushik Wasi et.al. 2410.15927 null
2024-10-21 Analysis of short-run and long-run marginal costs of generation in the power market Shamim Homaei et.al. 2410.15861 null
2024-10-20 A hybrid origin for the Martian atmosphere Kaveh Pahlevan et.al. 2410.15508 null
2024-10-20 Investigating the Impact of Age and Sex on Cataract Surgery Complications and Outcomes Hadas Ben-Eli Yaacov Cnaany et.al. 2410.15505 null
2024-10-20 CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts Malvina Nikandrou et.al. 2410.15453 link
2024-10-20 ActiveNeuS: Neural Signed Distance Fields for Active Stereo Kazuto Ichimaru et.al. 2410.15376 null
2024-10-19 A Semidefinite Relaxation Approach for Fair Graph Clustering Sina Baharlouei et.al. 2410.15233 link
2024-10-19 Smart-optimism. Uncovering the Resilience of Romanian City Halls in Online Service Delivery Catalin Vrabie et.al. 2410.15189 null
2024-10-19 Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation Seulbi Lee et.al. 2410.14975 null
2024-10-18 A Complexity-Based Theory of Compositionality Eric Elmoznino et.al. 2410.14817 null
2024-10-18 Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum Ryan Soh-Eun Shim et.al. 2410.14589 null
2024-10-18 Sim2real Cattle Joint Estimation in 3D point clouds Okour Mohammad et.al. 2410.14419 null
2024-10-18 Coded Water-Filling for Multi-User Interference Cancellation Yuan Li et.al. 2410.14136 null
2024-10-17 Auditing and Enforcing Conditional Fairness via Optimal Transport Mohsen Ghassemi et.al. 2410.14029 null
2024-10-17 A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models Qiaoyu Tang et.al. 2410.13841 null
2024-10-17 The Disparate Benefits of Deep Ensembles Kajetan Schweighofer et.al. 2410.13831 link
2024-10-18 Aggregation Artifacts in Subjective Tasks Collapse Large Language Models’ Posteriors Georgios Chochlakis et.al. 2410.13776 null
2024-10-17 Material Fingerprinting: Identifying and Predicting Perceptual Attributes of Material Appearance Jiri Filip et.al. 2410.13615 null
2024-10-17 SAda-Net: A Self-Supervised Adaptive Stereo Estimation CNN For Remote Sensing Image Data Dominik Hirner et.al. 2410.13500 link
2024-10-17 Inner ear morphology in wild versus laboratory house mice Sabrina Renaud et.al. 2410.13325 null
2024-10-17 Perceptions of Discriminatory Decisions of Artificial Intelligence: Unpacking the Role of Individual Characteristics Soojong Kim et.al. 2410.13250 null
2024-10-16 A Location Validation Technique to Mitigate GPS Spoofing Attacks in IEEE 802.11p based Fleet Operator’s Network of Electric Vehicles Ankita Samaddar et.al. 2410.13031 null
2024-10-16 Stability properties for subgroups generated by return words France Gheeraert et.al. 2410.12534 null
2024-10-16 Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention Weixuan Wang et.al. 2410.12462 link
2024-10-16 Real-time Stereo-based 3D Object Detection for Streaming Perception Changcai Li et.al. 2410.12394 link
2024-10-16 Pyramid-Driven Alignment: Pyramid Principle Guided Integration of Large Language Models and Knowledge Graphs Lei Sun et.al. 2410.12298 null
2024-10-15 A Software Engineering Capstone Course Facilitated By GitHub Templates Spencer Smith et.al. 2410.12114 null
2024-10-15 DAXA: Traversing the X-ray desert by Democratising Archival X-ray Astronomy David J. Turner et.al. 2410.11954 link
2024-10-15 Adaptive Coordinators and Prompts on Heterogeneous Graphs for Cross-Domain Recommendations Hengyu Zhang et.al. 2410.11719 null
2024-10-15 Multiple scales homogenisation of a porous viscoelastic material with rigid inclusions: application to lithium-ion battery electrodes J. M. Foster et.al. 2410.11699 null
2024-10-16 Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture Dabbrata Das et.al. 2410.11610 link
2024-10-15 Towards a Healthy AI Tradition: Lessons from Biology and Biomedical Science Simon Kasif et.al. 2410.11590 null
2024-10-15 MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields Yuru Xiao et.al. 2410.11394 null
2024-10-15 Improving Bias in Facial Attribute Classification: A Combined Impact of KL Divergence induced Loss Function and Dual Attention Shweta Patel et.al. 2410.11176 null
2024-10-14 Solving the Transient Dyson Equation with Quasilinear Complexity via Matrix Compression Baptiste Lamic et.al. 2410.11057 null
2024-10-14 Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation Emmanouil Zaranis et.al. 2410.10995 link
2024-10-14 Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation Peiwen Sun et.al. 2410.10676 null
2024-10-14 MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator Taozhe Li et.al. 2410.10669 null
2024-10-14 Double Jeopardy and Climate Impact in the Use of Large Language Models: Socio-economic Disparities and Reduced Utility for Non-English Speakers Aivin V. Solatorio et.al. 2410.10665 link
2024-10-14 Energetic Analysis of Emerging Quantum Communication Protocols Raja Yehia et.al. 2410.10661 link
2024-10-14 Dual-Path Mechanism of Amino Acid Racemization Mediated by Quantum Mechanical Tunneling Xinrui Yang et.al. 2410.10544 null
2024-10-14 Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world Han Ling et.al. 2410.10453 link
2024-10-14 Minimum Tuning to Unlock Long Output from LLMs with High Quality Data as the Key Yingda Chen et.al. 2410.10210 null
2024-10-13 Robust 3D Point Clouds Classification based on Declarative Defenders Kaidong Li et.al. 2410.09691 link
2024-10-12 Scito2M: A 2 Million, 30-Year Cross-disciplinary Dataset for Temporal Scientometric Analysis Yiqiao Jin et.al. 2410.09510 link
2024-10-12 Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors Hritam Basak et.al. 2410.09467 null
2024-10-11 Efficient Multi-Object Tracking on Edge Devices via Reconstruction-Based Channel Pruning Jan Müller et.al. 2410.08769 null
2024-10-11 No Tick-Size Too Small: A General Method for Modelling Small Tick Limit Order Books Konark Jain et.al. 2410.08744 null
2024-10-11 Bio-inspired reconfigurable stereo vision for robotics using omnidirectional cameras Suchang Chen et.al. 2410.08691 null
2024-10-10 PubMed knowledge graph 2.0: Connecting papers, patents, and clinical trials in biomedical science Jian Xu et.al. 2410.07969 null
2024-10-10 Determining the Magnetic Field in the Galactic Plane from New Arecibo Pulsar Faraday Rotation Measurements Alice P. Curtin et.al. 2410.07967 null
2024-10-10 A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways Jing Su et.al. 2410.07915 null
2024-10-10 Multi-Scale Deformable Transformers for Student Learning Behavior Detection in Smart Classroom Zhifeng Wang et.al. 2410.07834 null
2024-10-09 ACDC: Automated Creation of Digital Cousins for Robust Policy Learning Tianyuan Dai et.al. 2410.07408 null
2024-10-09 Enhancing Performance of Point Cloud Completion Networks with Consistency Loss Kevin Tirta Wijaya et.al. 2410.07298 null
2024-10-09 IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation Xinchen Zhang et.al. 2410.07171 link
2024-10-10 Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology Xiangyu Wang et.al. 2410.07087 null
2024-10-09 CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models Zi Gong et.al. 2410.06741 link
2024-10-09 Analysis of different disparity estimation techniques on aerial stereo image datasets Ishan Narayan et.al. 2410.06711 null
2024-10-09 Decomposing Relationship from 1-to-N into N 1-to-1 for Text-Video Retrieval Jian Xiao et.al. 2410.06618 link
2024-10-09 The Sampling-Gaussian for stereo matching Baiyu Pan et.al. 2410.06527 null
2024-10-09 OledFL: Unleashing the Potential of Decentralized Federated Learning via Opposite Lookahead Enhancement Qinglun Li et.al. 2410.06482 null
2024-10-08 Skin Cancer Machine Learning Model Tone Bias James Pope et.al. 2410.06385 null
2024-10-08 HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction Shengji Tang et.al. 2410.06245 null
2024-10-08 BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way Jiazi Bu et.al. 2410.06241 null
2024-10-07 Studying and Mitigating Biases in Sign Language Understanding Models Katherine Atwell et.al. 2410.05206 null
2024-10-07 Enhancing Equity in Large Language Models for Medical Applications Yuelyu Ji et.al. 2410.05180 link
2024-10-07 Presto! Distilling Steps and Layers for Accelerating Music Generation Zachary Novack et.al. 2410.05167 null
2024-10-07 Correcting for Popularity Bias in Recommender Systems via Item Loss Equalization Juno Prent et.al. 2410.04830 null
2024-10-07 The divide between us: Internet access among people with and without disabilities in the post-pandemic era Edgar Pacheco et.al. 2410.04825 null
2024-10-06 Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives Carolina Veiga et.al. 2410.04318 null
2024-10-05 Fast Object Detection with a Machine Learning Edge Device Richard C. Rodriguez et.al. 2410.04173 null
2024-10-05 High-Speed Stereo Visual SLAM for Low-Powered Computing Devices Ashish Kumar et.al. 2410.04090 link
2024-10-05 Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy Pengcheng Chen et.al. 2410.04041 null
2024-10-04 Improving Arabic Multi-Label Emotion Classification using Stacked Embeddings and Hybrid Loss Function Nisar Ahmed et.al. 2410.03979 link
2024-10-04 Noncollinear ferrielectricity and hydrogen-induced ferromagnetic polar half-metallicity in MnO $_3$ Cl Xinyu Yang et.al. 2410.03220 null
2024-10-03 Q-SCALE: Quantum computing-based Sensor Calibration for Advanced Learning and Efficiency Lorenzo Bergadano et.al. 2410.02998 null
2024-10-03 Individuation of 3D perceptual units from neurogeometry of binocular cells Maria Virginia Bolelli et.al. 2410.02870 null
2024-10-03 Pseudo-Stereo Inputs: A Solution to the Occlusion Challenge in Self-Supervised Stereo Matching Ruizhi Yang et.al. 2410.02534 link
2024-10-03 Cooperative Semantic Knowledge Base Update Policy for Multiple Semantic Communication Pairs Shuling Li et.al. 2410.02405 null
2024-10-03 Extracting the Potential of Emerging Hardware Accelerators for Symmetric Eigenvalue Decomposition Hansheng Wang et.al. 2410.02170 null
2024-10-03 Quantum Mutual Information in Time James Fullwood et.al. 2410.02137 null
2024-10-04 C-MELT: Contrastive Enhanced Masked Auto-Encoders for ECG-Language Pre-Training Manh Pham et.al. 2410.02131 link
2024-10-02 Unified space-time description of pulsed twin beams Alessandra Gatti et.al. 2410.01907 null
2024-10-02 Conformal Prediction Sets Can Cause Disparate Impact Jesse C. Cresswell et.al. 2410.01888 link
2024-10-02 A Novel Framework of Horizontal-Vertical Hybrid Federated Learning for EdgeIoT Kai Li et.al. 2410.01644 null
2024-10-02 Fair Class-Incremental Learning using Sample Weighting Jaeyoung Park et.al. 2410.01324 null
2024-10-02 SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network Ahmed Tawfik Aboukhadra et.al. 2410.01293 null
2024-10-02 Unifying the Scope of Bridging Anaphora Types in English: Bridging Annotations in ARRAU and GUM Lauren Levine et.al. 2410.01170 null
2024-10-01 M2P2: A Multi-Modal Passive Perception Dataset for Off-Road Mobility in Extreme Low-Light Conditions Aniket Datar et.al. 2410.01105 null
2024-10-01 A catalog of multi-vantage point observations of type-II bursts: Statistics and correlations Atul Mohan et.al. 2410.00814 null
2024-10-01 CME-associated type-IV radio bursts: The solar paradigm and the unique case of AD Leo Atul Mohan et.al. 2410.00787 null
2024-10-01 What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study Beatrice Savoldi et.al. 2410.00545 link
2024-10-01 Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration Yida Lin et.al. 2410.00503 null
2024-09-30 ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity Learning Jian Shi et.al. 2410.00262 link
2024-09-30 Uni $^2$ Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection Yubin Wang et.al. 2409.20558 null
2024-09-30 Match Stereo Videos via Bidirectional Alignment Junpeng Jing et.al. 2409.20283 null
2024-09-30 Understanding How Psychological Distance Influences User Preferences in Conversational Versus Web Search Yitian Yang et.al. 2409.19982 null
2024-09-30 Positive-Sum Fairness: Leveraging Demographic Attributes to Achieve Fair AI Outcomes Without Sacrificing Group Gains Samia Belhadj et.al. 2409.19940 null
2024-09-29 Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems Xuyang Wu et.al. 2409.19804 link
2024-09-29 Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving Wei-Bin Kou et.al. 2409.19560 null
2024-09-29 Transforming Scholarly Landscapes: Influence of Large Language Models on Academic Fields beyond Computer Science Aniket Pramanick et.al. 2409.19508 link
2024-09-29 KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation Soofiyan Atar et.al. 2409.19490 null
2024-10-01 Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models Seongmin Lee et.al. 2409.19382 null
2024-09-27 Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping Anthony A. Song et.al. 2409.19153 null
2024-09-27 LW2G: Learning Whether to Grow for Prompt-based Continual Learning Qian Feng et.al. 2409.18860 link
2024-09-27 Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation Chaomin Shen et.al. 2409.18785 null
2024-09-27 Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds Hanbin Bae et.al. 2409.18705 null
2024-09-27 Analysis of commissioning data from SST-1M : A Prototype of Single-Mirror Small Size Telescope Thomas Tavernier et.al. 2409.18639 null
2024-09-27 ChARLES: Change-Aware Recovery of Latent Evolution Semantics in Relational Data Shiyi He et.al. 2409.18386 null
2024-09-26 Realistic Evaluation of Model Merging for Compositional Generalization Derek Tam et.al. 2409.18314 link
2024-09-26 Revisit Anything: Visual Place Recognition via Image Segment Retrieval Kartik Garg et.al. 2409.18049 link
2024-09-26 LGFN: Lightweight Light Field Image Super-Resolution using Local Convolution Modulation and Global Attention Feature Extraction Zhongxin Yu et.al. 2409.17759 null
2024-09-26 Efficient Bias Mitigation Without Privileged Information Mateo Espinosa Zarlenga et.al. 2409.17691 null
2024-09-26 Event-based Stereo Depth Estimation: A Survey Suman Ghosh et.al. 2409.17680 null
2024-09-26 Improving Fast Adversarial Training via Self-Knowledge Guidance Chengze Jiang et.al. 2409.17589 null
2024-09-26 Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Integrating SGBM and Segmentation Models Yida Lin et.al. 2409.17526 null
2024-09-26 Characteristics of Powerful Radio Galaxies Chandra B. Singh et.al. 2409.17514 null
2024-09-26 Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation Ian Chuang et.al. 2409.17435 link
2024-09-25 NTIRE 2024 Challenge on Stereo Image Super-Resolution: Methods and Results Longguang Wang et.al. 2409.16947 null
2024-09-25 The diverse star formation histories of early massive, quenched galaxies in modern galaxy formation simulations Claudia del P. Lagos et.al. 2409.16916 link
2024-09-25 Pruning Multilingual Large Language Models for Multilingual Inference Hwichan Kim et.al. 2409.16911 link
2024-09-25 An Adaptive Screen-Space Meshing Approach for Normal Integration Moritz Heep et.al. 2409.16907 null
2024-09-25 GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning Zhe-Rui Yang et.al. 2409.16670 link
2024-09-25 Task-driven SLAM Benchmarking Yanwei Du et.al. 2409.16573 link
2024-09-24 Camera Calibration and Stereo via a Single Image of a Spherical Mirror Nissim Barzilay et.al. 2409.16386 null
2024-09-24 Transient bubble rising in the presence of a surfactant at very low concentrations D. Fernández-Martínez et.al. 2409.16029 null
2024-09-24 AutoCE: An Accurate and Efficient Model Advisor for Learned Cardinality Estimation Jintao Zhang et.al. 2409.16027 null
2024-09-24 NER-Luxury: Named entity recognition for the fashion and luxury domain Akim Mousterou et.al. 2409.15804 null
2024-09-24 Identified-and-Targeted: The First Early Evidence of the Privacy-Invasive Use of Browser Fingerprinting for Online Tracking Zengrui Liu et.al. 2409.15656 null
2024-09-23 Rethinking Emotion Bias in Music via Frechet Audio Distance Yuanchao Li et.al. 2409.15545 link
2024-09-23 Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras Ming Li et.al. 2409.14766 null
2024-09-23 An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding Wei-Bin Kou et.al. 2409.14737 null
2024-09-22 Exploring Multilingual Probing in Large Language Models: A Cross-Language Analysis Daoyang Li et.al. 2409.14459 null
2024-09-22 Nonmodal stability analysis of the plane Poiseuille flow in a multilayer porous-fluid channel Supriya Karmakar et.al. 2409.14420 null
2024-09-22 MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting Chen Tessler et.al. 2409.14393 null
2024-09-23 Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping Jaehyung Jung et.al. 2409.12051 null
2024-09-18 SymFace: Additional Facial Symmetry Loss for Deep Face Recognition Pritesh Prakash et.al. 2409.11816 null
2024-09-17 A Pileup of Coronal Mass Ejections Produced the Largest Geomagnetic Storm in Two Decades Ying D. Liu et.al. 2409.11492 null
2024-09-17 A generalized non-hourglass updated Lagrangian formulation for SPH solid dynamics Shuaihao Zhang et.al. 2409.11474 null
2024-09-17 Connecting the Low to High Corona: Propagating Disturbances as Tracers of the Near-Sun Solar Wind Nathalia Alzate et.al. 2409.11352 null
2024-09-17 The SST-1M imaging atmospheric Cherenkov telescope for gamma-ray astrophysics C. Alispach et.al. 2409.11310 null
2024-09-17 SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration Xin Guan et.al. 2409.11149 link
2024-09-17 Optimal Investment under the Influence of Decision-changing Imitation Huisheng Wang et.al. 2409.10933 null
2024-09-16 GPT takes the SAT: Tracing changes in Test Difficulty and Math Performance of Students Vikram Krishnaveti et.al. 2409.10750 null
2024-09-16 Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance Simone Maurizio La Cava et.al. 2409.10481 null
2024-09-16 uniGasFoam: a particle-based OpenFOAM solver for multiscale rarefied gas flows Nikos Vasileiadis et.al. 2409.10288 null
2024-09-16 SOLVR: Submap Oriented LiDAR-Visual Re-Localisation Joshua Knights et.al. 2409.10247 null
2024-09-16 RF-GML: Reference-Free Generative Machine Listener Arijit Biswas et.al. 2409.10210 null
2024-09-16 DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection Kun Fang et.al. 2409.10094 null
2024-09-16 Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments Wessel Ledder et.al. 2409.10048 link
2024-09-15 Estimating Wage Disparities Using Foundation Models Keyon Vafa et.al. 2409.09894 null
2024-09-15 A Benchmark Dataset with Larger Context for Non-Factoid Question Answering over Islamic Text Faiza Qamar et.al. 2409.09844 null
2024-09-15 Introducing DAIMYO: a first-time-right dynamic design architecture and its application to tail-sitter UAS development Jolan Wauters et.al. 2409.09820 null
2024-09-14 An Augmentation-based Model Re-adaptation Framework for Robust Image Segmentation Zheming Zuo et.al. 2409.09530 null
2024-09-13 ClearDepth: Enhanced Stereo Perception of Transparent Objects for Robotic Manipulation Kaixin Bai et.al. 2409.08926 null
2024-09-12 The Impact of Large Language Models on Open-source Innovation: Evidence from GitHub Copilot Doron Yeverechyahu et.al. 2409.08379 null
2024-09-12 Reducing Population-level Inequality Can Improve Demographic Group Fairness: a Twitter Case Study Avijit Ghosh et.al. 2409.08135 null
2024-09-12 FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments Devansh Dhrafani et.al. 2409.07715 null
2024-09-12 Modeling Information Narrative Detection and Evolution on Telegram during the Russia-Ukraine War Patrick Gerard et.al. 2409.07684 null
2024-09-11 Unsupervised anomaly detection in spatio-temporal stream network sensor data Edgar Santos-Fernandez et.al. 2409.07667 null
2024-09-11 Object Depth and Size Estimation using Stereo-vision and Integration with SLAM Layth Hamad et.al. 2409.07623 null
2024-09-11 Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs Sadra Safadoust et.al. 2409.07456 null
2024-09-11 StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos Sijie Zhao et.al. 2409.07447 null
2024-09-11 Towards Fairer Health Recommendations: finding informative unbiased samples via Word Sense Disambiguation Gavin Butts et.al. 2409.07424 null
2024-09-11 The microbiome science of composting and human excrement composting: a review Jeff Meilander et.al. 2409.07376 null
2024-09-11 Constraining Genetic Symbolic Regression via Semantic Backpropagation Maximilian Reissmann et.al. 2409.07369 link
2024-09-11 MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications Praveen K Kanithi et.al. 2409.07314 null
2024-09-11 Learning Personalized Scoping for Graph Neural Networks under Heterophily Gangda Deng et.al. 2409.06998 link
2024-09-11 Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention Wenhao Zhao et.al. 2409.06985 null
2024-09-10 A Quality Diversity Approach to Automatically Generate Multi-Agent Path Finding Benchmark Maps Cheng Qian et.al. 2409.06888 null
2024-09-10 Adversarial Attacks to Multi-Modal Models Zhihao Dou et.al. 2409.06793 null
2024-09-10 Synchronization of wave-propelled capillary spinners Jack-William Barotta et.al. 2409.06652 link
2024-09-10 Quantum-like approaches unveil the intrinsic limits of predictability in compartmental models José Alejandro Rojas-Venegas et.al. 2409.06438 null
2024-09-09 LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo Wei Zhi Tang et.al. 2409.06104 link
2024-09-09 Online 3D reconstruction and dense tracking in endoscopic videos Michel Hayoz et.al. 2409.06037 link
2024-09-09 Dust-UV offsets in high-redshift galaxies in the Cosmic Dawn III simulation Pierre Ocvirk et.al. 2409.05946 null
2024-09-09 The Influence of Task and Group Disparities over Users’ Attitudes Toward Using Large Language Models for Psychotherapy Qihang He et.al. 2409.05703 null
2024-09-09 LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow Hongyu Wen et.al. 2409.05688 null
2024-09-09 Adaptive Offloading and Enhancement for Low-Light Video Analytics on Mobile Devices Yuanyi He et.al. 2409.05297 null
2024-09-08 PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels Aayushman et.al. 2409.04975 link
2024-09-10 Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios Zhiqiang Chen et.al. 2409.04961 link
2024-09-08 A Hetero-functional Graph Resilience Analysis for Convergent Systems-of-Systems Amro M. Farid et.al. 2409.04936 null
2024-09-06 A Short Survey on Set-Based Aggregation Techniques for Single-Vector WSI Representation in Digital Pathology S. Hemati et.al. 2409.04615 null
2024-09-06 AGR: Age Group fairness Reward for Bias Mitigation in LLMs Shuirong Cao et.al. 2409.04340 null
2024-09-06 Calibration of Network Confidence for Unsupervised Domain Adaptation Using Estimated Accuracy Coby Penso et.al. 2409.04241 link
2024-09-06 Confidence-Aware Document OCR Error Detection Arthur Hemmer et.al. 2409.04117 null
2024-09-06 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors Yujun Huang et.al. 2409.04013 link
2024-09-05 An analysis of spectroscopic, seismological, astrometric, and photometric masses of pulsating white dwarf stars Leila M. Calcaferro et.al. 2409.03896 null
2024-09-05 LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors Hanyang Yu et.al. 2409.03456 null
2024-09-05 Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities Wei Lu et.al. 2409.03444 link
2024-09-04 Fast algorithms to improve fair information access in networks Dennis Robert Windham et.al. 2409.03127 link
2024-09-04 Incorporating dense metric depth into neural 3D representations for view synthesis and relighting Arkadeep Narayan Chaudhury et.al. 2409.03061 null
2024-09-04 UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views Jiaxin Guo et.al. 2409.02917 link
2024-09-04 MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling Jihye Ahn et.al. 2409.02846 null
2024-09-04 Deep Learning Meets Satellite Images – An Evaluation on Handcrafted and Learning-based Features for Multi-date Satellite Stereo Images Shuang Song et.al. 2409.02825 null
2024-09-04 Experimental Framework for Generating Reliable Ground Truth for Laryngeal Spatial Segmentation Tasks Hamzeh Ghasemzadeh et.al. 2409.02809 null
2024-09-04 UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching Soomin Kim et.al. 2409.02545 null
2024-09-04 Demographic parity in regression and classification within the unawareness framework Vincent Divol et.al. 2409.02471 null
2024-09-04 Unified Framework with Consistency across Modalities for Human Activity Recognition Tuyen Tran et.al. 2409.02385 link
2024-09-03 Collaboratively Learning Federated Models from Noisy Decentralized Data Haoyuan Li et.al. 2409.02189 null
2024-09-03 Taming Randomness in Agent-Based Models using Common Random Numbers Daniel J. Klein et.al. 2409.02086 link
2024-09-03 Observing Context Improves Disparity Estimation when Race is Unobserved Kweku Kwegyir-Aggrey et.al. 2409.01984 null
2024-08-30 Semi-supervised permutation invariant particle-level anomaly detection Gabriel Matos et.al. 2408.17409 link
2024-08-30 Fairness-Aware Estimation of Graphical Models Zhuoping Zhou et.al. 2408.17396 link
2024-08-30 BioBricks.ai: A Versioned Data Registry for Life Sciences Data Assets Yifan Gao et.al. 2408.17320 null
2024-08-30 Accelerating the discovery of steady-states of planetary interior dynamics with machine learning Siddhant Agarwal et.al. 2408.17298 null
2024-08-30 A Generic and Automated Methodology to Simulate Melting Point Fu-Zhi Dai et.al. 2408.17270 null
2024-08-30 Self-supervised learning for crystal property prediction via denoising Alexander New et.al. 2408.17255 null
2024-08-30 EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs Zhen Fan et.al. 2408.17168 null
2024-08-30 FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition Chen Hu et.al. 2408.17090 link
2024-08-29 STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models Koushik Srivatsan et.al. 2408.16807 link
2024-08-30 ARINC 429 Cyber-vulnerabilities and Voltage Data in a Hardware-in-the-Loop Simulator Connor Trask et.al. 2408.16714 null
2024-08-29 Fibrations of algebras Danel Ahman et.al. 2408.16581 null
2024-08-29 Spurfies: Sparse Surface Reconstruction using Local Geometry Priors Kevin Raj et.al. 2408.16544 null
2024-08-29 Physical Similarity of Fluid Flow in Bimodal Porous Media: Part 1 – Basic Model and Solution Characteristics Yuhe Wang et.al. 2408.16434 null
2024-08-28 Simulation and analysis of a high-k electron scale turbulence diagnostic for MAST-U David C. Speirs et.al. 2408.15807 null
2024-08-28 Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions Huachuan Qiu et.al. 2408.15787 link
2024-08-30 Addressing the challenges of loop detection in agricultural environments Nicolás Soncini et.al. 2408.15761 link
2024-08-28 ES-PTAM: Event-based Stereo Parallel Tracking and Mapping Suman Ghosh et.al. 2408.15605 link
2024-08-27 Regional emission dynamics across phases of the EU ETS Marco Dueñas et.al. 2408.15438 null
2024-08-27 Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty Saining Zhang et.al. 2408.15242 link
2024-08-27 Learning-based Multi-View Stereo: A Survey Fangjinhua Wang et.al. 2408.15235 null
2024-08-27 Investigating Coverage Criteria in Large Language Models: An In-Depth Study Through Jailbreak Attacks Shide Zhou et.al. 2408.15207 null
2024-08-27 Strategic Optimization and Challenges of Large Language Models in Object-Oriented Programming Zinan Wang et.al. 2408.14834 null
2024-08-26 Towards Graph Prompt Learning: A Survey and Beyond Qingqing Long et.al. 2408.14520 null
2024-08-26 Predictability and Causality in Spanish and English Natural Language Generation Andrea Busto-Castiñeira et.al. 2408.14283 null
2024-08-26 Harnessing the Digital Revolution: A Comprehensive Review of mHealth Applications for Remote Monitoring in Transforming Healthcare Delivery Avnish Singh Jat et.al. 2408.14190 null
2024-08-26 ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation Ruohua Shi et.al. 2408.14114 null
2024-08-26 Bengali Sign Language Recognition through Hand Pose Estimation using Multi-Branch Spatial-Temporal Attention Model Abu Saleh Musa Miah et.al. 2408.14111 null
2024-08-26 Fast Edge-Aware Occlusion Detection in the Context of Multispectral Camera Arrays Frank Sippel et.al. 2408.14050 link
2024-08-26 More Pictures Say More: Visual Intersection Network for Open Set Object Detection Bingcheng Dong et.al. 2408.14032 null
2024-08-25 Splatt3R: Zero-shot Gaussian Splatting from Uncalibarated Image Pairs Brandon Smart et.al. 2408.13912 null
2024-08-24 Submodular Maximization Approaches for Equitable Client Selection in Federated Learning Andrés Catalino Castillo Jiménez et.al. 2408.13683 null
2024-08-24 Outlier Detection Bias Busted: Understanding Sources of Algorithmic Bias through Data-centric Factors Xueying Ding et.al. 2408.13667 null
2024-08-23 HEK-Omics: The promise of omics to optimize HEK293 for recombinant adeno-associated virus (rAAV) gene therapy manufacturing Sai Guna Ranjan Gurazada et.al. 2408.13374 null
2024-08-23 Deep Learning at the Intersection: Certified Robustness as a Tool for 3D Vision Gabriel Pérez S et.al. 2408.13135 null
2024-08-23 VCEMO: Multi-Modal Emotion Recognition for Chinese Voiceprints Jinghua Tang et.al. 2408.13019 null
2024-08-23 Ada2I: Enhancing Modality Balance for Multimodal Conversational Emotion Recognition Cam-Van Thi Nguyen et.al. 2408.12895 null
2024-08-23 Refining the isovector component of the Woods-Saxon potential L. Xayavong et.al. 2408.12794 null
2024-08-22 Disentangled Structural and Featural Representation for Task-Agnostic Graph Valuation Ali Falahati et.al. 2408.12659 null
2024-08-22 The Hybrid Hospital: Balancing On-Site and Remote Hospitalization Noa Zychlinski et.al. 2408.12431 null
2024-08-22 Multi-Style Facial Sketch Synthesis through Masked Generative Modeling Bowen Sun et.al. 2408.12400 null
2024-08-22 Aligning (Medical) LLMs for (Counterfactual) Fairness Raphael Poulain et.al. 2408.12055 link
2024-08-21 Electrostatic Origins of the Dirichlet Principle Steven Deckelman et.al. 2408.12002 null
2024-08-21 Time-Dependent Strategy for Improving Aortic Blood Flow Simulations with Boundary Control and Data Assimilation Muhammad Adnan Anwar et.al. 2408.11617 null
2024-08-21 A Novel $δ$ -SBM-OPA Approach for Policy-Driven Analysis of Carbon Emission Efficiency under Uncertainty in the Chinese Industrial Sector Shutian Cui et.al. 2408.11600 null
2024-08-21 GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation Abiao Li et.al. 2408.11558 link
2024-08-21 Mutagenesis screen to map the functionals of parameters of Large Language Models Yue Hu et.al. 2408.11494 link
2024-08-20 Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier in 3D Object Detection for AVs Sanjay Bhargav Dharavath et.al. 2408.11207 link
2024-08-20 SDI-Net: Toward Sufficient Dual-View Interaction for Low-light Stereo Image Enhancement Linlin Hu et.al. 2408.10934 null
2024-08-20 A Noncontact Technique for Wave Measurement Based on Thermal Stereography and Deep Learning Deyu Li et.al. 2408.10670 null
2024-08-20 Multi-view Hand Reconstruction with a Point-Embedded Transformer Lixin Yang et.al. 2408.10581 link
2024-08-19 Customizing Language Models with Instance-wise LoRA for Sequential Recommendation Xiaoyu Kong et.al. 2408.10159 link
2024-08-19 Envisioning Possibilities and Challenges of AI for Personalized Cancer Care Elaine Kong et.al. 2408.10108 null
2024-08-19 ARMADA: Attribute-Based Multimodal Data Augmentation Xiaomeng Jin et.al. 2408.10086 null
2024-08-19 Helical edge modes in a triangular Heisenberg antiferromagnet Bastian Pradenas et.al. 2408.10062 null
2024-08-19 Bridging the Language Gap: Enhancing Multilingual Prompt-Based Code Generation in LLMs via Zero-Shot Cross-Lingual Transfer Mingda Li et.al. 2408.09701 null
2024-08-17 Intuitive Human-Robot Interface: A 3-Dimensional Action Recognition and UAV Collaboration Framework Akash Chaudhary et.al. 2408.09232 null
2024-08-17 TableBench: A Comprehensive and Complex Benchmark for Table Question Answering Xianjie Wu et.al. 2408.09174 null
2024-08-17 GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation Weiming Zhang et.al. 2408.09115 null
2024-08-17 Depth-guided Texture Diffusion for Image Semantic Segmentation Wei Sun et.al. 2408.09097 null
2024-08-17 From Urban Clusters to Megaregions: Mapping Australia’s Evolving Urban Regions M. K. M Ng et.al. 2408.09054 null
2024-08-16 An Empirical Examination of Balancing Strategy for Counterfactual Estimation on Time Series Qiang Huang et.al. 2408.08815 null
2024-08-16 CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving Shihan Peng et.al. 2408.08500 null
2024-08-16 Fishers Harvest Parallel Unlearning in Inherited Model Networks Xiao Liu et.al. 2408.08493 null
2024-08-15 Comparing NASA Discovery and New Frontiers Class Mission Concepts for the Io Volcano Observer (IVO) Christopher W. Hamilton et.al. 2408.08334 null
2024-08-15 Cluster Formations of Free and Congested Flows in Urban Road Networks Yongsung Kwon et.al. 2408.08122 null
2024-08-15 Motif analysis and passing behavior in football passing networks Ming-Xia Li et.al. 2408.07927 null
2024-08-14 Polarization dynamics: a study of individuals shifting between political communities on social media Federico Albanese et.al. 2408.07731 null
2024-08-14 Hierarchical Working Memory and a New Magic Number Weishun Zhong et.al. 2408.07637 null
2024-08-14 Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks Liting Jiang et.al. 2408.07613 null
2024-08-15 DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution Yuanbo Zhou et.al. 2408.07516 null
2024-08-14 M2L Translation Operators for Kernel Independent Fast Multipole Methods on Modern Architectures Srinath Kailasa et.al. 2408.07436 null
2024-08-14 Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction Liting Jiang et.al. 2408.07419 link
2024-08-14 MorphFader: Enabling Fine-grained Controllable Morphing with Text-to-Audio Models Purnima Kamath et.al. 2408.07260 null
2024-08-12 Quantized Redshift and its significance for recent observations Arindam Mal et.al. 2408.07101 null
2024-08-13 The News Comment Gap and Algorithmic Agenda Setting in Online Forums Flora Böwing et.al. 2408.07052 link
2024-08-13 Quantifying the checkerboard problem to reduce numerical dissipation Johannes Arend Hopman et.al. 2408.06821 null
2024-08-12 Observation of vortex stripes in UTe $_2$ Y. F. Wang et.al. 2408.06209 null
2024-08-12 IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI Yash Rampuria et.al. 2408.06113 null
2024-08-12 Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models Haifan Gong et.al. 2408.05985 null
2024-08-11 Predictors and Socio-Demographic Disparities in STEM Degree Outcomes: A ten-year UK study using Hierarchical Logistic Regression Andrew M. Low et.al. 2408.05853 null
2024-08-10 EV-MGDispNet: Motion-Guided Event-Based Stereo Disparity Estimation Network with Left-Right Consistency Junjie Jiang et.al. 2408.05452 null
2024-08-08 LiDAR-Event Stereo Fusion with Hallucinations Luca Bartolomei et.al. 2408.04633 link
2024-08-08 Charmed hypernuclei within density-dependent relativistic mean-field theory Wei Yang et.al. 2408.04527 null
2024-08-08 A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery Mengya Xu et.al. 2408.04426 link
2024-08-07 A Framework for Assessing Cumulative Exposure to Extreme Temperatures During Transit Trip Huiying Fan et.al. 2408.04081 null
2024-08-07 A Comparison of Fireball Luminous Efficiency Models using Acoustic Records Luke McFadden et.al. 2408.04078 null
2024-08-07 A Blockchain-based Reliable Federated Meta-learning for Metaverse: A Dual Game Framework Emna Baccour et.al. 2408.03694 null
2024-08-07 TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization Kien T. Pham et.al. 2408.03637 null
2024-08-07 Unlocking Exocentric Video-Language Data for Egocentric Video Representation Learning Zi-Yi Dou et.al. 2408.03567 null
2024-08-07 D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods Onkar Susladkar et.al. 2408.03558 link
2024-08-07 Opening the Black Box of 3D Reconstruction Error Analysis with VECTOR Racquel Fygenson et.al. 2408.03503 link
2024-08-06 Transit Rider Heat Stress in Atlanta, GA under Current and Future Climate Scenarios Huiying Fan et.al. 2408.03457 null
2024-08-06 Fusing Forces: Deep-Human-Guided Refinement of Segmentation Masks Rafael Sterzinger et.al. 2408.03304 link
2024-08-06 Measuring interconnectedness of infectious diseases in funded and unfunded research: a temporal network analysis on bibliometric data 1995-2022 Anbang Du et.al. 2408.03140 null
2024-08-06 Predictive Performance Test based on the Exhaustive Nested Cross-Validation for High-dimensional data Iris Ivy Gauran et.al. 2408.03138 null
2024-08-06 Interoperability and Explicable AI-based Zero-Day Attacks Detection Process in Smart Community Mohammad Sayduzzaman et.al. 2408.02921 null
2024-08-05 Phase Transitions in Anisotropic Turbulence Adrian van Kan et.al. 2408.02844 null
2024-08-05 Gaussian Mixture based Evidential Learning for Stereo Matching Weide Liu et.al. 2408.02796 null
2024-08-04 Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image Xinlin Ren et.al. 2408.02079 link
2024-08-04 PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone Xin Yang et.al. 2408.02053 null
2024-08-03 Are EU low-carbon structural funds efficient in reducing emissions? Marco Dueñas et.al. 2408.01782 null
2024-08-03 MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas Feng Qiao et.al. 2408.01653 null
2024-08-06 Three-dimensional Morphological Reconstruction of Millimeter-Scale Soft Continuum Robots based on Dual-Stereo-Vision Tian-Ao Ren et.al. 2408.01615 null
2024-08-02 Decentralized Smoothing ADMM for Quantile Regression with Non-Convex Sparse Penalties Reza Mirzaeifard et.al. 2408.01307 null
2024-08-02 The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models Hannah Chen et.al. 2408.01285 null
2024-08-01 High-Impact Innovations and Hidden Gender Disparities in Inventor-Evaluator Networks Tara Sowrirajan et.al. 2408.00905 null
2024-08-01 Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection Ruiyang Zhang et.al. 2408.00619 link
2024-07-31 Machine Learning Boosted Entropy-Engineered Synthesis of stable Nanometric Solid Solution CuCo Alloys for Efficient Nitrate Reduction to Ammonia Yao Hu et.al. 2408.00142 null
2024-07-31 A comparative study of radio signatures from winds and jets: Modelling synchrotron emission and polarization Moun Meenakshi et.al. 2408.00099 null
2024-07-31 Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs Shi Liu et.al. 2407.21771 null
2024-07-31 Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching Pengjie Zhang et.al. 2407.21735 null
2024-07-31 Deep Learning-Based Longitudinal Prediction of Childhood Myopia Progression Using Fundus Image Sequences and Baseline Refraction Data Mengtian Kang et.al. 2407.21467 null
2024-07-31 Modeling Urban Transport Choices: Incorporating Sociocultural Aspects Kathleen Salazar-Serna et.al. 2407.21307 link
2024-07-30 Algorithm-Assisted Decision Making and Racial Disparities in Housing: A Study of the Allegheny Housing Assessment Tool Lingwei Cheng et.al. 2407.21209 null
2024-07-30 Different behaviour of the gas-phase and stellar metallicity in the central part of MaNGA galaxies I. A. Zinchenko et.al. 2407.21160 null
2024-07-30 Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings Tianyi Zhang et.al. 2407.20870 null
2024-07-30 Planar network statistics for two-dimensional rupturing foams Joseph Klobusicky et.al. 2407.20858 null
2024-07-30 Evaluating Fairness in Black-box Algorithmic Markets: A Case Study of Ride Sharing in Chicago Yuhan Liu et.al. 2407.20522 null
2024-07-29 BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation Kieran Saunders et.al. 2407.20437 null
2024-07-29 Solving QUBOs with a quantum-amenable branch and bound method Thomas Häner et.al. 2407.20185 null
2024-07-29 Classification of Alzheimer’s Dementia vs. Healthy subjects by studying structural disparities in fMRI Time-Series of DMN Sneha Noble et.al. 2407.19990 null
2024-07-29 Can I trust my anomaly detection system? A case study based on explainable AI Muhammad Rashid et.al. 2407.19951 link
2024-07-29 Generalization bounds for regression and classification on adaptive covering input domains Wen-Liang Hwang et.al. 2407.19715 null
2024-07-29 SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages Wenxuan Zhang et.al. 2407.19672 link
2024-07-29 AI-Driven Healthcare: A Survey on Ensuring Fairness and Mitigating Bias Sribala Vidyadhari Chinta et.al. 2407.19655 null
2024-07-28 On the Evaluation Consistency of Attribution-based Explanations Jiarui Duan et.al. 2407.19471 null
2024-07-27 MSP-MVS: Multi-granularity Segmentation Prior Guided Multi-View Stereo Zhenlong Yuan et.al. 2407.19323 null
2024-07-27 On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs Nitay Calderon et.al. 2407.19200 null
2024-07-27 Assessing Spatial Disparities: A Bayesian Linear Regression Approach Kyle Lin Wu et.al. 2407.19171 null
2024-07-26 PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis Sohyeong Kim et.al. 2407.18695 null
2024-07-26 Direct observation of quantum vortex fractionalization in multiband superconductors Yu Zheng et.al. 2407.18610 null
2024-07-26 Content-driven Magnitude-Derivative Spectrum Complementary Learning for Hyperspectral Image Classification Huiyan Bai et.al. 2407.18593 null
2024-07-25 Unsupervised Training of Neural Cellular Automata on Edge Devices John Kalkhof et.al. 2407.18114 link
2024-07-25 TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework Guanfeng Tang et.al. 2407.18038 null
2024-07-25 Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks Zhicheng Cai et.al. 2407.17834 link
2024-07-25 Multi-modal Data Binding for Survival Analysis Modeling with Incomplete Data and Annotations Linhao Qu et.al. 2407.17726 null
2024-07-24 Unveiling the structural content of NGC 6357 via kinematics and NIR variability C. Ordenes-Huanca et.al. 2407.17577 null
2024-07-24 Gender disparities in the dissemination and acquisition of scientific knowledge Chiara Zappalà et.al. 2407.17441 null
2024-07-25 Domain Generalized Recaptured Screen Image Identification Using SWIN Transformer Preeti Mehta et.al. 2407.17170 null
2024-07-23 Balanced Multi-Relational Graph Clustering Zhixiang Shen et.al. 2407.16863 link
2024-07-24 FCNR: Fast Compressive Neural Representation of Visualization Images Yunfei Lu et.al. 2407.16369 link
2024-07-23 MHD activity induced coherent mode excitation in the edge plasma region of ADITYA-U Tokamak Kaushlender Singh et.al. 2407.16301 null
2024-07-23 Representation Magnitude has a Liability to Privacy Vulnerability Xingli Fang et.al. 2407.16164 link
2024-07-22 Inequalities in Computational Thinking Among Incoming Students in an STEM Chilean University Felipe González-Pizarro et.al. 2407.15833 null
2024-07-22 Breaking the Global North Stereotype: A Global South-centric Benchmark Dataset for Auditing and Mitigating Biases in Facial Recognition Systems Siddharth D Jaiswal et.al. 2407.15810 null
2024-07-22 Examining Inequality in Park Quality for Promoting Health Across 35 Global Cities Linus W. Dietz et.al. 2407.15770 link
2024-07-23 Bidirectional skip-frame prediction for video anomaly detection with intra-domain disparity-driven attention Jiahao Lyu et.al. 2407.15424 null
2024-07-22 Iterative approach to reconstructing neural disparity fields from light-field data Ligen Shi et.al. 2407.15380 null
2024-07-22 Dissecting Multiplication in Transformers: Insights into LLMs Luyu Qiu et.al. 2407.15360 link
2024-07-22 Efficient Multi-disparity Transformer for Light Field Image Super-resolution Zeke Zexi Hu et.al. 2407.15329 null
2024-07-19 PolySinger: Singing-Voice to Singing-Voice Translation from English to Japanese Silas Antonisen et.al. 2407.14399 null
2024-07-19 tidychangepoint: a unified framework for analyzing changepoint detection in univariate time series Benjamin S. Baumer et.al. 2407.14369 null
2024-07-19 Stable Audio Open Zach Evans et.al. 2407.14358 link
2024-07-19 SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization Mae Younes et.al. 2407.14257 link
2024-07-19 Double-Shot 3D Shape Measurement with a Dual-Branch Network Mingyang Lei et.al. 2407.14198 null
2024-07-19 Scale Disparity of Instances in Interactive Point Cloud Segmentation Chenrui Han et.al. 2407.14009 null
2024-07-19 Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance Changye Li et.al. 2407.13982 link
2024-07-19 The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations Tyler LaBonte et.al. 2407.13957 link
2024-07-18 Research on Tibetan Tourism Viewpoints information generation system based on LLM Jinhu Qi et.al. 2407.13561 null
2024-07-18 CookAR: Affordance Augmentations in Wearable AR to Support Kitchen Tool Interactions for People with Low Vision Jaewook Lee et.al. 2407.13515 link
2024-07-18 MIR laser CEP estimation using machine learning concepts in bulk high harmonic generation Balázs Nagyillés et.al. 2407.13512 null
2024-07-18 From Words to Worlds: Compositionality for Cognitive Architectures Ruchira Dhar et.al. 2407.13419 null
2024-07-18 Hybridization of terahertz phonons and magnons in disparate and spatially-separated material specimens Marcin Białek et.al. 2407.13305 null
2024-07-18 FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection Jianwei Zhao et.al. 2407.13133 null
2024-07-17 Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning Minjae Cho et.al. 2407.13006 null
2024-07-17 Multi-Band Wi-Fi Neural Dynamic Fusion Sorachi Kato et.al. 2407.12937 null
2024-07-17 Propagation of Interplanetary Shocks in the Heliosphere Munkhjargal Lkhagvadorj et.al. 2407.12689 null
2024-07-16 Temporally Consistent Stereo Matching Jiaxi Zeng et.al. 2407.11950 link
2024-07-16 Fairly Accurate: Optimizing Accuracy Parity in Fair Target-Group Detection Soumyajit Gupta et.al. 2407.11933 null
2024-07-16 MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification Zhuoxiao Li et.al. 2407.11840 null
2024-07-16 Robust Utility-Preserving Text Anonymization Based on Large Language Models Tianyu Yang et.al. 2407.11770 link
2024-07-16 Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems Jianzhu Huai et.al. 2407.11705 null
2024-07-16 Rethinking Fair Graph Neural Networks from Re-balancing Zhixun Li et.al. 2407.11624 link
2024-07-17 QVD: Post-training Quantization for Video Diffusion Models Shilong Tian et.al. 2407.11585 null
2024-07-16 Representation Bias in Political Sample Simulations with Large Language Models Weihong Qi et.al. 2407.11409 null
2024-07-16 The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation Muyang Qiu et.al. 2407.11356 link
2024-07-15 Benchmarking Vision Language Models for Cultural Understanding Shravan Nayak et.al. 2407.10920 null
2024-07-15 Temporal Event Stereo via Joint Learning with Stereoscopic Flow Hoonhee Cho et.al. 2407.10831 link
2024-07-15 Growth of Science: How long will the United States uphold its position? Dipak Patra et.al. 2407.10771 null
2024-07-15 Socioeconomic factors of national representation in the global film festival circuit: skewed toward the large and wealthy, but small countries can beat the odds Andres Karjus et.al. 2407.10755 null
2024-07-15 Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model Zhening Liu et.al. 2407.10632 link
2024-07-15 Muon-induced collisional flavor instability in core-collapse supernova Jiabao Liu et.al. 2407.10604 null
2024-07-15 A Unifying Approach to Product Constructions for Quantitative Temporal Inference Kazuki Watanabe et.al. 2407.10465 null
2024-07-14 Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data Tuo Feng et.al. 2407.10200 link
2024-07-14 Adaptive Model Predictive Control with Data-driven Error Model for Quadrupedal Locomotion Xuanqi Zeng et.al. 2407.10124 null
2024-07-13 Characterizing Disparity Between Edge Models and High-Accuracy Base Models for Vision Tasks Zhenyu Wang et.al. 2407.10016 null
2024-07-12 Self-organized multiscale structures in thermally relativistic electron-positron-ion plasmas Usman Shazad et.al. 2407.09440 null
2024-07-12 Multi-Modal Dataset Creation for Federated~Learning with DICOM Structured Reports Malte Tölle et.al. 2407.09064 null
2024-07-12 Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining on Chest CT Jie Zheng et.al. 2407.08961 null
2024-07-11 MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization Orevaoghene Ahia et.al. 2407.08818 null
2024-07-11 Adaptive Smooth Non-Stationary Bandits Joe Suk et.al. 2407.08654 link
2024-07-11 Multi-Group Proportional Representation Alex Oesterling et.al. 2407.08571 link
2024-07-11 Vox Populi, Vox AI? Using Language Models to Estimate German Public Opinion Leah von der Heyde et.al. 2407.08563 link
2024-07-11 Unveiling Disparities in Maternity Care: A Topic Modelling Approach to Analysing Maternity Incident Investigation Reports Georgina Cosma et.al. 2407.08328 null
2024-07-11 DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing Minghang Zhou et.al. 2407.08132 link
2024-07-10 Stretch your reach: Studying Self-Avatar and Controller Misalignment in Virtual Reality Interaction Jose Luis Ponton et.al. 2407.08011 null
2024-07-10 A Survey on Deep Stereo Matching in the Twenties Fabio Tosi et.al. 2407.07816 link
2024-07-10 Explicit inverse of symmetric, tridiagonal near Toeplitz matrices Part II: with weakly diagonally dominant Toeplitz Bakytzhan Kurmanbek et.al. 2407.07654 null
2024-07-10 TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data Siyi Du et.al. 2407.07582 link
2024-07-10 Causal Discovery-Driven Change Point Detection in Time Series Shanyun Gao et.al. 2407.07290 null
2024-07-09 A Detailed Analysis of a Magnetic Island Observed by WISPR on Parker Solar Probe Madison L. Ascione et.al. 2407.07216 null
2024-07-09 Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images Chuanrui Zhang et.al. 2407.06984 null
2024-07-09 iASiS: Towards Heterogeneous Big Data Analysis for Personalized Medicine Anastasia Krithara et.al. 2407.06748 null
2024-07-09 Computer vision tasks for intelligent aerospace missions: An overview Huilin Chen et.al. 2407.06513 null
2024-07-09 LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration Jiayi Liu et.al. 2407.06512 link
2024-07-08 Systematic time-coarse graining for driven quantum systems Leon Bello et.al. 2407.06068 link
2024-07-08 CA-FedRC: Codebook Adaptation via Federated Reservoir Computing in 5G NR Ziqiang Ye et.al. 2407.05928 null
2024-07-08 GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation Chenxin Li et.al. 2407.05540 null
2024-07-07 GitHub Marketplace for Automation and Innovation in Software Production SK Golam Saroar et.al. 2407.05519 null
2024-07-07 Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models Nikhil Sharma et.al. 2407.05502 null
2024-07-07 CLIMB: A Benchmark of Clinical Bias in Large Language Models Yubo Zhang et.al. 2407.05250 link
2024-07-06 SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention Yunzhong Si et.al. 2407.05128 link
2024-07-06 Crowdsourced reviews reveal substantial disparities in public perceptions of parking Lingyao Li et.al. 2407.05104 link
2024-07-06 SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions Zaid A. El-Shair et.al. 2407.04908 null
2024-07-05 Balancing Operator’s Risk Averseness in Model Predictive Control of a Reservoir System Ja-Ho Koo et.al. 2407.04506 null
2024-07-04 The SOHO LASCO CME Catalog – Version 2 Nat Gopalswamy et.al. 2407.04165 null
2024-07-04 Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving Sergio. Martín Serrano et.al. 2407.04070 null
2024-07-04 Adversarial Robustness of VAEs across Intersectional Subgroups Chethan Krishnamurthy Ramanaik et.al. 2407.03864 link
2024-07-04 M $\mathbf5$ – A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks Florian Schneider et.al. 2407.03791 null
2024-07-04 High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching Gael Le Lan et.al. 2407.03648 null
2024-07-04 ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution Yuanbo Zhou et.al. 2407.03598 link
2024-07-03 Probing Perfection: The Relentless Art of Meddling for Pulmonary Airway Segmentation from HRCT via a Human-AI Collaboration Based Active Learning Method Shiyi Wang et.al. 2407.03542 null
2024-07-03 How Does Quantization Affect Multilingual LLMs? Kelly Marchisio et.al. 2407.03211 null
2024-07-03 Stereo Risk: A Continuous Modeling Approach to Stereo Matching Ce Liu et.al. 2407.03152 null
2024-07-03 Effective Heterogeneous Federated Learning via Efficient Hypernetwork-based Weight Generation Yujin Shin et.al. 2407.03086 link
2024-07-03 Early-Stage Anomaly Detection: A Study of Model Performance on Complete vs. Partial Flows Adrian Pekar et.al. 2407.02856 link
2024-07-03 A Pairwise DomMix Attentive Adversarial Network for Unsupervised Domain Adaptive Object Detection Jie Shao et.al. 2407.02835 null
2024-07-02 Practical Guide for Causal Pathways and Sub-group Disparity Analysis Farnaz Kohankhaki et.al. 2407.02702 null
2024-07-02 Domain Generalizable Knowledge Tracing via Concept Aggregation and Relation-Based Attention Yuquan Xie et.al. 2407.02547 null
2024-07-02 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices Juntao Zhao et.al. 2407.02327 link
2024-07-02 Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models Anjishnu Mukherjee et.al. 2407.02067 link
2024-07-02 Privacy Risks of General-Purpose AI Systems: A Foundation for Investigating Practitioner Perspectives Stephen Meisenbacher et.al. 2407.02027 null
2024-07-02 Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model Yu-Kuan Fu et.al. 2407.01911 null
2024-07-01 Race and Privacy in Broadcast Police Communications Pranav Narayanan Venkit et.al. 2407.01817 null
2024-07-01 Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation Lianjie Guo et.al. 2407.01292 link
2024-07-01 OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos Yassine Benzakour et.al. 2407.01265 null
2024-07-01 FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models Ruinan Jin et.al. 2407.00983 link
2024-06-30 Learning System Dynamics without Forgetting Xikun Zhang et.al. 2407.00717 link
2024-06-30 Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP Ayush Ranjan et.al. 2407.00592 null
2024-06-28 LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation Xianda Guo et.al. 2406.19833 link
2024-06-28 Galaxy Group Ellipticity Confirms a Younger Cosmos Yu Rong et.al. 2406.19612 null
2024-06-28 What’s the Weight? Estimating Controlled Outcome Differences in Complex Surveys for Health Disparities Research Stephen Salerno et.al. 2406.19597 link
2024-06-27 Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects Orevaoghene Ahia et.al. 2406.19564 link
2024-06-27 Stereo Vision Based Robot for Remote Monitoring with VR Support Mohamed Fazil M. S. et.al. 2406.19498 null
2024-06-27 STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning Yanan Zhang et.al. 2406.19362 null
2024-06-27 Revealing Fine-Grained Values and Opinions in Large Language Models Dustin Wright et.al. 2406.19238 link
2024-06-27 RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulaiton Fanfan Liu et.al. 2406.18977 link
2024-06-27 From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased Decisions Trenton Chang et.al. 2406.18865 link
2024-06-27 Retain, Blend, and Exchange: A Quality-aware Spatial-Stereo Fusion Approach for Event Stream Recognition Lan Chen et.al. 2406.18845 link
2024-06-26 DoubleTake: Geometry Guided Depth Estimation Mohamed Sayed et.al. 2406.18387 null
2024-06-26 An interactive framework for the evaluation and detection of stereoacuity threshold under ambient lighting Kritika Lohia et.al. 2406.18336 null
2024-06-26 Molecular Diffusion Models with Virtual Receptors Matan Halfon et.al. 2406.18330 null
2024-06-28 SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance Caishuang Huang et.al. 2406.18118 link
2024-06-25 Evaluating Fairness in Large Vision-Language Models Across Diverse Demographic Attributes and Prompts Xuyang Wu et.al. 2406.17974 link
2024-06-25 Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals Kentaro Seki et.al. 2406.17722 link
2024-06-25 Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation Xuming Zhang et.al. 2406.17679 null
2024-06-25 RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale Beck LaBash et.al. 2406.16801 link
2024-06-24 Addressing Polarization and Unfairness in Performative Prediction Kun Jin et.al. 2406.16756 null
2024-06-24 Lone Pair Induced 1D Character and Weak Cation-anion Interactions: Two Ingredients for Low Thermal Conductivity in Mixed-anion Metal Chalcohalides Xingchen Shen et.al. 2406.16744 null
2024-06-24 Effective Elastic Properties of Multilayer Graphene Yun Hwangbo et.al. 2406.16344 null
2024-06-23 Thinking beyond Bias: Analyzing Multifaceted Impacts and Implications of AI on Gendered Labour Satyam Mohla et.al. 2406.16207 null
2024-06-23 The Persistence of Contrarianism on Twitter: Mapping users’ sharing habits for the Ukraine war, COVID-19 vaccination, and the 2020 Midterm Elections David Axelrod et.al. 2406.16175 null
2024-06-23 Comparison of methods for mediation analysis with multiple correlated mediators Mary Appah et.al. 2406.16174 null
2024-06-23 Quantitative Global Carbon Inequality Network Yanming Guo et.al. 2406.16092 null
2024-06-23 Learning Accurate and Enriched Features for Stereo Image Super-Resolution Hu Gao et.al. 2406.16001 link
2024-06-23 Generalized Measures of Population Synchrony Francis C. Motta et.al. 2406.15987 null
2024-06-21 Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks Hokyung Lee et.al. 2406.15325 link
2024-06-21 Time-Domain Signatures of Distinct Correlated Insulators in a Moiré Superlattice Eric A. Arsenault et.al. 2406.15067 null
2024-06-21 3D-Localization of Single Point-Like Gamma Sources with a Coded Aperture Camera Tobias Meißner et.al. 2406.15048 null
2024-06-21 Trustworthy Enhanced Multi-view Multi-modal Alzheimer’s Disease Prediction with Brain-wide Imaging Transcriptomics Data Shan Cong et.al. 2406.14977 link
2024-06-21 Direct Multi-Turn Preference Optimization for Language Agents Wentao Shi et.al. 2406.14868 link
2024-06-21 Older and Wiser: The Marriage of Device Aging and Intellectual Property Protection of Deep Neural Networks Ning Lin et.al. 2406.14863 null
2024-06-21 Non-Markovian Collective Emission of Giant emitters in the Zeno Regime Qing-Yang Qiu et.al. 2406.14811 null
2024-06-20 1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators? Yue Huang et.al. 2406.14721 null
2024-06-20 Population Activity Recovery: Milestones Unfolding, Temporal Interdependencies, and Relationship with Physical and Social Vulnerability Flavia Ioana Patrascu et.al. 2406.14720 null
2024-06-20 Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data Johannes Treutlein et.al. 2406.14546 link
2024-06-20 Towards Truthful Multilingual Large Language Models: Benchmarking and Alignment Strategies Weihao Liu et.al. 2406.14434 link
2024-06-20 Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services David Hartmann et.al. 2406.14154 null
2024-06-20 Novae: An Important Source of Lithium in the Galaxy Jun Gao et.al. 2406.13986 null
2024-06-19 Open Generative Large Language Models for Galician Pablo Gamallo et.al. 2406.13893 null
2024-06-19 Leveraging Large Language Models to Measure Gender Bias in Gendered Languages Erik Derner et.al. 2406.13677 null
2024-06-19 Transferable Tactile Transformers for Representation Learning Across Diverse Sensors and Tasks Jialiang Zhao et.al. 2406.13640 null
2024-06-19 Formation of a Magnetic Cloud from the Merging of Two Successive Coronal Mass Ejections Chong Chen et.al. 2406.13603 null
2024-06-19 MVSBoost: An Efficient Point Cloud-based 3D Reconstruction Umair Haroon et.al. 2406.13515 null
2024-06-19 Toward Structure Fairness in Dynamic Graph Embedding: A Trend-aware Dual Debiasing Approach Yicong Li et.al. 2406.13201 link
2024-06-18 Stealth edits for provably fixing or attacking large language models Oliver J. Sutton et.al. 2406.12670 link
2024-06-18 An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation Qin Li et.al. 2406.12646 null
2024-06-18 Restorer: Solving Multiple Image Restoration Tasks with One Set of Parameters Jiawei Mao et.al. 2406.12587 link
2024-06-18 Rastall gravity: accretion disk image in radiation fields context and visual transformations compared to Reissner-Nordstrom black holes Yu-Xiang Huang et.al. 2406.12466 null
2024-06-18 Status of Astronomy Education in India: A Baseline Survey Moupiya Maji et.al. 2406.12308 null
2024-06-17 Slicing Through Bias: Explaining Performance Gaps in Medical Image Analysis using Slice Discovery Methods Vincent Olesen et.al. 2406.12142 link
2024-06-17 The Benefits and Risks of Transductive Approaches for AI Fairness Muhammed Razzak et.al. 2406.12011 null
2024-06-17 Decomposed evaluations of geographic disparities in text-to-image models Abhishek Sureddy et.al. 2406.11988 null
2024-06-17 Be careful in multi-messenger inference of the Hubble constant: A path forward for robust inference Michael Müller et.al. 2406.11965 null
2024-06-17 Personalized Federated Knowledge Graph Embedding with Client-Wise Relation Graph Xiaoxiong Zhang et.al. 2406.11943 null
2024-06-17 P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models Shuo Yang et.al. 2406.11391 null
2024-06-17 Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network Frank Sippel et.al. 2406.11284 link
2024-06-16 Physics-Informed Deep Learning and Partial Transfer Learning for Bearing Fault Diagnosis in the Presence of Highly Missing Data Mohammadreza Kavianpour et.al. 2406.11023 null
2024-06-16 Rectified Iterative Disparity for Stereo Matching Weiqing Xiao et.al. 2406.10943 null
2024-06-16 Quantifying Generative Media Bias with a Corpus of Real-world and Generated News Articles Filip Trhlik et.al. 2406.10773 null
2024-06-15 Trapping of isotropic droplets by disclinations in nematic liquid crystals controlled by surface anchoring and elastic constant disparity Nilanthi P. Haputhanthrige et.al. 2406.10684 null
2024-06-15 Functional Clustering for Longitudinal Associations between County-Level Social Determinants of Health and Stroke Mortality in the US Fangzhi Luo et.al. 2406.10499 null
2024-06-15 A Label is Worth a Thousand Images in Dataset Distillation Tian Qin et.al. 2406.10485 link
2024-06-14 Consistency-diversity-realism Pareto fronts of conditional image generative models Pietro Astolfi et.al. 2406.10429 null
2024-06-14 Gender Representation in TV and Radio: Automatic Information Extraction methods versus Manual Analyses David Doukhan et.al. 2406.10316 null
2024-06-14 Carbon Monoxide Cooling in Radiative Transfer Modeling of Supernovae Collin McLeod et.al. 2406.10132 null
2024-06-14 DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications Li Li et.al. 2406.10068 link
2024-06-14 Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness Maximilian Spliethöver et.al. 2406.09977 null
2024-06-14 OpenCapBench: A Benchmark to Bridge Pose Estimation and Biomechanics Yoni Gozlan et.al. 2406.09788 null
2024-06-14 Cross-view geo-localization: a survey Abhilash Durgam et.al. 2406.09722 null
2024-06-14 MoME: Mixture of Multimodal Experts for Cancer Survival Prediction Conghao Xiong et.al. 2406.09696 link
2024-06-13 Strain rate controls alignment in growing bacterial monolayers Blake Langeslay et.al. 2406.09615 null
2024-06-13 AOC: Analysis of Orthologous Collections – an application for the characterization of natural selection in protein-coding sequences Alexander Lucaci et.al. 2406.09522 link
2024-06-13 You are what you eat? Feeding foundation models a regionally diverse food dataset of World Wide Dishes Jabez Magomere et.al. 2406.09496 link
2024-06-13 Scale-Invariant Monocular Depth Estimation via SSI Depth S. Mahdi H. Miangoleh et.al. 2406.09374 link
2024-06-13 Less Cybersickness, Please: Demystifying and Detecting Stereoscopic Visual Inconsistencies in VR Apps Shuqing Li et.al. 2406.09313 null
2024-06-13 Python-based DSL for generating Verilog model of Synchronous Digital Circuits Mandar Datar et.al. 2406.09208 link
2024-06-13 Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns Kaavya Rekanar et.al. 2406.09203 null
2024-06-13 Fine-Grained Domain Generalization with Feature Structuralization Wenlong Yu et.al. 2406.09166 link
2024-06-13 Mean Field Study of Superconductivity in the Square Lattice $t$-$J$ Model with Three-Site Hopping Ke Yang et.al. 2406.08780 null
2024-06-12 On Strongly-equitable Social Welfare Orders Without the Axiom of Choice Luke Serafin et.al. 2406.08684 null
2024-06-12 Conditional Similarity Triplets Enable Covariate-Informed Representations of Single-Cell Data Chi-Jane Chen et.al. 2406.08638 link
2024-06-12 Unraveling Code-Mixing Patterns in Migration Discourse: Automated Detection and Analysis of Online Conversations on Reddit Fedor Vitiugin et.al. 2406.08633 link
2024-06-13 Real2Code: Reconstruct Articulated Objects via Code Generation Zhao Mandi et.al. 2406.08474 null
2024-06-12 Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models Javier Nistal et.al. 2406.08384 null
2024-06-12 Chemistry3D: Robotic Interaction Benchmark for Chemistry Experiments Shoujie Li et.al. 2406.08160 link
2024-06-12 Generalizable Disaster Damage Assessment via Change Detection with Vision Foundation Model Kyeongjin Ahn et.al. 2406.08020 null
2024-06-12 Automatic detection of large-scale flux ropes and their geoeffectiveness with a machine learning approach Sanchita Pal et.al. 2406.07798 null
2024-06-11 PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow Joshua Tokarsky et.al. 2406.07667 null
2024-06-11 Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling Denis Blessing et.al. 2406.07423 link
2024-06-11 NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images Yufei Han et.al. 2406.07111 null
2024-06-11 The evolution of coronal shock wave properties and their relation with solar energetic particles Manon Jarry et.al. 2406.07058 null
2024-06-11 Bridging Language Gaps in Audio-Text Retrieval Zhiyong Yan et.al. 2406.07012 link
2024-06-11 HPC Alongside User-space Kubernetes Vanessa Sochat et.al. 2406.06995 null
2024-06-11 Stepwise Regression and Pre-trained Edge for Robust Stereo Matching Weiqing Xiao et.al. 2406.06953 link
2024-06-10 Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies Alex DeWeese et.al. 2406.06823 null
2024-06-10 The Legal Duty to Search for Less Discriminatory Algorithms Emily Black et.al. 2406.06817 null
2024-06-10 Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests T. Tony Cai et.al. 2406.06749 null
2024-06-10 The largest metallicity difference in twin systems: high-precision abundance analysis of the benchmark pair Krios & Kronos P. Miquelarena et.al. 2406.06705 null
2024-06-10 Annotation alignment: Comparing LLM and human annotations of conversational safety Rajiv Movva et.al. 2406.06369 null
2024-06-10 Shoulders of Giants: A Look at the Degree and Utility of Openness in NLP Research Surangika Ranathunga et.al. 2406.06021 null
2024-06-10 Computational and Statistical Guarantees for Tensor-on-Tensor Regression with Tensor Train Decomposition Zhen Qin et.al. 2406.06002 null
2024-06-10 Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context Jingru Jia et.al. 2406.05972 null
2024-06-09 Predictors of the Sense of Presence in an Immersive Audio Storytelling Experience, a Mixed Methods Study. PREPRINT Isabelle Verhulst et.al. 2406.05856 null
2024-06-09 SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion Bingsong Bai et.al. 2406.05692 null
2024-06-09 MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations Hemant Yadav et.al. 2406.05661 null
2024-06-09 Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs for Open-Ended Responses Maryam Amirizaniani et.al. 2406.05659 null
2024-06-08 I-SIRch: AI-Powered Concept Annotation Tool For Equitable Extraction And Analysis Of Safety Insights From Maternity Investigations Mohit Kumar Singh et.al. 2406.05505 null
2024-06-08 M3GIA: A Cognition Inspired Multilingual and Multimodal General Intelligence Ability Benchmark Wei Song et.al. 2406.05343 link
2024-06-07 ProMotion: Prototypes As Motion Learners Yawen Lu et.al. 2406.04999 null
2024-06-07 On the social bias of speech self-supervised models Yi-Cheng Lin et.al. 2406.04997 null
2024-06-07 UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection Yuchao Wang et.al. 2406.04647 null
2024-06-06 Function and form of U.S. cities Sandro M. Reia et.al. 2406.04543 null
2024-06-06 TexIm FAST: Text-to-Image Representation for Semantic Similarity Evaluation using Transformers Wazib Ansar et.al. 2406.04438 null
2024-06-06 Stereo-Depth Fusion through Virtual Pattern Projection Luca Bartolomei et.al. 2406.04345 link
2024-06-06 Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation Honglei Zhang et.al. 2406.03933 link
2024-06-06 Knowledge Transfer, Knowledge Gaps, and Knowledge Silos in Citation Networks Eoghan Cunningham et.al. 2406.03921 link
2024-06-06 Transductive Off-policy Proximal Policy Optimization Yaozhong Gan et.al. 2406.03894 null
2024-06-05 Does the Sun have a Dark Disk? Gustavo F. S. Alves et.al. 2406.03607 null
2024-06-05 Reconciling Heterogeneous Effects in Causal Inference Audrey Chang et.al. 2406.03575 null
2024-06-05 MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization Xiaobo Guo et.al. 2406.03479 null
2024-06-05 A Flexible Recursive Network for Video Stereo Matching Based on Residual Estimation Youchen Zhao et.al. 2406.03333 link
2024-06-05 On the Maximal Local Disparity of Fairness-Aware Classifiers Jinqiu Jin et.al. 2406.03255 link
2024-06-05 MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class Min-Margin Contrastive Learning for Superior Prohibited Item Detection Mingyuan Li et.al. 2406.03176 link
2024-06-05 Instructing Prompt-to-Prompt Generation for Zero-Shot Learning Man Liu et.al. 2406.03032 null
2024-06-05 GraphAlign: Pretraining One Graph Neural Network on Multiple Graphs via Feature Alignment Zhenyu Hou et.al. 2406.02953 null
2024-06-04 Building Socially-Equitable Public Models Yejia Liu et.al. 2406.02790 link
2024-06-04 VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors Markus Plack et.al. 2406.02552 null
2024-06-04 The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding Kenneth Enevoldsen et.al. 2406.02396 link
2024-06-04 Layer-2 Arbitrage: An Empirical Analysis of Swap Dynamics and Price Disparities on Rollups Krzysztof Gogol et.al. 2406.02172 null
2024-06-04 A Multipurpose Interface for Close- and Far-Proximity Control of Mobile Collaborative Robots Hamidreza Raei et.al. 2406.02171 link
2024-06-05 CondTSF: One-line Plugin of Dataset Condensation for Time Series Forecasting Jianrong Ding et.al. 2406.02131 link
2024-06-04 Timescale bridging in atomistic simulations of epoxy polymer mechanics using non-affine deformation theory Vinay Vaibhav et.al. 2406.02113 null
2024-06-03 Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities Golnoosh Farnadi et.al. 2406.01757 null
2024-06-03 Inverse design of photonic surfaces on Inconel via multi-fidelity machine learning ensemble framework and high throughput femtosecond laser processing Luka Grbcic et.al. 2406.01471 null
2024-06-03 Structural Interventions and the Dynamics of Inequality Aurora Zhang et.al. 2406.01323 null
2024-06-03 Bridging the Digital Divide: Mapping Internet Connectivity Evolution, Inequalities, and Resilience in six Brazilian Cities Nicolò Gozzi et.al. 2406.01113 null
2024-05-31 Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF Tengyang Xie et.al. 2405.21046 null
2024-05-31 GANcrop: A Contrastive Defense Against Backdoor Attacks in Federated Learning Xiaoyun Gan et.al. 2405.20727 null
2024-05-31 Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation Shuzhou Yang et.al. 2405.20669 link
2024-05-31 Weak-Form Inference for Hybrid Dynamical Systems in Ecology Daniel Messenger et.al. 2405.20591 null
2024-05-31 The Point of View of a Sentiment: Towards Clinician Bias Detection in Psychiatric Notes Alissa A. Valentine et.al. 2405.20582 null
2024-05-30 Impact of Connected and Automated Vehicles on Transport Injustices Laura Martinez-Buelvas et.al. 2405.20530 null
2024-05-30 Bridging electronic and classical density-functional theory using universal machine-learned functional approximations Michelle M. Kelley et.al. 2405.20270 null
2024-05-30 Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting Kuldeep R Barad et.al. 2405.20104 null
2024-05-30 Strategies to Counter Artificial Intelligence in Law Enforcement: Cross-Country Comparison of Citizens in Greece, Italy and Spain Petra Saskia Bayerl et.al. 2405.19970 null
2024-05-29 X-ray and Radio campaign of the Z-source GX 340+0: discovery of X-ray polarization and its implications Yash Bhargava et.al. 2405.19324 null
2024-05-29 Measuring and Mitigating Bias for Tabular Datasets with Multiple Protected Attributes Manh Khoi Duong et.al. 2405.19300 link
2024-05-29 Mitigating Disparate Impact of Differential Privacy in Federated Learning through Robust Clustering Saber Malekmohammadi et.al. 2405.19272 null
2024-05-29 MAGIC: Modular Auto-encoder for Generalisable Model Inversion with Bias Corrections Yihang She et.al. 2405.18953 link
2024-05-29 UniPTS: A Unified Framework for Proficient Post-Training Sparsity Jingjing Xie et.al. 2405.18810 link
2024-05-28 The Efficacy of the Connect America Fund in Addressing US Internet Access Inequities Haarika Manda et.al. 2405.18657 null
2024-05-28 Aligning in a Compact Space: Contrastive Knowledge Distillation between Heterogeneous Architectures Hongjun Wu et.al. 2405.18524 null
2024-05-28 Exploring the Evolution of Altruistic Punishment with a PDE Model of Cultural Multilevel Selection Daniel B. Cooney et.al. 2405.18419 link
2024-05-28 A Calibration Tool for Refractive Underwater Vision Felix Seegräber et.al. 2405.18018 null
2024-05-28 Cross-Context Backdoor Attacks against Graph Prompt Learning Xiaoting Lyu et.al. 2405.17984 link
2024-05-28 FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes Yunsong Wang et.al. 2405.17958 link
2024-05-28 Boosting Protein Language Models with Negative Sample Mining Yaoyao Xu et.al. 2405.17902 link
2024-05-28 Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection Yingwen Wu et.al. 2405.17816 null
2024-05-27 A Two-sided Model for EV Market Dynamics and Policy Implications Haoxuan Ma et.al. 2405.17702 null
2024-05-27 Unifying Perspectives: Plausible Counterfactual Explanations on Global, Group-wise, and Local Levels Patryk Wielopolski et.al. 2405.17642 null
2024-05-27 MindMerger: Efficient Boosting LLM Reasoning in non-English Languages Zixian Huang et.al. 2405.17386 link
2024-05-27 EF-Calib: Spatiotemporal Calibration of Event- and Frame-Based Cameras Using Continuous-Time Trajectories Shaoan Wang et.al. 2405.17278 link
2024-05-27 Highly inhomogeneous interactions between background climate and urban warming across typical local climate zones in heatwave and non-heatwave days Jing Kong et.al. 2405.17213 null
2024-05-27 SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing Yong-Qiang Mao et.al. 2405.17140 null
2024-05-27 Multi-view Disparity Estimation Using a Novel Gradient Consistency Model James L. Gray et.al. 2405.17029 null
2024-05-27 Blind Data Adaptation to tackle Covariate Shift in Operational Steganalysis Rony Abecidan et.al. 2405.16961 null
2024-05-27 Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models Fengfan Zhou et.al. 2405.16940 null
2024-05-28 PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting Zipeng Wang et.al. 2405.16829 null
2024-05-27 Addressing Discretization-Induced Bias in Demographic Prediction Evan Dong et.al. 2405.16762 link
2024-05-26 Demystify Mamba in Vision: A Linear Attention Perspective Dongchen Han et.al. 2405.16605 link
2024-05-24 Synthetic high angular momentum spin dynamics in a microwave oscillator Saswata Roy et.al. 2405.15695 null
2024-05-24 Digital finance, Bargaining Power and Gender Wage Gap Qing Guo et.al. 2405.15486 null
2024-05-24 Mind the Gap: A Causal Perspective on Bias Amplification in Prediction & Decision-Making Drago Plecko et.al. 2405.15446 null
2024-05-24 Fairness-Accuracy Trade-Offs: A Causal Perspective Drago Plecko et.al. 2405.15443 link
2024-05-23 ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization Han Song et.al. 2405.15082 null
2024-05-23 Modularity, Higher-Order Recombination, and New Venture Success Likun Cao et.al. 2405.15042 null
2024-05-23 Federated Online Adaptation for Deep Stereo Matteo Poggi et.al. 2405.14873 null
2024-05-23 An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models Jiahao Sun et.al. 2405.14870 link
2024-05-23 Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras Hanzhang Tu et.al. 2405.14866 null
2024-05-23 A Systematic and Formal Study of the Impact of Local Differential Privacy on Fairness: Preliminary Results Karima Makhlouf et.al. 2405.14725 null
2024-05-23 Is the EJRA proportionate and therefore justified? A critical review of the EJRA policy at Cambridge Oliver Linton et.al. 2405.14611 null
2024-05-23 Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks Xingguang Jiang et.al. 2405.14520 null
2024-05-22 Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation Mykhailo Uss et.al. 2405.14024 null
2024-05-22 CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models Giada Pistilli et.al. 2405.13974 null
2024-05-22 Multi-Dataset Multi-Task Learning for COVID-19 Prognosis Filippo Ruffini et.al. 2405.13771 null
2024-05-22 Knowledge-Driven Cross-Document Relation Extraction Monika Jain et.al. 2405.13546 link

Monocular Depth Estimation

Publish Date Title Authors PDF Code
2025-07-15 Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation Zhen Xu et.al. 2507.11540 null
2025-07-15 MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network Jianfei Jiang et.al. 2507.11333 null
2025-07-15 Uncertainty Aware Mapping for Vision-Based Underwater Robots Abhimanyu Bhowmik et.al. 2507.10991 null
2025-07-14 Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision Justin M. Kasowski et.al. 2507.10813 null
2025-07-14 Cameras as Relative Positional Encoding Ruilong Li et.al. 2507.10496 null
2025-07-14 Spatial Lifting for Dense Prediction Mingzhi Xu et.al. 2507.10222 null
2025-07-13 Prompt2DEM: High-Resolution DEMs for Urban and Open Environments from Global Prompts Using a Monocular Foundation Model Osher Rafaeli et.al. 2507.09681 null
2025-07-11 ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way Rajarshi Roy et.al. 2507.08679 null
2025-07-10 An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision Jareen Anjom et.al. 2507.08165 null
2025-07-10 Tree-Mamba: A Tree-Aware Mamba for Underwater Monocular Depth Estimation Peixian Zhuang et.al. 2507.07687 null
2025-07-10 HOTA: Hierarchical Overlap-Tiling Aggregation for Large-Area 3D Flood Mapping Wenfeng Jia et.al. 2507.07585 null
2025-07-08 LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures Seungoh Han et.al. 2507.06109 null
2025-07-14 Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation Quanzhu Niu et.al. 2507.05948 null
2025-07-07 The Generalization Ridge: Information Flow in Natural Language Generation Ruidi Chang et.al. 2507.05387 null
2025-07-10 VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting Juyi Lin et.al. 2507.05116 null
2025-07-07 Estimating Object Physical Properties from RGB-D Vision and Depth Robot Sensors Using Deep Learning Ricardo Cardoso et.al. 2507.05029 null
2025-07-06 A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields Aoxiang Fan et.al. 2507.04408 null
2025-07-06 High-Resolution Sustain Pedal Depth Estimation from Piano Audio Across Room Acoustics Kun Fang et.al. 2507.04230 null
2025-07-03 From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images Danrong Zhang et.al. 2507.02781 null
2025-07-10 Underwater Monocular Metric Depth Estimation: Real-World Benchmarks and Synthetic Fine-Tuning with Vision Foundation Models Zijie Cai et.al. 2507.02148 null
2025-07-02 RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather Yuran Wang et.al. 2507.01653 null
2025-07-02 Depth Anything at Any Condition Boyuan Sun et.al. 2507.01634 null
2025-07-02 DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation Yue-Jiang Dong et.al. 2507.01603 null
2025-07-02 Evaluating Robustness of Monocular Depth Estimation with Procedural Scene Perturbations Jack Nugent et.al. 2507.00981 null
2025-06-30 SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures Fengyi Jiang et.al. 2507.00209 null
2025-06-30 OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving Mingqian Ji et.al. 2506.23565 null
2025-06-26 ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation Shruti Bansal et.al. 2506.20969 null
2025-06-25 THIRDEYE: Cue-Aware Monocular Depth Estimation via Brain-Inspired Multi-Stage Fusion Calin Teodor Ioan et.al. 2506.20877 null
2025-06-30 StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation Haodong Li et.al. 2506.20756 null
2025-06-24 Look to Locate: Vision-Based Multisensory Navigation with 3-D Digital Maps for GNSS-Challenged Environments Ola Elmaghraby et.al. 2506.19827 null
2025-06-23 SOF: Sorted Opacity Fields for Fast Unbounded Surface Reconstruction Lukas Radl et.al. 2506.19139 null
2025-06-23 BulletGen: Improving 4D Reconstruction with Bullet-Time Generation Denys Rozumnyi et.al. 2506.18601 null
2025-06-21 Optimization-Free Patch Attack on Stereo Depth Estimation Hangcheng Liu et.al. 2506.17632 null
2025-06-20 DreamCube: 3D Panorama Generation via Multi-plane Synchronization Yukun Huang et.al. 2506.17206 null
2025-06-20 RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking Teng Guo et.al. 2506.17119 link
2025-06-20 Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping Teng Guo et.al. 2506.17110 null
2025-06-20 DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches Yun Xing et.al. 2506.16690 null
2025-06-19 EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised Training Liangjing Shao et.al. 2506.16017 link
2025-06-18 RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation Xingrui Qin et.al. 2506.15560 null
2025-06-17 Time-Optimized Safe Navigation in Unstructured Environments through Learning Based Depth Completion Jeffrey Mao et.al. 2506.14975 null
2025-06-17 DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning Kunal Swami et.al. 2506.14709 null
2025-06-16 Test3R: Learning to Reconstruct 3D at Test Time Yuheng Yuan et.al. 2506.13750 link
2025-06-16 Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields Jungeon Kim et.al. 2506.13508 null
2025-06-17 Self-Supervised Enhancement for Depth from a Lightweight ToF Sensor with Monocular Images Laiyan Ding et.al. 2506.13444 link
2025-06-16 TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast Beilei Cui et.al. 2506.13387 link
2025-06-17 3D Hand Mesh-Guided AI-Generated Malformed Hand Refinement with Hand Pose Transformation via Diffusion Model Chen-Bin Feng et.al. 2506.12680 null
2025-06-12 Leveraging 6DoF Pose Foundation Models For Mapping Marine Sediment Burial Jerry Yan et.al. 2506.10386 link
2025-06-11 DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects Guanghu Xie et.al. 2506.09491 null
2025-06-11 MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning Tong Wang et.al. 2506.09327 null
2025-06-10 AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models Zheda Mai et.al. 2506.09082 null
2025-06-10 One Patch to Rule Them All: Transforming Static Patches into Dynamic Attacks in the Physical World Xingshuo Han et.al. 2506.08482 null
2025-06-09 Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence Octave Mariotti et.al. 2506.08220 null
2025-06-09 Hidden in plain sight: VLMs overlook their visual representations Stephanie Fu et.al. 2506.08008 null
2025-06-09 EgoM2P: Egocentric Multimodal Multitask Pretraining Gen Li et.al. 2506.07886 null
2025-06-09 Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images Yingping Liang et.al. 2506.07740 null
2025-06-07 Dark Channel-Assisted Depth-from-Defocus from a Single Image Moushumi Medhi et.al. 2506.06643 null
2025-06-06 NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces Pierluigi Zama Ramirez et.al. 2506.05815 null
2025-06-06 Advancement and Field Evaluation of a Dual-arm Apple Harvesting Robot Keyi Zhu et.al. 2506.05714 null
2025-06-06 Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration Fanhu Zeng et.al. 2506.05709 null
2025-06-06 Aerial Multi-View Stereo via Adaptive Depth Range Inference and Normal Cues Yimei Liu et.al. 2506.05655 null
2025-06-09 Structure-Aware Radar-Camera Depth Estimation Fuyi Zhang et.al. 2506.05008 null
2025-06-05 Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer Filip Slezak et.al. 2506.04908 null
2025-06-05 Toward Better SSIM Loss for Unsupervised Monocular Depth Estimation Yijun Cao et.al. 2506.04758 null
2025-06-04 JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting Yang Xiao et.al. 2506.03872 null
2025-06-04 Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation Joonkyung Kim et.al. 2506.03834 null
2025-06-03 ViT-Split: Unleashing the Power of Vision Foundation Models via Efficient Splitting Heads Yifan Li et.al. 2506.03433 null
2025-06-02 E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models Wenyan Cong et.al. 2506.01933 null
2025-06-01 Perceptual Inductive Bias Is What You Need Before Contrastive Learning Tianqin Li et.al. 2506.01201 null
2025-06-01 Depth-Aware Scoring and Hierarchical Alignment for Multiple Object Tracking Milad Khanchi et.al. 2506.00774 null
2025-05-31 XYZ-IBD: High-precision Bin-picking Dataset for Object 6D Pose Estimation Capturing Real-world Industrial Complexity Junwen Huang et.al. 2506.00599 null
2025-05-31 Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline Zhaoying Wang et.al. 2506.00546 null
2025-05-31 Improving Optical Flow and Stereo Depth Estimation by Leveraging Uncertainty-Based Learning Difficulties Jisoo Jeong et.al. 2506.00324 null
2025-05-30 Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization Qingyao Tian et.al. 2505.24249 null
2025-05-29 Ultrafast High-Flux Single-Photon LiDAR Simulator via Neural Mapping Weijian Zhang et.al. 2505.23992 null
2025-05-29 Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation Sanggyun Ma et.al. 2505.23400 null
2025-05-29 GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion Gwanghyun Kim et.al. 2505.23085 null
2025-05-28 Depth to magnetic source estimation using TDX contour Hammed Oyekan et.al. 2505.22780 null
2025-05-28 Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss Wenjun Lu et.al. 2505.22279 null
2025-05-27 Object Concepts Emerge from Motion Haoqian Liang et.al. 2505.21635 null
2025-05-23 EvidenceMoE: A Physics-Guided Mixture-of-Experts with Evidential Critics for Advancing Fluorescence Light Detection and Ranging in Scattering Media Ismail Erbas et.al. 2505.21532 null
2025-05-27 Occlusion Boundary and Depth: Mutual Enhancement via Multi-Task Learning Lintao Xu et.al. 2505.21231 null
2025-05-27 Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing Dehao Wang et.al. 2505.21049 null
2025-05-27 Spatial RoboGrasp: Generalized Robotic Grasping Control Policy Yiqi Huang et.al. 2505.20814 null
2025-05-26 SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams Zhuoheng Gao et.al. 2505.19487 null
2025-05-25 From Single Images to Motion Policies via Video-Generation Environment Representations Weiming Zhi et.al. 2505.19306 null
2025-05-23 Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues Chinmay Talegaonkar et.al. 2505.17358 null
2025-05-22 MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation Bohan Zhou et.al. 2505.16602 null
2025-05-22 BadDepth: Backdoor Attacks Against Monocular Depth Estimation in the Physical World Ji Guo et.al. 2505.16154 null
2025-05-21 RadarRGBD A Multi-Sensor Fusion Dataset for Perception with RGB-D and mmWave Radar Tieshuai Song et.al. 2505.15860 null
2025-05-21 MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models Yifan Liu et.al. 2505.15185 link
2025-05-20 Diving into the Fusion of Monocular Priors for Generalized Stereo Matching Chengtang Yao et.al. 2505.14414 link
2025-05-20 M3Depth: Wavelet-Enhanced Depth Estimation on Mars via Mutual Boosting of Dual-Modal Data Junjie Li et.al. 2505.14159 null
2025-05-20 Multi-Label Stereo Matching for Transparent Scene Depth Estimation Zhidan Liu et.al. 2505.14008 link
2025-05-20 Event-Driven Dynamic Scene Depth Completion Zhiqiang Yan et.al. 2505.13279 null
2025-05-19 DB3D-L: Depth-aware BEV Feature Transformation for Accurate 3D Lane Detection Yehao Liu et.al. 2505.13266 null
2025-05-20 3D Visual Illusion Depth Estimation Chengtang Yao et.al. 2505.13061 link
2025-05-19 IA-MVS: Instance-Focused Adaptive Depth Sampling for Multi-View Stereo Yinzhe Wang et.al. 2505.12714 null
2025-05-18 Depth Transfer: Learning to See Like a Simulator for Real-World Drone Navigation Hang Yu et.al. 2505.12428 null
2025-05-18 Always Clear Depth: Robust Monocular Depth Estimation under Adverse Weather Kui Jiang et.al. 2505.12199 link
2025-05-17 SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations Songchun Zhang et.al. 2505.11992 null
2025-05-17 MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos Hongyi Zhou et.al. 2505.11868 null
2025-05-16 SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision Utsav Rai et.al. 2505.11439 null
2025-05-16 Attention on the Sphere Boris Bonev et.al. 2505.11157 link
2025-05-15 Depth Anything with Any Prior Zehan Wang et.al. 2505.10565 null
2025-05-15 JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation Tiancong Cheng et.al. 2505.10057 null
2025-05-14 Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis Bingxin Ke et.al. 2505.09358 link
2025-05-13 Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World Yuran Wang et.al. 2505.08607 null
2025-05-13 Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images Ziteng Liu et.al. 2505.08178 null
2025-05-12 Some insights into depth estimators for location and scatter in the multivariate setting Jorge G. Adrover et.al. 2505.07383 null
2025-05-11 Reinforcement Learning-Based Monocular Vision Approach for Autonomous UAV Landing Tarik Houichime et.al. 2505.06963 null
2025-05-10 ElectricSight: 3D Hazard Monitoring for Power Lines Using Low-Cost Sensors Xingchen Li et.al. 2505.06573 null
2025-05-09 Camera-Only Bird’s Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles Anupkumar Bochare et.al. 2505.06113 null
2025-05-09 MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection Zhihao Zhang et.al. 2505.04594 null
2025-05-13 Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach Srecharan Selvam et.al. 2505.03702 null
2025-05-06 LiftFeat: 3D Geometry-Aware Local Feature Matching Yepeng Liu et.al. 2505.03422 link
2025-05-06 VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery Bojin Wu et.al. 2505.02704 link
2025-05-05 DELTA: Dense Depth from Events and LiDAR using Transformer’s Attention Vincent Brebion et.al. 2505.02593 null
2025-05-03 PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth Bu Jin et.al. 2505.01729 null
2025-05-02 LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment Jiahuan Long et.al. 2505.00980 null
2025-05-01 JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers Kwon Byung-Ki et.al. 2505.00482 link
2025-04-30 HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Haiyang Zhou et.al. 2504.21650 link
2025-04-30 eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes Henry John Krumb et.al. 2504.21562 null
2025-04-29 Real-Time Wayfinding Assistant for Blind and Low-Vision Users Dabbrata Das et.al. 2504.20976 null
2025-04-29 Large-scale visual SLAM for in-the-wild videos Shuo Sun et.al. 2504.20496 null
2025-04-28 MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion Zador Pataki et.al. 2504.20040 link
2025-04-28 Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video Hoang Chuong Nguyen et.al. 2504.19819 null
2025-04-27 Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection Athul M. Mathew et.al. 2504.19271 null
2025-04-26 Depth as Points: Center Point-based Depth Estimation Zhiheng Tu et.al. 2504.18773 null
2025-04-25 LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning Rui Li et.al. 2504.18424 null
2025-04-25 Dense Geometry Supervision for Underwater Depth Estimation Wenxiang Gua et.al. 2504.18233 null
2025-04-25 LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring Raul David Dominguez Sanchez et.al. 2504.18203 null
2025-04-24 The Fourth Monocular Depth Estimation Challenge Anton Obukhov et.al. 2504.17787 null
2025-04-24 Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images Zebo Huang et.al. 2504.17582 null
2025-04-24 Invasion depth estimation of gastric cancer in early stage using circularly polarized light scattering: Phantom studies Mike R. Maskey et.al. 2504.17161 null
2025-04-23 PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth Estimation Xinqi Xiong et.al. 2504.17067 null
2025-04-23 Helping Blind People Grasp: Enhancing a Tactile Bracelet with an Automated Hand Navigation System Marcin Furtak et.al. 2504.16502 null
2025-04-21 MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation Xingxing Zuo et.al. 2504.16127 null
2025-04-22 DERD-Net: Learning Depth from Event-based Ray Densities Diego de Oliveira Hitzges et.al. 2504.15863 null
2025-04-22 VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation Mingxia Zhan et.al. 2504.15095 null
2025-04-21 Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation Chenjie Cao et.al. 2504.14899 link
2025-04-20 Seurat: From Moving Points to Depth Seokju Cho et.al. 2504.14687 link
2025-04-18 Occlusion-Ordered Semantic Instance Segmentation Soroosh Baselizadeh et.al. 2504.14054 null
2025-04-18 Enhancing Pothole Detection and Characterization: Integrated Segmentation and Depth Estimation in Road Anomaly Systems Uthman Baroudi et.al. 2504.13648 null
2025-04-17 Perception Encoder: The best visual embeddings are not at the output of the network Daniel Bolya et.al. 2504.13181 null
2025-04-17 TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors Mingwei Li et.al. 2504.12799 null
2025-04-17 Privacy-Preserving Operating Room Workflow Analysis using Digital Twins Alejandra Perez et.al. 2504.12552 null
2025-04-16 Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image Tao Wen et.al. 2504.12103 null
2025-04-16 TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion Yiran Wang et.al. 2504.11773 null
2025-04-16 An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World Xingwu Ji et.al. 2504.11698 link
2025-04-15 Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception Ziqi Pang et.al. 2504.11457 link
2025-04-16 DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation Soyoung Yoo et.al. 2504.11347 null
2025-04-18 Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting Jiaxin Huang et.al. 2504.11092 null
2025-04-13 TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting Zhicong Wu et.al. 2504.09588 null
2025-04-12 Text To 3D Object Generation For Scalable Room Assembly Sonia Laguna et.al. 2504.09328 null
2025-04-11 Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation Bram Vanherle et.al. 2504.08473 link
2025-04-10 Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction Zeren Jiang et.al. 2504.07961 link
2025-04-09 FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution Gene Chou et.al. 2504.07093 link
2025-04-08 POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction Songyan Zhang et.al. 2504.05692 link
2025-04-07 Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and Semidensification Yasuhiro Yao et.al. 2504.05148 link
2025-04-04 3D Scene Understanding Through Local Random Access Sequence Modeling Wanhee Lee et.al. 2504.03875 null
2025-04-04 RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation Hanbo Bi et.al. 2504.03166 null
2025-04-03 All-day Depth Completion via Thermal-LiDAR Fusion Janghyun Kim et.al. 2504.02356 null
2025-04-02 FreSca: Unveiling the Scaling Space in Diffusion Models Chao Huang et.al. 2504.02154 null
2025-04-02 Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis Niluthpol Chowdhury Mithun et.al. 2504.01960 null
2025-04-03 Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting Shu-Wei Lu et.al. 2504.01957 null
2025-04-02 A novel gesture interaction control method for rehabilitation lower extremity exoskeleton Shuang Qiu et.al. 2504.01888 null
2025-04-02 DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image Jijun Xiang et.al. 2504.01596 link
2025-04-01 GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors Tian-Xing Xu et.al. 2504.01016 null
2025-04-01 Monocular and Generalizable Gaussian Talking Head Animation Shengjie Gong et.al. 2504.00665 null
2025-03-31 ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image Tianyi Gong et.al. 2503.23881 null
2025-03-31 Detail-aware multi-view stereo network for depth estimation Haitao Tian et.al. 2503.23684 null
2025-03-30 Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries Wei Xu et.al. 2503.23606 null
2025-03-30 Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model Jannik Endres et.al. 2503.23502 link
2025-03-28 SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations Krispin Wandel et.al. 2503.22462 null
2025-03-28 EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting Xu Wang et.al. 2503.22437 link
2025-03-28 MVSAnywhere: Zero-Shot Multi-View Stereo Sergio Izquierdo et.al. 2503.22430 null
2025-03-28 One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images Byeongjun Kwon et.al. 2503.22351 null
2025-03-28 Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces Wonhyeok Choi et.al. 2503.22209 null
2025-03-28 Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges Ukcheol Shin et.al. 2503.22060 link
2025-03-27 A Unified Image-Dense Annotation Generation Model for Underwater Scenes Hongkai Lin et.al. 2503.21771 link
2025-03-27 ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo Yuxi Hu et.al. 2503.21525 null
2025-03-26 Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors Weilong Yan et.al. 2503.20211 link
2025-03-26 FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion Pihai Sun et.al. 2503.19739 link
2025-03-25 Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving Yusen Xie et.al. 2503.19713 link
2025-03-25 StableGS: A Floater-Free Framework for 3D Gaussian Splatting Luchao Wang et.al. 2503.18458 null
2025-03-24 PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes Xinhua Xu et.al. 2503.18393 null
2025-03-24 MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction Wenyuan Zhang et.al. 2503.18363 null
2025-03-23 Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images Yara AlaaEldin et.al. 2503.17982 link
2025-03-21 Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image Jerred Chen et.al. 2503.17358 null
2025-03-21 Radar-Guided Polynomial Fitting for Metric Depth Estimation Patrick Rim et.al. 2503.17182 null
2025-03-21 AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process Junjie Hu et.al. 2503.17029 null
2025-03-21 Distilling Monocular Foundation Model for Fine-grained Depth Completion Yingping Liang et.al. 2503.16970 null
2025-03-20 QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge Xuan Shen et.al. 2503.16709 link
2025-03-20 A Recipe for Generating 3D Worlds From a Single Image Katja Schwarz et.al. 2503.16611 null
2025-03-20 DreamTexture: Shape from Virtual Texture with Analysis by Augmentation Ananta R. Bhattarai et.al. 2503.16412 null
2025-03-20 Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors Tian Yi Lim et.al. 2503.16275 null
2025-03-20 Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras Beilei Cui et.al. 2503.15917 null
2025-03-20 Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation Jiyuan Wang et.al. 2503.15905 null
2025-03-19 TULIP: Towards Unified Language-Image Pretraining Zineng Tang et.al. 2503.15485 null
2025-03-19 EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining Boshen Xu et.al. 2503.15470 link
2025-03-19 USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network Joseph Emmanuel DL Dayo et.al. 2503.14950 null
2025-03-18 Multi-view Reconstruction via SfM-guided Monocular Depth Estimation Haoyu Guo et.al. 2503.14483 null
2025-03-18 DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers Mert Bulent Sariyildiz et.al. 2503.14405 null
2025-03-18 3D Densification for Multi-Map Monocular VSLAM in Endoscopy X. Anadón et.al. 2503.14346 null
2025-03-17 MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models Johannes Meier et.al. 2503.13743 null
2025-03-17 SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint Zhenlong Yuan et.al. 2503.13721 null
2025-03-17 Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios Iryna Repinetska et.al. 2503.13710 null
2025-03-19 FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis Luxi Chen et.al. 2503.13265 null
2025-03-17 MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs Erik Daxberger et.al. 2503.13111 null
2025-03-17 TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image Haoxiao Wang et.al. 2503.12779 null
2025-03-16 UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing Tsu-Jui Fu et.al. 2503.12652 null
2025-03-16 Deblur Gaussian Splatting SLAM Francesco Girlanda et.al. 2503.12572 null
2025-03-16 Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View Xianzu Wu et.al. 2503.12553 link
2025-03-14 VGGT: Visual Geometry Grounded Transformer Jianyuan Wang et.al. 2503.11651 link
2025-03-14 Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation Hongyu Wen et.al. 2503.11633 null
2025-03-14 Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation Fengchen He et.al. 2503.11213 link
2025-03-13 Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations Xunzhi Zheng et.al. 2503.10464 null
2025-03-15 WonderVerse: Extendable 3D Scene Generation with Video Generative Models Hao Feng et.al. 2503.09160 null
2025-03-11 Language-Depth Navigated Thermal and Visible Image Fusion Jinchang Zhang et.al. 2503.08676 null
2025-03-11 CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning Kaiqiang Xiong et.al. 2503.08219 null
2025-03-10 SIRE: SE(3) Intrinsic Rigidity Embeddings Cameron Smith et.al. 2503.07739 null
2025-03-10 LBM: Latent Bridge Matching for Fast Image-to-Image Translation Clément Chadebec et.al. 2503.07535 link
2025-03-12 Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion Mona Sheikh Zeinoddin et.al. 2503.07204 null
2025-03-11 LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation Quanjian Song et.al. 2503.06508 link
2025-03-08 Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity Xiaohao Xu et.al. 2503.06014 link
2025-03-07 TomatoScanner: phenotyping tomato fruit based on only RGB image Xiaobei Zhao et.al. 2503.05568 link
2025-03-07 Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects Justin Yu et.al. 2503.05189 null
2025-03-05 RTFusion: A depth estimation network based on multimodal fusion in challenging scenarios Zelin Meng et.al. 2503.04821 null
2025-03-06 A Novel Solution for Drone Photogrammetry with Low-overlap Aerial Images using Monocular Depth Estimation Jiageng Zhong et.al. 2503.04513 null
2025-03-08 EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images Rohit Menon et.al. 2503.04441 null
2025-03-06 H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision Yunxiao Shi et.al. 2503.04059 null
2025-03-05 Task-Agnostic Attacks Against Vision Foundation Models Brian Pulfer et.al. 2503.03842 link
2025-03-05 Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings Xusheng Du et.al. 2503.03068 null
2025-03-04 RGBSQGrasp: Inferring Local Superquadric Primitives from Single RGB Image for Graspability-Aware Bin Picking Yifeng Xu et.al. 2503.02387 null
2025-03-03 MUSt3R: Multi-view Network for Stereo 3D Reconstruction Yohann Cabon et.al. 2503.01661 link
2025-03-02 Bridging Spectral-wise and Multi-spectral Depth Estimation via Geometry-guided Contrastive Learning Ukcheol Shin et.al. 2503.00793 link
2025-02-28 EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering John J. Han et.al. 2502.20669 null
2025-02-27 UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler Luigi Piccinelli et.al. 2502.20110 link
2025-02-26 Stellar Models Also Limit Exoplanet Atmosphere Studies in Emission Thomas J. Fauchez et.al. 2502.19585 null
2025-02-26 Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator Xiankang He et.al. 2502.19204 link
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932 null
2025-02-19 Physical Depth-aware Early Accident Anticipation: A Multi-dimensional Visual Feature Fusion Framework Hongpu Huang et.al. 2502.18496 null
2025-02-21 RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes Sicheng Yu et.al. 2502.15633 null
2025-02-20 CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting Qilin Zhang et.al. 2502.14684 link
2025-03-03 Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion Jiangyuan Liu et.al. 2502.14616 link
2025-02-20 Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining Wonhyeok Choi et.al. 2502.14573 null
2025-02-20 OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images Zhichao Zheng et.al. 2502.14279 null
2025-02-18 Pre-training Auto-regressive Robotic Models with 4D Representations Dantong Niu et.al. 2502.13142 null
2025-02-18 SHADeS: Self-supervised Monocular Depth Estimation Through Non-Lambertian Image Decomposition Rema Daher et.al. 2502.12994 link
2025-02-17 Deep Neural Networks for Accurate Depth Estimation with Latent Space Features Siddiqui Muhammad Yasir et.al. 2502.11777 null
2025-02-16 Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation Kunal Swami et.al. 2502.11002 null
2025-02-14 ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences Liyuan Zhu et.al. 2502.10377 null
2025-02-14 RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control Teng Li et.al. 2502.10059 null
2025-02-13 SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest Jack Erhardt et.al. 2502.09528 null
2025-02-17 S $^2$ -Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation Quantao Yang et.al. 2502.09389 null
2025-02-13 CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery Chenghao Zhang et.al. 2502.08902 null
2025-02-13 Visual-based spatial audio generation system for multi-speaker environments Xiaojing Liu et.al. 2502.07538 null
2025-02-11 Learning Inverse Laplacian Pyramid for Progressive Depth Completion Kun Wang et.al. 2502.07289 null
2025-02-10 From Image to Video: An Empirical Study of Diffusion Representations Pedro Vélez et.al. 2502.07001 null
2025-02-09 Revisiting Gradient-based Uncertainty for Monocular Depth Estimation Julia Hornauer et.al. 2502.05964 null
2025-02-09 SphereFusion: Efficient Panorama Depth Estimation via Gated Fusion Qingsong Yan et.al. 2502.05859 null
2025-02-05 MetaFE-DE: Learning Meta Feature Embedding for Depth Estimation from Monocular Endoscopic Images Dawei Lu et.al. 2502.03493 null
2025-02-04 DOC-Depth: A novel approach for dense depth ground truth generation Simon de Moreau et.al. 2502.02144 null
2025-02-01 Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding Jingming Xia et.al. 2502.01666 null
2025-02-01 Exploring Representation-Aligned Latent Space for Better Generation Wanghan Xu et.al. 2502.00359 null
2025-02-01 MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model Jihyeok Kim et.al. 2502.00315 null
2025-01-30 Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion Vitor Guizilini et.al. 2501.18804 null
2025-01-25 Snapshot Compressed Imaging Based Single-Measurement Computer Vision for Videos Fengpu Pan et.al. 2501.15122 null
2025-01-24 Rethinking Encoder-Decoder Flow Through Shared Structures Frederik Laboyrie et.al. 2501.14535 null
2025-01-23 IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models Jiayi Lei et.al. 2501.13920 null
2025-01-23 PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments Changhao Wang et.al. 2501.13796 null
2025-01-22 Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation Akshay Krishnan et.al. 2501.13087 null
2025-01-22 Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks Alessio Quercia et.al. 2501.12824 link
2025-01-22 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null
2025-01-21 Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging Shuyi Hu et.al. 2501.11884 null
2025-01-21 Survey on Monocular Metric Depth Estimation Jiuling Zhang et.al. 2501.11841 null
2025-01-19 RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering Chenlu Zhan et.al. 2501.11102 null
2025-01-15 BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation Xiaolu Hou et.al. 2501.10462 link
2025-01-20 Zero-Shot Monocular Scene Flow Estimation in the Wild Yiqing Liang et.al. 2501.10357 null
2025-01-17 One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression Keita Miwa et.al. 2501.10064 null
2025-01-17 Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography Mohammed Salah et.al. 2501.09994 link
2025-01-21 FoundationStereo: Zero-Shot Stereo Matching Bowen Wen et.al. 2501.09898 link
2025-01-16 DEFOM-Stereo: Depth Foundation Model Based Stereo Matching Hualie Jiang et.al. 2501.09466 link
2025-01-15 StereoGen: High-quality Stereo Image Generation from a Single Image Xianqi Wang et.al. 2501.08654 null
2025-01-15 MonSter: Marry Monodepth to Stereo Unleashes Power Junda Cheng et.al. 2501.08643 link
2025-01-14 A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation Steven Landgraf et.al. 2501.08188 null
2025-01-14 Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2 Seamie Hayes et.al. 2501.08118 null
2025-01-13 Fixing the Scale and Shift in Monocular Depth For Camera Pose Estimation Yaqing Ding et.al. 2501.07742 link
2025-01-13 Matching Free Depth Recovery from Structured Light Zhuohang Yu et.al. 2501.07113 null
2025-01-09 Relative Pose Estimation through Affine Corrections of Monocular Depth Priors Yifan Yu et.al. 2501.05446 link
2025-01-09 $DPF^*$ : improved Depth Potential Function for scale-invariant sulcal depth estimation Maxime Dieudonné et.al. 2501.05436 link
2025-01-09 A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision Ali Rohan et.al. 2501.05147 null
2025-01-08 FatesGS: Fast and Accurate Sparse-View Surface Reconstruction using Gaussian Splatting with Depth-Feature Consistency Han Huang et.al. 2501.04628 null
2025-01-08 FrontierNet: Learning Visual Cues to Explore Boyang Sun et.al. 2501.04597 link
2025-01-07 AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features Ruochen Zhang et.al. 2501.03700 null
2025-01-05 DepthMaster: Taming Diffusion Models for Monocular Depth Estimation Ziyang Song et.al. 2501.02576 link
2025-01-05 Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera Yuliang Guo et.al. 2501.02464 link
2025-01-03 SafeAug: Safety-Critical Driving Data Augmentation from Naturalistic Datasets Zhaobin Mo et.al. 2501.02143 null
2025-01-03 Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery Baoru Huang et.al. 2501.01752 null
2025-01-03 IGAF: Incremental Guided Attention Fusion for Depth Super-Resolution Athanasios Tragakis et.al. 2501.01723 null
2024-12-31 Tech Report: Divide and Conquer 3D Real-Time Reconstruction for Improved IGS Yicheng Zhu et.al. 2501.01465 null
2025-01-02 TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions Vriksha Srihari et.al. 2501.01156 null
2025-01-02 PatchRefiner V2: Fast and Lightweight Real-Domain High-Resolution Metric Depth Estimation Zhenyu Li et.al. 2501.01121 null
2024-12-30 FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI Zhengdong Li et.al. 2412.20974 null
2024-12-29 MetricDepth: Enhancing Monocular Depth Estimation with Deep Metric Learning Chunpu Liu et.al. 2412.20390 null
2024-12-28 Multi-Modality Driven LoRA for Adverse Condition Depth Estimation Guanglei Yang et.al. 2412.20162 null
2024-12-28 DepthMamba with Adaptive Fusion Zelin Meng et.al. 2412.19964 null
2024-12-26 An End-to-End Depth-Based Pipeline for Selfie Image Rectification Ahmed Alhawwary et.al. 2412.19189 null
2024-12-26 Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement Qiude Zhang et.al. 2412.19165 null
2024-12-26 MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo Byeonggwon Lee et.al. 2412.19130 null
2024-12-26 Learning Monocular Depth from Events via Egomotion Compensation Haitao Meng et.al. 2412.19067 null
2024-12-24 RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis Yiling Yao et.al. 2412.18380 null
2024-12-23 V $^2$ -SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy Long Bai et.al. 2412.17595 null
2024-12-22 GeoTexDensifier: Geometry-Texture-Aware Densification for High-Quality Photorealistic 3D Gaussian Splatting Hanqing Jiang et.al. 2412.16809 null
2024-12-27 LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty Guidance Huawei Sun et.al. 2412.16380 link
2024-12-19 Flowing from Words to Pixels: A Framework for Cross-Modality Evolution Qihao Liu et.al. 2412.15213 null
2024-12-19 Scaling 4D Representations João Carreira et.al. 2412.15212 null
2024-12-18 Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation Rémi Marsal et.al. 2412.14103 null
2024-12-18 Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation Haotong Lin et.al. 2412.14015 link
2024-12-18 Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion Massimiliano Viola et.al. 2412.13389 null
2024-12-18 Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera Zhengdi Yu et.al. 2412.12861 null
2024-12-17 PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts Kun Guo et.al. 2412.12460 null
2024-12-16 V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations Jin-Cheng Jhang et.al. 2412.11412 null
2024-12-16 Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video Junkai Fan et.al. 2412.11395 null
2024-12-15 ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction Yi Feng et.al. 2412.11210 link
2024-12-14 MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance Wenjun Huang et.al. 2412.10730 null
2024-12-12 Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos Linyi Jin et.al. 2412.09621 null
2024-12-12 T-SVG: Text-Driven Stereoscopic Video Generation Qiao Jin et.al. 2412.09323 null
2024-12-12 Cross-View Completion Models are Zero-shot Correspondence Estimators Honggyu An et.al. 2412.09072 null
2024-12-11 BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation Shengze Wang et.al. 2412.08640 null
2024-12-13 Utilizing Multi-step Loss for Single Image Reflection Removal Abdelrahman Elnenaey et.al. 2412.08582 link
2024-12-11 Combining Neural Fields and Deformation Models for Non-Rigid 3D Motion Reconstruction from Partial Data Aymen Merrouche et.al. 2412.08511 null
2024-12-11 Dense Depth from Event Focal Stack Kenta Horikawa et.al. 2412.08120 null
2024-12-10 Diffusion-Based Attention Warping for Consistent 3D Scene Editing Eyal Gomel et.al. 2412.07984 null
2024-12-10 Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation Kurt H. W. Stolle et.al. 2412.07966 null
2024-12-09 SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception Yaniv Benny et.al. 2412.06968 null
2024-12-09 Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving Xin Fei et.al. 2412.06777 link
2024-12-09 MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views Antoine Guédon et.al. 2412.06767 null
2024-12-09 On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events Jesse Hagenaars et.al. 2412.06359 null
2024-12-09 Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction Dongxu Wei et.al. 2412.06273 null
2024-12-09 Event fields: Capturing light fields at high speed, resolution, and dynamic range Ziyuan Qu et.al. 2412.06191 null
2024-12-08 GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion Karlo Koledic et.al. 2412.06080 null
2024-12-08 Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors Alex Rich et.al. 2412.05771 null
2024-12-10 TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action Zixian Ma et.al. 2412.05479 link
2024-12-06 SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images Jiahua Dong et.al. 2412.05274 null
2024-12-06 Penetrative rotating magnetoconvection subject to lateral variations in temperature gradients Tirtharaj Barman et.al. 2412.05235 null
2024-12-06 PanoDreamer: 3D Panorama Synthesis from a Single Image Avinash Paliwal et.al. 2412.04827 link
2024-12-05 LAA-Net: A Physical-prior-knowledge Based Network for Robust Nighttime Depth Estimation Kebin Peng et.al. 2412.04666 null
2024-12-05 Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail Luca Bartolomei et.al. 2412.04472 link
2024-12-05 MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos Zhengqi Li et.al. 2412.04463 null
2024-12-05 MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction Mithun Parab et.al. 2412.03928 null
2024-12-04 Perception Tokens Enhance Visual Reasoning in Multimodal Language Models Mahtab Bigverdi et.al. 2412.03548 null
2024-12-04 Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter Hermes McGriff et.al. 2412.03518 null
2024-12-04 2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction Wanting Zhang et.al. 2412.03428 null
2024-12-04 MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction Gangjian Zhang et.al. 2412.03103 null
2024-12-05 Align3R: Aligned Monocular Depth Estimation for Dynamic Videos Jiahao Lu et.al. 2412.03079 null
2024-12-03 Single-Shot Metric Depth from Focused Plenoptic Cameras Blanca Lasheras-Hernandez et.al. 2412.02386 null
2024-12-03 Dual Exposure Stereo for Extended Dynamic Range 3D Imaging Juhyung Choi et.al. 2412.02351 null
2024-12-03 Amodal Depth Anything: Amodal Depth Estimation in the Wild Zhenyu Li et.al. 2412.02336 null
2024-12-03 GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos Zhiyuan Chen et.al. 2412.02267 null
2024-12-03 FoveaSPAD: Exploiting Depth Priors for Adaptive and Efficient Single-Photon 3D Imaging Justin Folden et.al. 2412.02052 null
2024-12-02 Mutli-View 3D Reconstruction using Knowledge Distillation Aditya Dutt et.al. 2412.02039 link
2024-12-02 AVS-Net: Audio-Visual Scale Net for Self-supervised Monocular Metric Depth Estimation Xiaohu Liu et.al. 2412.01637 null
2024-12-02 STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation Sunghun Yang et.al. 2412.01090 null
2024-12-01 FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation Yunpeng Bai et.al. 2412.00671 null
2024-11-29 SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection Philipp Wolters et.al. 2411.19860 null
2024-11-29 MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications Gasser Elazab et.al. 2411.19717 null
2024-11-29 Gaussian Splashing: Direct Volumetric Rendering Underwater Nir Mualem et.al. 2411.19588 null
2024-11-28 Learning Surrogate Rainfall-driven Inundation Models with Few Data Marzieh Alireza Mirhoseini et.al. 2411.19323 null
2024-11-28 AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones Xuqian Ren et.al. 2411.19271 null
2024-11-28 Video Depth without Video Models Bingxin Ke et.al. 2411.19189 null
2024-11-28 360Recon: An Accurate Reconstruction Method Based on Depth Fusion from 360 Images Zhongmiao Yan et.al. 2411.19102 null
2024-11-27 Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation Mehdi Zayene et.al. 2411.18335 link
2024-11-27 GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation Wenbo Cui et.al. 2411.18276 null
2024-11-27 SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation Duc-Hai Pham et.al. 2411.18229 null
2024-11-26 Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation Sudarshan Rajagopalan et.al. 2411.17814 null
2024-11-26 Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors Ziang Xu et.al. 2411.17790 null
2024-11-26 DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting Christian Homeyer et.al. 2411.17660 link
2024-11-26 Spatially Visual Perception for End-to-End Robotic Learning Travis Davies et.al. 2411.17458 null
2024-11-26 DepthCues: Evaluating Monocular Depth Perception in Large Vision Models Duolikun Danier et.al. 2411.17385 null
2024-11-26 Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration Junyuan Deng et.al. 2411.17240 link
2024-11-25 G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs Kunyi Li et.al. 2411.16898 null
2024-11-24 PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation Ziyao Zeng et.al. 2411.16750 null
2024-11-25 Generative Omnimatte: Learning to Decompose Video into Layers Yao-Chih Lee et.al. 2411.16683 null
2024-11-25 One Diffusion to Generate Them All Duong H. Le et.al. 2411.16318 link
2024-11-24 Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors Soumava Paul et.al. 2411.15966 null
2024-11-21 StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart Jian Shi et.al. 2411.14295 link
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291 null
2024-11-20 OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging Rajini Makam et.al. 2411.13230 link
2024-11-15 SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction Yutao Tang et.al. 2411.12592 link
2024-11-18 Towards Degradation-Robust Reconstruction in Generalizable NeRF Chan Ho Park et.al. 2411.11691 null
2024-11-18 MGNiceNet: Unified Monocular Geometric Scene Understanding Markus Schön et.al. 2411.11466 null
2024-11-18 The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather Markus Schön et.al. 2411.11455 null
2024-11-18 GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views Boyao Zhou et.al. 2411.11363 null
2024-11-18 Scalable Autoregressive Monocular Depth Estimation Jinhong Wang et.al. 2411.11361 null
2024-11-16 MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation Ansh Shah et.al. 2411.10886 link
2024-11-19 EVT: Efficient View Transformation for Multi-Modal 3D Object Detection Yongjin Lee et.al. 2411.10715 null
2024-11-15 Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses Yongfan Liu et.al. 2411.10013 link
2024-11-14 Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting Yian Wang et.al. 2411.09823 null
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching Yuran Wang et.al. 2411.09151 null
2024-11-13 OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances Youqi Liao et.al. 2411.08665 link
2024-11-09 Online Collision Risk Estimation via Monocular Depth-Aware Object Detectors and Fuzzy Inference Brian Hsuan-Cheng Liao et.al. 2411.08060 null
2024-11-13 Scaling Properties of Diffusion Models for Perceptual Tasks Rahul Ravishankar et.al. 2411.08034 null
2024-11-11 $SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation Yinshuang Xu et.al. 2411.07326 null
2024-11-08 Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning Quang Truong Nguyen et.al. 2411.05344 null
2024-11-08 SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection Yun Zhao et.al. 2411.05292 null
2024-11-07 D $^3$ epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes Siyu Chen et.al. 2411.04826 null
2024-11-06 Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation Teppei Kurita et.al. 2411.04714 null
2024-11-07 Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation Qingyao Tian et.al. 2411.04404 null
2024-11-04 PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes Kebin Peng et.al. 2411.04227 null
2024-11-06 Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions Zihan Qin et.al. 2411.03638 null
2024-11-05 Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor Anish Bhattacharya et.al. 2411.03303 null
2024-11-05 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation Matthias Bartolo et.al. 2411.02844 link
2024-11-05 FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training Ruihong Yin et.al. 2411.02229 null
2024-11-05 Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training Yuanqi Yao et.al. 2411.02149 null
2024-11-02 MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction Wang Zhao et.al. 2411.01226 link
2024-11-01 MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes Sanghyun Byun et.al. 2411.01048 null
2024-11-01 On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR Li Li et.al. 2411.00600 link
2024-10-31 Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving Ce Zhou et.al. 2411.00192 null
2024-10-31 ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images Timing Yang et.al. 2410.24001 link
2024-10-30 Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe Songyu Xu et.al. 2410.23154 null
2024-10-29 Active Event Alignment for Monocular Distance Estimation Nan Cai et.al. 2410.22280 null
2024-10-29 PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting Sunghwan Hong et.al. 2410.22128 link
2024-10-27 Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist et.al. 2410.20459 link
2024-10-27 Depth Attention for Robust RGB Tracking Yu Liu et.al. 2410.20395 link
2024-10-21 YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning Ranjan Sapkota et.al. 2410.19846 null
2024-10-25 MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors Fanqi Pu et.al. 2410.19590 link
2024-10-24 Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction Hongxin Peng et.al. 2410.18433 null
2024-10-24 Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images Dong-Guw Lee et.al. 2410.18340 link
2024-10-25 UnCLe: Unsupervised Continual Learning of Depth Completion Suchisrit Gangopadhyay et.al. 2410.18074 null
2024-10-21 TIPS: Text-Image Pretraining with Spatial Awareness Kevis-Kokitsi Maninis et.al. 2410.16512 null
2024-10-22 DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain Kun Wang et.al. 2410.14980 link
2024-10-17 DepthSplat: Connecting Gaussian Splatting and Depth Haofei Xu et.al. 2410.13862 link
2024-10-16 DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning Jiabao Wei et.al. 2410.12501 null
2024-10-16 Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture Dabbrata Das et.al. 2410.11610 link
2024-10-16 CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction Pranav Gupta et.al. 2410.11211 link
2024-10-14 Few-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting Raja Kumar et.al. 2410.11080 link
2024-10-14 When Does Perceptual Alignment Benefit Vision Representations? Shobhita Sundaram et.al. 2410.10817 null
2024-10-14 Depth Any Video with Scalable Synthetic Data Honghui Yang et.al. 2410.10815 link
2024-10-15 Improved Depth Estimation of Bayesian Neural Networks Bart van Erp et.al. 2410.10395 link
2024-10-10 Color-Guided Flying Pixel Correction in Depth Images Ekamresh Vasudevan et.al. 2410.08084 link
2024-10-09 Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models Ange Lou et.al. 2410.07434 null
2024-10-09 Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation Runze Chen et.al. 2410.06982 null
2024-10-09 Analysis of different disparity estimation techniques on aerial stereo image datasets Ishan Narayan et.al. 2410.06711 null
2024-10-08 Vision Transformer based Random Walk for Group Re-Identification Guoqing Zhang et.al. 2410.05808 null
2024-10-08 CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality Wenjie Chang et.al. 2410.05735 null
2024-10-07 PhotoReg: Photometrically Registering 3D Gaussian Splatting Models Ziwen Yuan et.al. 2410.05044 null
2024-10-06 Mode-GS: Monocular Depth Guided Anchored 3D Gaussian Splatting for Robust Ground-View Scene Rendering Yonghan Lee et.al. 2410.04646 null
2024-10-10 Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy Pengcheng Chen et.al. 2410.04041 null
2024-10-04 Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering Laura Fink et.al. 2410.03861 link
2024-10-03 DecTrain: Deciding When to Train a DNN Online Zih-Sing Fu et.al. 2410.02980 null
2024-10-03 RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions Ziyao Zeng et.al. 2410.02924 link
2024-10-02 Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Aleksei Bochkovskii et.al. 2410.02073 link
2024-10-02 Learning from the Giants: A Practical Approach to Underwater Depth and Surface Normals Estimation Alzayat Saleh et.al. 2410.02072 null
2024-10-02 SinkSAM: A Monocular Depth-Guided SAM Framework for Automatic Sinkhole Segmentation Osher Rafaeli et.al. 2410.01473 link
2024-10-01 Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation Shuting Zhao et.al. 2410.00979 null
2024-10-01 Radar Meets Vision: Robustifying Monocular Metric Depth Prediction for Mobile Robotics Marco Job et.al. 2410.00736 null
2024-10-01 Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration Yida Lin et.al. 2410.00503 null
2024-10-01 Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance Hongchao Shu et.al. 2410.00386 null
2024-09-30 CCDepth: A Lightweight Self-supervised Depth Estimation Network with Enhanced Interpretability Xi Zhang et.al. 2409.19933 null
2024-09-30 EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth Prediction Ivan Reyes-Amezcua et.al. 2409.19930 link
2024-09-29 fCOP: Focal Length Estimation from Category-level Object Priors Xinyue Zhang et.al. 2409.19641 null
2024-09-29 KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation Soofiyan Atar et.al. 2409.19490 null
2024-09-27 Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping Anthony A. Song et.al. 2409.19153 null
2024-09-26 Self-supervised Monocular Depth Estimation with Large Kernel Attention Xuezhi Xiang et.al. 2409.17895 null
2024-09-26 Self-Distilled Depth Refinement with Noisy Poisson Fusion Jiaqi Li et.al. 2409.17880 link
2024-09-27 A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts Aurel Pjetri et.al. 2409.17851 null
2024-09-26 Event-based Stereo Depth Estimation: A Survey Suman Ghosh et.al. 2409.17680 null
2024-09-26 CAMOT: Camera Angle-aware Multi-Object Tracking Felix Limanta et.al. 2409.17533 null
2024-09-25 Optical Lens Attack on Deep Learning Based Monocular Depth Estimation Ce Zhou et.al. 2409.17376 null
2024-09-25 Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation Richard D. Paul et.al. 2409.17085 null
2024-09-25 EventHDR: from Event to High-Speed HDR Videos and Beyond Yunhao Zou et.al. 2409.17029 null
2024-09-25 3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation Yi Gu et.al. 2409.16702 link
2024-09-24 MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling Yifang Men et.al. 2409.16160 null
2024-09-24 Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data An Wang et.al. 2409.16063 link
2024-09-23 FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera Guoyang Zhao et.al. 2409.15054 link
2024-09-23 DepthART: Monocular Depth Estimation as Autoregressive Refinement Task Bulat Gabdullin et.al. 2409.15010 null
2024-09-23 Generalizing monocular colonoscopy image depth estimation by uncertainty-based global and local fusion network Sijia Du et.al. 2409.15006 null
2024-09-23 GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth Aurélien Cecille et.al. 2409.14850 link
2024-09-23 Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras Ming Li et.al. 2409.14766 null
2024-09-25 D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation Songlin Wei et.al. 2409.14365 null
2024-09-22 MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views Wangze Xu et.al. 2409.14316 null
2024-09-21 @Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology Xin Jiang et.al. 2409.14215 null
2024-09-18 Panoptic-Depth Forecasting Juana Valeria Hurtado et.al. 2409.12008 null
2024-09-17 Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Gonzalo Martin Garcia et.al. 2409.11355 link
2024-09-15 GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion Vitor Guizilini et.al. 2409.09896 null
2024-09-15 Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation Xiaolong Qian et.al. 2409.09754 link
2024-09-13 PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage Denis Zavadski et.al. 2409.09144 link
2024-09-23 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695 link
2024-09-12 Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor Andrea Conti et.al. 2409.08277 null
2024-09-12 LED: Light Enhanced Depth Estimation at Night Simon de Moreau et.al. 2409.08031 link
2024-09-12 Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes Ming Li et.al. 2409.07843 null
2024-09-12 Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy Bojian Li et.al. 2409.07723 null
2024-09-12 FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments Devansh Dhrafani et.al. 2409.07715 null
2024-09-10 Deep Neural Networks: Multi-Classification and Universal Approximation Martín Hernández et.al. 2409.06555 null
2024-09-10 EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation Nischal Khanal et.al. 2409.06183 link
2024-09-11 EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels Qingyao Tian et.al. 2409.05442 link
2024-09-09 Spontaneous magnetic field and disorder effects in BaPtAs_1-x_Sb_x_ with honeycomb network T. Adachi et.al. 2409.05266 null
2024-09-08 TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs Horatiu Florea et.al. 2409.05142 null
2024-09-12 Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective Tim Bader et.al. 2409.04086 link
2024-09-08 Estimating Indoor Scene Depth Maps from Ultrasonic Echoes Junpei Honma et.al. 2409.03336 null
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-02 GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling Huawei Sun et.al. 2409.02720 link
2024-09-04 Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects Kyungmin Jo et.al. 2409.02653 null
2024-09-04 UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching Soomin Kim et.al. 2409.02545 null
2024-09-04 SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction Sumin Son et.al. 2409.02513 null
2024-09-04 Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation Li Liu et.al. 2409.02494 link
2024-09-04 Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization Cho-Ying Wu et.al. 2409.02486 null
2024-09-04 GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving Huasong Han et.al. 2409.02382 null
2024-09-03 DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos Wenbo Hu et.al. 2409.02095 link
2024-09-02 Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling Haicheng Liao et.al. 2409.01256 null
2024-08-30 DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model Mona Sheikh Zeinoddin et.al. 2408.17433 link
2024-08-30 Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method Yuji Lin et.al. 2408.17339 link
2024-08-30 Synthetic Lunar Terrain: A Multimodal Open Dataset for Training and Evaluating Neuromorphic Vision Algorithms Marcus Märtens et.al. 2408.16971 null
2024-08-29 EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More Kanghao Chen et.al. 2408.16254 null
2024-08-30 Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective Zhijie Shen et.al. 2408.16227 link
2024-08-27 Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack Naufal Suryanto et.al. 2408.14879 link
2024-08-26 NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training Albert Luginov et.al. 2408.14177 link
2024-08-26 Pixel-Aligned Multi-View Generation with Depth Guided Decoder Zhenggang Tang et.al. 2408.14016 null
2024-08-25 TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers Chuanrui Zhang et.al. 2408.13770 null
2024-08-25 InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth Cho-Ying Wu et.al. 2408.13708 null
2024-08-25 SeeBelow: Sub-dermal 3D Reconstruction of Tumors with Surgical Robotic Palpation and Tactile Exploration Raghava Uppuluri et.al. 2408.13699 null
2024-08-27 Sapiens: Foundation for Human Vision Models Rawal Khirodkar et.al. 2408.12569 null
2024-08-21 LiFCal: Online Light Field Camera Calibration via Bundle Adjustment Aymeric Fleith et.al. 2408.11682 null
2024-08-19 Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video Shuxian Wang et.al. 2408.10153 link
2024-08-19 SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition Wiktor Mucha et.al. 2408.10037 link
2024-08-19 P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders Xuechao Chen et.al. 2408.10007 link
2024-08-14 Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling Ruofeng Wei et.al. 2408.07266 null
2024-08-12 Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces Junrui Zhang et.al. 2408.06083 null
2024-08-08 Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation Daniele Rege Cambrin et.al. 2408.04523 link
2024-08-08 Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework Subhasis Dasgupta et.al. 2408.04360 null
2024-08-08 Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform Daniel Vargas et.al. 2408.04195 null
2024-08-07 Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach Benedikt W. Hosp et.al. 2408.03591 null
2024-08-06 BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications G. Manni et.al. 2408.03078 link
2024-08-05 Gaussian Mixture based Evidential Learning for Stereo Matching Weide Liu et.al. 2408.02796 null
2024-08-05 Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining Dongyang Liu et.al. 2408.02657 link
2024-08-03 MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas Feng Qiao et.al. 2408.01653 null
2024-08-02 Self-Supervised Depth Estimation Based on Camera Models Jinchang Zhang et.al. 2408.01565 null
2024-08-01 MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection Youjia Fu et.al. 2408.00438 null
2024-08-01 High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior Wencheng Han et.al. 2408.00361 null
2024-08-01 LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting Zhenyu Bao et.al. 2408.00254 null
2024-07-31 Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching Pengjie Zhang et.al. 2407.21735 null
2024-07-29 BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation Kieran Saunders et.al. 2407.20437 null
2024-07-29 Analysis and Improvement of Rank-Ordered Mean Algorithm in Single-Photon LiDAR William C. Yau et.al. 2407.20399 null
2024-07-29 Improving 2D Feature Representations by 3D-Aware Fine-Tuning Yuanwen Yue et.al. 2407.20229 null
2024-07-27 Revisit Self-supervised Depth Estimation with Local Structure-from-Motion Shengjie Zhu et.al. 2407.19166 null
2024-07-27 RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry Shengjie Zhu et.al. 2407.19154 null
2024-07-26 HybridDepth: Robust Depth Fusion for Mobile AR by Leveraging Depth from Focus and Single-Image Priors Ashkan Ganj et.al. 2407.18443 link
2024-07-26 Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation Razieh Azizi et.al. 2407.18195 null
2024-07-25 BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation Xiang Zhang et.al. 2407.17952 null
2024-07-25 UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation Jian Wang et.al. 2407.17838 null
2024-07-24 DarSwin-Unet: Distortion Aware Encoder-Decoder Architecture Akshaya Athwale et.al. 2407.17328 null
2024-07-24 Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches Chenxing Zhao et.al. 2407.17312 null
2024-07-23 SINDER: Repairing the Singular Defects of DINOv2 Haoqi Wang et.al. 2407.16826 link
2024-07-23 Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions Fabio Tosi et.al. 2407.16698 link
2024-07-23 ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation Zhenhua Wu et.al. 2407.16508 null
2024-07-19 Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation Jinfeng Liu et.al. 2407.14126 link
2024-07-18 Unveiling the purely young star formation history of the SMC’s northeastern shell from colour-magnitude diagram fitting Joanna D. Sakowska et.al. 2407.13876 null
2024-07-18 Many Perception Tasks are Highly Redundant Functions of their Input Data Rahul Ramesh et.al. 2407.13841 null
2024-07-18 Shape of Motion: 4D Reconstruction from a Single Video Qianqian Wang et.al. 2407.13764 null
2024-07-18 Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk et.al. 2407.12588 link
2024-07-16 Temporally Consistent Stereo Matching Jiaxi Zeng et.al. 2407.11950 link
2024-07-15 IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation Yuanhao Zhai et.al. 2407.10937 link
2024-07-15 OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection Jinghua Hou et.al. 2407.10753 link
2024-07-15 Towards Scale-Aware Full Surround Monodepth with Transformers Yuchen Yang et.al. 2407.10406 null
2024-07-12 ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion Sungmin Woo et.al. 2407.09303 link
2024-07-11 ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation Ruijie Zhu et.al. 2407.08187 link
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860 null
2024-07-07 SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning Yi Feng et.al. 2407.05283 link
2024-07-05 A Physical Model-Guided Framework for Underwater Image Enhancement and Depth Estimation Dazhao Du et.al. 2407.04230 link
2024-07-04 Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation Laiyan Ding et.al. 2407.04041 link
2024-07-02 Parametric Modeling and Estimation of Photon Registrations for 3D Imaging Weijian Zhang et.al. 2407.02712 null
2024-07-02 Depth-Aware Endoscopic Video Inpainting Francis Xiatian Zhang et.al. 2407.02675 link
2024-07-04 Camera-LiDAR Cross-modality Gait Recognition Wenxuan Guo et.al. 2407.02038 null
2024-07-07 CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation Huawei Sun et.al. 2407.00697 link
2024-06-28 Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey Uchitha Rajapaksha et.al. 2406.19675 null
2024-06-27 What Matters in Detecting AI-Generated Videos like Sora? Chirui Chang et.al. 2406.19568 null
2024-07-05 360 in the Wild: Dataset for Depth Prediction and View Synthesis Kibaek Park et.al. 2406.18898 null
2024-06-27 Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach Yuxiang Huang et.al. 2406.18837 null
2024-06-26 MultiDiff: Consistent Novel View Synthesis from a Single Image Norman Müller et.al. 2406.18524 null
2024-06-26 DoubleTake: Geometry Guided Depth Estimation Mohamed Sayed et.al. 2406.18387 null
2024-06-25 Depth-Guided Semi-Supervised Instance Segmentation Xin Chen et.al. 2406.17413 null
2024-06-20 Uncertainty and Self-Supervision in Single-View Depth Javier Rodriguez-Puigvert et.al. 2406.14226 null
2024-06-19 WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation Yilin Ding et.al. 2406.13344 link
2024-06-18 Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation Ning-Hsu Wang et.al. 2406.12849 null
2024-06-21 GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models Yongtao Ge et.al. 2406.12671 link
2024-06-17 DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Letian Wang et.al. 2406.12095 null
2024-06-17 MEDeA: Multi-view Efficient Depth Adjustment Mikhail Artemyev et.al. 2406.12048 null
2024-06-16 Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry Boris Chidlovskii et.al. 2406.11019 null
2024-06-16 3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments Eduardo Davalos et.al. 2406.11003 null
2024-06-15 GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR Bharat Singh et.al. 2406.10722 null
2024-06-14 The BabyView dataset: High-resolution egocentric videos of infants’ and young children’s everyday experiences Bria Long et.al. 2406.10447 null
2024-06-14 D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video Moritz Kappel et.al. 2406.10078 null
2024-06-14 DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications Li Li et.al. 2406.10068 link
2024-06-14 Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion Runze Liu et.al. 2406.09782 null
2024-06-13 Depth Anything V2 Lihe Yang et.al. 2406.09414 link
2024-06-14 WonderWorld: Interactive 3D Scene Generation from a Single Image Hong-Xing Yu et.al. 2406.09394 null
2024-06-13 Scale-Invariant Monocular Depth Estimation via SSI Depth S. Mahdi H. Miangoleh et.al. 2406.09374 link
2024-06-13 Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid Transformer Guodong Sun et.al. 2406.08928 link
2024-06-13 ToSA: Token Selective Attention for Efficient Vision Transformers Manish Kumar Singh et.al. 2406.08816 null
2024-06-11 Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation Yufan Zhu et.al. 2406.07741 link
2024-06-11 PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow Joshua Tokarsky et.al. 2406.07667 null
2024-06-11 RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks Zhechao Wang et.al. 2406.07032 null
2024-06-10 PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation Zhenyu Li et.al. 2406.06679 null
2024-06-10 Visual-Inertial SLAM as Simple as A, B, VINS Nathaniel Merrill et.al. 2406.05969 null
2024-06-09 Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks Zhiyuan Cheng et.al. 2406.05857 link
2024-06-09 RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering Rui Zhang et.al. 2406.05852 null
2024-06-07 Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction Aarya Patel et.al. 2406.04861 null
2024-06-07 UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection Yuchao Wang et.al. 2406.04647 null
2024-06-06 MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation Ionuţ Grigore et.al. 2406.04532 null
2024-06-06 Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image Stanislaw Szymanowicz et.al. 2406.04343 link
2024-06-06 Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry Kaichen Zhou et.al. 2406.04301 null
2024-06-04 VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors Markus Plack et.al. 2406.02552 null
2024-06-03 L-MAGIC: Language Model Assisted Generation of Images with Coherence Zhipeng Cai et.al. 2406.01843 link
2024-06-04 Learning Temporally Consistent Video Depth from Video Diffusion Priors Jiahao Shao et.al. 2406.01493 null
2024-06-03 Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry Takayuki Kanai et.al. 2406.00929 null
2024-06-01 MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos Qingming Liu et.al. 2406.00434 null
2024-05-30 Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian Wei Sun et.al. 2405.19657 null
2024-05-28 Hybrid Multi-Head Physics-informed Neural Network for Depth Estimation in Terahertz Imaging Mingjun Xiang et.al. 2405.18317 null
2024-05-27 Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation Amir El-Ghoussani et.al. 2405.17704 link
2024-05-27 Benchmarking and Improving Bird’s Eye View Perception Robustness in Autonomous Driving Shaoyuan Xie et.al. 2405.17426 link
2024-05-27 All-day Depth Completion Vadim Ezhov et.al. 2405.17315 null
2024-05-27 GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping Junyoung Seo et.al. 2405.17251 link
2024-05-27 SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing Yong-Qiang Mao et.al. 2405.17140 null
2024-05-27 DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge Yifan Mao et.al. 2405.17102 null
2024-05-27 Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation Steven Landgraf et.al. 2405.17097 null
2024-05-27 DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation Mengtan Zhang et.al. 2405.16960 link
2024-05-27 ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection Ziying Song et.al. 2405.16873 null
2024-05-27 Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations Jingguo Liu et.al. 2405.16858 null
2024-05-26 Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians Erik Sandström et.al. 2405.16544 link
2024-05-24 Transparent Object Depth Completion Yifan Zhou et.al. 2405.15299 null
2024-05-24 MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method Pan Liao et.al. 2405.15176 null
2024-05-23 EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting Jiaxu Wang et.al. 2405.14959 link
2024-05-23 Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks Xingguang Jiang et.al. 2405.14520 null
2024-05-23 MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes Ruiyuan Gao et.al. 2405.14475 null
2024-05-23 Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning Zhenyu Wei et.al. 2405.14195 null
2024-05-21 Cross-spectral Gated-RGB Stereo Depth Estimation Samuel Brucker et.al. 2405.12759 null
2024-05-20 Depth Reconstruction with Neural Signed Distance Fields in Structured Light Systems Rukun Qiao et.al. 2405.12006 null
2024-05-20 Depth Prompting for Sensor-Agnostic Depth Estimation Jin-Hwi Park et.al. 2405.11867 null
2024-05-19 CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs Zidong Cao et.al. 2405.11564 null
2024-05-18 Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models Madhu Vankadari et.al. 2405.11158 link
2024-05-17 FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation Fei Wang et.al. 2405.10885 link
2024-05-17 Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory Jonas Kälble et.al. 2405.10575 link
2024-05-16 Towards Task-Compatible Compressible Representations Anderson de Andrade et.al. 2405.10244 link
2024-05-16 KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment Zhengxu Shi et.al. 2405.09964 null
2024-05-14 CLIP with Quality Captions: A Strong Pretraining for Vision Tasks Pavan Kumar Anasosalu Vasu et.al. 2405.08911 null
2024-05-14 The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition Lingdong Kong et.al. 2405.08816 null
2024-05-14 EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera Beilei Cui et.al. 2405.08672 link
2024-05-13 SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling Yijun Yuan et.al. 2405.07847 null
2024-05-11 TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization Zhen Tan et.al. 2405.07027 link
2024-05-11 Learning Monocular Depth from Focus with Event Focal Stack Chenxu Jiang et.al. 2405.06944 null

Optical flow

Publish Date Title Authors PDF Code
2025-07-14 Well-posedness of an optical flow based optimal control formulation for image registration Johannes Haubner et.al. 2507.10188 null
2025-07-14 Taming Modern Point Tracking for Speckle Tracking Echocardiography via Impartial Motion Md Abulkalam Azad et.al. 2507.10127 null
2025-07-11 Taming generative video models for zero-shot optical flow extraction Seungwoo Kim et.al. 2507.09082 null
2025-07-11 An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan Mengyuan Liu et.al. 2507.08690 null
2025-07-11 PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models Yongjian Zhang et.al. 2507.08400 null
2025-07-11 MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion Jihao Gu et.al. 2507.08344 null
2025-07-10 X-RAFT: Cross-Modal Non-Rigid Registration of Blue and White Light Neurosurgical Hyperspectral Images Charlie Budd et.al. 2507.07747 null
2025-07-09 mmFlux: Crowd Flow Analytics with Commodity mmWave MIMO Radar Anurag Pallaprolu et.al. 2507.07331 null
2025-07-08 Learning to Track Any Points from Human Motion Inès Hyeonsu Kim et.al. 2507.06233 null
2025-07-07 MoDiT: Learning Highly Consistent 3D Motion Coefficients with Diffusion Transformer for Talking Head Generation Yucheng Wang et.al. 2507.05092 null
2025-07-07 TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation Zonglin Lyu et.al. 2507.04984 null
2025-07-10 MCFormer: A Multi-Cost-Volume Network and Comprehensive Benchmark for Particle Image Velocimetry Zicheng Lin et.al. 2507.04750 null
2025-07-06 FB-Diff: Fourier Basis-guided Diffusion for Temporal Interpolation of 4D Medical Imaging Xin You et.al. 2507.04547 null
2025-07-03 Flow-CDNet: A Novel Network for Detecting Both Slow and Fast Changes in Bitemporal Images Haoxuan Li et.al. 2507.02307 null
2025-07-01 TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency Minye Shao et.al. 2507.00802 null
2025-07-01 DIJE: Dense Image Jacobian Estimation for Robust Robotic Self-Recognition and Visual Servoing Yasunori Toshimitsu et.al. 2507.00446 null
2025-06-30 C3VDv2 – Colonoscopy 3D video dataset with enhanced realism Mayank V. Golhar et.al. 2506.24074 null
2025-07-03 PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal View Longliang Liu et.al. 2506.23897 null
2025-06-30 Proteus-ID: ID-Consistent and Motion-Coherent Video Customization Guiyu Zhang et.al. 2506.23729 null
2025-06-29 MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation Vladislav Bargatin et.al. 2506.23151 null
2025-06-26 WAFT: Warping-Alone Field Transforms for Optical Flow Yihan Wang et.al. 2506.21526 null
2025-06-26 EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting Taoyu Wu et.al. 2506.21420 null
2025-06-25 Feature Hallucination for Self-supervised Action Recognition Lei Wang et.al. 2506.20342 null
2025-06-24 Online camera-pose-free stereo endoscopic tissue deformation recovery with tissue-invariant vision-biomechanics consistency Jiahe Chen et.al. 2506.19388 null
2025-06-23 Flow-Aware Diffusion for Real-Time VR Restoration: Enhancing Spatiotemporal Coherence and Efficiency Yitong Zhu et.al. 2506.18786 null
2025-06-24 Multimodal Fusion SLAM with Fourier Attention Youjie Zhou et.al. 2506.18204 null
2025-06-19 EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised Training Liangjing Shao et.al. 2506.16017 link
2025-06-17 MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution Zhiwen Shao et.al. 2506.14511 link
2025-06-21 Inference-Time Gaze Refinement for Micro-Expression Recognition: Enhancing Event-Based Eye Tracking with Motion-Aware Post-Processing Nuwan Bandara et.al. 2506.12524 link
2025-06-13 MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution Linfeng He et.al. 2506.11768 null
2025-06-12 Post-Training Quantization for Video Matting Tianrui Zhu et.al. 2506.10840 null
2025-06-10 UFM: A Simple Path towards Unified Dense Correspondence with Flow Yuchen Zhang et.al. 2506.09278 null
2025-06-10 Princeton365: A Diverse Dataset with Accurate Camera Pose Karhan Kayan et.al. 2506.09035 null
2025-06-09 Spatio-Temporal State Space Model For Efficient Event-Based Optical Flow Muhammad Ahmed Humais et.al. 2506.07878 link
2025-06-09 Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images Yingping Liang et.al. 2506.07740 null
2025-06-13 Consistent Video Editing as Flow-Driven Image-to-Video Generation Ge Wang et.al. 2506.07713 null
2025-06-08 AllTracker: Efficient Dense Point Tracking at High Resolution Adam W. Harley et.al. 2506.07310 null
2025-06-08 GoTrack: Generic 6DoF Object Pose Refinement and Tracking Van Nguyen Nguyen et.al. 2506.07155 null
2025-06-07 EV-LayerSegNet: Self-supervised Motion Segmentation using Event Cameras Youssef Farah et.al. 2506.06596 null
2025-06-06 3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model Hongyan Zhi et.al. 2506.06199 link
2025-06-06 Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments Mingrui Li et.al. 2506.05965 null
2025-06-05 DualX-VSR: Dual Axial Spatial $\times$ Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation Shuo Cao et.al. 2506.04830 null
2025-06-04 JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting Yang Xiao et.al. 2506.03872 null
2025-06-04 EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation Daikun Liu et.al. 2506.03512 null
2025-06-03 Learning Optical Flow Field via Neural Ordinary Differential Equation Leyla Mirvakhabova et.al. 2506.03290 null
2025-06-03 LinkTo-Anime: A 2D Animation Optical Flow Dataset from 3D Model Rendering Xiaoyi Feng et.al. 2506.02733 null
2025-06-03 LumosFlow: Motion-Guided Long Video Generation Jiahao Chen et.al. 2506.02497 null
2025-06-02 MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow Jakob Schmid et.al. 2506.01443 null
2025-06-01 MOOSE: Pay Attention to Temporal Dynamics for Video Understanding via Optical Flows Hong Nguyen et.al. 2506.01119 null
2025-05-31 Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline Zhaoying Wang et.al. 2506.00546 null
2025-05-31 Improving Optical Flow and Stereo Depth Estimation by Leveraging Uncertainty-Based Learning Difficulties Jisoo Jeong et.al. 2506.00324 null
2025-05-30 Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction Chenyou Fan et.al. 2505.24156 null
2025-05-29 Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing Tongtong Su et.al. 2505.23134 link
2025-05-27 Object Concepts Emerge from Motion Haoqian Liang et.al. 2505.21635 null
2025-05-26 A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking Zixiang Zhao et.al. 2505.19858 null
2025-05-23 Brightness-Invariant Tracking Estimation in Tagged MRI Zhangxing Bian et.al. 2505.18365 null
2025-05-31 CTRL-GS: Cascaded Temporal Residue Learning for 4D Gaussian Splatting Karly Hou et.al. 2505.18306 null
2025-05-23 Real-time Traffic Accident Anticipation with Feature Reuse Inpyo Song et.al. 2505.17449 null
2025-05-22 Efficient Correlation Volume Sampling for Ultra-High-Resolution Optical Flow Estimation Karlis Martins Briedis et.al. 2505.16942 null
2025-05-22 V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation Hanyue Lou et.al. 2505.16797 link
2025-05-21 SENSE – Sensor-Enhanced Neural Shear Stress Estimation for Quantitative Oilfilm Visualizations Lennart Rohlfs et.al. 2505.15697 null
2025-05-19 RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers Ahmet Berke Gokmen et.al. 2505.13344 null
2025-05-19 eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks Jad Mansour et.al. 2505.13309 null
2025-05-19 FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching Alp Eren Sari et.al. 2505.13174 null
2025-05-19 Just Dance with $π$ ! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection Snehashis Majhi et.al. 2505.13123 null
2025-05-17 MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos Hongyi Zhou et.al. 2505.11868 null
2025-05-16 Planar Velocity Estimation for Fast-Moving Mobile Robots Using Event-Based Optical Flow Liam Boyle et.al. 2505.11116 null
2025-05-15 TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation Manthan Patel et.al. 2505.10696 null
2025-05-15 A label-free sub-diffractive technique for 3D intracellular tomography using thermally induced convection currents Jayesh Goswami et.al. 2505.10112 null
2025-05-14 FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling Yue Wen et.al. 2505.09406 null
2025-05-14 RobustSpring: Benchmarking Robustness to Image Corruptions for Optical Flow, Scene Flow and Stereo Jenny Schmalfuss et.al. 2505.09368 null
2025-05-13 Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection Ayush K. Rai et.al. 2505.08561 null
2025-05-13 TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection Wenkui Yang et.al. 2505.08437 link
2025-05-13 EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation Hanle Zheng et.al. 2505.08235 null
2025-05-13 Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images Ziteng Liu et.al. 2505.08178 null
2025-05-12 Asynchronous Multi-Object Tracking with an Event Camera Angus Apps et.al. 2505.08126 link
2025-05-11 MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception Zhengye Zhang et.al. 2505.07007 link
2025-05-13 Detection of Moving Objects Using Self-motion Constraints on Optic Flow Hope Lutwak et.al. 2505.06686 null
2025-05-08 Nonlinear Motion-Guided and Spatio-Temporal Aware Network for Unsupervised Event-Based Optical Flow Zuntao Liu et.al. 2505.05089 null
2025-05-08 A Simple Detector with Frame Dynamics is a Strong Tracker Chenxu Peng et.al. 2505.04917 link
2025-05-06 Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment João Alves et.al. 2505.03554 link
2025-05-06 TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion Haoyue Liu et.al. 2505.03116 null
2025-05-04 Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance Yingkai Zhang et.al. 2505.02109 null
2025-05-02 Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation Zhen Yao et.al. 2505.01548 link
2025-04-30 AnimalMotionCLIP: Embedding motion in CLIP for Animal Behavior Analysis Enmin Zhong et.al. 2505.00569 null
2025-04-29 LPVIMO-SAM: Tightly-coupled LiDAR/Polarization Vision/Inertial/Magnetometer/Optical Flow Odometry via Smoothing and Mapping Derui Shan et.al. 2504.20380 null
2025-04-25 RapidPIV: Full Flow-Field kHz PIV for Real-Time Display and Control Scott A. Bollt et.al. 2504.17987 null
2025-04-22 Motion-Enhanced Nonlocal Similarity Implicit Neural Representation for Infrared Dim and Small Target Detection Pei Liu et.al. 2504.15665 null
2025-04-22 DiTPainter: Efficient Video Inpainting with Diffusion Transformers Xian Wu et.al. 2504.15661 null
2025-04-21 PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV Qianyu Zhu et.al. 2504.14952 link
2025-04-21 Multimodal Non-Semantic Feature Fusion for Predicting Segment Access Frequency in Lecture Archives Ruozhu Sheng et.al. 2504.14927 null
2025-04-20 FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models Kuanting Wu et.al. 2504.14535 null
2025-04-18 Neural Ganglion Sensors: Learning Task-specific Event Cameras Inspired by the Neural Circuit of the Human Retina Haley M. So et.al. 2504.13457 null
2025-04-18 MicroFlow: Domain-Specific Optical Flow for Ground Deformation Estimation in Seismic Events Juliette Bertrand et.al. 2504.13452 null
2025-04-18 Event-Enhanced Blurry Video Super-Resolution Dachun Kai et.al. 2504.13042 link
2025-04-17 SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration Xi Tong et.al. 2504.12869 null
2025-04-17 SAM-Based Building Change Detection with Distribution-Aware Fourier Adaptation and Edge-Constrained Warping Yun-Cheng Li et.al. 2504.12619 null
2025-04-14 Perturbed State Space Feature Encoders for Optical Flow with Event Cameras Gokul Raju Govinda Raju et.al. 2504.10669 null
2025-04-15 WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs Nguyen Ngoc Dat et.al. 2504.10165 null
2025-04-11 Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review Claudio Cimarelli et.al. 2504.08588 null
2025-04-10 Extending Visual Dynamics for Video-to-Music Generation Xiaohao Liu et.al. 2504.07594 null
2025-04-08 Intrinsic Saliency Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation Xiangyu Zheng et.al. 2504.05904 null
2025-04-07 Towards Efficient Real-Time Video Motion Transfer via Generative Time Series Modeling Tasmiah Haque et.al. 2504.05537 null
2025-04-06 FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency Shiyan Liu et.al. 2504.04427 null
2025-04-05 Simultaneous Motion And Noise Estimation with Event Cameras Shintaro Shiba et.al. 2504.04029 null
2025-04-04 3D Scene Understanding Through Local Random Access Sequence Modeling Wanhee Lee et.al. 2504.03875 null
2025-04-03 L-LBVC: Long-Term Motion Estimation and Prediction for Learned Bi-Directional Video Compression Yongqi Zhai et.al. 2504.02560 null
2025-04-01 Beyond Wide-Angle Images: Unsupervised Video Portrait Correction via Spatiotemporal Diffusion Adaptation Wenbo Nie et.al. 2504.00401 null
2025-04-01 Hierarchical Flow Diffusion for Efficient Frame Interpolation Yang Hai et.al. 2504.00380 null
2025-03-31 Easi3R: Estimating Disentangled Motion from DUSt3R Without Training Xingyu Chen et.al. 2503.24391 link
2025-04-03 Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey Haoyang Wang et.al. 2503.22943 null
2025-03-28 Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision Rulin Zhou et.al. 2503.22394 null
2025-03-28 Segment Any Motion in Videos Nan Huang et.al. 2503.22268 null
2025-03-28 Synergistic Bleeding Region and Point Detection in Surgical Videos Jialun Pei et.al. 2503.22174 null
2025-03-27 VADMamba: Exploring State Space Models for Fast Video Anomaly Detection Jiahao Lyu et.al. 2503.21169 link
2025-03-27 Can Video Diffusion Model Reconstruct 4D Geometry? Jinjie Mai et.al. 2503.21082 null
2025-03-25 Burst Image Super-Resolution with Mamba Ozan Unal et.al. 2503.19634 null
2025-03-24 NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting Yulong Zheng et.al. 2503.18794 null
2025-03-27 MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion Yikun Ma et.al. 2503.17695 null
2025-03-21 Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks Bhishma Dedhia et.al. 2503.17539 null
2025-03-21 Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras Shuang Guo et.al. 2503.17262 link
2025-03-20 4D Gaussian Splatting SLAM Yanyan Li et.al. 2503.16710 null
2025-03-20 EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation Zihao Zhang et.al. 2503.15831 null
2025-03-19 DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework Henrique Morimitsu et.al. 2503.14880 link
2025-03-19 Temporal-Consistent Video Restoration with Pre-trained Diffusion Models Hengkang Wang et.al. 2503.14863 null
2025-03-18 GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics Tingyang Xiao et.al. 2503.14247 link
2025-03-17 UCF-Crime-DVS: A Novel Event-Based Dataset for Video Anomaly Detection with Spiking Neural Networks Yuanbin Qian et.al. 2503.12905 link
2025-03-16 ProbDiffFlow: An Efficient Learning-Free Framework for Probabilistic Single-Image Optical Flow Estimation Mo Zhou et.al. 2503.12348 null
2025-03-17 EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation Zengyu Wan et.al. 2503.11371 null
2025-03-14 FG-DFPN: Flow Guided Deformable Frame Prediction Network M. Akın Yılmaz et.al. 2503.11343 link
2025-03-14 Zero-TIG: Temporal Consistency-Aware Zero-Shot Illumination-Guided Low-light Video Enhancement Yini Li et.al. 2503.11175 link
2025-03-14 A High-Accuracy Alignment Approach for Solar Images of Different Wavelengths Yun Wang et.al. 2503.11035 null
2025-03-13 Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations Xunzhi Zheng et.al. 2503.10464 null
2025-03-13 Markerless Tracking-Based Registration for Medical Image Motion Correction Luisa Neubig et.al. 2503.10260 null
2025-03-13 ST-FlowNet: An Efficient Spiking Neural Network for Event-Based Optical Flow Estimation Hongze Sun et.al. 2503.10195 null
2025-03-12 Investigation of Frame Differences as Motion Cues for Video Object Segmentation Sota Kawamura et.al. 2503.09132 null
2025-03-11 Feature Alignment with Equivariant Convolutions for Burst Image Super-Resolution Xinyi Liu et.al. 2503.08300 null
2025-03-10 MambaFlow: A Mamba-Centric Architecture for End-to-End Optical Flow Estimation Juntian Du et.al. 2503.07046 null
2025-03-11 Bridge Frame and Event: Common Spatiotemporal Fusion for High-Dynamic Scene Optical Flow Hanyu Zhou et.al. 2503.06992 null
2025-03-09 Online Dense Point Tracking with Streaming Memory Qiaole Dong et.al. 2503.06471 link
2025-03-10 VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control Yuxuan Bian et.al. 2503.05639 link
2025-03-07 Stereo Any Video: Temporally Consistent Stereo Matching Junpeng Jing et.al. 2503.05549 null
2025-03-06 Implicit Neural Representation for Video and Image Super-Resolution Mary Aiyetigbo et.al. 2503.04665 null
2025-03-09 ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem Yu-Hsi Chen et.al. 2503.04500 link
2025-03-05 Video Super-Resolution: All You Need is a Video Diffusion Model Zhihao Zhan et.al. 2503.03355 null
2025-03-05 BAT: Learning Event-based Optical Flow with Bidirectional Adaptive Temporal Correlation Gangwei Xu et.al. 2503.03256 null
2025-03-05 Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria Asma A. Almutairi et.al. 2503.03100 null
2025-03-04 Anomaly detection in non-stationary videos using time-recursive differencing network based prediction Gargi V. Pillai et.al. 2503.02234 null
2025-03-03 MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features Chao Ye et.al. 2503.01571 link
2025-03-03 AI-Driven Relocation Tracking in Dynamic Kitchen Environments Arash Nasr Esfahani et.al. 2503.01547 link
2025-03-02 Vid2Fluid: 3D Dynamic Fluid Assets from Single-View Videos with Generative Gaussian Splatting Zhiwei Zhao et.al. 2503.00868 null
2025-02-28 EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration Kuangyi Chen et.al. 2503.00167 link
2025-02-21 Peripheral Teleportation: A Rest Frame Design to Mitigate Cybersickness During Virtual Locomotion Tongyu Nie et.al. 2502.15227 null
2025-02-20 Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance Meng Wang et.al. 2502.14520 null
2025-02-18 L4P: Low-Level 4D Vision Perception Unified Abhishek Badki et.al. 2502.13078 null
2025-02-18 Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection Zijian Cao et.al. 2502.12735 null
2025-02-17 Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance Jixiang Chen et.al. 2502.11971 null
2025-02-17 Stonefish: Supporting Machine Learning Research in Marine Robotics Michele Grimaldi et.al. 2502.11887 link
2025-02-15 Super Resolution image reconstructs via total variation-based image deconvolution: a majorization-minimization approach Mouhamad Chehaitly et.al. 2502.10876 null
2025-02-15 Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video Runyang Feng et.al. 2502.10616 null
2025-02-11 A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision Hao Ai et.al. 2502.10444 null
2025-02-12 FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis Wonjoon Jin et.al. 2502.08244 null
2025-02-11 Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors Lin-Zhuo Chen et.al. 2502.07615 null
2025-02-18 A Physical Coherence Benchmark for Evaluating Video Generation Models via Optical Flow-guided Frame Prediction Yongfan Chen et.al. 2502.05503 link
2025-02-05 MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent Xinyao Liao et.al. 2502.03207 null
2025-02-03 XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications Shangjin Zhai et.al. 2502.01297 null
2025-01-28 Image Velocimetry using Direct Displacement Field estimation with Neural Networks for Fluids Efraín Magaña et.al. 2501.18641 link
2025-02-02 REMOTE: Real-time Ego-motion Tracking for Various Endoscopes via Multimodal Visual Feature Learning Liangjing Shao et.al. 2501.18124 null
2025-01-28 Improved Encoding for Overfitted Video Codecs Thomas Leguay et.al. 2501.16976 null
2025-01-28 Assessing ultrasonic and optical flow velocimetry in a millifluidic device using oil-in-water emulsions as blood mimicking fluid Estelle Lu et.al. 2501.16959 null
2025-01-28 Extending Information Bottleneck Attribution to Video Sequences Veronika Solopova et.al. 2501.16889 link
2025-02-04 Event-Based Adaptive Koopman Framework for Optic Flow-Guided Landing on Moving Platforms Bazeela Banday et.al. 2501.16868 null
2025-01-23 GC-ConsFlow: Leveraging Optical Flow Residuals and Global Context for Robust Deepfake Detection Jiaxin Chen et.al. 2501.13435 null
2025-01-22 MONA: Moving Object Detection from Videos Shot by Dynamic Camera Boxun Hu et.al. 2501.13183 null
2025-01-22 Machine Learning Modeling for Multi-order Human Visual Motion Processing Zitang Sun et.al. 2501.12810 link
2025-01-21 Efficient Dynamic Image Reconstruction with motion estimation Toluwani Okunola et.al. 2501.12497 null
2025-01-21 Learning segmentation from point trajectories Laurynas Karazija et.al. 2501.12392 link
2025-01-22 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null
2025-01-21 VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models Chaohao Xie et.al. 2501.12267 null
2025-01-20 Event-based vision for egomotion estimation using precise event timing Hugh Greatorex et.al. 2501.11554 null
2025-01-19 BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution Eunjin Kim et.al. 2501.11043 link
2025-01-25 Quadcopter Position Hold Function using Optical Flow in a Smartphone-based Flight Computer Noel P. Caliston et.al. 2501.10752 null
2025-01-18 Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection Yifang Xu et.al. 2501.10692 null
2025-01-17 DiffuEraser: A Diffusion Model for Video Inpainting Xiaowen Li et.al. 2501.10018 link
2025-01-16 VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization Zixun Fang et.al. 2501.09499 null
2025-01-16 Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise Ryan Burgert et.al. 2501.08331 link
2025-01-13 Aligning First, Then Fusing: A Novel Weakly Supervised Multimodal Violence Detection Method Wenping Jin et.al. 2501.07496 link
2025-01-08 Edit as You See: Image-guided Video Editing via Masked Motion Modeling Zhi-Lin Huang et.al. 2501.04325 null
2025-01-06 TinySense: A Lighter Weight and More Power-efficient Avionics System for Flying Insect-scale Robots Zhitao Yu et.al. 2501.03416 null
2025-01-06 ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking Tingyang Zhang et.al. 2501.03220 null
2025-01-05 AHMSA-Net: Adaptive Hierarchical Multi-Scale Attention Network for Micro-Expression Recognition Lijun Zhang et.al. 2501.02539 null
2025-01-01 Spatially-guided Temporal Aggregation for Robust Event-RGB Optical Flow Estimation Qianang Zhou et.al. 2501.00838 null
2025-01-05 How Honeybees Perceive and Traverse Apertures Timothy Jakobi et.al. 2501.00646 null
2024-12-29 Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition Xiu-Feng Huang et.al. 2412.20327 link
2024-12-28 Enhancing Marine Debris Acoustic Monitoring by Optical Flow-Based Motion Vector Analysis Xiaoteng Zhou et.al. 2412.20085 null
2024-12-27 Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark Lukas Picek et.al. 2412.19944 null
2024-12-27 Generalized Uncertainty-Based Evidential Fusion with Hybrid Multi-Head Attention for Weak-Supervised Temporal Action Localization Yuanpeng He et.al. 2412.19418 link
2025-01-03 Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry Zhaoxing Zhang et.al. 2412.16923 link
2024-12-20 SOUS VIDE: Cooking Visual Drone Navigation Policies in a Gaussian Splatting Vacuum JunEn Low et.al. 2412.16346 null
2024-12-20 MotiF: Making Text Count in Image Animation with Motion Focal Loss Shijie Wang et.al. 2412.16153 null
2024-12-18 Dynamic semantic VSLAM with known and unknown objects Sanghyoup Gu et.al. 2412.14359 null
2024-12-18 SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation Tong Chen et.al. 2412.14018 null
2024-12-17 CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices Andrei Znobishchev et.al. 2412.13273 null
2024-12-17 Complex extension of optical flow and its practical evaluation for undersampled dynamic MRI Matthias J. Ehrhardt et.al. 2412.12711 null
2024-12-17 GG-SSMs: Graph-Generating State Space Models Nikola Zubić et.al. 2412.12423 null
2024-12-16 Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising Zikang Chen et.al. 2412.11820 link
2024-12-16 Exploring More from Multiple Gait Modalities for Human Identification Dongyang Jin et.al. 2412.11495 link
2024-12-16 BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions Wonyong Seo et.al. 2412.11365 null
2024-12-15 Learning Normal Flow Directly From Event Neighborhoods Dehao Yuan et.al. 2412.11284 link
2024-12-13 BatDeck – Ultra Low-power Ultrasonic Ego-velocity Estimation and Obstacle Avoidance on Nano-drones Hanna Müller et.al. 2412.10048 null
2024-12-12 A Plug-and-Play Algorithm for 3D Video Super-Resolution of Single-Photon LiDAR data Alice Ruget et.al. 2412.09427 null
2024-12-12 eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction Jad Mansour et.al. 2412.09209 link
2024-12-12 ResFlow: Fine-tuning Residual Optical Flow for Event-based High Temporal Resolution Motion Estimation Qianang Zhou et.al. 2412.09105 null
2024-12-12 Mojito: Motion Trajectory and Intensity Control for Video Generation Xuehai He et.al. 2412.08948 null
2024-12-12 Labits: Layered Bidirectional Time Surfaces Representation for Event Camera-based Continuous Dense Trajectory Estimation Zhongyang Zhang et.al. 2412.08849 null
2024-12-11 Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation Zhigang Cen et.al. 2412.08034 null
2024-12-10 EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision Qiang Qu et.al. 2412.07080 link
2024-12-09 Local Attention Transformers for High-Detail Optical Flow Upsampling Alexander Gielisse et.al. 2412.06439 null
2024-12-08 MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation Shuwei Shi et.al. 2412.05848 null
2024-12-05 Deep Learning and Hybrid Approaches for Dynamic Scene Analysis, Object Detection and Motion Tracking Shahran Rahman Alve et.al. 2412.05331 null
2024-12-04 Advancing Auto-Regressive Continuation for Video Frames Ruibo Ming et.al. 2412.03758 null
2024-12-03 Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback Hiroki Furuta et.al. 2412.02617 null
2024-12-02 STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation Sunghun Yang et.al. 2412.01090 null
2024-12-01 Advanced Video Inpainting Using Optical Flow-Guided Efficient Diffusion Bohai Gu et.al. 2412.00857 null
2024-11-30 A conditional Generative Adversarial network model for the Weather4Cast 2024 Challenge Atharva Deshpande et.al. 2412.00451 null
2024-11-30 Hybrid Local-Global Context Learning for Neural Video Compression Yongqi Zhai et.al. 2412.00446 null
2024-11-27 RoMo: Robust Motion Segmentation Improves Structure from Motion Lily Goli et.al. 2411.18650 null
2024-11-27 ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching Yangrui Dong et.al. 2411.18174 null
2024-11-27 An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognition Song-Jiang Lai et.al. 2411.18002 null
2024-11-26 Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors Zhengfei Kuang et.al. 2411.17249 null
2024-11-25 Context-Aware Input Orchestration for Video Inpainting Hoyoung Kim et.al. 2411.16926 null
2024-11-22 TSkips: Efficiency Through Explicit Temporal Delay Connections in Spiking Neural Networks Prajna G. Malettira et.al. 2411.16711 null
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800 null
2024-11-23 Optical-Flow Guided Prompt Optimization for Coherent Video Generation Hyelin Nam et.al. 2411.15540 null
2024-11-22 Benchmarking the Robustness of Optical Flow Estimation to Corruptions Zhonghua Yi et.al. 2411.14865 link
2024-11-21 EdgeFlowNet: 100FPS@1W Dense Optical Flow For Tiny Mobile Robots Sai Ramana Kiran Pinnama Raju et.al. 2411.14576 null
2024-11-21 Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation Zhuoman Liu et.al. 2411.14423 null
2024-11-21 Transforming Static Images Using Generative Models for Video Salient Object Detection Suhwan Cho et.al. 2411.13975 link
2024-11-20 Sparse Input View Synthesis: 3D Representations and Reliable Priors Nagabhushan Somraj et.al. 2411.13631 null
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291 null
2024-11-20 Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark Bing Cao et.al. 2411.13056 null
2024-11-16 AnimateAnything: Consistent and Controllable Animation for Video Generation Guojun Lei et.al. 2411.10836 null
2024-11-15 OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models Mathis Koroglu et.al. 2411.10501 null
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation Jonas Serych et.al. 2411.09551 link
2024-11-12 DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection Shawn Li et.al. 2411.08227 link
2024-11-17 Scaling Properties of Diffusion Models for Perceptual Tasks Rahul Ravishankar et.al. 2411.08034 null
2024-11-11 Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters Corwin Grant Jeon MacMillan et.al. 2411.05225 null
2024-11-07 Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera Yu Hu et.al. 2411.04413 null
2024-11-07 AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation Mingyu Sheng et.al. 2411.03695 link
2024-11-04 Neural optical flow for planar and stereo PIV Andrew I. Masker et.al. 2411.02373 null
2024-11-03 Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation Zhenbin Wang et.al. 2411.01647 null
2024-11-03 Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli Matthias Tangemann et.al. 2411.01505 link
2024-11-02 Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks Aarjav Kavathia et.al. 2411.01348 null
2024-10-29 Motion Graph Unleashed: A Novel Approach to Video Prediction Yiqi Zhong et.al. 2410.22288 link
2024-10-29 FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives Qizhi Chen et.al. 2410.22070 null
2024-10-29 Investigation of moving objects through atmospheric turbulence from a non-stationary platform Nicholas Ferrante et.al. 2410.21639 null
2024-10-27 CloudCast – Total Cloud Cover Nowcasting with Machine Learning Mikko Partio et.al. 2410.21329 link
2024-10-28 Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual Context Manuel Benavent-Lledo et.al. 2410.21275 link
2024-10-27 BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events Yijin Li et.al. 2410.20451 null
2024-10-26 UniVST: A Unified Framework for Training-free Localized Video Style Transfer Quanjian Song et.al. 2410.20084 link
2024-10-23 Separating edges from microstructure in X-ray dark-field imaging: Evolving and devolving perspectives via the X-ray Fokker-Planck equation Samantha J. Alloo et.al. 2410.18317 null
2024-10-16 Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks Pranjali Pathre et.al. 2410.12432 link
2024-10-14 Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world Han Ling et.al. 2410.10453 link
2024-10-12 A Collaborative Team of UAV-Hexapod for an Autonomous Retrieval System in GNSS-Denied Maritime Environments Seungwook Lee et.al. 2410.09606 null
2024-10-12 Robust Optical Flow Computation: A Higher-Order Differential Approach Chanuka Algama et.al. 2410.09563 null
2024-10-10 MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting Ruijie Zhu et.al. 2410.07707 link
2024-10-09 Z-upscaling: Optical Flow Guided Frame Interpolation for Isotropic Reconstruction of 3D EM Volumes Fisseha A. Ferede et.al. 2410.07043 link
2024-10-08 Future frame prediction in chest cine MR imaging using the PCA respiratory motion model and dynamically trained recurrent neural networks Michel Pohl et.al. 2410.05882 null
2024-10-01 Descriptor: Face Detection Dataset for Programmable Threshold-Based Sparse-Vision Riadul Islam et.al. 2410.00368 link
2024-10-08 DressRecon: Freeform 4D Human Reconstruction from Monocular Video Jeff Tan et.al. 2409.20563 null
2024-10-06 Visual collective behaviors on spherical robots Diego Castro et.al. 2409.20539 null
2024-09-26 Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming Zehao Zhu et.al. 2409.17596 null
2024-09-26 TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene Sandika Biswas et.al. 2409.17459 link
2024-09-25 EventHDR: from Event to High-Speed HDR Videos and Beyond Yunhao Zou et.al. 2409.17029 null
2024-09-25 Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation Hanyu Zhou et.al. 2409.17001 null
2024-09-25 Pose-Guided Fine-Grained Sign Language Video Generation Tongkai Shi et.al. 2409.16709 null
2024-09-21 BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow EungGu Kang et.al. 2409.15384 link
2024-09-23 Skills Made to Order: Efficient Acquisition of Robot Cooking Skills Guided by Multiple Forms of Internet Data Mrinal Verghese et.al. 2409.15172 null
2024-09-22 Secrets of Edge-Informed Contrast Maximization for Event-Based Vision Pritam P. Karmokar et.al. 2409.14611 null
2024-09-18 Optical Flow Matters: an Empirical Comparative Study on Fusing Monocular Extracted Modalities for Better Steering Fouad Makiyeh et.al. 2409.12716 null
2024-09-16 ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video Han Ling et.al. 2409.12202 link
2024-09-16 Continual Learning of Conjugated Visual Representations through Higher-order Motion Flows Simone Marullo et.al. 2409.11441 null
2024-09-17 Training Datasets Generation for Machine Learning: Application to Vision Based Navigation Jérémy Lebreton et.al. 2409.11383 null
2024-09-17 Multimodal Attention-Enhanced Feature Fusion-based Weekly Supervised Anomaly Violence Detection Yuta Kaneko et.al. 2409.11223 null
2024-09-16 SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning Amogh Joshi et.al. 2409.09990 null
2024-09-15 Dynamic Layer Detection of a Thin Silk Cloth using DenseTact Optical Tactile Sensors Ankush Kundan Dhawan et.al. 2409.09849 null
2024-09-15 Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings Oriel Perl et.al. 2409.09841 null
2024-09-13 InstantDrag: Improving Interactivity in Drag-based Image Editing Joonghyuk Shin et.al. 2409.08857 null
2024-09-11 Violence detection in videos using deep recurrent and convolutional neural networks Abdarahmane Traoré et.al. 2409.07581 null
2024-09-11 Distance Measurement for UAVs in Deep Hazardous Tunnels Vishal Choudhary et.al. 2409.07160 null
2024-09-09 LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow Hongyu Wen et.al. 2409.05688 null
2024-09-11 Real-Time Human Action Recognition on Embedded Platforms Ruiqi Wang et.al. 2409.05662 null
2024-09-15 HMAFlow: Learning More Accurate Optical Flow via Hierarchical Motion Field Alignment Dianbo Ma et.al. 2409.05531 link
2024-09-09 FacialFlowNet: Advancing Facial Optical Flow Estimation with a Diverse Dataset and a Decomposed Model Jianzhi Lu et.al. 2409.05396 link
2024-09-06 Hybrid Cost Volume for Memory-Efficient Optical Flow Yang Zhao et.al. 2409.04243 link
2024-09-06 SDformerFlow: Spatiotemporal swin spikeformer for event-based optical flow estimation Yi Tian et.al. 2409.04082 link
2024-09-03 DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos Wenbo Hu et.al. 2409.02095 link
2024-08-29 FlowRetrieval: Flow-Guided Data Retrieval for Few-Shot Imitation Learning Li-Heng Lin et.al. 2408.16944 null
2024-08-29 Estimating Dynamic Flow Features in Groups of Tracked Objects Tanner D. Harms et.al. 2408.16190 null
2024-08-28 MMASD+: A Novel Dataset for Privacy-Preserving Behavior Analysis of Children with Autism Spectrum Disorder Pavan Uttej Ravva et.al. 2408.15077 link
2024-08-21 Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars Zhihao Lin et.al. 2408.11582 null
2024-08-21 SelfDRSC++: Self-Supervised Learning for Dual Reversed Rolling Shutter Correction Wei Shang et.al. 2408.11411 link
2024-09-02 Video Diffusion Models are Strong Video Inpainter Minhyeok Lee et.al. 2408.11402 null
2024-08-20 PooDLe: Pooled and dense self-supervised learning from naturalistic videos Alex N. Wang et.al. 2408.11208 null
2024-08-21 NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices Zhiyong Zhang et.al. 2408.10161 link
2024-08-19 Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data Tao Yang et.al. 2408.10119 null
2024-08-18 Contactless seismocardiography via Gunnar-Farneback optical flow Mohammad Muntasir Rahman et.al. 2408.09512 null
2024-08-18 OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare Chen Long-fei et.al. 2408.09409 null
2024-08-16 CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving Shihan Peng et.al. 2408.08500 null
2024-08-15 MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing Chenjie Cao et.al. 2408.08000 null
2024-08-12 FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework Lukas Meyer et.al. 2408.06190 link
2024-08-12 Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network Kailai Sun et.al. 2408.05877 null
2024-08-11 Egocentric Vision Language Planning Zhirui Fang et.al. 2408.05802 null
2024-08-08 KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance Jingxian Lu et.al. 2408.02912 null
2024-08-02 NOLO: Navigate Only Look Once Bohan Zhou et.al. 2408.01384 null
2024-07-31 RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining Hongtao Wu et.al. 2407.21773 link
2024-07-31 Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching Pengjie Zhang et.al. 2407.21735 null
2024-07-30 SpotFormer: Multi-Scale Spatio-Temporal Transformer for Facial Expression Spotting Yicheng Deng et.al. 2407.20799 null
2024-07-29 Event-based Optical Flow on Neuromorphic Processor: ANN vs. SNN Comparison based on Activation Sparsification Yingfu Xu et.al. 2407.20421 link
2024-07-26 Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations Zipeng Wang et.al. 2407.18500 null
2024-07-23 Occlusion-Aware 3D Motion Interpretation for Abnormal Behavior Detection Su Li et.al. 2407.16788 null
2024-07-23 SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging Lingtong Kong et.al. 2407.16308 link
2024-07-18 Many Perception Tasks are Highly Redundant Functions of their Input Data Rahul Ramesh et.al. 2407.13841 null
2024-07-18 Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain Bach Nguyen Gia et.al. 2407.13159 link
2024-07-17 Fusion Flow-enhanced Graph Pooling Residual Networks for Unmanned Aerial Vehicles Surveillance in Day and Night Dual Visions Alam Noor et.al. 2407.12647 null
2024-07-16 Improving Unsupervised Video Object Segmentation via Fake Flow Generation Suhwan Cho et.al. 2407.11714 link
2024-07-16 ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment Xinyi Wang et.al. 2407.11496 link
2024-07-16 Hybrid physics-AI outperforms numerical weather prediction for extreme precipitation nowcasting Puja Das et.al. 2407.11317 null
2024-07-15 Temporal Event Stereo via Joint Learning with Stereoscopic Flow Hoonhee Cho et.al. 2407.10831 link
2024-07-15 Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation Friedhelm Hamann et.al. 2407.10802 link
2024-07-14 Research Experience of an Undergraduate Student in Computer Vision and Robotics Ayush V. Gowda et.al. 2407.10044 null
2024-07-13 ScaleRAFT: Cross-Scale Recurrent All-Pairs Field Transforms for 3D Motion Estimation Han Ling et.al. 2407.09797 link
2024-07-11 Generalizable Implicit Motion Modeling for Video Frame Interpolation Zujin Guo et.al. 2407.08680 null
2024-07-11 Event-based vision on FPGAs – a survey Tomasz Kryjak et.al. 2407.08356 null
2024-07-10 Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction Yili Liu et.al. 2407.07587 null
2024-07-05 Unsupervised 4D Cardiac Motion Tracking with Spatiotemporal Optical Flow Networks Long Teng et.al. 2407.04663 null
2024-07-04 CardioSpectrum: Comprehensive Myocardium Motion Analysis with 3D Deep Learning and Geometric Insights Shahar Zuler et.al. 2407.03794 link
2024-07-03 Towards High Resolution Real-Time Optical Flow Particle Image Velocimetry Juan Pimienta et.al. 2407.03057 null
2024-07-03 Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction Jiaxin Guo et.al. 2407.02918 link
2024-07-01 DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models Chang-Han Yeh et.al. 2407.01519 link
2024-07-01 RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields Haochen Jiang et.al. 2407.01303 link
2024-06-27 What Matters in Detecting AI-Generated Videos like Sora? Chirui Chang et.al. 2406.19568 null
2024-06-27 A Universal Railway Obstacle Detection System based on Semi-supervised Segmentation And Optical Flow Qiushi Guo et.al. 2406.18908 null
2024-06-27 Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach Yuxiang Huang et.al. 2406.18837 null
2024-06-25 Disentangled Motion Modeling for Video Frame Interpolation Jaihyun Lew et.al. 2406.17256 link
2024-06-26 Splatter a Video: Video Gaussian Representation for Versatile Processing Yang-Tian Sun et.al. 2406.13870 null
2024-06-19 Low Latency Visual Inertial Odometry with On-Sensor Accelerated Optical Flow for Resource-Constrained UAVs Jonas Kühne et.al. 2406.13345 null
2024-06-17 MEDeA: Multi-view Efficient Depth Adjustment Mikhail Artemyev et.al. 2406.12048 null
2024-06-13 Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion Linzhan Mou et.al. 2406.09402 null
2024-06-11 PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow Joshua Tokarsky et.al. 2406.07667 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551 link
2024-06-07 DVOS: Self-Supervised Dense-Pattern Video Object Segmentation Keyhan Najafian et.al. 2406.05131 null
2024-06-07 Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior Tanvir Mahmud et.al. 2406.04873 link
2024-06-07 Interplay between preconditioning and regularization for linear ill-posed problems solved by conjugate gradient. Application to optical flow estimation Ahmed Chabib et.al. 2406.04695 null
2024-06-04 Neural Representations of Dynamic Visual Stimuli Jacob Yeung et.al. 2406.02659 null
2024-06-03 DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation Chun-Hung Wu et.al. 2406.01591 null
2024-06-03 Prototypical Transformer as Unified Motion Learners Cheng Han et.al. 2406.01559 null
2024-06-03 Enhancing Dynamic CT Image Reconstruction with Neural Fields Through Explicit Motion Regularizers Pablo Arratia et.al. 2406.01299 null
2024-06-03 Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting Fang Li et.al. 2406.01042 link
2024-06-03 Synthetic Data Generation for 3D Myocardium Deformation Analysis Shahar Zuler et.al. 2406.01040 link
2024-05-30 EMAG: Ego-motion Aware and Generalizable 2D Hand Forecasting from Egocentric Videos Masashi Hatano et.al. 2405.20030 null
2024-05-30 May the Dance be with You: Dance Generation Framework for Non-Humanoids Hyemin Ahn et.al. 2405.19743 null
2024-05-28 GFlow: Recovering 4D World from Monocular Video Shizun Wang et.al. 2405.18426 null
2024-05-28 Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition Muhammad Adi Nugroho et.al. 2405.18012 null
2024-05-27 DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation Mengtan Zhang et.al. 2405.16960 link
2024-05-27 SCSim: A Realistic Spike Cameras Simulator Liwen Hu et.al. 2405.16790 link
2024-05-26 Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition Tong Shi et.al. 2405.16701 null
2024-05-26 Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception Shuangpeng Han et.al. 2405.16493 link
2024-05-24 Time-Harmonic Optical Flow with Applications in Elastography Oleh Melnyk et.al. 2405.15507 link
2024-05-24 Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features Lichuan Ji et.al. 2405.15343 null
2024-05-24 Unsupervised Motion Segmentation for Neuromorphic Aerial Surveillance Sami Arja et.al. 2405.15209 link
2024-05-23 SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow Yihan Wang et.al. 2405.14793 link
2024-05-23 OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance Shuheng Ge et.al. 2405.14709 null
2024-05-23 Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields Tom Fischer et.al. 2405.14599 null
2024-05-22 MotionCraft: Physics-based Zero-Shot Video Generation Luca Savant Aira et.al. 2405.13557 link
2024-05-21 Weakly supervised alignment and registration of MR-CT for cervical cancer radiotherapy Jjahao Zhang et.al. 2405.12850 null
2024-05-21 Rethink Predicting the Optical Flow with the Kinetics Perspective Yuhao Cheng et.al. 2405.12512 link
2024-05-18 GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition Mallika Garg et.al. 2405.11180 link
2024-05-17 MicroBundlePillarTrack, A Python package for automated segmentation, tracking, and analysis of pillar deflection in cardiac microbundles Hiba Kobeissi et.al. 2405.11096 link
2024-05-16 Physics-incorporated Graph Neural Network for Multivariate Time Series Imputation Guojun Liang et.al. 2405.10995 link
2024-05-15 Dance Any Beat: Blending Beats with Visuals in Dance Video Generation Xuanchen Wang et.al. 2405.09266 null
2024-05-11 DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation Volodymyr Fedynyak et.al. 2405.08715 null
2024-05-14 EchoTracker: Advancing Myocardial Point Tracking in Echocardiography Md Abulkalam Azad et.al. 2405.08587 link
2024-05-15 Vector-Symbolic Architecture for Event-Based Optical Flow Hongzhi You et.al. 2405.08300 null
2024-05-12 NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU Yuhao Zhang et.al. 2405.07392 link
2024-05-11 Global Motion Understanding in Large-Scale Video Object Segmentation Volodymyr Fedynyak et.al. 2405.07031 null
2024-05-09 A Survey on Backbones for Deep Video Action Recognition Zixuan Tang et.al. 2405.05584 null
2024-05-08 Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection Shengyang Sun et.al. 2405.05130 link
2024-05-07 Visually Guided Swarm Motion Coordination via Insect-inspired Small Target Motion Reactions Md Arif Billah et.al. 2405.04591 null
2024-05-06 Diffeomorphic Template Registration for Atmospheric Turbulence Mitigation Dong Lao et.al. 2405.03662 null

Object Tracking

Publish Date Title Authors PDF Code
2025-07-15 CharaConsist: Fine-Grained Consistent Character Generation Mengyu Wang et.al. 2507.11533 null
2025-07-14 Taming Modern Point Tracking for Speckle Tracking Echocardiography via Impartial Motion Md Abulkalam Azad et.al. 2507.10127 null
2025-07-14 MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second Chenguo Lin et.al. 2507.10065 null
2025-07-14 OpenHuman4D: Open-Vocabulary 4D Human Parsing Keito Suzuki et.al. 2507.09880 null
2025-07-12 Online Long-term Point Tracking in the Foundation Model Era Görkay Aydemir et.al. 2507.09217 null
2025-07-12 On the Fragility of Multimodal Perception to Temporal Misalignment in Autonomous Driving Md Hasan Shahriar et.al. 2507.09095 null
2025-07-11 SAM2RL: Towards Reinforcement Learning Memory Control in Segment Anything Model 2 Alen Adamyan et.al. 2507.08548 null
2025-07-14 HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking Ruixiang Chen et.al. 2507.07603 null
2025-07-10 Temporal Unlearnable Examples: Preventing Personal Video Data from Unauthorized Exploitation by Object Tracking Qiangqiang Wu et.al. 2507.07483 null
2025-07-08 When Trackers Date Fish: A Benchmark and Framework for Underwater Multiple Fish Tracking Weiran Li et.al. 2507.06400 null
2025-07-08 Learning to Track Any Points from Human Motion Inès Hyeonsu Kim et.al. 2507.06233 null
2025-07-08 Cooperative Mapping, Localization, and Beam Management via Multi-Modal SLAM in ISAC Systems Hang Que et.al. 2507.05718 null
2025-07-07 Self-Supervised Real-Time Tracking of Military Vehicles in Low-FPS UAV Footage Markiyan Kostiv et.al. 2507.05229 null
2025-07-07 Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking Maria Damanaki et.al. 2507.04762 null
2025-07-05 Integrated Gaussian Processes for Robust and Adaptive Multi-Object Tracking Fred Lydeard et.al. 2507.04116 null
2025-07-03 CrowdTrack: A Benchmark for Difficult Multiple Pedestrian Tracking in Real Scenarios Teng Fu et.al. 2507.02479 null
2025-07-03 A Novel Tuning Method for Real-time Multiple-Object Tracking Utilizing Thermal Sensor with Complexity Motion Pattern Duong Nguyen-Ngoc Tran et.al. 2507.02408 null
2025-07-03 PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection Seokyeong Lee et.al. 2507.02393 null
2025-07-02 TrackingMiM: Efficient Mamba-in-Mamba Serialization for Real-time UAV Object Tracking Bingxi Liu et.al. 2507.01535 null
2025-07-04 Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations Shivansh Patel et.al. 2507.00990 null
2025-07-01 UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions Siyuan Yao et.al. 2507.00648 null
2025-06-30 Visual and Memory Dual Adapter for Multi-Modal Object Tracking Boyue Xu et.al. 2506.23972 null
2025-06-30 Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking Shiao Wang et.al. 2506.23783 null
2025-06-28 Optimal Trajectory Planning for Space Object Tracking with Collision-Avoidance Constraints Saif R. Kazi et.al. 2506.22797 null
2025-06-27 Improving Token-based Object Detection with Video Abhineet Singh et.al. 2506.22562 null
2025-07-01 R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning Biao Wang et.al. 2506.21980 null
2025-06-26 Linear and Second-order-cone Valid Inequalities for Problems with Storage Juan M. Morales et.al. 2506.21470 null
2025-06-24 VideoPCDNet: Video Parsing and Prediction with Phase Correlation Networks Noel José Rodrigues Vicente et.al. 2506.19621 null
2025-06-24 Trajectory Prediction in Dynamic Object Tracking: A Critical Study Zhongping Dong et.al. 2506.19341 null
2025-06-23 Lightweight RGB-T Tracking with Mobile Vision Transformers Mahdi Falaki et.al. 2506.19154 null
2025-06-23 USVTrack: USV-Based 4D Radar-Camera Tracking Dataset for Autonomous Driving in Inland Waterways Shanliang Yao et.al. 2506.18737 null
2025-06-23 Emergent Temporal Correspondences from Video Diffusion Transformers Jisu Nam et.al. 2506.17220 link
2025-06-20 RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking Teng Guo et.al. 2506.17119 link
2025-06-19 From Theory to Practice: Identifying the Optimal Approach for Offset Point Tracking in the Context of Agricultural Robotics Stephane Ngnepiepaye Wembe et.al. 2506.16143 null
2025-06-19 KARL: Kalman-Filter Assisted Reinforcement Learner for Dynamic Object Tracking and Grasping Kowndinya Boyalakuntla et.al. 2506.15945 null
2025-06-18 Probabilistic Trajectory GOSPA: A Metric for Uncertainty-Aware Multi-Object Tracking Performance Evaluation Yuxuan Xia et.al. 2506.15148 null
2025-06-17 Projected integral control of impedance passive nonlinear systems Nicolas Vanspranghe et.al. 2506.14267 null
2025-06-16 Deep Learning-Based Multi-Object Tracking: A Comprehensive Survey from Foundations to State-of-the-Art Momir Adžemović et.al. 2506.13457 null
2025-06-15 Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors Wen-Hsuan Chu et.al. 2506.12716 null
2025-06-13 Multiple Object Tracking in Video SAR: A Benchmark and Tracking Baseline Haoxiang Chen et.al. 2506.12105 null
2025-06-11 Optimizing Cooperative Multi-Object Tracking using Graph Signal Processing Maria Damanaki et.al. 2506.09469 null
2025-06-10 MOSE: A Novel Orchestration Framework for Stateful Microservice Migration at the Edge Antonio Calagna et.al. 2506.09159 null
2025-06-10 MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning Mohammadreza Salehi et.al. 2506.08694 link
2025-06-09 SAM2Auto: Auto Annotation Using FLASH Arash Rocky et.al. 2506.07850 null
2025-06-09 DragNeXt: Rethinking Drag-Based Image Editing Yuan Zhou et.al. 2506.07611 null
2025-06-08 AllTracker: Efficient Dense Point Tracking at High Resolution Adam W. Harley et.al. 2506.07310 null
2025-06-05 FRAME: Pre-Training Video Feature Representations via Anticipation and Memory Sethuraman TV et.al. 2506.05543 null
2025-06-08 Context Is Not Comprehension Alex Pan et.al. 2506.04907 null
2025-06-04 Contour Errors: An Ego-Centric Metric for Reliable 3D Multi-Object Tracking Sharang Kaul et.al. 2506.04122 null
2025-06-03 SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports Dheeraj Khanna et.al. 2506.03335 null
2025-06-03 IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation Yuanze Lin et.al. 2506.03150 null
2025-06-03 MVTD: A Benchmark Dataset for Maritime Visual Object Tracking Ahsan Baidar Bakht et.al. 2506.02866 null
2025-06-09 E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models Wenyan Cong et.al. 2506.01933 null
2025-06-02 UMA: Ultra-detailed Human Avatars via Multi-level Surface Alignment Heming Zhu et.al. 2506.01802 null
2025-06-02 No Train Yet Gain: Towards Generic Multi-Object Tracking in Sports and Beyond Tomasz Stanczyk et.al. 2506.01373 null
2025-06-01 Depth-Aware Scoring and Hierarchical Alignment for Multiple Object Tracking Milad Khanchi et.al. 2506.00774 null
2025-05-29 Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping Justin Lazarow et.al. 2505.23756 null
2025-05-27 SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation Claudia Cuttano et.al. 2505.21795 link
2025-05-27 Fully Spiking Neural Networks for Unified Frame-Event Object Tracking Jingjun Yang et.al. 2505.20834 null
2025-05-26 Video-based Direct Time Series Measurement of Along-Strike Slip on the Coseismic Surface Rupture During the 2025 Mw7.7 Myanmar Earthquake Jianhao Gao et.al. 2505.20494 null
2025-05-26 ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking Sijia Chen et.al. 2505.20381 link
2025-05-28 Progressive Scaling Visual Object Tracking Jack Hong et.al. 2505.19990 null
2025-05-24 Distributed Expectation Propagation for Multi-Object Tracking over Sensor Networks Qing Li et.al. 2505.18795 null
2025-05-24 FusionTrack: End-to-End Multi-Object Tracking in Arbitrary Multi-View Environment Xiaohe Li et.al. 2505.18727 null
2025-05-24 EOTNet: Deep Memory Aided Bayesian Filter for Extended Object Tracking Zhixing Wang et.al. 2505.18684 link
2025-05-23 Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking Cheng-Yen Yang et.al. 2505.18111 null
2025-05-22 A Framework for Multi-View Multiple Object Tracking using Single-View Multi-Object Trackers on Fish Data Chaim Chai Elchik et.al. 2505.17201 null
2025-05-22 Temporal Object Captioning for Street Scene Videos from LiDAR Tracks Vignesh Gopinathan et.al. 2505.16594 null
2025-05-21 Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detection Shichao Li et.al. 2505.16029 link
2025-05-21 ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation Tony Montes et.al. 2505.15928 link
2025-05-19 Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach Shiao Wang et.al. 2505.12903 link
2025-05-22 LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking Martha Teiko Teye et.al. 2505.12753 null
2025-05-19 Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking Shiyu Xuan et.al. 2505.12606 null
2025-05-20 DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model Siwei Xia et.al. 2505.12427 link
2025-05-18 DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking Jirong Zha et.al. 2505.12340 null
2025-05-17 GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity Takuya Ikeda et.al. 2505.11905 null
2025-05-12 Asynchronous Multi-Object Tracking with an Event Camera Angus Apps et.al. 2505.08126 link
2025-05-12 SAEN-BGS: Energy-Efficient Spiking AutoEncoder Network for Background Subtraction Zhixuan Zhang et.al. 2505.07336 null
2025-05-12 Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking Mohamed Nagy et.al. 2505.07254 null
2025-05-09 Hyperbolic and Elliptic Points Tracking Algorithm (HEPTA) in two-dimensional non-stationary velocity fields defined on a discrete grid A. A. Udalov et.al. 2505.05975 null
2025-05-09 CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV Tracking Weihong Li et.al. 2505.05936 link
2025-05-09 You Are Your Best Teacher: Semi-Supervised Surgical Point Tracking with Cycle-Consistent Self-Distillation Valay Bundele et.al. 2505.05722 null
2025-05-08 A Simple Detector with Frame Dynamics is a Strong Tracker Chenxu Peng et.al. 2505.04917 link
2025-05-11 SMMT: Siamese Motion Mamba with Self-attention for Thermal Infrared Target Tracking Shang Zhang et.al. 2505.04088 null
2025-05-06 Interactive Instance Annotation with Siamese Networks Xiang Xu et.al. 2505.03184 null
2025-05-06 TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion Haoyue Liu et.al. 2505.03116 null
2025-05-02 CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking Vladimir Somers et.al. 2505.01257 link
2025-05-02 Optimizing Indoor Farm Monitoring Efficiency Using UAV: Yield Estimation in a GNSS-Denied Cherry Tomato Greenhouse Taewook Park et.al. 2505.00995 null
2025-04-30 MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection Qiushi Yang et.al. 2505.00739 null
2025-05-01 A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic Muhammad Imran Zaman et.al. 2505.00534 null
2025-04-30 Direct Motion Models for Assessing Generated Videos Kelsey Allen et.al. 2505.00209 null
2025-04-30 Stereo X-ray tomography on deformed object tracking Zhenduo Shang et.al. 2505.00122 null
2025-04-30 LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics Marc Glocker et.al. 2504.21716 link
2025-04-30 Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction Zihan Zhou et.al. 2504.21692 null
2025-04-30 Model-Free Two-Degree-of-Freedom PID Controller Design for Unknown LTI Systems Taiga Kiyota et.al. 2504.21341 null
2025-04-29 The Mean of Multi-Object Trajectories Tran Thien Dat Nguyen et.al. 2504.20391 null
2025-04-28 Improving trajectory continuity in drone-based crowd monitoring using a set of minimal-cost techniques and deep discriminative correlation filters Bartosz Ptak et.al. 2504.20234 null
2025-04-28 A computer vision method to estimate ventilation rate of Atlantic salmon in sea fish farms Lukas Folkman et.al. 2504.19719 null
2025-04-25 Decentralized Fusion of 3D Extended Object Tracking based on a B-Spline Shape Model Longfei Han et.al. 2504.18708 null
2025-04-25 Multi-Sensor Fusion of Active and Passive Measurements for Extended Object Tracking Hong Zhu et.al. 2504.18301 null
2025-04-25 PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models Michel Gokan Khan et.al. 2504.18165 link
2025-04-25 S3MOT: Monocular 3D Object Tracking with Selective State Space Model Zhuohao Yan et.al. 2504.18068 null
2025-04-24 Dynamic Camera Poses and Where to Find Them Chris Rockwell et.al. 2504.17788 null
2025-04-23 PRaDA: Projective Radial Distortion Averaging Daniil Sinitsyn et.al. 2504.16499 null
2025-04-22 SonarT165: A Large-scale Benchmark and STFTrack Framework for Acoustic Object Tracking Yunfeng Li et.al. 2504.15609 link
2025-04-20 TAPIP3D: Tracking Any Point in Persistent 3D Geometry Bowei Zhang et.al. 2504.14717 link
2025-04-20 Seurat: From Moving Points to Depth Seokju Cho et.al. 2504.14687 link
2025-04-19 Adversarial Attack for RGB-Event based Visual Object Tracking Qiang Chen et.al. 2504.14423 link
2025-04-17 St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World Haiwen Feng et.al. 2504.13152 null
2025-04-17 Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving Shumin Wang et.al. 2504.12709 null
2025-04-16 Robust Visual Servoing under Human Supervision for Assembly Tasks Victor Nan Fernandez-Ayala et.al. 2504.12506 null
2025-04-13 Intelligent driving vehicle front multi-target tracking and detection based on YOLOv5 and point cloud 3D projection Dayong Liu et.al. 2504.11310 null
2025-04-15 WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs Nguyen Ngoc Dat et.al. 2504.10165 null
2025-04-14 LiteTracker: Leveraging Temporal Causality for Accurate Low-latency Tissue Tracking Mert Asim Karaoglu et.al. 2504.09904 null
2025-04-12 PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking Jiahuan Long et.al. 2504.09361 null
2025-04-12 Text To 3D Object Generation For Scalable Room Assembly Sonia Laguna et.al. 2504.09328 null
2025-04-12 ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking Tzoulio Chamiti et.al. 2504.09195 null
2025-04-10 GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation Lang Lin et.al. 2504.07962 null
2025-04-09 Multi-Object Tracking for Collision Avoidance Using Multiple Cameras in Open RAN Networks Jordi Serra et.al. 2504.07163 null
2025-04-13 VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning Xinhao Li et.al. 2504.06958 null
2025-04-08 POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction Songyan Zhang et.al. 2504.05692 link
2025-04-06 SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation Junjie Jiang et.al. 2504.04519 link
2025-04-05 Risk-Aware Robot Control in Dynamic Environments Using Belief Control Barrier Functions Shaohang Han et.al. 2504.04097 link
2025-04-04 TQD-Track: Temporal Query Denoising for 3D Multi-Object Tracking Shuxiao Ding et.al. 2504.03258 null
2025-04-03 Attention-Aware Multi-View Pedestrian Tracking Reef Alturki et.al. 2504.03047 null
2025-04-03 Data-Driven Object Tracking: Integrating Modular Neural Networks into a Kalman Framework Christian Alexander Holz et.al. 2504.02519 null
2025-04-02 Deep LG-Track: An Enhanced Localization-Confidence-Guided Multi-Object Tracker Ting Meng et.al. 2504.01457 null
2025-04-02 COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking Chunhui Zhang et.al. 2504.01321 link
2025-04-01 IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval Bangwei Liu et.al. 2504.00954 null
2025-03-31 Point Tracking in Surgery–The 2024 Surgical Tattoos in Infrared (STIR) Challenge Adam Schmidt et.al. 2503.24306 link
2025-04-03 Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey Haoyang Wang et.al. 2503.22943 null
2025-03-28 Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision Rulin Zhou et.al. 2503.22394 null
2025-03-28 Hyperspectral Adapter for Object Tracking based on Hyperspectral Video Long Gao et.al. 2503.22199 null
2025-03-25 Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better Zihang Lai et.al. 2503.19904 null
2025-03-24 TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court Videos Kazuhiro Yamada et.al. 2503.18282 link
2025-03-22 MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking Haolin Qin et.al. 2503.17699 link
2025-03-21 Dynamic Attention Mechanism in Spatiotemporal Memory Networks for Object Tracking Meng Zhou et.al. 2503.16768 null
2025-03-20 Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction Edgar Sucar et.al. 2503.16318 null
2025-03-19 Toward Scalable, Flexible Scene Flow for Point Clouds Kyle Vedder et.al. 2503.15666 null
2025-03-17 Real-Time Multi-Object Tracking using YOLOv8 and SORT on a SoC FPGA Michal Danilowicz et.al. 2503.13023 null
2025-03-17 OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering Guanhua Ding et.al. 2503.12968 null
2025-03-17 GIFT: Generated Indoor video frames for Texture-less point tracking Jianzheng Huang et.al. 2503.12944 null
2025-03-17 UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Network Siyuan Yao et.al. 2503.12888 link
2025-03-16 History-Aware Transformation of ReID Features for Multiple Object Tracking Ruopeng Gao et.al. 2503.12562 link
2025-03-15 ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object Zhe Shan et.al. 2503.12006 link
2025-03-14 VGGT: Visual Geometry Grounded Transformer Jianyuan Wang et.al. 2503.11651 link
2025-03-14 Cognitive Disentanglement for Referring Multi-Object Tracking Shaofeng Liang et.al. 2503.11496 null
2025-03-13 3D Extended Object Tracking based on Extruded B-Spline Side View Profiles Longfei Han et.al. 2503.10730 null
2025-03-18 OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer Jinyang Li et.al. 2503.10616 link
2025-03-13 Low Complexity Point Tracking of the Myocardium in 2D Echocardiography Artem Chernyshov et.al. 2503.10431 link
2025-03-13 Target-aware Bidirectional Fusion Transformer for Aerial Object Tracking Xinglong Sun et.al. 2503.09951 null
2025-03-12 How good are deep learning methods for automated road safety analysis using video data? An experimental study Qingwu Liu et.al. 2503.09807 null
2025-03-11 TrackOcc: Camera-based 4D Panoptic Occupancy Tracking Zhuoguang Chen et.al. 2503.08471 link
2025-03-11 Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking Yunhao Li et.al. 2503.08145 null
2025-03-10 SIRE: SE(3) Intrinsic Rigidity Embeddings Cameron Smith et.al. 2503.07739 null
2025-03-10 CPAny: Couple With Any Encoder to Refer Multi-Object Tracking Weize Li et.al. 2503.07516 null
2025-03-09 Online Dense Point Tracking with Streaming Memory Qiaole Dong et.al. 2503.06471 link
2025-03-06 A Novel Control Strategy for Offset Points Tracking in the Context of Agricultural Robotics Stephane Ngnepiepaye Wembe et.al. 2503.05835 null
2025-03-06 Omnidirectional Multi-Object Tracking Kai Luo et.al. 2503.04565 link
2025-03-09 ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem Yu-Hsi Chen et.al. 2503.04500 link
2025-03-06 A Modular Pipeline for 3D Object Tracking Using RGB Cameras Lars Bredereke et.al. 2503.04322 link
2025-03-03 AI-Driven Relocation Tracking in Dynamic Kitchen Environments Arash Nasr Esfahani et.al. 2503.01547 link
2025-02-27 MITracker: Multi-View Integration for Visual Object Tracking Mengjie Xu et.al. 2502.20111 null
2025-02-26 Spectral-Enhanced Transformers: Leveraging Large-Scale Pretrained Models for Hyperspectral Object Tracking Shaheer Mohamed et.al. 2502.18748 null
2025-02-25 UASTrack: A Unified Adaptive Selection Framework with Modality-Customization in Single Object Tracking He Wang et.al. 2502.18220 null
2025-02-26 Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking Peng Zhang et.al. 2502.17822 null
2025-02-24 V-HOP: Visuo-Haptic 6D Object Pose Tracking Hongyu Li et.al. 2502.17434 null
2025-02-24 Enriching Physical-Virtual Interaction in AR Gaming by Tracking Identical Real Objects Liuchuan Yu et.al. 2502.17399 link
2025-02-24 CRTrack: Low-Light Semi-Supervised Multi-object Tracking Based on Consistency Regularization Zijing Zhao et.al. 2502.16809 null
2025-02-23 Benchmarking Online Object Trackers for Underwater Robot Position Locking Applications Ali Safa et.al. 2502.16569 null
2025-02-19 A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects Arjun Gupta et.al. 2502.13964 null
2025-02-19 MEX: Memory-efficient Approach to Referring Multi-Object Tracking Huu-Thien Tran et.al. 2502.13875 null
2025-02-18 Pre-training Auto-regressive Robotic Models with 4D Representations Dantong Niu et.al. 2502.13142 null
2025-02-13 IMM-MOT: A Novel 3D Multi-object Tracking Framework with Interacting Multiple Model Filter Xiaohong Liu et.al. 2502.09672 null
2025-02-12 Control Barrier Function-Based Quadratic Programming for SafeOperation of Tethered UAVs Samuel O. Folorunsho et.al. 2502.08129 null
2025-02-10 Adaptive Perception for Unified Visual Multi-modal Object Tracking Xiantao Hu et.al. 2502.06583 null
2025-02-09 Energy-Efficient Autonomous Aerial Navigation with Dynamic Vision Sensors: A Physics-Guided Neuromorphic Approach Sourav Sanyal et.al. 2502.05938 null
2025-02-08 Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark Shiao Wang et.al. 2502.05574 link
2025-02-06 OneTrack-M: A multitask approach to transformer-based MOT models Luiz C. S. de Araujo et.al. 2502.04478 null
2025-02-06 RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology Nhat-Tan Do et.al. 2502.03760 null
2025-02-04 Rethinking Vision Transformer for Object Centric Foundation Models Manuel Traub et.al. 2502.02763 null
2025-02-04 INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy Nastaran Darabi et.al. 2502.01896 null
2025-02-03 Bayesian Approximation-Based Trajectory Prediction and Tracking with 4D Radar Dong-In Kim et.al. 2502.01357 null
2025-02-03 Solgenia – A Test Vessel Toward Energy-Efficient Autonomous Water Taxi Applications Hannes Homburger et.al. 2502.01207 link
2025-01-30 Track-On: Transformer-based Online Point Tracking with Memory Görkay Aydemir et.al. 2501.18487 link
2025-01-28 Overcoming Semantic Dilution in Transformer-Based Next Frame Prediction Hy Nguyen et.al. 2501.16753 null
2025-01-27 Understanding Long Videos via LLM-Powered Entity Relation Graphs Meng Chu et.al. 2501.15953 null
2025-01-24 MATCHA:Towards Matching Anything Fei Xue et.al. 2501.14945 null
2025-01-24 Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection Viktor Kozák et.al. 2501.14587 null
2025-01-23 CSAOT: Cooperative Multi-Agent System for Active Object Tracking Hy Nguyen et.al. 2501.13994 null
2025-01-23 YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID Iñaki Erregue et.al. 2501.13710 link
2025-01-21 Learning segmentation from point trajectories Laurynas Karazija et.al. 2501.12392 link
2025-01-22 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Yi Wang et.al. 2501.12386 link
2025-01-21 Exploring Temporally-Aware Features for Point Tracking Inès Hyeonsu Kim et.al. 2501.12218 link
2025-01-20 PD-SORT: Occlusion-Robust Multi-Object Tracking Using Pseudo-Depth Cues Yanchao Wang et.al. 2501.11288 link
2025-01-17 Spatio-temporal Graph Learning on Adaptive Mined Key Frames for High-performance Multi-Object Tracking Futian Wang et.al. 2501.10129 null
2025-01-13 SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing Varun Biyyala et.al. 2501.07554 link
2025-01-13 TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations Daniel Steininger et.al. 2501.07360 link
2025-01-13 Robust Single Object Tracking in LiDAR Point Clouds under Adverse Weather Conditions Xiantong Zhao et.al. 2501.07133 null
2025-01-09 An Empirical Study of Autoregressive Pre-training from Videos Jathushan Rajasegaran et.al. 2501.05453 null
2025-01-08 Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs Zeyi Huang et.al. 2501.04336 null
2025-01-07 Neuromorphic Optical Tracking and Imaging of Randomly Moving Targets through Strongly Scattering Media Ning Zhang et.al. 2501.03874 null
2025-01-06 ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking Tingyang Zhang et.al. 2501.03220 null
2025-01-05 GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking Weikang Bian et.al. 2501.02690 null
2025-01-05 DeTrack: In-model Latent Denoising Learning for Visual Object Tracking Xinyu Zhou et.al. 2501.02467 null
2025-01-02 HybridTrack: A Hybrid Approach for Robust Multi-Object Tracking Leandro Di Bella et.al. 2501.01275 link
2025-01-02 Sensitivity of Room Impulse Responses in Changing Acoustic Environment Karolina Prawda et.al. 2501.01206 null
2025-01-01 Less is More: Token Context-aware Learning for Object Tracking Chenlong Xu et.al. 2501.00758 link
2024-12-26 SUTrack: Towards Simple and Unified Single Object Tracking Xin Chen et.al. 2412.19138 link
2024-12-23 Cross-View Referring Multi-Object Tracking Sijia Chen et.al. 2412.17807 link
2024-12-20 Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking Xiantao Hu et.al. 2412.15691 link
2024-12-19 Scaling 4D Representations João Carreira et.al. 2412.15212 null
2024-12-18 Joint Perception and Prediction for Autonomous Driving: A Survey Lucas Dal’Col et.al. 2412.14088 link
2024-12-18 MambaLCT: Boosting Tracking via Long-term Context State Space Model Xiaohai Li et.al. 2412.13615 link
2024-12-17 CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices Andrei Znobishchev et.al. 2412.13273 null
2024-12-17 Tell Me What to Track: Infusing Robust Language Guidance for Enhanced Referring Multi-Object Tracking Wenjun Huang et.al. 2412.12561 null
2024-12-15 Exploring Enhanced Contextual Information for Video-Level Object Tracking Ben Kang et.al. 2412.11023 link
2024-12-14 Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos Qingyu Xu et.al. 2412.10861 link
2024-12-14 Patch-level Sounding Object Tracking for Audio-Visual Question Answering Zhangbin Li et.al. 2412.10749 null
2024-12-12 Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach Kailas PS et.al. 2412.10453 null
2024-12-13 Visual Object Tracking across Diverse Data Modalities: A Review Mengmeng Wang et.al. 2412.09991 null
2024-12-12 NormalFlow: Fast, Robust, and Accurate Contact-based Object 6DoF Pose Tracking with Vision-based Tactile Sensors Hung-Jui Huang et.al. 2412.09617 link
2024-12-12 Temporal-Assisted Beamforming and Trajectory Prediction in Sensing-Enabled UAV Communications Shengcai Zhou et.al. 2412.09097 null
2024-12-11 TGOSPA Metric Parameters Selection and Evaluation for Visual Multi-object Tracking Jan Krejčí et.al. 2412.08321 null
2024-12-11 Post-Hoc MOTS: Exploring the Capabilities of Time-Symmetric Multi-Object Tracking Gergely Szabó et.al. 2412.08313 null
2024-12-11 DTAA: A Detect, Track and Avoid Architecture for navigation in spaces with Multiple Velocity Objects Samuel Nordström et.al. 2412.08121 null
2024-12-10 Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation Kurt H. W. Stolle et.al. 2412.07966 null
2024-12-10 Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments Muhayy Ud Din et.al. 2412.07392 null
2024-12-10 Optical Levitation of Arrays of Microspheres Benjamin Siegel et.al. 2412.07088 null
2024-12-09 Microcontroller-Driven MPPT System for Enhanced Photovoltaic Efficiency: An Experimental Approach in Nepal Diwakar Khadka et.al. 2412.06956 null
2024-12-09 Enhanced Multi-Object Tracking Using Pose-based Virtual Markers in 3x3 Basketball Li Yin et.al. 2412.06258 null
2024-12-10 Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation Hyeonho Jeong et.al. 2412.06016 null
2024-12-07 Street Gaussians without 3D Object Tracker Ruida Zhang et.al. 2412.05548 null
2024-12-06 HOLa: HoloLens Object Labeling Michael Schwimmbeck et.al. 2412.04945 link
2024-12-06 Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection Khurram Azeem Hashmi et.al. 2412.04915 null
2024-12-05 EgoPoints: Advancing Point Tracking for Egocentric Videos Ahmad Darkhalil et.al. 2412.04592 null
2024-12-04 Distillation of Diffusion Features for Semantic Correspondence Frank Fundel et.al. 2412.03512 null
2024-12-03 MVCTrack: Boosting 3D Point Cloud Tracking via Multimodal-Guided Virtual Cues Zhaofeng Hu et.al. 2412.02734 link
2024-12-03 GSOT3D: Towards Generic 3D Single Object Tracking in the Wild Yifan Jiao et.al. 2412.02129 link
2024-12-02 6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting Yufeng Jin et.al. 2412.01543 null
2024-12-02 A2VIS: Amodal-Aware Approach to Video Instance Segmentation Minh Tran et.al. 2412.01147 null
2024-12-02 Referring Video Object Segmentation via Language-aligned Track Selection Seongchan Kim et.al. 2412.01136 link
2024-12-02 Eyes on the Road: State-of-the-Art Video Question Answering Models Assessment for Traffic Monitoring Tasks Joseph Raj Vishal et.al. 2412.01132 link
2024-12-02 Object Tracking in a $360^o$ View: A Novel Perspective on Bridging the Gap to Biomedical Advancements Mojtaba S. Fazli et.al. 2412.01119 null
2024-12-02 LiDAR SLAMMOT based on Confidence-guided Data Association Susu Fang et.al. 2412.01041 null
2024-12-01 BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird’s-Eye View Yizhou Wang et.al. 2412.00692 null
2024-11-29 Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark Joseph Heyward et.al. 2411.19941 null
2024-11-28 HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos Prithviraj Banerjee et.al. 2411.19167 null
2024-11-28 Visual SLAMMOT Considering Multiple Motion Models Peilin Tian et.al. 2411.19134 null
2024-11-28 CrossTracker: Robust Multi-modal 3D Multi-Object Tracking via Cross Correction Lipeng Gu et.al. 2411.18850 null
2024-11-27 TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video Jinyuan Qu et.al. 2411.18671 null
2024-11-27 A comparison of extended object tracking with multi-modal sensors in indoor environment Jiangtao Shuai et.al. 2411.18476 null
2024-11-27 Efficient Dynamic LiDAR Odometry for Mobile Robots with Structured Point Clouds Jonathan Lichtenfeld et.al. 2411.18443 link
2024-11-26 A Distractor-Aware Memory for Visual Object Tracking with SAM2 Jovana Videnovic et.al. 2411.17576 link
2024-11-24 FastTrackTr:Towards Fast Multi-Object Tracking with Transformers Pan Liao et.al. 2411.15811 null
2024-11-23 How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking Xuchen Li et.al. 2411.15600 null
2024-11-23 MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking Xinqi Liu et.al. 2411.15459 null
2024-11-20 Gaze2AOI: Open Source Deep-learning Based System for Automatic Area of Interest Annotation with Eye Tracking Data Karolina Trajkovska et.al. 2411.13346 null
2024-11-20 Teaching VLMs to Localize Specific Objects from In-context Examples Sivan Doveh et.al. 2411.13317 link
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291 null
2024-11-24 ClickTrack: Towards Real-time Interactive Single Object Tracking Kuiran Wang et.al. 2411.13183 null
2024-11-20 Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity Wassim El Ahmar et.al. 2411.12943 link
2024-11-19 Resolution Improvement in OFDM-based Joint Communication and Sensing through Combined Tracking and Interpolation Charlotte Muth et.al. 2411.12464 null
2024-11-18 SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory Cheng-Yen Yang et.al. 2411.11922 link
2024-11-18 Learning a Neural Association Network for Self-supervised Multi-Object Tracking Shuai Li et.al. 2411.11514 null
2024-11-15 Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras Ishrath Ahamed et.al. 2411.10072 null
2024-11-21 MOT FCG++: Enhanced Representation of Spatio-temporal Motion and Appearance Features Yanzhao Fang et.al. 2411.10028 null
2024-11-13 Predictive Visuo-Tactile Interactive Perception Framework for Object Properties Inference Anirvan Dutta et.al. 2411.09020 null
2024-11-13 3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter Xiaoxiang Wang et.al. 2411.08433 null
2024-11-13 DEEGITS: Deep Learning based Framework for Measuring Heterogenous Traffic State in Challenging Traffic Scenarios Muttahirul Islam et.al. 2411.08335 null
2024-11-12 GTA: Global Tracklet Association for Multi-Object Tracking in Sports Jiacheng Sun et.al. 2411.08216 link
2024-11-11 BuckTales : A multi-UAV dataset for multi-object tracking and re-identification of wild antelopes Hemal Naik et.al. 2411.06896 null
2024-11-11 HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision Shubo Lin et.al. 2411.06780 null
2024-11-11 Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs Jia Syuen Lim et.al. 2411.06702 null
2024-11-10 PKF: Probabilistic Data Association Kalman Filter for Multi-Object Tracking Hanwen Cao et.al. 2411.06378 link
2024-11-09 Multi-object Tracking by Detection and Query: an efficient end-to-end manner Shukun Jia et.al. 2411.06197 null
2024-11-08 Agile UAV landing control on moving ship in adverse conditions James Mordaunt et.al. 2411.05445 null
2024-11-06 Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving Depanshu Sani et.al. 2411.03702 null
2024-11-05 Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting Michael Büttner et.al. 2411.03555 null
2024-11-04 SIRA: Scalable Inter-frame Relation and Association for Radar Perception Ryoma Yataka et.al. 2411.02220 null
2024-11-04 Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations Thanh Nguyen Canh et.al. 2411.01816 null
2024-11-04 ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model Yiming Sun et.al. 2411.01756 null
2024-11-01 HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices Xiang Li et.al. 2411.00608 null
2024-11-01 Is Multiple Object Tracking a Matter of Specialization? Gianluca Mancusi et.al. 2411.00553 null
2024-10-31 Extended Object Tracking and Classification based on Linear Splines Matteo Tesori et.al. 2410.24183 null
2024-10-30 IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking Run Luo et.al. 2410.23907 null
2024-10-28 Evaluating the Robustness of LiDAR Point Cloud Tracking Against Adversarial Attack Shengjing Tian et.al. 2410.20893 null
2024-10-27 BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events Yijin Li et.al. 2410.20451 null
2024-10-27 NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tracking Yu Liu et.al. 2410.20421 link
2024-10-27 Depth Attention for Robust RGB Tracking Yu Liu et.al. 2410.20395 link
2024-10-26 SFTrack: A Robust Scale and Motion Adaptive Algorithm for Tracking Small and Fast Moving Objects InPyo Song et.al. 2410.20079 null
2024-10-25 A-MFST: Adaptive Multi-Flow Sparse Tracker for Real-Time Tissue Tracking Under Occlusion Yuxin Chen et.al. 2410.19996 null
2024-10-23 ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting Shaofei Cai et.al. 2410.17856 link
2024-10-23 Real-time Vehicle-to-Vehicle Communication Based Network Cooperative Control System through Distributed Database and Multimodal Perception: Demonstrated in Crossroads Xinwen Zhu et.al. 2410.17576 link
2024-10-23 OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking Haiji Liang et.al. 2410.17534 link
2024-10-22 MPT: A Large-scale Multi-Phytoplankton Tracking Benchmark Yang Yu et.al. 2410.16695 link
2024-10-19 The Solution for Single Object Tracking Task of Perception Test Challenge 2024 Zhiqiang Zhong et.al. 2410.16329 null
2024-10-20 TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool Thinh Phan et.al. 2410.15518 link
2024-10-20 Multiset Combinatorial Gray Codes with Application to Proximity Sensor Networks Chung Shue Chen et.al. 2410.15428 null
2024-10-19 3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving Linh Van Ma et.al. 2410.14977 link
2024-10-18 Enhancing In-vehicle Multiple Object Tracking Systems with Embeddable Ising Machines Kosuke Tatsumura et.al. 2410.14093 null
2024-10-17 Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation Changcheng Xiao et.al. 2410.13437 null
2024-10-17 TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal Yanpeng Jia et.al. 2410.13240 null
2024-10-15 CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos Nikita Karaev et.al. 2410.11831 null
2024-10-17 UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles Hui Ye et.al. 2410.11125 null
2024-10-14 Motion-guided small MAV detection in complex and non-planar scenes Hanqing Guo et.al. 2410.10527 null
2024-10-14 SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments Khaled Gabr et.al. 2410.10409 link
2024-10-14 DINTR: Tracking via Diffusion-based Interpolation Pha Nguyen et.al. 2410.10053 null
2024-10-11 Enhanced Kalman with Adaptive Appearance Motion SORT for Grounded Generic Multiple Object Tracking Duy Le Dinh Anh et.al. 2410.09243 null
2024-10-11 VideoSAM: Open-World Video Segmentation Pinxue Guo et.al. 2410.08781 null
2024-10-11 Efficient Multi-Object Tracking on Edge Devices via Reconstruction-Based Channel Pruning Jan Müller et.al. 2410.08769 null
2024-10-11 VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking Zekun Qian et.al. 2410.08529 null
2024-10-05 ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments Lorenzo Terenzi et.al. 2410.04250 null
2024-10-04 Combing Text-based and Drag-based Editing for Precise and Flexible Image Editing Ziqi Jiang et.al. 2410.03097 null
2024-10-03 Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking Fabian Herzog et.al. 2410.02638 link
2024-10-09 DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM Xuchen Li et.al. 2410.02492 null
2024-10-03 Spiking Neural Network as Adaptive Event Stream Slicer Jiahang Cao et.al. 2410.02249 link
2024-10-10 Tracking objects that change in appearance with phase synchrony Sabine Muzellec et.al. 2410.02094 null
2024-10-02 Scene Flow as a Partial Differential Equation Kyle Vedder et.al. 2410.02031 null
2024-10-02 Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking Mattia Segu et.al. 2410.01806 null
2024-10-02 Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking Ayesha Ishaq et.al. 2410.01678 link
2024-09-29 One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos Zechen Bai et.al. 2409.19603 link
2024-09-27 Improving Visual Object Tracking through Visual Prompting Shih-Fang Chen et.al. 2409.18901 link
2024-09-30 An Overview of Multi-Object Estimation via Labeled Random Finite Set Ba-Ngu Vo et.al. 2409.18531 null
2024-09-26 BlinkTrack: Feature Tracking over 100 FPS via Events and Images Yichen Shen et.al. 2409.17981 null
2024-09-26 General Compression Framework for Efficient Transformer Object Tracking Lingyi Hong et.al. 2409.17564 null
2024-09-26 CAMOT: Camera Angle-aware Multi-Object Tracking Felix Limanta et.al. 2409.17533 null
2024-09-25 Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs Mattia Segu et.al. 2409.17221 null
2024-09-25 Automated Surgical Skill Assessment in Endoscopic Pituitary Surgery using Real-time Instrument Tracking on a High-fidelity Bench-top Phantom Adrito Das et.al. 2409.17025 null
2024-09-25 Towards Underwater Camouflaged Object Tracking: An Experimental Evaluation of SAM and SAM 2 Chunhui Zhang et.al. 2409.16902 link
2024-09-25 Conditional Generative Denoiser for Nighttime UAV Tracking Yucheng Wang et.al. 2409.16834 link
2024-09-25 Progressive Representation Learning for Real-Time UAV Tracking Changhong Fu et.al. 2409.16652 link
2024-09-25 Enhancing Nighttime UAV Tracking with Light Distribution Suppression Liangliang Yao et.al. 2409.16631 link
2024-09-24 Transformer based time series prediction of the maximum power point for solar photovoltaic cells Palaash Agrawal et.al. 2409.16342 null
2024-09-24 Self-Supervised Any-Point Tracking by Contrastive Random Walks Ayush Shrivastava et.al. 2409.16288 link
2024-09-23 MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving Xiyang Wang et.al. 2409.16149 link
2024-09-24 CloudTrack: Scalable UAV Tracking with Cloud Semantics Yannik Blei et.al. 2409.16111 link
2024-09-22 TrackNetV4: Enhancing Fast Sports Object Tracking with Motion Attention Maps Arjun Raj et.al. 2409.14543 null
2024-09-21 Masks and Boxes: Combining the Best of Both Worlds for Multi-Object Tracking Tomasz Stanczyk et.al. 2409.14220 null
2024-09-21 Foundation Models for Amodal Video Instance Segmentation in Automated Driving Jasmin Breitenstein et.al. 2409.14095 link
2024-09-18 Tracking Any Point with Frame-Event Fusion Network at High Frame Rate Jiaxiong Liu et.al. 2409.11953 null
2024-09-18 RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework Xiaoyu Li et.al. 2409.11749 null
2024-09-17 SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking Siyuan Li et.al. 2409.11235 link
2024-09-17 STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking Jianbo Ma et.al. 2409.11234 link
2024-09-17 TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection Philip Jacobson et.al. 2409.10901 null
2024-09-15 Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings Oriel Perl et.al. 2409.09841 null
2024-09-14 Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown Zimeng Fang et.al. 2409.09293 link
2024-09-12 FACT: Feature Adaptive Continual-learning Tracker for Multiple Object Tracking Rongzihan Song et.al. 2409.07904 null
2024-09-10 When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking Emirhan Bayar et.al. 2409.06617 link
2024-09-09 Leveraging Object Priors for Point Tracking Bikram Boote et.al. 2409.05786 link
2024-09-08 RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network Zhiwei Lin et.al. 2409.04979 null
2024-09-06 LITE: A Paradigm Shift in Multi-Object Tracking with Efficient ReID Feature Integration Jumabek Alikhanov et.al. 2409.04187 link
2024-09-05 Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints Keisuke Toida et.al. 2409.03252 null
2024-09-04 TP-GMOT: Tracking Generic Multiple Object by Textual Prompt with Motion-Appearance Cost (MAC) SORT Duy Le Dinh Anh et.al. 2409.02490 link
2024-09-03 DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction Jenny Seidenschwarz et.al. 2409.02104 null
2024-09-01 YOLOO: You Only Learn from Others Once Lipeng Gu et.al. 2409.00618 null
2024-09-10 TrackSSM: A General Motion Predictor by State-Space Model Bin Hu et.al. 2409.00487 link
2024-08-31 Fish Tracking Challenge 2024: A Multi-Object Tracking Competition with Sweetfish Schooling Data Makoto M. Itoh et.al. 2409.00339 null
2024-08-30 UTrack: Multi-Object Tracking with Uncertain Detections Edgardo Solano-Carrillo et.al. 2408.17098 link
2024-08-29 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445 link
2024-08-29 Estimating Dynamic Flow Features in Groups of Tracked Objects Tanner D. Harms et.al. 2408.16190 null
2024-08-28 ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model Lifan Jiang et.al. 2408.15548 link
2024-08-25 Camouflaged_Object_Tracking__A_Benchmark Xiaoyu Guo et.al. 2408.13877 link
2024-08-24 Can Visual Foundation Models Achieve Long-term Point Tracking? Görkay Aydemir et.al. 2408.13575 null
2024-08-23 MCTR: Multi Camera Tracking Transformer Alexandru Niculescu-Mizil et.al. 2408.13243 null
2024-08-23 BoostTrack++: using tracklet information to detect more objects in multiple object tracking Vukašin Stanojević et.al. 2408.13003 link
2024-08-22 BankTweak: Adversarial Attack against Multi-Object Trackers by Manipulating Feature Banks Woojin Shin et.al. 2408.12727 null
2024-08-22 BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking Hanzheng Wang et.al. 2408.12232 null
2024-08-21 CHOTA: A Higher Order Accuracy Metric for Cell Tracking Timo Kaiser et.al. 2408.11571 link
2024-08-21 Low-Light Object Tracking: A Benchmark Pengzhi Zhong et.al. 2408.11463 link
2024-08-20 MambaEVT: Event Stream based Visual Object Tracking using State Space Model Xiao Wang et.al. 2408.10487 link
2024-08-17 GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System Shuo Wang et.al. 2408.09191 null
2024-08-17 MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model Changcheng Xiao et.al. 2408.09178 null
2024-08-14 Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving Yuqing Wen et.al. 2408.07605 null
2024-08-14 RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking Song Guo et.al. 2408.07344 null
2024-08-13 Object Tracking Incorporating Transfer Learning into Unscented and Cubature Kalman Filters Omar Alotaibi et.al. 2408.07157 null
2024-08-12 FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework Lukas Meyer et.al. 2408.06190 link
2024-08-11 A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot Haoxuan Ding et.al. 2408.05729 link
2024-08-09 Mesh-based Object Tracking for Dynamic Semantic 3D Scene Graphs via Ray Tracing Lennart Niecksch et.al. 2408.04979 null
2024-08-06 Quantum Imaging Using Spatially Entangled Photon Pairs from a Nonlinear Metasurface Jinyong Ma et.al. 2408.02903 null
2024-08-05 VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking Yuxuan Lu et.al. 2408.02263 null
2024-08-04 3D Single-object Tracking in Point Clouds with High Temporal Variation Qiao Wu et.al. 2408.02049 null
2024-08-03 SiamMo: Siamese Motion-Centric 3D Object Tracking Yuxiang Yang et.al. 2408.01688 link
2024-08-02 Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach Yabin Zhu et.al. 2408.00969 link
2024-08-05 U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight Tongtong Feng et.al. 2408.00606 link
2024-08-01 A Batch Update Using Multiplicative Noise Modelling for Extended Object Tracking Christian Gramsch et.al. 2408.00417 null
2024-07-30 Autogenic Language Embedding for Coherent Point Tracking Zikai Song et.al. 2407.20730 link
2024-07-30 SharkTrack: an accurate, generalisable software for streamlining shark and ray underwater video analysis Filippo Varini et.al. 2407.20623 null
2024-07-29 MEVDT: Multi-Modal Event-Based Vehicle Detection and Tracking Dataset Zaid A. El Shair et.al. 2407.20446 null
2024-07-28 Progressive Domain Adaptation for Thermal Infrared Object Tracking Qiao Li et.al. 2407.19430 null
2024-07-25 Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT Niels G. Faber et.al. 2407.18288 link
2024-07-20 CORT: Class-Oriented Real-time Tracking for Embedded Systems Edoardo Cittadini et.al. 2407.17521 null
2024-07-23 PlantTrack: Task-Driven Plant Keypoint Tracking with Zero-Shot Sim2Real Transfer Samhita Marri et.al. 2407.16829 null
2024-07-23 Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos Jiahe Liu et.al. 2407.16124 link
2024-07-22 Local All-Pair Correspondence for Point Tracking Seokju Cho et.al. 2407.15420 link
2024-07-21 Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety Analysis Jingwei Guo et.al. 2407.15199 link
2024-07-19 Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking Yunfei Zhang et.al. 2407.14086 link
2024-07-19 OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking Zekun Qian et.al. 2407.14047 null
2024-07-18 Boosting Online 3D Multi-Object Tracking through Camera-Radar Cross Check Sheng-Yao Kuan et.al. 2407.13937 null
2024-07-18 Long-Term 3D Point Tracking By Cost Volume Fusion Hung Nguyen et.al. 2407.13337 null
2024-07-17 Strawberry detection and counting based on YOLOv7 pruning and information based tracking algorithm Shiyu Liu et.al. 2407.12614 null
2024-07-15 Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation Friedhelm Hamann et.al. 2407.10802 link
2024-07-15 Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss Mufeng Yao et.al. 2407.10485 link
2024-07-16 Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking Lorenzo Vaquero et.al. 2407.10151 link
2024-07-14 Power System Architecture and Control for Green Hydrogen Production via Power Converter-less Photovoltaic-Electrolyser Integration Aymeric Fabre et.al. 2407.10075 null
2024-07-12 DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects Peng Wang et.al. 2407.09051 null
2024-07-11 Manipulating a Tetris-Inspired 3D Video Representation Mihir Godbole et.al. 2407.08885 null
2024-07-11 Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets Linh Van Ma et.al. 2407.08872 link
2024-07-11 CommRad: Context-Aware Sensing-Driven Millimeter-Wave Networks Ish Kumar Jain et.al. 2407.08817 null
2024-07-10 Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors Lei Cheng et.al. 2407.08049 null
2024-07-10 MSC-LIO: An MSCKF-Based LiDAR-Inertial Odometry with Same-Plane-Point Tracking Tisheng Zhang et.al. 2407.07589 null
2024-07-09 Decomposition Betters Tracking Everything Everywhere Rui Li et.al. 2407.06531 link
2024-07-08 GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images Jon Crall et.al. 2407.06337 null
2024-07-08 TAPVid-3D: A Benchmark for Tracking Any Point in 3D Skanda Koppula et.al. 2407.05921 link
2024-07-07 Addressing single object tracking in satellite imagery through prompt-engineered solutions Athena Psalta et.al. 2407.05518 null
2024-07-09 P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds Jiahao Nie et.al. 2407.05238 link
2024-07-06 VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking Xuefeng Jiang et.al. 2407.05017 null
2024-07-05 TF-SASM: Training-free Spatial-aware Sparse Memory for Multi-object Tracking Thuc Nguyen-Quang et.al. 2407.04327 null
2024-07-08 SSP-GNN: Learning to Track via Bilevel Optimization Griffin Golias et.al. 2407.04308 null
2024-07-05 FeatureSORT: Essential Features for Effective Tracking Hamidreza Hashempoor et.al. 2407.04249 null
2024-07-04 Attention Normalization Impacts Cardinality Generalization in Slot Attention Markus Krimmel et.al. 2407.04170 link
2024-07-04 TrackPGD: A White-box Attack using Binary Masks against Robust Transformer Trackers Fatemeh Nourilenjan Nokabadi et.al. 2407.03946 link
2024-07-03 Applying Extended Object Tracking for Self-Localization of Roadside Radar Sensors Longfei Han et.al. 2407.03084 null
2024-07-02 FlowTrack: Point-level Flow Network for 3D Single Object Tracking Shuo Li et.al. 2407.01959 null
2024-07-02 The Solution for the ICCV 2023 Perception Test Challenge 2023 – Task 6 – Grounded videoQA Hailiang Zhang et.al. 2407.01907 null
2024-06-30 DroBoost: An Intelligent Score and Model Boosting Method for Drone Detection Ogulcan Eryuksel et.al. 2407.00830 null
2024-06-30 Engineering an Efficient Object Tracker for Non-Linear Motion Momir Adžemović et.al. 2407.00738 null
2024-06-28 PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators Kuo-Hao Zeng et.al. 2406.20083 null
2024-06-28 eMoE-Tracker: Environmental MoE-based Transformer for Robust Event-guided Object Tracking Yucheng Chen et.al. 2406.20024 null
2024-06-28 StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction Jiaheng Zhuang et.al. 2406.19844 null
2024-06-28 Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking Qingrui Hu et.al. 2406.19655 null
2024-06-28 Optimal Video Compression using Pixel Shift Tracking Hitesh Saai Mananchery Panneerselvam et.al. 2406.19630 link
2024-06-26 Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos Colton Stearns et.al. 2406.18717 link
2024-06-26 BiTrack: Bidirectional Offline 3D Multi-Object Tracking Using Camera-LiDAR Data Kemiao Huang et.al. 2406.18414 link
2024-06-24 POPCat: Propagation of particles for complex annotation tasks Adam Srebrnjak Yang et.al. 2406.17183 null
2024-06-24 A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking Lorenzo Shaikewitz et.al. 2406.16837 link
2024-06-24 The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers Abhi Kamboj et.al. 2406.16784 null
2024-06-21 LU2Net: A Lightweight Network for Real-time Underwater Image Enhancement Haodong Yang et.al. 2406.14973 null
2024-06-22 Velocity Analysis of Moving Objects in Earth Observation Satellite Images Using Multi-Spectral Push Broom Scanning Eric Keto et.al. 2406.13710 null
2024-06-19 Hierarchical IoU Tracking based on Interval Yunhao Du et.al. 2406.13271 link
2024-06-19 Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models Akchay Srivastava et.al. 2406.13232 null
2024-06-17 Deep HM-SORT: Enhancing Multi-Object Tracking in Sports with Deep Features, Harmonic Mean, and Expansion IOU Matias Gran-Henriksen et.al. 2406.12081 null
2024-06-17 VideoVista: A Versatile Benchmark for Video Understanding and Reasoning Yunxin Li et.al. 2406.11303 null
2024-06-14 Robust compressive tracking via online weighted multiple instance learning Sandeep Singh Sengar et.al. 2406.09914 null
2024-06-13 Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking Prithviraj Banerjee et.al. 2406.09598 null
2024-06-12 LaMOT: Language-Guided Multi-Object Tracking Yunhao Li et.al. 2406.08324 link
2024-06-12 Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance Yasod Ginige et.al. 2406.08294 null
2024-06-11 Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos Duc Pham et.al. 2406.07680 null
2024-06-11 Haptic Repurposing with GenAI Haoyu Wang et.al. 2406.07228 null
2024-06-11 UVIS: Unsupervised Video Instance Segmentation Shuaiyi Huang et.al. 2406.06908 null
2024-06-09 ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving Chen Ma et.al. 2406.05810 null
2024-06-09 SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving Chen Ma et.al. 2406.05800 null
2024-06-08 Training-Free Robust Interactive Video Object Segmentation Xiaoli Wei et.al. 2406.05485 null
2024-06-07 Bootstrapping Referring Multi-Object Tracking Yani Zhang et.al. 2406.05039 link
2024-06-07 Multi-Granularity Language-Guided Multi-Object Tracking Yuhao Li et.al. 2406.04844 link
2024-06-06 Matching Anything by Segmenting Anything Siyuan Li et.al. 2406.04221 link
2024-06-06 ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints Divij Handa et.al. 2406.04046 null
2024-06-04 UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking Lijun Zhou et.al. 2406.02147 null
2024-06-03 Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers Fatemeh Nourilenjan Nokabadi et.al. 2406.01765 link
2024-06-03 Prototypical Transformer as Unified Motion Learners Cheng Han et.al. 2406.01559 null
2024-06-03 Convolutional Unscented Kalman Filter for Multi-Object Tracking with Outliers Shiqi Liu et.al. 2406.01380 null
2024-06-03 Programmable Multi-input Buck-Boost Converter for Photovoltaics Arrays Zhongting Tang et.al. 2406.01193 null
2024-06-03 Multi-Object Tracking based on Imaging Radar 3D Object Detection Patrick Palmer et.al. 2406.01011 null
2024-06-01 Towards Generalizable Multi-Object Tracking Zheng Qin et.al. 2406.00429 link
2024-05-30 WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark Chunhui Zhang et.al. 2405.19818 link
2024-05-29 DGD: Dynamic 3D Gaussians Distillation Isaac Labe et.al. 2405.19321 null
2024-05-28 Track Initialization and Re-Identification for~3D Multi-View Multi-Object Tracking Linh Van Ma et.al. 2405.18606 link
2024-05-28 Reliable Object Tracking by Multimodal Hybrid Feature Extraction and Transformer-Based Fusion Hongze Sun et.al. 2405.17903 link
2024-05-28 Towards a Generalist and Blind RGB-X Tracker Yuedong Tan et.al. 2405.17773 link
2024-06-03 BaboonLand Dataset: Tracking Primates in the Wild and Automating Behaviour Recognition from Drone Videos Isla Duporge et.al. 2405.17698 null
2024-05-27 Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association Tingwei Liu et.al. 2405.17323 null
2024-05-24 ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking Xudong Han et.al. 2405.15755 null
2024-05-24 Trackastra: Transformer-based cell tracking for live-cell microscopy Benjamin Gallusser et.al. 2405.15700 link
2024-05-24 An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking Pratyusha Musunuru et.al. 2405.15137 null
2024-05-23 Awesome Multi-modal Object Tracking Chunhui Zhang et.al. 2405.14200 link
2024-05-23 Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning Zhenyu Wei et.al. 2405.14195 null
2024-05-23 PuTR: A Pure Transformer for Decoupled and Online Multi-Object Tracking Chongwei Liu et.al. 2405.14119 link
2024-05-22 Multi Player Tracking in Ice Hockey with Homographic Projections Harish Prakash et.al. 2405.13397 null
2024-05-20 Building Temporal Kernels with Orthogonal Polynomials Yan Ru Pei et.al. 2405.12179 link
2024-05-20 WiDRa – Enabling Millimeter-Level Differential Ranging Accuracy in Wi-Fi Using Carrier Phase Vishnu V. Ratnam et.al. 2405.12168 null
2024-05-20 DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM Xuchen Li et.al. 2405.12139 null
2024-05-20 A Vision on Open Science for the Evolution of Software Engineering Research and Practice Edson OliveiraJr et.al. 2405.12132 null
2024-05-20 PATE: Proximity-Aware Time series anomaly Evaluation Ramin Ghorbani et.al. 2405.12096 link
2024-05-20 SEMv3: A Fast and Robust Approach to Table Separation Line Detection Chunxia Qin et.al. 2405.11862 link
2024-05-20 Online Learning Feedback Control Considering Hysteresis for Musculoskeletal Structures Kento Kawaharazuka et.al. 2405.11808 null
2024-05-20 CDM-MPC: An Integrated Dynamic Planning and Control Framework for Bipedal Robots Jumping Zhicheng He et.al. 2405.11773 null
2024-05-19 PBI: Position-Based Dynamics Handles Updated Lagrangian Inelasticity Chang Yu et.al. 2405.11694 null
2024-05-19 Auto-Platoon : Freight by example Tharun V. Puthanveettil et.al. 2405.11659 link
2024-05-19 Track Anything Rapter(TAR) Tharun V. Puthanveettil et.al. 2405.11655 link
2024-05-19 RobMOT: Robust 3D Multi-Object Tracking by Observational Noise and State Estimation Drift Mitigation on LiDAR PointCloud Mohamed Nagy et.al. 2405.11536 link
2024-05-17 Air Signing and Privacy-Preserving Signature Verification for Digital Documents P. Sarveswarasarma et.al. 2405.10868 link
2024-05-17 Review on physical impedance models in perovskite solar cells Rajat Kumar Goyal et.al. 2405.10855 null
2024-05-17 Model Predictive Contouring Control for Vehicle Obstacle Avoidance at the Limit of Handling Using Torque Vectoring Alberto Bertipaglia et.al. 2405.10847 null
2024-05-17 Heterogeneity-Informed Meta-Parameter Learning for Spatiotemporal Time Series Forecasting Zheng Dong et.al. 2405.10800 link
2024-05-17 Anomalous relaxation of coarsening foams with viscoelastic continuous phase Chiara Guidolin et.al. 2405.10657 null
2024-05-17 Cyclical Weight Consolidation: Towards Solving Catastrophic Forgetting in Serial Federated Learning Haoyue Song et.al. 2405.10647 null
2024-05-17 COMET: NFT Price Prediction with Wallet Profiling Tianfu Wang et.al. 2405.10640 link
2024-05-17 Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track Xiaoshuai Hao et.al. 2405.10567 null
2024-05-17 Dynamic Cluster Analysis to Detect and Track Novelty in Network Telescopes Kai Huang et.al. 2405.10545 null
2024-05-17 Hawkes Models And Their Applications Patrick J. Laub et.al. 2405.10527 null
2024-05-16 A Novel Bounding Box Regression Method for Single Object Tracking Omar Abdelaziz et.al. 2405.10444 null
2024-05-16 Beyond Traditional Single Object Tracking: A Survey Omar Abdelaziz et.al. 2405.10439 null
2024-05-16 Spatial Cognition: a Wave Hypothesis Robert Worden et.al. 2405.10112 null
2024-05-14 Learning Correspondence for Deformable Objects Priya Sundaresan et.al. 2405.08996 null
2024-05-14 ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association Shuxiao Ding et.al. 2405.08909 link
2024-05-14 EchoTracker: Advancing Myocardial Point Tracking in Echocardiography Md Abulkalam Azad et.al. 2405.08587 link

Defocus

Publish Date Title Authors PDF Code
2025-07-15 Digital defocus aberration interference for automated optical microscopy Haowen Zhou et.al. 2507.10867 null
2025-07-01 Efficient Depth- and Spatially-Varying Image Simulation for Defocus Deblur Xinge Yang et.al. 2507.00372 null
2025-07-09 High-quality metalens enables minimally invasive CFB endoscopy Ruixiang Song et.al. 2506.21379 null
2025-06-26 Quantitative structure determination from experimental four-dimensional scanning transmission electron microscopy via the scattering matrix Emmanuel W. C. Terzoudis-Lumsden et.al. 2506.21004 null
2025-06-22 On the Particle Image Overlap in Single Camera Defocusing Approaches Christian Sax et.al. 2506.18170 null
2025-06-25 Dark Channel-Assisted Depth-from-Defocus from a Single Image Moushumi Medhi et.al. 2506.06643 null
2025-05-29 Dc-EEMF: Pushing depth-of-field limit of photoacoustic microscopy via decision-level constrained learning Wangting Zhou et.al. 2506.03181 null
2025-05-31 Fovea Stacking: Imaging with Dynamic Localized Aberration Correction Shi Mao et.al. 2506.00716 null
2025-05-30 High resolution up-conversion imaging in the 10 μm band under incoherent illumination Zhao-Qi-Zhi Han et.al. 2505.24367 null
2025-05-30 Fourier ptychographic microscopy aided with transport of intensity equation for robust full phase spectrum reconstruction Mikołaj Rogalski et.al. 2505.24322 null
2025-07-02 Real-Time Blind Defocus Deblurring for Earth Observation: The IMAGIN-e Mission Approach Alejandro D. Mousist et.al. 2505.22128 null
2025-05-27 Any-to-Bokeh: One-Step Video Bokeh via Multi-Plane Image Guided Diffusion Yang Yang et.al. 2505.21593 null
2025-05-23 Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues Chinmay Talegaonkar et.al. 2505.17358 null
2025-05-19 Combinatorial Sample-and Back-Focal-Plane (BFP) Imaging. Pt. I: Instrument and acquisition parameters affecting BFP images and their analysis Omer Shavit et.al. 2505.13190 null
2025-05-12 Apple’s Synthetic Defocus Noise Pattern: Characterization and Forensic Applications David Vázquez-Padín et.al. 2505.07380 null
2025-05-09 Development of precession Lorentz transmission electron microscopy Shunsuke Hayashi et.al. 2505.05790 null
2025-05-07 Image Restoration via Multi-domain Learning Xingyu Jiang et.al. 2505.05504 link
2025-05-08 Differentiation of Distinct Single Atoms via Multi-Defocus Fusion Method Yangfan Li et.al. 2505.04078 null
2025-05-09 Back-illumination interference tomography for imaging weak scattering in thick tissues Gregory N. McKay et.al. 2504.19278 null
2025-04-25 Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models Patrick Müller et.al. 2504.18510 null
2025-04-24 Surface morphology and thickness variation estimation of zeolites via electron ptychography Enci Zhang et.al. 2504.17501 null
2025-04-23 Dual-Camera All-in-Focus Neural Radiance Fields Xianrui Luo et.al. 2504.16636 null
2025-04-15 Focal Split: Untethered Snapshot Depth from Differential Defocus Junjie Luo et.al. 2504.11202 null
2025-04-15 Three-dimensional neural network driving self-interference digital holography enables high-fidelity, non-scanning volumetric fluorescence microscopy Tianlong Man et.al. 2504.10769 null
2025-04-14 Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials Jingyun Yang et.al. 2504.10281 null
2025-04-11 Optical vortex trajectories as probes for wavefront aberrations Aleksandra K. Korzeniewska et.al. 2504.08643 null
2025-03-31 InstructRestore: Region-Customized Image Restoration with Human Instructions Shuaizheng Liu et.al. 2503.24357 link
2025-03-30 Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries Wei Xu et.al. 2503.23606 null
2025-03-26 Spectrum from Defocus: Fast Spectral Imaging with Chromatic Focal Stack M. Kerem Aydin et.al. 2503.20184 null
2025-03-24 MaSS13K: A Matting-level Semantic Segmentation Benchmark Chenxi Xie et.al. 2503.18364 link
2025-03-22 Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration Yawei Li et.al. 2503.17825 null
2025-03-25 Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures Tim Seizinger et.al. 2503.16067 link
2025-03-18 The Power of Context: How Multimodality Improves Image Super-Resolution Kangfu Mei et.al. 2503.14503 null
2025-03-18 Intra and Inter Parser-Prompted Transformers for Effective Image Restoration Cong Wang et.al. 2503.14037 link
2025-03-16 Pathology Image Restoration via Mixture of Prompts Jiangdong Cai et.al. 2503.12399 link
2025-03-24 Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models Armando Fortes et.al. 2503.08434 null
2025-03-12 Free Your Hands: Lightweight Relightable Turntable Capture Pipeline Jiahui Fan et.al. 2503.05511 null
2025-03-03 Blind Augmentation: Calibration-free Camera Distortion Model Estimation for Real-time Mixed-reality Consistency Siddhant Prakash et.al. 2503.01387 link
2025-03-13 DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting Liao Shen et.al. 2503.00746 null
2025-01-24 Linnik point spread functions, time-reversed logarithmic diffusion equations, and blind deconvolution of electron microscope imagery Alfred S. Carasso et.al. 2502.19420 null
2025-02-20 Exploiting Deblurring Networks for Radiance Fields Haeyun Choi et.al. 2502.14454 link
2025-02-16 Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation Kunal Swami et.al. 2502.11002 null
2025-02-11 CodePhys: Robust Video-based Remote Physiological Measurement through Latent Codebook Querying Shuyang Chu et.al. 2502.07526 null
2025-02-10 SparseFocus: Learning-based One-shot Autofocus for Microscopy with Sparse Content Yongping Zhai et.al. 2502.06452 null
2025-02-13 Self-similar Features in Sub-secondary Breakup of a Droplet and Ligament Mediated Fragmentation under Extreme Conditions Saini Jatin Rao et.al. 2502.05976 null
2025-01-29 Five-dimensional single-shot fluorescence imaging using a polarized Fourier light-field microscope Oumeng Zhang et.al. 2501.18047 null
2025-01-25 Image formation theory of optical coherence tomography with optical aberrations and its application for computational aberration correction Shuichi Makita et.al. 2501.15011 null
2025-01-23 Theoretical analysis of performance limitation of computational refocusing in optical coherence tomography Yue Zhu et.al. 2501.13874 null
2025-01-16 SE-BSFV: Online Subspace Learning based Shadow Enhancement and Background Suppression for ViSAR under Complex Background Shangqu Yan et.al. 2501.09341 null
2025-02-23 Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks Shuang Cui et.al. 2501.09052 null
2024-12-24 Dissecting CLIP: Decomposition with a Schur Complement-based Approach Azim Ospanov et.al. 2412.18645 link
2024-12-20 CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images Jungho Lee et.al. 2412.16028 null
2025-01-06 LEDiff: Latent Exposure Diffusion for HDR Generation Chao Wang et.al. 2412.14456 null
2024-12-29 AKiRa: Augmentation Kit on Rays for optical video generation Xi Wang et.al. 2412.14158 null
2024-12-17 Strain engineering of magnetic anisotropy in the kagome magnet Fe3Sn2 D. Kong et.al. 2412.12684 null
2024-12-16 Photoacoustic microscopy with meta-optics Dorian S. H. Brandmüller et.al. 2412.11733 null
2024-12-11 Dense Depth from Event Focal Stack Kenta Horikawa et.al. 2412.08120 null
2024-11-15 Resilient Stellarator Divertor Characteristics in the Helically Symmetric eXperiment K. A. Garcia et.al. 2411.10611 null
2024-10-18 Variable Aperture Bokeh Rendering via Customized Focal Plane Guidance Kang Chen et.al. 2410.14400 link
2024-11-15 Feature Extraction Reimagined: Achieving Superior Accuracy in Camera Calibration Zezhun Shi et.al. 2410.13371 link
2024-10-08 First experimental study of multiple orientation muon tomography, with image optimization in sparse data environments Jesus J. Valencia et.al. 2410.07264 null
2024-10-02 Recording dynamic facial micro-expressions with a multi-focus camera array Lucas Kreiss et.al. 2410.01973 null
2024-10-29 EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis Alexander Mai et.al. 2410.01804 null
2024-10-02 Frequency-Dependent F-Numbers Suppress Grating Lobes and Improve the Lateral Resolution in Line-by-Line Scanning Martin F. Schiffner et.al. 2410.01593 null
2024-10-02 Estimating Atmospheric Wind Speeds From Gemini Planet Imager AO Telemetry Zhenxi Du et.al. 2410.01193 null
2024-09-28 Extending Depth of Field for Varifocal Multiview Images Zhilong Li et.al. 2409.19220 null
2024-09-26 PNR: Physics-informed Neural Representation for high-resolution LFM reconstruction Jiayin Zhao et.al. 2409.18223 null
2024-09-26 Reblurring-Guided Single Image Defocus Deblurring: A Learning Framework with Misaligned Training Pairs Xinya Shu et.al. 2409.17792 link
2024-09-18 Depth Estimation Based on 3D Gaussian Splatting Siamese Defocus Jinchang Zhang et.al. 2409.12323 null
2024-09-16 Depth from Coupled Optical Differentiation Junjie Luo et.al. 2409.10725 link
2024-09-16 Focus diverse phase retrieval test results on broadband continuous wavefront sensing in space telescope applications Hyukmo Kang et.al. 2409.10500 null
2024-09-15 Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation Xiaolong Qian et.al. 2409.09754 link
2024-09-14 Innovative schemes for Correlation Plenoptic Imaging Gianlorenzo Massaro et.al. 2409.09459 null
2024-09-14 Plenoptic microscopy and photography from intensity correlations Francesco V. Pepe et.al. 2409.09456 null
2024-09-03 F2former: When Fractional Fourier Meets Deep Wiener Deconvolution and Selective Frequency Transformer for Image Deblurring Subhajit Paul et.al. 2409.02056 null
2024-08-17 Pupil-Adaptive 3D Holography Beyond Coherent Depth-of-Field Yujie Wang et.al. 2409.00028 null
2024-08-05 Joint-Motion Mutual Learning for Pose Estimation in Videos Sifan Wu et.al. 2408.02285 null
2024-08-28 Enhancing Quantitative Image Synthesis through Pretraining and Resolution Scaling for Bone Mineral Density Estimation from a Plain X-ray Image Yi Gu et.al. 2407.20495 link
2024-07-26 3D Orbital Angular Momentum Nonlinear Holography Feiyang Shen et.al. 2407.18696 null
2024-07-23 HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images Shreyas Singh et.al. 2407.16503 link
2024-07-21 A Novel Method to Improve Quality Surface Coverage in Multi-View Capture Wei-Lun Huang et.al. 2407.15883 null
2024-07-20 A New Dataset and Framework for Real-World Blurred Images Super-Resolution Rui Qin et.al. 2407.14880 link
2024-07-15 Automated high-resolution backscattered-electron imaging at macroscopic scale Zhiyuan Lang et.al. 2407.10628 null
2024-07-24 Inverse-designed 3D laser nanoprinted phase masks to extend the depth of field of imaging systems T. J. Sturges et.al. 2407.08482 null
2024-07-11 GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views Vinayak Gupta et.al. 2407.08221 link
2024-07-31 Dynamic Neural Radiance Field From Defocused Monocular Video Xianrui Luo et.al. 2407.05586 null
2024-07-01 Point-Spread Function of the Optics in Scanning Electron Microscopes Surya Kamal et.al. 2407.01439 null
2024-06-27 Super-resolution imaging using super-oscillatory diffractive neural networks Hang Chen et.al. 2406.19126 null
2024-06-27 The Space Coronagraph Optical Bench (SCoOB): 5. End-to-end simulations of polarization aberrations Ramya M Anche et.al. 2406.18886 null
2024-06-22 Robust Ptychographic Reconstruction with an Out-of-Focus Electron Probe Shoucong Ning et.al. 2406.15879 null
2024-06-15 fNeRF: High Quality Radiance Fields from Practical Cameras Yi Hua et.al. 2406.10633 null
2024-06-12 Striving towards robust phase diversity on-sky: Implementing LIFT for VLT/MUSE-NFM Arseniy Kuznetsov et.al. 2406.08529 link
2024-06-21 Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field Chao Wang et.al. 2406.07329 null
2024-06-06 Single Exposure Quantitative Phase Imaging with a Conventional Microscope using Diffusion Models Gabriel della Maggiora et.al. 2406.04388 null
2024-06-03 Improved Three-Dimensional Reconstructions in Electron Ptychography through Defocus Series Measurements Marcel Schloz et.al. 2406.01141 null
2024-06-02 End-to-End Hybrid Refractive-Diffractive Lens Design with Differentiable Ray-Wave Model Xinge Yang et.al. 2406.00834 null
2024-06-10 In vivo fundus imaging and computational refocusing with a diffuser-based fundus camera Corey Simmerer et.al. 2406.00122 null
2024-05-31 Axial HoloTile: Extended Depth-of-Focus of Dynamic Holographic Light Projections Andreas Erik Gejl Madsen et.al. 2405.20997 null
2024-05-27 DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal Yujie Wang et.al. 2405.17351 null
2024-05-20 Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction Aryan Garg et.al. 2405.11823 null
2024-06-04 Single-shot volumetric fluorescence imaging with neural fields Oumeng Zhang et.al. 2405.10463 null
2024-05-09 Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft Debabrata Pal et.al. 2405.05574 null
2024-04-05 Robust Gaussian Splatting François Darmon et.al. 2404.04211 null
2024-04-05 Deep Phase Coded Image Prior Nimrod Shabtay et.al. 2404.03906 null
2024-04-02 Multiple scattering suppression for in vivo optical coherence tomography measurement using B-scan-wise multi-focus averaging method Yiqiang Zhu et.al. 2404.01811 null
2024-03-29 Depth from Defocus Technique for High Number Densities and Non-spherical Particles Rixin Xua et.al. 2403.20004 null
2024-04-01 Video-Based Human Pose Regression via Decoupled Space-Time Aggregation Jijie He et.al. 2403.19926 link
2024-03-21 Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data Michael John Fanous et.al. 2403.14324 null
2024-05-06 Expected Impact of Glints from Space Debris in the LSST J. Anthony Tyson et.al. 2403.04942 null
2024-02-25 Forward and inverse modeling of depth-of-field effects in background-oriented schlieren Joseph P. Molnar et.al. 2402.15954 null
2024-02-12 Roll-to-roll tomographic volumetric additive manufacturing for continuous production of microstructures on long flexible substrates Joseph Toombs et.al. 2402.10955 null
2024-04-03 Ptycho-endoscopy on a lensless ultrathin fiber bundle tip Pengming Song et.al. 2401.17213 null
2024-02-09 Exploring one giga electronvolt cosmic gamma rays with a Cherenkov plenoscope capable of recording atmospheric light fields, Part 1: Optics Sebastian Achim Mueller et.al. 2401.16148 null
2024-01-29 Light-field imaging from position-momentum correlations Davide Giannella et.al. 2401.16129 null
2024-01-25 Single- and multi-layer micro-scale diffractive lens fabrication for fiber imaging probes with versatile depth-of-field Fei He et.al. 2401.14551 null