Updated on 2025.07.17
Usage instructions: here
SLAM
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-11 | Towards Robust Sensor-Fusion Ground SLAM: A Comprehensive Benchmark and A Resilient Framework | Deteng Zhang et.al. | 2507.08364 | null |
2025-07-10 | Hardware-Aware Feature Extraction Quantisation for Real-Time Visual Odometry on FPGA Platforms | Mateusz Wasala et.al. | 2507.07903 | null |
2025-07-10 | IRAF-SLAM: An Illumination-Robust and Adaptive Feature-Culling Front-End for Visual SLAM in Challenging Environments | Thanh Nguyen Canh et.al. | 2507.07752 | null |
2025-07-09 | g2o vs. Ceres: Optimizing Scan Matching in Cartographer SLAM | Quanjie Qiu et.al. | 2507.07142 | null |
2025-07-08 | Mapping the Catacombs: An Underwater Cave Segment of the Devil’s Eye System | Michalis Chatzispyrou et.al. | 2507.06397 | null |
2025-07-08 | Cooperative Mapping, Localization, and Beam Management via Multi-Modal SLAM in ISAC Systems | Hang Que et.al. | 2507.05718 | null |
2025-07-07 | Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR | Tao Du et.al. | 2507.04662 | null |
2025-07-06 | Lidar Variability: A Novel Dataset and Comparative Study of Solid-State and Spinning Lidars | Doumegna Mawuto Koudjo Felix et.al. | 2507.04321 | null |
2025-07-09 | Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM | Xiaolei Lang et.al. | 2507.04004 | null |
2025-07-04 | Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps | Chong Cheng et.al. | 2507.03737 | null |
2025-07-01 | RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles | David Hunt et.al. | 2507.00937 | null |
2025-07-01 | Generation of Indoor Open Street Maps for Robot Navigation from CAD Files | Jiajie Zhang et.al. | 2507.00552 | null |
2025-06-30 | VOCAL: Visual Odometry via ContrAstive Learning | Chi-Yao Huang et.al. | 2507.00243 | null |
2025-06-29 | TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints | Zhen Tan et.al. | 2506.23207 | null |
2025-06-29 | Event-based Stereo Visual-Inertial Odometry with Voxel Map | Zhaoxing Zhang et.al. | 2506.23078 | null |
2025-06-26 | Adaptive Multipath-Based SLAM for Distributed MIMO Systems | Xuhong Li et.al. | 2506.21798 | null |
2025-06-24 | Ark: An Open-source Python-based Framework for Robot Learning | Magnus Dierking et.al. | 2506.21628 | null |
2025-06-26 | EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting | Taoyu Wu et.al. | 2506.21420 | null |
2025-06-26 | CURL-SLAM: Continuous and Compact LiDAR Mapping | Kaicheng Zhang et.al. | 2506.21077 | null |
2025-06-25 | SPARK: Graph-Based Online Semantic Integration System for Robot Task Planning | Mimo Shirasaka et.al. | 2506.20394 | null |
2025-06-25 | Real-Time Obstacle Avoidance Algorithms for Unmanned Aerial and Ground Vehicles | Jingwen Wei et.al. | 2506.20311 | null |
2025-06-24 | Posterior Cramér-Rao Bounds on Localization and Mapping Errors in Distributed MIMO SLAM | Benjamin J. B. Deutschmann et.al. | 2506.19957 | null |
2025-06-23 | GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM | Annika Thomas et.al. | 2506.18885 | null |
2025-06-23 | MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation | Tianchen Deng et.al. | 2506.18678 | null |
2025-06-24 | Multimodal Fusion SLAM with Fourier Attention | Youjie Zhou et.al. | 2506.18204 | null |
2025-06-22 | ADA-DPM: A Neural Descriptors-based Adaptive Noise Point Filtering Strategy for SLAM | Yongxin Shao et.al. | 2506.18016 | null |
2025-06-21 | Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems | Sebastian Sansoni et.al. | 2506.17775 | null |
2025-06-18 | MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System | Miaoxin Pan et.al. | 2506.15402 | null |
2025-06-24 | RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories | Qingsong Yan et.al. | 2506.15242 | null |
2025-06-18 | SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization | Hanjun Kim et.al. | 2506.15175 | null |
2025-06-18 | VIMS: A Visual-Inertial-Magnetic-Sonar SLAM System in Underwater Environments | Bingbing Zhang et.al. | 2506.15126 | null |
2025-06-16 | Slanted light-sheet array microscopy for large volume imaging at rates exceeding 100 Hz | Kai Long et.al. | 2506.13664 | null |
2025-06-16 | Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots | Jaehong Oh et.al. | 2506.13149 | null |
2025-06-16 | A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method | Zhanhua Xin et.al. | 2506.13100 | null |
2025-06-16 | SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure | Shahram Najam Syed et.al. | 2506.13089 | link |
2025-06-12 | LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System | Hongbeen Park et.al. | 2506.10567 | null |
2025-06-11 | VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots | Miguel Á. González-Santamarta et.al. | 2506.09583 | null |
2025-06-10 | UFM: A Simple Path towards Unified Dense Correspondence with Flow | Yuchen Zhang et.al. | 2506.09278 | null |
2025-06-10 | Princeton365: A Diverse Dataset with Accurate Camera Pose | Karhan Kayan et.al. | 2506.09035 | null |
2025-06-10 | Planar Collisionless Shock Simulations with Semi-Implicit Particle-in-Cell Model FLEKS | Hongyang Zhou et.al. | 2506.08384 | null |
2025-06-09 | ZeroVO: Visual Odometry with Minimal Assumptions | Lei Lai et.al. | 2506.08005 | null |
2025-06-08 | Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs | Qiong Chang et.al. | 2506.07164 | null |
2025-06-08 | UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment | Wentao Zhao et.al. | 2506.07013 | null |
2025-06-06 | GS4: Generalizable Sparse Splatting Semantic SLAM | Mingqi Jiang et.al. | 2506.06517 | null |
2025-06-06 | Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception | Pushyami Kaveti et.al. | 2506.06476 | null |
2025-06-06 | Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments | Mingrui Li et.al. | 2506.05965 | null |
2025-06-06 | Analysis of points outcome in ATP Grand Slam Tennis using big data and machine learning | Martin Illum et.al. | 2506.05866 | null |
2025-06-05 | On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images | Andreas Meuleman et.al. | 2506.05558 | null |
2025-06-05 | Deep Learning Reforms Image Matching: A Survey and Outlook | Shihua Zhang et.al. | 2506.04619 | null |
2025-06-04 | cuVSLAM: CUDA accelerated visual odometry | Alexander Korovko et.al. | 2506.04359 | link |
2025-06-04 | Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset | Zirui Wang et.al. | 2506.04224 | null |
2025-06-03 | LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM | Roman Titkov et.al. | 2506.03073 | null |
2025-06-03 | Online Performance Assessment of Multi-Source-Localization for Autonomous Driving Systems Using Subjective Logic | Stefan Orf et.al. | 2506.02932 | null |
2025-06-03 | VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians | Pengchong Hu et.al. | 2506.02741 | null |
2025-06-03 | GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal | Shufan Qing et.al. | 2506.02736 | link |
2025-06-03 | Olfactory Inertial Odometry: Methodology for Effective Robot Navigation by Scent | Kordel K. France et.al. | 2506.02373 | null |
2025-06-01 | Globally Consistent RGB-D SLAM with 2D Gaussian Splatting | Xingguang Zhong et.al. | 2506.00970 | link |
2025-05-30 | Black-box Adversarial Attacks on CNN-based SLAM Algorithms | Maria Rafaela Gkeka et.al. | 2505.24654 | null |
2025-05-28 | Semantic Exploration and Dense Mapping of Complex Environments using Ground Robots Equipped with LiDAR and Panoramic Camera | Xiaoyang Zhan et.al. | 2505.22880 | null |
2025-05-28 | 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians | Hidenobu Matsuki et.al. | 2505.22859 | null |
2025-05-28 | UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments | Wancai Zheng et.al. | 2505.22335 | null |
2025-05-27 | HS-SLAM: A Fast and Hybrid Strategy-Based SLAM Approach for Low-Speed Autonomous Driving | Bingxiang Kang et.al. | 2505.20906 | null |
2025-05-27 | ProBA: Probabilistic Bundle Adjustment with the Bhattacharyya Coefficient | Jason Chui et.al. | 2505.20858 | null |
2025-05-26 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting | Wenhua Wu et.al. | 2505.19420 | null |
2025-05-25 | VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes | Tianchen Deng et.al. | 2505.18992 | link |
2025-05-23 | CU-Multi: A Dataset for Multi-Robot Data Association | Doncey Albin et.al. | 2505.17576 | null |
2025-05-22 | TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition | Oliver Grainge et.al. | 2505.16447 | null |
2025-05-20 | A Methodological Framework for Measuring Spatial Labeling Similarity | Yihang Du et.al. | 2505.14128 | link |
2025-05-22 | Place Recognition: A Comprehensive Review, Current Challenges and Future Directions | Zhenyu Li et.al. | 2505.14068 | link |
2025-05-19 | eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks | Jad Mansour et.al. | 2505.13309 | null |
2025-05-23 | VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold | Dominic Maggio et.al. | 2505.12549 | null |
2025-05-18 | Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey | Calvin Galagain et.al. | 2505.12384 | null |
2025-05-18 | Structureless VIO | Junlin Song et.al. | 2505.12337 | null |
2025-05-16 | EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video | Ryan Hoque et.al. | 2505.11709 | null |
2025-05-16 | Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization | Aaron Wilhelm et.al. | 2505.11620 | null |
2025-05-16 | Robust 2D lidar-based SLAM in arboreal environments without IMU/GNSS | Paola Nazate-Burgos et.al. | 2505.10847 | null |
2025-05-15 | TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation | Manthan Patel et.al. | 2505.10696 | null |
2025-05-15 | A hybrid SLAM-Payne framework for atmospheric parameter and abundance determination of early-type Stars from LAMOST DR9 low-resolution Spectra | Weijia Sun et.al. | 2505.10310 | null |
2025-05-15 | Large-Scale Gaussian Splatting SLAM | Zhe Xin et.al. | 2505.09915 | null |
2025-05-13 | Automated Meta Prompt Engineering for Alignment with the Theory of Mind | Aaron Baughman et.al. | 2505.09024 | null |
2025-05-13 | MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM | Saqi Hussain Kalan et.al. | 2505.08388 | null |
2025-05-13 | SKiD-SLAM: Robust, Lightweight, and Distributed Multi-Robot LiDAR SLAM in Resource-Constrained Field Environments | Hogyun Kim et.al. | 2505.08230 | null |
2025-05-12 | RDD: Robust Feature Detector and Descriptor using Deformable Transformer | Gonglin Chen et.al. | 2505.08013 | null |
2025-05-12 | Ranking-aware Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2505.07198 | null |
2025-05-07 | Scalable Aerial GNSS Localization for Marine Robots | Shuo Wen et.al. | 2505.04095 | link |
2025-05-06 | Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions | Lukas Schichler et.al. | 2505.03565 | null |
2025-05-06 | AquaticVision: Benchmarking Visual SLAM in Underwater Environment with Events and Frames | Yifan Peng et.al. | 2505.03448 | null |
2025-05-06 | LiftFeat: 3D Geometry-Aware Local Feature Matching | Yepeng Liu et.al. | 2505.03422 | link |
2025-05-05 | LiDAR-Inertial SLAM-Based Navigation and Safety-Oriented AI-Driven Control System for Skid-Steer Robots | Mehdi Heydari Shahna et.al. | 2505.02598 | null |
2025-05-04 | Robust Localization, Mapping, and Navigation for Quadruped Robots | Dyuman Aditya et.al. | 2505.02272 | null |
2025-05-04 | SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment | Ganesh Sapkota et.al. | 2505.01956 | null |
2025-05-03 | GauS-SLAM: Dense RGB-D SLAM with Gaussian Surfels | Yongxin Su et.al. | 2505.01934 | null |
2025-05-02 | Tightly Coupled Range Inertial Odometry and Mapping with Exact Point Cloud Downsampling | Kenji Koide et.al. | 2505.01017 | null |
2025-04-30 | An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation | Yaming Ou et.al. | 2504.21826 | null |
2025-04-30 | eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes | Henry John Krumb et.al. | 2504.21562 | null |
2025-04-29 | Large-scale visual SLAM for in-the-wild videos | Shuo Sun et.al. | 2504.20496 | null |
2025-04-28 | Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM | Leon Davies et.al. | 2504.19654 | null |
2025-04-28 | GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM | Leon Davies et.al. | 2504.19653 | null |
2025-04-28 | GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field | Zuxing Lu et.al. | 2504.19409 | null |
2025-04-27 | Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users | Apurv Varshney et.al. | 2504.19345 | null |
2025-04-27 | NANO-SLAM : Natural Gradient Gaussian Approximation for Vehicle SLAM | Tianyi Zhang et.al. | 2504.19195 | null |
2025-04-27 | MISO: Multiresolution Submap Optimization for Efficient Globally Consistent Neural Implicit Reconstruction | Yulun Tian et.al. | 2504.19104 | null |
2025-04-25 | Certifiably-Correct Mapping for Safe Navigation Despite Odometry Drift | Devansh R. Agrawal et.al. | 2504.18713 | null |
2025-04-25 | Range-based 6-DoF Monte Carlo SLAM with Gradient-guided Particle Filter on GPU | Takumi Nakao et.al. | 2504.18056 | null |
2025-04-24 | Autonomous Navigation Of Quadrupeds Using Coverage Path Planning | Alexander James Becoy et.al. | 2504.17880 | null |
2025-04-22 | SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos | Yuxin Yao et.al. | 2504.17810 | null |
2025-04-24 | BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring | Asier Bikandi et.al. | 2504.17693 | null |
2025-04-24 | Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images | Zebo Huang et.al. | 2504.17582 | null |
2025-04-24 | Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization | Guangyang Zeng et.al. | 2504.17410 | null |
2025-04-24 | EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy | Haodi Yao et.al. | 2504.17280 | null |
2025-04-23 | ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration | Andrea Conti et.al. | 2504.16545 | null |
2025-04-22 | DERD-Net: Learning Depth from Event-based Ray Densities | Diego de Oliveira Hitzges et.al. | 2504.15863 | null |
2025-04-23 | SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems | Abhishek Tyagi et.al. | 2504.15305 | null |
2025-04-20 | Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction | Weirong Chen et.al. | 2504.14516 | null |
2025-04-20 | SG-Reg: Generalizable and Efficient Scene Graph Registration | Chuhao Liu et.al. | 2504.14440 | link |
2025-04-19 | Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering | Jonathan Embley-Riches et.al. | 2504.14135 | null |
2025-04-21 | SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM | Samuel Cerezo et.al. | 2504.13713 | link |
2025-04-16 | An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World | Xingwu Ji et.al. | 2504.11698 | link |
2025-04-18 | Doppler-SLAM: Doppler-Aided Radar-Inertial and LiDAR-Inertial Simultaneous Localization and Mapping | Dong Wang et.al. | 2504.11634 | link |
2025-04-14 | Region Based SLAM-Aware Exploration: Efficient and Robust Autonomous Mapping Strategy That Can Scale | Megha Maheshwari et.al. | 2504.10416 | null |
2025-04-14 | RoboCup Rescue 2025 Team Description Paper UruBots | Kevin Farias et.al. | 2504.09778 | null |
2025-04-11 | FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment | Sebastián Barbas Laina et.al. | 2504.08603 | null |
2025-04-11 | PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection | Xiong Li et.al. | 2504.08280 | null |
2025-04-11 | II-NVM: Enhancing Map Accuracy and Consistency with Normal Vector-Assisted Mapping | Chengwei Zhao et.al. | 2504.08204 | link |
2025-04-10 | UWB Anchor Based Localization of a Planetary Rover | Andreas Nüchter et.al. | 2504.07658 | null |
2025-04-10 | Event Signal Filtering via Probability Flux Estimation | Jinze Chen et.al. | 2504.07503 | null |
2025-04-07 | Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM | Zhicong Sun et.al. | 2504.04844 | link |
2025-04-06 | SELC: Self-Supervised Efficient Local Correspondence Learning for Low Quality Images | Yuqing Wang et.al. | 2504.04497 | null |
2025-04-06 | VSLAM-LAB: A Comprehensive Framework for Visual SLAM Methods and Datasets | Alejandro Fontan et.al. | 2504.04457 | link |
2025-04-05 | Nonlinear Observer Design for Landmark-Inertial Simultaneous Localization and Mapping | Mouaad Boughellaba et.al. | 2504.04239 | null |
2025-04-04 | WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments | Jianhao Zheng et.al. | 2504.03886 | null |
2025-04-03 | SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections | Prashant Kumar et.al. | 2504.03089 | null |
2025-04-03 | Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision | Xiaofeng Han et.al. | 2504.02477 | null |
2025-04-03 | MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM | Renwu Li et.al. | 2504.02437 | null |
2025-04-02 | A Chefs KISS – Utilizing semantic information in both ICP and SLAM framework | Sven Ochs et.al. | 2504.02086 | null |
2025-04-01 | Semantic SLAM with Rolling-Shutter Cameras and Low-Precision INS in Outdoor Environments | Yuchen Zhang et.al. | 2504.01997 | null |
2025-04-02 | Strengthening Multi-Robot Systems for SAR: Co-Designing Robotics and Communication Towards 6G | Juan Bravo-Arrabal et.al. | 2504.01940 | null |
2025-04-02 | Dynamic Initialization for LiDAR-inertial SLAM | Jie Xu et.al. | 2504.01451 | link |
2025-04-02 | ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue | Thomas Pritchard et.al. | 2504.01261 | link |
2025-03-31 | SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection | Yannick Burkhardt et.al. | 2504.00139 | null |
2025-03-30 | A Visual-Inertial Motion Prior SLAM for Dynamic Environments | Weilong Sun et.al. | 2503.23429 | null |
2025-03-30 | AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos | Felix Wimbauer et.al. | 2503.23282 | link |
2025-03-29 | Incorporating GNSS Information with LIDAR-Inertial Odometry for Accurate Land-Vehicle Localization | Jintao Cheng et.al. | 2503.23199 | null |
2025-03-29 | Towards Mobile Sensing with Event Cameras on High-mobility Resource-constrained Devices: A Survey | Haoyang Wang et.al. | 2503.22943 | null |
2025-03-27 | HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM | Ziren Gong et.al. | 2503.21778 | null |
2025-03-27 | STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM | Yongxu Wang et.al. | 2503.21425 | null |
2025-03-25 | Scene-agnostic Pose Regression for Visual Localization | Junwei Zheng et.al. | 2503.19543 | null |
2025-03-25 | First Results on UAV-aided User Localization Using ToA and OpenAirInterface in 5G NR | Omid Esrafilian et.al. | 2503.19529 | null |
2025-03-25 | MM-LINS: a Multi-Map LiDAR-Inertial System for Over-Degenerate Environments | Yongxin Ma et.al. | 2503.19506 | link |
2025-03-24 | Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control | Tohid Kargar Tasooji et.al. | 2503.19135 | null |
2025-03-24 | GI-SLAM: Gaussian-Inertial SLAM | Xulang Liu et.al. | 2503.18275 | null |
2025-03-22 | LightLoc: Learning Outdoor LiDAR Localization at Light Speed | Wen Li et.al. | 2503.17814 | link |
2025-03-21 | Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions | Muhua Zhang et.al. | 2503.17005 | null |
2025-03-20 | 4D Gaussian Splatting SLAM | Yanyan Li et.al. | 2503.16710 | null |
2025-03-20 | Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education | Giovanni Adorni et.al. | 2503.16307 | null |
2025-03-20 | Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors | Tian Yi Lim et.al. | 2503.16275 | null |
2025-03-19 | A Sigma Point-based Low Complexity Algorithm for Multipath-based SLAM in MIMO Systems | Anna Masiero et.al. | 2503.15286 | null |
2025-03-19 | ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents | Hao Liang et.al. | 2503.14948 | null |
2025-03-18 | 3D Densification for Multi-Map Monocular VSLAM in Endoscopy | X. Anadón et.al. | 2503.14346 | null |
2025-03-18 | GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics | Tingyang Xiao et.al. | 2503.14247 | link |
2025-03-18 | A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios | Huy-Hoang Bui et.al. | 2503.13982 | link |
2025-03-17 | Digital Beamforming Enhanced Radar Odometry | Jingqi Jiang et.al. | 2503.13252 | link |
2025-03-17 | Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes | Tatsuro Sakai et.al. | 2503.12768 | null |
2025-03-16 | KISS-SLAM: A Simple, Robust, and Accurate 3D LiDAR SLAM System With Enhanced Generalization Capabilities | Tiziano Guadagnino et.al. | 2503.12660 | null |
2025-03-16 | Deblur Gaussian Splatting SLAM | Francesco Girlanda et.al. | 2503.12572 | null |
2025-03-16 | M2UD: A Multi-model, Multi-scenario, Uneven-terrain Dataset for Ground Robot with Localization and Mapping Evaluation | Yanpeng Jia et.al. | 2503.12387 | null |
2025-03-15 | DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes | Runfa Blark Li et.al. | 2503.11979 | null |
2025-03-14 | AQUA-SLAM: Tightly-Coupled Underwater Acoustic-Visual-Inertial SLAM with Sensor Calibration | Shida Xu et.al. | 2503.11420 | link |
2025-03-14 | NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications | Li Cui et.al. | 2503.11199 | null |
2025-03-14 | Leveraging Semantic Graphs for Efficient and Robust LiDAR SLAM | Neng Wang et.al. | 2503.11145 | link |
2025-03-13 | Rapidly Converging Time-Discounted Ergodicity on Graphs for Active Inspection of Confined Spaces | Benjamin Wong et.al. | 2503.10853 | null |
2025-03-13 | OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions | Maxim Popov et.al. | 2503.10331 | null |
2025-03-12 | Online Language Splatting | Saimouli Katragadda et.al. | 2503.09447 | null |
2025-03-12 | MonoSLAM: Robust Monocular SLAM with Global Structure Optimization | Bingzheng Jiang et.al. | 2503.09296 | null |
2025-03-11 | Keypoint Detection and Description for Raw Bayer Images | Jiakai Lin et.al. | 2503.08673 | null |
2025-03-11 | GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats | Kai Deng et.al. | 2503.08071 | link |
2025-03-10 | POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality | Joey Wilson et.al. | 2503.07819 | null |
2025-03-08 | HIPPO-MAT: Decentralized Task Allocation Using GraphSAGE and Multi-Agent Deep Reinforcement Learning | Lavanya Ratnabala et.al. | 2503.07662 | null |
2025-03-10 | AirSwarm: Enabling Cost-Effective Multi-UAV Research with COTS drones | Xiaowei Li et.al. | 2503.06890 | link |
2025-03-08 | InfoFusion Controller: Informed TRRT Star with Mutual Information based on Fusion of Pure Pursuit and MPC for Enhanced Path Planning | Seongjun Choi et.al. | 2503.06010 | link |
2025-03-07 | THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks | Chaoran Xiong et.al. | 2503.05112 | null |
2025-03-07 | Adaptive-LIO: Enhancing Robustness and Precision through Environmental Adaptation in LiDAR Inertial Odometry | Chengwei Zhao et.al. | 2503.05077 | link |
2025-03-06 | MarsLGPR: Mars Rover Localization with Ground Penetrating Radar | Anja Sheppard et.al. | 2503.04944 | null |
2025-03-06 | On the Connection Between Magnetic-Field Odometry Aided Inertial Navigation and Magnetic-Field SLAM | Isaac Skog et.al. | 2503.04286 | null |
2025-03-06 | Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes | Hui Zhang et.al. | 2503.04235 | null |
2025-03-06 | DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems | Joshua Bird et.al. | 2503.04126 | null |
2025-03-05 | Equivariant Filter Design for Range-only SLAM | Yixiao Ge et.al. | 2503.03973 | null |
2025-03-05 | Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments | Jie Deng et.al. | 2503.03373 | link |
2025-03-05 | OpenGV 2.0: Motion prior-assisted calibration and SLAM with vehicle-mounted surround-view systems | Kun Huang et.al. | 2503.03230 | null |
2025-03-05 | Distributed Certifiably Correct Range-Aided SLAM | Alexander Thoms et.al. | 2503.03192 | link |
2025-03-04 | Monocular visual simultaneous localization and mapping: (r)evolution from geometry to deep learning-based pipelines | Olaya Alvarez-Tunon et.al. | 2503.02955 | link |
2025-03-04 | Introspective Loop Closure for SLAM with 4D Imaging Radar | Maximilian Hilger et.al. | 2503.02383 | null |
2025-03-04 | DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting | Haoyuan Li et.al. | 2503.02223 | link |
2025-03-03 | Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM | Marco Giberna et.al. | 2503.02050 | null |
2025-03-03 | vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding | Ali Tourani et.al. | 2503.01783 | null |
2025-03-03 | MUSt3R: Multi-view Network for Stereo 3D Reconstruction | Yohann Cabon et.al. | 2503.01661 | link |
2025-03-03 | OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding | Dianyi Yang et.al. | 2503.01646 | null |
2025-03-03 | MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features | Chao Ye et.al. | 2503.01571 | link |
2025-03-03 | AI-Driven Relocation Tracking in Dynamic Kitchen Environments | Arash Nasr Esfahani et.al. | 2503.01547 | link |
2025-03-03 | Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning | Xintao Chao et.al. | 2503.01543 | null |
2025-03-03 | RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation | Shu Pan et.al. | 2503.01434 | null |
2025-02-28 | A2DO: Adaptive Anti-Degradation Odometry with Deep Multi-Sensor Fusion for Autonomous Navigation | Hui Lai et.al. | 2502.20767 | null |
2025-02-27 | BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground | Yufei Wei et.al. | 2502.20078 | null |
2025-02-26 | Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects | Petri Mäkinen et.al. | 2502.19169 | null |
2025-02-26 | SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images | Yangfan Xu et.al. | 2502.18932 | null |
2025-02-28 | S-Graphs 2.0 – A Hierarchical-Semantic Optimization and Loop Closure for SLAM | Hriday Bavle et.al. | 2502.18044 | link |
2025-02-25 | MegaLoc: One Retrieval to Place Them All | Gabriele Berton et.al. | 2502.17237 | link |
2025-02-24 | SLABIM: A SLAM-BIM Coupled Dataset in HKUST Main Building | Haoming Huang et.al. | 2502.16856 | link |
2025-02-27 | Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM | Yao Zhang et.al. | 2502.16495 | null |
2025-02-19 | Slamming: Training a Speech Language Model on One GPU in a Day | Gallil Maimon et.al. | 2502.15814 | link |
2025-02-21 | RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes | Sicheng Yu et.al. | 2502.15633 | null |
2025-02-20 | Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2502.14931 | null |
2025-02-19 | 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments | Vincent Ress et.al. | 2502.13803 | null |
2025-02-19 | Active Illumination for Visual Ego-Motion Estimation in the Dark | Francesco Crocetti et.al. | 2502.13708 | null |
2025-02-17 | From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations | Matteo Scucchia et.al. | 2502.12303 | null |
2025-02-19 | pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM | Luigi Freda et.al. | 2502.11955 | link |
2025-02-17 | Anti-Degeneracy Scheme for Lidar SLAM based on Particle Filter in Geometry Feature-Less Environments | Yanbin Li et.al. | 2502.11486 | null |
2025-02-16 | GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting | Zelin Zhou et.al. | 2502.10975 | null |
2025-02-19 | MonoForce: Learnable Image-conditioned Physics Engine | Ruslan Agishev et.al. | 2502.10156 | link |
2025-02-13 | Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions | Dario Pisanti et.al. | 2502.09795 | null |
2025-02-13 | DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior | Mingrui Li et.al. | 2502.09111 | null |
2025-02-12 | LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features | Shujie Zhou et.al. | 2502.08676 | link |
2025-02-14 | Occupancy-SLAM: An Efficient and Robust Algorithm for Simultaneously Optimizing Robot Poses and Occupancy Map | Yingyu Wang et.al. | 2502.06292 | link |
2025-02-09 | PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map | Yue Pan et.al. | 2502.05752 | link |
2025-02-07 | Joint State and Noise Covariance Estimation | Kasra Khosoussi et.al. | 2502.04584 | null |
2025-02-05 | GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM | Mingrui Li et.al. | 2502.03228 | null |
2025-02-04 | SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification | Yifu Tao et.al. | 2502.02657 | null |
2025-02-04 | HeRCULES: Heterogeneous Radar Dataset in Complex Urban Environment for Multi-session Radar SLAM | Hanjun Kim et.al. | 2502.01946 | null |
2025-02-03 | Statistical enhance learning for modeling and prediction tennis matches at Grand Slam tournaments | Nourah Buhamra et.al. | 2502.01613 | null |
2025-02-03 | Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter | Dabin Kim et.al. | 2502.01092 | null |
2025-02-01 | FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps | Maximilian Leitenstern et.al. | 2502.00395 | link |
2025-01-31 | LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks | Liudi Yang et.al. | 2501.19382 | link |
2025-01-31 | Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping | Yiming Huang et.al. | 2501.19319 | link |
2025-01-31 | GO: The Great Outdoors Multimodal Dataset | Peng Jiang et.al. | 2501.19274 | null |
2025-01-30 | Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems | Liudi Yang et.al. | 2501.18110 | null |
2025-01-28 | SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios | Yinqi Chen et.al. | 2501.16754 | null |
2025-01-27 | Visual-Lidar Map Alignment for Infrastructure Inspections | Jake McLaughlin et.al. | 2501.14486 | link |
2025-01-24 | Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video | Xiaohao Xu et.al. | 2501.14319 | link |
2025-01-24 | HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting | Javier Yu et.al. | 2501.14147 | null |
2025-01-23 | FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation | Bingyang Zhou et.al. | 2501.13876 | null |
2025-01-23 | VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM | Gyuhyeon Pak et.al. | 2501.13402 | null |
2025-01-22 | Grid-based Submap Joining: An Efficient Algorithm for Simultaneously Optimizing Global Occupancy Map and Local Submap Frames | Yingyu Wang et.al. | 2501.12764 | null |
2025-01-21 | DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM | Jesse Morris et.al. | 2501.11893 | link |
2025-01-21 | Survey on Monocular Metric Depth Estimation | Jiuling Zhang et.al. | 2501.11841 | null |
2025-01-19 | OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors | Dominik Kulmer et.al. | 2501.11111 | link |
2025-01-19 | Factor Graph-Based Active SLAM for Spacecraft Proximity Operations | Lorenzo Ticozzi et.al. | 2501.10950 | null |
2025-01-23 | Mesh2SLAM in VR: A Fast Geometry-Based SLAM Framework for Rapid Prototyping in Virtual Reality Applications | Carlos Augusto Pinheiro de Sousa et.al. | 2501.09600 | null |
2025-01-16 | Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment | Maksim Filipenko et.al. | 2501.09490 | null |
2025-01-15 | Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures | Pengru Deng et.al. | 2501.09203 | null |
2025-01-15 | AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning | Assaf Lahiany et.al. | 2501.09160 | null |
2025-01-15 | SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM | Yuhang Ming et.al. | 2501.08880 | null |
2025-01-15 | GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping | Sheng Hong et.al. | 2501.08672 | null |
2025-01-16 | BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module | Dongzhihan Wang et.al. | 2501.08659 | null |
2025-01-15 | Self-Organizing Edge Computing Distribution Framework for Visual SLAM | Jussi Kalliola et.al. | 2501.08629 | null |
2025-01-14 | VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes | Ke Wu et.al. | 2501.08286 | null |
2025-01-13 | Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps | Saurabh Gupta et.al. | 2501.07399 | null |
2025-01-14 | SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting | Yue Hu et.al. | 2501.07015 | null |
2025-01-12 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
2025-01-11 | SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors | Zhen Hong et.al. | 2501.06469 | null |
2025-01-09 | Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping | Wen Tianci et.al. | 2501.05242 | null |
2025-01-07 | SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment | Yuchun Fan et.al. | 2501.03681 | link |
2025-01-06 | HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos | Jinglei Zhang et.al. | 2501.02973 | null |
2025-01-09 | LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments | Haosong Yue et.al. | 2501.02580 | link |
2025-01-04 | ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle | Yinchuan Wang et.al. | 2501.02166 | link |
2024-12-31 | PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM | Runnan Chen et.al. | 2501.00352 | null |
2024-12-30 | Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields | Evgenii Kruzhkov et.al. | 2412.20976 | null |
2024-12-28 | MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing | Shuo Wang et.al. | 2412.20082 | null |
2024-12-27 | DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction | Kai Xu et.al. | 2412.19584 | null |
2024-12-26 | MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo | Byeonggwon Lee et.al. | 2412.19130 | null |
2024-12-23 | End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework | Fuhua Jia et.al. | 2412.17343 | null |
2024-12-23 | LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation | Riku Uemura et.al. | 2412.17282 | null |
2024-12-23 | Selective Kalman Filter: When and How to Fuse Multi-Sensor Information to Overcome Degeneracy in SLAM | Jie Xu et.al. | 2412.17235 | null |
2025-01-03 | Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry | Zhaoxing Zhang et.al. | 2412.16923 | link |
2024-12-21 | Query Quantized Neural SLAM | Sijia Jiang et.al. | 2412.16476 | link |
2024-12-20 | SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training | Wenxi Chen et.al. | 2412.15649 | link |
2024-12-18 | Energy-Efficient SLAM via Joint Design of Sensing, Communication, and Exploration Speed | Zidong Han et.al. | 2412.13912 | null |
2024-12-18 | Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation | Sait Akturk et.al. | 2412.13752 | null |
2024-12-18 | 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching | Fernando Amodeo et.al. | 2412.13639 | link |
2024-12-17 | NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment | Andrea Dunn Beltran et.al. | 2412.13176 | null |
2024-12-18 | Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera | Zhengdi Yu et.al. | 2412.12861 | null |
2024-12-16 | Global SLAM in Visual-Inertial Systems with 5G Time-of-Arrival Integration | Meisam Kabiri et.al. | 2412.12406 | null |
2024-12-16 | MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors | Riku Murai et.al. | 2412.12392 | null |
2024-12-16 | Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges | Martin Aubard et.al. | 2412.11840 | null |
2024-12-19 | RoMeO: Robust Metric Visual Odometry | Junda Cheng et.al. | 2412.11530 | null |
2024-12-14 | Affine EKF: Exploring and Utilizing Sufficient and Necessary Conditions for Observability Maintenance to Improve EKF Consistency | Yang Song et.al. | 2412.10809 | link |
2024-12-13 | RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting | Lizhi Bai et.al. | 2412.09868 | null |
2024-12-12 | SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos | Yuzheng Liu et.al. | 2412.09401 | link |
2024-12-12 | eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction | Jad Mansour et.al. | 2412.09209 | link |
2024-12-12 | Drift-free Visual SLAM using Digital Twins | Roxane Merat et.al. | 2412.08496 | null |
2024-12-10 | A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM | Zongbo Liao et.al. | 2412.07513 | null |
2024-12-08 | DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments | Juwon Kim et.al. | 2412.05839 | null |
2024-12-06 | MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos | Zhengqi Li et.al. | 2412.04463 | null |
2024-12-05 | Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset | Fuzhang Han et.al. | 2412.04287 | link |
2024-12-10 | MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application | Hyesu Jang et.al. | 2412.03887 | null |
2024-12-04 | Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars | John McConnell et.al. | 2412.03760 | null |
2024-12-04 | BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement | Miguel Arturo Vega Torres et.al. | 2412.03434 | link |
2024-12-04 | NeRF and Gaussian Splatting SLAM in the Wild | Fabian Schmidt et.al. | 2412.03263 | link |
2024-12-04 | MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras | Huai Yu et.al. | 2412.03146 | link |
2024-12-04 | An indoor DSO-based ceiling-vision odometry system for indoor industrial environments | Abdelhak Bougouffa et.al. | 2412.02950 | null |
2024-12-03 | ROVER: A Multi-Season Dataset for Visual SLAM | Fabian Schmidt et.al. | 2412.02506 | link |
2024-12-04 | RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting | Zhenzhong Cao et.al. | 2412.01217 | link |
2024-12-02 | Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM | Alejandro Fontan et.al. | 2412.01116 | null |
2024-12-02 | LiDAR SLAMMOT based on Confidence-guided Data Association | Susu Fang et.al. | 2412.01041 | null |
2024-12-01 | FlashSLAM: Accelerated RGB-D SLAM for Real-Time 3D Scene Reconstruction with Gaussian Splatting | Phu Pham et.al. | 2412.00682 | null |
2024-11-29 | Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction | Shaoxiang Wang et.al. | 2412.00242 | null |
2024-11-28 | Visual SLAMMOT Considering Multiple Motion Models | Peilin Tian et.al. | 2411.19134 | null |
2024-11-27 | ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching | Yangrui Dong et.al. | 2411.18174 | null |
2024-11-27 | HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction | Wei Zhang et.al. | 2411.17982 | link |
2024-11-26 | MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework | Xiangcheng Hu et.al. | 2411.17928 | link |
2024-11-29 | DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting | Christian Homeyer et.al. | 2411.17660 | link |
2024-11-25 | MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Vladimir Yugay et.al. | 2411.16785 | null |
2024-11-24 | Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors | Soumava Paul et.al. | 2411.15966 | null |
2024-11-24 | Near-Range Environmental Perception for Inland Waterway Vessels: A Comparative Study of LiDAR and Automotive FMCW RADAR Sensors | R. Herrmann et.al. | 2411.15901 | null |
2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
2024-11-23 | Gassidy: Gaussian Splatting SLAM in Dynamic Environments | Long Wen et.al. | 2411.15476 | null |
2024-11-22 | OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping | Tomas Berriel Martins et.al. | 2411.15043 | link |
2024-11-22 | A Benchmark Dataset for Collaborative SLAM in Service Environments | Harin Park et.al. | 2411.14775 | link |
2024-11-21 | InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation | Marziyeh Bamdad et.al. | 2411.14358 | link |
2024-11-20 | Robust Monocular Visual Odometry using Curriculum Learning | Assaf Lahiany et.al. | 2411.13438 | null |
2024-11-20 | Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds | Jelena Trisovic et.al. | 2411.13310 | null |
2024-11-19 | 3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality | Hanbeom Chang et.al. | 2411.12514 | null |
2024-11-19 | LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2411.12185 | null |
2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
2024-11-18 | The Blue Horizontal-Branch Stars From the LAMOST Survey: Atmospheric Parameters | Jie Ju et.al. | 2411.11250 | null |
2024-11-17 | A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality | Wei-Hsiang Lien et.al. | 2411.10940 | null |
2024-11-16 | DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment | Mangyu Kong et.al. | 2411.10722 | link |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation | Yufei Wei et.al. | 2411.10195 | null |
2024-11-13 | DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization | Yueming Xu et.al. | 2411.08373 | null |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-12 | Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments | Ankit Shaw et.al. | 2411.08231 | null |
2024-11-12 | NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN | Sonia Raychaudhuri et.al. | 2411.07848 | null |
2024-11-11 | Lost in Tracking Translation: A Comprehensive Analysis of Visual SLAM in Human-Centered XR and IoT Ecosystems | Yasra Chandio et.al. | 2411.07146 | null |
2024-11-11 | Learning from Feedback: Semantic Enhancement for Object SLAM Using Foundation Models | Jungseok Hong et.al. | 2411.06752 | null |
2024-11-11 | HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation | Xiaolong Wang et.al. | 2411.06700 | null |
2024-11-08 | Development of an indoor localization and navigation system based on monocular SLAM for mobile robots | Thanh Nguyen Canh et.al. | 2411.05337 | null |
2024-11-07 | Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping | Sayat Ibrayev et.al. | 2411.04797 | null |
2024-11-07 | MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation | Sayan Paul et.al. | 2411.04796 | null |
2024-11-09 | DEIO: Deep Event Inertial Odometry | Weipeng Guan et.al. | 2411.03928 | link |
2024-11-06 | Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward | Shashi Kumar et.al. | 2411.03866 | null |
2024-11-06 | LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior | Jiahui Wang et.al. | 2411.03610 | link |
2024-11-05 | LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting | Huibin Zhao et.al. | 2411.02703 | null |
2024-11-04 | Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing | Xinran Zhang et.al. | 2411.02553 | null |
2024-11-04 | Semantic Masking and Visual Feature Matching for Robust Localization | Luisa Mao et.al. | 2411.01804 | null |
2024-10-31 | XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM | Xiaomeng Wang et.al. | 2410.23690 | link |
2024-10-30 | LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM | Yucheng Huang et.al. | 2410.23231 | link |
2024-10-30 | ISAC Prototype System for Multi-Domain Cooperative Communication Networks | Jie Yang et.al. | 2410.22956 | null |
2024-10-30 | SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark | HyunJun Jung et.al. | 2410.22715 | link |
2024-10-29 | LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues | Hanqing Jiang et.al. | 2410.22213 | null |
2024-10-29 | EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments | Linus Nwankwo et.al. | 2410.22200 | null |
2024-10-28 | NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments | Taiyi Pan et.al. | 2410.21615 | link |
2024-10-28 | coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM | Emiliano Höss et.al. | 2410.21149 | link |
2024-11-01 | RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior | Mingjiang Liang et.al. | 2410.20358 | null |
2024-10-25 | Context-Based Visual-Language Place Recognition | Soojin Woo et.al. | 2410.19341 | link |
2024-10-22 | AG-SLAM: Active Gaussian Splatting SLAM | Wen Jiang et.al. | 2410.17422 | null |
2024-10-22 | Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study | J. Jorge et.al. | 2410.17171 | null |
2024-10-19 | EndoMetric: Near-light metric scale monocular SLAM | Raúl Iranzo et.al. | 2410.15065 | null |
2024-10-17 | Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot | Dongkun Han et.al. | 2410.13612 | null |
2024-10-17 | TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal | Yanpeng Jia et.al. | 2410.13240 | null |
2024-10-16 | QueensCAMP: an RGB-D dataset for robust Visual SLAM | Hudson M. S. Bruno et.al. | 2410.12520 | link |
2024-10-18 | PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM | Guanghao Li et.al. | 2410.12324 | null |
2024-10-16 | Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem | Yichen Sha et.al. | 2410.12169 | null |
2024-10-15 | V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting | Tuan Dang et.al. | 2410.12068 | link |
2024-10-15 | GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information | Wancai Zheng et.al. | 2410.11356 | null |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
2024-10-14 | MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator | Taozhe Li et.al. | 2410.10669 | null |
2024-10-13 | Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph | Benoit Casseau et.al. | 2410.09896 | null |
2024-10-12 | SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs | Wenxi Chen et.al. | 2410.09503 | link |
2024-10-12 | An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation | Wei Liang et.al. | 2410.09443 | null |
2024-10-12 | ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras | Junkai Niu et.al. | 2410.09374 | link |
2024-10-11 | Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System | Zheng Liu et.al. | 2410.08935 | link |
2024-10-11 | Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints | Yicheng He et.al. | 2410.08780 | null |
2024-10-10 | ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization | Mason B. Peterson et.al. | 2410.08262 | link |
2024-10-10 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
2024-10-08 | Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching | Gongxin Yao et.al. | 2410.06285 | null |
2024-10-08 | Submodular Optimization for Keyframe Selection & Usage in SLAM | David Thorne et.al. | 2410.05576 | null |
2024-10-07 | SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones | Denis Davletshin et.al. | 2410.05405 | null |
2024-10-07 | Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection | Ang He et.al. | 2410.05017 | null |
2024-10-05 | A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems | Nikola Radulov et.al. | 2410.04242 | link |
2024-10-05 | High-Speed Stereo Visual SLAM for Low-Powered Computing Devices | Ashish Kumar et.al. | 2410.04090 | link |
2024-10-04 | EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM | Shi Chen et.al. | 2410.03812 | null |
2024-10-04 | Estimating Body and Hand Motion in an Ego-sensed World | Brent Yi et.al. | 2410.03665 | null |
2024-10-03 | LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features | Zihao Dong et.al. | 2410.02961 | null |
2024-10-02 | ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space | Hogyun Kim et.al. | 2410.01325 | null |
2024-10-01 | Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency | William Dubois et.al. | 2410.00758 | null |
2024-10-02 | CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM | Dapeng Feng et.al. | 2410.00486 | link |
2024-09-30 | Additively Manufactured Open-Source Quadruped Robots for Multi-Robot SLAM Applications | Zachary Fuge et.al. | 2410.00122 | null |
2024-09-30 | Direct Multipath-Based SLAM | Mingchao Liang et.al. | 2409.20552 | null |
2024-09-30 | Robust Gaussian Splatting SLAM by Leveraging Loop Closure | Zunjie Zhu et.al. | 2409.20111 | null |
2024-09-30 | DynORecon: Dynamic Object Reconstruction for Navigation | Yiduo Wang et.al. | 2409.19928 | null |
2024-09-29 | CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation | Yifan Duan et.al. | 2409.19597 | null |
2024-09-29 | CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought | Yexing Du et.al. | 2409.19510 | link |
2024-09-29 | Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface | Ziniu Wu et.al. | 2409.19499 | null |
2024-09-27 | Royal Reveals: LiDAR Mapping of Kronborg Castle, Echoes of Hamlet’s Halls | Leon Davies et.al. | 2409.18752 | null |
2024-09-26 | BlinkTrack: Feature Tracking over 100 FPS via Events and Images | Yichen Shen et.al. | 2409.17981 | null |
2024-09-26 | Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry | Qi Zhang et.al. | 2409.17729 | null |
2024-09-26 | Event-based Stereo Depth Estimation: A Survey | Suman Ghosh et.al. | 2409.17680 | null |
2024-09-25 | Efficient Submap-based Autonomous MAV Exploration using Visual-Inertial SLAM Configurable for LiDARs or Depth Cameras | Sotiris Papatheodorou et.al. | 2409.16972 | null |
2024-09-25 | Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM | Phu Pham et.al. | 2409.16944 | null |
2024-09-25 | Inline Photometrically Calibrated Hybrid Visual SLAM | Nicolas Abboud et.al. | 2409.16810 | link |
2024-09-25 | Topological SLAM in colonoscopies leveraging deep features and topological priors | Javier Morlana et.al. | 2409.16806 | link |
2024-09-25 | Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots | Masoud Dayani Najafabadi et.al. | 2409.16595 | link |
2024-09-25 | Task-driven SLAM Benchmarking | Yanwei Du et.al. | 2409.16573 | link |
2024-09-24 | SoMaSLAM: 2D Graph SLAM for Sparse Range Sensing with Soft Manhattan World Constraints | Jeahn Han et.al. | 2409.15736 | null |
2024-09-23 | Spectral Graph Theoretic Methods for Enhancing Network Robustness in Robot Localization | Neelkamal Somisetty et.al. | 2409.15506 | null |
2024-09-22 | SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms | Niraj Pudasaini et.al. | 2409.14515 | null |
2024-09-21 | Point Cloud Structural Similarity-based Underwater Sonar Loop Detection | Donghwi Jung et.al. | 2409.14020 | link |
2024-09-20 | HMD $^2$ : Environment-aware Motion Generation from Single Egocentric Head-Mounted Device | Vladimir Guzov et.al. | 2409.13426 | null |
2024-09-20 | Learning Visual Information Utility with PIXER | Yash Turkar et.al. | 2409.13151 | null |
2024-09-19 | MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting | Yan Song Hu et.al. | 2409.13055 | null |
2024-09-19 | Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2409.12518 | link |
2024-09-18 | Bundle Adjustment in the Eager Mode | Zitong Zhan et.al. | 2409.12190 | null |
2024-09-23 | Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping | Jaehyung Jung et.al. | 2409.12051 | null |
2024-09-18 | Metric-Semantic Factor Graph Generation based on Graph Neural Networks | Jose Andres Millan-Romera et.al. | 2409.11972 | null |
2024-09-18 | Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments | Lei Cheng et.al. | 2409.11854 | null |
2024-09-18 | ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation | Yanlin Jin et.al. | 2409.11692 | null |
2024-09-18 | SLAM assisted 3D tracking system for laparoscopic surgery | Jingwei Song et.al. | 2409.11688 | null |
2024-09-17 | GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure | Ziheng Xu et.al. | 2409.10982 | null |
2024-09-17 | Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells | Ankit Butola et.al. | 2409.10971 | null |
2024-09-17 | Evaluating and Improving the Robustness of LiDAR-based Localization and Mapping | Bo Yang et.al. | 2409.10824 | link |
2024-09-16 | P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty | Yufan Zhang et.al. | 2409.10143 | link |
2024-09-16 | SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning | Amogh Joshi et.al. | 2409.09990 | null |
2024-09-16 | Enhancing Visual Inertial SLAM with Magnetic Measurements | Bharat Joshi et.al. | 2409.09904 | null |
2024-09-15 | Marginalizing and Conditioning Gaussians onto Linear Approximations of Smooth Manifolds with Applications in Robotics | Zi Cong Guo et.al. | 2409.09871 | link |
2024-09-15 | Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping | Yi Liu et.al. | 2409.09763 | null |
2024-09-15 | High Definition Map Mapping and Update: A General Overview and Future Directions | Benny Wijaya et.al. | 2409.09726 | null |
2024-09-14 | MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry | Yuheng Qiu et.al. | 2409.09479 | null |
2024-09-14 | Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM | Haoying Li et.al. | 2409.09410 | null |
2024-09-14 | GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians | Dasong Gao et.al. | 2409.09295 | link |
2024-09-14 | Panoramic Direct LiDAR-assisted Visual Odometry | Zikang Yuan et.al. | 2409.09287 | link |
2024-09-11 | Object Depth and Size Estimation using Stereo-vision and Integration with SLAM | Layth Hamad et.al. | 2409.07623 | null |
2024-09-11 | Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry | Anbo Tao et.al. | 2409.06948 | null |
2024-09-10 | Technical Report of Mobile Manipulator Robot for Industrial Environments | Erfan Amoozad Khalili et.al. | 2409.06693 | null |
2024-09-10 | Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios | Zhiqiang Chen et.al. | 2409.04961 | link |
2024-09-08 | FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat | Changfei Fu et.al. | 2409.03457 | null |
2024-09-03 | Integration of Augmented Reality and Mobile Robot Indoor SLAM for Enhanced Spatial Awareness | Michael D. Friske et.al. | 2409.01915 | null |
2024-09-03 | Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric | Tingchen Ma et.al. | 2409.01856 | null |
2024-09-02 | Saying goodbyes to rotating your phone: Magnetometer calibration during SLAM | Ilari Vallivaara et.al. | 2409.01242 | null |
2024-09-02 | Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection | Manon Kok et.al. | 2409.01091 | null |
2024-09-02 | Robust Vehicle Localization and Tracking in Rain using Street Maps | Yu Xiang Tan et.al. | 2409.01038 | link |
2024-08-31 | UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM | Mostafa Mansour et.al. | 2409.00362 | null |
2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
2024-08-30 | Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Shuyang Zhang et.al. | 2408.17005 | link |
2024-08-29 | Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry | Michael Adlerstein et.al. | 2408.16472 | null |
2024-08-28 | Single-Photon 3D Imaging with Equi-Depth Photon Histograms | Kaustubh Sadekar et.al. | 2408.16150 | null |
2024-08-28 | BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR | Miguel Arturo Vega Torres et.al. | 2408.15870 | link |
2024-08-30 | Addressing the challenges of loop detection in agricultural environments | Nicolás Soncini et.al. | 2408.15761 | link |
2024-08-28 | ES-PTAM: Event-based Stereo Parallel Tracking and Mapping | Suman Ghosh et.al. | 2408.15605 | link |
2024-08-28 | PointEMRay: A Novel Efficient SBR Framework on Point Based Geometry | Kaiqiao Yang et.al. | 2408.15583 | null |
2024-09-02 | Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration | Rongge Zhang et.al. | 2408.14726 | link |
2024-08-26 | A Survey on Reinforcement Learning Applications in SLAM | Mohammad Dehghani Tezerjani et.al. | 2408.14518 | null |
2024-08-28 | FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2408.14035 | link |
2024-08-21 | Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild | Turcan Tuna et.al. | 2408.11809 | null |
2024-08-21 | LiFCal: Online Light Field Camera Calibration via Bundle Adjustment | Aymeric Fleith et.al. | 2408.11682 | null |
2024-08-21 | Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars | Zhihao Lin et.al. | 2408.11582 | null |
2024-08-21 | RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions Transform | Maximilian Hilger et.al. | 2408.11576 | link |
2024-08-21 | Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models | Kento Kawaharazuka et.al. | 2408.11380 | null |
2024-08-20 | LoopSplat: Loop Closure by Registering 3D Gaussian Splats | Liyuan Zhu et.al. | 2408.10154 | link |
2024-08-19 | Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM | Sanghyun Hahn et.al. | 2408.09727 | link |
2024-08-17 | GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System | Shuo Wang et.al. | 2408.09191 | null |
2024-08-15 | GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Yutong Wang et.al. | 2408.07917 | link |
2024-08-14 | Inverse k-visibility for RSSI-based Indoor Geometric Mapping | Junseo Kim et.al. | 2408.07757 | null |
2024-08-14 | Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition | Hogyun Kim et.al. | 2408.07330 | link |
2024-08-12 | CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments | Yanpeng Jia et.al. | 2408.05981 | null |
2024-08-21 | Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis | Zhongche Qu et.al. | 2408.05635 | null |
2024-08-10 | TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping | Seoyeon Jang et.al. | 2408.05453 | null |
2024-08-08 | Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods | Yiming Zhou et.al. | 2408.04268 | null |
2024-08-07 | Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM | Yan Song Hu et.al. | 2408.03825 | null |
2024-08-07 | AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System | Kuan Xu et.al. | 2408.03520 | link |
2024-08-06 | BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications | G. Manni et.al. | 2408.03078 | link |
2024-08-04 | SLAMS-Propelled Electron Acceleration at High-Mach Number Astrophysical Shocks | Vladimir Zeković et.al. | 2408.02084 | null |
2024-08-03 | Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing | Fabian Schmidt et.al. | 2408.01716 | link |
2024-08-03 | Deep Patch Visual SLAM | Lahav Lipson et.al. | 2408.01654 | link |
2024-08-02 | Momentum Capture and Prediction System Based on Wimbledon Open2023 Tournament Data | Chang Liu et.al. | 2408.01544 | null |
2024-08-07 | IG-SLAM: Instant Gaussian SLAM | F. Aykut Sarikamis et.al. | 2408.01126 | null |
2024-08-01 | Collecting Larg-Scale Robotic Datasets on a High-Speed Mobile Platform | Yuxin Lin et.al. | 2408.00545 | null |
2024-08-01 | High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets | Jian Li et.al. | 2408.00538 | link |
2024-07-31 | SuperVINS: A visual-inertial SLAM framework integrated deep learning features | Hongkun Luo et.al. | 2407.21348 | link |
2024-07-30 | NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding | Hongjia Zhai et.al. | 2407.20853 | null |
2024-07-29 | A flexible framework for accurate LiDAR odometry, map manipulation, and localization | José Luis Blanco-Claraco et.al. | 2407.20465 | link |
2024-07-28 | Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data | Azmyin Md. Kamal et.al. | 2407.19518 | null |
2024-07-26 | Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation | Aditya Penumarti et.al. | 2407.19046 | null |
2024-07-26 | HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM | Zhe Xin et.al. | 2407.18813 | null |
2024-07-25 | CodedVO: Coded Visual Odometry | Sachin Shah et.al. | 2407.18240 | null |
2024-07-28 | HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation | Zhenzhi Wang et.al. | 2407.17438 | link |
2024-07-22 | Memory Management for Real-Time Appearance-Based Loop Closure Detection | Mathieu Labbé et.al. | 2407.15890 | null |
2024-07-22 | Reinforcement Learning Meets Visual Odometry | Nico Messikommer et.al. | 2407.15626 | link |
2024-07-22 | Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM | Mathieu Labbe et.al. | 2407.15305 | null |
2024-07-21 | Semi-Supervised Pipe Video Temporal Defect Interval Localization | Zhu Huang et.al. | 2407.15170 | null |
2024-07-21 | VoxDepth: Rectification of Depth Images on Edge Devices | Yashashwee Chakrabarty et.al. | 2407.15067 | null |
2024-07-20 | From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM | Lorenzo Montano-Oliván et.al. | 2407.14797 | null |
2024-07-19 | MSSP : A Versatile Multi-Scenario Adaptable Intelligent Robot Simulation Platform Based on LIDAR-Inertial Fusion | Qiyan Li et.al. | 2407.14102 | null |
2024-07-18 | A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion | Jianxiang Xu et.al. | 2407.13878 | link |
2024-07-18 | Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM | Baicheng Li et.al. | 2407.13338 | null |
2024-07-18 | Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain | Bach Nguyen Gia et.al. | 2407.13159 | link |
2024-07-17 | Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge | Andrea Albanese et.al. | 2407.12663 | null |
2024-07-17 | Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM | Markus Weißflog et.al. | 2407.12408 | null |
2024-07-19 | Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Sangjun Lee et.al. | 2407.12405 | link |
2024-07-17 | Fusion LiDAR-Inertial-Encoder data for High-Accuracy SLAM | Manh Do Duc et.al. | 2407.11870 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-16 | Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems | Jianzhu Huai et.al. | 2407.11705 | null |
2024-07-16 | Batch SLAM with PMBM Data Association Sampling and Graph-Based Optimization | Yu Ge et.al. | 2407.11643 | null |
2024-07-16 | I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM | Gwangtak Bae et.al. | 2407.11347 | null |
2024-07-16 | FR-SLAM: A SLAM Improvement Method Based on Floor Plan Registration | Jiantao Feng et.al. | 2407.11299 | null |
2024-07-15 | Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method | Adam Korycki et.al. | 2407.11238 | null |
2024-07-12 | An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks | Seyed Alireza Rahimi Azghadi et.al. | 2407.09242 | null |
2024-07-11 | SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM | Neng Wang et.al. | 2407.08106 | link |
2024-07-09 | Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM | David Hug et.al. | 2407.07074 | link |
2024-07-15 | A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM | Yasra Chandio et.al. | 2407.06889 | null |
2024-07-08 | Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots | Siva Krishna Ravipati et.al. | 2407.06077 | link |
2024-07-10 | Co-RaL: Complementary Radar-Leg Odometry with 4-DoF Optimization and Rolling Contact | Sangwoo Jung et.al. | 2407.05820 | null |
2024-07-07 | Active Collaborative Visual SLAM exploiting ORB Features | Muhammad Farhan Ahmed et.al. | 2407.05453 | null |
2024-07-06 | VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking | Xuefeng Jiang et.al. | 2407.05017 | null |
2024-07-06 | Symmetric Linear Arc Monadic Datalog and Gadget Reductions | Manuel Bodirsky et.al. | 2407.04924 | null |
2024-07-03 | Ultra-Lightweight Collaborative Mapping for Robot Swarms | Vlad Niculescu et.al. | 2407.03136 | null |
2024-07-01 | RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields | Haochen Jiang et.al. | 2407.01303 | link |
2024-07-01 | Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation | Lianjie Guo et.al. | 2407.01292 | link |
2024-07-01 | Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization | Ruofei Bai et.al. | 2407.01013 | link |
2024-06-30 | Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation | Adnan Abdullah et.al. | 2407.00848 | null |
2024-06-30 | OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration | Fengyuan Yang et.al. | 2407.00574 | null |
2024-06-24 | Compressing Search with Language Models | Thomas Mulc et.al. | 2407.00085 | null |
2024-06-28 | CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services | DongKi Noh et.al. | 2406.19634 | null |
2024-06-25 | Benchmarking SLAM Algorithms in the Cloud: The SLAM Hive System | Xinzhe Liu et.al. | 2406.17586 | null |
2024-07-02 | SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation | Xu Liu et.al. | 2406.17249 | link |
2024-06-24 | From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking | Xiaohao Xu et.al. | 2406.16850 | link |
2024-06-23 | Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy | Chen Wang et.al. | 2406.16087 | null |
2024-06-19 | Simultaneous Map and Object Reconstruction | Nathaniel Chodosh et.al. | 2406.13896 | null |
2024-06-14 | Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization | Wonho Song et.al. | 2406.11599 | null |
2024-06-16 | Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry | Boris Chidlovskii et.al. | 2406.11019 | null |
2024-06-15 | Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM | Yinjie Li et.al. | 2406.10494 | link |
2024-06-12 | From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers | Swaminathan Gurumurthy et.al. | 2406.07785 | link |
2024-06-27 | Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF) | Gyubeom Im et.al. | 2406.06427 | null |
2024-06-10 | Notes on Various Errors and Jacobian Derivations for SLAM | Gyubeom Im et.al. | 2406.06422 | null |
2024-06-23 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374 | link |
2024-06-15 | Visual-Inertial SLAM as Simple as A, B, VINS | Nathaniel Merrill et.al. | 2406.05969 | null |
2024-06-09 | MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps | Jianhao Zheng et.al. | 2406.05849 | null |
2024-06-06 | Open Problem: Active Representation Learning | Nikola Milosevic et.al. | 2406.03845 | null |
2024-06-04 | ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization | Chen Mao et.al. | 2406.01906 | link |
2024-06-03 | The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry | Paolo Cudrano et.al. | 2406.01797 | null |
2024-06-03 | Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry | Takayuki Kanai et.al. | 2406.00929 | null |
2024-06-02 | Visual place recognition for aerial imagery: A survey | Ivan Moskalenko et.al. | 2406.00885 | link |
2024-05-30 | Structure Gaussian SLAM with Manhattan World Hypothesis | Shuhong Liu et.al. | 2405.20031 | null |
2024-05-30 | Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar | Wouter Jansen et.al. | 2405.19869 | null |
2024-05-30 | SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization | Jiang Wang et.al. | 2405.19813 | link |
2024-05-30 | TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM | Peifeng Jiang et.al. | 2405.19614 | null |
2024-05-27 | CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy | Richard Elvira et.al. | 2405.16932 | null |
2024-05-26 | Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians | Erik Sandström et.al. | 2405.16544 | link |
2024-05-24 | NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes | Lizhi Bai et.al. | 2405.15151 | null |
2024-05-23 | ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization | Han Song et.al. | 2405.15082 | null |
2024-05-23 | Synergistic Global-space Camera and Human Reconstruction from Videos | Yizhou Zhao et.al. | 2405.14855 | null |
2024-05-23 | CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments | Yang Zhou et.al. | 2405.14731 | link |
2024-05-23 | Efficient Robot Learning for Perception and Mapping | Niclas Vödisch et.al. | 2405.14688 | null |
2024-05-22 | Monocular Gaussian SLAM with Language Extended Loop Closure | Tian Lan et.al. | 2405.13748 | null |
2024-05-26 | NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments | Dongha Chung et.al. | 2405.12563 | link |
2024-05-20 | EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving | Boyi Liu et.al. | 2405.12120 | null |
2024-05-24 | Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation | Hyungtae Lim et.al. | 2405.11176 | null |
2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129 | link |
2024-05-17 | CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion | Gang Wang et.al. | 2405.10793 | null |
2024-05-17 | Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map | Liang Zhao et.al. | 2405.10743 | null |
2024-05-10 | MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization | Pengcheng Zhu et.al. | 2405.06241 | null |
2024-05-07 | Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map | Yuxuan Xia et.al. | 2405.04290 | null |
2024-05-07 | IMU-Aided Event-based Stereo Visual Odometry | Junkai Niu et.al. | 2405.04071 | link |
2024-04-27 | An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation | Olivier Brochu Dufour et.al. | 2404.17745 | null |
2024-04-26 | Camera Motion Estimation from RGB-D-Inertial Scene Flow | Samuel Cerezo et.al. | 2404.17251 | link |
2024-04-23 | Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization | Lahav Lipson et.al. | 2404.15263 | link |
2024-04-18 | SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints | Spencer Carmichael et.al. | 2404.12339 | null |
2024-04-17 | VBR: A Vision Benchmark in Rome | Leonardo Brizi et.al. | 2404.11322 | link |
2024-04-14 | Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration | Yanhao Zhang et.al. | 2404.09169 | link |
2024-04-06 | Salient Sparse Visual Odometry With Pose-Only Supervision | Siyu Chen et.al. | 2404.04677 | null |
2024-03-25 | A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments | Gianluca D’Amico et.al. | 2403.17084 | null |
2024-03-19 | On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine | Jagatpreet Singh Nir et.al. | 2403.13170 | null |
2024-03-18 | The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions | Margaret Hansen et.al. | 2403.12194 | null |
2024-03-18 | An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation | Zewen Xu et.al. | 2403.11639 | null |
2024-03-16 | Efficient Domain Adaptation for Endoscopic Visual Odometry | Junyang Wu et.al. | 2403.10860 | null |
2024-03-14 | Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO) | Matthew Lisondra et.al. | 2403.09882 | null |
2024-03-02 | Grid-based Fast and Structural Visual Odometry | Zhang Zhihe et.al. | 2403.01110 | null |
2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | link |
2024-02-22 | Secure Navigation using Landmark-based Localization in a GPS-denied Environment | Ganesh Sapkota et.al. | 2402.14280 | null |
2024-02-19 | Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment | Ganesh Sapkota et.al. | 2402.12551 | null |
2024-02-07 | Online and Certifiably Correct Visual Odometry and Mapping | Devansh R Agrawal et.al. | 2402.05254 | null |
2024-02-06 | YOLOPoint Joint Keypoint and Object Detection | Anton Backhaus et.al. | 2402.03989 | link |
2024-01-19 | Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning | André O. Françani et.al. | 2401.10857 | null |
2024-01-17 | Event-Based Visual Odometry on Non-Holonomic Ground Vehicles | Wanting Xu et.al. | 2401.09331 | link |
2024-01-11 | On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering | Feng Zhu et.al. | 2401.05836 | null |
2023-12-19 | Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry | Olaya Álvarez-Tuñón et.al. | 2401.05396 | link |
2024-01-07 | Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people | Ali Samadzadeh et.al. | 2401.03604 | link |
2024-01-03 | LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry | Weirong Chen et.al. | 2401.01887 | link |
2023-12-28 | SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction | Zikang Yuan et.al. | 2312.16800 | link |
2023-12-20 | NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields | Jens Naumann et.al. | 2312.13471 | null |
2023-12-22 | Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM | Junru Lin et.al. | 2312.13332 | null |
2023-12-20 | Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach | Habib Boloorchi Tabrizi et.al. | 2312.13162 | link |
2023-12-20 | Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera | Abdulkadhem A. Abdulkadhem et.al. | 2312.12680 | null |
2023-12-15 | Deep Event Visual Odometry | Simon Klenk et.al. | 2312.09800 | link |
2023-12-10 | SuperPrimitive: Scene Reconstruction at a Primitive Level | Kirill Mazur et.al. | 2312.05889 | null |
2023-12-04 | iMatching: Imperative Correspondence Learning | Zitong Zhan et.al. | 2312.02141 | link |
2023-11-30 | Event-based Visual Inertial Velometer | Xiuyuan Lu et.al. | 2311.18189 | null |
2023-11-21 | CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems | Young-Hee Lee et.al. | 2311.12580 | null |
2023-11-10 | Dense Visual Odometry Using Genetic Algorithm | Slimane Djema et.al. | 2311.06149 | null |
2023-11-07 | Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM | Seongwook Yoon et.al. | 2311.03722 | null |
2023-10-23 | Converting Depth Images and Point Clouds for Feature-based Pose Estimation | Robert Lösch et.al. | 2310.14924 | link |
2023-10-17 | Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms | Yanyan Li et.al. | 2310.10931 | link |
2023-10-12 | Jointly Optimized Global-Local Visual Localization of UAVs | Haoling Li et.al. | 2310.08082 | null |
2023-10-10 | l-dyno: framework to learn consistent visual features using robot’s motion | Kartikeya Singh et.al. | 2310.06249 | link |
2023-10-08 | XVO: Generalized Visual Odometry via Cross-Modal Self-Training | Lei Lai et.al. | 2309.16772 | null |
2023-10-22 | ObVi-SLAM: Long-Term Object-Visual SLAM | Amanda Adkins et.al. | 2309.15268 | link |
2023-09-23 | Tag-based Visual Odometry Estimation for Indoor UAVs Localization | Massimiliano Bertoni et.al. | 2309.13311 | null |
2023-09-22 | Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms | Olivier Gamache et.al. | 2309.13139 | link |
2023-09-20 | Conformalized Multimodal Uncertainty Regression and Reasoning | Domenico Parente et.al. | 2309.11018 | null |
2023-09-20 | OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving | Heng Li et.al. | 2309.11011 | link |
2023-09-19 | LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation | Haizhou Zhang et.al. | 2309.10436 | link |
2023-09-21 | Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration | Hongbo Zhao et.al. | 2309.10314 | null |
2023-09-18 | End-to-End Learned Event- and Image-based Visual Odometry | Roberto Pellerito et.al. | 2309.09947 | link |
2023-09-14 | An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments | Yehao Liu et.al. | 2309.07408 | null |
2023-09-11 | Evaluating Visual Odometry Methods for Autonomous Driving in Rain | Yu Xiang Tan et.al. | 2309.05249 | null |
2023-09-08 | Robot Localization and Mapping Final Report – Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry | Akankshya Kar et.al. | 2309.04147 | null |
2023-09-04 | EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity | Zijie Jiang et.al. | 2309.01296 | null |
2023-08-27 | Deep Learning for Visual Localization and Mapping: A Survey | Changhao Chen et.al. | 2308.14039 | null |
2023-08-19 | Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters | Xiao Liu et.al. | 2308.09870 | link |
2023-08-12 | 4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion | Guirong Zhuo et.al. | 2308.06573 | null |
2023-08-10 | Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU | U. V. B. L. Udugama et.al. | 2308.05515 | null |
2023-08-02 | A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry | Cora A. Dimmig et.al. | 2308.01398 | null |
2023-08-02 | Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network | Shenbagaraj Kannapiran et.al. | 2308.01125 | null |
2023-08-02 | Preliminary Design of the Dragonfly Navigation Filter | Ben Schilling et.al. | 2307.13513 | null |
2023-07-19 | Optimizing the extended Fourier Mellin Transformation Algorithm | Wenqing Jiang et.al. | 2307.10015 | link |
2023-07-15 | Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents | Ke Cao et.al. | 2307.07763 | null |
2023-07-26 | Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression | Jianeng Wang et.al. | 2306.01188 | null |
2023-07-06 | OSPC: Online Sequential Photometric Calibration | Jawad Haidar et.al. | 2305.17673 | null |
2023-05-15 | Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface | Shifan Zhu et.al. | 2305.08962 | null |
2023-05-10 | Transformer-based model for monocular visual odometry: a video understanding approach | André O. Françani et.al. | 2305.06121 | link |
2023-04-29 | Modality-invariant Visual Odometry for Embodied Vision | Marius Memmel et.al. | 2305.00348 | link |
2023-04-21 | FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving | Yuxuan Liu et.al. | 2304.10719 | null |
2023-07-08 | Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Visual Bootstrapping | Hanyu Cai et.al. | 2304.08978 | null |
2023-04-12 | SiLK – Simple Learned Keypoints | Pierre Gleize et.al. | 2304.06194 | link |
2023-04-11 | ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster | Yifei Dong et.al. | 2304.04943 | null |
2023-03-21 | Learning a Depth Covariance Function | Eric Dexheimer et.al. | 2303.12157 | null |
2023-03-21 | Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network | Alessandro Navone et.al. | 2303.11725 | null |
2023-03-20 | VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors | Thien Hoang Nguyen et.al. | 2303.10903 | null |
2023-03-17 | CoVIO: Online Continual Learning for Visual-Inertial Odometry | Niclas Vödisch et.al. | 2303.10149 | link |
2023-03-15 | UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry | Chaoyang Jiang et.al. | 2303.08550 | null |
2023-03-13 | Discovering Multiple Algorithm Configurations | Leonid Keselman et.al. | 2303.07434 | null |
2023-03-09 | Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation | Masahiro Hirano et.al. | 2303.05192 | null |
2023-03-16 | Stereo Event-based Visual-Inertial Odometry | Kunfeng Wang et.al. | 2303.05086 | link |
2023-03-07 | Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor | Eduardo Gallo et.al. | 2303.03804 | null |
2023-03-03 | Lightweight, Uncertainty-Aware Conformalized Visual Odometry | Alex C. Stutts et.al. | 2303.02207 | null |
2023-02-24 | FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets | Yelena Randall et.al. | 2302.12772 | null |
2023-02-27 | CP+: Camera Poses Augmentation with Large-scale LiDAR Maps | Jiadi Cui et.al. | 2302.12198 | null |
2023-02-19 | EdgeVO: An Efficient and Accurate Edge-based Visual Odometry | Hui Zhao et.al. | 2302.09493 | null |
2023-01-27 | HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera | Mostafa Ahmadi et.al. | 2301.11823 | null |
2023-01-26 | Distributed Optimization Methods for Multi-Robot Systems: Part I – A Tutorial | Ola Shorinwa et.al. | 2301.11313 | null |
2023-01-24 | Generalized Object Search | Kaiyu Zheng et.al. | 2301.10121 | null |
2023-01-22 | Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories | Hanlin Chen et.al. | 2301.09194 | null |
2023-01-21 | Dense RGB SLAM with Neural Implicit Maps | Heng Li et.al. | 2301.08930 | null |
2023-01-18 | Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information | Junshi Chen et.al. | 2301.07560 | null |
2023-01-17 | COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM | Manthan Patel et.al. | 2301.07147 | link |
2023-01-31 | Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems | Pierre-Yves Lajoie et.al. | 2301.06230 | link |
2023-01-13 | A LiDAR-Inertial-Visual SLAM System with Loop Detection | Kangcheng Liu et.al. | 2301.05604 | null |
2023-01-11 | AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization | Ying Chen et.al. | 2301.04620 | link |
2023-01-12 | TBV Radar SLAM – trust but verify loop candidates | Daniel Adolfsson et.al. | 2301.04397 | link |
2022-12-31 | Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges | Maxwell McManus et.al. | 2301.03359 | null |
2023-01-09 | Motion Addition and Motion Optimization | Liqun Qi et.al. | 2301.03174 | null |
2023-01-08 | Towards Open World NeRF-Based SLAM | Daniil Lisus et.al. | 2301.03102 | null |
2023-01-06 | CyberLoc: Towards Accurate Long-term Visual Localization | Liu Liu et.al. | 2301.02403 | null |
2023-01-03 | LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation | Shreyansh Daftry et.al. | 2301.01350 | null |
2022-12-31 | 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions | Patrick Wenzel et.al. | 2301.01147 | null |
2023-01-03 | BS3D: Building-scale 3D Reconstruction from RGB-D Images | Janne Mustaniemi et.al. | 2301.01057 | null |
2023-01-10 | An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping | Masoud Dayani Najafabadi et.al. | 2301.00618 | link |
2022-12-25 | A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion | Nadia Figueroa et.al. | 2212.14772 | null |
2022-12-29 | An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping | Kangcheng Liu et.al. | 2212.14209 | link |
2022-12-27 | Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands | Felipe Gómez-Cuba et.al. | 2212.13477 | link |
2022-12-26 | ESVIO: Event-based Stereo Visual Inertial Odometry | Peiyu Chen et.al. | 2212.13184 | link |
2022-12-24 | A Comprehensive Review on Autonomous Navigation | Saeid Nahavandi et.al. | 2212.12808 | null |
2022-12-23 | Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation | Marina Lotti et.al. | 2212.12388 | null |
2022-12-23 | Implementation of a Blind navigation method in outdoors/indoors areas | Mohammad Javadian Farzaneh et.al. | 2212.12185 | null |
2022-12-22 | S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations | Hriday Bavle et.al. | 2212.11770 | link |
2022-12-22 | Active SLAM: A Review On Last Decade | Muhammad Farhan Ahmed et.al. | 2212.11654 | null |
2022-12-27 | Motion, Unit Dual Quaternion and Motion Optimization | Liqun Qi et.al. | 2212.11593 | null |
2022-12-22 | Vision-Based Environmental Perception for Autonomous Driving | Fei Liu et.al. | 2212.11453 | null |
2022-12-19 | Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models | Yong Cheng et.al. | 2212.09553 | null |
2022-12-16 | Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments | Lasitha Weerakoon et.al. | 2212.08633 | null |
2022-12-16 | rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments | Bo Wei et.al. | 2212.08418 | null |
2023-03-02 | AirVO: An Illumination-Robust Point-Line Visual Odometry | Kuan Xu et.al. | 2212.07595 | link |
2022-12-14 | Autonomous Vehicle Navigation with LIDAR using Path Planning | Rahul M K et.al. | 2212.07155 | null |
2022-12-14 | RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping | Hyowon Kim et.al. | 2212.07141 | null |
2022-12-13 | Know What You Don’t Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version) | Daniil Lisus et.al. | 2212.06923 | null |
2022-12-13 | SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance | Chenyangguang Zhang et.al. | 2212.06524 | null |
2022-12-13 | Localization and Navigation System for Indoor Mobile Robot | Yanbaihui Liu et.al. | 2212.06391 | null |
2022-12-12 | Evaluation of RGB-D SLAM in Large Indoor Environments | Kirill Muravyev et.al. | 2212.05980 | null |
2022-12-19 | A Light-Weight LiDAR-Inertial SLAM System with Loop Closing | Kangcheng Liu et.al. | 2212.05743 | link |
2022-12-12 | An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds | Kangcheng Liu et.al. | 2212.05705 | link |
2022-12-09 | SLAM for Visually Impaired People: A Survey | Marziyeh Bamdad et.al. | 2212.04745 | null |
2022-12-09 | Ego-Body Pose Estimation via Ego-Head Pose Estimation | Jiaman Li et.al. | 2212.04636 | null |
2022-12-06 | Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles | Sushant Veer et.al. | 2212.03323 | link |
2022-12-06 | PRISM: Probabilistic Real-Time Inference in Spatial World Models | Atanas Mirchev et.al. | 2212.02988 | null |
2022-12-06 | RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps | Florian Sauerbeck et.al. | 2212.02085 | link |
2022-12-05 | DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization | Xuebo Tian et.al. | 2212.02077 | null |
2022-12-05 | ObjectMatch: Robust Registration using Canonical Object Correspondences | Can Gümeli et.al. | 2212.01985 | null |
2022-12-02 | Sparse SPN: Depth Completion from Sparse Keypoints | Yuqun Wu et.al. | 2212.00987 | null |
2022-12-01 | maplab 2.0 – A Modular and Multi-Modal Mapping Framework | Andrei Cramariuc et.al. | 2212.00654 | link |
2022-12-01 | AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body – Theory and Experiments | Mehregan Dor et.al. | 2212.00350 | null |
2022-11-30 | MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves | Pranjali Pathre et.al. | 2211.16882 | null |
2022-11-29 | PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images | Hartmut Surmann et.al. | 2211.16266 | link |
2022-11-29 | MmWave Mapping and SLAM for 5G and Beyond | Yu Ge et.al. | 2211.16024 | null |
2022-11-28 | Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map | Xi Zheng et.al. | 2211.15127 | null |
2022-11-29 | BALF: Simple and Efficient Blur Aware Local Feature Detector | Zhenjun Zhao et.al. | 2211.14731 | null |
2022-11-27 | Development of a Modular Real-time Shared-control System for a Smart Wheelchair | Vaishanth Ramaraj et.al. | 2211.14711 | null |
2022-11-26 | A1 SLAM: Quadruped SLAM using the A1’s Onboard Sensors | Jerred Chen et.al. | 2211.14432 | link |
2022-11-23 | ActiveRMAP: Radiance Field for Active Mapping And Planning | Huangying Zhan et.al. | 2211.12656 | null |
2022-11-22 | Vision-based localization methods under GPS-denied conditions | Zihao Lu et.al. | 2211.11988 | null |
2022-11-21 | Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques | David Ramirez et.al. | 2211.11836 | null |
2022-11-21 | ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari et.al. | 2211.11704 | null |
2022-11-24 | Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths | Erik Leitinger et.al. | 2211.09241 | null |
2022-11-16 | Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery | Hao Qu et.al. | 2211.08904 | null |
2022-11-20 | Detecting Line Segments in Motion-blurred Images with Events | Huai Yu et.al. | 2211.07365 | link |
2022-11-13 | Automatic Eye-in-Hand Calibration using EKF | Aditya Ramakrishnan et.al. | 2211.06881 | null |
2022-11-12 | Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling | Zhihao Wang et.al. | 2211.06557 | link |
2022-11-11 | Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications | Jie Yang et.al. | 2211.05982 | null |
2022-11-10 | Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time | Ignacio Torroba et.al. | 2211.05601 | link |
2022-11-07 | When Geometry is not Enough: Using Reflector Markers in Lidar SLAM | Gerhard Kurz et.al. | 2211.03484 | null |
2022-11-07 | Detecting Invalid Map Merges in Lifelong SLAM | Matthias Holoch et.al. | 2211.03423 | null |
2022-11-06 | Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU | Yibin Wu et.al. | 2211.03174 | link |
2022-11-07 | Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments | Daniel Adolfsson et.al. | 2211.02445 | link |
2022-11-03 | DyOb-SLAM : Dynamic Object Tracking SLAM System | Rushmian Annoy Wadud et.al. | 2211.01941 | null |
2022-11-03 | Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM | Yang Chen et.al. | 2211.01749 | null |
2022-11-04 | $D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm | Hao Xu et.al. | 2211.01538 | link |
2022-11-02 | Semantic SuperPoint: A Deep Semantic Descriptor | Gabriel S. Gama et.al. | 2211.01098 | link |
2022-11-02 | Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation | Myung-Hwan Jeon et.al. | 2211.00960 | link |
2022-10-31 | Mapping Extended Landmarks for Radar SLAM | Shuai Sun et.al. | 2210.17207 | null |
2022-10-25 | MAROAM: Map-based Radar SLAM through Two-step Feature Selection | Dequan Wang et.al. | 2210.13797 | null |
2022-10-25 | S3E: A Large-scale Multimodal Dataset for Collaborative SLAM | Dapeng Feng et.al. | 2210.13723 | link |
2022-10-24 | NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields | Antoni Rosinol et.al. | 2210.13641 | link |
2022-10-24 | Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging | Geng Wang et.al. | 2210.13556 | null |
2022-10-28 | VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points | Andreas Georgis et.al. | 2210.12756 | null |
2022-10-22 | SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation | Junliang Chen et.al. | 2210.12417 | null |
2022-10-21 | DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm | Shipeng Zhong et.al. | 2210.11978 | link |
2022-10-21 | Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments | Shubham Kedia et.al. | 2210.11652 | null |
2022-10-22 | Visual SLAM: What are the Current Trends and What to Expect? | Ali Tourani et.al. | 2210.10491 | null |
2022-10-18 | Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM | Geon Choi et.al. | 2210.09636 | null |
2022-10-16 | D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments | Ayman Beghdadi et.al. | 2210.08647 | null |
2022-10-16 | Indoor Smartphone SLAM with Learned Echoic Location Features | Wenjie Luo et.al. | 2210.08493 | null |
2022-10-15 | Self-Improving SLAM in Dynamic Environments: Learning When to Mask | Adrian Bojko et.al. | 2210.08350 | link |
2022-10-13 | Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems | Pushyami Kaveti et.al. | 2210.07315 | link |
2022-10-12 | RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map | Xuecheng Xu et.al. | 2210.05984 | link |
2022-10-11 | Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization | Yuanzheng He et.al. | 2210.05600 | null |
2022-10-11 | Autonomous Asteroid Characterization Through Nanosatellite Swarming | Kaitlin Dennison et.al. | 2210.05518 | null |
2022-10-11 | DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion | Yuxi Xiao et.al. | 2210.05517 | null |
2022-10-11 | Multi-Object Navigation with dynamically learned neural implicit representations | Pierre Marza et.al. | 2210.05129 | link |
2022-10-12 | Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation | Yulun Tian et.al. | 2210.05020 | null |
2022-10-10 | Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios | Xingyu Chen et.al. | 2210.04562 | null |
2022-10-09 | Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning | Ali Safa et.al. | 2210.04236 | null |
2022-10-06 | SCORE: A Second-Order Conic Initialization for Range-Aided SLAM | Alan Papalia et.al. | 2210.03177 | link |
2022-10-06 | Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding | Kirill Mazur et.al. | 2210.03043 | null |
2022-10-06 | Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence | Osian Morgan et.al. | 2210.02642 | null |
2022-10-05 | MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation | Hanwei Zhang et.al. | 2210.02038 | null |
2022-10-04 | O2S: Open-source open shuttle | Nwankwo Linus et.al. | 2210.01627 | null |
2022-10-04 | Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing | Weiying Wang et.al. | 2210.01320 | null |
2022-10-03 | Probabilistic Volumetric Fusion for Dense Monocular SLAM | Antoni Rosinol et.al. | 2210.01276 | null |
2022-10-03 | DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams | John McConnell et.al. | 2210.00867 | link |
2022-10-03 | A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments | Ha Sier et.al. | 2210.00812 | link |
2022-10-01 | Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2 | Ali Eslamian et.al. | 2210.00278 | null |
2022-09-30 | PyPose: A Library for Robot Learning with Physics-based Optimization | Chen Wang et.al. | 2209.15428 | link |
2022-09-29 | DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment | Mariia Gladkova et.al. | 2209.14965 | null |
2022-09-28 | Robust Incremental Smoothing and Mapping (riSAM) | Daniel McGann et.al. | 2209.14359 | null |
2022-09-27 | Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping | Chi-Ming Chung et.al. | 2209.13274 | link |
2022-09-24 | Graph Neural Networks for Multi-Robot Active Information Acquisition | Mariliza Tzes et.al. | 2209.12091 | null |
2022-09-24 | Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes | Jonathan J. Y. Kim et.al. | 2209.11894 | null |
2022-09-23 | involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs | Gilad Rotman et.al. | 2209.11591 | null |
2022-09-23 | Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot | David Balaban et.al. | 2209.11432 | null |
2022-09-22 | SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation | Xiao Han et.al. | 2209.10817 | null |
2022-09-22 | Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio | Wenhao Qiu et.al. | 2209.10726 | null |
2022-09-21 | Visual Localization and Mapping in Dynamic and Changing Environments | João Carlos Virgolino Soares et.al. | 2209.10710 | null |
2022-09-20 | Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM | Sabir Hossain et.al. | 2209.10047 | null |
2022-09-20 | WGICP: Differentiable Weighted GICP-Based Lidar Odometry | Sanghyun Son et.al. | 2209.09777 | null |
2022-09-20 | PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention | José Arce et.al. | 2209.09699 | link |
2022-09-19 | MeSLAM: Memory Efficient SLAM based on Neural Fields | Evgenii Kruzhkov et.al. | 2209.09357 | null |
2022-09-19 | LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM | Letian Zhang et.al. | 2209.08810 | null |
2022-09-18 | HGI-SLAM: Loop Closure With Human and Geometric Importance Features | Shuhul Mujoo et.al. | 2209.08608 | null |
2022-09-18 | Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM | Jiarui Tan et.al. | 2209.08578 | link |
2022-09-17 | DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments | Shihao Shen et.al. | 2209.08430 | link |
2022-09-17 | OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM | Matthieu Zins et.al. | 2209.08338 | null |
2022-09-17 | PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments | Adam Dai et.al. | 2209.08248 | link |
2022-09-16 | ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM | Aditya Arun et.al. | 2209.08091 | null |
2022-09-16 | iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking | Yuhang Ming et.al. | 2209.07919 | null |
2022-09-16 | TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM | Mathieu Gonzalez et.al. | 2209.07888 | null |
2022-09-15 | Landmark Management in the Application of Radar SLAM | Shuai Sun et.al. | 2209.07199 | link |
2022-09-15 | PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization | Xianwei Meng et.al. | 2209.07061 | null |
2022-09-14 | Semantic Visual Simultaneous Localization and Mapping: A Survey | Kaiqi Chen et.al. | 2209.06428 | null |
2022-09-13 | Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets | Islam Ali et.al. | 2209.06316 | null |
2022-09-12 | A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding | Tin Lai et.al. | 2209.05222 | null |
2022-09-12 | Attitude-Guided Loop Closure for Cameras with Negative Plane | Ze Wang et.al. | 2209.05167 | link |
2022-09-09 | General Place Recognition Survey: Towards the Real-world Autonomy Age | Peng Yin et.al. | 2209.04497 | link |
2022-09-08 | ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology | Julio A. Placed et.al. | 2209.03693 | link |
2022-09-08 | R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator | Jiarong Lin et.al. | 2209.03666 | link |
2022-09-06 | Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection | Brendon Forsgren et.al. | 2209.02658 | link |
2022-09-05 | Neuromorphic Visual Odometry with Resonator Networks | Alpha Renner et.al. | 2209.02000 | null |
2022-09-05 | MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM | Pavel Karpyshev et.al. | 2209.01936 | null |
2022-09-05 | ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics | Boyi Liu et.al. | 2209.01774 | null |
2022-09-04 | CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud | Evgeny Yudin et.al. | 2209.01605 | null |
2022-08-31 | PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM | Yifan Duan et.al. | 2208.14848 | null |
2022-08-30 | BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition | Peng Yin et.al. | 2208.14543 | null |
2022-08-27 | Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes | Ali Safa et.al. | 2208.12997 | null |
2022-08-25 | FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms | Jianhao Jiao et.al. | 2208.11865 | null |
2022-08-25 | Lidar SLAM for Autonomous Driving Vehicles | Farhad Aghili et.al. | 2208.11855 | null |
2022-08-24 | DynaVINS: A Visual-Inertial SLAM for Dynamic Environments | Seungwon Song et.al. | 2208.11500 | link |
2022-08-22 | Doppler Exploitation in Bistatic mmWave Radio SLAM | Yu Ge et.al. | 2208.10204 | null |
2022-08-21 | Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping | Lintong Zhang et.al. | 2208.09825 | link |
2022-08-26 | JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario | Longrui Dong et.al. | 2208.09777 | null |
2022-08-15 | BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM | Yunge Cui et.al. | 2208.07473 | link |
2022-08-12 | Handling Constrained Optimization in Factor Graphs for Autonomous Navigation | Barbara Bazzana et.al. | 2208.06325 | null |
2022-08-11 | RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild | Jason Y. Zhang et.al. | 2208.05963 | null |
2022-08-08 | Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation | Yifei Ren et.al. | 2208.04274 | link |
2022-08-08 | SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty | Shuai Zhang et.al. | 2208.03945 | link |
2022-08-05 | A Survey on Visual Map Localization Using LiDARs and Cameras | Elhousni Mahdi et.al. | 2208.03376 | null |
2022-08-04 | SROS2: Usable Cyber Security Tools for ROS 2 | Victor Mayoral Vilches et.al. | 2208.02615 | link |
2022-08-03 | Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms | Bharath Garigipati et.al. | 2208.02063 | null |
2022-08-02 | Present and Future of SLAM in Extreme Underground Environments | Kamak Ebadi et.al. | 2208.01787 | null |
2022-08-01 | Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion | Simon Boche et.al. | 2208.00709 | null |
2022-07-29 | Neural Density-Distance Fields | Itsuki Ueda et.al. | 2207.14455 | link |
2022-07-25 | DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions | Tristan Laidlow et.al. | 2207.12244 | null |
2022-07-25 | Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration | Kenji Koide et.al. | 2207.11942 | null |
2022-07-22 | NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction | Yunlong Ran et.al. | 2207.10985 | null |
2022-07-22 | Dense RGB-D-Inertial SLAM with Map Deformations | Tristan Laidlow et.al. | 2207.10940 | null |
2022-07-22 | PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes | BaoSheng Zhang et.al. | 2207.10916 | null |
2022-07-21 | Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion | Suman Ghosh et.al. | 2207.10494 | link |
2022-07-21 | Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions | Quentin Serdel et.al. | 2207.10489 | link |
2022-07-21 | On applicability of von Karman’s momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity | Yujin Lu et.al. | 2207.10413 | null |
2022-07-19 | Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM | Tuvy Lemberg et.al. | 2207.09103 | null |
2022-07-18 | DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM | Weicai Ye et.al. | 2207.08794 | link |
2022-07-18 | Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction | Marco Orsingher et.al. | 2207.08439 | null |
2022-07-18 | ORB-based SLAM accelerator on SoC FPGA | Vibhakar Vemulapati et.al. | 2207.08405 | null |
2022-07-14 | Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset | Riccardo Giubilato et.al. | 2207.06815 | null |
2022-07-14 | Semi-supervised Vector-Quantization in Visual SLAM using HGCN | Amir Zarringhalam et.al. | 2207.06738 | null |
2022-07-14 | Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders | Amir Zarringhalam et.al. | 2207.06732 | null |
2022-07-13 | SLAM: SLO-Aware Memory Optimization for Serverless Applications | Gor Safaryan et.al. | 2207.06183 | null |
2022-07-19 | Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras | Fangwen Shu et.al. | 2207.06058 | link |
2022-07-12 | Accelerating Certifiable Estimation with Preconditioned Eigensolvers | David M. Rosen et.al. | 2207.05257 | null |
2022-07-12 | Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features | Meiyu Zhi et.al. | 2207.05244 | null |
2022-07-14 | SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial | Chih-Yuan Chiu et.al. | 2207.05043 | null |
2022-07-08 | BlindSpotNet: Seeing Where We Cannot See | Taichi Fukuda et.al. | 2207.03870 | null |
2022-07-08 | Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints | Philipp Glira et.al. | 2207.03785 | null |
2022-07-08 | Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements | Ran Liu et.al. | 2207.03700 | null |
2022-07-07 | RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments | Qihao Peng et.al. | 2207.03539 | null |
2022-07-06 | VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization | Marius Laska et.al. | 2207.02668 | null |
2022-07-06 | A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models | Axel Garcia-Vega et.al. | 2207.02396 | null |
2022-07-04 | VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM | Ling Gao et.al. | 2207.01404 | null |
2022-07-04 | VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM | Danpeng Chen et.al. | 2207.01158 | null |
2022-07-03 | Wireless Channel Prediction in Partially Observed Environments | Mingsheng Yin et.al. | 2207.00934 | null |
2022-07-01 | A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers | Julio A. Placed et.al. | 2207.00254 | null |
2022-07-01 | Keeping Less is More: Point Sparsification for Visual SLAM | Yeonsoo Park et.al. | 2207.00225 | null |
2022-06-30 | Controlled and impulsive compression of an entrapped air bubble during impact | Utkarsh Jain et.al. | 2206.15297 | null |
2022-06-30 | Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery | Yuehao Wang et.al. | 2206.15255 | link |
2022-06-27 | IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments | Abanob Soliman et.al. | 2206.13455 | link |
2022-06-26 | An Efficient Global Optimality Certificate for Landmark-Based SLAM | Connor Holmes et.al. | 2206.12961 | link |
2022-06-21 | Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping | Davide Tateo et.al. | 2206.10263 | link |
2022-06-20 | Data Fusion for Radio Frequency SLAM with Robust Sampling | Erik Leitinger et.al. | 2206.09746 | null |
2022-06-19 | RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments | Chenglong Qian et.al. | 2206.09463 | null |
2022-06-17 | Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments | Khairuldanial Ismail et.al. | 2206.08733 | null |
2022-06-17 | An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions | Yijun Yuan et.al. | 2206.08712 | link |
2022-06-13 | ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy | Hao Bai et.al. | 2206.06435 | null |
2022-06-10 | Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming | Javier Cremona et.al. | 2206.05066 | link |
2022-06-09 | SparseFormer: Attention-based Depth Completion Network | Frederik Warburg et.al. | 2206.04557 | null |
2022-06-07 | Robot Self-Calibration Using Actuated 3D Sensors | Arne Peters et.al. | 2206.03430 | null |
2022-06-07 | Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map | Haodong Yuan et.al. | 2206.03062 | null |
2022-06-05 | DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions | Alena Savinykh et.al. | 2206.02199 | null |
2022-06-04 | C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy | Erez Posner et.al. | 2206.01961 | null |
2022-06-01 | PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry | Dong-Uk Seo et.al. | 2206.00266 | link |
2022-05-27 | A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching | Arno Solin et.al. | 2205.13821 | null |
2022-05-31 | LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments | Yun Chang et.al. | 2205.13135 | link |
2022-05-25 | Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM | Milad Ramezani et.al. | 2205.12595 | null |
2022-05-24 | Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM | Christopher E. Denniston et.al. | 2205.12402 | link |
2022-05-22 | ALITA: A Large-scale Incremental Dataset for Long-term Autonomy | Peng Yin et.al. | 2205.10737 | link |
2022-05-19 | FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2 | Jeffrey Ichnowski et.al. | 2205.09778 | link |
2022-05-17 | Global Data Association for SLAM with 3D Grassmannian Manifold Objects | Parker C. Lusk et.al. | 2205.08556 | null |
2022-05-19 | Cluster on Wheels | Yuanyuan Yang et.al. | 2205.08151 | null |
2022-05-12 | Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry | Shihao Shen et.al. | 2205.05916 | link |
2022-05-12 | S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization | Ran Cheng et.al. | 2205.05861 | null |
2022-05-14 | Multi-modal Semantic SLAM for Complex Dynamic Environments | Han Wang et.al. | 2205.04300 | link |
2022-05-06 | OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations | Carmen Delgado et.al. | 2205.03256 | null |
2022-05-05 | CNN-Augmented Visual-Inertial SLAM with Planar Constraints | Pan Ji et.al. | 2205.02940 | null |
2022-05-05 | PMBM-based SLAM Filters in 5G mmWave Vehicular Networks | Hyowon Kim et.al. | 2205.02502 | null |
2022-05-04 | BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking | Dorian Henning et.al. | 2205.02301 | null |
2022-05-04 | A Global Asymptotic Convergent Observer for SLAM | Seyed Hamed Hashemi et.al. | 2205.01953 | null |
2022-05-04 | Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation | Nathaniel Merrill et.al. | 2205.01823 | link |
2022-05-03 | GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping | Pan Ji et.al. | 2205.01656 | null |
2022-04-29 | Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM | Jinwoo Jeon et.al. | 2204.13877 | link |
2022-04-27 | The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection | Konstantinos A. Tsintotas et.al. | 2204.12831 | null |
2022-04-27 | Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment | Wenyu Li et.al. | 2204.12769 | null |
2022-04-29 | MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment | Tingchen Ma et.al. | 2204.11621 | null |
2022-04-23 | Indoor simultaneous localization and mapping based on fringe projection profilometry | Yang Zhao et.al. | 2204.11020 | null |
2022-04-22 | Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria | Julio A. Placed et.al. | 2204.10631 | null |
2022-04-22 | Fast Autonomous Robotic Exploration Using the Underlying Graph Structure | Julio A. Placed et.al. | 2204.10610 | null |
2022-04-22 | Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions | Yutong Hu et.al. | 2204.10552 | null |
2022-04-22 | Implicit Object Mapping With Noisy Data | Jad Abou-Chakra et.al. | 2204.10516 | link |
2022-04-19 | Photometric single-view dense 3D reconstruction in endoscopy | Victor M. Batlle et.al. | 2204.09083 | null |
2022-04-18 | Pulsar skips: Understanding variations in the regular periods of rotating neutron stars | Clayton Miller et.al. | 2204.08449 | null |
2022-04-18 | Tracking monocular camera pose and deformation for SLAM inside the human body | Juan J. Gomez Rodriguez et.al. | 2204.08309 | null |
2022-04-18 | Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker | Hanjing Ye et.al. | 2204.08163 | null |
2022-04-14 | ViViD++: Vision for Visibility Dataset | Alex Junho Lee et.al. | 2204.06183 | null |
2022-04-12 | HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud | Zhixing Hou et.al. | 2204.05481 | null |
2022-04-12 | RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room | Cong Gao et.al. | 2204.05467 | null |
2022-04-11 | Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context | Lizhou Liao et.al. | 2204.04932 | link |
2022-04-04 | Monitoring social distancing with single image depth estimation | Alessio Mingozzi et.al. | 2204.01693 | null |
2022-04-01 | Bi-directional Loop Closure for Visual SLAM | Ihtisham Ali et.al. | 2204.01524 | null |
2022-04-04 | IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers | Lei Sun et.al. | 2204.01324 | link |
2022-04-03 | Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor | Wenyan Ou et.al. | 2204.01154 | null |
2022-04-02 | UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps | Ayyappa Swamy Thatavarthy et.al. | 2204.00865 | link |
2022-03-31 | Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects | Yujie Lu et.al. | 2204.00035 | null |
2022-03-30 | GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios | Chih-Yuan Chiu et.al. | 2203.16690 | null |
2022-03-29 | Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field | Mostafa Osman et.al. | 2203.15866 | null |
2022-03-29 | Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform | Mingjun Li et.al. | 2203.15439 | null |
2022-03-29 | Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots | Pranay Mathur et.al. | 2203.15272 | null |
2022-03-28 | Are High-Resolution Event Cameras Really Needed? | Daniel Gehrig et.al. | 2203.14672 | null |
2022-03-25 | Spectral Measurement Sparsification for Pose-Graph SLAM | Kevin J. Doherty et.al. | 2203.13897 | link |
2022-03-25 | FD-SLAM: 3-D Reconstruction Using Features and Dense Matching | Xingrui Yang et.al. | 2203.13861 | null |
2022-03-25 | Gravity-constrained point cloud registration | Vladimír Kubelka et.al. | 2203.13799 | null |
2022-03-24 | MD-SLAM: Multi-cue Direct SLAM | Luca Di Giammarino et.al. | 2203.13237 | link |
2022-03-24 | Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video | Shun Taguchi et.al. | 2203.12804 | null |
2022-03-19 | Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems | Jie Yang et.al. | 2203.10267 | null |
2022-03-16 | Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR | Ian D. Miller et.al. | 2203.08925 | link |
2022-03-15 | Neural RF SLAM for unsupervised positioning and mapping with channel state information | Shreya Kadambi et.al. | 2203.08264 | null |
2022-03-15 | Simultaneous Localisation and Mapping with Quadric Surfaces | Tristan Laidlow et.al. | 2203.08040 | null |
2022-03-14 | Drift Reduced Navigation with Deep Explainable Features | Mohd Omama et.al. | 2203.06897 | link |
2022-03-11 | An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs | Keisuke Sugiura et.al. | 2203.05763 | null |
2022-03-10 | High Definition, Inexpensive, Underwater Mapping | Bharat Joshi et.al. | 2203.05640 | link |
2022-03-10 | SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning | Jaehoon Choi et.al. | 2203.05332 | null |
2022-03-08 | Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM | Pierre-Yves Lajoie et.al. | 2203.04446 | link |
2022-03-08 | SLAM-Supported Self-Training for 6D Object Pose Estimation | Ziqi Lu et.al. | 2203.04424 | link |
2022-03-08 | An Online Semantic Mapping System for Extending and Enhancing Visual SLAM | Thorsten Hempel et.al. | 2203.03944 | null |
2022-03-07 | Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms | Qingqing Li et.al. | 2203.03454 | link |
2022-03-07 | OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition | Junyi Ma et.al. | 2203.03397 | link |
2022-03-06 | Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM | Kazushi Aiba et.al. | 2203.02887 | null |
2022-03-06 | RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects | Ran Long et.al. | 2203.02882 | null |
2022-03-03 | STUN: Self-Teaching Uncertainty Estimation for Place Recognition | Kaiwen Cai et.al. | 2203.01851 | link |
2022-03-03 | Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning | Niclas Vödisch et.al. | 2203.01578 | link |
2022-03-02 | FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2203.00893 | link |
2022-03-02 | Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation | Yulun Tian et.al. | 2203.00851 | null |
2022-03-01 | Descriptellation: Deep Learned Constellation Descriptors for SLAM | Chunwei Xing et.al. | 2203.00567 | null |
2022-03-01 | Collaborative Robot Mapping using Spectral Graph Analysis | Lukas Bernreiter et.al. | 2203.00308 | null |
2022-02-26 | RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization | Nikolaos Kourtzanidis et.al. | 2202.13221 | link |
2022-02-25 | Probabilistic Data Association for Semantic SLAM at Scale | Elad Michael et.al. | 2202.12802 | link |
2022-02-24 | TwistSLAM: Constrained SLAM in Dynamic Environment | Mathieu Gonzalez et.al. | 2202.12384 | null |
2022-02-24 | Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion | Hyeonsoo Jang et.al. | 2202.12108 | null |
2022-02-23 | MITI: SLAM Benchmark for Laparoscopic Surgery | Regine Hartwig et.al. | 2202.11496 | null |
2022-02-23 | DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization | Xuebo Tian et.al. | 2202.11431 | null |
2022-02-23 | Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets | Islam Ali et.al. | 2202.11312 | null |
2022-02-22 | SAGE: SLAM with Appearance and Geometry Prior for Endoscopy | Xingtong Liu et.al. | 2202.09487 | link |
2022-02-18 | OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure | Stefan Leutenegger et.al. | 2202.09199 | null |
2022-02-18 | MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery | Ahmad Khaliq et.al. | 2202.09146 | link |
2022-02-18 | An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems | Qiang Liu et.al. | 2202.08952 | null |
2022-02-17 | Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study | Giovanni Cioffi et.al. | 2202.08894 | link |
2022-02-17 | LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building | Jiashi Zhang et.al. | 2202.08487 | null |
2022-02-16 | Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments | Jinkun Wang et.al. | 2202.08359 | null |
2022-02-11 | Overhead Image Factors for Underwater Sonar-based SLAM | John McConnell et.al. | 2202.05811 | null |
2022-02-10 | Scale Estimation with Dual Quadrics for Monocular Object SLAM | Shuangfu Song et.al. | 2202.04816 | null |
2022-02-08 | A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition | Nie Jiwei et.al. | 2202.03677 | null |
2022-01-25 | Autonomous Vehicles: Open-Source Technologies, Considerations, and Development | Oussama Saoudi et.al. | 2202.03148 | null |
2022-02-07 | Temporal Point Cloud Completion with Pose Disturbance | Jieqi Shi et.al. | 2202.03084 | null |
2022-02-04 | DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments | Xinggang Hu et.al. | 2202.01938 | null |
2022-02-01 | A Model for Multi-View Residual Covariances based on Perspective Deformation | Alejandro Fontan et.al. | 2202.00765 | null |
2022-01-30 | Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM | Xinghe Chu et.al. | 2201.12726 | null |
2022-01-28 | RGB-D SLAM Using Attention Guided Frame Association | Ali Caglayan et.al. | 2201.12047 | null |
2022-02-04 | Learning to Act with Affordance-Aware Multimodal Neural SLAM | Zhiwei Jia et.al. | 2201.09862 | link |
2022-01-22 | Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems | Xi Zheng et.al. | 2201.09048 | link |
2022-01-17 | SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System | Giseop Kim et.al. | 2201.06423 | null |
2022-01-14 | SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions | Ali Samadzadeh et.al. | 2201.05386 | link |
2022-01-19 | Multi-Hypothesis Scan Matching through Clustering | Giorgio Iavicoli et.al. | 2201.03814 | null |
2022-01-11 | Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM | Kevin J. Doherty et.al. | 2201.03773 | null |
2022-01-10 | High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM | Brian M. Hopkinson et.al. | 2201.03364 | link |
2022-01-10 | Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition | M. Usman Maqbool Bhutta et.al. | 2201.03212 | link |
2022-01-04 | Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds | Xueliang Wen et.al. | 2201.00959 | null |
2021-12-29 | Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic | Khen Elimelech et.al. | 2112.14428 | null |
2021-12-19 | M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots | Jie Yin et.al. | 2112.13659 | link |
2021-12-27 | UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping | Hyunjun Lim et.al. | 2112.13515 | link |
2021-12-25 | Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs | Yusheng Wang et.al. | 2112.13224 | null |
2021-12-25 | Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping | Peng Huang et.al. | 2112.13222 | null |
2021-12-24 | 3D Point Cloud Reconstruction and SLAM as an Input | Ziyu Li et.al. | 2112.12907 | null |
2021-12-22 | NICE-SLAM: Neural Implicit Scalable Encoding for SLAM | Zihan Zhu et.al. | 2112.12130 | link |
2021-12-18 | Fast and Robust Registration of Partially Overlapping Point Clouds | Eduardo Arnold et.al. | 2112.09922 | link |
2021-12-17 | Symmetry-aware Neural Architecture for Embodied Visual Navigation | Shuang Liu et.al. | 2112.09515 | null |
2021-12-27 | Homography Decomposition Networks for Planar Object Tracking | Xinrui Zhan et.al. | 2112.07909 | link |
2021-12-14 | Autonomous Navigation System from Simultaneous Localization and Mapping | Micheal Caracciolo et.al. | 2112.07723 | link |
2021-12-12 | 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation | Bolivar Solarte et.al. | 2112.06180 | link |
2021-12-11 | Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization | Amay Saxena et.al. | 2112.05921 | null |
2021-12-07 | Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems | Gideon Billings et.al. | 2112.03826 | link |
2021-12-05 | Iterated Posterior Linearization PMB Filter for 5G SLAM | Yu Ge et.al. | 2112.02575 | null |
2021-12-03 | Fast Direct Stereo Visual SLAM | Jiawei Mo et.al. | 2112.01890 | link |
2021-12-02 | MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment | Jie Ren et.al. | 2112.01349 | link |
2021-12-01 | Research on Event Accumulator Settings for Event-Based SLAM | Kun Xiao et.al. | 2112.00427 | link |
2021-11-29 | An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments | Assem Sadek et.al. | 2111.14666 | null |
2021-11-29 | Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report | Hartmut Surmann et.al. | 2111.14542 | null |
2021-11-24 | Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment | V. Ayala-Alfaro et.al. | 2111.12690 | null |
2021-11-24 | Autonomous bot with ML-based reactive navigation for indoor environment | Yash Srivastava et.al. | 2111.12542 | null |
2021-11-22 | A General Framework for Lifelong Localization and Mapping in Changing Environment | Min Zhao et.al. | 2111.10946 | link |
2021-11-17 | Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network | Xiaoming Zhao et.al. | 2111.09006 | null |
2021-11-10 | Comparing dominance of tennis’ big three via multiple-output Bayesian quantile regression models | Bruno Santos et.al. | 2111.05631 | null |
2021-11-10 | TomoSLAM: factor graph optimization for rotation angle refinement in microtomography | Mark Griguletskii et.al. | 2111.05562 | null |
2021-11-07 | Hierarchical Segment-based Optimization for SLAM | Yuxin Tian et.al. | 2111.04101 | null |
2021-11-07 | Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM | Shing Yan Loo et.al. | 2111.04096 | null |
2021-11-05 | MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry | Joan P. Company-Corcoles et.al. | 2111.03408 | null |
2021-10-31 | Loop closure detection using local 3D deep descriptors | Youjie Zhou et.al. | 2111.00440 | link |
2021-10-27 | Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification | Mingsheng Yin et.al. | 2110.14789 | link |
2021-10-27 | Efficient Placard Discovery for Semantic Mapping During Frontier Exploration | David Balaban et.al. | 2110.14742 | null |
2021-10-26 | Robust Multi-view Registration of Point Sets with Laplacian Mixture Model | Jin Zhang et.al. | 2110.13744 | null |
2021-10-25 | WOLF: A modular estimation framework for robotics based on factor graphs | Joan Sola et.al. | 2110.12919 | null |
2021-10-21 | Real-Time Ground-Plane Refined LiDAR SLAM | Fan Yang et.al. | 2110.11517 | null |
2021-10-21 | SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words | Jonathan J. Y. Kim et.al. | 2110.11491 | null |
2021-10-21 | InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion | Zhenkun Zhu et.al. | 2110.11040 | null |
2021-10-20 | SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training | Ankur Bapna et.al. | 2110.10329 | null |
2021-10-18 | Enhancing exploration algorithms for navigation with visual SLAM | Kirill Muravyev et.al. | 2110.09156 | null |
2021-10-18 | Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment | Rui Tian et.al. | 2110.08977 | null |
2021-10-16 | Partial Hierarchical Pose Graph Optimization for SLAM | Alexander Korovko et.al. | 2110.08639 | null |
2021-10-14 | Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach | Shumon Koga et.al. | 2110.07546 | null |
2021-10-13 | Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity | Ran Liu et.al. | 2110.06541 | null |
2021-10-12 | Learning Efficient Multi-Agent Cooperative Visual Exploration | Chao Yu et.al. | 2110.05734 | null |
2021-10-07 | Self-Supervised Depth Completion for Active Stereo | Frederik Warburg et.al. | 2110.03234 | null |
2021-10-06 | InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes | Zhenkun Zhu et.al. | 2110.02593 | null |
2021-10-03 | AEROS: Adaptive RObust least-Squares for Graph-Based SLAM | Milad Ramezani et.al. | 2110.02018 | null |
2021-10-04 | Fast Uncertainty Quantification for Active Graph SLAM | Julio A. Placed et.al. | 2110.01289 | link |
2021-10-04 | Geometry-based Graph Pruning for Lifelong SLAM | Gerhard Kurz et.al. | 2110.01286 | null |
2021-10-03 | Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration | Marcus Greiff et.al. | 2110.01099 | null |
2021-10-02 | Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows | Qiangqiang Huang et.al. | 2110.00876 | link |
SFM
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-14 | Supporting SENĆOTEN Language Documentation Efforts with Automatic Speech Recognition | Mengzhe Geng et.al. | 2507.10827 | null |
2025-07-11 | Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT | Wei Zhang et.al. | 2507.08448 | null |
2025-07-04 | MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion | Peilin Tao et.al. | 2507.03306 | null |
2025-06-30 | Towards Initialization-free Calibrated Bundle Adjustment | Carl Olsson et.al. | 2506.23808 | null |
2025-06-30 | AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention | Ziao Liu et.al. | 2506.23611 | null |
2025-06-27 | Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras | Petr Hruby et.al. | 2506.22069 | null |
2025-06-24 | ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes | Chenhao Zhang et.al. | 2506.21629 | null |
2025-07-08 | Wild refitting for black box prediction | Martin J. Wainwright et.al. | 2506.21460 | null |
2025-06-24 | Experimental Assessment of Neural 3D Reconstruction for Small UAV-based Applications | Genís Castillo Gómez-Raya et.al. | 2506.19491 | null |
2025-06-23 | ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs | Michal Nazarczuk et.al. | 2506.18792 | null |
2025-06-23 | Room temperature spin injection into commercial VCSELs at non-resonant wavelengths | Timur Almabetov et.al. | 2506.18376 | null |
2025-06-11 | OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary | Yui Sudo et.al. | 2506.09448 | null |
2025-06-06 | SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction | Yuchao Zheng et.al. | 2506.05935 | null |
2025-06-05 | On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images | Andreas Meuleman et.al. | 2506.05558 | null |
2025-06-05 | SupeRANSAC: One RANSAC to Rule Them All | Daniel Barath et.al. | 2506.04803 | link |
2025-06-04 | Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation | Tianyu Huang et.al. | 2506.04225 | null |
2025-06-04 | Accelerating SfM-based Pose Estimation with Dominating Set | Joji Joseph et.al. | 2506.03667 | null |
2025-06-03 | Nearby dwarf galaxies with extreme star formation rates: a window into dwarf-galaxy evolution in the early Universe | S. Kaviraj et.al. | 2506.03265 | null |
2025-06-02 | Fast and Robust Rotation Averaging with Anisotropic Coordinate Descent | Yaroslava Lochman et.al. | 2506.01940 | null |
2025-06-03 | Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC | Qingzheng Wang et.al. | 2505.24200 | null |
2025-05-29 | Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping | Justin Lazarow et.al. | 2505.23756 | null |
2025-05-30 | FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian | Sara Papi et.al. | 2505.22759 | link |
2025-05-28 | UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images | Junhuan Liu et.al. | 2505.22098 | null |
2025-05-28 | Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule | San Jiang et.al. | 2505.22089 | null |
2025-05-30 | Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations | Whenty Ariyanti et.al. | 2505.21356 | null |
2025-05-27 | Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting | Xiangyu Sun et.al. | 2505.20729 | null |
2025-05-26 | Robust fine-tuning of speech recognition models via model merging: application to disordered speech | Alexandre Ducorroy et.al. | 2505.20477 | null |
2025-05-29 | Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud | Natsuki Takama et.al. | 2505.19854 | null |
2025-05-25 | Improving Novel view synthesis of 360 $^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images | Guangan Chen et.al. | 2505.19264 | link |
2025-05-24 | Token-Level Logits Matter: A Closer Look at Speech Foundation Models for Ambiguous Emotion Recognition | Jule Valendo Halim et.al. | 2505.18484 | null |
2025-05-23 | To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models | Simone Gaisbauer et.al. | 2505.17973 | link |
2025-05-23 | Corporate Needs You to Find the Difference: Revisiting Submodular and Supermodular Ratio Optimization Problems | Elfarouk Harb et.al. | 2505.17443 | link |
2025-05-23 | Tracking the Flight: Exploring a Computational Framework for Analyzing Escape Responses in Plains Zebra (Equus quagga) | Isla Duporge et.al. | 2505.16882 | link |
2025-05-21 | A Taxonomy of Structure from Motion Methods | Federica Arrigoni et.al. | 2505.15814 | null |
2025-05-18 | Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis | Dong Yang et.al. | 2505.12226 | null |
2025-05-15 | Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis | Francisco Raverta Capua et.al. | 2505.10751 | link |
2025-05-13 | Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People | Haoshuai Zhou et.al. | 2505.08215 | null |
2025-05-12 | RDD: Robust Feature Detector and Descriptor using Deformable Transformer | Gonglin Chen et.al. | 2505.08013 | null |
2025-05-12 | Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild | Lintao Xiang et.al. | 2505.07373 | null |
2025-05-11 | Symmetry in Fundamental Parameters of Galaxies on the Star-forming Main Sequence | Zhicheng He et.al. | 2505.06868 | null |
2025-05-10 | TPK: Trustworthy Trajectory Prediction Integrating Prior Knowledge For Interpretability and Kinematic Feasibility | Marius Baden et.al. | 2505.06743 | null |
2025-05-08 | DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion | Qitao Zhao et.al. | 2505.05473 | null |
2025-05-20 | FastMap: Revisiting Dense and Scalable Structure from Motion | Jiahao Li et.al. | 2505.04612 | link |
2025-05-15 | Estimating the Diameter at Breast Height of Trees in a Forest With a Single 360 Camera | Siming He et.al. | 2505.03093 | null |
2025-05-03 | AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting | Junhao Shi et.al. | 2505.01799 | null |
2025-05-03 | PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth | Bu Jin et.al. | 2505.01729 | null |
2025-05-01 | Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation? | Viktor Kocur et.al. | 2505.00866 | link |
2025-04-29 | Large-scale visual SLAM for in-the-wild videos | Shuo Sun et.al. | 2504.20496 | null |
2025-04-29 | Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views | Jiang Wu et.al. | 2504.20378 | link |
2025-04-28 | MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion | Zador Pataki et.al. | 2504.20040 | link |
2025-04-24 | Dynamic Camera Poses and Where to Find Them | Chris Rockwell et.al. | 2504.17788 | null |
2025-04-24 | EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy | Haodi Yao et.al. | 2504.17280 | null |
2025-04-23 | A Low-Cost Photogrammetry System for 3D Plant Modeling and Phenotyping | Joe Hrzich et.al. | 2504.16840 | null |
2025-04-23 | PRaDA: Projective Radial Distortion Averaging | Daniil Sinitsyn et.al. | 2504.16499 | null |
2025-04-21 | Traversing the Star-Forming Main Sequence with Molecular Gas Stacks of z~1.6 Cluster Galaxies | Alex Pigarelli et.al. | 2504.15381 | null |
2025-04-21 | Towards Understanding Camera Motions in Any Video | Zhiqiu Lin et.al. | 2504.15376 | null |
2025-04-21 | StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models | Yeona Hong et.al. | 2504.14915 | null |
2025-04-17 | Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering | Landon Dyken et.al. | 2504.13339 | null |
2025-04-15 | EDGS: Eliminating Densification for Efficient Convergence of 3DGS | Dmytro Kotovenko et.al. | 2504.13204 | null |
2025-04-15 | Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps | Panagiotis Agrafiotis et.al. | 2504.11416 | link |
2025-04-12 | A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds | Jizong Peng et.al. | 2504.09129 | null |
2025-04-11 | Stereophotoclinometry Revisited | Travis Driver et.al. | 2504.08252 | null |
2025-04-08 | Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring | José A. Pilartes-Congo et.al. | 2504.06464 | null |
2025-04-07 | Decoding the variability in the star-formation histories of z ~ 0.8 galaxies | Jenny T. Wan et.al. | 2504.05281 | null |
2025-04-05 | 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS | Zhisheng Huang et.al. | 2504.04294 | null |
2025-04-04 | An Algebraic Geometry Approach to Viewing Graph Solvability | Federica Arrigoni et.al. | 2504.03637 | null |
2025-04-04 | Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video | Jiaxin Guo et.al. | 2504.03198 | null |
2025-04-03 | Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation | Feng Gao et.al. | 2504.02647 | link |
2025-04-09 | FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking | Ulas Gunes et.al. | 2504.01732 | null |
2025-03-31 | LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors | Han Zhou et.al. | 2504.00219 | null |
2025-03-30 | AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos | Felix Wimbauer et.al. | 2503.23282 | link |
2025-03-24 | Ground Penetrating Radar-Assisted Multimodal Robot Odometry Using Subsurface Feature Matrix | Haifeng Li et.al. | 2503.18301 | null |
2025-03-22 | 3D Modeling: Camera Movement Estimation and path Correction for SFM Model using the Combination of Modified A-SIFT and Stereo System | Usha Kumari et.al. | 2503.17668 | null |
2025-03-25 | ProtoGS: Efficient and High-Quality Rendering with 3D Gaussian Prototypes | Zhengqing Gao et.al. | 2503.17486 | null |
2025-03-21 | ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration | Johan Edstedt et.al. | 2503.17093 | link |
2025-03-20 | From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction | Ayberk Acar et.al. | 2503.16263 | null |
2025-03-22 | Euclid Quick Data Release (Q1). A first view of the star-forming main sequence in the Euclid Deep Fields | Euclid Collaboration et.al. | 2503.15314 | null |
2025-03-18 | Multi-view Reconstruction via SfM-guided Monocular Depth Estimation | Haoyu Guo et.al. | 2503.14483 | null |
2025-03-18 | A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios | Huy-Hoang Bui et.al. | 2503.13982 | link |
2025-03-17 | Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios | Iryna Repinetska et.al. | 2503.13710 | null |
2025-03-17 | Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization | Yiwei Xu et.al. | 2503.13086 | null |
2025-03-15 | SFMNet: Sparse Focal Modulation for 3D Object Detection | Oren Shrout et.al. | 2503.12093 | null |
2025-03-11 | A Framework for Reducing the Complexity of Geometric Vision Problems and its Application to Two-View Triangulation with Approximation Bounds | Felix Rydell et.al. | 2503.08142 | null |
2025-03-11 | DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection | Johan Edstedt et.al. | 2503.07347 | link |
2025-03-18 | Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion | Mona Sheikh Zeinoddin et.al. | 2503.07204 | null |
2025-03-10 | VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation | Hanzhi Chen et.al. | 2503.07135 | null |
2025-03-09 | AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation | Yang Zou et.al. | 2503.06660 | null |
2025-03-07 | LiDAR-enhanced 3D Gaussian Splatting Mapping | Jian Shen et.al. | 2503.05425 | null |
2025-03-06 | PLMP – Point-Line Minimal Problems for Projective SfM | Kim Kiehn et.al. | 2503.04351 | null |
2025-03-03 | MUSt3R: Multi-view Network for Stereo 3D Reconstruction | Yohann Cabon et.al. | 2503.01661 | link |
2025-03-03 | ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization | Anas Abdelkarim et.al. | 2503.01311 | link |
2025-03-05 | A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping | Jialei He et.al. | 2503.01202 | null |
2025-03-02 | MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain | Rui Yi Yong et.al. | 2503.00853 | null |
2025-03-02 | PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery | BoCheng Li et.al. | 2503.00848 | null |
2025-03-02 | Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration | Jinjiang You et.al. | 2503.00737 | link |
2025-02-28 | The THESAN-ZOOM project: Burst, quench, repeat – unveiling the evolution of high-redshift galaxies along the star-forming main sequence | William McClymont et.al. | 2503.00106 | null |
2025-02-27 | Best Foot Forward: Robust Foot Reconstruction in-the-wild | Kyle Fogarty et.al. | 2502.20511 | null |
2025-02-26 | SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images | Yangfan Xu et.al. | 2502.18932 | null |
2025-03-04 | Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model | Yaxuan Huang et.al. | 2502.16779 | null |
2025-02-20 | CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting | Qilin Zhang et.al. | 2502.14684 | link |
2025-02-19 | Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections | Seong Jong Yoo et.al. | 2502.13986 | null |
2025-02-19 | IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras | Dongki Jung et.al. | 2502.12545 | null |
2025-02-12 | Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors | Vishwanath Pratap Singh et.al. | 2502.08587 | null |
2025-02-10 | FOCUS – Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences | Oliver Boyne et.al. | 2502.06367 | link |
2025-02-09 | Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models | Jing-Xuan Zhang et.al. | 2502.05766 | link |
2025-02-10 | Building Rome with Convex Optimization | Haoyu Han et.al. | 2502.04640 | null |
2025-02-04 | SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification | Yifu Tao et.al. | 2502.02657 | null |
2025-02-05 | GP-GS: Gaussian Processes for Enhanced Gaussian Splatting | Zhihao Guo et.al. | 2502.02283 | link |
2025-02-03 | XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications | Shangjin Zhai et.al. | 2502.01297 | null |
2025-01-29 | Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment | Zixue Zeng et.al. | 2501.17690 | link |
2025-01-28 | Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction | Tim Flückiger et.al. | 2501.16221 | null |
2025-01-25 | Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos | Zhen-Hui Dong et.al. | 2501.15096 | null |
2025-01-24 | MATCHA:Towards Matching Anything | Fei Xue et.al. | 2501.14945 | null |
2025-01-24 | Light3R-SfM: Towards Feed-forward Structure-from-Motion | Sven Elflein et.al. | 2501.14914 | null |
2025-01-24 | Dense-SfM: Structure from Motion with Dense Consistent Matching | JongMin Lee et.al. | 2501.14277 | null |
2025-01-21 | Theory of quantum-geometric charge and spin Josephson diode effects in strongly spin-polarized hybrid structures with noncoplanar spin textures | Niklas L. Schulz et.al. | 2501.12232 | null |
2025-01-14 | Selective Attention Merging for low resource tasks: A case study of Child ASR | Natarajan Balaji Shankar et.al. | 2501.08468 | link |
2025-01-14 | SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting | Yue Hu et.al. | 2501.07015 | null |
2025-02-02 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
2025-01-11 | Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis | Aditya Rauniyar et.al. | 2501.06431 | null |
2025-01-09 | Existence of dynamical fluctuation in AMPT generated data for Au+Au collisions at 10 AGeV | Somen Gope et.al. | 2501.05175 | null |
2025-01-06 | Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation | Yuezhang Lv et.al. | 2501.02821 | null |
2025-01-02 | On Unifying Video Generation and Camera Pose Estimation | Chun-Hao Paul Huang et.al. | 2501.01409 | null |
2025-01-02 | EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy | Ao Gao et.al. | 2501.01003 | null |
2024-12-30 | KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences | Keng-Wei Chang et.al. | 2412.20767 | null |
2024-12-27 | Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images | Xudong Cai et.al. | 2412.19518 | null |
2024-12-25 | Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition | Shujie Hu et.al. | 2412.18832 | null |
2024-12-23 | Reconstructing People, Places, and Cameras | Lea Müller et.al. | 2412.17806 | link |
2024-12-18 | Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation | Rémi Marsal et.al. | 2412.14103 | null |
2024-12-16 | Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection | Beomseok Lee et.al. | 2412.11978 | null |
2024-12-18 | SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video | Jongmin Park et.al. | 2412.09982 | null |
2024-12-12 | CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework | Yushan Han et.al. | 2412.08344 | null |
2024-12-10 | Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling | Hui Deng et.al. | 2412.07230 | null |
2024-12-08 | Unveiling True Talent: The Soccer Factor Model for Skill Evaluation | Alexandre Andorra et.al. | 2412.05911 | null |
2024-12-08 | Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features | Yuanbo Xiangli et.al. | 2412.05826 | null |
2024-12-06 | MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos | Zhengqi Li et.al. | 2412.04463 | null |
2024-12-03 | ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification | Pan Zhang et.al. | 2412.02044 | link |
2024-12-02 | SfM-Free 3D Gaussian Splatting via Hierarchical Training | Bo Ji et.al. | 2412.01553 | link |
2024-12-02 | MVImgNet2.0: A Larger-scale Dataset of Multi-view Images | Xiaoguang Han et.al. | 2412.01430 | null |
2024-12-02 | TAS-TsC: A Data-Driven Framework for Estimating Time of Arrival Using Temporal-Attribute-Spatial Tri-space Coordination of Truck Trajectories | Mengran Li et.al. | 2412.01122 | null |
2024-12-02 | Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM | Alejandro Fontan et.al. | 2412.01116 | null |
2024-11-27 | RoMo: Robust Motion Segmentation Improves Structure from Motion | Lily Goli et.al. | 2411.18650 | null |
2024-11-26 | The MAGPI Survey: radial trends in star formation across different cosmological simulations in comparison with observations at $z \sim$ 0.3 | Marcie Mun et.al. | 2411.17882 | null |
2024-11-25 | Characterizing Stellar and Gas Properties in NGC 628: Spatial Distributions, Radial Gradients, and Resolved Scaling Relations | Peng Wei et.al. | 2411.16150 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-11-20 | DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild | Weicai Ye et.al. | 2411.13291 | null |
2024-11-15 | SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction | Yutao Tang et.al. | 2411.12592 | link |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879 | null |
2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
2024-11-08 | From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS | Haoran Zhang et.al. | 2411.05362 | link |
2024-10-29 | A Cascade Approach for APT Campaign Attribution in System Event Logs: Technique Hunting and Subgraph Matching | Yi-Ting Huang et.al. | 2410.22602 | null |
2024-10-29 | LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues | Hanqing Jiang et.al. | 2410.22213 | null |
2024-10-17 | Stochastic Flow Matching for Resolving Small-Scale Physics | Stathi Fotiadis et.al. | 2410.19814 | null |
2024-10-25 | A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint | Changshi Mu et.al. | 2410.19473 | link |
2024-10-30 | Large Spatial Model: End-to-end Unposed Images to Semantic 3D | Zhiwen Fan et.al. | 2410.18956 | link |
2024-10-23 | CO-CAVITY project: Molecular gas and star formation in void galaxies | M. I. Rodríguez et.al. | 2410.18078 | null |
2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
2024-10-20 | Neural Active Structure-from-Motion in Dark and Textureless Environment | Kazuto Ichimaru et.al. | 2410.15378 | null |
2024-10-17 | SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation | Shiao Xie et.al. | 2410.13486 | null |
2024-10-16 | Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks | Orchid Chetia Phukan et.al. | 2410.12947 | null |
2024-10-16 | Gravity-aligned Rotation Averaging with Circular Regression | Linfei Pan et.al. | 2410.12763 | link |
2024-10-16 | Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals | Orchid Chetia Phukan et.al. | 2410.12645 | null |
2024-10-15 | SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection | Yizhe Liu et.al. | 2410.12080 | link |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-10-09 | Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models | Ange Lou et.al. | 2410.07434 | null |
2024-10-09 | Deep HI Mapping of M 106 Group with FAST | Yao Liu et.al. | 2410.07038 | null |
2024-10-09 | MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data | Mingu Kang et.al. | 2410.06442 | null |
2024-10-08 | Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? | Charalambos Tzamos et.al. | 2410.05984 | link |
2024-10-04 | Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering | Laura Fink et.al. | 2410.03861 | link |
2024-10-01 | MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages | Marco Gaido et.al. | 2410.01036 | link |
2024-10-01 | Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance | Hongchao Shu et.al. | 2410.00386 | null |
2024-09-29 | Robust Incremental Structure-from-Motion with Hybrid Features | Shaohui Liu et.al. | 2409.19811 | null |
2024-09-27 | MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion | Bardienus Duisterhof et.al. | 2409.19152 | null |
2024-09-27 | Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras | Yipeng Lu et.al. | 2409.18673 | null |
2024-09-26 | BlinkTrack: Feature Tracking over 100 FPS via Events and Images | Yichen Shen et.al. | 2409.17981 | null |
2024-09-25 | How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not | Francesco Verdini et.al. | 2409.17044 | null |
2024-09-24 | Frequency-based View Selection in Gaussian Splatting Reconstruction | Monica M. Q. Li et.al. | 2409.16470 | null |
2024-10-07 | Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion | Juan-Diego Florez et.al. | 2409.16465 | null |
2024-09-24 | Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research | Vandita Shukla et.al. | 2409.15914 | null |
2024-09-23 | Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments | Francisco Roza de Moraes et.al. | 2409.15602 | null |
2024-09-23 | Evaluating Robot Influence on Pedestrian Behavior Models for Crowd Simulation and Benchmarking | Subham Agrawal et.al. | 2409.14844 | null |
2024-09-21 | Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models | Orchid Chetia Phukan et.al. | 2409.14131 | null |
2024-09-17 | GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module | Yichen Zhang et.al. | 2409.11307 | null |
2024-09-13 | Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints | Shan Chen et.al. | 2409.08613 | null |
2024-09-09 | KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction | Davide Di Nucci et.al. | 2409.05407 | null |
2024-09-06 | The Arizona Molecular ISM Survey with the SMT: Variations in the CO(2-1)/CO(1-0) Line Ratio Across the Galaxy Population | Ryan P. Keenan et.al. | 2409.03963 | null |
2024-09-05 | Active Galactic Nuclei in the Green Valley at z $\sim$ 0.7 | Charity Woodrum et.al. | 2409.03197 | null |
2024-09-04 | Object Gaussian for Monocular 6D Pose Estimation from Sparse Views | Luqing Luo et.al. | 2409.02581 | null |
2024-09-11 | Geometry-aware Feature Matching for Large-Scale Structure from Motion | Gonglin Chen et.al. | 2409.02310 | null |
2024-09-04 | The study of strongly intensive observables for $π^{\pm,0}$ in $pp$ collisions at LHC energy in the framework of PYTHIA model | Tumpa Biswas et.al. | 2409.00525 | null |
2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
2024-09-15 | Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Sierra Bonilla et.al. | 2408.16445 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-20 | TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai et.al. | 2408.10739 | null |
2024-08-16 | Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS | Wei Sun et.al. | 2408.08723 | null |
2024-08-15 | CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning | Wei Zhu et.al. | 2408.08134 | link |
2024-08-13 | A Miniature Vision-Based Localization System for Indoor Blimps | Shicong Ma et.al. | 2408.06648 | null |
2024-08-07 | Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM | Yan Song Hu et.al. | 2408.03825 | null |
2024-08-05 | Context-aware Mamba-based Reinforcement Learning for social robot navigation | Syed Muhammad Mustafa et.al. | 2408.02661 | null |
2024-08-04 | Birational geometry of critical loci in Algebraic Vision | Marina Bertolini et.al. | 2408.02067 | null |
2024-08-04 | PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone | Xin Yang et.al. | 2408.02053 | null |
2024-08-02 | Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris | Kentaro Uno et.al. | 2408.01035 | null |
2024-08-01 | LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting | Zhenyu Bao et.al. | 2408.00254 | null |
2024-07-29 | Global Structure-from-Motion Revisited | Linfei Pan et.al. | 2407.20219 | link |
2024-08-06 | Revisit Self-supervised Depth Estimation with Local Structure-from-Motion | Shengjie Zhu et.al. | 2407.19166 | null |
2024-07-23 | The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations | Hao Liu et.al. | 2407.16452 | null |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-16 | NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models | Francesco Milano et.al. | 2407.12207 | link |
2024-07-15 | LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning | Zhuozhu Jian et.al. | 2407.10782 | null |
2024-07-15 | Towards Scale-Aware Full Surround Monodepth with Transformers | Yuchen Yang et.al. | 2407.10406 | null |
2024-07-14 | 3DEgo: 3D Editing on the Go! | Umar Khalid et.al. | 2407.10102 | null |
2024-07-10 | Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization | Jinjie Mai et.al. | 2407.08023 | link |
2024-07-10 | Euclid preparation. Forecasting the recovery of galaxy physical properties and their relations with template-fitting and machine-learning methods | Euclid Collaboration et.al. | 2407.07940 | null |
2024-07-10 | Controlling Space and Time with Diffusion Models | Daniel Watson et.al. | 2407.07860 | null |
2024-07-09 | Computer vision tasks for intelligent aerospace missions: An overview | Huilin Chen et.al. | 2407.06513 | null |
2024-07-08 | Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views | Jiawei Guo et.al. | 2407.05666 | null |
2024-07-05 | Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization | Shaohan Li et.al. | 2407.04260 | null |
2024-07-15 | SfM on-the-fly: Get better 3D from What You Capture | Zongqian Zhan et.al. | 2407.03939 | null |
2024-07-03 | Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jiaxin Guo et.al. | 2407.02918 | link |
2024-07-02 | Indoor 3D Reconstruction with an Unknown Camera-Projector Pair | Zhaoshuai Qi et.al. | 2407.01945 | null |
2024-06-27 | SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas | John Lambert et.al. | 2406.19390 | link |
2024-06-27 | STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning | Yanan Zhang et.al. | 2406.19362 | null |
2024-06-26 | VDG: Vision-Only Dynamic Gaussian for Driving Simulation | Hao Li et.al. | 2406.18198 | null |
2024-06-25 | Consensus Learning with Deep Sets for Essential Matrix Estimation | Dror Moran et.al. | 2406.17414 | link |
2024-06-24 | Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction | Tong Qin et.al. | 2406.16289 | null |
2024-06-21 | The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization | Ivan Nikolić et.al. | 2406.15237 | link |
2024-06-19 | MVSBoost: An Efficient Point Cloud-based 3D Reconstruction | Umair Haroon et.al. | 2406.13515 | null |
2024-06-17 | MegaScenes: Scene-Level View Synthesis at Scale | Joseph Tung et.al. | 2406.11819 | link |
2024-06-15 | Benchmarking Children’s ASR with Supervised and Self-supervised Speech Foundation Models | Ruchao Fan et.al. | 2406.10507 | link |
2024-06-14 | On the Evaluation of Speech Foundation Models for Spoken Language Understanding | Siddhant Arora et.al. | 2406.10083 | null |
2024-06-12 | Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement | Maxime Pietrantoni et.al. | 2406.08463 | null |
2024-06-12 | SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models | Chun Yin et.al. | 2406.08445 | null |
2024-06-10 | Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis | Xin Jin et.al. | 2406.06216 | link |
2024-06-07 | The Star-Forming Main Sequence in JADES and CEERS at $z>1.4$ : Investigating the Burstiness of Star Formation | Leonardo Clarke et.al. | 2406.05178 | null |
2024-06-13 | Gaussian Splatting with Localized Points Management | Haosen Yang et.al. | 2406.04251 | null |
2024-06-05 | L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration | Yibo Liu et.al. | 2406.03298 | link |
2024-06-04 | CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation | Dejia Xu et.al. | 2406.02509 | null |
2024-05-29 | Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy | Zijie Jiang et.al. | 2405.18863 | null |
2024-05-29 | 3D Reconstruction with Fast Dipole Sums | Hanyu Chen et.al. | 2405.16788 | null |
2024-05-26 | MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups | Yusen Xie et.al. | 2405.16599 | null |
2024-05-26 | Categorical Flow Matching on Statistical Manifolds | Chaoran Cheng et.al. | 2405.16441 | link |
2024-05-22 | Exploring Galaxy Properties of eCALIFA with Contrastive Learning | G. Martínez-Solaeche et.al. | 2405.13471 | null |
2024-05-23 | Switched Flow Matching: Eliminating Singularities via Switching ODEs | Qunxi Zhu et.al. | 2405.11605 | null |
2024-05-28 | NeRO: Neural Road Surface Reconstruction | Ruibo Wang et.al. | 2405.10554 | link |
2024-05-15 | Three Dimensional Spatial Cognition: Bees and Bats | Robert Worden et.al. | 2405.09413 | null |
2024-05-09 | Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media | Zhizhen Zhang et.al. | 2405.05760 | null |
2024-05-09 | Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment | Simon Weber et.al. | 2405.05079 | link |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-07 | Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling | Jiawei Shi et.al. | 2405.04309 | null |
2024-05-06 | Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion | Yunfeng Li et.al. | 2405.03177 | link |
2024-05-03 | HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 | Miriam Jäger et.al. | 2405.02005 | null |
2024-04-25 | The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time | Marcie Mun et.al. | 2404.16319 | null |
2024-04-22 | Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Eric Brachmann et.al. | 2404.14351 | null |
2024-04-22 | RESFM: Robust Equivariant Multiview Structure from Motion | Fadi Khatib et.al. | 2404.14280 | null |
2024-04-22 | Does Gaussian Splatting need SFM Initialization? | Yalda Foroutan et.al. | 2404.12547 | null |
2024-05-07 | A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion | Feng Yu et.al. | 2404.11590 | link |
2024-04-18 | DeblurGS: Gaussian Splatting for Camera Motion Blur | Jeongtaek Oh et.al. | 2404.11358 | null |
2024-05-21 | LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives | Jiadi Cui et.al. | 2404.09748 | null |
2024-04-12 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
2024-04-11 | Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation | Keonhee Han et.al. | 2404.07933 | null |
2024-04-07 | NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization | Peng Tu et.al. | 2404.04875 | null |
2024-04-04 | GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis | Emmanouil Nikolakakis et.al. | 2404.03126 | null |
2024-03-29 | InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds | Zhiwen Fan et.al. | 2403.20309 | link |
2024-03-29 | HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Zhuopeng Li et.al. | 2403.20032 | null |
2024-03-26 | NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation | Jiahao Chen et.al. | 2403.17537 | null |
2024-03-25 | INPC: Implicit Neural Point Clouds for Radiance Field Rendering | Florian Hahlbohm et.al. | 2403.16862 | null |
2024-03-18 | An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation | Zewen Xu et.al. | 2403.11639 | null |
2024-03-14 | Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting | Jaewoo Jung et.al. | 2403.09413 | link |
2024-03-13 | Refractive COLMAP: Refractive Structure-from-Motion Revisited | Mengkun She et.al. | 2403.08640 | null |
2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-11 | SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection | Yifu Tao et.al. | 2403.06877 | null |
2024-03-24 | BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng et.al. | 2403.04926 | link |
2024-02-22 | GaussianPro: 3D Gaussian Splatting with Progressive Propagation | Kai Cheng et.al. | 2402.14650 | null |
2024-02-25 | A Robust Error-Resistant View Selection Method for 3D Reconstruction | Shaojie Zhang et.al. | 2402.11431 | null |
2024-02-17 | Dense Matchers for Dense Tracking | Tomáš Jelínek et.al. | 2402.11287 | null |
2024-03-11 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-22 | HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs | Zelin Gao et.al. | 2401.11711 | null |
2024-01-19 | SCENES: Subpixel Correspondence Estimation With Epipolar Supervision | Dominik A. Kloepfer et.al. | 2401.10886 | null |
2024-01-15 | 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data | Mathilde Letard et.al. | 2401.09481 | link |
2024-01-17 | 3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey | Thiago Lopes Trugillo da Silveira et.al. | 2401.09252 | null |
2024-01-17 | ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization | Weiyao Wang et.al. | 2401.08937 | null |
2024-01-16 | Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions | Yi-Fan Zuo et.al. | 2401.08043 | link |
2024-01-10 | Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects | Tianhang Cheng et.al. | 2401.05236 | link |
2024-01-07 | A Classification of Critical Configurations for any Number of Projective Views | Martin Bråtelund et.al. | 2401.03450 | link |
2023-12-24 | Residual Learning for Image Point Descriptors | Rashik Shrestha et.al. | 2312.15471 | null |
2023-12-16 | Transformers in Unsupervised Structure-from-Motion | Hemang Chawla et.al. | 2312.10529 | link |
2023-12-14 | HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video | Xueying Wang et.al. | 2312.08863 | null |
2023-12-14 | CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning | Qingsong Yan et.al. | 2312.08760 | null |
2023-12-11 | Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach | Travis Driver et.al. | 2312.06865 | link |
2023-12-11 | Gaussian Splatting SLAM | Hidenobu Matsuki et.al. | 2312.06741 | null |
2023-12-10 | SuperPrimitive: Scene Reconstruction at a Primitive Level | Kirill Mazur et.al. | 2312.05889 | null |
2023-12-07 | Visual Geometry Grounded Deep Structure From Motion | Jianyuan Wang et.al. | 2312.04563 | null |
2023-11-30 | Distributed Global Structure-from-Motion with a Deep Front-End | Ayush Baid et.al. | 2311.18801 | link |
2023-11-21 | Robot Hand-Eye Calibration using Structure-from-Motion | Nicolas Andreff et.al. | 2311.11808 | null |
2023-11-18 | LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation | Sébastien Henry et.al. | 2311.11171 | null |
2023-11-10 | MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty | Rémi Marsal et.al. | 2311.06137 | link |
2023-11-08 | VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering | Linus Franke et.al. | 2311.04634 | link |
2023-10-22 | A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video | Jan Emily Mangulabnan et.al. | 2310.14364 | null |
2023-10-20 | FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer | Xinyu Zhang et.al. | 2310.13605 | null |
2023-10-09 | Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration | Chunge Bai et.al. | 2310.05504 | link |
2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
2023-11-29 | Pose-Free Generalizable Rendering Transformer | Zhiwen Fan et.al. | 2310.03704 | link |
2023-10-02 | Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images | Georg Bökman et.al. | 2310.01092 | null |
2023-10-01 | Propagating Semantic Labels in Video Data | David Balaban et.al. | 2310.00783 | null |
2023-09-22 | Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning | Jonathan Sauder et.al. | 2309.12804 | null |
2023-09-21 | On-the-Fly SfM: What you capture is What you get | Zongqian Zhan et.al. | 2309.11883 | link |
2023-09-19 | Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water | Jayesh Tripathi et.al. | 2309.10269 | null |
2023-09-16 | DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF | Mert Asim Karaoglu et.al. | 2309.08927 | link |
2023-09-08 | Robot Localization and Mapping Final Report – Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry | Akankshya Kar et.al. | 2309.04147 | null |
2023-09-01 | SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation | Youhong Wang et.al. | 2309.00526 | null |
2023-09-01 | Dense Voxel 3D Reconstruction Using a Monocular Event Camera | Haodong Chen et.al. | 2309.00385 | null |
2023-08-30 | Learning Structure-from-Motion with Graph Attention Networks | Lucas Brynte et.al. | 2308.15984 | link |
2023-08-26 | Disjoint Pose and Shape for 3D Face Reconstruction | Raja Kumar et.al. | 2308.13903 | null |
2023-08-30 | CamP: Camera Preconditioning for Neural Radiance Fields | Keunhong Park et.al. | 2308.10902 | null |
2023-08-18 | Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling | Haorui Ji et.al. | 2308.10705 | null |
2023-08-14 | Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation | Tao Liu et.al. | 2308.07231 | link |
2023-08-11 | Efficient Large-scale AUV-based Visual Seafloor Mapping | Mengkun She et.al. | 2308.06147 | null |
2023-08-04 | EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems | Weihan Wang et.al. | 2308.02670 | null |
2023-08-15 | Tirtha – An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites | Jyotirmaya Shivottam et.al. | 2308.01246 | link |
2023-08-02 | Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network | Shenbagaraj Kannapiran et.al. | 2308.01125 | null |
2023-07-27 | PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking | Yang Zheng et.al. | 2307.15055 | link |
2023-07-28 | SACReg: Scene-Agnostic Coordinate Regression for Visual Localization | Jerome Revaud et.al. | 2307.11702 | null |
2023-07-19 | Lazy Visual Localization via Motion Averaging | Siyan Dong et.al. | 2307.09981 | null |
2023-07-10 | Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor | San Jiang et.al. | 2307.04520 | null |
2023-07-07 | RGB-D Mapping and Tracking in a Plenoxel Radiance Field | Andreas L. Teigen et.al. | 2307.03404 | link |
2023-06-29 | The Drunkard’s Odometry: Estimating Camera Motion in Deforming Scenes | David Recasens et.al. | 2306.16917 | link |
2023-06-27 | Detector-Free Structure from Motion | Xingyi He et.al. | 2306.15669 | link |
2023-06-28 | PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment | Jianyuan Wang et.al. | 2306.15667 | null |
2023-06-24 | 3D Reconstruction of Spherical Images based on Incremental Structure from Motion | San Jiang et.al. | 2306.12770 | link |
2023-06-15 | NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations | Varun Jampani et.al. | 2306.09109 | link |
2023-06-15 | Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization | Dror Aiger et.al. | 2306.09012 | link |
2023-06-10 | 3D reconstruction using Structure for Motion | Kshitij Karnawat et.al. | 2306.06360 | link |
2023-06-02 | Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images | Marcela Mera-Trujillo et.al. | 2306.01938 | null |
2023-05-31 | FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow | Cameron Smith et.al. | 2306.00180 | null |
2023-05-19 | SIDAR: Synthetic Image Dataset for Alignment & Restoration | Monika Kwiatkowski et.al. | 2305.12036 | link |
2023-05-09 | Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization | Clémentin Boittiaux et.al. | 2305.05301 | link |
2023-05-09 | Rotation Synchronization via Deep Matrix Factorization | Gk Tejus et.al. | 2305.05268 | link |
2023-04-20 | A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion | Miriam Jäger et.al. | 2304.10664 | null |
2023-04-14 | Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments | Felix Ott et.al. | 2304.07250 | null |
2023-04-12 | Visual Localization using Imperfect 3D Models from the Internet | Vojtech Panek et.al. | 2304.05947 | link |
2023-04-08 | Photometric Correction for Infrared Sensors | Jincheng Zhang et.al. | 2304.03930 | null |
2023-04-07 | DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium | Antyanta Bangunharcana et.al. | 2304.03560 | link |
2023-04-05 | Semantic Validation in Structure from Motion | Joseph Rowell et.al. | 2304.02420 | link |
2023-03-31 | Learning Internal Representations of 3D Transformations from 2D Projected Inputs | Marissa Connor et.al. | 2303.17776 | null |
2023-03-30 | 3D Line Mapping Revisited | Shaohui Liu et.al. | 2303.17504 | link |
2023-03-27 | TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering | Jaehoon Choi et.al. | 2303.15060 | null |
2023-03-26 | On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks | HyunJun Jung et.al. | 2303.14840 | link |
2023-03-24 | Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container | Jinguang Tong et.al. | 2303.13805 | link |
2023-03-24 | Progressively Optimized Local Radiance Fields for Robust View Synthesis | Andreas Meuleman et.al. | 2303.13791 | null |
2023-03-15 | RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters | Shuja Khalid et.al. | 2303.08695 | null |
2023-03-09 | Revisiting Rotation Averaging: Uncertainties and Robust Losses | Ganlin Zhang et.al. | 2303.05195 | link |
2023-02-28 | Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images | Zhongli Fan et.al. | 2302.14239 | link |
2023-03-25 | BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling | Sameera Ramasinghe et.al. | 2302.13543 | null |
2023-02-21 | EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images | Zhichao Ye et.al. | 2302.10544 | link |
2023-02-18 | Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering | Tatsuro Yamane et.al. | 2302.09208 | null |
2023-02-12 | Uncertainty-Driven Dense Two-View Structure from Motion | Weirong Chen et.al. | 2302.00523 | null |
2023-01-28 | AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion | Yu Chen et.al. | 2301.12135 | null |
2023-01-20 | A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles | Zhefan Xu et.al. | 2301.08422 | link |
2023-03-21 | Robust Dynamic Radiance Fields | Yu-Lun Liu et.al. | 2301.02239 | link |
2022-12-24 | Polarimetric Multi-View Inverse Rendering | Jinyu Zhao et.al. | 2212.12721 | null |
2022-12-13 | Accidental Turntables: Learning 3D Pose by Watching Objects Turn | Zezhou Cheng et.al. | 2212.06300 | null |
2022-12-04 | 3D Object Aided Self-Supervised Monocular Depth Estimation | Songlin Wei et.al. | 2212.01768 | null |
2022-12-02 | High-Res Facial Appearance Capture from Polarized Smartphone Images | Dejan Azinović et.al. | 2212.01160 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
2022-11-24 | JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models | Sepidehsadat Hosseini et.al. | 2211.13785 | null |
2022-11-24 | SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks | Sergio Izquierdo et.al. | 2211.13551 | link |
2022-11-22 | Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces | Yuxi Xiao et.al. | 2211.12018 | link |
2022-11-21 | Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques | David Ramirez et.al. | 2211.11836 | null |
2022-11-14 | Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion | René Haas et.al. | 2211.07195 | null |
2022-10-13 | Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach | Zhiang Chen et.al. | 2210.07349 | null |
2022-10-11 | DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion | Yuxi Xiao et.al. | 2210.05517 | null |
2022-10-07 | Leveraging Structure from Motion to Localize Inaccessible Bus Stops | Indu Panigrahi et.al. | 2210.03646 | link |
2022-10-01 | Structure-Aware NeRF without Posed Camera via Epipolar Constraint | Shu Chen et.al. | 2210.00183 | link |
2022-10-05 | FAST-LIO, Then Bayesian ICP, Then GTSFM | Jerred Chen et.al. | 2210.00146 | null |
2022-09-20 | BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction | Ahalya Ravendran et.al. | 2209.09470 | null |
2022-09-19 | A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion | Gerry Chen et.al. | 2209.08690 | null |
2022-09-14 | End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes | Qiao Chen et.al. | 2209.06926 | null |
2022-09-07 | Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021 | Hartmut Surmann et.al. | 2209.03084 | null |
2022-08-27 | Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data | Thomas A. Ciarfuglia et.al. | 2208.13001 | null |
2022-08-12 | Handling Constrained Optimization in Factor Graphs for Autonomous Navigation | Barbara Bazzana et.al. | 2208.06325 | null |
2022-08-04 | Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training | Yao-Chih Lee et.al. | 2208.02709 | link |
2022-07-31 | One Object at a Time: Accurate and Robust Structure From Motion for Robots | Aravind Battaje et.al. | 2208.00487 | null |
2022-07-23 | Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks | Daniel Posada et.al. | 2207.11413 | null |
2022-07-25 | MeshLoc: Mesh-Based Visual Localization | Vojtech Panek et.al. | 2207.10762 | link |
2022-07-19 | ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild | Wang Zhao et.al. | 2207.09137 | link |
2022-07-16 | Organic Priors in Non-Rigid Structure from Motion | Suryansh Kumar et.al. | 2207.06262 | null |
2022-07-06 | A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models | Axel Garcia-Vega et.al. | 2207.02396 | null |
2022-06-24 | Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set | San Jiang et.al. | 2206.11499 | null |
2022-06-13 | TC-SfM: Robust Track-Community-Based Structure-from-Motion | Lei Wang et.al. | 2206.05866 | null |
2022-06-10 | EigenFairing: 3D Model Fairing using Image Coherence | Pragyana Mishra et.al. | 2206.05309 | null |
2022-06-01 | Semantic Room Wireframe Detection from a Single View | David Gillsjö et.al. | 2206.00491 | link |
2022-05-31 | Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction | Qiancheng Fu et.al. | 2205.15848 | null |
2022-05-09 | Is my Depth Ground-Truth Good Enough? HAMMER – Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression | HyunJun Jung et.al. | 2205.04565 | null |
2022-05-07 | Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs | Pedro F. Proença et.al. | 2205.03522 | null |
2022-05-06 | EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms | Levi Burner et.al. | 2205.03467 | null |
2022-04-20 | Learned Monocular Depth Priors in Visual-Inertial Initialization | Yunwen Zhou et.al. | 2204.09171 | null |
2022-04-10 | Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective | Hui Deng et.al. | 2204.04730 | null |
2022-04-08 | Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems | Debao Huang et.al. | 2204.04145 | null |
2022-04-07 | SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation | Yi Wei et.al. | 2204.03636 | link |
2022-04-06 | Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion | Lukas Bommes et.al. | 2204.02733 | link |
2022-04-05 | Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows | Sheng Liu et.al. | 2204.02509 | link |
2022-03-31 | Fast, Accurate and Memory-Efficient Partial Permutation Synchronization | Shaohan Li et.al. | 2203.16505 | null |
2022-03-28 | Visual Odometry for RGB-D Cameras | Afonso Fontes et.al. | 2203.15119 | null |
2022-03-28 | Optimizing Elimination Templates by Greedy Parameter Search | Evgeniy Martyushev et.al. | 2203.14901 | link |
2022-03-23 | Event-Based Dense Reconstruction Pipeline | Kun Xiao et.al. | 2203.12270 | null |
2022-03-21 | DiffPoseNet: Direct Differentiable Camera Pose Estimation | Chethan M. Parameshwara et.al. | 2203.11174 | null |
2022-03-02 | Asynchronous Optimisation for Event-based Visual Odometry | Daqi Liu et.al. | 2203.01037 | null |
2022-03-02 | Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation | Yulun Tian et.al. | 2203.00851 | null |
2022-02-18 | MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery | Ahmad Khaliq et.al. | 2202.09146 | link |
2022-01-20 | GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry | Yunhan Zhao et.al. | 2201.08131 | null |
2022-01-13 | Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching | Yunpeng Shi et.al. | 2201.04797 | link |
2022-01-10 | High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM | Brian M. Hopkinson et.al. | 2201.03364 | link |
2022-01-06 | De-rendering 3D Objects in the Wild | Felix Wimbauer et.al. | 2201.02279 | link |
2021-12-29 | On the Instability of Relative Pose Estimation and RANSAC’s Role | Hongyi Fan et.al. | 2112.14651 | null |
2021-12-16 | Road-aware Monocular Structure from Motion and Homography Estimation | Wei Sui et.al. | 2112.08635 | null |
2021-12-10 | Critical configurations for three projective views | Martin Bråtelund et.al. | 2112.05478 | null |
2021-12-09 | Critical configurations for two projective views, a new approach | Martin Bråtelund et.al. | 2112.05074 | null |
2021-12-06 | Dense Depth Priors for Neural Radiance Fields from Sparse Input Views | Barbara Roessle et.al. | 2112.03288 | link |
2021-12-10 | MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment | Jie Ren et.al. | 2112.01349 | link |
2021-11-11 | Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft | Pascal Schoppmann et.al. | 2111.06271 | null |
2021-11-10 | Damage Estimation and Localization from Sparse Aerial Imagery | Rene Garcia Franceschini et.al. | 2111.03708 | null |
2021-11-03 | Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems | Swarnabja Bhaumik et.al. | 2111.02064 | null |
2021-10-14 | Modeling dynamic target deformation in camera calibration | Annika Hagemann et.al. | 2110.07322 | null |
2021-10-13 | Hyperspectral 3D Mapping of Underwater Environments | Maxime Ferrera et.al. | 2110.06571 | null |
2021-09-24 | Automatic Map Update Using Dashcam Videos | Aziza Zhanabatyrova et.al. | 2109.12131 | null |
2021-09-16 | Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs | Gabriel Moreira et.al. | 2109.08046 | link |
2021-09-06 | Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications | Tejas Mane et.al. | 2109.02740 | null |
2021-09-02 | Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency | Beatrix-Emőke Fülöp-Balogh et.al. | 2109.01018 | null |
2021-09-01 | On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation | Eric Brachmann et.al. | 2109.00524 | link |
2021-08-31 | DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension | Roman Shapovalov et.al. | 2109.00033 | null |
2021-08-29 | Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration | Seyed-Mahdi Nasiri et.al. | 2108.12876 | null |
2021-08-23 | Burst Imaging for Light-Constrained Structure-From-Motion | Ahalya Ravendran et.al. | 2108.09895 | null |
Visual Localization
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-09 | Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning | Konstantinos I. Roumeliotis et.al. | 2507.10571 | null |
2025-07-14 | GT-Loc: Unifying When and Where in Images Through a Joint Embedding Space | David G. Shatwell et.al. | 2507.10473 | null |
2025-07-14 | Text-to-Remote-Sensing-Image Retrieval beyond RGB Sources | Daniele Rege Cambrin et.al. | 2507.10403 | null |
2025-07-14 | Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures | Xinlong Ding et.al. | 2507.10265 | null |
2025-07-11 | RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features | Inye Na et.al. | 2507.08546 | null |
2025-07-11 | LiDAR, GNSS and IMU Sensor Alignment through Dynamic Time Warping to Construct 3D City Maps | Haitian Wang et.al. | 2507.08420 | null |
2025-07-11 | Deep Hashing with Semantic Hash Centers for Image Retrieval | Li Chen et.al. | 2507.08404 | null |
2025-07-08 | Unveiling Effective In-Context Configurations for Image Captioning: An External & Internal Analysis | Li Li et.al. | 2507.08021 | null |
2025-07-10 | SCREP: Scene Coordinate Regression and Evidential Learning-based Perception-Aware Trajectory Generation | Juyeop Han et.al. | 2507.07467 | null |
2025-07-10 | VP-SelDoA: Visual-prompted Selective DoA Estimation of Target Sound via Semantic-Spatial Matching | Yu Chen et.al. | 2507.07384 | null |
2025-07-08 | FACap: A Large-scale Fashion Dataset for Fine-grained Composed Image Retrieval | François Gardères et.al. | 2507.07135 | null |
2025-07-09 | Evaluating Attribute Confusion in Fashion Text-to-Image Generation | Ziyue Liu et.al. | 2507.07079 | null |
2025-07-09 | MS-DPPs: Multi-Source Determinantal Point Processes for Contextual Diversity Refinement of Composite Attributes in Text to Image Retrieval | Naoya Sogi et.al. | 2507.06654 | null |
2025-07-08 | Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval | Haiwen Li et.al. | 2507.05970 | null |
2025-07-08 | OFFSET: Segmentation-based Focus Shift Revision for Composed Image Retrieval | Zhiwei Chen et.al. | 2507.05631 | null |
2025-07-07 | Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model | Mengyao Xu et.al. | 2507.05513 | null |
2025-07-07 | An analysis of vision-language models for fabric retrieval | Francesco Giuliari et.al. | 2507.04735 | null |
2025-07-08 | What’s Making That Sound Right Now? Video-centric Audio-Visual Localization | Hahyeon Choi et.al. | 2507.04667 | null |
2025-07-07 | Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR | Tao Du et.al. | 2507.04662 | null |
2025-07-06 | U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration | Xiaofan Li et.al. | 2507.04503 | null |
2025-07-04 | Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition | Jiuhong Xiao et.al. | 2507.03831 | null |
2025-07-01 | LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment | Juelin Zhu et.al. | 2507.00659 | null |
2025-06-28 | Utilizing a Novel Deep Learning Method for Scene Categorization in Remote Sensing Data | Ghufran A. Omran et.al. | 2506.22939 | null |
2025-06-28 | Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval | Li-Cheng Shen et.al. | 2506.22864 | null |
2025-06-27 | MatChA: Cross-Algorithm Matching with Feature Augmentation | Paula Carbó Cubero et.al. | 2506.22336 | null |
2025-06-26 | OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography | Caoshuo Li et.al. | 2506.21101 | null |
2025-06-25 | Visualizing intercalation effects in 2D materials using AFM based techniques | Karmen Kapustić et.al. | 2506.20467 | null |
2025-06-25 | On the Burstiness of Faces in Set | Jiong Wang et.al. | 2506.20312 | null |
2025-06-24 | jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval | Michael Günther et.al. | 2506.18902 | null |
2025-06-26 | Referring Expression Instance Retrieval and A Strong End-to-End Baseline | Xiangzhao Hao et.al. | 2506.18246 | null |
2025-06-20 | Class Agnostic Instance-level Descriptor for Visual Instance Search | Qi-Ying Sun et.al. | 2506.16745 | null |
2025-06-19 | MambaHash: Visual State Space Deep Hashing Model for Large-Scale Image Retrieval | Chao He et.al. | 2506.16353 | link |
2025-06-19 | Fine-grained Image Retrieval via Dual-Vision Adaptation | Xin Jiang et.al. | 2506.16273 | null |
2025-06-19 | Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation | Connor Malone et.al. | 2506.15988 | link |
2025-06-18 | Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles | Qiyuan Wu et.al. | 2506.15851 | null |
2025-06-18 | ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections | Ziling Huang et.al. | 2506.15180 | null |
2025-06-17 | HARMONY: A Scalable Distributed Vector Database for High-Throughput Approximate Nearest Neighbor Search | Qian Xu et.al. | 2506.14707 | null |
2025-06-17 | TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping | Jeewon Kim et.al. | 2506.14178 | null |
2025-06-16 | A Semantically-Aware Relevance Measure for Content-Based Medical Image Retrieval Evaluation | Xiaoyang Wei et.al. | 2506.13509 | null |
2025-06-19 | Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval | Kshitij Kavimandan et.al. | 2506.13496 | null |
2025-06-16 | EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition | Bingxi Liu et.al. | 2506.13133 | null |
2025-06-16 | SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models | Bingxi Liu et.al. | 2506.13073 | null |
2025-06-14 | Feature Complementation Architecture for Visual Place Recognition | Weiwei Wang et.al. | 2506.12401 | null |
2025-06-11 | Towards a general-purpose foundation model for fMRI analysis | Cheng Wang et.al. | 2506.11167 | null |
2025-06-11 | Improving Personalized Search with Regularized Low-Rank Parameter Updates | Fiona Ryan et.al. | 2506.10182 | link |
2025-06-10 | Safeguarding Multimodal Knowledge Copyright in the RAG-as-a-Service Environment | Tianyu Chen et.al. | 2506.10030 | link |
2025-06-11 | Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints | Xiangkai Zhang et.al. | 2506.09748 | null |
2025-06-10 | Robust Visual Localization via Semantic-Guided Multi-Scale Transformer | Zhongtao Tian et.al. | 2506.08526 | null |
2025-06-08 | Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs | Yikun Ji et.al. | 2506.07045 | null |
2025-06-07 | Zero Shot Composed Image Retrieval | Santhosh Kakarla et.al. | 2506.06602 | null |
2025-06-06 | GenIR: Generative Visual Feedback for Mental Image Retrieval | Diji Yang et.al. | 2506.06220 | null |
2025-06-06 | Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning | Sheng Chen et.al. | 2506.06205 | null |
2025-06-05 | HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition | Suhan Woo et.al. | 2506.04764 | null |
2025-06-05 | Deep Learning Reforms Image Matching: A Survey and Outlook | Shihua Zhang et.al. | 2506.04619 | null |
2025-06-02 | Entity Image and Mixed-Modal Image Retrieval Datasets | Cristian-Ioan Blaga et.al. | 2506.02291 | null |
2025-06-01 | Quantization-based Bounds on the Wasserstein Metric | Jonathan Bobrutsky et.al. | 2506.00976 | null |
2025-05-30 | SORCE: Small Object Retrieval in Complex Environments | Chunxu Liu et.al. | 2505.24441 | link |
2025-05-29 | Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch | Aneeshan Sain et.al. | 2505.23763 | null |
2025-05-28 | 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians | Hidenobu Matsuki et.al. | 2505.22859 | null |
2025-05-28 | UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images | Junhuan Liu et.al. | 2505.22098 | null |
2025-05-28 | Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule | San Jiang et.al. | 2505.22089 | null |
2025-05-27 | Visual Loop Closure Detection Through Deep Graph Consensus | Martin Büchner et.al. | 2505.21754 | null |
2025-05-27 | QuARI: Query Adaptive Retrieval Improvement | Eric Xing et.al. | 2505.21647 | null |
2025-05-27 | ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval | Eric Xing et.al. | 2505.20764 | link |
2025-05-26 | Visualized Text-to-Image Retrieval | Di Wu et.al. | 2505.20291 | link |
2025-05-26 | Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval | Rong-Cheng Tu et.al. | 2505.19952 | null |
2025-05-26 | Can Visual Encoder Learn to See Arrows? | Naoyuki Terashita et.al. | 2505.19944 | null |
2025-05-26 | MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval | Rong-Cheng Tu et.al. | 2505.19707 | null |
2025-05-24 | Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU | Yicheng Lin et.al. | 2505.18652 | link |
2025-05-24 | TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP | Yuliang Cai et.al. | 2505.18434 | null |
2025-05-23 | ImLPR: Image-based LiDAR Place Recognition using Vision Foundation Models | Minwoo Jung et.al. | 2505.18364 | null |
2025-05-23 | DART $^3$ : Leveraging Distance for Test Time Adaptation in Person Re-Identification | Rajarshi Bhattacharya et.al. | 2505.18337 | null |
2025-05-23 | To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models | Simone Gaisbauer et.al. | 2505.17973 | link |
2025-05-23 | DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval | Yuxin Yang et.al. | 2505.17796 | null |
2025-05-23 | CU-Multi: A Dataset for Multi-Robot Data Association | Doncey Albin et.al. | 2505.17576 | null |
2025-05-22 | TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition | Oliver Grainge et.al. | 2505.16447 | null |
2025-05-21 | Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval | Siting Li et.al. | 2505.15877 | null |
2025-05-21 | SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval | Nikolaos Chaidos et.al. | 2505.15867 | link |
2025-05-20 | Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | Kiarash Naghavi Khanghah et.al. | 2505.13828 | null |
2025-05-18 | MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark | Yiwei Ou et.al. | 2505.12254 | null |
2025-05-16 | Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization | Aaron Wilhelm et.al. | 2505.11620 | null |
2025-05-16 | Redundancy-Aware Pretraining of Vision-Language Foundation Models in Remote Sensing | Mathis Jürgen Adler et.al. | 2505.11121 | null |
2025-05-04 | OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery | Chongsheng Zhang et.al. | 2505.03836 | link |
2025-05-06 | Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions | Lukas Schichler et.al. | 2505.03565 | null |
2025-05-06 | LiftFeat: 3D Geometry-Aware Local Feature Matching | Yepeng Liu et.al. | 2505.03422 | link |
2025-05-06 | Seeing the Abstract: Translating the Abstract Language for Vision Language Models | Davide Talon et.al. | 2505.03242 | link |
2025-05-13 | SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment | Ganesh Sapkota et.al. | 2505.01956 | null |
2025-05-02 | NeuroLoc: Encoding Navigation Cells for 6-DOF Camera Localization | Xun Li et.al. | 2505.01113 | null |
2025-05-01 | GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting | Jongwon Lee et.al. | 2504.20379 | null |
2025-04-25 | From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval | Yabing Wang et.al. | 2504.17990 | null |
2025-04-24 | A Guide to Structureless Visual Localization | Vojtech Panek et.al. | 2504.17636 | null |
2025-04-23 | Rethinking Vision Transformer for Large-Scale Fine-Grained Image Retrieval | Xin Jiang et.al. | 2504.16691 | null |
2025-04-22 | Media Content Atlas: A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs | Merve Cerit et.al. | 2504.16323 | link |
2025-04-19 | A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling | Kyle Buettner et.al. | 2504.14359 | null |
2025-04-17 | SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs | Haoxuan Li et.al. | 2504.13172 | null |
2025-04-16 | Generalized Visual Relation Detection with Diffusion Models | Kaifeng Gao et.al. | 2504.12100 | null |
2025-04-15 | Visual Re-Ranking with Non-Visual Side Information | Gustav Hanning et.al. | 2504.11134 | link |
2025-04-15 | TMCIR: Token Merge Benefits Composed Image Retrieval | Chaoyang Wang et.al. | 2504.10995 | null |
2025-04-14 | Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition | Changwei Wang et.al. | 2504.09881 | link |
2025-04-12 | Evolved Hierarchical Masking for Self-Supervised Learning | Zhanzhou Feng et.al. | 2504.09155 | null |
2025-04-11 | HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields | Asterios Reppas et.al. | 2504.08901 | null |
2025-04-11 | Hypergraph Vision Transformers: Images are More than Nodes, More than Edges | Joshua Fixelle et.al. | 2504.08710 | null |
2025-04-11 | FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations | Cheng-Yu Hsieh et.al. | 2504.08368 | null |
2025-04-11 | PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection | Xiong Li et.al. | 2504.08280 | null |
2025-04-10 | Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval | Zehong Ma et.al. | 2504.07718 | null |
2025-04-09 | A Pointcloud Registration Framework for Relocalization in Subterranean Environments | David Akhihiero et.al. | 2504.07231 | null |
2025-04-09 | Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception | Ruotian Peng et.al. | 2504.06666 | null |
2025-04-08 | To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition | Davide Sferrazza et.al. | 2504.06116 | link |
2025-04-06 | NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval | Peng Gao et.al. | 2504.04339 | null |
2025-04-04 | REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval | Shabnam Choudhury et.al. | 2504.03169 | null |
2025-04-06 | Re-thinking Temporal Search for Long-Form Video Understanding | Jinhui Ye et.al. | 2504.02259 | link |
2025-04-02 | A Chefs KISS – Utilizing semantic information in both ICP and SLAM framework | Sven Ochs et.al. | 2504.02086 | null |
2025-04-02 | Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval | Yuji Nozawa et.al. | 2504.01348 | null |
2025-04-01 | IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval | Bangwei Liu et.al. | 2504.00954 | null |
2025-04-01 | Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data | Yiqun Duan et.al. | 2504.00812 | null |
2025-03-31 | CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization | Yingrui Ji et.al. | 2503.24182 | null |
2025-03-31 | LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds | Masahiko Tsuji et.al. | 2503.23664 | null |
2025-03-30 | Multiview Image-Based Localization | Cameron Fiore et.al. | 2503.23577 | null |
2025-03-27 | LOCORE: Image Re-ranking with Long-Context Sequence Modeling | Zilin Xiao et.al. | 2503.21772 | link |
2025-03-27 | Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck | Adrian Bulat et.al. | 2503.21757 | null |
2025-03-27 | UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation | Yehui Shen et.al. | 2503.21338 | link |
2025-03-27 | FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval | Zixu Li et.al. | 2503.21309 | link |
2025-03-27 | Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing | Shuai Li et.al. | 2503.21236 | null |
2025-03-25 | CoLLM: A Large Language Model for Composed Image Retrieval | Chuong Huynh et.al. | 2503.19910 | link |
2025-03-25 | Scene-agnostic Pose Regression for Visual Localization | Junwei Zheng et.al. | 2503.19543 | null |
2025-03-25 | From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting | Zhiwei Huang et.al. | 2503.19358 | null |
2025-03-25 | Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval | Haoqiang Lin et.al. | 2503.19296 | link |
2025-03-23 | LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space | Zhangyu Wang et.al. | 2503.18142 | null |
2025-03-23 | Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning | Xiang Fang et.al. | 2503.17938 | null |
2025-03-23 | What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images | Dongheng Lin et.al. | 2503.17899 | null |
2025-03-22 | good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval | Pranavi Kolouju et.al. | 2503.17871 | null |
2025-03-21 | Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval | Yuanmin Tang et.al. | 2503.17109 | link |
2025-03-21 | Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions | Muhua Zhang et.al. | 2503.17005 | null |
2025-03-20 | PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval | Qiang Zou et.al. | 2503.16064 | link |
2025-03-20 | Automating 3D Dataset Generation with Neural Radiance Fields | P. Schulz et.al. | 2503.15997 | link |
2025-03-18 | 3D Densification for Multi-Map Monocular VSLAM in Endoscopy | X. Anadón et.al. | 2503.14346 | null |
2025-03-18 | A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios | Huy-Hoang Bui et.al. | 2503.13982 | link |
2025-03-17 | Scale Efficient Training for Large Datasets | Qing Zhou et.al. | 2503.13385 | link |
2025-03-17 | Multi-Platform Teach-and-Repeat Navigation by Visual Place Recognition Based on Deep-Learned Local Features | Václav Truhlařík et.al. | 2503.13090 | null |
2025-03-17 | All You Need to Know About Training Image Retrieval Models | Gabriele Berton et.al. | 2503.13045 | link |
2025-03-12 | Exploring the best way for UAV visual localization under Low-altitude Multi-view Observation Condition: a Benchmark | Yibin Ye et.al. | 2503.10692 | link |
2025-03-13 | ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning | Pengfei Luo et.al. | 2503.10166 | link |
2025-03-12 | Revisiting Medical Image Retrieval via Knowledge Consolidation | Yang Nan et.al. | 2503.09370 | null |
2025-03-11 | CQVPR: Landmark-aware Contextual Queries for Visual Place Recognition | Dongyue Li et.al. | 2503.08170 | null |
2025-03-10 | Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization | Michael Green et.al. | 2503.07038 | null |
2025-03-10 | Zero-Shot Hashing Based on Reconstruction With Part Alignment | Yan Jiang et.al. | 2503.07037 | null |
2025-03-10 | Improving Visual Place Recognition with Sequence-Matching Receptiveness Prediction | Somayeh Hussaini et.al. | 2503.06840 | null |
2025-03-09 | RoboDesign1M: A Large-scale Dataset for Robot Design Understanding | Tri Le et.al. | 2503.06796 | null |
2025-03-09 | StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place Recognition | Yanqing Shen et.al. | 2503.06601 | link |
2025-03-09 | TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification | Huaqi Tao et.al. | 2503.06501 | link |
2025-03-08 | NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features | Hongjia Zhai et.al. | 2503.06117 | null |
2025-03-07 | Data-Efficient Generalization for Zero-shot Composed Image Retrieval | Zining Chen et.al. | 2503.05204 | null |
2025-03-06 | RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining | Tengfei Zhang et.al. | 2503.04653 | null |
2025-03-06 | ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images | Yanqing Shen et.al. | 2503.04475 | link |
2025-03-06 | Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes | Hui Zhang et.al. | 2503.04235 | null |
2025-03-06 | Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior | Haitao Wu et.al. | 2503.04207 | link |
2025-03-06 | Image-Based Relocalization and Alignment for Long-Term Monitoring of Dynamic Underwater Environments | Beverley Gorry et.al. | 2503.04096 | link |
2025-03-04 | TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition | Oliver Grainge et.al. | 2503.02511 | null |
2025-03-04 | Introspective Loop Closure for SLAM with 4D Imaging Radar | Maximilian Hilger et.al. | 2503.02383 | null |
2025-03-04 | Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models | Kenta Tsukahara et.al. | 2503.02256 | null |
2025-03-03 | Composed Multi-modal Retrieval: A Survey of Approaches and Applications | Kun Zhang et.al. | 2503.01334 | link |
2025-03-03 | AirRoom: Objects Matter in Room Reidentification | Runmao Yao et.al. | 2503.01130 | null |
2025-03-02 | Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching | Jinyu Miao et.al. | 2503.00862 | null |
2025-03-01 | Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning | Songlin Dong et.al. | 2503.00515 | null |
2025-02-28 | EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration | Kuangyi Chen et.al. | 2503.00167 | link |
2025-02-28 | CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval | Zelong Sun et.al. | 2502.20826 | null |
2025-02-28 | SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition | Shanshan Wan et.al. | 2502.20676 | null |
2025-02-27 | A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization | Yejun Zhang et.al. | 2502.20036 | link |
2025-02-27 | On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation | Ruben T. Lucassen et.al. | 2502.19285 | null |
2025-02-26 | BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure | Haoxin Cai et.al. | 2502.19242 | link |
2025-02-26 | SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images | Yangfan Xu et.al. | 2502.18932 | null |
2025-02-19 | A Comprehensive Survey on Composed Image Retrieval | Xuemeng Song et.al. | 2502.18495 | link |
2025-02-25 | MegaLoc: One Retrieval to Place Them All | Gabriele Berton et.al. | 2502.17237 | link |
2025-02-23 | Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries | Yin Wu et.al. | 2502.16636 | link |
2025-02-23 | SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition | Feng Lu et.al. | 2502.16601 | link |
2025-02-21 | ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval | Guanqi Zhan et.al. | 2502.15682 | null |
2025-02-20 | Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition | Tianyi Shang et.al. | 2502.14195 | link |
2025-02-19 | 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments | Vincent Ress et.al. | 2502.13803 | null |
2025-02-18 | Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization | Shuo Xing et.al. | 2502.13146 | link |
2025-02-19 | IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras | Dongki Jung et.al. | 2502.12545 | null |
2025-02-17 | From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations | Matteo Scucchia et.al. | 2502.12303 | null |
2025-02-17 | Descriminative-Generative Custom Tokens for Vision-Language Models | Pramuditha Perera et.al. | 2502.12095 | null |
2025-02-17 | ILIAS: Instance-Level Image retrieval At Scale | Giorgos Kordopatis-Zilos et.al. | 2502.11748 | null |
2025-02-17 | Range and Bird’s Eye View Fused Cross-Modal Visual Place Recognition | Jianyi Peng et.al. | 2502.11742 | link |
2025-02-17 | Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics | Francesco Croce et.al. | 2502.11725 | link |
2025-02-17 | Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization | Yuanze Xu et.al. | 2502.11408 | null |
2025-02-12 | E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection | Junjie Wu et.al. | 2502.10455 | null |
2025-02-11 | Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning | Yuhang Dong et.al. | 2502.09649 | null |
2025-02-13 | ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation | Rotem Shalev-Arkushin et.al. | 2502.09411 | null |
2025-02-12 | SpeechCompass: Enhancing Mobile Captioning with Diarization and Directional Guidance via Multi-Microphone Localization | Artem Dementyev et.al. | 2502.08848 | null |
2025-02-12 | Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions | Prajwal Gatti et.al. | 2502.08438 | null |
2025-02-11 | Captured by Captions: On Memorization and its Mitigation in CLIP Models | Wenhao Wang et.al. | 2502.07830 | null |
2025-02-11 | Ultrafast 4D scanning transmission electron microscopy for imaging of localized optical fields | Petr Koutenský et.al. | 2502.07338 | null |
2025-02-11 | Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos | Haowen Gao et.al. | 2502.07327 | null |
2025-02-11 | PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval | Osman Tursun et.al. | 2502.07215 | null |
2025-02-10 | AstroLoc: Robust Space to Ground Image Localizer | Gabriele Berton et.al. | 2502.07003 | null |
2025-02-09 | Uni-Retrieval: A Multi-Style Retrieval Framework for STEM’s Education | Yanhao Jia et.al. | 2502.05863 | null |
2025-02-07 | Learning Street View Representations with Spatiotemporal Contrast | Yong Li et.al. | 2502.04638 | null |
2025-02-06 | Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion | Marco Mistretta et.al. | 2502.04263 | link |
2025-02-05 | Human-Aligned Image Models Improve Visual Decoding from the Brain | Nona Rajabi et.al. | 2502.03081 | null |
2025-02-03 | ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies | Costin F. Ciusdel et.al. | 2502.01335 | null |
2025-01-31 | LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks | Liudi Yang et.al. | 2501.19382 | link |
2025-01-27 | Freestyle Sketch-in-the-Loop Image Segmentation | Subhadeep Koley et.al. | 2501.16022 | null |
2025-01-26 | Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations | Zijun Long et.al. | 2501.15379 | null |
2025-01-24 | Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection | Viktor Kozák et.al. | 2501.14587 | null |
2025-01-23 | Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models | Jakob Krogh Petersen et.al. | 2501.14051 | link |
2025-01-22 | Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation | Kenta Uesugi et.al. | 2501.13968 | null |
2025-01-19 | Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection | Zhipeng Yu et.al. | 2501.11063 | link |
2025-01-18 | A Resource-Efficient Training Framework for Remote Sensing Text–Image Retrieval | Weihang Zhang et.al. | 2501.10638 | null |
2025-01-17 | FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis | Zhe Chen et.al. | 2501.09887 | null |
2025-01-15 | Vision Foundation Models for Computed Tomography | Suraj Pai et.al. | 2501.09001 | link |
2025-01-12 | SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval | Bhavin Jawade et.al. | 2501.08347 | null |
2025-01-14 | VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes | Ke Wu et.al. | 2501.08286 | null |
2025-01-13 | Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps | Saurabh Gupta et.al. | 2501.07399 | null |
2025-01-12 | Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation | Zhenyang Feng et.al. | 2501.06749 | null |
2025-01-06 | Integrating Language-Image Prior into EEG Decoding for Cross-Task Zero-Calibration RSVP-BCI | Xujin Li et.al. | 2501.02841 | null |
2025-01-03 | A Minimal Subset Approach for Efficient and Scalable Loop Closure | Nikolaos Stathoulopoulos et.al. | 2501.01791 | link |
2025-01-03 | iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings | Shuhei Tomoshige et.al. | 2501.01642 | null |
2025-01-02 | R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization | Xudong Jiang et.al. | 2501.01421 | link |
2025-01-02 | Training Medical Large Vision-Language Models with Abnormal-Aware Feedback | Yucheng Zhou et.al. | 2501.01377 | null |
2025-01-02 | Domain-invariant feature learning in brain MR imaging for content-based image retrieval | Shuya Tobari et.al. | 2501.01326 | null |
2024-12-28 | GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting | Atticus J. Zeller et.al. | 2412.20056 | link |
2024-12-25 | FOR: Finetuning for Object Level Open Vocabulary Image Retrieval | Hila Levi et.al. | 2412.18806 | null |
2024-12-24 | ERVD: An Efficient and Robust ViT-Based Distillation Framework for Remote Sensing Image Retrieval | Le Dong et.al. | 2412.18136 | link |
2024-12-22 | Where am I? Cross-View Geo-localization with Natural Language Descriptions | Junyan Ye et.al. | 2412.17007 | null |
2024-12-22 | Large-Scale UWB Anchor Calibration and One-Shot Localization Using Gaussian Process | Shenghai Yuan et.al. | 2412.16880 | null |
2024-12-24 | Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling | Daichi Yashima et.al. | 2412.16576 | link |
2024-12-20 | A New Method to Capturing Compositional Knowledge in Linguistic Space | Jiahe Wan et.al. | 2412.15632 | null |
2024-12-20 | Stabilizing Laplacian Inversion in Fokker-Planck Image Retrieval using the Transport-of-Intensity Equation | Samantha J Alloo et.al. | 2412.15513 | null |
2024-12-19 | Learning Visual Composition through Improved Semantic Guidance | Austin Stone et.al. | 2412.15396 | null |
2024-12-19 | MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval | Junjie Zhou et.al. | 2412.14475 | null |
2024-12-18 | Adversarial Hubness in Multi-Modal Retrieval | Tingwei Zhang et.al. | 2412.14113 | link |
2024-12-18 | Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval | Giacomo Pacini et.al. | 2412.13834 | null |
2024-12-18 | ConDo: Continual Domain Expansion for Absolute Pose Regression | Zijun Li et.al. | 2412.13452 | link |
2024-12-17 | Three Things to Know about Deep Metric Learning | Yash Patel et.al. | 2412.12432 | null |
2024-12-15 | Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval | Zelong Sun et.al. | 2412.11087 | null |
2024-12-20 | Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval | Yuanmin Tang et.al. | 2412.11077 | link |
2024-12-13 | MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition | Qiwen Gu et.al. | 2412.09199 | null |
2024-12-12 | A Flexible Plug-and-Play Module for Generating Variable-Length | Liyang He et.al. | 2412.08922 | link |
2024-12-11 | Image Retrieval Methods in the Dissimilarity Space | Madhu Kiran et.al. | 2412.08618 | null |
2024-12-11 | Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization | Siyan Dong et.al. | 2412.08376 | link |
2024-12-11 | Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin | Benjamin D. Killeen et.al. | 2412.08020 | null |
2024-12-10 | On Motion Blur and Deblurring in Visual Place Recognition | Timur Ismagilov et.al. | 2412.07751 | null |
2024-12-10 | Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance | Wanwen Chen et.al. | 2412.07741 | null |
2024-12-09 | An Efficient Scene Coordinate Encoding and Relocalization Method | Kuan Xu et.al. | 2412.06488 | link |
2024-12-09 | A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition | Connor Malone et.al. | 2412.06153 | null |
2024-12-07 | Compositional Image Retrieval via Instruction-Aware Contrastive Learning | Wenliang Zhong et.al. | 2412.05756 | link |
2024-12-06 | DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification | Ying Jin et.al. | 2412.04828 | null |
2024-12-04 | Distillation of Diffusion Features for Semantic Correspondence | Frank Fundel et.al. | 2412.03512 | null |
2024-12-04 | Composed Image Retrieval for Training-Free Domain Conversion | Nikos Efthymiadis et.al. | 2412.03297 | link |
2024-12-03 | A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration | Thulio Amorim et.al. | 2412.02881 | null |
2024-12-03 | Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval | Leah Bar et.al. | 2412.02310 | link |
2024-12-02 | Mutli-View 3D Reconstruction using Knowledge Distillation | Aditya Dutt et.al. | 2412.02039 | link |
2024-12-02 | Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features | MD Shaikh Rahman et.al. | 2412.01555 | null |
2024-12-02 | Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models | Yi Liao et.al. | 2412.01202 | null |
2024-12-01 | EDTformer: An Efficient Decoder Transformer for Visual Place Recognition | Tong Jin et.al. | 2412.00784 | link |
2024-11-28 | EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval | Muhammad Huzaifa et.al. | 2412.00139 | null |
2024-11-28 | Unleashing the Power of Data Synthesis in Visual Localization | Sihang Li et.al. | 2412.00138 | null |
2024-11-28 | Relation-Aware Meta-Learning for Zero-shot Sketch-Based Image Retrieval | Yang Liu et.al. | 2412.00120 | null |
2024-11-29 | A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications | Liqiang Zhang Ye Tian Dongyan Wei et.al. | 2411.19845 | null |
2024-11-27 | Optimizing Image Retrieval with an Extended b-Metric Space | Abdelkader Belhenniche et.al. | 2411.18800 | null |
2024-11-26 | Learning Visual Hierarchies with Hyperbolic Embeddings | Ziwei Wang et.al. | 2411.17490 | null |
2024-12-02 | Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy | You Li et.al. | 2411.16752 | null |
2024-12-02 | AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks | You Li et.al. | 2411.16749 | null |
2024-11-25 | Image Generation Diversity Issues and How to Tame Them | Mischa Dombrowski et.al. | 2411.16171 | link |
2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
2024-11-22 | Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval | Zengbao Sun et.al. | 2411.14704 | null |
2024-11-20 | Globally Correlation-Aware Hard Negative Generation | Wenjie Peng et.al. | 2411.13145 | link |
2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
2024-11-13 | OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances | Youqi Liao et.al. | 2411.08665 | link |
2024-11-13 | Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval | Saul Santos et.al. | 2411.08590 | link |
2024-11-22 | Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments | Ashkan Nejad et.al. | 2411.08567 | link |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-05 | From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing | Xintian Sun et.al. | 2411.05826 | null |
2024-11-04 | TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives | Maitreya Patel et.al. | 2411.02545 | null |
2024-11-11 | INQUIRE: A Natural World Text-to-Image Retrieval Benchmark | Edward Vendrow et.al. | 2411.02537 | link |
2024-11-20 | Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models | Sharat Agarwal et.al. | 2411.01925 | null |
2024-11-04 | Semantic Masking and Visual Feature Matching for Robust Localization | Luisa Mao et.al. | 2411.01804 | null |
2024-11-03 | Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification | MD Shaikh Rahman et.al. | 2411.01473 | null |
2024-11-01 | Identifying Implicit Social Biases in Vision-Language Models | Kimia Hamidieh et.al. | 2411.00997 | null |
2024-10-31 | Nearest Neighbor Normalization Improves Multimodal Retrieval | Neil Chowdhury et.al. | 2410.24114 | link |
2024-10-31 | MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval | Haiwen Li et.al. | 2410.23736 | null |
2024-10-30 | Decoupling Semantic Similarity from Spatial Alignment for Neural Networks | Tassilo Wald et.al. | 2410.23107 | link |
2024-10-29 | Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications | Monica Riedler et.al. | 2410.21943 | link |
2024-10-28 | NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments | Taiyi Pan et.al. | 2410.21615 | link |
2024-10-25 | Context-Based Visual-Language Place Recognition | Soojin Woo et.al. | 2410.19341 | link |
2024-10-24 | ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval | Zijia Zhao et.al. | 2410.18715 | link |
2024-10-25 | On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features | Tomáš Pivoňka et.al. | 2410.18573 | null |
2024-10-22 | Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval | Yuanmin Tang et.al. | 2410.17393 | null |
2024-10-20 | GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning | Haiwen Diao et.al. | 2410.15266 | link |
2024-10-19 | Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway’s Digitised Book Collection | Marie Roald et.al. | 2410.14969 | link |
2024-10-16 | Development of Image Collection Method Using YOLO and Siamese Network | Chan Young Shin et.al. | 2410.12561 | null |
2024-10-16 | LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment | Juelin Zhu et.al. | 2410.12269 | link |
2024-10-16 | Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization | Nanda Febri Istighfarin et.al. | 2410.12240 | null |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-10-11 | Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System | Zheng Liu et.al. | 2410.08935 | link |
2024-10-16 | Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP | Eunji Kim et.al. | 2410.08469 | null |
2024-10-11 | A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification | Eugene P. W. Ang et.al. | 2410.08456 | null |
2024-10-10 | A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks | Hoin Jung et.al. | 2410.07593 | link |
2024-10-09 | Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval | Mohammad Omama et.al. | 2410.07022 | null |
2024-10-09 | Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers | Stephen Hausler et.al. | 2410.06614 | link |
2024-10-09 | MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging | Noel C. F. Codella et.al. | 2410.06542 | null |
2024-10-08 | Temporal Image Caption Retrieval Competition – Description and Results | Jakub Pokrywka et.al. | 2410.06314 | null |
2024-10-08 | Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching | Gongxin Yao et.al. | 2410.06285 | null |
2024-10-08 | GSLoc: Visual Localization with 3D Gaussian Splatting | Kazii Botashev et.al. | 2410.06165 | null |
2024-10-08 | Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning | Ayush Singh et.al. | 2410.05928 | null |
2024-10-08 | RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps | Minsoo Kim et.al. | 2410.05621 | null |
2024-10-09 | LoTLIP: Improving Language-Image Pre-training for Long Text Understanding | Wei Wu et.al. | 2410.05249 | null |
2024-10-06 | LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation | Jianhao Jiao et.al. | 2410.04419 | null |
2024-10-02 | Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension | Zaiquan Yang et.al. | 2410.01544 | null |
2024-10-03 | EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections | Francesc Net et.al. | 2410.01536 | link |
2024-10-04 | CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment | Safouane El Ghazouali et.al. | 2410.01411 | link |
2024-09-30 | Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation | Aleyna Kütük et.al. | 2410.00266 | null |
2024-09-29 | CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation | Yifan Duan et.al. | 2409.19597 | null |
2024-09-28 | VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition | Ahmad Khaliq et.al. | 2409.19293 | link |
2024-09-27 | MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion | Bardienus Duisterhof et.al. | 2409.19152 | null |
2024-09-26 | Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval | Mankeerat Sidhu et.al. | 2409.18733 | null |
2024-09-26 | Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Kartik Garg et.al. | 2409.18049 | link |
2024-09-24 | GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Gennady Sidorov et.al. | 2409.16502 | link |
2024-09-23 | CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis | Xiang Zhang et.al. | 2409.15169 | null |
2024-09-21 | Combining Absolute and Semi-Generalized Relative Poses for Visual Localization | Vojtech Panek et.al. | 2409.14269 | null |
2024-09-21 | SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality | Hongjia Zhai et.al. | 2409.14067 | null |
2024-09-20 | Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval | Morris Florek et.al. | 2409.13513 | link |
2024-09-18 | Towards Global Localization using Multi-Modal Object-Instance Re-Identification | Aneesh Chavan et.al. | 2409.12002 | link |
2024-09-17 | Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching | Kurran Singh et.al. | 2409.11555 | null |
2024-09-17 | Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information | Kunal Chelani et.al. | 2409.11536 | null |
2024-09-17 | Improving the Efficiency of Visually Augmented Language Models | Paula Ontalvilla et.al. | 2409.11148 | link |
2024-09-21 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
2024-09-16 | SOLVR: Submap Oriented LiDAR-Visual Re-Localisation | Joshua Knights et.al. | 2409.10247 | null |
2024-09-16 | Garment Attribute Manipulation with Multi-level Attention | Vittorio Casula et.al. | 2409.10206 | null |
2024-09-14 | Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval | Amirreza Mahbod et.al. | 2409.09430 | link |
2024-09-12 | Structured Pruning for Efficient Visual Place Recognition | Oliver Grainge et.al. | 2409.07834 | null |
2024-09-10 | GeoCalib: Learning Single-image Calibration with Geometric Optimization | Alexander Veicht et.al. | 2409.06704 | link |
2024-09-10 | Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Yujiao Shi et.al. | 2409.06471 | link |
2024-09-10 | A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions | Zhicong Wu et.al. | 2409.06381 | null |
2024-09-09 | Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding | Bram Willemsen et.al. | 2409.05721 | link |
2024-09-09 | Open-World Dynamic Prompt and Continual Visual Representation Learning | Youngeun Kim et.al. | 2409.05312 | null |
2024-09-12 | Training-free ZS-CIR via Weighted Modality Fusion and Similarity | Ren-Di Wu et.al. | 2409.04918 | link |
2024-09-12 | Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models | Saghir Alfasly et.al. | 2409.04631 | null |
2024-09-06 | Reprojection Errors as Prompts for Efficient Scene Coordinate Regression | Ting-Ru Liu et.al. | 2409.04178 | null |
2024-09-06 | Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments | Therese Joseph et.al. | 2409.03998 | null |
2024-09-04 | Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications | Abby Stylianou et.al. | 2409.03012 | null |
2024-09-04 | NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval | Sepanta Zeighami et.al. | 2409.02343 | link |
2024-09-03 | Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment | Konstantin Schall et.al. | 2409.01936 | link |
2024-09-02 | A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches | Kim Jinwoo et.al. | 2409.01219 | null |
2024-09-02 | Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection | Manon Kok et.al. | 2409.01091 | null |
2024-09-02 | Evidential Transformers for Improved Image Retrieval | Danilo Dordevic et.al. | 2409.01082 | null |
2024-09-05 | EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System | Bonan Liu et.al. | 2409.00343 | null |
2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
2024-09-02 | RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance | Avideep Mukherjee et.al. | 2408.17095 | null |
2024-08-29 | A compact neuromorphic system for ultra energy-efficient, on-device robot localization | Adam D. Hines et.al. | 2408.16754 | link |
2024-08-29 | Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models | Kengo Nakata et.al. | 2408.16296 | null |
2024-08-28 | Temporal Attention for Cross-View Sequential Image Localization | Dong Yuan et.al. | 2408.15569 | link |
2024-08-27 | Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild | Tianqi Wei et.al. | 2408.14723 | null |
2024-08-25 | LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task | Ali Asgarov et.al. | 2408.13909 | link |
2024-08-15 | Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval | Lifeng Zhou et.al. | 2408.13705 | null |
2024-08-15 | Coarse-to-fine Alignment Makes Better Speech-image Retrieval | Lifeng Zhou et.al. | 2408.13119 | null |
2024-08-21 | FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization | Son Tung Nguyen et.al. | 2408.12037 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-21 | UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Xiangyu Zhao et.al. | 2408.11305 | link |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | link |
2024-08-19 | BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval | Zhenyu Lu et.al. | 2408.10383 | null |
2024-08-23 | Fashion Image-to-Image Translation for Complementary Item Retrieval | Matteo Attimonelli et.al. | 2408.09847 | link |
2024-08-20 | MambaLoc: Efficient Camera Localisation via State Space Model | Jialu Wang et.al. | 2408.09680 | null |
2024-08-15 | DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions | Ryosuke Korekata et.al. | 2408.07910 | null |
2024-08-13 | A Miniature Vision-Based Localization System for Indoor Blimps | Shicong Ma et.al. | 2408.06648 | null |
2024-08-10 | Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network | Junyan Ye et.al. | 2408.05475 | link |
2024-08-09 | Spherical World-Locking for Audio-Visual Localization in Egocentric Videos | Heeseung Yun et.al. | 2408.05364 | null |
2024-08-06 | AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval | Pavel Suma et.al. | 2408.03282 | link |
2024-08-05 | CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration | Gongxin Yao et.al. | 2408.02394 | null |
2024-08-09 | BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles | Lun Luo et.al. | 2408.01841 | link |
2024-08-02 | On Validation of Search & Retrieval of Tissue Images in Digital Pathology | H. R. Tizhoosh et.al. | 2408.01570 | null |
2024-07-31 | VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning | Yuhang Ming et.al. | 2407.21416 | null |
2024-07-31 | SuperVINS: A visual-inertial SLAM framework integrated deep learning features | Hongkun Luo et.al. | 2407.21348 | link |
2024-07-30 | Re-localization acceleration with Medoid Silhouette Clustering | Hongyi Zhang et.al. | 2407.20749 | null |
2024-07-29 | A flexible framework for accurate LiDAR odometry, map manipulation, and localization | José Luis Blanco-Claraco et.al. | 2407.20465 | link |
2024-07-26 | From 2D to 3D: AISG-SLA Visual Localization Challenge | Jialin Gao et.al. | 2407.18590 | null |
2024-07-24 | Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation | Yongqi Li et.al. | 2407.17274 | null |
2024-07-24 | Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments | Wei Gao et.al. | 2407.17078 | null |
2024-07-24 | Pose Estimation from Camera Images for Underwater Inspection | Luyuan Peng et.al. | 2407.16961 | null |
2024-07-22 | Memory Management for Real-Time Appearance-Based Loop Closure Detection | Mathieu Labbé et.al. | 2407.15890 | null |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-22 | Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM | Mathieu Labbe et.al. | 2407.15305 | null |
2024-07-22 | Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation | Mathieu Labbé et.al. | 2407.15304 | null |
2024-07-19 | Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization | Yuehua Ding et.al. | 2407.14643 | null |
2024-07-18 | Visual Haystacks: Answering Harder Questions About Sets of Images | Tsung-Han Wu et.al. | 2407.13766 | link |
2024-07-17 | Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM | Markus Weißflog et.al. | 2407.12408 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-16 | EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis | Ruijie Yang et.al. | 2407.11401 | null |
2024-07-15 | No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations | Walter Simoncini et.al. | 2407.10964 | link |
2024-07-15 | DINO Pre-training for Vision-based End-to-end Autonomous Driving | Shubham Juneja et.al. | 2407.10803 | null |
2024-07-15 | Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Youngsun Lim et.al. | 2407.10683 | null |
2024-07-15 | An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots | J. J. Cabrera et.al. | 2407.10596 | link |
2024-07-15 | An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments | J. J. Cabrera et.al. | 2407.10536 | null |
2024-07-12 | Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval | Vaibhav Balloli et.al. | 2407.08908 | link |
2024-07-11 | Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates | Owen Claxton et.al. | 2407.08162 | link |
2024-07-12 | Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal | Xinyu Zhu et.al. | 2407.08153 | link |
2024-07-11 | SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM | Neng Wang et.al. | 2407.08106 | link |
2024-07-09 | LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition | Teng Wang et.al. | 2407.06730 | null |
2024-07-09 | CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding | Wenhao Xu et.al. | 2407.06611 | null |
2024-07-08 | Pseudo-triplet Guided Few-shot Composed Image Retrieval | Bohan Hou et.al. | 2407.06001 | null |
2024-07-09 | HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels | Yingying Jiang et.al. | 2407.05795 | null |
2024-07-05 | Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning | Mainak Singha et.al. | 2407.04207 | link |
2024-07-04 | Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models | Chang-Sheng Kao et.al. | 2407.03615 | link |
2024-07-03 | Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach | Pronay Debnath et.al. | 2407.03486 | null |
2024-07-02 | Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition | Sergio Izquierdo et.al. | 2407.02422 | link |
2024-07-01 | Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval | Aneeshan Sain et.al. | 2407.01810 | null |
2024-07-01 | Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval | Hanwen Su et.al. | 2407.00979 | null |
2024-07-01 | Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios | Connor Malone et.al. | 2407.00863 | null |
2024-06-27 | PathAlign: A vision-language model for whole slide images in histopathology | Faruk Ahmed et.al. | 2406.19578 | null |
2024-07-05 | 360 in the Wild: Dataset for Depth Prediction and View Synthesis | Kibaek Park et.al. | 2406.18898 | null |
2024-06-27 | Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs | Huaying Zhang et.al. | 2406.18836 | null |
2024-06-26 | WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images | Yannik Glaser et.al. | 2406.18765 | link |
2024-06-26 | View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis | Subin Varghese et.al. | 2406.18012 | null |
2024-06-25 | Tell Me Where You Are: Multimodal LLMs Meet Place Recognition | Zonglin Lyu et.al. | 2406.17520 | null |
2024-06-25 | SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation | Xu Liu et.al. | 2406.17249 | link |
2024-06-23 | Breaking the Frame: Image Retrieval by Visual Overlap Prediction | Tong Wei et.al. | 2406.16204 | link |
2024-06-19 | Towards a multimodal framework for remote sensing image change retrieval and captioning | Roger Ferrod et.al. | 2406.13424 | link |
2024-06-19 | CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval | Christian Lülf et.al. | 2406.13322 | link |
2024-06-17 | Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization | Huaiji Zhou et.al. | 2406.11766 | null |
2024-06-22 | Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment | Jianan Jiang et.al. | 2406.11551 | link |
2024-06-17 | They’re All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias | Salma Abdel Magid et.al. | 2406.11331 | null |
2024-06-17 | Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion | Guoyuan An et.al. | 2406.11242 | null |
2024-06-14 | Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval | Genc Hoxha et.al. | 2406.10107 | null |
2024-06-14 | BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval | Imanol Miranda et.al. | 2406.09952 | link |
2024-06-13 | Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases | Meng Wang et.al. | 2406.09317 | link |
2024-06-13 | Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval | Jaeseok Byun et.al. | 2406.09188 | null |
2024-06-13 | DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification | Zhengrui Xu et.al. | 2406.08773 | link |
2024-06-12 | Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement | Maxime Pietrantoni et.al. | 2406.08463 | null |
2024-06-12 | ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery | Kam Woh Ng et.al. | 2406.08457 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502 | link |
2024-06-11 | Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Shuvendu Roy et.al. | 2406.07450 | link |
2024-06-11 | Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval | Adrià Molina et.al. | 2406.07315 | null |
2024-06-10 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374 | link |
2024-06-09 | Unified Text-to-Image Generation and Retrieval | Leigang Qu et.al. | 2406.05814 | null |
2024-06-07 | The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better | Scott Geng et.al. | 2406.05184 | link |
2024-06-07 | PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction | Eduard Poesina et.al. | 2406.04746 | link |
2024-06-06 | GLACE: Global Local Accelerated Coordinate Encoding | Fangjinhua Wang et.al. | 2406.04340 | link |
2024-06-06 | Monocular Localization with Semantics Map for Autonomous Vehicles | Jixiang Wan et.al. | 2406.03835 | null |
2024-06-05 | Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach | Saehyung Lee et.al. | 2406.03411 | link |
2024-06-04 | MeshVPR: Citywide Visual Place Recognition Using 3D Meshes | Gabriele Berton et.al. | 2406.02776 | null |
2024-06-04 | Can CLIP help CLIP in learning 3D? | Cristian Sbrolli et.al. | 2406.02202 | null |
2024-06-03 | Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP | Sriram Balasubramanian et.al. | 2406.01583 | link |
2024-06-03 | Scale-Free Image Keypoints Using Differentiable Persistent Homology | Giovanni Barbarani et.al. | 2406.01315 | link |
2024-06-02 | Visual place recognition for aerial imagery: A survey | Ivan Moskalenko et.al. | 2406.00885 | link |
2024-06-01 | NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization | Wugang Meng et.al. | 2406.00312 | null |
2024-05-31 | DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models | Linli Yao et.al. | 2405.20985 | link |
2024-05-29 | Multi-Modal Generative Embedding Model | Feipeng Ma et.al. | 2405.19333 | null |
2024-05-29 | ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions | Honglin Lin et.al. | 2405.19226 | null |
2024-05-30 | CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval | Xintong Jiang et.al. | 2405.19149 | link |
2024-05-29 | SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation | Zhenbei Wu et.al. | 2405.18801 | null |
2024-05-29 | Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs | Jialiang Xu et.al. | 2405.18740 | link |
2024-05-28 | EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition | Issar Tzachor et.al. | 2405.18065 | null |
2024-05-28 | AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval | Sihe Zhang et.al. | 2405.17718 | null |
2024-05-26 | MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups | Yusen Xie et.al. | 2405.16599 | null |
2024-05-29 | Composed Image Retrieval for Remote Sensing | Bill Psomas et.al. | 2405.15587 | link |
2024-05-24 | Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval | Yiming Wu et.al. | 2405.15451 | null |
2024-05-20 | UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization | Wenjia Xu et.al. | 2405.11936 | link |
2024-05-19 | Register assisted aggregation for Visual Place Recognition | Xuan Yu et.al. | 2405.11526 | null |
2024-05-26 | CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion | Gang Wang et.al. | 2405.10793 | null |
2024-05-16 | FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models | Adrian Bulat et.al. | 2405.10286 | null |
2024-05-15 | Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study | Farnaz Khun Jush et.al. | 2405.09334 | null |
2024-05-14 | BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment | Lihong Jin et.al. | 2405.09001 | null |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-13 | OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | Qiuchi Xiang et.al. | 2405.07966 | link |
2024-05-14 | HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval | Chao He et.al. | 2405.07524 | link |
2024-05-13 | JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation | Xubo Luo et.al. | 2405.07429 | link |
2024-05-12 | BoQ: A Place is Worth a Bag of Learnable Queries | Amar Ali-bey et.al. | 2405.07364 | link |
2024-05-07 | Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction | Nematollah Saeidi et.al. | 2405.04211 | null |
2024-05-06 | A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions | Sharath Raghvendra et.al. | 2405.03664 | null |
2024-05-06 | Knowledge-aware Text-Image Retrieval for Remote Sensing Images | Li Mi et.al. | 2405.03373 | null |
2024-05-06 | Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval | Jiacheng Cheng et.al. | 2405.03190 | null |
2024-05-05 | iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval | Lorenzo Agnolucci et.al. | 2405.02951 | link |
2024-05-01 | Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval | Young Kyun Jang et.al. | 2405.00571 | null |
2024-04-30 | Large Language Model Informed Patent Image Retrieval | Hao-Cheng Lo et.al. | 2404.19360 | null |
2024-04-30 | XFeat: Accelerated Features for Lightweight Image Matching | Guilherme Potje et.al. | 2404.19174 | null |
2024-04-29 | Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models | Hongyi Zhu et.al. | 2404.18746 | null |
2024-04-29 | Dual-Modal Prompting for Sketch-Based Image Retrieval | Liying Gao et.al. | 2404.18695 | null |
2024-05-01 | Semantic Line Combination Detector | Jinwon Ko et.al. | 2404.18399 | link |
2024-04-26 | Learning text-to-video retrieval from image captioning | Lucas Ventura et.al. | 2404.17498 | null |
2024-04-25 | CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching | Samia Shafique et.al. | 2404.16972 | link |
2024-04-29 | Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval | Ryoya Nara et.al. | 2404.16398 | null |
2024-04-24 | Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval | Haokun Wen et.al. | 2404.15875 | link |
2024-04-24 | DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines | Xin Jiang et.al. | 2404.15771 | null |
2024-04-23 | Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval | Young Kyun Jang et.al. | 2404.15516 | null |
2024-04-22 | EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models | Mathias Thorsager et.al. | 2404.14236 | null |
2024-04-22 | Hierarchical localization with panoramic views and triplet loss functions | Marcos Alfaro et.al. | 2404.14117 | link |
2024-04-20 | High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces | Baoru Huang et.al. | 2404.13437 | null |
2024-04-20 | Collaborative Visual Place Recognition through Federated Learning | Mattia Dutto et.al. | 2404.13324 | null |
2024-04-18 | SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints | Spencer Carmichael et.al. | 2404.12339 | null |
2024-04-17 | Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives | Zhangchi Feng et.al. | 2404.11317 | link |
2024-04-17 | Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing | Sanggeon Yun et.al. | 2404.11025 | null |
2024-04-16 | SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments | Niklas Gard et.al. | 2404.10527 | link |
2024-04-20 | CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning | Haojian Huang et.al. | 2404.09640 | link |
2024-04-11 | PRAM: Place Recognition Anywhere Model for Efficient Visual Localization | Fei Xue et.al. | 2404.07785 | null |
2024-04-16 | 2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure | Bin Zhang et.al. | 2404.07644 | link |
2024-04-11 | Semantically-correlated memories in a dense associative model | Thomas F Burns et.al. | 2404.07123 | link |
2024-04-09 | Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation | Luca Barsellotti et.al. | 2404.06542 | null |
2024-04-09 | Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping | Anas Gouda et.al. | 2404.06277 | link |
2024-04-07 | Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval | Jinpeng Wang et.al. | 2404.04998 | link |
2024-04-06 | Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning | Juncheng Yang et.al. | 2404.04538 | link |
2024-04-05 | Towards introspective loop closure in 4D radar SLAM | Maximilian Hilger et.al. | 2404.03940 | null |
2024-04-02 | TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation | Yehui Shen et.al. | 2404.01587 | link |
2024-04-01 | On Train-Test Class Overlap and Detection for Image Retrieval | Chull Hwan Song et.al. | 2404.01524 | link |
2024-04-01 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400 | null |
2024-03-31 | On the Estimation of Image-matching Uncertainty in Visual Place Recognition | Mubariz Zaffar et.al. | 2404.00546 | null |
2024-03-31 | NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation | Diwei Sheng et.al. | 2404.00504 | null |
2024-03-30 | SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs | Yang Miao et.al. | 2404.00469 | null |
2024-03-30 | Do Vision-Language Models Understand Compound Nouns? | Sonal Kumar et.al. | 2404.00419 | link |
2024-04-05 | FairRAG: Fair Human Generation via Fair Retrieval Augmentation | Robik Shrestha et.al. | 2403.19964 | null |
2024-03-28 | JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition | Gabriele Berton et.al. | 2403.19787 | link |
2024-03-28 | MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions | Kai Zhang et.al. | 2403.19651 | link |
2024-03-27 | AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation | Changkun Liu et.al. | 2403.18281 | null |
2024-03-26 | Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge | Dongjin Kim et.al. | 2403.17420 | link |
2024-03-25 | Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras | Gokul B. Nair et.al. | 2403.16425 | link |
2024-03-24 | Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval | Yucheng Suo et.al. | 2403.16005 | link |
2024-03-24 | BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval | Yinda Chen et.al. | 2403.15992 | null |
2024-03-22 | Long-CLIP: Unlocking the Long-Text Capability of CLIP | Beichen Zhang et.al. | 2403.15378 | link |
2024-03-22 | A Multimodal Approach for Cross-Domain Image Retrieval | Lucas Iijima et.al. | 2403.15152 | null |
2024-03-22 | Piecewise-Linear Manifolds for Deep Metric Learning | Shubhang Bhatnagar et.al. | 2403.14977 | null |
2024-03-21 | Enhancing Historical Image Retrieval with Compositional Cues | Tingyu Lin et.al. | 2403.14287 | link |
2024-03-20 | Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval | Aymene Berriche et.al. | 2403.13747 | null |
2024-03-20 | Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval | Haoyu Liu et.al. | 2403.13317 | null |
2024-03-19 | Learning Neural Volumetric Pose Features for Camera Localization | Jingyu Lin et.al. | 2403.12800 | null |
2024-03-19 | Quantixar: High-performance Vector Data Management System | Gulshan Yadav et.al. | 2403.12583 | null |
2024-03-17 | 3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization | Peng Jiang et.al. | 2403.11367 | null |
2024-03-17 | MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data | Paul S. Scotti et.al. | 2403.11207 | link |
2024-03-16 | Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval | Shunsuke Tsubaki et.al. | 2403.10756 | null |
2024-03-16 | Vector search with small radiuses | Gergely Szilvasy et.al. | 2403.10746 | null |
2024-03-13 | Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer | Kenta Tsukahara et.al. | 2403.10552 | null |
2024-03-20 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297 | link |
2024-03-15 | Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline | Fangming Yuan et.al. | 2403.10283 | null |
2024-03-14 | The NeRFect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou et.al. | 2403.09577 | null |
2024-03-14 | VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition | Benjamin Ramtoula et.al. | 2403.09025 | null |
2024-03-13 | PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models | Siddharth Mishra-Sharma et.al. | 2403.08851 | link |
2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-12 | It’s All About Your Sketch: Democratising Sketch Control in Diffusion Models | Subhadeep Koley et.al. | 2403.07234 | link |
2024-03-12 | You’ll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval | Subhadeep Koley et.al. | 2403.07222 | null |
2024-03-12 | Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers | Subhadeep Koley et.al. | 2403.07214 | null |
2024-03-11 | How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? | Subhadeep Koley et.al. | 2403.07203 | null |
2024-03-11 | EarthLoc: Astronaut Photography Localization by Indexing Earth from Space | Gabriele Berton et.al. | 2403.06758 | link |
2024-03-11 | BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues | Fudong Ge et.al. | 2403.06600 | link |
2024-03-11 | Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology | Stefan Denner et.al. | 2403.06567 | link |
2024-03-10 | RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation | Mathieu Labbé et.al. | 2403.06341 | null |
2024-03-10 | Texture image retrieval using a classification and contourlet-based features | Asal Rouhafzay et.al. | 2403.06048 | null |
2024-03-11 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002 | link |
2024-03-11 | Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed | Yifan Wang et.al. | 2403.04765 | null |
2024-03-07 | mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar | Chengzhen Meng et.al. | 2403.04703 | null |
2024-03-06 | Self-supervised Photographic Image Layout Representation Learning | Zhaoran Zhao et.al. | 2403.03740 | link |
2024-03-04 | Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models | Benedikt Blumenstiel et.al. | 2403.02059 | link |
2024-03-03 | Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval | Yongchao Du et.al. | 2403.01431 | null |
2024-03-01 | Asymmetric Feature Fusion for Image Retrieval | Hui Wu et.al. | 2403.00671 | null |
2024-03-01 | Structure Similarity Preservation Learning for Asymmetric Image Retrieval | Hui Wu et.al. | 2403.00648 | link |
2024-02-29 | CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition | Feng Lu et.al. | 2402.19231 | link |
2024-02-28 | Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport | Bin Li et.al. | 2402.18411 | link |
2024-02-28 | Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning | Hanyao Wang et.al. | 2402.18400 | null |
2024-02-28 | Representing 3D sparse map points and lines for camera relocalization | Bach-Thuan Bui et.al. | 2402.18011 | link |
2024-02-27 | Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control | Thong Nguyen et.al. | 2402.17535 | link |
2024-02-29 | Active propulsion noise shaping for multi-rotor aircraft localization | Gabriele Serussi et.al. | 2402.17289 | link |
2024-02-27 | NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer | Bingxi Liu et.al. | 2402.17159 | link |
2024-02-25 | Deep Homography Estimation for Visual Place Recognition | Feng Lu et.al. | 2402.16086 | link |
2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | link |
2024-02-28 | Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries | Zijun Long et.al. | 2402.15276 | null |
2024-02-23 | Fine-tuning CLIP Text Encoders with Two-step Paraphrasing | Hyunjae Kim et.al. | 2402.15120 | null |
2024-02-22 | Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition | Feng Lu et.al. | 2402.14505 | link |
2024-02-16 | Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition | Chenming Hu et.al. | 2402.10476 | null |
2024-02-15 | Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task | Mirko Nava et.al. | 2402.09886 | link |
2024-02-14 | Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency | Yannis Kalantidis et.al. | 2402.09237 | null |
2024-02-13 | Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Xiangming Gu et.al. | 2402.08567 | link |
2024-02-13 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359 | link |
2024-02-10 | Semantic Object-level Modeling for Robust Visual Camera Relocalization | Yifan Zhu et.al. | 2402.06951 | null |
2024-02-09 | Large Language Models for Captioning and Retrieving Remote Sensing Images | João Daniel Silva et.al. | 2402.06475 | null |
2024-02-09 | PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes | Xinggang Hu et.al. | 2402.06131 | null |
2024-02-21 | MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction | Heng Zhou et.al. | 2402.03762 | null |
2024-02-04 | Region-Based Representations Revisited | Michal Shlapentokh-Rothman et.al. | 2402.02352 | link |
2024-02-03 | Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization | Bo Yang et.al. | 2402.02141 | link |
2024-02-01 | BrainSLAM: SLAM on Neural Population Activity Data | Kipp Freud et.al. | 2402.00588 | null |
2024-02-01 | Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering | Tianxiao Gao et.al. | 2402.00330 | link |
2024-01-31 | Improved Scene Landmark Detection for Camera Localization | Tien Do et.al. | 2401.18083 | link |
2024-01-31 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-29 | Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors | Shiyin Dong et.al. | 2401.16459 | null |
2024-01-29 | Cross-Modal Coordination Across a Diverse Set of Input Modalities | Jorge Sánchez et.al. | 2401.16347 | null |
2024-01-29 | Regressing Transformers for Data-efficient Visual Place Recognition | María Leyva-Vallina et.al. | 2401.16304 | null |
2024-01-27 | Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval | Ayush Dubey et.al. | 2401.15362 | null |
2024-01-24 | Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode | Naresh Kumar Lahajal et.al. | 2401.13613 | null |
2024-01-23 | PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion | Shyam Sundar Kannan et.al. | 2401.13082 | null |
2024-01-23 | SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization | Mingyang Li et.al. | 2401.13076 | link |
2024-01-25 | CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios | Xiangshuo Qiao et.al. | 2401.10475 | link |
2024-01-19 | PhotoScout: Synthesis-Powered Multi-Modal Image Search | Celeste Barnaby et.al. | 2401.10464 | null |
2024-01-19 | Cross-Modality Perturbation Synergy Attack for Person Re-identification | Yunpeng Gong et.al. | 2401.10090 | null |
2024-01-16 | Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging | Zahra Tabatabaei et.al. | 2401.08272 | null |
2024-01-16 | Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments | Bruno Arcanjo et.al. | 2401.08263 | null |
2024-01-15 | Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing | Jakob Hackstein et.al. | 2401.07782 | link |
2024-01-14 | HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval | Zexuan Qiu et.al. | 2401.07212 | link |
2024-01-11 | UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization | Rouwan Wu et.al. | 2401.05971 | link |
2024-01-10 | Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval | Eunyi Lyou et.al. | 2401.04860 | link |
2024-01-05 | Benchmarking PathCLIP for Pathology Image Analysis | Sunyi Zheng et.al. | 2401.02651 | null |
2024-01-03 | DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM with Joint Semantic Encoding | Mingrui Li et.al. | 2401.01545 | null |
2024-01-02 | BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving | Dafeng Wei et.al. | 2401.01065 | null |
2023-12-31 | Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval | Liang Wang et.al. | 2401.00371 | link |
2023-12-29 | Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering | Long-Kun Du et.al. | 2401.00032 | null |
2023-12-27 | LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization | Sai Shubodh Puligilla et.al. | 2312.16648 | null |
2023-12-26 | Recursive Distillation for Open-Set Distributed Robot Localization | Kenta Tsukahara et.al. | 2312.15897 | null |
2023-12-24 | Residual Learning for Image Point Descriptors | Rashik Shrestha et.al. | 2312.15471 | null |
2023-12-23 | CaLDiff: Camera Localization in NeRF via Pose Diffusion | Rashik Shrestha et.al. | 2312.15242 | null |
2023-12-20 | Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition | Bruno Arcanjo et.al. | 2312.12995 | null |
2023-12-19 | VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering | Chun-Mei Feng et.al. | 2312.12273 | link |
2023-12-18 | Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback | Boaz Lerner et.al. | 2312.11078 | link |
2023-12-17 | PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields | Boming Zhao et.al. | 2312.10649 | null |
2023-12-17 | DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition | Sijie Wang et.al. | 2312.10616 | link |
2023-12-16 | Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval | Decheng Liu et.al. | 2312.10320 | link |
2023-12-15 | Data-Efficient Multimodal Fusion on a Single GPU | Noël Vouitsis et.al. | 2312.10144 | link |
2023-12-13 | Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques | Hamed Qazanfari et.al. | 2312.10089 | null |
2023-12-15 | Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval | Zhe Ma et.al. | 2312.09716 | link |
2023-12-14 | Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition | Oliver Grainge et.al. | 2312.09028 | null |
2023-12-14 | Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking | Shitong Sun et.al. | 2312.08924 | null |
2023-12-13 | C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation | Florian Fervers et.al. | 2312.08060 | null |
2023-12-12 | Contextually Affinitive Neighborhood Refinery for Deep Clustering | Chunlin Yu et.al. | 2312.07806 | link |
2023-12-12 | Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval | Qiwei Tian et.al. | 2312.07364 | link |
2023-12-12 | Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection | Jonathan J. Y. Kim et.al. | 2312.06991 | null |
2023-12-11 | Dynamic Weighted Combiner for Mixed-Modal Image Retrieval | Fuxiang Huang et.al. | 2312.06179 | link |
2023-12-06 | Lite-Mind: Towards Efficient and Versatile Brain Representation Network | Zixuan Gong et.al. | 2312.03781 | link |
2023-12-08 | FreestyleRet: Retrieving Images from Style-Diversified Queries | Hao Li et.al. | 2312.02428 | link |
2023-12-04 | Implicit Learning of Scene Geometry from Poses for Global Localization | Mohammad Altillawi et.al. | 2312.02029 | null |
2023-12-04 | Language-only Efficient Training of Zero-shot Composed Image Retrieval | Geonmo Gu et.al. | 2312.01998 | link |
2023-12-03 | G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training | Che Liu et.al. | 2312.01522 | link |
2023-12-01 | Improve Supervised Representation Learning with Masked Image Modeling | Kaifeng Chen et.al. | 2312.00950 | null |
2023-12-05 | Grounding Everything: Emerging Localization Properties in Vision-Language Transformers | Walid Bousselham et.al. | 2312.00878 | link |
2023-12-01 | Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras | Mohammad Altillawi et.al. | 2312.00500 | null |
2023-11-30 | HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance | Zhuohao Yin et.al. | 2311.18273 | link |
2023-11-30 | Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models | Raviteja Vemulapalli et.al. | 2311.18237 | link |
2023-11-29 | Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce | Chang Liu et.al. | 2311.17954 | null |
2023-11-28 | Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames | Chao Chen et.al. | 2311.17940 | null |
2023-11-29 | 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries | Huajian Huang et.al. | 2311.17389 | link |
2023-11-27 | Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation | Samuele Poppi et.al. | 2311.16254 | link |
2023-11-27 | Optimal Transport Aggregation for Visual Place Recognition | Sergio Izquierdo et.al. | 2311.15937 | link |
2023-11-27 | AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval | Shicheng Xu et.al. | 2311.14084 | link |
2023-11-23 | 3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology | Asma Ben Abacha et.al. | 2311.13752 | link |
2023-11-22 | Medical Image Retrieval Using Pretrained Embeddings | Farnaz Khun Jush et.al. | 2311.13547 | null |
2023-11-22 | Applications of Spiking Neural Networks in Visual Place Recognition | Somayeh Hussaini et.al. | 2311.13186 | link |
2023-11-21 | Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval | Xiu-Shen Wei et.al. | 2311.12894 | null |
2023-11-21 | Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs | Zhentian Qian et.al. | 2311.12245 | null |
2023-11-19 | From Categories to Classifier: Name-Only Continual Learning by Exploring the Web | Ameya Prabhu et.al. | 2311.11293 | null |
2023-11-18 | Lesion Search with Self-supervised Learning | Kristin Qi et.al. | 2311.11014 | null |
2023-11-15 | Flow reconstruction and particle characterization from inertial Lagrangian tracks | Ke Zhou et.al. | 2311.09076 | null |
2023-11-15 | Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval | Junyang Chen et.al. | 2311.07622 | link |
2023-11-13 | VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search | Shuting He et.al. | 2311.07514 | null |
2023-11-10 | Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval | Xin Lu et.al. | 2311.06067 | null |
2023-11-08 | Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model | Junya Shiraishi et.al. | 2311.04788 | null |
2023-11-08 | Training CLIP models on Data from Scientific Papers | Calvin Metzger et.al. | 2311.04711 | link |
2023-11-07 | DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding | Kehinde Ajayi et.al. | 2311.04098 | link |
2023-11-06 | Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences | Zador Pataki et.al. | 2311.03345 | null |
2023-11-06 | FocusTune: Tuning Visual Localization through Focus-Guided Sampling | Son Tung Nguyen et.al. | 2311.02872 | link |
2023-11-01 | DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing | Gaoshuang Huang et.al. | 2311.00230 | link |
2023-10-29 | Identifiable Contrastive Learning with Automatic Feature Importance Discovery | Qi Zhang et.al. | 2310.18904 | link |
2023-10-27 | LipSim: A Provably Robust Perceptual Similarity Metric | Sara Ghazanfari et.al. | 2310.18274 | link |
2023-10-27 | Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation | Susu Fang et.al. | 2310.17879 | null |
2023-10-25 | FoundLoc: Vision-based Onboard Aerial Localization in the Wild | Yao He et.al. | 2310.16299 | null |
2023-10-24 | Cross-view Self-localization from Synthesized Scene-graphs | Ryogo Yamamoto et.al. | 2310.15504 | null |
2023-10-23 | Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval | Xu Yuan et.al. | 2310.14637 | link |
2023-10-21 | Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation | Anastasia Kritharoula et.al. | 2310.14025 | link |
2023-10-20 | FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer | Xinyu Zhang et.al. | 2310.13605 | null |
2023-10-20 | CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants | Shaoan Wang et.al. | 2310.13320 | link |
2023-10-27 | Representation Learning via Consistent Assignment of Views over Random Partitions | Thalles Silva et.al. | 2310.12692 | link |
2023-10-18 | Evaluating the Fairness of Discriminative Foundation Models in Computer Vision | Junaid Ali et.al. | 2310.11867 | link |
2023-10-17 | Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification | Shuanglin Yan et.al. | 2310.11210 | null |
2023-10-16 | Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People | Dharmateja Adapa et.al. | 2310.10290 | null |
2023-10-16 | EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge | Tom Bryan et.al. | 2310.10050 | null |
2023-10-15 | CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes | Yulei Qin et.al. | 2310.09761 | link |
2023-10-13 | Pairwise Similarity Learning is SimPLE | Yandong Wen et.al. | 2310.09449 | link |
2023-10-13 | Vision-by-Language for Training-Free Compositional Image Retrieval | Shyamgopal Karthik et.al. | 2310.09291 | link |
2023-10-12 | Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning | Shiyang Yan et.al. | 2310.08390 | null |
2023-10-12 | Jointly Optimized Global-Local Visual Localization of UAVs | Haoling Li et.al. | 2310.08082 | null |
2023-10-10 | Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization | Le Chen et.al. | 2310.06984 | null |
2023-10-10 | Distillation Improves Visual Place Recognition for Low-Quality Queries | Anbang Yang et.al. | 2310.06906 | link |
2023-10-10 | Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets | Jiajun Zhang et.al. | 2310.06566 | null |
2023-10-10 | Topological RANSAC for instance verification and retrieval without fine-tuning | Guoyuan An et.al. | 2310.06486 | null |
2023-10-10 | 3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments | Ghanta Sai Krishna et.al. | 2310.06385 | null |
2023-10-09 | Collaborative Visual Place Recognition | Yiming Li et.al. | 2310.05541 | null |
2023-10-09 | Sentence-level Prompts Benefit Composed Image Retrieval | Yang Bai et.al. | 2310.05473 | link |
2023-10-08 | AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition | Feng Lu et.al. | 2310.05184 | link |
2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
2023-10-12 | ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer | Yifan Xu et.al. | 2310.04099 | null |
2023-10-06 | Sub-token ViT Embedding via Stochastic Resonance Transformers | Dong Lao et.al. | 2310.03967 | link |
2023-10-04 | Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach | Matthew Hanlon et.al. | 2310.02650 | null |
2023-10-02 | NEUCORE: Neural Concept Reasoning for Composed Image Retrieval | Shu Zhao et.al. | 2310.01358 | null |
2023-10-02 | Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images | Georg Bökman et.al. | 2310.01092 | null |
2023-10-05 | PlaceNav: Topological Navigation through Place Recognition | Lauri Suomela et.al. | 2309.17260 | null |
2023-09-29 | Segment Anything Model is a Good Teacher for Local Feature Learning | Jingqian Wu et.al. | 2309.16992 | link |
2023-09-28 | Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning | Albert Mohwald et.al. | 2309.16351 | link |
2023-09-28 | FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding | Pengxiang Wu et.al. | 2309.16249 | link |
2023-09-28 | Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval | Yuanmin Tang et.al. | 2309.16137 | link |
2023-09-27 | GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization | Vicente Vivanco Cepeda et.al. | 2309.16020 | link |
2023-09-27 | Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization | Zhenbo Song et.al. | 2309.15556 | null |
2023-09-26 | Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features | Hila Levi et.al. | 2309.14999 | null |
2023-09-23 | Resolving References in Visually-Grounded Dialogue via Text Generation | Bram Willemsen et.al. | 2309.13430 | link |
2023-09-21 | Face Identity-Aware Disentanglement in StyleGAN | Adrian Suwała et.al. | 2309.12033 | null |
2023-09-21 | On-the-Fly SfM: What you capture is What you get | Zongqian Zhan et.al. | 2309.11883 | link |
2023-09-20 | 2D-3D Pose Tracking with Multi-View Constraints | Huai Yu et.al. | 2309.11335 | null |
2023-09-19 | VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition | Adam D. Hines et.al. | 2309.10225 | link |
2023-09-18 | DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach | Chenghao Xu et.al. | 2309.09879 | null |
2023-09-18 | Decompose Semantic Shifts for Composed Image Retrieval | Xingyu Yang et.al. | 2309.09531 | null |
2023-09-16 | Efficient Object Rearrangement via Multi-view Fusion | Dehao Huang et.al. | 2309.08994 | null |
2023-09-16 | DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF | Mert Asim Karaoglu et.al. | 2309.08927 | link |
2023-09-16 | Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning | Pengyu Yin et.al. | 2309.08914 | link |
2023-09-15 | Active Learning for Fine-Grained Sketch-Based Image Retrieval | Himanshu Thakur et.al. | 2309.08743 | null |
2023-09-15 | Optimization of Rank Losses for Image Retrieval | Elias Ramzi et.al. | 2309.08250 | link |
2023-09-18 | Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer | Yaoting Wang et.al. | 2309.07929 | link |
2023-09-14 | EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization | Minjung Kim et.al. | 2309.07471 | link |
2023-09-13 | RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline | Mirko Usuelli et.al. | 2309.07094 | null |
2023-09-11 | Towards Content-based Pixel Retrieval in Revisited Oxford and Paris | Guoyuan An et.al. | 2309.05438 | link |
2023-09-08 | Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning | Hiroki Nakamura et.al. | 2309.04148 | null |
2023-09-05 | Magnetic Navigation using Attitude-Invariant Magnetic Field Information for Loop Closure Detection | Natalia Pavlasek et.al. | 2309.02394 | null |
2023-09-05 | Dual Relation Alignment for Composed Image Retrieval | Xintong Jiang et.al. | 2309.02169 | null |
2023-09-04 | NLLB-CLIP – train performant multilingual image retrieval model on a budget | Alexander Visheratin et.al. | 2309.01859 | null |
2023-09-04 | Target-Guided Composed Image Retrieval | Haokun Wen et.al. | 2309.01366 | null |
2023-09-02 | Deep supervised hashing for fast retrieval of radio image cubes | Steven Ndung’u et.al. | 2309.00932 | null |
2023-08-31 | Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval | Prateksha Udhayanan et.al. | 2308.16649 | null |
2023-08-28 | Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics | Nils Böhne et.al. | 2308.14786 | null |
2023-08-28 | CoVR: Learning Composed Video Retrieval from Web Video Captions | Lucas Ventura et.al. | 2308.14746 | link |
2023-08-27 | Deep Learning for Visual Localization and Mapping: A Survey | Changhao Chen et.al. | 2308.14039 | null |
2023-08-26 | Learning Efficient Representations for Image-Based Patent Retrieval | Hongsong Wang et.al. | 2308.13749 | null |
2023-08-25 | Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers | Mohammad Javad Rajabi et.al. | 2308.13671 | null |
2023-08-24 | Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities | Jinze Bai et.al. | 2308.12966 | link |
2023-08-23 | Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval | Huafeng Li et.al. | 2308.11994 | null |
2023-08-23 | OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes | Tao Xie et.al. | 2308.11928 | link |
2023-08-22 | Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features | Alberto Baldrati et.al. | 2308.11485 | link |
2023-08-22 | GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training | Xinchi Deng et.al. | 2308.11331 | null |
2023-08-22 | LDP-Feat: Image Features with Local Differential Privacy | Francesco Pittaluga et.al. | 2308.11223 | null |
2023-08-21 | EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition | Gabriele Berton et.al. | 2308.10832 | link |
2023-08-20 | FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory | Anwesan Pal et.al. | 2308.10170 | null |
2023-08-18 | 3D Model-free Visual localization System from Essential Matrix under Local Planar Motion | Yanmei Jiao et.al. | 2308.09566 | null |
2023-08-17 | FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings | Yulin Su et.al. | 2308.09012 | link |
2023-08-16 | Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval | Aishwarya Venkataramanan et.al. | 2308.08431 | link |
2023-08-16 | Ranking-aware Uncertainty for Text-guided Image Retrieval | Junyang Chen et.al. | 2308.08131 | null |
2023-08-19 | Global Features are All You Need for Image Retrieval and Reranking | Shihao Shao et.al. | 2308.06954 | link |
2023-08-14 | MixBCT: Towards Self-Adapting Backward-Compatible Training | Yu Liang et.al. | 2308.06948 | link |
2023-08-10 | KS-APR: Keyframe Selection for Robust Absolute Pose Regression | Changkun Liu et.al. | 2308.05459 | null |
2023-08-09 | AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities | Jingdan Zhang et.al. | 2308.04992 | link |
2023-08-08 | Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval | Yi Bin et.al. | 2308.04343 | link |
2023-08-08 | Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval | Yunquan Zhu et.al. | 2308.04008 | link |
2023-08-05 | A Comprehensive Analysis of Real-World Image Captioning and Scene Identification | Sai Suprabhanu Nallapaneni et.al. | 2308.02833 | null |
2023-08-03 | Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies | Eunsuk Seo et.al. | 2308.01871 | null |
2023-08-01 | AnyLoc: Towards Universal Visual Place Recognition | Nikhil Keetha et.al. | 2308.00688 | link |
2023-07-31 | Guiding Image Captioning Models Toward More Specific Captions | Simon Kornblith et.al. | 2307.16686 | null |
2023-07-31 | Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks | Kousik Rajesh et.al. | 2307.16395 | null |
2023-07-28 | D2S: Representing local descriptors and global scene coordinates for camera relocalization | Bach-Thuan Bui et.al. | 2307.15250 | link |
2023-07-26 | Neural-based Cross-modal Search and Retrieval of Artwork | Yan Gong et.al. | 2307.14244 | null |
2023-07-26 | Boon: A Neural Search Engine for Cross-Modal Information Retrieval | Yan Gong et.al. | 2307.14240 | null |
2023-07-25 | Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network | Chull Hwan Song et.al. | 2307.13254 | null |
2023-07-28 | SACReg: Scene-Agnostic Coordinate Regression for Visual Localization | Jerome Revaud et.al. | 2307.11702 | null |
2023-07-19 | Lazy Visual Localization via Motion Averaging | Siyan Dong et.al. | 2307.09981 | null |
2023-07-19 | Quantum Optics based Algorithm for Measuring the Similarity between Images | Vivek Mehta et.al. | 2307.09789 | null |
2023-07-18 | Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments | Max Moebius et.al. | 2307.09172 | null |
2023-07-18 | 3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving | Qipeng Li et.al. | 2307.09044 | null |
2023-07-19 | Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation | Rundong Luo et.al. | 2307.08779 | null |
2023-07-17 | Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition | Gabriele Trivigno et.al. | 2307.08417 | link |
2023-07-17 | Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification | Tengfei Liang et.al. | 2307.08316 | link |
2023-07-17 | NDT-Map-Code: A 3D global descriptor for real-time loop closure detection in lidar SLAM | Lizhou Liao et.al. | 2307.08221 | link |
2023-07-20 | Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer | Yujiao Shi et.al. | 2307.08015 | link |
2023-07-10 | Phoneme-retrieval; voice recognition; vowels recognition | Brunello Tirozzi et.al. | 2307.07407 | null |
2023-07-14 | Risk Controlled Image Retrieval | Kaiwen Cai et.al. | 2307.07336 | link |
2023-07-11 | ResMatch: Residual Attention Learning for Local Feature Matching | Yuxin Deng et.al. | 2307.05180 | link |
2023-07-11 | Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification | Yi Liao et.al. | 2307.05017 | null |
2023-07-10 | Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor | San Jiang et.al. | 2307.04520 | null |
2023-07-10 | RaPlace: Place Recognition for Imaging Radar using Radon Transform and Mutable Threshold | Hyesu Jang et.al. | 2307.04321 | link |
2023-07-08 | Calibration-Aware Margin Loss: Pushing the Accuracy-Calibration Consistency Pareto Frontier for Deep Metric Learning | Qin Zhang et.al. | 2307.04047 | null |
2023-07-04 | Unsupervised Quality Prediction for Improved Single-Frame and Weighted Sequential Visual Place Recognition | Helen Carson et.al. | 2307.01464 | null |
2023-07-04 | Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network | Zizhuo Li et.al. | 2307.01447 | null |
2023-07-03 | Cross-modal Place Recognition in Image Databases using Event-based Sensors | Xiang Ji et.al. | 2307.01047 | null |
2023-06-30 | DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions | Stephen Hausler et.al. | 2306.17536 | null |
2023-06-30 | Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization | Stephen Hausler et.al. | 2306.17529 | null |
2023-06-27 | Dental CLAIRES: Contrastive LAnguage Image REtrieval Search for Dental Research | Tanjida Kabir et.al. | 2306.15651 | null |
2023-06-27 | Mean Field Theory in Deep Metric Learning | Takuya Furusawa et.al. | 2306.15368 | null |
2023-06-26 | Hierarchical Matching and Reasoning for Multi-Query Image Retrieval | Zhong Ji et.al. | 2306.14460 | link |
2023-06-25 | Enhancing Dynamic Image Advertising with Vision-Language Pre-training | Zhoufutu Wen et.al. | 2306.14112 | null |
2023-06-23 | Catching Image Retrieval Generalization | Maksim Zhdanov et.al. | 2306.13357 | null |
2023-06-22 | Deep Metric Learning with Soft Orthogonal Proxies | Farshad Saberi-Movahed et.al. | 2306.13055 | null |
2023-06-22 | What to Learn: Features, Image Transformations, or Both? | Yuxuan Chen et.al. | 2306.13040 | null |
2023-06-22 | Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval | Katrin Glinka et.al. | 2306.12843 | null |
2023-06-26 | Annotation Cost Efficient Active Learning for Content Based Image Retrieval | Julia Henkel et.al. | 2306.11605 | null |
2023-06-19 | Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning | Shivaen Ramshetty et.al. | 2306.11065 | link |
2023-06-18 | LiDAR-Based Place Recognition For Autonomous Driving: A Survey | Pengcheng Shi et.al. | 2306.10561 | link |
2023-06-15 | Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization | Dror Aiger et.al. | 2306.09012 | link |
2023-06-15 | Prompt Performance Prediction for Generative IR | Nicolas Bizzozzero et.al. | 2306.08915 | null |
2023-06-15 | Graph Convolution Based Efficient Re-Ranking for Visual Retrieval | Yuqi Zhang et.al. | 2306.08792 | link |
2023-06-13 | GeneCIS: A Benchmark for General Conditional Image Similarity | Sagar Vaze et.al. | 2306.07969 | null |
2023-06-13 | MOFI: Learning Image Representations from Noisy Entity Annotated Images | Wentao Wu et.al. | 2306.07952 | link |
2023-06-12 | Zero-shot Composed Text-Image Retrieval | Yikun Liu et.al. | 2306.07272 | link |
2023-06-12 | Sticker820K: Empowering Interactive Retrieval with Stickers | Sijie Zhao et.al. | 2306.06870 | null |
2023-06-11 | Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models | Yuguang Yang et.al. | 2306.06691 | null |
2023-06-03 | Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval | Xu Zhang et.al. | 2306.02092 | null |
2023-06-03 | Class Anchor Margin Loss for Content-Based Image Retrieval | Alexandru Ghita et.al. | 2306.00630 | null |
2023-05-31 | Chatting Makes Perfect – Chat-based Image Retrieval | Matan Levy et.al. | 2305.20062 | link |
2023-05-31 | Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization | Junan Chen et.al. | 2305.20044 | null |
2023-05-30 | A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation | Omar Seddati et.al. | 2305.18988 | null |
2023-05-29 | Synfeal: A Data-Driven Simulator for End-to-End Camera Localization | Daniel Coelho et.al. | 2305.18260 | link |
2023-05-29 | Nanoscale visualization of the thermally-driven evolution of antiferromagnetic domains in FeTe thin films | Shrinkhala Sharma et.al. | 2305.18197 | null |
2023-05-29 | TReR: A Lightweight Transformer Re-Ranking Approach for 3D LiDAR Place Recognition | Tiago Barros et.al. | 2305.18013 | null |
2023-05-28 | ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval | Jiapeng Wang et.al. | 2305.17652 | null |
2023-06-01 | FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing | Zhuang Li et.al. | 2305.17497 | link |
2023-05-27 | Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation | Yueh-Cheng Huang et.al. | 2305.17463 | null |
2023-05-26 | Generating Images with Multimodal Language Models | Jing Yu Koh et.al. | 2305.17216 | link |
2023-05-25 | Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder | Zheyuan Liu et.al. | 2305.16304 | link |
2023-05-23 | Leveraging BEV Representation for 360-degree Visual Place Recognition | Xuecheng Xu et.al. | 2305.13814 | link |
2023-05-23 | EDIS: Entity-Driven Image Search over Multimodal Web Content | Siqi Liu et.al. | 2305.13631 | link |
2023-05-20 | DAC: Detector-Agnostic Spatial Covariances for Deep Local Features | Javier Tirado-Garín et.al. | 2305.12250 | link |
2023-05-19 | Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach | Zahra Tabatabaei et.al. | 2305.11728 | null |
2023-05-19 | Learning Sequence Descriptor based on Spatiotemporal Attention for Visual Place Recognition | Fenglin Zhang et.al. | 2305.11467 | link |
2023-05-12 | IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images | Varuna Krishna et.al. | 2305.10438 | link |
2023-05-17 | Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval | Haokun Wen et.al. | 2305.09979 | null |
2023-05-13 | Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance | Xinyu Lin et.al. | 2305.07943 | link |
2023-05-11 | Foundations of Spatial Perception for Robotics: Hierarchical Representations and Real-time Systems | Nathan Hughes et.al. | 2305.07154 | link |
2023-05-09 | Visual Place Recognition with Low-Resolution Images | Mihnea-Alexandru Tomita et.al. | 2305.05776 | null |
2023-05-09 | Vision-Language Models in Remote Sensing: Current Progress and Future Trends | Congcong Wen et.al. | 2305.05726 | null |
2023-05-09 | An Evaluation and Ranking of Different Voting Schemes for Improved Visual Place Recognition | Maria Waheed et.al. | 2305.05705 | null |
2023-05-09 | Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query | Ho Hin Lee et.al. | 2305.05598 | null |
2023-05-09 | ColonMapper: topological mapping and localization for colonoscopy | Javier Morlana et.al. | 2305.05546 | null |
2023-05-09 | Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization | Clémentin Boittiaux et.al. | 2305.05301 | link |
2023-05-09 | Patch-DrosoNet: Classifying Image Partitions With Fly-Inspired Models For Lightweight Visual Place Recognition | Bruno Arcanjo et.al. | 2305.05256 | null |
2023-05-09 | Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval | Shiyin Dong et.al. | 2305.05144 | null |
2023-05-08 | Hierarchical Visual Localization Based on Sparse Feature Pyramid for Adaptive Reduction of Keypoint Map Size | Andrei Potapov et.al. | 2305.04856 | null |
2023-05-08 | Privacy-Preserving Representations are not Enough – Recovering Scene Content from Camera Poses | Kunal Chelani et.al. | 2305.04603 | link |
2023-05-06 | Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer | Minyi Zhao et.al. | 2305.04072 | null |
2023-05-06 | Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing | Swagatika Dash et.al. | 2305.03881 | link |
2023-05-05 | COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? | Arijit Ray et.al. | 2305.03689 | link |
2023-05-05 | HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer | Shuzhe Wang et.al. | 2305.03595 | null |
2023-05-05 | WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval | Zahra Tabatabaei et.al. | 2305.03383 | null |
2023-05-04 | Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval | Tan Pan et.al. | 2305.02610 | link |
2023-05-03 | Learning-based Relational Object Matching Across Views | Cathrin Elich et.al. | 2305.02398 | null |
2023-05-05 | A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text | Yunxin Li et.al. | 2305.02265 | link |
2023-05-03 | AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation | Shentong Mo et.al. | 2305.01836 | null |
2023-04-30 | Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection | Jie Ren et.al. | 2305.00435 | null |
2023-04-28 | SFD2: Semantic-guided Feature Detection and Description | Fei Xue et.al. | 2304.14845 | link |
2023-04-28 | Quantum enhanced non-interferometric quantitative phase imaging | Giuseppe Ortolano et.al. | 2304.14727 | null |
2023-04-26 | Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams | Yun Chang et.al. | 2304.13487 | null |
2023-04-27 | STIR: Siamese Transformer for Image Retrieval Postprocessing | Aleksei Shabanov et.al. | 2304.13393 | null |
2023-04-25 | DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design | Jiahao Weng et.al. | 2304.12506 | null |
2023-04-24 | Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning | Lucas Pascotti Valem et.al. | 2304.12448 | link |
2023-04-23 | IDLL: Inverse Depth Line based Visual Localization in Challenging Environments | Wanting Li et.al. | 2304.11748 | null |
2023-04-23 | Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval | Mehdi Rafiei et.al. | 2304.11734 | null |
2023-04-17 | Features-over-the-Air: Contrastive Learning Enabled Cooperative Edge Inference | Haotian Wu et.al. | 2304.08221 | null |
2023-04-17 | NeRF-Loc: Visual Localization with Conditional Neural Radiance Field | Jianlin Liu et.al. | 2304.07979 | link |
2023-04-16 | Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification | Luca Piano et.al. | 2304.07883 | null |
2023-04-16 | Language Guided Local Infiltration for Interactive Image Retrieval | Fuxiang Huang et.al. | 2304.07747 | null |
2023-04-16 | Long-term Visual Localization with Mobile Sensors | Shen Yan et.al. | 2304.07691 | null |
2023-04-16 | Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging | Jielin Qiu et.al. | 2304.07675 | null |
2023-04-14 | CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression | Mubariz Zaffar et.al. | 2304.07426 | null |
2023-04-14 | FM-Loc: Using Foundation Models for Improved Vision-based Localization | Reihaneh Mirjalili et.al. | 2304.07058 | null |
2023-04-17 | Toward Real-Time Image Annotation Using Marginalized Coupled Dictionary Learning | Seyed Mahdi Roostaiyan et.al. | 2304.06907 | link |
2023-04-17 | You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization problem and dataset | Matteo Toso et.al. | 2304.06373 | link |
2023-04-12 | Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation | Yifeng Shi et.al. | 2304.06051 | link |
2023-04-12 | Visual Localization using Imperfect 3D Models from the Internet | Vojtech Panek et.al. | 2304.05947 | link |
2023-04-12 | Are Local Features All You Need for Cross-Domain Visual Place Recognition? | Giovanni Barbarani et.al. | 2304.05887 | link |
2023-04-12 | Unicom: Universal and Compact Representation Learning for Image Retrieval | Xiang An et.al. | 2304.05884 | link |
2023-04-12 | SGL: Structure Guidance Learning for Camera Localization | Xudong Zhang et.al. | 2304.05571 | null |
2023-04-14 | Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency | Xingwu Ji et.al. | 2304.05146 | link |
2023-04-10 | CAVL: Learning Contrastive and Adaptive Representations of Vision and Language | Shentong Mo et.al. | 2304.04399 | null |
2023-04-09 | Unsupervised Multi-Criteria Adversarial Detection in Deep Image Retrieval | Yanru Xiao et.al. | 2304.04228 | null |
2023-04-08 | SGIDN-LCD: An Appearance-based Loop Closure Detection Algorithm using Superpixel Grids and Incremental Dynamic Nodes | Baosheng Zhang et.al. | 2304.03872 | null |
2023-04-06 | $R^{2}$Former: Unified $R$etrieval and $R$ eranking Transformer for Place Recognition | Sijie Zhu et.al. | 2304.03410 | null |
2023-04-06 | Distributed formation-enforcing control for UAVs robust to observation noise in relative pose measurements | Viktor Walter et.al. | 2304.03057 | link |
2023-04-05 | Efficient OCR for Building a Diverse Digital History | Jacob Carlson et.al. | 2304.02737 | link |
2023-04-05 | LogoNet: a fine-grained network for instance-level logo sketch retrieval | Binbin Feng et.al. | 2304.02214 | link |
2023-04-04 | OrienterNet: Visual Localization in 2D Public Maps with Neural Matching | Paul-Edouard Sarlin et.al. | 2304.02009 | link |
2023-04-04 | Cross-Domain Image Captioning with Discriminative Finetuning | Roberto Dessì et.al. | 2304.01662 | link |
2023-04-02 | Learning Similarity between Scene Graphs and Images with Transformers | Yuren Cong et.al. | 2304.00590 | link |
2023-04-01 | NPR: Nocturnal Place Recognition in Street | Bingxi Liu et.al. | 2304.00276 | null |
2023-03-31 | Unsupervised crack detection on complex stone masonry surfaces | Panagiotis Agrafiotis et.al. | 2303.17989 | null |
2023-03-30 | If At First You Don’t Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval | Finlay G. C. Hudson et.al. | 2303.17703 | null |
2023-03-30 | Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime | Rhydian Windsor et.al. | 2303.17644 | null |
2023-03-30 | 3D Line Mapping Revisited | Shaohui Liu et.al. | 2303.17504 | link |
2023-03-30 | Methods and advancement of content-based fashion image retrieval: A Review | Amin Muhammad Shoib et.al. | 2303.17371 | null |
2023-03-30 | Adaptive Cross Batch Normalization for Metric Learning | Thalaiyasingam Ajanthan et.al. | 2303.17127 | null |
2023-03-30 | MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks | Weicheng Kuo et.al. | 2303.16839 | null |
2023-03-29 | Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval | Leo Sampaio Ferraz Ribeiro et.al. | 2303.16769 | null |
2023-03-29 | Bi-directional Training for Composed Image Retrieval via Text Prompt Learning | Zheyuan Liu et.al. | 2303.16604 | link |
2023-03-27 | Model Cascades for Efficient Image Search | Robert Hönig et.al. | 2303.15595 | null |
2023-03-27 | Zero-Shot Composed Image Retrieval with Textual Inversion | Alberto Baldrati et.al. | 2303.15247 | link |
2023-03-27 | What Can Human Sketches Do for Object Detection? | Pinaki Nath Chowdhury et.al. | 2303.15149 | null |
2023-03-25 | Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style | Fengyin Lin et.al. | 2303.14348 | link |
2023-03-24 | A-MuSIC: An Adaptive Ensemble System For Visual Place Recognition In Changing Environments | Bruno Arcanjo et.al. | 2303.14247 | null |
2023-03-24 | PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View | Ze Shi et.al. | 2303.14095 | link |
2023-03-24 | Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR | Aneeshan Sain et.al. | 2303.13779 | null |
2023-03-28 | CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not | Aneeshan Sain et.al. | 2303.13440 | null |
2023-03-22 | Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval | Xunguang Wang et.al. | 2303.12658 | null |
2023-03-21 | CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion | Geonmo Gu et.al. | 2303.11916 | link |
2023-03-21 | LIMITR: Leveraging Local Information for Medical Image-Text Representation | Gefen Dawidowicz et.al. | 2303.11755 | null |
2023-03-25 | Data-efficient Large Scale Place Recognition with Graded Similarity Supervision | Maria Leyva-Vallina et.al. | 2303.11739 | link |
2023-03-20 | Picture that Sketch: Photorealistic Image Generation from Abstract Sketches | Subhadeep Koley et.al. | 2303.11162 | null |
2023-03-19 | Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths | Ming Xu et.al. | 2303.10778 | link |
2023-03-17 | MRIS: A Multi-modal Retrieval Approach for Image Synthesis on Diverse Modalities | Boqi Chen et.al. | 2303.10249 | null |
2023-03-17 | IRGen: Generative Modeling for Image Retrieval | Yidan Zhang et.al. | 2303.10126 | link |
2023-03-16 | Data Roaming and Early Fusion for Composed Image Retrieval | Matan Levy et.al. | 2303.09429 | link |
2023-03-16 | Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval | Yi Xie et.al. | 2303.09230 | null |
2023-03-16 | Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space | Yuhang He et.al. | 2303.09192 | null |
2023-03-16 | Unsupervised Facial Expression Representation Learning with Contrastive Local Warping | Fanglei Xue et.al. | 2303.09034 | null |
2023-03-15 | A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval | Saeideh Yousefzadeh et.al. | 2303.08398 | null |
2023-03-14 | Data-Free Sketch-Based Image Retrieval | Abhra Chaudhuri et.al. | 2303.07775 | link |
2023-03-14 | PATS: Patch Area Transportation with Subdivision for Local Feature Matching | Junjie Ni et.al. | 2303.07700 | null |
2023-03-10 | Robotic Applications of Pre-Trained Vision-Language Models to Various Recognition Behaviors | Kento Kawaharazuka et.al. | 2303.05674 | null |
2023-03-09 | Dominating Set Database Selection for Visual Place Recognition | Anastasiia Kornilova et.al. | 2303.05123 | null |
2023-03-07 | Graph Neural Networks in Vision-Language Image Understanding: A Survey | Henry Senior et.al. | 2303.03761 | null |
2023-03-07 | Sketch-based Medical Image Retrieval | Kazuma Kobayashi et.al. | 2303.03633 | link |
2023-03-06 | Visual Place Recognition: A Tutorial | Stefan Schubert et.al. | 2303.03281 | link |
2023-03-06 | MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval | Rohit Agarwal et.al. | 2303.03050 | link |
2023-03-06 | Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints | Chenjie Cao et.al. | 2303.02885 | link |
2023-03-05 | Composing Mood Board with User Feedback in Concept Space | Shin Sano et.al. | 2303.02547 | null |
2023-03-04 | FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks | Xiao Han et.al. | 2303.02483 | link |
2023-03-09 | Self-Supervised Learning for Place Representation Generalization across Appearance Changes | Mohamed Adel Musallam et.al. | 2303.02370 | null |
2023-03-03 | MixVPR: Feature Mixing for Visual Place Recognition | Amar Ali-bey et.al. | 2303.02190 | link |
2023-03-01 | A Complementarity-Based Switch-Fuse System for Improved Visual Place Recognition | Maria Waheed et.al. | 2303.00714 | null |
2023-03-01 | ORCHNet: A Robust Global Feature Aggregation approach for 3D LiDAR-based Place recognition in Orchards | T. Barros et.al. | 2303.00477 | link |
2023-03-03 | Renderable Neural Radiance Map for Visual Navigation | Obin Kwon et.al. | 2303.00304 | null |
2023-03-01 | Region Prediction for Efficient Robot Localization on Large Maps | Matteo Scucchia et.al. | 2303.00295 | link |
2023-02-28 | OEKG: The Open Event Knowledge Graph | Simon Gottschalk et.al. | 2302.14688 | null |
2023-02-28 | Global Proxy-based Hard Mining for Visual Place Recognition | Amar Ali-bey et.al. | 2302.14217 | link |
2023-02-27 | Efficient Informed Proposals for Discrete Distributions via Newton’s Series Approximation | Yue Xiang et.al. | 2302.13929 | link |
2023-02-26 | Data-Efficient Sequence-Based Visual Place Recognition with Highly Compressed JPEG Images | Mihnea-Alexandru Tomita et.al. | 2302.13314 | null |
2023-02-26 | Learning cross space mapping via DNN using large scale click-through logs | Wei Yu et.al. | 2302.13275 | null |
2023-02-25 | DeepBrainPrint: A Novel Contrastive Framework for Brain MRI Re-Identification | Lemuel Puglisi et.al. | 2302.13057 | null |
2023-02-23 | Teaching CLIP to Count to Ten | Roni Paiss et.al. | 2302.12066 | null |
2023-02-22 | Steerable Equivariant Representation Learning | Sangnie Bhardwaj et.al. | 2302.11349 | null |
2023-02-21 | iQPP: A Benchmark for Image Query Performance Prediction | Eduard Poesina et.al. | 2302.10126 | link |
2023-02-20 | Ontology-aware Network for Zero-shot Sketch-based Image Retrieval | Haoxiang Zhang et.al. | 2302.10040 | null |
2023-02-20 | TBPos: Dataset for Large-Scale Precision Visual Localization | Masud Fahim et.al. | 2302.09825 | link |
2023-02-17 | Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts | Zhihong Chen et.al. | 2302.08958 | link |
2023-02-22 | Fashion Image Retrieval with Multi-Granular Alignment | Jinkuan Zhu et.al. | 2302.08902 | null |
2023-02-15 | Unsupervised Hashing via Similarity Distribution Calibration | Kam Woh Ng et.al. | 2302.07669 | link |
2023-02-13 | Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior | Shen Yan et.al. | 2302.06287 | link |
2023-02-13 | Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation | Binqian Jiang et.al. | 2302.06149 | link |
2023-02-13 | Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval | Xu Wang et.al. | 2302.06081 | link |
2023-02-11 | Sketch Less Face Image Retrieval: A New Challenge | Dawei Dai et.al. | 2302.05576 | link |
2023-02-10 | Is multi-modal vision supervision beneficial to language? | Avinash Madasu et.al. | 2302.05016 | link |
2023-02-06 | Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval | Kuniaki Saito et.al. | 2302.03084 | link |
2023-02-06 | Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs | Michael Kirchhof et.al. | 2302.02865 | link |
2023-02-03 | Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization | Yingying Zhu et.al. | 2302.01572 | link |
2023-02-04 | Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval | Frederik Warburg et.al. | 2302.01332 | link |
2023-01-31 | Grounding Language Models to Images for Multimodal Generation | Jing Yu Koh et.al. | 2301.13823 | link |
2023-01-31 | UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers | Dachuan Shi et.al. | 2301.13741 | link |
2023-01-23 | Lexi: Self-Supervised Learning of the UI Language | Pratyay Banerjee et.al. | 2301.10165 | link |
2023-01-17 | Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval | Yuchen Wu et.al. | 2301.06685 | null |
2023-01-19 | High-bandwidth Close-Range Information Transport through Light Pipes | Joowon Lim et.al. | 2301.06496 | null |
2023-01-13 | A LiDAR-Inertial-Visual SLAM System with Loop Detection | Kangcheng Liu et.al. | 2301.05604 | null |
2023-01-12 | GH-Feat: Learning Versatile Generative Hierarchical Features from GANs | Yinghao Xu et.al. | 2301.05315 | null |
2023-01-10 | Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images | Xindi Wu et.al. | 2301.04224 | null |
2023-01-10 | Collaborative Semantic Communication at the Edge | Wing Fei Lo et.al. | 2301.03996 | null |
2023-01-10 | Online Backfilling with No Regret for Large-Scale Image Retrieval | Seonguk Seo et.al. | 2301.03767 | null |
2023-01-06 | CyberLoc: Towards Accurate Long-term Visual Localization | Liu Liu et.al. | 2301.02403 | null |
2023-01-05 | A Probabilistic Framework for Visual Localization in Ambiguous Scenes | Fereidoon Zangeneh et.al. | 2301.02086 | link |
2022-12-31 | 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions | Patrick Wenzel et.al. | 2301.01147 | null |
2022-12-30 | HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images | Dmitry Yudin et.al. | 2212.14649 | link |
2022-12-27 | Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning | Wooyoung Kang et.al. | 2212.13563 | link |
2022-12-23 | SuperGF: Unifying Local and Global Features for Visual Localization | Wenzheng Song et.al. | 2212.13105 | null |
2022-12-24 | GraffMatch: Global Matching of 3D Lines and Planes for Wide Baseline LiDAR Registration | Parker C. Lusk et.al. | 2212.12745 | null |
2022-12-19 | From a Bird’s Eye View to See: Joint Camera and Subject Registration without the Camera Calibration | Zekun Qian et.al. | 2212.09298 | link |
2022-12-14 | The Infinite Index: Information Retrieval on Generative Text-To-Image Models | Niklas Deckers et.al. | 2212.07476 | null |
2022-12-14 | Shared Coupling-bridge for Weakly Supervised Local Feature Learning | Jiayuan Sun et.al. | 2212.07047 | link |
2022-12-08 | Group Generalized Mean Pooling for Vision Transformer | Byungsoo Ko et.al. | 2212.04114 | null |
2022-12-12 | Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models | Gowthami Somepalli et.al. | 2212.03860 | null |
2022-12-07 | LSVL: Large-scale season-invariant visual localization for UAVs | Jouko Kinnari et.al. | 2212.03581 | null |
2022-12-06 | ADIR: Adaptive Diffusion for Image Reconstruction | Shady Abu-Hussein et.al. | 2212.03221 | null |
2022-12-08 | Privacy-Preserving Visual Localization with Event Cameras | Junho Kim et.al. | 2212.03177 | link |
2022-12-06 | Semantic Communication for Internet of Vehicles: A Multi-User Cooperative Approach | Wenjun Xu et.al. | 2212.03037 | null |
2022-12-06 | Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds | Zhipeng Zhao et.al. | 2212.02757 | null |
2022-12-04 | Fast and Lightweight Scene Regressor for Camera Relocalization | Thuan B. Bui et.al. | 2212.01830 | link |
2022-12-02 | Information Retrieval from the Digitized Books | Riya Gupta et.al. | 2212.00999 | null |
2022-12-09 | StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition | Yanqing Shen et.al. | 2212.00937 | null |
2022-11-30 | Self-Supervised Feature Learning for Long-Term Metric Visual Localization | Yuxuan Chen et.al. | 2212.00122 | null |
2022-11-30 | SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation | Tianyu Zhang et.al. | 2211.16697 | link |
2022-11-28 | SLAN: Self-Locator Aided Network for Cross-Modal Understanding | Jiang-Tian Zhai et.al. | 2211.16208 | null |
2022-11-29 | RankDNN: Learning to Rank for Few-shot Learning | Qianyu Guo et.al. | 2211.15320 | link |
2022-11-28 | Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map | Xi Zheng et.al. | 2211.15127 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
2022-11-27 | BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images | Zhihuang Zhang et.al. | 2211.14927 | null |
2022-11-27 | A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition | Rui Huang et.al. | 2211.14864 | null |
2022-11-26 | Visual Place Recognition | Bailu Guo et.al. | 2211.14533 | null |
2022-11-26 | Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval | Fan Yang et.al. | 2211.14515 | link |
2022-11-30 | Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark | Floriana Ciaglia et.al. | 2211.13523 | link |
2022-11-23 | InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images | Konstantin Kobs et.al. | 2211.12760 | link |
2022-11-29 | Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments | Joshua Knights et.al. | 2211.12732 | link |
2022-11-23 | FE-Fusion-VPR: Attention-based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events | Kuanxu Hou et.al. | 2211.12244 | null |
2022-11-22 | Multimorbidity Content-Based Medical Image Retrieval Using Proxies | Yunyan Xing et.al. | 2211.12185 | null |
2022-11-22 | Vision-based localization methods under GPS-denied conditions | Zihao Lu et.al. | 2211.11988 | null |
2022-11-21 | ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari et.al. | 2211.11704 | null |
2022-11-21 | LISA: Localized Image Stylization with Audio via Implicit Neural Representation | Seung Hyun Lee et.al. | 2211.11381 | null |
2022-11-21 | NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization | Shitao Tang et.al. | 2211.11177 | link |
2022-11-16 | Improving Feature-based Visual Localization by Geometry-Aided Matching | Hailin Yu et.al. | 2211.08712 | link |
2022-11-15 | LiePoseNet: Heterogeneous Loss Function Based on Lie Group for Significant Speed-up of PoseNet Training Process | Mikhail Kurenkov et.al. | 2211.08480 | null |
2022-11-14 | Degeneracy removal of spin bands in antiferromagnets with non-interconvertible spin motif pair | Lin-Ding Yuan et.al. | 2211.07803 | null |
2022-11-14 | Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition | Farid Alijani et.al. | 2211.07696 | null |
2022-11-14 | Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization | Yiyang Chen et.al. | 2211.07394 | link |
2022-11-14 | Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment | Junyang Wang et.al. | 2211.07275 | null |
2022-11-14 | ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations | Chanda Grover et.al. | 2211.07122 | null |
2022-11-14 | Few-shot Metric Learning: Online Adaptation of Embedding for Retrieval | Deunsol Jung et.al. | 2211.07116 | null |
2022-11-12 | Partial Visual-Semantic Embedding: Fashion Intelligence System with Sensitive Part-by-Part Learning | Ryotaro Shimizu et.al. | 2211.06688 | null |
2022-11-09 | Visual Named Entity Linking: A New Dataset and A Baseline | Wenxiang Sun et.al. | 2211.04872 | link |
2022-11-07 | Ultrafast Image Retrieval from a Holographic Memory Disc for High-Speed Operation of a Shift, Scale, and Rotation Invariant Target Recognition System | Julian Gamboa et.al. | 2211.03881 | null |
2022-11-06 | A Geometrically Constrained Point Matching based on View-invariant Cross-ratios, and Homography | Yueh-Cheng Huang et.al. | 2211.03007 | null |
2022-11-02 | Optimizing Fiducial Marker Placement for Improved Visual Localization | Qiangqiang Huang et.al. | 2211.01513 | link |
2022-11-02 | A comparison of uncertainty estimation approaches for DNN-based camera localization | Matteo Vaghi et.al. | 2211.01234 | null |
2022-11-02 | M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval | Layne Berry et.al. | 2211.01180 | null |
2022-11-11 | Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality | Anuj Diwan et.al. | 2211.00768 | link |
2022-11-07 | Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding | Ryotaro Shimizu et.al. | 2210.17417 | null |
2022-10-27 | Structuring User-Generated Content on Social Media with Multimodal Aspect-Based Sentiment Analysis | Miriam Anschütz et.al. | 2210.15377 | link |
2022-10-27 | Leveraging Computer Vision Application in Visual Arts: A Case Study on the Use of Residual Neural Network to Classify and Analyze Baroque Paintings | Daniel Kvak et.al. | 2210.15300 | null |
2022-10-27 | Towards Practicality of Sketch-Based Visual Understanding | Ayan Kumar Bhunia et.al. | 2210.15146 | null |
2022-10-27 | MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval | Chen Bao et.al. | 2210.15128 | null |
2022-10-26 | FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning | Suvir Mirchandani et.al. | 2210.15028 | null |
2022-10-26 | FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization | Junyang Wang et.al. | 2210.14562 | null |
2022-11-02 | A Framework for Collaborative Multi-Robot Mapping using Spectral Graph Wavelets | Lukas Bernreiter et.al. | 2210.13856 | null |
2022-10-27 | Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision | Tzu-Jui Julius Wang et.al. | 2210.13591 | null |
2022-10-24 | Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval | Zhaopeng Dou et.al. | 2210.13440 | link |
2022-10-23 | Neural Eigenfunctions Are Structured Representation Learners | Zhijie Deng et.al. | 2210.12637 | link |
2022-10-21 | Boosting vision transformers for image retrieval | Chull Hwan Song et.al. | 2210.11909 | link |
2022-10-20 | Communication breakdown: On the low mutual intelligibility between human and neural captioning | Roberto Dessì et.al. | 2210.11512 | link |
2022-10-19 | Image Semantic Relation Generation | Mingzhe Du et.al. | 2210.11253 | null |
2022-10-20 | General Image Descriptors for Open World Image Retrieval using ViT CLIP | Marcos V. Conde et.al. | 2210.11141 | link |
2022-10-20 | DeepRING: Learning Roto-translation Invariant Representation for LiDAR based Place Recognition | Sha Lu et.al. | 2210.11029 | null |
2022-10-19 | Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval | Abhra Chaudhuri et.al. | 2210.10486 | link |
2022-10-19 | GSV-Cities: Toward Appropriate Supervised Visual Place Recognition | Amar Ali-bey et.al. | 2210.10239 | link |
2022-10-18 | A Real-Time Fusion Framework for Long-term Visual Localization | Yuchen Yang et.al. | 2210.09757 | null |
2022-10-17 | Bridging the Gap between Local Semantic Concepts and Bag of Visual Words for Natural Scene Image Retrieval | Yousef Alqasrawi et.al. | 2210.08875 | null |
2022-10-17 | SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation | Woo Suk Choi et.al. | 2210.08675 | null |
2022-10-16 | Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers | Tao Tang et.al. | 2210.08458 | link |
2022-10-14 | Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding | Xuetong Xue et.al. | 2210.07572 | link |
2022-10-14 | Boosting Performance of a Baseline Visual Place Recognition Technique by Predicting the Maximally Complementary Technique | Connor Malone et.al. | 2210.07509 | null |
2022-10-11 | Large-to-small Image Resolution Asymmetry in Deep Metric Learning | Pavel Suma et.al. | 2210.05463 | link |
2022-10-09 | Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning | Ali Safa et.al. | 2210.04236 | null |
2022-10-05 | Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features | Deepak Gupta et.al. | 2210.02401 | link |
2022-10-05 | Granularity-aware Adaptation for Image Retrieval over Multiple Tasks | Jon Almazán et.al. | 2210.02254 | null |
2022-10-05 | Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective | Zijian Zhang et.al. | 2210.02206 | link |
2022-10-04 | Supervised Metric Learning for Retrieval via Contextual Similarity Optimization | Christopher Liao et.al. | 2210.01908 | link |
2022-10-04 | Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing | Weiying Wang et.al. | 2210.01320 | null |
2022-10-03 | Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments | Bruno Arcanjo et.al. | 2210.00834 | null |
2022-10-02 | Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval | Kei Nishimaki et.al. | 2210.00506 | null |
2022-09-29 | Guided Unsupervised Learning by Subaperture Decomposition for Ocean SAR Image Retrieval | Nicolae-Cătălin Ristea et.al. | 2209.15034 | null |
2022-09-28 | TVLT: Textless Vision-Language Transformer | Zineng Tang et.al. | 2209.14156 | link |
2022-09-28 | SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval | Yang Shen et.al. | 2209.13833 | link |
2022-09-28 | Learning Deep Representations via Contrastive Learning for Instance Retrieval | Tao Wu et.al. | 2209.13832 | null |
2022-09-28 | Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text | Cheng-An Hsieh et.al. | 2209.13764 | link |
2022-09-27 | Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors | Hao Dong et.al. | 2209.13586 | link |
2022-09-27 | Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability | Peisong Wen et.al. | 2209.13262 | link |
2022-09-26 | NDD: A 3D Point Cloud Descriptor Based on Normal Distribution for Loop Closure Detection | Ruihao Zhou et.al. | 2209.12513 | link |
2022-09-25 | Personalized Saliency in Task-Oriented Semantic Communications: Image Transmission and Performance Analysis | Jiawen Kang et.al. | 2209.12274 | link |
2022-09-24 | Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes | Jonathan J. Y. Kim et.al. | 2209.11894 | null |
2022-09-23 | Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs | Youya Xia et.al. | 2209.11673 | null |
2022-09-23 | Query-based Hard-Image Retrieval for Object Detection at Test Time | Edward Ayers et.al. | 2209.11559 | link |
2022-09-23 | Unsupervised Hashing with Semantic Concept Mining | Rong-Cheng Tu et.al. | 2209.11475 | link |
2022-09-22 | UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low vision | Anbang Yang et.al. | 2209.11336 | null |
2022-09-21 | Visual Localization and Mapping in Dynamic and Changing Environments | João Carlos Virgolino Soares et.al. | 2209.10710 | null |
2022-09-20 | PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention | José Arce et.al. | 2209.09699 | link |
2022-09-19 | Deep Metric Learning with Chance Constraints | Yeti Z. Gurbuz et.al. | 2209.09060 | link |
2022-09-18 | HGI-SLAM: Loop Closure With Human and Geometric Importance Features | Shuhul Mujoo et.al. | 2209.08608 | null |
2022-09-18 | Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM | Jiarui Tan et.al. | 2209.08578 | link |
2022-09-17 | Data Efficient Visual Place Recognition Using Extremely JPEG-Compressed Images | Mihnea-Alexandru Tomita et.al. | 2209.08343 | null |
2022-09-15 | Efficient Planar Pose Estimation via UWB Measurements | Haodong Jiang et.al. | 2209.06779 | link |
2022-09-14 | Transformers and CNNs both Beat Humans on SBIR | Omar Seddati et.al. | 2209.06629 | null |
2022-09-14 | Tac2Structure: Object Surface Reconstruction Only through Multi Times Touch | J. Lu et.al. | 2209.06545 | link |
2022-09-14 | iSimLoc: Visual Global Localization for Previously Unseen Environments with Simulated Images | Peng Yin et.al. | 2209.06376 | null |
2022-09-09 | General Place Recognition Survey: Towards the Real-world Autonomy Age | Peng Yin et.al. | 2209.04497 | link |
2022-09-09 | Retinal Image Restoration and Vessel Segmentation using Modified Cycle-CBAM and CBAM-UNet | Alnur Alimanov et.al. | 2209.04234 | link |
2022-09-13 | Segment Augmentation and Differentiable Ranking for Logo Retrieval | Feyza Yavuz et.al. | 2209.02482 | null |
2022-09-12 | ScaleFace: Uncertainty-aware Deep Metric Learning | Roman Kail et.al. | 2209.01880 | link |
2022-09-04 | CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud | Evgeny Yudin et.al. | 2209.01605 | null |
2022-08-31 | EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing | Qihua Feng et.al. | 2208.14657 | link |
2022-08-25 | A Deep Perceptual Measure for Lens and Camera Calibration | Yannick Hold-Geoffroy et.al. | 2208.12300 | null |
2022-08-25 | A Privacy-Preserving and End-to-End-Based Encrypted Image Retrieval Scheme | Zhixun Lu et.al. | 2208.11876 | null |
2022-08-23 | Satellite Image Search in AgoraEO | Ahmet Kerem Aksoy et.al. | 2208.10830 | null |
2022-08-20 | Fuse and Attend: Generalized Embedding Learning for Art and Sketches | Ujjal Kr Dutta et.al. | 2208.09698 | null |
2022-08-19 | Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods | Chao Chen et.al. | 2208.09315 | link |
2022-08-19 | TTT-UCDR: Test-time Training for Universal Cross-Domain Retrieval | Soumava Paul et.al. | 2208.09198 | link |
2022-08-17 | Visual Cross-View Metric Localization with Dense Uncertainty Estimates | Zimin Xia et.al. | 2208.08519 | link |
2022-08-17 | Understanding Attention for Vision-and-Language Tasks | Feiqi Cao et.al. | 2208.08104 | link |
2022-08-14 | Visual Localization via Few-Shot Scene Region Classification | Siyan Dong et.al. | 2208.06933 | link |
2022-08-14 | HyP $^2$ Loss: Beyond Hypersphere Metric Space for Multi-label Image Retrieval | Chengyin Xu et.al. | 2208.06866 | link |
2022-08-13 | Finding Point with Image: An End-to-End Benchmark for Vision-based UAV Localization | Ming Dai et.al. | 2208.06561 | link |
2022-08-16 | Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation | Georgios Kouros et.al. | 2208.06195 | link |
2022-08-12 | Instance Image Retrieval by Learning Purely From Within the Dataset | Zhongyan Zhang et.al. | 2208.06119 | null |
2022-08-07 | CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization | Yujiao Shi et.al. | 2208.03660 | null |
2022-08-05 | A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch | Patsorn Sangkloy et.al. | 2208.03354 | null |
2022-08-05 | ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding | Bingning Wang et.al. | 2208.03030 | link |
2022-08-04 | Pattern Spotting and Image Retrieval in Historical Documents using Deep Hashing | Caio da S. Dias et.al. | 2208.02397 | null |
2022-07-27 | On the robustness of self-supervised representations for multi-view object classification | David Torpey et.al. | 2208.00787 | null |
2022-07-26 | Multimodal Neural Machine Translation with Search Engine Based Image Retrieval | ZhenHao Tang et.al. | 2208.00767 | null |
2022-07-30 | Towards Privacy-Preserving, Real-Time and Lossless Feature Matching | Qiang Meng et.al. | 2208.00214 | link |
2022-07-30 | DAS: Densely-Anchored Sampling for Deep Metric Learning | Lizhao Liu et.al. | 2208.00119 | link |
2022-07-29 | Curriculum Learning for Data-Efficient Vision-Language Alignment | Tejas Srinivasan et.al. | 2207.14525 | null |
2022-07-29 | Neural Density-Distance Fields | Itsuki Ueda et.al. | 2207.14455 | link |
2022-07-27 | Abstracting Sketches through Simple Primitives | Stephan Alaniz et.al. | 2207.13543 | link |
2022-07-27 | Satellite Image Based Cross-view Localization for Autonomous Vehicle | Shan Wang et.al. | 2207.13506 | null |
2022-07-26 | RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments | Jiahui Zhang et.al. | 2207.12579 | null |
2022-07-25 | A hybrid-qudit representation of digital RGB images | Sreetama Das et.al. | 2207.12550 | null |
2022-07-19 | ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization | Ivan Cisneros et.al. | 2207.12317 | link |
2022-07-22 | PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes | BaoSheng Zhang et.al. | 2207.10916 | null |
2022-07-25 | MeshLoc: Mesh-Based Visual Localization | Vojtech Panek et.al. | 2207.10762 | link |
2022-07-20 | Revisiting Hotels-50K and Hotel-ID | Aarash Feizi et.al. | 2207.10200 | link |
2022-07-20 | Feature Representation Learning for Unsupervised Cross-domain Image Retrieval | Conghui Hu et.al. | 2207.09721 | link |
2022-07-19 | SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany | Dominik Koßmann et.al. | 2207.09507 | null |
2022-07-19 | Context Unaware Knowledge Distillation for Image Retrieval | Bytasandram Yaswanth Reddy et.al. | 2207.09070 | link |
2022-07-17 | FashionViL: Fashion-Focused Vision-and-Language Representation Learning | Xiao Han et.al. | 2207.08150 | link |
2022-07-14 | AutoMerge: A Framework for Map Assembling and Smoothing in City-scale Environments | Peng Yin et.al. | 2207.06965 | null |
2022-07-14 | Semi-supervised Vector-Quantization in Visual SLAM using HGCN | Amir Zarringhalam et.al. | 2207.06738 | null |
2022-07-14 | Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders | Amir Zarringhalam et.al. | 2207.06732 | null |
2022-07-19 | Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras | Fangwen Shu et.al. | 2207.06058 | link |
2022-07-12 | CPO: Change Robust Panorama to Point Cloud Localization | Junho Kim et.al. | 2207.05317 | link |
2022-07-05 | Hierarchical Average Precision Training for Pertinent Image Retrieval | Elias Ramzi et.al. | 2207.04873 | link |
2022-07-11 | A clinically motivated self-supervised approach for content-based image retrieval of CT liver images | Kristoffer Knutsen Wickstrøm et.al. | 2207.04812 | link |
2022-07-09 | BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval | Wenqiao Zhang et.al. | 2207.04211 | null |
2022-07-08 | Learning Sequential Descriptors for Sequence-based Visual Place Recognition | Riccardo Mereu et.al. | 2207.03868 | link |
2022-07-08 | GEMS: Scene Expansion using Generative Models of Graphs | Rishi Agarwal et.al. | 2207.03729 | null |
2022-07-05 | Object-Level Targeted Selection via Deep Template Matching | Suraj Kothawade et.al. | 2207.01778 | null |
2022-07-06 | Adaptive Fine-Grained Sketch-Based Image Retrieval | Ayan Kumar Bhunia et.al. | 2207.01723 | link |
2022-07-04 | Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets | Paul Albert et.al. | 2207.01573 | link |
2022-07-08 | Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval | Keyu Wen et.al. | 2207.00733 | null |
2022-07-01 | DALG: Deep Attentive Local and Global Modeling for Image Retrieval | Yuxin Song et.al. | 2207.00287 | null |
2022-07-04 | BadHash: Invisible Backdoor Attacks against Deep Hashing with Clean Label | Shengshan Hu et.al. | 2207.00278 | link |
2022-06-28 | Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems | Stephen Hausler et.al. | 2206.13883 | null |
2022-07-08 | How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels | Tobias Fischer et.al. | 2206.13673 | link |
2022-06-25 | FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance | Yongzhi Fan et.al. | 2206.12628 | link |
2022-06-25 | Inverted Semantic-Index for Image Retrieval | Ying Wang et.al. | 2206.12623 | null |
2022-06-17 | RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval | Yihan Wu et.al. | 2206.11225 | null |
2022-06-22 | ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas | Prathmesh Madhu et.al. | 2206.11115 | null |
2022-06-20 | Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval | Guile Wu et.al. | 2206.09806 | null |
2022-06-18 | Attention-based Dynamic Subspace Learners for Medical Image Analysis | Sukesh Adiga V et.al. | 2206.09068 | null |
2022-06-17 | Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments | Khairuldanial Ismail et.al. | 2206.08733 | null |
2022-06-06 | Learning Treatment Plan Representations for Content Based Image Retrieval | Charles Huang et.al. | 2206.02912 | null |
2022-06-19 | NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation | Ekaterina Nepovinnykh et.al. | 2206.02498 | link |
2022-06-05 | Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks | B. G. Palm et.al. | 2206.02278 | null |
2022-05-28 | FaIRCoP: Facial Image Retrieval using Contrastive Personalization | Devansh Gupta et.al. | 2205.15870 | null |
2022-05-31 | Investigating the Role of Image Retrieval for Visual Localization – An exhaustive benchmark | Martin Humenberger et.al. | 2205.15761 | link |
2022-05-27 | Improving Road Segmentation in Challenging Domains Using Similar Place Priors | Connor Malone et.al. | 2205.14112 | null |
2022-05-31 | LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments | Yun Chang et.al. | 2205.13135 | link |
2022-05-26 | Fine-grained Image Captioning with CLIP Reward | Jaemin Cho et.al. | 2205.13115 | link |
2022-05-25 | Deep Dense Local Feature Matching and Vehicle Removal for Indoor Visual Localization | Kyung Ho Park et.al. | 2205.12544 | null |
2022-05-24 | OnePose: One-Shot Object Pose Estimation without CAD Models | Jiaming Sun et.al. | 2205.12257 | link |
2022-05-23 | VPAIR – Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments | Michael Schleiss et.al. | 2205.11567 | link |
2022-05-23 | VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering | Yanan Wang et.al. | 2205.11501 | null |
2022-05-23 | Deep Image Retrieval is not Robust to Label Noise | Stanislav Dereka et.al. | 2205.11195 | null |
2022-05-22 | Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval | Zelong Zeng et.al. | 2205.10878 | link |
2022-05-20 | Visually-Augmented Language Modeling | Weizhi Wang et.al. | 2205.10178 | link |
2022-05-18 | Deep Features for CBIR with Scarce Data using Hebbian Learning | Gabriele Lagani et.al. | 2205.08935 | null |
2022-05-19 | Text Detection & Recognition in the Wild for Robot Localization | Zobeir Raisi et.al. | 2205.08565 | null |
2022-05-12 | One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code | Yong Dai et.al. | 2205.06126 | null |
2022-05-11 | Review on Panoramic Imaging and Its Applications in Scene Understanding | Shaohua Gao et.al. | 2205.05570 | null |
2022-05-18 | Identical Image Retrieval using Deep Learning | Sayan Nath et.al. | 2205.04883 | link |
2022-05-09 | Introspective Deep Metric Learning | Chengkun Wang et.al. | 2205.04449 | link |
2022-05-11 | Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting | Kai Uwe Barthel et.al. | 2205.04255 | link |
2022-05-08 | Adversarial Learning of Hard Positives for Place Recognition | Wenxuan Fang et.al. | 2205.03871 | null |
2022-05-10 | AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching | Khanh Nguyen et.al. | 2205.02849 | link |
2022-04-29 | Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval | Shupeng Su et.al. | 2204.13919 | null |
2022-04-29 | Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval | Siyu Ren et.al. | 2204.13913 | link |
2022-04-28 | Spatio-Temporal Graph Localization Networks for Image-based Navigation | Takahiro Niwa et.al. | 2204.13237 | null |
2022-04-27 | The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection | Konstantinos A. Tsintotas et.al. | 2204.12831 | null |
2022-04-25 | SceneTrilogy: On Scene Sketches and its Relationship with Text and Photo | Pinaki Nath Chowdhury et.al. | 2204.11964 | null |
2022-04-23 | On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning | Muhammad Umer Anwaar et.al. | 2204.11848 | null |
2022-04-24 | Progressive Learning for Image Retrieval with Hybrid-Modality Queries | Yida Zhao et.al. | 2204.11212 | null |
2022-04-23 | Training and challenging models for text-guided fashion image retrieval | Eric Dodds et.al. | 2204.11004 | link |
2022-04-18 | Centralized Adversarial Learning for Robust Deep Hashing | Xunguang Wang et.al. | 2204.10779 | link |
2022-04-22 | Transferring ConvNet Features from Passive to Active Robot Self-Localization: The Use of Ego-Centric and World-Centric Views | Kanya Kurauchi et.al. | 2204.10497 | null |
2022-04-21 | Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval | Zhiqiang Yuan et.al. | 2204.09868 | link |
2022-04-21 | Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information | Zhiqiang Yuan et.al. | 2204.09860 | link |
2022-04-20 | Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations | Leila Pishdad et.al. | 2204.09268 | null |
2022-04-19 | Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing | Georgii Mikriukov et.al. | 2204.08707 | null |
2022-04-18 | Multiple-environment Self-adaptive Network for Aerial-view Geo-localization | Tingyu Wang et.al. | 2204.08381 | link |
2022-04-15 | Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder | Hanjing Ye et.al. | 2204.07350 | link |
2022-04-14 | Composite Code Sparse Autoencoders for first stage retrieval | Carlos Lassance et.al. | 2204.07023 | null |
2022-04-13 | Reuse your features: unifying retrieval and feature-metric alignment | Javier Morlana et.al. | 2204.06292 | link |
2022-04-12 | Probabilistic Compositional Embeddings for Multimodal Image Retrieval | Andrei Neculai et.al. | 2204.05845 | link |
2022-04-12 | Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval | Yu-Wei Zhan et.al. | 2204.05666 | null |
2022-04-12 | HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud | Zhixing Hou et.al. | 2204.05481 | null |
2022-04-11 | Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context | Lizhou Liao et.al. | 2204.04932 | link |
2022-04-10 | Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image | Yujiao Shi et.al. | 2204.04752 | link |
2022-04-08 | A Generic Image Retrieval Method for Date Estimation of Historical Document Collections | Adrià Molina et.al. | 2204.04028 | null |
2022-04-08 | SnapMode: An Intelligent and Distributed Large-Scale Fashion Image Retrieval Platform Based On Big Data and Deep Generative Adversarial Network Technologies | Narges Norouzi et.al. | 2204.03998 | null |
2022-04-05 | Leveraging Equivariant Features for Absolute Pose Regression | Mohamed Adel Musallam et.al. | 2204.02163 | null |
2022-04-04 | “This is my unicorn, Fluffy”: Personalizing frozen vision-language representations | Niv Cohen et.al. | 2204.01694 | link |
2022-04-01 | Bi-directional Loop Closure for Visual SLAM | Ihtisham Ali et.al. | 2204.01524 | null |
2022-04-01 | LASER: LAtent SpacE Rendering for 2D Visual Localization | Zhixiang Min et.al. | 2204.00157 | link |
2022-03-31 | Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning | Semih Orhan et.al. | 2203.16945 | null |
2022-03-30 | AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift | Burak Yildiz et.al. | 2203.16291 | link |
2022-03-29 | Long-term Visual Map Sparsification with Heterogeneous GNN | Ming-Fang Chang et.al. | 2203.15182 | null |
2022-04-01 | A Simulation Benchmark for Vision-based Autonomous Navigation | Lauri Suomela et.al. | 2203.13048 | link |
2022-03-24 | Is Geometry Enough for Matching in Visual Localization? | Qunjie Zhou et.al. | 2203.12979 | link |
2022-03-21 | MatchFormer: Interleaving Attention in Transformers for Feature Matching | Qing Wang et.al. | 2203.09645 | link |
2022-03-10 | ReF – Rotation Equivariant Features for Local Feature Matching | Abhishek Peri et.al. | 2203.05206 | null |
2022-03-09 | Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction | Matthieu Zins et.al. | 2203.04613 | null |
2022-03-08 | Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM | Pierre-Yves Lajoie et.al. | 2203.04446 | link |
2022-03-07 | ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization | Simon Maurer et.al. | 2203.03610 | link |
2022-03-07 | Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms | Qingqing Li et.al. | 2203.03454 | link |
2022-03-01 | SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments | Maria Waheed et.al. | 2203.00591 | null |
2022-02-28 | Deep Camera Pose Regression Using Pseudo-LiDAR | Ali Raza et.al. | 2203.00080 | null |
2022-02-25 | RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation | Praveen Kumar Rajendran et.al. | 2202.12838 | null |
2022-02-24 | Highly-Efficient Binary Neural Networks for Visual Place Recognition | Bruno Ferrarini et.al. | 2202.12375 | null |
2022-02-18 | MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery | Ahmad Khaliq et.al. | 2202.09146 | link |
2022-02-14 | Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition | Y. Shen et.al. | 2202.06470 | null |
2022-02-11 | Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition | Yingfeng Cai et.al. | 2202.05738 | null |
2022-02-09 | Object-Guided Day-Night Visual Localization in Urban Scenes | Assia Benbihi et.al. | 2202.04445 | null |
2022-02-08 | A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition | Nie Jiwei et.al. | 2202.03677 | null |
2022-02-25 | CFP-SLAM: A Real-time Visual SLAM Based on Coarse-to-Fine Probability in Dynamic Environments | Xinggang Hu et.al. | 2202.01938 | null |
2022-02-03 | Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization | Andrea Vallone et.al. | 2202.01821 | null |
2022-02-02 | Training Semantic Descriptors for Image-Based Localization | Ibrahim Cinaroglu et.al. | 2202.01212 | null |
2022-01-31 | Hydra: A Real-time Spatial Perception Engine for 3D Scene Graph Construction and Optimization | Nathan Hughes et.al. | 2201.13360 | null |
2022-01-31 | Rigidity Preserving Image Transformations and Equivariance in Perspective | Lucas Brynte et.al. | 2201.13065 | null |
2022-01-25 | Learning Semantics for Visual Place Recognition through Multi-Scale Attention | Valerio Paolicelli et.al. | 2201.09701 | link |
2022-01-22 | Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems | Xi Zheng et.al. | 2201.09048 | link |
2022-01-15 | A Critical Analysis of Image-based Camera Pose Estimation Techniques | Meng Xu et.al. | 2201.05816 | null |
2022-01-14 | SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions | Ali Samadzadeh et.al. | 2201.05386 | link |
2021-12-23 | NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning | Tony Ng et.al. | 2112.12785 | null |
2021-12-16 | CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data | Qi Yan et.al. | 2112.09081 | link |
2021-12-05 | RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather | Jialu Wang et.al. | 2112.02469 | null |
2021-11-25 | MegLoc: A Robust and Accurate Visual Localization Pipeline | Shuxue Peng et.al. | 2111.13063 | null |
2021-10-08 | Semantic Image Alignment for Vehicle Localization | Markus Herb et.al. | 2110.04162 | null |
2021-10-05 | Season-invariant GNSS-denied visual localization for UAVs | Jouko Kinnari et.al. | 2110.01967 | link |
2021-09-30 | Forming a sparse representation for visual place recognition using a neurorobotic approach | Sylvain Colomer et.al. | 2109.14916 | null |
2021-09-22 | Audio-Visual Grounding Referring Expression for Robotic Manipulation | Yefei Wang et.al. | 2109.10571 | null |
2021-09-20 | Efficient shape mapping through dense touch and vision | Sudharshan Suresh et.al. | 2109.09884 | link |
2021-09-15 | S3LAM: Structured Scene SLAM | Mathieu Gonzalez et.al. | 2109.07339 | null |
2021-09-13 | Monocular Camera Localization for Automated Vehicles Using Image Retrieval | Eunhyek Joa et.al. | 2109.06296 | null |
2021-09-10 | Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization | Sungho Yoon et.al. | 2109.04753 | link |
2021-09-09 | CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization | Ara Jafarzadeh et.al. | 2109.04527 | null |
2021-09-09 | Keeping an Eye on Things: Deep Learned Features for Long-Term Visual Localization | Mona Gridseth et.al. | 2109.04041 | link |
Keypoint Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-15 | KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model | Jie Yang et.al. | 2507.11102 | null |
2025-07-15 | GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft | Weizhao Ma et.al. | 2507.11077 | null |
2025-07-14 | FPC-Net: Revisiting SuperPoint with Descriptor-Free Keypoint Detection via Feature Pyramids and Consistency-Based Implicit Matching | Ionuţ Grigore et.al. | 2507.10770 | null |
2025-07-11 | Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection | Subhajit Maity et.al. | 2507.07994 | null |
2025-07-09 | Reading a Ruler in the Wild | Yimu Pan et.al. | 2507.07077 | null |
2025-07-09 | MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning | Yifan Yang et.al. | 2507.06662 | null |
2025-06-27 | MatChA: Cross-Algorithm Matching with Feature Augmentation | Paula Carbó Cubero et.al. | 2506.22336 | null |
2025-06-27 | SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images | Naftaly Wambugu et.al. | 2506.21945 | null |
2025-05-29 | TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning | Ron Shapira Weber et.al. | 2505.23475 | link |
2025-05-24 | Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU | Yicheng Lin et.al. | 2505.18652 | link |
2025-05-18 | SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving | Muleilan Pei et.al. | 2505.12246 | null |
2025-05-17 | Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation | Niaz Ahmad et.al. | 2505.12130 | null |
2025-05-16 | Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation | Massimiliano Cassia et.al. | 2505.11110 | null |
2025-06-19 | RDD: Robust Feature Detector and Descriptor using Deformable Transformer | Gonglin Chen et.al. | 2505.08013 | null |
2025-05-12 | Enabling Privacy-Aware AI-Based Ergonomic Analysis | Sander De Coninck et.al. | 2505.07306 | null |
2025-05-09 | My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing | Jingrui He et.al. | 2505.06436 | null |
2025-05-05 | Unsupervised training of keypoint-agnostic descriptors for flexible retinal image registration | David Rivas-Villar et.al. | 2505.02787 | null |
2025-05-05 | Unsupervised Deep Learning-based Keypoint Localization Estimating Descriptor Matching Performance | David Rivas-Villar et.al. | 2505.02779 | null |
2025-05-04 | Focus What Matters: Matchability-Based Reweighting for Local Feature Matching | Dongyue Li et.al. | 2505.02161 | null |
2025-05-04 | Enhancing Lidar Point Cloud Sampling via Colorization and Super-Resolution of Lidar Imagery | Sier Ha et.al. | 2505.02049 | null |
2025-04-29 | Emotion Recognition in Contemporary Dance Performances Using Laban Movement Analysis | Muhammad Turab et.al. | 2504.21154 | null |
2025-04-29 | Learning a General Model: Folding Clothing with Topological Dynamics | Yiming Liu et.al. | 2504.20720 | null |
2025-04-26 | VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation | Niaz Ahmad et.al. | 2504.19032 | null |
2025-04-24 | EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy | Haodi Yao et.al. | 2504.17280 | null |
2025-04-15 | UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques | Pedro Diaz-Garcia et.al. | 2504.11063 | null |
2025-04-15 | Acquisition of high-quality images for camera calibration in robotics applications via speech prompts | Timm Linder et.al. | 2504.11031 | null |
2025-04-11 | Stereophotoclinometry Revisited | Travis Driver et.al. | 2504.08252 | null |
2025-03-31 | SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection | Yannick Burkhardt et.al. | 2504.00139 | null |
2025-03-29 | Deep Visual Servoing of an Aerial Robot Using Keypoint Feature Extraction | Shayan Sepahvand et.al. | 2503.23171 | null |
2025-03-25 | Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines | Junle Liu et.al. | 2503.19278 | null |
2025-03-05 | Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing | Ryan Banks et.al. | 2503.13477 | null |
2025-03-16 | Histogram Transporter: Learning Rotation-Equivariant Orientation Histograms for High-Precision Robotic Kitting | Jiadong Zhou et.al. | 2503.12541 | null |
2025-04-12 | Keypoint Detection and Description for Raw Bayer Images | Jiakai Lin et.al. | 2503.08673 | null |
2025-03-10 | REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding | Yan Tai et.al. | 2503.07413 | link |
2025-03-11 | DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection | Johan Edstedt et.al. | 2503.07347 | link |
2025-03-07 | Automatic determination of quasicrystalline patterns from microscopy images | Tano Kim Kender et.al. | 2503.05472 | link |
2025-03-07 | Spatial regularisation for improved accuracy and interpretability in keypoint-based registration | Benjamin Billot et.al. | 2503.04499 | link |
2025-03-04 | A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection | Junyi Wang et.al. | 2503.02481 | null |
2025-03-01 | Autonomous Dissection in Robotic Cholecystectomy | Ki-Hwan Oh et.al. | 2503.00666 | null |
2025-02-28 | CNSv2: Probabilistic Correspondence Encoded Neural Image Servo | Anzhe Chen et.al. | 2503.00132 | null |
2025-02-27 | Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets | Jisoo Lee et.al. | 2502.19766 | null |
2025-02-23 | Rewards-based image analysis in microscopy | Kamyar Barakati et.al. | 2502.18522 | null |
2025-02-19 | 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification | Yusuke Uchida et.al. | 2502.13484 | link |
2025-01-30 | Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images | Wei-Lun Chen et.al. | 2501.18453 | null |
2025-01-30 | Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models | Bhargav Ghanekar et.al. | 2501.18361 | null |
2025-01-30 | Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems | Liudi Yang et.al. | 2501.18110 | null |
2025-01-21 | Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction | Mengyuan Li et.al. | 2501.11844 | null |
2025-01-20 | MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching | Yepeng Liu et.al. | 2501.11299 | null |
2025-01-19 | Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation | Shibang Liu et.al. | 2501.11069 | null |
2025-01-13 | Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications | Lukas Rustler et.al. | 2501.07421 | null |
2025-01-13 | Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps | Saurabh Gupta et.al. | 2501.07399 | null |
2024-12-24 | GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network | Xianfeng Song et.al. | 2412.18221 | link |
2024-12-21 | A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection | Shahid Ansari et.al. | 2412.16755 | null |
2024-12-19 | Corn Ear Detection and Orientation Estimation Using Deep Learning | Nathan Sprague et.al. | 2412.14954 | null |
2024-12-12 | Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models | Faith Johnson et.al. | 2412.09739 | null |
2024-12-09 | An Efficient Scene Coordinate Encoding and Relocalization Method | Kuan Xu et.al. | 2412.06488 | link |
2024-12-09 | ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models | Bingchen Gong et.al. | 2412.06292 | null |
2024-12-07 | Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures | Muhammad Umar Farooq et.al. | 2412.05487 | null |
2024-12-04 | Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything | Yongkyu Lee et.al. | 2412.03472 | link |
2024-12-02 | MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection | Yonghao Dang et.al. | 2412.01422 | null |
2024-11-23 | OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs | Chen Xin et.al. | 2411.15653 | link |
2024-11-19 | IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose | Fei Ren et.al. | 2411.12676 | null |
2024-11-04 | Silver medal Solution for Image Matching Challenge 2024 | Yian Wang et.al. | 2411.01851 | null |
2024-11-04 | KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension | Jie Yang et.al. | 2411.01846 | null |
2024-10-31 | From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots | Vasileios Tzouras et.al. | 2410.23906 | null |
2024-10-04 | Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation | Aman Anand et.al. | 2410.14700 | null |
2024-11-27 | Sim2real Cattle Joint Estimation in 3D point clouds | Mohammad Okour et.al. | 2410.14419 | null |
2024-10-16 | PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network | Asish Bera et.al. | 2410.12742 | null |
2024-10-16 | RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition | Asish Bera et.al. | 2410.12718 | null |
2024-10-01 | A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference | Yuan Li et.al. | 2410.11848 | null |
2024-10-11 | Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image | Marta Veganzones Rodriguez et.al. | 2410.09155 | null |
2024-10-08 | Unsupervised Model Diagnosis | Yinong Oliver Wang et.al. | 2410.06243 | null |
2024-10-08 | Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration | Xueyang Kang et.al. | 2410.05729 | link |
2024-10-16 | Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features | Chengkai Hou et.al. | 2410.02237 | null |
2024-10-02 | Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection | Hongru Yan et.al. | 2410.01404 | null |
2024-09-30 | OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection | Changsheng Lu et.al. | 2409.19899 | link |
2024-10-07 | SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation | Xin Li et.al. | 2409.18082 | null |
2024-09-24 | GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Gennady Sidorov et.al. | 2409.16502 | link |
2024-09-20 | Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators | Niloufar Amiri et.al. | 2409.13668 | null |
2024-09-25 | Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding | Rania Hossam et.al. | 2409.08695 | link |
2024-09-06 | D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection | Kentaro Hirahara et.al. | 2409.04060 | null |
2024-10-01 | Towards Practical Human Motion Prediction with LiDAR Point Clouds | Xiao Han et.al. | 2408.08202 | null |
2024-07-31 | Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods | Xusheng Luo et.al. | 2408.00117 | null |
2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907 | null |
2024-07-25 | LION: Linear Group RNN for 3D Object Detection in Point Clouds | Zhe Liu et.al. | 2407.18232 | link |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-09 | LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition | Teng Wang et.al. | 2407.06730 | null |
2024-07-04 | PFGS: High Fidelity Point Cloud Rendering via Feature Splatting | Jiaxu Wang et.al. | 2407.03857 | link |
2024-07-03 | A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes | Li Fang et.al. | 2407.02830 | link |
2024-07-02 | Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning | Chengchao Shen et.al. | 2407.02014 | link |
2024-06-28 | Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics | Chengrui Gao et.al. | 2406.19672 | null |
2024-07-23 | A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking | Lorenzo Shaikewitz et.al. | 2406.16837 | link |
2024-06-03 | Scale-Free Image Keypoints Using Differentiable Persistent Homology | Giovanni Barbarani et.al. | 2406.01315 | link |
2024-06-23 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-05-25 | Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration | Junjie Gao et.al. | 2405.16085 | null |
2024-06-01 | Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection – Towards Precise Fish Morphological Assessment in Aquaculture Breeding | Weizhen Liu et.al. | 2405.12476 | link |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-15 | Vector-Symbolic Architecture for Event-Based Optical Flow | Hongzhi You et.al. | 2405.08300 | null |
2024-05-13 | RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration | Congjia Chen et.al. | 2405.07594 | null |
2024-05-08 | Unsupervised Skin Feature Tracking with Deep Neural Networks | Jose Chang et.al. | 2405.04943 | null |
2024-05-07 | A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images | László Kopácsi et.al. | 2405.04650 | null |
2024-04-30 | A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images | Wang Zhang et.al. | 2404.19311 | null |
2024-04-25 | Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach | Tahmim Hossain et.al. | 2404.14560 | null |
2024-04-19 | SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers | Vandad Davoodnia et.al. | 2404.12625 | null |
2024-04-17 | Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images | Junbiao Pang et.al. | 2404.10985 | null |
2024-03-28 | Towards Long Term SLAM on Thermal Imagery | Colin Keil et.al. | 2403.19885 | link |
2024-03-28 | Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation | Xiao Lin et.al. | 2403.19527 | link |
2024-03-27 | RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation | Yang Tian et.al. | 2403.18259 | null |
2024-03-18 | FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events | Xiangyuan Wang et.al. | 2403.11662 | link |
2024-03-05 | Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion | Meng Zheng et.al. | 2403.03217 | null |
2024-02-22 | A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets | Chengzhang Yu et.al. | 2402.14241 | null |
2024-02-25 | A Feature Matching Method Based on Multi-Level Refinement Strategy | Shaojie Zhang et.al. | 2402.13488 | null |
2024-03-05 | 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data | Zhi-Yi Lin et.al. | 2402.13172 | null |
2024-02-25 | Region Feature Descriptor Adapted to High Affine Transformations | Shaojie Zhang et.al. | 2402.09724 | null |
2024-01-29 | Reconstructing Close Human Interactions from Multiple Views | Qing Shuai et.al. | 2401.16173 | link |
2024-01-17 | To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection | Luyi Han et.al. | 2401.09336 | link |
2024-01-08 | Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach | Huanyu Liu et.al. | 2401.03742 | link |
2024-03-22 | 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation | Li Xu et.al. | 2401.00029 | null |
2023-12-27 | Bezier-based Regression Feature Descriptor for Deformable Linear Objects | Fangqing Chen et.al. | 2312.16502 | null |
2023-12-24 | Residual Learning for Image Point Descriptors | Rashik Shrestha et.al. | 2312.15471 | null |
2023-12-22 | BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions | Elias Marks et.al. | 2312.14706 | null |
2023-12-19 | Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation | Jiaming Liu et.al. | 2312.12480 | null |
2023-12-19 | An effective image copy-move forgery detection using entropy image | Zhaowei Lu et.al. | 2312.11793 | link |
2023-12-11 | VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data | Jian Shi et.al. | 2312.08871 | link |
2023-12-11 | Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach | Travis Driver et.al. | 2312.06865 | link |
2023-12-01 | Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) | Emma Cramer et.al. | 2312.00592 | link |
2023-11-30 | Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications | Sahar Almahfouz Nasser et.al. | 2311.18281 | null |
2023-11-29 | Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features | Thomas Wimmer et.al. | 2311.18113 | link |
2023-11-28 | Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features | Niladri Shekhar Dutt et.al. | 2311.17024 | link |
2023-11-28 | Riemannian Self-Attention Mechanism for SPD Networks | Rui Wang et.al. | 2311.16738 | null |
2023-11-27 | A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor | Jialin Liu et.al. | 2311.15609 | null |
2023-11-21 | Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers | Bo Sun et.al. | 2311.12291 | null |
2023-11-20 | CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement | Boni Hu et.al. | 2311.11604 | link |
2023-11-17 | Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration | Paul J. Claasen et.al. | 2311.10361 | link |
2023-11-13 | Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning | Tomáš Kunzo et.al. | 2311.07398 | null |
2023-11-11 | CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer | Haoyu Ma et.al. | 2311.06443 | link |
2023-11-08 | 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud | Jianchao Ci et.al. | 2311.04699 | null |
2023-11-06 | TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains | Alexander Naumann et.al. | 2311.03124 | link |
2023-11-06 | An invariant feature extraction for multi-modal images matching | Chenzhong Gao et.al. | 2311.02842 | null |
2023-10-20 | Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification | Mateus Roder et.al. | 2310.13490 | null |
2023-10-12 | UniPose: Detecting Any Keypoints | Jie Yang et.al. | 2310.08530 | link |
2023-10-10 | l-dyno: framework to learn consistent visual features using robot’s motion | Kartikeya Singh et.al. | 2310.06249 | link |
2023-10-10 | Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face | Hao Zhang et.al. | 2310.05056 | link |
2023-10-13 | H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation | Yanjie Ze et.al. | 2310.01404 | link |
2023-10-04 | Self-supervised Learning of Contextualized Local Visual Embeddings | Thalles Santos Silva et.al. | 2310.00527 | link |
2023-10-22 | ObVi-SLAM: Long-Term Object-Visual SLAM | Amanda Adkins et.al. | 2309.15268 | link |
2023-09-19 | LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation | Haizhou Zhang et.al. | 2309.10436 | link |
2023-09-18 | RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy | Mert Asim Karaoglu et.al. | 2309.09563 | null |
2023-09-17 | CryoAlign: feature-based method for global and local 3D alignment of EM density maps | Bintao He et.al. | 2309.09217 | null |
2023-09-14 | EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization | Minjung Kim et.al. | 2309.07471 | link |
2023-09-09 | Mirror-Aware Neural Humans | Daniel Ajisafe et.al. | 2309.04750 | link |
2023-09-07 | InstructDiffusion: A Generalist Modeling Interface for Vision Tasks | Zigang Geng et.al. | 2309.03895 | null |
2023-09-04 | SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras | Himanshu Pahadia et.al. | 2309.01324 | null |
2023-09-12 | Improving the matching of deformable objects by learning to detect keypoints | Felipe Cadar et.al. | 2309.00434 | link |
2023-08-31 | SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation | Jiaben Chen et.al. | 2308.16876 | null |
2023-08-30 | Learning Structure-from-Motion with Graph Attention Networks | Lucas Brynte et.al. | 2308.15984 | link |
2023-08-29 | A lightweight 3D dense facial landmark estimation model from position map data | Shubhajit Basak et.al. | 2308.15170 | link |
2023-08-27 | Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors | Francesco Pirotti et.al. | 2308.14047 | null |
2023-08-24 | VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition | Gengxuan Tian et.al. | 2308.12870 | null |
2023-08-22 | LDP-Feat: Image Features with Local Differential Privacy | Francesco Pittaluga et.al. | 2308.11223 | null |
2023-08-20 | Neural Interactive Keypoint Detection | Jie Yang et.al. | 2308.10174 | link |
2023-08-19 | ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment | Bingyang Zhou et.al. | 2308.09987 | null |
2023-09-03 | DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching | Johan Edstedt et.al. | 2308.08479 | link |
2023-08-15 | CoDeF: Content Deformation Fields for Temporally Consistent Video Processing | Hao Ouyang et.al. | 2308.07926 | link |
2023-08-15 | ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition | Wenyuan Xue et.al. | 2308.07743 | null |
2023-08-14 | DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport | Sk Aziz Ali et.al. | 2308.07153 | null |
2023-08-14 | 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds | Minhao Li et.al. | 2308.05667 | link |
2023-08-02 | Automated Hit-frame Detection for Badminton Match Analysis | Yu-Hang Chien et.al. | 2307.16000 | link |
2023-07-25 | Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception | Chuanyu Luo et.al. | 2307.13300 | null |
2023-07-21 | Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data | Sahar Almahfouz Nasser et.al. | 2307.10698 | link |
2023-07-19 | SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid | Zi Li et.al. | 2307.09727 | link |
2023-07-01 | SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation | Fabian Duffhauss et.al. | 2307.00306 | link |
2023-06-27 | Detector-Free Structure from Motion | Xingyi He et.al. | 2306.15669 | link |
2023-06-26 | CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild | Li Ding et.al. | 2306.15073 | null |
2023-06-28 | Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset | Ziqiao Weng et.al. | 2306.07089 | link |
2023-06-07 | Learning Probabilistic Coordinate Fields for Robust Correspondences | Weiyue Zhao et.al. | 2306.04231 | null |
2023-06-03 | LDEB – Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues | Amitabha Dey et.al. | 2306.02193 | null |
2023-06-02 | Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images | Marcela Mera-Trujillo et.al. | 2306.01938 | null |
2023-06-01 | A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm | Onur Beker et.al. | 2306.00892 | null |
2023-05-30 | Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection | Supeng Wang et.al. | 2305.18714 | link |
2023-05-23 | Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence | Grace Luo et.al. | 2305.14334 | null |
2023-05-15 | Non-Separable Multi-Dimensional Network Flows for Visual Computing | Viktoria Ehm et.al. | 2305.08628 | null |
2023-05-13 | Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance | Xinyu Lin et.al. | 2305.07943 | link |
2023-05-05 | HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration | Canhui Tang et.al. | 2305.03487 | link |
2023-04-17 | Human Pose Estimation in Monocular Omnidirectional Top-View Images | Jingrui Yu et.al. | 2304.08186 | null |
2023-04-14 | CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression | Mubariz Zaffar et.al. | 2304.07426 | null |
2023-04-12 | SiLK – Simple Learned Keypoints | Pierre Gleize et.al. | 2304.06194 | link |
2023-04-06 | From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection | Changsheng Lu et.al. | 2304.03140 | null |
2023-03-29 | NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud | Xiangyu Zhu et.al. | 2303.16465 | link |
2023-03-24 | PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View | Ze Shi et.al. | 2303.14095 | link |
2023-03-23 | Semantic Image Attack for Visual Model Diagnosis | Jinqi Luo et.al. | 2303.13010 | null |
2023-03-22 | Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation | Heng Yang et.al. | 2303.12246 | link |
2023-03-21 | RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network | Sangmin Yoo et.al. | 2303.10770 | null |
2023-03-17 | ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty | Vanessa Wirth et.al. | 2303.10042 | null |
2023-03-15 | Descriptor Distillation for Efficient Multi-Robot SLAM | Xiyue Guo et.al. | 2303.08420 | null |
2023-03-15 | From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning | Zhuo Su et.al. | 2303.08414 | null |
2023-03-16 | KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input | Yiye Chen et.al. | 2303.05617 | link |
2023-03-07 | External Camera-based Mobile Robot Pose Estimation for Collaborative Perception with Smart Edge Sensors | Simon Bultmann et.al. | 2303.03797 | null |
2023-02-26 | PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection | Shenwei Xie et.al. | 2302.13263 | null |
2023-02-24 | Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks | Julian Lißner et.al. | 2302.12545 | null |
2023-02-21 | Deep Reinforcement Learning Based on Local GNN for Goal-conditioned Deformable Object Rearranging | Yuhong Deng et.al. | 2302.10446 | null |
2023-02-12 | A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training | Jingnan Shi et.al. | 2302.06019 | null |
2023-02-11 | Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing | Zitong Yu et.al. | 2302.05744 | null |
2023-02-09 | MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection | Yuhe Ding et.al. | 2302.04589 | link |
2023-02-03 | Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation | Jie Yang et.al. | 2302.01593 | link |
2023-02-03 | Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization | Yingying Zhu et.al. | 2302.01572 | link |
2023-01-21 | Vision Aided Environment Semantics Extraction and Its Application in mmWave Beam Selection | Feiyang Wen et.al. | 2301.08973 | null |
2023-01-18 | OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models | Xingyi He et.al. | 2301.07673 | null |
2023-01-12 | Towards High Performance One-Stage Human Pose Estimation | Ling Li et.al. | 2301.04842 | null |
2022-12-31 | Rethinking Rotation Invariance with Point Cloud Registration | Jianhui Yu et.al. | 2301.00149 | null |
2023-02-06 | Fruit Ripeness Classification: a Survey | Matteo Rizzo et.al. | 2212.14441 | null |
2022-12-28 | NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action | Kuan-Chieh Wang et.al. | 2212.13660 | link |
2022-12-24 | HandsOff: Labeled Dataset Generation With No Additional Human Annotations | Austin Xu et.al. | 2212.12645 | null |
2022-12-13 | Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images | Welerson Melo et.al. | 2212.09589 | link |
2022-12-15 | Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation | Bugra C. Sefercik et.al. | 2212.07567 | null |
2023-02-01 | DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization | Xiangyu Xu et.al. | 2212.04575 | null |
2022-12-07 | ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation | Yufei Xu et.al. | 2212.04246 | link |
2022-12-15 | Designing Feature Vector Representations: A case study from Chemistry | Signe Sidwall Thygesen et.al. | 2212.03731 | null |
2022-12-09 | DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model | Jeongjun Choi et.al. | 2212.02796 | link |
2022-12-05 | Images Speak in Images: A Generalist Painter for In-Context Visual Learning | Xinlong Wang et.al. | 2212.02499 | link |
2022-12-06 | R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor | Bai Zhu et.al. | 2212.02277 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
2022-11-29 | BALF: Simple and Efficient Blur Aware Local Feature Detector | Zhenjun Zhao et.al. | 2211.14731 | null |
2022-11-21 | Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching | Paul Roetzer et.al. | 2211.11589 | link |
2022-11-07 | Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration | Zixin Yang et.al. | 2211.03688 | null |
2022-10-31 | Tree Detection and Diameter Estimation Based on Deep Learning | Vincent Grondin et.al. | 2210.17424 | link |
2022-10-26 | Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds | Zhiyuan Zhang et.al. | 2210.14899 | null |
2022-10-23 | Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders | Ömer Sümer et.al. | 2210.12705 | null |
2022-10-21 | Real-time Detection of 2D Tool Landmarks with Synthetic Training Data | Bram Vanherle et.al. | 2210.11991 | null |
2022-10-09 | Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning | Ali Safa et.al. | 2210.04236 | null |
2022-10-04 | Centroid Distance Keypoint Detector for Colored Point Clouds | Hanzhe Teng et.al. | 2210.01298 | link |
2022-09-28 | Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences | Jun-Jee Chao et.al. | 2209.14419 | null |
2022-09-28 | USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation | Zhengrong Xue et.al. | 2209.13864 | null |
2022-10-16 | Suture Thread Spline Reconstruction from Endoscopic Images for Robotic Surgery with Reliability-driven Keypoint Detection | Neelay Joglekar et.al. | 2209.13657 | link |
2022-09-27 | Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors | Hao Dong et.al. | 2209.13586 | link |
2022-09-26 | Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments | Kyungmin Jung et.al. | 2209.12881 | null |
2022-10-07 | Long-Lived Accurate Keypoints in Event Streams | Philippe Chiberre et.al. | 2209.10385 | null |
2022-09-20 | Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence | Sunghwan Hong et.al. | 2209.08742 | null |
2022-09-15 | Online Marker-free Extrinsic Camera Calibration using Person Keypoint Detections | Bastian Pätzold et.al. | 2209.07393 | link |
2022-09-07 | Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip | Yang Li et.al. | 2209.03440 | null |
2022-08-27 | Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes | Ali Safa et.al. | 2208.12997 | null |
2022-08-24 | Self-Supervised Endoscopic Image Key-Points Matching | Manel Farhat et.al. | 2208.11424 | link |
2022-08-19 | Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture | Muhammad Muzammel et.al. | 2208.08224 | null |
2022-08-08 | MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis | Maximilian Gilles et.al. | 2208.03963 | null |
2022-08-07 | CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization | Yujiao Shi et.al. | 2208.03660 | null |
2022-07-29 | Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation | Qihao Liu et.al. | 2208.00090 | null |
2022-07-25 | Translating a Visual LEGO Manual to a Machine-Executable Plan | Ruocheng Wang et.al. | 2207.12572 | null |
2022-07-21 | Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network | Aline Sindel et.al. | 2207.10506 | null |
2022-07-15 | Human keypoint detection for close proximity human-robot interaction | Jan Docekal et.al. | 2207.07742 | null |
2022-07-15 | Adversarial Focal Loss: Asking Your Discriminator for Hard Examples | Chen Liu et.al. | 2207.07739 | null |
2022-07-13 | Rapid Person Re-Identification via Sub-space Consistency Regularization | Qingze Yin et.al. | 2207.05933 | null |
2022-07-07 | RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments | Qihao Peng et.al. | 2207.03539 | null |
2022-08-15 | Semi-supervised Human Pose Estimation in Art-historical Images | Matthias Springstein et.al. | 2207.02976 | link |
2022-07-01 | Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling | Jiamin Liang et.al. | 2207.00474 | null |
2022-06-24 | Motion Estimation for Large Displacements and Deformations | Qiao Chen et.al. | 2206.12464 | null |
2022-06-24 | Deep embedded clustering algorithm for clustering PACS repositories | Teo Manojlović et.al. | 2206.12417 | null |
2022-06-21 | KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences | Xuanhan Wang et.al. | 2206.10090 | link |
2022-06-20 | Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval | Guile Wu et.al. | 2206.09806 | null |
2022-06-15 | A Unified Sequence Interface for Vision Tasks | Ting Chen et.al. | 2206.07669 | link |
2022-06-09 | Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields | Mingtong Zhang et.al. | 2206.04669 | null |
2022-06-03 | SNAKE: Shape-aware Neural 3D Keypoint Field | Chengliang Zhong et.al. | 2206.01724 | link |
2022-05-17 | MulT: An End-to-End Multitask Learning Transformer | Deblina Bhattacharjee et.al. | 2205.08303 | null |
2022-05-10 | ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild | Chirag Raman et.al. | 2205.05177 | link |
2022-04-28 | Polarimetric imaging for the detection of synthetic models of SARS-CoV-2: a proof of concept | Emilio Gomez-Gonzalez et.al. | 2204.14050 | null |
2022-05-02 | GRIT: General Robust Image Task Benchmark | Tanmay Gupta et.al. | 2204.13653 | link |
2022-05-24 | ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation | Yufei Xu et.al. | 2204.12484 | link |
2022-04-26 | Unified GCNs: Towards Connecting GCNs with CNNs | Ziyan Zhang et.al. | 2204.12300 | null |
2022-04-19 | Self-Supervised Equivariant Learning for Oriented Keypoint Detection | Jongmin Lee et.al. | 2204.08613 | link |
2022-04-17 | The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation | Bao Zhao et.al. | 2204.08024 | null |
2022-04-15 | 2D Human Pose Estimation: A Survey | Haoming Chen et.al. | 2204.07370 | null |
2022-04-11 | Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification | Haojie Liu et.al. | 2204.04842 | null |
2022-04-07 | Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification | Yanan Wang et.al. | 2204.02611 | link |
2022-04-02 | SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning | Nilaksh Das et.al. | 2204.00734 | link |
2022-04-01 | MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration | Chenzhong Gao et.al. | 2204.00260 | null |
2022-03-29 | Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning | David Howard et.al. | 2203.15172 | null |
2022-03-28 | REGTR: End-to-end Point Cloud Correspondences with Transformers | Zi Jian Yew et.al. | 2203.14517 | link |
2022-03-27 | UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection | Ye Liu et.al. | 2203.12745 | link |
2022-03-21 | MatchFormer: Interleaving Attention in Transformers for Feature Matching | Qing Wang et.al. | 2203.09645 | link |
2022-03-16 | PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research | R. James Cotton et.al. | 2203.08792 | link |
2022-03-11 | DRTAM: Dual Rank-1 Tensor Attention Module | Hanxing Chi et.al. | 2203.05893 | null |
2022-03-07 | Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation | Meng Tian et.al. | 2203.03498 | null |
2022-02-10 | Motion-Aware Transformer For Occluded Person Re-identification | Mi Zhou et.al. | 2202.04243 | null |
2022-02-03 | Sim2Real Object-Centric Keypoint Detection and Description | Chengliang Zhong et.al. | 2202.00448 | null |
2022-01-16 | Cross-Centroid Ripple Pattern for Facial Expression Recognition | Monu Verma et.al. | 2201.05958 | null |
2022-01-14 | Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words | Harry Nguyen et.al. | 2201.03556 | link |
2022-01-10 | TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials | Jinnavat Sanalohit et.al. | 2201.03170 | null |
2022-01-06 | A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration | Aline Sindel et.al. | 2201.02242 | null |
2021-12-28 | Skin feature point tracking using deep feature encodings | Jose Ramon Chang et.al. | 2112.14159 | null |
2021-12-23 | Data-efficient learning for 3D mirror symmetry detection | Yancong Lin et.al. | 2112.12579 | null |
2021-12-22 | Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations – combining input rotations and a kinematic model | Michael Zwölfer et.al. | 2112.12193 | null |
2021-12-22 | Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction | Henrique Siqueira et.al. | 2112.12002 | link |
2021-12-19 | Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection | Renjie Li et.al. | 2112.10275 | null |
2021-12-19 | GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor | Jean-Baptiste Carluer et.al. | 2112.10258 | link |
2021-12-16 | Masked Feature Prediction for Self-Supervised Visual Pre-Training | Chen Wei et.al. | 2112.09133 | link |
2021-12-13 | DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points | Zhengfei Kuang et.al. | 2112.06910 | null |
2021-12-12 | Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species | Changsheng Lu et.al. | 2112.06183 | link |
2021-12-13 | Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings | Mel Vecerik et.al. | 2112.04910 | null |
2021-12-06 | ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction | Xiaoming Zhao et.al. | 2112.02906 | link |
2021-11-25 | Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association | Sen Yang et.al. | 2111.12892 | link |
2021-11-08 | Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images | Jianfei Guo et.al. | 2111.04237 | null |
2021-11-04 | Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image | Feng Liu et.al. | 2111.03098 | null |
2021-11-01 | Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-realistic Perspective of Computer Vision | Ali Safa et.al. | 2111.00791 | null |
2021-10-30 | Geometry-Aware Hierarchical Bayesian Learning on Manifolds | Yonghui Fan et.al. | 2111.00184 | null |
2021-10-26 | CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration | Hao Yu et.al. | 2110.14076 | link |
2021-10-23 | HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware | James Hegarty et.al. | 2110.12106 | null |
2021-10-18 | Keypoint-Based Bimanual Shaping of Deformable Linear Objects under Environmental Constraints using Hierarchical Action Planning | Shengzeng Huo et.al. | 2110.08962 | null |
2021-10-11 | High-order Tensor Pooling with Attention for Action Recognition | Piotr Koniusz et.al. | 2110.05216 | null |
2021-10-10 | Digging Into Self-Supervised Learning of Feature Descriptors | Iaroslav Melekhov et.al. | 2110.04773 | null |
2021-10-04 | BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion | Zhaoqun Li et.al. | 2110.01179 | link |
2021-10-01 | Machine learning aided noise filtration and signal classification for CREDO experiment | Łukasz Bibrzycki et.al. | 2110.00297 | null |
2021-09-28 | PDC-Net+: Enhanced Probabilistic Dense Correspondence Network | Prune Truong et.al. | 2109.13912 | link |
2021-09-27 | HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines | Fabio Bellavia et.al. | 2109.12925 | null |
2021-09-24 | Catadioptric Stereo on a Smartphone | Kristijan Bartol et.al. | 2109.11872 | null |
2021-09-20 | Semi-supervised Dense Keypointsusing Unlabeled Multiview Images | Zhixuan Yu et.al. | 2109.09299 | null |
2021-08-31 | A Novel Dataset for Keypoint Detection of quadruped Animals from Images | Prianka Banik et.al. | 2108.13958 | link |
2021-08-27 | A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images | Xiaoteng Zhou et.al. | 2108.12151 | null |
Image Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-09 | Dual-Granularity Cross-Modal Identity Association for Weakly-Supervised Text-to-Person Image Matching | Yafei Zhang et.al. | 2507.06744 | null |
2025-07-05 | From Query to Explanation: Uni-RAG for Multi-Modal Retrieval-Augmented Learning in STEM | Xinyi Wu et.al. | 2507.03868 | null |
2025-07-02 | What does really matter in image goal navigation? | Gianluca Monaci et.al. | 2507.01667 | null |
2025-06-30 | Efficient and Accurate Image Provenance Analysis: A Scalable Pipeline for Large-scale Images | Jiewei Lai et.al. | 2506.23707 | null |
2025-06-29 | Dynamic Contrastive Learning for Hierarchical Retrieval: A Case Study of Distance-Aware Cross-View Geo-Localization | Suofei Zhang et.al. | 2506.23077 | null |
2025-06-27 | MatChA: Cross-Algorithm Matching with Feature Augmentation | Paula Carbó Cubero et.al. | 2506.22336 | null |
2025-07-07 | Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs | Shaojie Zhang et.al. | 2506.22139 | null |
2025-06-27 | ZeroReg3D: A Zero-shot Registration Pipeline for 3D Consecutive Histopathology Image Reconstruction | Juming Xiong et.al. | 2506.21923 | null |
2025-06-25 | Fast entropy-regularized SDP relaxations for permutation synchronization | Michael Lindsey et.al. | 2506.20191 | null |
2025-06-18 | ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections | Ziling Huang et.al. | 2506.15180 | null |
2025-06-16 | EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition | Bingxi Liu et.al. | 2506.13133 | null |
2025-06-12 | RealKeyMorph: Keypoints in Real-world Coordinates for Resolution-agnostic Image Registration | Mina C. Moghadam et.al. | 2506.10344 | null |
2025-06-11 | Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints | Xiangkai Zhang et.al. | 2506.09748 | null |
2025-06-11 | ScaleLSD: Scalable Deep Line Segment Detection Streamlined | Zeran Ke et.al. | 2506.09369 | link |
2025-05-21 | Anti-interrupted sampling repeater jamming via linear canonical Wigner distribution lightweight LFM detection | Jia-Mian Li et.al. | 2506.06302 | null |
2025-06-05 | Vanishing arcs for isolated plane curve singularities | Hanwool Bae et.al. | 2506.04917 | null |
2025-06-05 | Deep Learning Reforms Image Matching: A Survey and Outlook | Shihua Zhang et.al. | 2506.04619 | null |
2025-06-20 | SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping | Mingxu Zhang et.al. | 2505.24305 | null |
2025-06-05 | Universal Domain Adaptation for Semantic Segmentation | Seun-An Choe et.al. | 2505.22458 | null |
2025-05-23 | To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models | Simone Gaisbauer et.al. | 2505.17973 | link |
2025-05-16 | Multi-view dense image matching with similarity learning and geometry priors | Mohamed Ali Chebbi et.al. | 2505.11264 | null |
2025-05-12 | Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection | Yuqi Cheng et.al. | 2505.07375 | link |
2025-05-04 | OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery | Chongsheng Zhang et.al. | 2505.03836 | link |
2025-05-06 | LiftFeat: 3D Geometry-Aware Local Feature Matching | Yepeng Liu et.al. | 2505.03422 | link |
2025-05-04 | Focus What Matters: Matchability-Based Reweighting for Local Feature Matching | Dongyue Li et.al. | 2505.02161 | null |
2025-05-15 | Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective | Taoyu Su et.al. | 2504.19458 | link |
2025-04-28 | Dynamic Arthroscopic Navigation System for Anterior Cruciate Ligament Reconstruction Based on Multi-level Memory Architecture | Shuo Wang et.al. | 2504.19398 | null |
2025-04-23 | Road Similarity-Based BEV-Satellite Image Matching for UGV Localization | Zhenping Sun et.al. | 2504.16346 | null |
2025-04-18 | Outlier-Robust Multi-Model Fitting on Quantum Annealers | Saurabh Pandey et.al. | 2504.13836 | null |
2025-04-11 | Geometric Consistency Refinement for Single Image Novel View Synthesis via Test-Time Adaptation of Diffusion Models | Josef Bengtson et.al. | 2504.08348 | null |
2025-04-10 | Image registration of 2D optical thin sections in a 3D porous medium: Application to a Berea sandstone digital rock image | Jaehong Chung et.al. | 2504.06604 | link |
2025-04-22 | To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition | Davide Sferrazza et.al. | 2504.06116 | link |
2025-04-10 | Learning Affine Correspondences by Integrating Geometric Constraints | Pengju Sun et.al. | 2504.04834 | link |
2025-04-01 | Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data | Yiqun Duan et.al. | 2504.00812 | null |
2025-03-31 | CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching | Zizhuo Li et.al. | 2503.23925 | null |
2025-03-28 | Pairwise Matching of Intermediate Representations for Fine-grained Explainability | Lauren Shrack et.al. | 2503.22881 | link |
2025-03-26 | Multimodal Image Matching based on Frequency-domain Information of Local Energy Response | Meng Yang et.al. | 2503.20827 | null |
2025-03-22 | Normalized Matching Transformer | Abtin Pourhadi et.al. | 2503.17715 | link |
2025-03-20 | Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors | Tian Yi Lim et.al. | 2503.16275 | null |
2025-03-20 | MapGlue: Multimodal Remote Sensing Image Matching | Peihao Wu et.al. | 2503.16185 | link |
2025-03-19 | PAPI-Reg: Patch-to-Pixel Solution for Efficient Cross-Modal Registration between LiDAR Point Cloud and Camera Image | Yuanchao Yue et.al. | 2503.15285 | null |
2025-04-07 | Less Biased Noise Scale Estimation for Threshold-Robust RANSAC | Johan Edstedt et.al. | 2503.13433 | null |
2025-03-17 | SatDepth: A Novel Dataset for Satellite Image Matching | Rahul Deshmukh et.al. | 2503.12706 | link |
2025-03-14 | Refining Image Edge Detection via Linear Canonical Riesz Transforms | Shuhui Yang et.al. | 2503.11148 | null |
2025-03-13 | Speedy MASt3R | Jingxing Li et.al. | 2503.10017 | null |
2025-03-11 | Keypoint Detection and Description for Raw Bayer Images | Jiakai Lin et.al. | 2503.08673 | null |
2025-03-06 | Learning 3D Medical Image Models From Brain Functional Connectivity Network Supervision For Mental Disorder Diagnosis | Xingcan Hu et.al. | 2503.04205 | null |
2025-03-07 | Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration | Qianliang Wu et.al. | 2503.04127 | null |
2025-03-05 | JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba | Xiaoyong Lu et.al. | 2503.03437 | null |
2025-02-28 | CNSv2: Probabilistic Correspondence Encoded Neural Image Servo | Anzhe Chen et.al. | 2503.00132 | null |
2025-02-27 | A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization | Yejun Zhang et.al. | 2502.20036 | link |
2025-02-27 | RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges | Thibaut Loiseau et.al. | 2502.19955 | null |
2025-02-26 | BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure | Haoxin Cai et.al. | 2502.19242 | link |
2025-02-25 | PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching | Han Nie et.al. | 2502.18104 | link |
2025-02-25 | Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking | Xin Tong et.al. | 2502.17766 | null |
2025-03-04 | Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model | Yaxuan Huang et.al. | 2502.16779 | null |
2025-02-16 | FeaKM: Robust Collaborative Perception under Noisy Pose Conditions | Jiuwu Hao et.al. | 2502.11003 | link |
2025-02-24 | Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation | Emanuele Mule et.al. | 2502.06288 | link |
2025-02-04 | Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications | William O’Donnell et.al. | 2502.02624 | null |
2025-02-01 | MambaGlue: Fast and Robust Local Feature Matching With Mamba | Kihwan Ryoo et.al. | 2502.00462 | link |
2025-01-24 | Dense-SfM: Structure from Motion with Dense Consistent Matching | JongMin Lee et.al. | 2501.14277 | null |
2025-01-20 | MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching | Yepeng Liu et.al. | 2501.11299 | null |
2025-01-13 | MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training | Xingyi He et.al. | 2501.07556 | null |
2025-01-13 | Matching Free Depth Recovery from Structured Light | Zhuohang Yu et.al. | 2501.07113 | null |
2025-01-02 | Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views | Yulun Wu et.al. | 2501.01196 | null |
2024-12-31 | Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights | Bharath Kumar Agnur et.al. | 2412.20210 | null |
2024-12-27 | MINIMA: Modality Invariant Image Matching | Xingyu Jiang et.al. | 2412.19412 | link |
2024-12-24 | GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network | Xianfeng Song et.al. | 2412.18221 | link |
2024-12-17 | Bringing Multimodality to Amazon Visual Search System | Xinliang Zhu et.al. | 2412.13364 | null |
2024-12-04 | Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis | Siyoon Jin et.al. | 2412.03150 | null |
2024-11-20 | DT-LSD: Deformable Transformer-based Line Segment Detection | Sebastian Janampa et.al. | 2411.13005 | link |
2024-11-15 | Image Matching Filtering and Refinement by Planes and Beyond | Fabio Bellavia et.al. | 2411.09484 | link |
2024-11-11 | XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration | Ismail Can Yagmur et.al. | 2411.07430 | link |
2024-11-07 | The Impact of Semi-Supervised Learning on Line Segment Detection | Johanna Engman et.al. | 2411.04596 | link |
2024-11-04 | Silver medal Solution for Image Matching Challenge 2024 | Yian Wang et.al. | 2411.01851 | null |
2024-10-30 | Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants | Azadeh Sharafi et.al. | 2410.23329 | null |
2024-11-05 | RelationBooth: Towards Relation-Aware Customized Object Generation | Qingyu Shi et.al. | 2410.23280 | null |
2024-10-31 | ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses | Junjie Ni et.al. | 2410.22733 | null |
2024-10-30 | LoFLAT: Local Feature Matching using Focused Linear Attention Transformer | Naijian Cao et.al. | 2410.22710 | null |
2024-10-26 | Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification | Yue Su et.al. | 2410.20097 | null |
2024-10-01 | A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference | Yuan Li et.al. | 2410.11848 | null |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-09-27 | Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras | Yipeng Lu et.al. | 2409.18673 | null |
2024-09-25 | Game4Loc: A UAV Geo-Localization Benchmark from Game Data | Yuxiang Ji et.al. | 2409.16925 | link |
2024-09-24 | Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge | Marek Wodzinski et.al. | 2409.15931 | null |
2024-09-10 | Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Yujiao Shi et.al. | 2409.06471 | link |
2024-09-05 | Enabling Practical and Privacy-Preserving Image Processing | Chao Wang et.al. | 2409.03568 | null |
2024-09-20 | A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering | Shuang Song et.al. | 2409.03032 | link |
2024-08-29 | Super-Resolution works for coastal simulations | Zhi-Song Liu et.al. | 2408.16553 | null |
2024-09-15 | Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Sierra Bonilla et.al. | 2408.16445 | link |
2024-08-26 | Affine steerers for structured keypoint description | Georg Bökman et.al. | 2408.14186 | link |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-09-11 | Coarse-to-fine Alignment Makes Better Speech-image Retrieval | Lifeng Zhou et.al. | 2408.13119 | null |
2024-08-19 | BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval | Zhenyu Lu et.al. | 2408.10383 | null |
2024-08-14 | RSD-DOG : A New Image Descriptor based on Second Order Derivatives | Darshan Venkatrayappa et.al. | 2408.07687 | null |
2024-08-09 | One Shot is Enough for Sequential Infrared Small Target Segmentation | Bingbing Dan et.al. | 2408.04823 | link |
2024-08-07 | PRISM: PRogressive dependency maxImization for Scale-invariant image Matching | Xudong Cai et.al. | 2408.03598 | null |
2024-08-05 | ConDL: Detector-Free Dense Image Matching | Monika Kwiatkowski et.al. | 2408.02766 | null |
2024-08-04 | Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image | Xinlin Ren et.al. | 2408.02079 | link |
2024-07-29 | Image-text matching for large-scale book collections | Artemis Llabrés et.al. | 2407.19812 | link |
2024-07-26 | PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis | Sohyeong Kim et.al. | 2407.18695 | null |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-16 | REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching | Han Nie et.al. | 2407.11637 | link |
2024-07-16 | A Self-Correcting Strategy of the Digital Volume Correlation Displacement Field Based on Image Matching: Application to Poor Speckles Quality and Complex-Large Deformation | Chengsheng Li et.al. | 2407.11287 | null |
2024-07-14 | Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching | Xiaoyong Lu et.al. | 2407.07789 | null |
2024-07-10 | Mutual Information calculation on different appearances | Jiecheng Liao et.al. | 2407.07410 | null |
2024-07-15 | SfM on-the-fly: Get better 3D from What You Capture | Zongqian Zhan et.al. | 2407.03939 | null |
2024-07-03 | IMC 2024 Methods & Solutions Review | Shyam Gupta et.al. | 2407.03172 | null |
2024-06-21 | High Resolution Surface Reconstruction of Cultural Heritage Objects Using Shape from Polarization Method | F. S. Mortazavi et.al. | 2406.15121 | null |
2024-06-16 | Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models | Yikai Zhang et.al. | 2406.10902 | link |
2024-06-14 | Grounding Image Matching in 3D with MASt3R | Vincent Leroy et.al. | 2406.09756 | link |
2024-06-05 | A Self-Supervised Denoising Strategy for Underwater Acoustic Camera Imageries | Xiaoteng Zhou et.al. | 2406.02914 | null |
2024-05-22 | Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching | Hongkai Chen et.al. | 2405.13874 | null |
2024-05-21 | OmniGlue: Generalizable Feature Matching with Foundation Model Guidance | Hanwen Jiang et.al. | 2405.12979 | link |
2024-07-09 | Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation | Rezkellah Noureddine Khiati et.al. | 2405.08556 | link |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-13 | Authentic Hand Avatar from a Phone Scan via Universal Hand Model | Gyeongsik Moon et.al. | 2405.07933 | null |
2024-04-30 | A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images | Wang Zhang et.al. | 2404.19311 | null |
2024-04-30 | XFeat: Accelerated Features for Lightweight Image Matching | Guilherme Potje et.al. | 2404.19174 | null |
2024-06-10 | MinBackProp – Backpropagating through Minimal Solvers | Diana Sungatullina et.al. | 2404.17993 | link |
2024-04-25 | Transformer-Based Local Feature Matching for Multimodal Image Registration | Remi Delaunay et.al. | 2404.16802 | null |
2024-04-23 | FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction | Hang Hua et.al. | 2404.14715 | null |
2024-04-22 | Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Eric Brachmann et.al. | 2404.14351 | null |
2024-04-17 | A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching | Francesco Pro et.al. | 2404.11302 | link |
2024-04-16 | Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction | John Francis et.al. | 2404.10626 | null |
2024-04-15 | XoFTR: Cross-modal Feature Matching Transformer | Önder Tuzcuoğlu et.al. | 2404.09692 | link |
2024-04-13 | DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector | Johan Edstedt et.al. | 2404.08928 | link |
2024-04-09 | Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences | Axel Barroso-Laguna et.al. | 2404.06337 | link |
2024-04-01 | Marrying NeRF with Feature Matching for One-step Pose Estimation | Ronghan Chen et.al. | 2404.00891 | null |
2024-04-01 | 3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching | Yibin Ye et.al. | 2404.00838 | null |
2024-03-31 | On the Estimation of Image-matching Uncertainty in Visual Place Recognition | Mubariz Zaffar et.al. | 2404.00546 | null |
2024-03-30 | Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation | Yuan Wang et.al. | 2404.00262 | null |
2024-03-26 | Staircase Localization for Autonomous Exploration in Urban Environments | Jinrae Kim et.al. | 2403.17330 | null |
2024-03-23 | MatchSeg: Towards Better Segmentation via Reference Image Matching | Ruiqiang Xiao et.al. | 2403.15901 | link |
2024-03-20 | Unifying Local and Global Multimodal Features for Place Recognition in Aliased and Low-Texture Environments | Alberto García-Hernández et.al. | 2403.13395 | link |
2024-03-19 | HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching | Ying Chen et.al. | 2403.12543 | null |
2024-03-16 | Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval | Shunsuke Tsubaki et.al. | 2403.10756 | null |
2024-03-16 | Vector search with small radiuses | Gergely Szilvasy et.al. | 2403.10746 | null |
2024-03-15 | Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline | Fangming Yuan et.al. | 2403.10283 | null |
2024-03-15 | Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning | Meixuan Li et.al. | 2403.10252 | null |
2024-03-14 | Virtual birefringence imaging and histological staining of amyloid deposits in label-free tissue using autofluorescence microscopy and deep learning | Xilin Yang et.al. | 2403.09100 | null |
2024-03-18 | Matching Non-Identical Objects | Yusuke Marumo et.al. | 2403.08227 | null |
2024-03-11 | Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed | Yifan Wang et.al. | 2403.04765 | null |
2024-03-07 | Scene Depth Estimation from Traditional Oriental Landscape Paintings | Sungho Kang et.al. | 2403.03408 | null |
2024-02-21 | Visual Style Prompting with Swapping Self-Attention | Jaeseok Jeong et.al. | 2402.12974 | link |
2024-02-16 | GIM: Learning Generalizable Image Matcher From Internet Videos | Xuelun Shen et.al. | 2402.11095 | link |
2024-02-13 | Are Semi-Dense Detector-Free Methods Good at Matching Local Features? | Matthieu Vilain et.al. | 2402.08671 | null |
2024-02-13 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359 | link |
2024-01-31 | Improved Scene Landmark Detection for Camera Localization | Tien Do et.al. | 2401.18083 | link |
2024-03-11 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-24 | Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry | Qi Cai et.al. | 2401.13357 | null |
2024-01-19 | SCENES: Subpixel Correspondence Estimation With Epipolar Supervision | Dominik A. Kloepfer et.al. | 2401.10886 | null |
2024-01-18 | Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Songhe Deng et.al. | 2401.09883 | link |
2024-01-26 | RomniStereo: Recurrent Omnidirectional Stereo Matching | Hualie Jiang et.al. | 2401.04345 | link |
2024-01-05 | CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs | Daoan Zhang et.al. | 2401.02582 | null |
2024-01-03 | Local Adaptive Clustering Based Image Matching for Automatic Visual Identification | Zhizhen Wang et.al. | 2401.01720 | null |
2024-01-03 | A Transformer-Based Adaptive Semantic Aggregation Method for UAV Visual Geo-Localization | Shishen Li et.al. | 2401.01574 | null |
2023-12-23 | BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation | Tavis Shore et.al. | 2312.15363 | link |
2023-12-22 | Harnessing Diffusion Models for Visual Perception with Meta Prompts | Qiang Wan et.al. | 2312.14733 | link |
2024-01-05 | MatchDet: A Collaborative Framework for Image Matching and Object Detection | Jinxiang Lai et.al. | 2312.10983 | null |
2023-12-07 | Visual Geometry Grounded Deep Structure From Motion | Jianyuan Wang et.al. | 2312.04563 | null |
2023-12-04 | Steerers: A framework for rotation equivariant keypoint descriptors | Georg Bökman et.al. | 2312.02152 | link |
2023-11-30 | DSeg: Direct Line Segments Detection | Berger Cyrille et.al. | 2311.18344 | null |
2023-11-30 | Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications | Sahar Almahfouz Nasser et.al. | 2311.18281 | null |
2023-11-29 | LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching | Wenhao Zhong et.al. | 2311.17571 | link |
2023-11-08 | Zero-shot Translation of Attention Patterns in VQA Models to Natural Language | Leonard Salewski et.al. | 2311.05043 | link |
2023-11-06 | An invariant feature extraction for multi-modal images matching | Chenzhong Gao et.al. | 2311.02842 | null |
2023-10-23 | RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments | Jinyu Li et.al. | 2310.15072 | link |
2023-10-23 | Player Re-Identification Using Body Part Appearences | Mahesh Bhosale et.al. | 2310.14469 | null |
2023-10-20 | FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer | Xinyu Zhang et.al. | 2310.13605 | null |
2023-11-14 | RGM: A Robust Generalist Matching Model | Songyan Zhang et.al. | 2310.11755 | link |
2023-10-07 | UFD-PRiME: Unsupervised Joint Learning of Optical Flow and Stereo Depth through Pixel-Level Rigid Motion Estimation | Shuai Yuan et.al. | 2310.04712 | null |
2023-10-02 | Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images | Georg Bökman et.al. | 2310.01092 | null |
2023-09-29 | Segment Anything Model is a Good Teacher for Local Feature Learning | Jingqian Wu et.al. | 2309.16992 | link |
2023-09-27 | KDD-LOAM: Jointly Learned Keypoint Detector and Descriptors Assisted LiDAR Odometry and Mapping | Renlang Huang et.al. | 2309.15394 | null |
2023-10-13 | A Critical Analysis of Internal Reliability for Uncertainty Quantification of Dense Image Matching in Multi-view Stereo | Debao Huang et.al. | 2309.09379 | null |
2023-09-11 | Towards Content-based Pixel Retrieval in Revisited Oxford and Paris | Guoyuan An et.al. | 2309.05438 | link |
2023-09-09 | Neural Semantic Surface Maps | Luca Morreale et.al. | 2309.04836 | null |
2023-09-05 | Doppelgangers: Learning to Disambiguate Images of Similar Structures | Ruojin Cai et.al. | 2309.02420 | link |
2023-08-14 | Occ $^2$ Net: Robust Image Matching Based on 3D Occupancy Estimation for Occluded Regions | Miao Fan et.al. | 2308.16160 | null |
2023-08-29 | TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching | Yun Liao et.al. | 2308.15144 | null |
2023-08-27 | LDL: Line Distance Functions for Panoramic Localization | Junho Kim et.al. | 2308.13989 | link |
2023-08-22 | Scene-Aware Feature Matching | Xiaoyong Lu et.al. | 2308.09949 | null |
2023-09-03 | DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching | Johan Edstedt et.al. | 2308.08479 | link |
2023-08-19 | Global Features are All You Need for Image Retrieval and Reranking | Shihao Shao et.al. | 2308.06954 | link |
2023-08-02 | ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation | Bo Zhang et.al. | 2308.00400 | link |
2023-07-28 | Cross-Modal Concept Learning and Inference for Vision-Language Models | Yi Zhang et.al. | 2307.15460 | null |
2023-07-22 | CryptoMask : Privacy-preserving Face Recognition | Jianli Bai et.al. | 2307.12010 | null |
2023-07-22 | A Stronger Stitching Algorithm for Fisheye Images based on Deblurring and Registration | Jing Hao et.al. | 2307.11997 | null |
2023-07-21 | Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data | Sahar Almahfouz Nasser et.al. | 2307.10698 | link |
2023-08-08 | Balancing Privacy and Progress in Artificial Intelligence: Anonymization in Histopathology for Biomedical Research and Education | Neel Kanwal et.al. | 2307.09426 | null |
2023-08-01 | Unsupervised Deep Graph Matching Based on Cycle Consistency | Siddharth Tourani et.al. | 2307.08930 | link |
2023-07-15 | Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents | Ke Cao et.al. | 2307.07763 | null |
2023-07-09 | Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion | Jie S. Li et.al. | 2307.05564 | null |
2023-07-11 | ResMatch: Residual Attention Learning for Local Feature Matching | Yuxin Deng et.al. | 2307.05180 | link |
2023-07-11 | TIAM – A Metric for Evaluating Alignment in Text-to-Image Generation | Paul Grimal et.al. | 2307.05134 | link |
2023-07-02 | TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching | Khang Truong Giang et.al. | 2307.00485 | link |
2023-06-27 | Detector-Free Structure from Motion | Xingyi He et.al. | 2306.15669 | link |
2023-06-28 | PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment | Jianyuan Wang et.al. | 2306.15667 | null |
2023-06-25 | Enhancing Dynamic Image Advertising with Vision-Language Pre-training | Zhoufutu Wen et.al. | 2306.14112 | null |
2023-06-23 | LightGlue: Local Feature Matching at Light Speed | Philipp Lindenberger et.al. | 2306.13643 | link |
2023-06-19 | Graph Self-Supervised Learning for Endoscopic Image Matching | Manel Farhat et.al. | 2306.11141 | link |
2023-06-09 | Leaving the Lines Behind: Vision-Based Crop Row Exit for Agricultural Robot Navigation | Rajitha de Silva et.al. | 2306.05869 | null |
2023-06-07 | A2B: Anchor to Barycentric Coordinate for Robust Correspondence | Weiyue Zhao et.al. | 2306.02760 | null |
2023-05-27 | Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation | Yueh-Cheng Huang et.al. | 2305.17463 | null |
2023-05-19 | SIDAR: Synthetic Image Dataset for Alignment & Restoration | Monika Kwiatkowski et.al. | 2305.12036 | link |
2023-05-18 | LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation | Yujie Lu et.al. | 2305.11116 | link |
2023-05-16 | A Method for Training-free Person Image Picture Generation | Tianyu Chen et.al. | 2305.09817 | null |
2023-05-15 | Image Matching by Bare Homography | Fabio Bellavia et.al. | 2305.08946 | null |
2023-05-12 | CLIP-Count: Towards Text-Guided Zero-Shot Object Counting | Ruixiang Jiang et.al. | 2305.07304 | link |
2023-05-10 | SENDD: Sparse Efficient Neural Depth and Deformation for Tissue Tracking | Adam Schmidt et.al. | 2305.06477 | null |
2023-05-10 | Level-line Guided Edge Drawing for Robust Line Segment Detection | Xinyu Lin et.al. | 2305.05883 | link |
2023-05-09 | ColonMapper: topological mapping and localization for colonoscopy | Javier Morlana et.al. | 2305.05546 | null |
2023-04-29 | A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges | Xinyu Lin et.al. | 2305.00264 | link |
2023-04-28 | SFD2: Semantic-guided Feature Detection and Description | Fei Xue et.al. | 2304.14845 | link |
2023-04-17 | DeepSim-Nets: Deep Similarity Networks for Stereo Image Matching | Mohamed Ali Chebbi et.al. | 2304.08056 | link |
2023-04-16 | Long-term Visual Localization with Mobile Sensors | Shen Yan et.al. | 2304.07691 | null |
2023-04-12 | SiLK – Simple Learned Keypoints | Pierre Gleize et.al. | 2304.06194 | link |
2023-04-16 | ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation | Xiaoming Zhao et.al. | 2304.03608 | link |
2023-04-04 | GlueStick: Robust Image Matching by Sticking Points and Lines Together | Rémi Pautrat et.al. | 2304.02008 | link |
2023-04-03 | PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching | Pedro Castro et.al. | 2304.01382 | null |
2023-04-02 | Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints | Guilherme Potje et.al. | 2304.00583 | link |
2023-04-13 | Structured Epipolar Matcher for Local Feature Matching | Jiahao Chang et.al. | 2303.16646 | null |
2023-03-29 | Adaptive Spot-Guided Transformer for Consistent Local Feature Matching | Jiahuan Yu et.al. | 2303.16624 | null |
2023-03-28 | ASIC: Aligning Sparse in-the-wild Image Collections | Kamal Gupta et.al. | 2303.16201 | null |
2023-03-25 | Learning Rotation-Equivariant Features for Visual Correspondence | Jongmin Lee et.al. | 2303.15472 | null |
2023-03-27 | Learnable Graph Matching: A Practical Paradigm for Data Association | Jiawei He et.al. | 2303.15414 | link |
2023-03-24 | Efficient and Accurate Co-Visible Region Localization with Matching Key-Points Crop (MKPC): A Two-Stage Pipeline for Enhancing Image Matching Performance | Hongjian Song et.al. | 2303.13794 | null |
2023-03-15 | Rethinking Optical Flow from Geometric Matching Consistent Perspective | Qiaole Dong et.al. | 2303.08384 | link |
2023-04-04 | PATS: Patch Area Transportation with Subdivision for Local Feature Matching | Junjie Ni et.al. | 2303.07700 | null |
2023-03-07 | Parsing Line Segments of Floor Plan Images Using Graph Neural Networks | Mingxiang Chen et.al. | 2303.03851 | null |
2023-03-06 | Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints | Chenjie Cao et.al. | 2303.02885 | link |
2023-03-10 | ParaFormer: Parallel Attention Transformer for Efficient Feature Matching | Xiaoyong Lu et.al. | 2303.00941 | null |
2023-03-01 | RIFT2: Speeding-up RIFT with A New Rotation-Invariance Technique | Jiayuan Li et.al. | 2303.00319 | link |
2023-02-28 | Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images | Zhongli Fan et.al. | 2302.14239 | link |
2023-02-25 | BrainCLIP: Bridging Brain and Visual-Linguistic Representation via CLIP for Generic Natural Visual Stimulus Decoding from fMRI | Yulong Liu et.al. | 2302.12971 | link |
2023-02-24 | Classification of structural building damage grades from multi-temporal photogrammetric point clouds using a machine learning model trained on virtual laser scanning data | Vivien Zahs et.al. | 2302.12591 | null |
2023-02-20 | A Large Scale Homography Benchmark | Daniel Barath et.al. | 2302.09997 | link |
2023-02-12 | OAMatcher: An Overlapping Areas-based Network for Accurate Local Feature Matching | Kun Dai et.al. | 2302.05846 | link |
2023-02-10 | General, Single-shot, Target-less, and Automatic LiDAR-Camera Extrinsic Calibration Toolbox | Kenji Koide et.al. | 2302.05094 | link |
2023-02-03 | Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization | Yingying Zhu et.al. | 2302.01572 | link |
2023-01-27 | Harmonizing Flows: Unsupervised MR harmonization based on normalizing flows | Farzad Beizaee et.al. | 2301.11551 | link |
2023-01-25 | Local Feature Extraction from Salient Regions by Feature Map Transformation | Yerim Jung et.al. | 2301.10413 | null |
2023-01-24 | Feature-based Image Matching for Identifying Individual Kākā | Fintan O’Sullivan et.al. | 2301.06678 | null |
2023-01-18 | Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram Images | Johannes Bayer et.al. | 2301.03155 | null |
2023-01-08 | DeepMatcher: A Deep Transformer-based Network for Robust and Accurate Local Feature Matching | Tao Xie et.al. | 2301.02993 | link |
2023-01-07 | Deep Learning-Based UAV Aerial Triangulation without Image Control Points | Jiageng Zhong et.al. | 2301.02869 | null |
2023-01-06 | The UNCOVER Survey: A first-look HST+JWST catalog of 50,000 galaxies near Abell 2744 and beyond | John R. Weaver et.al. | 2301.02671 | link |
2023-02-13 | Translating Text Synopses to Video Storyboards | Xu Gu et.al. | 2301.00135 | link |
2022-12-23 | SuperGF: Unifying Local and Global Features for Visual Localization | Wenzheng Song et.al. | 2212.13105 | null |
2022-12-26 | Transformer and GAN Based Super-Resolution Reconstruction Network for Medical Images | Weizhi Du et.al. | 2212.13068 | null |
2022-12-20 | Seafloor-Invariant Caustics Removal from Underwater Imagery | Panagiotis Agrafiotis et.al. | 2212.10167 | null |
2022-12-15 | DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients | Rémi Pautrat et.al. | 2212.07766 | link |
2022-12-14 | Shared Coupling-bridge for Weakly Supervised Local Feature Learning | Jiayuan Sun et.al. | 2212.07047 | link |
2022-12-05 | Real Time Incremental Image Mosaicking Without Use of Any Camera Parameter | Suleyman Melih Portakal et.al. | 2212.02302 | null |
2022-12-05 | ObjectMatch: Robust Registration using Canonical Object Correspondences | Can Gümeli et.al. | 2212.01985 | null |
2022-12-07 | Universe Points Representation Learning for Partial Multi-Graph Matching | Zhakshylyk Nurlanov et.al. | 2212.00780 | null |
2022-11-30 | Self-Supervised Feature Learning for Long-Term Metric Visual Localization | Yuxuan Chen et.al. | 2212.00122 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
2022-11-19 | Person Text-Image Matching via Text-Feature Interpretability Embedding and External Attack Node Implantation | Fan Li et.al. | 2211.08657 | link |
2022-11-20 | Detecting Line Segments in Motion-blurred Images with Events | Huai Yu et.al. | 2211.07365 | link |
2022-11-15 | Fast Key Points Detection and Matching for Tree-Structured Images | Hao Wang et.al. | 2211.03242 | null |
2022-10-25 | A Comparative Study on Deep-Learning Methods for Dense Image Matching of Multi-angle and Multi-date Remote Sensing Stereo Images | Hessah Albanwan et.al. | 2210.14031 | null |
2022-10-11 | DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion | Yuxi Xiao et.al. | 2210.05517 | null |
2022-10-07 | Mars Rover Localization Based on A2G Obstacle Distribution Pattern Matching | Lang Zhou et.al. | 2210.03398 | link |
2022-09-27 | Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors | Hao Dong et.al. | 2209.13586 | link |
2022-09-25 | ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement | Dongli Tan et.al. | 2209.12213 | null |
2022-09-22 | DRKF: Distilled Rotated Kernel Fusion for Efficiently Boosting Rotation Invariance in Image Matching | Chao Li et.al. | 2209.10907 | null |
2022-11-15 | Uncertainty-aware Efficient Subgraph Isomorphism using Graph Topology | Arpan Kusari et.al. | 2209.09090 | null |
2022-09-16 | SRFeat: Learning Locally Accurate and Globally Consistent Non-Rigid Shape Correspondence | Lei Li et.al. | 2209.07806 | link |
2022-08-30 | ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer | Hongkai Chen et.al. | 2208.14201 | link |
2022-08-25 | A Gis Aided Approach for Geolocalizing an Unmanned Aerial System Using Deep Learning | Jianli Wei et.al. | 2208.12251 | link |
2022-08-25 | UAS Navigation in the Real World Using Visual Observation | Yuci Han et.al. | 2208.12125 | null |
2022-08-24 | Self-Supervised Endoscopic Image Key-Points Matching | Manel Farhat et.al. | 2208.11424 | link |
2022-08-22 | Equivariant Hypergraph Neural Networks | Jinwoo Kim et.al. | 2208.10428 | link |
2022-09-22 | Understanding Attention for Vision-and-Language Tasks | Feiqi Cao et.al. | 2208.08104 | link |
2022-08-16 | Hierarchical Attention Network for Few-Shot Object Detection via Meta-Contrastive Learning | Dongwoo Park et.al. | 2208.07039 | link |
2022-08-04 | Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification | Xinyu Lin et.al. | 2208.02450 | link |
2022-08-04 | OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images | Weijia Li et.al. | 2208.00928 | null |
2022-07-29 | Testing Relational Understanding in Text-Guided Image Generation | Colin Conwell et.al. | 2208.00005 | null |
2022-07-21 | Pose for Everything: Towards Category-Agnostic Pose Estimation | Lumin Xu et.al. | 2207.10387 | link |
2022-07-20 | Explaining Deepfake Detection by Analysing Image Matching | Shichao Dong et.al. | 2207.09679 | link |
2022-07-18 | Adaptive Assignment for Geometry Aware Local Feature Matching | Dihe Huang et.al. | 2207.08427 | link |
2022-07-16 | Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching | Jiazhen Liu et.al. | 2207.07932 | link |
2022-07-06 | Virtual staining of defocused autofluorescence images of unlabeled tissue using deep neural networks | Yijie Zhang et.al. | 2207.02946 | null |
2022-07-01 | TopicFM: Robust and Interpretable Feature Matching with Topic-assisted | Khang Truong Giang et.al. | 2207.00328 | link |
2022-06-16 | Virtual Correspondence: Humans as a Cue for Extreme-View Geometry | Wei-Chiu Ma et.al. | 2206.08365 | null |
2022-06-15 | Self-Supervised Learning of Image Scale and Orientation | Jongmin Lee et.al. | 2206.07259 | link |
2022-05-27 | Image Keypoint Matching using Graph Neural Networks | Nancy Xu et.al. | 2205.14275 | null |
2022-05-27 | Fine-tuning deep learning models for stereo matching using results from semi-global matching | Hessah Albanwan et.al. | 2205.14051 | null |
2022-05-23 | TransforMatcher: Match-to-Match Attention for Semantic Correspondence | Seungwook Kim et.al. | 2205.11634 | link |
2022-05-16 | ReDFeat: Recoupling Detection and Description for Multimodal Feature Learning | Yuxin Deng et.al. | 2205.07439 | null |
2022-05-06 | BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo Surgical Image Matching | Jingwei Song et.al. | 2205.03133 | link |
2022-05-10 | AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching | Khanh Nguyen et.al. | 2205.02849 | link |
2022-04-27 | Gleo-Det: Deep Convolution Feature-Guided Detector with Local Entropy Optimization for Salient Points | Chao Li et.al. | 2204.12884 | null |
2022-04-22 | SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite | Runzhe Zhu et.al. | 2204.10704 | link |
2022-04-20 | Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations | Leila Pishdad et.al. | 2204.09268 | null |
2022-04-19 | OpenGlue: Open Source Graph Neural Net Based Pipeline for Image Matching | Ostap Viniavskyi et.al. | 2204.08870 | link |
2022-04-19 | Self-Supervised Equivariant Learning for Oriented Keypoint Detection | Jongmin Lee et.al. | 2204.08613 | link |
2022-04-22 | Efficient Linear Attention for Fast and Accurate Keypoint Matching | Suwichaya Suwanwimolkul et.al. | 2204.07731 | null |
2022-04-08 | Lightweight starshade position sensing with convolutional neural networks and simulation-based inference | Andrew Chen et.al. | 2204.03853 | link |
2022-03-30 | AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift | Burak Yildiz et.al. | 2203.16291 | link |
2022-03-29 | Photographic Visualization of Weather Forecasts with Generative Adversarial Networks | Christian Sigg et.al. | 2203.15601 | link |
2022-03-29 | Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots | Pranay Mathur et.al. | 2203.15272 | null |
2022-03-28 | Optimizing Elimination Templates by Greedy Parameter Search | Evgeniy Martyushev et.al. | 2203.14901 | link |
2022-03-28 | S2-Net: Self-supervision Guided Feature Representation Learning for Cross-Modality Images | Shasha Mei et.al. | 2203.14581 | null |
2022-03-26 | Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching | Yujiao Shi et.al. | 2203.14148 | link |
2022-03-24 | Keypoints Tracking via Transformer Networks | Oleksii Nasypanyi et.al. | 2203.12848 | link |
2022-03-21 | MatchFormer: Interleaving Attention in Transformers for Feature Matching | Qing Wang et.al. | 2203.09645 | link |
2022-03-14 | There’s no difference: Convolutional Neural Networks for transient detection without template subtraction | Tatiana Acero-Cuellar et.al. | 2203.07390 | link |
2022-03-25 | Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Jinheng Xie et.al. | 2203.02668 | link |
2022-03-01 | CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP | Zihao Wang et.al. | 2203.00386 | null |
2022-03-09 | Time-resolved Imaging of Stochastic Cascade Reactions over a Submillisecond to Second Time Range at the Angstrom Level | Toshiki Shimizu et.al. | 2202.13332 | null |
2022-02-16 | Cross-view and Cross-domain Underwater Localization based on Optical Aerial and Acoustic Underwater Images | Matheus M. Dos Santos et.al. | 2202.07817 | null |
2022-02-14 | CATs++: Boosting Cost Aggregation with Convolutions and Transformers | Seokju Cho et.al. | 2202.06817 | link |
2022-02-11 | Improving Image-recognition Edge Caches with a Generative Adversarial Network | Guilherme B. Souza et.al. | 2202.05929 | null |
2022-02-08 | Learning Optical Flow with Adaptive Graph Reasoning | Ao Luo et.al. | 2202.03857 | link |
2022-02-03 | Sim2Real Object-Centric Keypoint Detection and Description | Chengliang Zhong et.al. | 2202.00448 | null |
2022-01-27 | Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape context | Jie Shao et.al. | 2201.11296 | null |
2021-12-24 | Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation | Zhiwei Liu et.al. | 2112.12917 | null |
2021-12-20 | Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching | Yujie Fu et.al. | 2112.10485 | null |
2021-12-19 | GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor | Jean-Baptiste Carluer et.al. | 2112.10258 | link |
2021-12-14 | More Control for Free! Image Synthesis with Semantic Diffusion Guidance | Xihui Liu et.al. | 2112.05744 | null |
2021-12-08 | Label-free virtual HER2 immunohistochemical staining of breast tissue using deep learning | Bijie Bai et.al. | 2112.05240 | null |
2021-12-01 | FaSS-MVS – Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery | Boitumelo Ruf et.al. | 2112.00821 | null |
2021-12-01 | CLIPstyler: Image Style Transfer with a Single Text Condition | Gihyun Kwon et.al. | 2112.00374 | link |
2021-11-29 | Nonlinear Intensity Underwater Sonar Image Matching Method Based on Phase Information and Deep Convolution Features | Xiaoteng Zhou et.al. | 2111.15514 | null |
2021-11-29 | Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic | Yoad Tewel et.al. | 2111.14447 | link |
2021-11-29 | Heterogeneous Visible-Thermal and Visible-Infrared Face Recognition using Unit-Class Loss and Cross-Modality Discriminator | Usman Cheema et.al. | 2111.14339 | null |
2021-11-17 | Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network | Xiaoming Zhao et.al. | 2111.09006 | null |
2021-11-17 | Nonlinear Intensity Sonar Image Matching based on Deep Convolution Features | Xiaoteng Zhou et.al. | 2111.08994 | null |
2021-10-30 | A Deep Search for Faint Chandra X-ray Sources, Radio Sources, and Optical Counterparts in NGC 6752 | Haldan N. Cohn et.al. | 2111.00357 | null |
2021-10-01 | Robustly Removing Deep Sea Lighting Effects for Visual Mapping of Abyssal Plains | Kevin Köser et.al. | 2110.00480 | null |
2021-09-29 | Visually Grounded Concept Composition | Bowen Zhang et.al. | 2109.14115 | null |
2021-09-27 | HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines | Fabio Bellavia et.al. | 2109.12925 | null |
2021-09-20 | Viewpoint Invariant Dense Matching for Visual Geolocalization | Gabriele Berton et.al. | 2109.09827 | link |
2021-09-20 | Image Subtraction in Fourier Space | Lei Hu et.al. | 2109.09334 | link |
2021-09-10 | Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization | Sungho Yoon et.al. | 2109.04753 | link |
2021-09-08 | Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes | Wenzheng Song et.al. | 2109.03585 | null |
2021-08-27 | A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images | Xiaoteng Zhou et.al. | 2108.12151 | null |
2021-08-27 | Matching Underwater Sonar Images by the Learned Descriptor Based on Style Transfer Method | Xiaoteng Zhou et.al. | 2108.12072 | null |
2021-08-26 | Efficient Joint Object Matching via Linear Programming | Antonio De Rosa et.al. | 2108.11911 | null |
NeRF
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-14 | VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling | Zihang Zeng et.al. | 2507.09987 | null |
2025-07-12 | Stable Score Distillation | Haiming Zhu et.al. | 2507.09168 | null |
2025-07-11 | From images to properties: a NeRF-driven framework for granular material parameter inversion | Cheng-Hsi Hsiao et.al. | 2507.09005 | null |
2025-07-10 | MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation | Bangning Wei et.al. | 2507.07519 | null |
2025-07-14 | BayesSDF: Surface-Based Laplacian Uncertainty Estimation for 3D Geometry with Neural Signed Distance Fields | Rushil Desai et.al. | 2507.06269 | null |
2025-07-08 | Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering | Jiayi Song et.al. | 2507.06103 | null |
2025-07-08 | DreamArt: Generating Interactable Articulated Objects from a Single Image | Ruijie Lu et.al. | 2507.05763 | null |
2025-07-06 | A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields | Aoxiang Fan et.al. | 2507.04408 | null |
2025-07-02 | Tile and Slide : A New Framework for Scaling NeRF from Local to Global 3D Earth Observation | Camille Billouard et.al. | 2507.01631 | null |
2025-07-01 | Surgical Neural Radiance Fields from One Image | Alberto Neri et.al. | 2507.00969 | null |
2025-07-01 | PlantSegNeRF: A few-shot, cross-dataset method for plant 3D instance point cloud reconstruction via joint-channel NeRF with multi-view image instance matching | Xin Yang et.al. | 2507.00371 | null |
2025-06-30 | AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention | Ziao Liu et.al. | 2506.23611 | null |
2025-06-29 | Dynamic View Synthesis from Small Camera Motion Videos | Huiqiang Sun et.al. | 2506.23153 | null |
2025-06-27 | UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields | Fabian Perez et.al. | 2506.21884 | null |
2025-06-24 | ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes | Chenhao Zhang et.al. | 2506.21629 | null |
2025-06-26 | PanSt3R: Multi-view Consistent Panoptic Segmentation | Lojze Zust et.al. | 2506.21348 | null |
2025-06-25 | Joint attitude estimation and 3D neural reconstruction of non-cooperative space objects | Clément Forray et.al. | 2506.20638 | null |
2025-06-24 | NeRF-based CBCT Reconstruction needs Normalization and Initialization | Zhuowei Xu et.al. | 2506.19742 | null |
2025-06-25 | Self-Supervised Multimodal NeRF for Autonomous Driving | Gaurav Sharma et.al. | 2506.19615 | null |
2025-06-24 | HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis | Xiaoyuan Wang et.al. | 2506.19291 | null |
2025-06-23 | MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation | Tianchen Deng et.al. | 2506.18678 | null |
2025-06-26 | 2D Triangle Splatting for Direct Differentiable Mesh Training | Kaifeng Sheng et.al. | 2506.18575 | link |
2025-06-22 | Limitations of NERF with pre-trained Vision Features for Few-Shot 3D Reconstruction | Ankit Sanjyal et.al. | 2506.18208 | null |
2025-06-21 | 3D Gaussian Splatting for Fine-Detailed Surface Reconstruction in Large-Scale Scene | Shihan Chen et.al. | 2506.17636 | null |
2025-06-23 | R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision | Weeyoung Kwon et.al. | 2506.16262 | link |
2025-06-24 | RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories | Qingsong Yan et.al. | 2506.15242 | null |
2025-06-17 | Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction | Zhengquan Zhang et.al. | 2506.14856 | null |
2025-06-18 | Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting | Mufan Liu et.al. | 2506.12787 | null |
2025-06-17 | Efficient multi-view training for 3D Gaussian Splatting | Minhyuk Choi et.al. | 2506.12727 | null |
2025-06-12 | PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting | Lintao Xiang et.al. | 2506.10335 | null |
2025-06-11 | The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images without Any 3D Knowledge | Haoru Wang et.al. | 2506.09885 | null |
2025-06-10 | A Probability-guided Sampler for Neural Implicit Surface Rendering | Gonçalo Dias Pais et.al. | 2506.08619 | null |
2025-06-09 | Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes | Allen Tu et.al. | 2506.07917 | link |
2025-06-20 | Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency | Xiangyu Guo et.al. | 2506.07497 | null |
2025-06-07 | SPC to 3D: Novel View Synthesis from Binary SPC via I2I translation | Sumit Sharma et.al. | 2506.06890 | null |
2025-06-06 | Splat and Replace: 3D Reconstruction with Repetitive Elements | Nicolás Violante et.al. | 2506.06462 | null |
2025-06-06 | NeurNCD: Novel Class Discovery via Implicit Neural Representation | Junming Wang et.al. | 2506.06412 | null |
2025-06-06 | Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments | Mingrui Li et.al. | 2506.05965 | null |
2025-06-06 | ProJo4D: Progressive Joint Optimization for Sparse-View Inverse Physics Estimation | Daniel Rho et.al. | 2506.05317 | null |
2025-06-06 | Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting | Nan Wang et.al. | 2506.05280 | link |
2025-06-05 | Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer | Filip Slezak et.al. | 2506.04908 | null |
2025-05-30 | Hi-Dyna Graph: Hierarchical Dynamic Scene Graph for Robotic Autonomy in Human-Centric Environments | Jiawei Hou et.al. | 2506.00083 | null |
2025-05-29 | PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views | Mohamed Rayan Barhdadi et.al. | 2505.23481 | link |
2025-05-29 | LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering | Jonas Kulhanek et.al. | 2505.23158 | null |
2025-05-28 | Can NeRFs See without Cameras? | Chaitanya Amballa et.al. | 2505.22441 | null |
2025-05-28 | Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss | Wenjun Lu et.al. | 2505.22279 | null |
2025-05-28 | Hyperspectral Gaussian Splatting | Sunil Kumar Narayanan et.al. | 2505.21890 | null |
2025-05-27 | Structure from Collision | Takuhiro Kaneko et.al. | 2505.21335 | null |
2025-05-26 | OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender | Shintaro Ito et.al. | 2505.20126 | link |
2025-05-30 | ErpGS: Equirectangular Image Rendering enhanced with 3D Gaussian Regularization | Shintaro Ito et.al. | 2505.19883 | null |
2025-05-26 | GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis | You Wang et.al. | 2505.19813 | link |
2025-05-26 | Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction | Li Fang et.al. | 2505.19793 | link |
2025-05-26 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting | Wenhua Wu et.al. | 2505.19420 | null |
2025-05-25 | Triangle Splatting for Real-Time Radiance Field Rendering | Jan Held et.al. | 2505.19175 | null |
2025-05-22 | UAV See, UGV Do: Aerial Imagery and Virtual Teach Enabling Zero-Shot Ground Vehicle Repeat | Desiree Fisker et.al. | 2505.16912 | null |
2025-05-19 | IPENS:Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion | Wentao Song et.al. | 2505.13633 | null |
2025-05-19 | 3D Gaussian Adaptive Reconstruction for Fourier Light-Field Microscopy | Chenyu Xu et.al. | 2505.12875 | null |
2025-05-18 | Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey | Calvin Galagain et.al. | 2505.12384 | null |
2025-05-16 | MutualNeRF: Improve the Performance of NeRF under Limited Samples with Mutual Information Theory | Zifan Wang et.al. | 2505.11386 | null |
2025-05-16 | EA-3DGS: Efficient and Adaptive 3D Gaussians with Highly Enhanced Quality for outdoor scenes | Jianlin Guo et.al. | 2505.10787 | link |
2025-05-15 | Large-Scale Gaussian Splatting SLAM | Zhe Xin et.al. | 2505.09915 | null |
2025-05-14 | Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians | Ma Changfeng et.al. | 2505.09413 | link |
2025-05-14 | FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling | Yue Wen et.al. | 2505.09406 | null |
2025-05-12 | TUGS: Physics-based Compact Representation of Underwater Scenes by Tensorized Gaussian | Shijie Lian et.al. | 2505.08811 | null |
2025-05-13 | FOCI: Trajectory Optimization on Gaussian Splats | Mario Gomez Andreu et.al. | 2505.08510 | null |
2025-05-13 | TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset | Olaf Wysocki et.al. | 2505.07396 | null |
2025-05-12 | Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild | Lintao Xiang et.al. | 2505.07373 | null |
2025-05-11 | NeuGen: Amplifying the ‘Neural’ in Neural Radiance Fields for Domain Generalization | Ahmed Qazi et.al. | 2505.06894 | null |
2025-05-10 | 3D Characterization of Smoke Plume Dispersion Using Multi-View Drone Swarm | Nikil Krishnakumar et.al. | 2505.06638 | null |
2025-05-10 | FlexNeRFer: A Multi-Dataflow, Adaptive Sparsity-Aware Accelerator for On-Device NeRF Rendering | Seock-Hwan Noh et.al. | 2505.06504 | null |
2025-05-08 | 3D Scene Generation: A Survey | Beichen Wen et.al. | 2505.05474 | link |
2025-05-04 | HandOcc: NeRF-based Hand Rendering with Occupancy Networks | Maksym Ivashechkin et.al. | 2505.02079 | null |
2025-05-04 | Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance Fields | Zhenxing Mi et.al. | 2505.02005 | link |
2025-05-03 | AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting | Junhao Shi et.al. | 2505.01799 | null |
2025-05-03 | Unified Steganography via Implicit Neural Representation | Qi Song et.al. | 2505.01749 | null |
2025-04-30 | A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond | Jiajia Li et.al. | 2505.00737 | link |
2025-05-01 | Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation | Feng Xue et.al. | 2505.00378 | null |
2025-04-29 | GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction | Yuhan Xie et.al. | 2504.21067 | link |
2025-04-29 | Large-scale visual SLAM for in-the-wild videos | Shuo Sun et.al. | 2504.20496 | null |
2025-05-01 | GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting | Jongwon Lee et.al. | 2504.20379 | null |
2025-04-29 | Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views | Jiang Wu et.al. | 2504.20378 | link |
2025-04-28 | Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video | Hoang Chuong Nguyen et.al. | 2504.19819 | null |
2025-04-27 | Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users | Apurv Varshney et.al. | 2504.19345 | null |
2025-04-29 | IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos | Yuan Li et.al. | 2504.19165 | null |
2025-04-28 | RGS-DR: Reflective Gaussian Surfels with Deferred Rendering for Shiny Objects | Georgios Kouros et.al. | 2504.18468 | null |
2025-04-23 | Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning | Mingxuan Cui et.al. | 2504.17815 | link |
2025-04-24 | CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos | Shucheng Gong et.al. | 2504.17728 | link |
2025-04-23 | Dual-Camera All-in-Focus Neural Radiance Fields | Xianrui Luo et.al. | 2504.16636 | null |
2025-04-23 | Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks | Murat Bilgehan Ertan et.al. | 2504.16557 | null |
2025-04-23 | SaENeRF: Suppressing Artifacts in Event-based Neural Radiance Fields | Yuanjian Wang et.al. | 2504.16389 | link |
2025-04-22 | Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models | Quentin Herau et.al. | 2504.15776 | null |
2025-04-21 | StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians | Cailin Zhuang et.al. | 2504.15281 | null |
2025-04-18 | Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training | Andrea Amaduzzi et.al. | 2504.13995 | null |
2025-04-21 | SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM | Samuel Cerezo et.al. | 2504.13713 | link |
2025-04-16 | BEV-GS: Feed-forward Gaussian Splatting in Bird’s-Eye-View for Road Reconstruction | Wenhua Wu et.al. | 2504.13207 | null |
2025-04-17 | GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration | Rendong Zhang et.al. | 2504.12999 | link |
2025-04-16 | R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors | Haoyang Wang et.al. | 2504.11946 | null |
2025-04-19 | LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis | Hao Sun et.al. | 2504.10331 | null |
2025-04-14 | MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling | Yunpeng Tan et.al. | 2504.09878 | null |
2025-04-14 | NeRF-Based Transparent Object Grasping Enhanced by Shape Priors | Yi Han et.al. | 2504.09868 | null |
2025-04-11 | HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields | Asterios Reppas et.al. | 2504.08901 | null |
2025-04-09 | Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting | Daiwei Zhang et.al. | 2504.06978 | null |
2025-04-09 | S-EO: A Large-Scale Dataset for Geometry-Aware Shadow Detection in Remote Sensing Applications | Masquil Elías et.al. | 2504.06920 | null |
2025-04-09 | SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering | Hanxiao Sun et.al. | 2504.06815 | link |
2025-04-08 | Meta-Continual Learning of Neural Fields | Seungyoon Woo et.al. | 2504.05806 | null |
2025-04-08 | SE4Lip: Speech-Lip Encoder for Talking Head Synthesis to Solve Phoneme-Viseme Alignment Ambiguity | Yihuan Huang et.al. | 2504.05803 | null |
2025-04-08 | InvNeRF-Seg: Fine-Tuning a Pre-Trained NeRF for 3D Object Segmentation | Jiangsan Zhao et.al. | 2504.05751 | null |
2025-04-07 | DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal | Wanzhou Liu et.al. | 2504.04679 | null |
2025-04-06 | Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models | Etienne Chassaing et.al. | 2504.04448 | null |
2025-04-04 | NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices | Zhe Wang et.al. | 2504.03415 | null |
2025-04-03 | MultiNeRF: Multiple Watermark Embedding for Neural Radiance Fields | Yash Kulthe et.al. | 2504.02517 | null |
2025-04-03 | LPA3D: 3D Room-Level Scene Generation from In-the-Wild Images | Ming-Jia Yang et.al. | 2504.02337 | null |
2025-04-01 | OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF | Jingyu Shi et.al. | 2504.02007 | null |
2025-04-02 | Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis | Niluthpol Chowdhury Mithun et.al. | 2504.01960 | null |
2025-04-02 | BOGausS: Better Optimized Gaussian Splatting | Stéphane Pateux et.al. | 2504.01844 | null |
2025-04-02 | FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking | Ulas Gunes et.al. | 2504.01732 | null |
2025-04-02 | RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars | Yahui Li et.al. | 2504.01559 | null |
2025-04-02 | Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment | Ziteng Cui et.al. | 2504.01503 | link |
2025-04-01 | Neural Pruning for 3D Scene Reconstruction: Efficient NeRF Acceleration | Tianqi Ding et.al. | 2504.00950 | null |
2025-04-01 | NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds | Mahan Rafidashti et.al. | 2504.00859 | null |
2025-03-31 | NeRF-Based defect detection | Tianqi et.al. | 2504.00270 | null |
2025-03-31 | LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors | Han Zhou et.al. | 2504.00219 | null |
2025-03-31 | ERUPT: Efficient Rendering with Unposed Patch Transformer | Maxim V. Shugaev et.al. | 2503.24374 | null |
2025-03-29 | NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations | Zhenyu Tang et.al. | 2503.23162 | null |
2025-03-28 | ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting | Wenjie Liu et.al. | 2503.22218 | null |
2025-03-27 | NeRF-based Point Cloud Reconstruction using a Stationary Camera for Agricultural Applications | Kibon Ku et.al. | 2503.21958 | null |
2025-03-27 | Refined Geometry-guided Head Avatar Reconstruction from Monocular RGB Video | Pilseo Park et.al. | 2503.21886 | null |
2025-03-27 | HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM | Ziren Gong et.al. | 2503.21778 | null |
2025-04-01 | RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting | Qiyu Dai et.al. | 2503.21442 | null |
2025-03-28 | LandMarkSystem Technical Report | Zhenxiang Ma et.al. | 2503.21364 | link |
2025-03-27 | UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation | Yehui Shen et.al. | 2503.21338 | link |
2025-03-25 | CoMapGS: Covisibility Map-based Gaussian Splatting for Sparse Novel View Synthesis | Youngkyoon Jang et.al. | 2503.20998 | null |
2025-03-26 | AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Xiangwen Zhang et.al. | 2503.20654 | null |
2025-03-26 | EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis | Sheng Miao et.al. | 2503.20168 | null |
2025-03-25 | Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals | Zhirui Dai et.al. | 2503.20066 | null |
2025-03-25 | MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities | Federico Lincetto et.al. | 2503.19673 | null |
2025-03-24 | NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting | Yulong Zheng et.al. | 2503.18794 | null |
2025-03-25 | LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene | Xiaoyu Zhang et.al. | 2503.18513 | null |
2025-03-24 | NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction | Wenyuan Zhang et.al. | 2503.18361 | null |
2025-03-23 | End-to-End Implicit Neural Representations for Classification | Alexander Gielisse et.al. | 2503.18123 | link |
2025-03-23 | Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving | Junhao Ge et.al. | 2503.18108 | link |
2025-03-23 | PanopticSplatting: End-to-End Panoptic Gaussian Splatting | Yuxuan Xie et.al. | 2503.18073 | null |
2025-03-21 | Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping | Emanuele Giacomini et.al. | 2503.17491 | link |
2025-03-21 | FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields | Kwan Yun et.al. | 2503.17095 | link |
2025-03-21 | DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery | Jiadong Tang et.al. | 2503.16964 | null |
2025-03-20 | Digitally Prototype Your Eye Tracker: Simulating Hardware Performance using 3D Synthetic Data | Esther Y. H. Lin et.al. | 2503.16742 | null |
2025-03-20 | Enhancing Close-up Novel View Synthesis via Pseudo-labeling | Jiatong Xia et.al. | 2503.15908 | link |
2025-03-19 | SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints | Weiwen Hu et.al. | 2503.15712 | null |
2025-03-19 | DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis | Yuming Gu et.al. | 2503.15667 | link |
2025-03-19 | GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector | Zechuan Li et.al. | 2503.15211 | null |
2025-03-19 | MultiBARF: Integrating Imagery of Different Wavelength Regions by Using Neural Radiance Fields | Kana Kurata et.al. | 2503.15070 | null |
2025-03-19 | 3D Engine-ready Photorealistic Avatars via Dynamic Textures | Yifan Wang et.al. | 2503.14943 | null |
2025-03-19 | ClimateGS: Real-Time Climate Simulation with 3D Gaussian Style Transfer | Yuezhen Xie et.al. | 2503.14845 | null |
2025-03-18 | Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis | Yizhou Li et.al. | 2503.14219 | null |
2025-03-17 | Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios | Iryna Repinetska et.al. | 2503.13710 | null |
2025-03-17 | TriDF: Triplane-Accelerated Density Fields for Few-Shot Remote Sensing Novel View Synthesis | Jiaming Kang et.al. | 2503.13347 | null |
2025-03-17 | DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction | Rui Wang et.al. | 2503.13176 | null |
2025-03-17 | DivCon-NeRF: Generating Augmented Rays with Diversity and Consistency for Few-shot View Synthesis | Ingyun Lee et.al. | 2503.12947 | null |
2025-03-15 | FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields | Rui Qian et.al. | 2503.12086 | null |
2025-03-14 | Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation | Xianming Zeng et.al. | 2503.11731 | null |
2025-03-13 | Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations | Xunzhi Zheng et.al. | 2503.10464 | null |
2025-03-13 | AI-assisted 3D Preservation and Reconstruction of Temple Arts | Naai-Jung Shih et.al. | 2503.10031 | null |
2025-03-12 | Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation | Máté Tóth et.al. | 2503.09464 | null |
2025-03-11 | GAS-NeRF: Geometry-Aware Stylization of Dynamic Radiance Fields | Nhat Phuong Anh Vu et.al. | 2503.08483 | null |
2025-03-17 | Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios | Zikang Yuan et.al. | 2503.08317 | null |
2025-03-11 | GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats | Kai Deng et.al. | 2503.08071 | link |
2025-03-11 | NeRF-VIO: Map-Based Visual-Inertial Odometry with Initialization Leveraging Neural Radiance Fields | Yanyu Zhang et.al. | 2503.07952 | null |
2025-03-10 | Neural Radiance and Gaze Fields for Visual Attention Modeling in 3D Environments | Andrei Chubarau et.al. | 2503.07828 | null |
2025-03-10 | CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving | Ziliang Xiong et.al. | 2503.07425 | null |
2025-03-08 | Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction | Kai Li et.al. | 2503.06161 | null |
2025-03-08 | SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography | Xuanyu Zhang et.al. | 2503.06118 | null |
2025-03-08 | NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features | Hongjia Zhai et.al. | 2503.06117 | null |
2025-03-06 | Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering | Idris O. Sunmola et.al. | 2503.04079 | null |
2025-03-05 | LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation | Qian Feng et.al. | 2503.03890 | null |
2025-03-04 | Tracking-Aware Deformation Field Estimation for Non-rigid 3D Reconstruction in Robotic Surgeries | Zeqing Wang et.al. | 2503.02558 | null |
2025-03-04 | 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting | Qipeng Yan et.al. | 2503.02452 | null |
2025-03-04 | Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views | Yingji Zhong et.al. | 2503.02230 | null |
2025-03-04 | Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints | Yan Miao et.al. | 2503.02198 | null |
2025-03-03 | Data Augmentation for NeRFs in the Low Data Limit | Ayush Gaggar et.al. | 2503.02092 | null |
2025-03-03 | Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models | Jay Zhangjie Wu et.al. | 2503.01774 | null |
2025-03-05 | Category-level Meta-learned NeRF Priors for Efficient Object Mapping | Saad Ejaz et.al. | 2503.01582 | null |
2025-03-03 | LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training | Kaimin Liao et.al. | 2503.01199 | link |
2025-03-02 | DreamPrinting: Volumetric Printing Primitives for High-Fidelity 3D Printing | Youjia Wang et.al. | 2503.00887 | null |
2025-03-01 | Scalable Real2Sim: Physics-Aware Asset Generation Via Robotic Pick-and-Place Setups | Nicholas Pfaff et.al. | 2503.00370 | link |
2025-02-27 | Identity-preserving Distillation Sampling by Fixed-Point Iterator | SeonHwa Kim et.al. | 2502.19930 | null |
2025-02-27 | NeRFCom: Feature Transform Coding Meets Neural Radiance Field for Free-View 3D Scene Semantic Transmission | Weijie Yue et.al. | 2502.19873 | null |
2025-02-26 | Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions | Muhammad Salman Ali et.al. | 2502.19457 | null |
2025-02-26 | Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? | Adam Celarek et.al. | 2502.19318 | link |
2025-02-26 | The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields | Ziyuan Luo et.al. | 2502.19125 | null |
2025-02-24 | Semantic Neural Radiance Fields for Multi-Date Satellite Data | Valentin Wagner et.al. | 2502.16992 | link |
2025-02-22 | AquaNeRF: Neural Radiance Fields in Underwater Media with Distractor Removal | Luca Gough et.al. | 2502.16351 | null |
2025-02-22 | DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation | Yuxuan Xiong et.al. | 2502.16302 | null |
2025-02-24 | Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis | Ziqian Ni et.al. | 2502.15635 | null |
2025-02-20 | Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2502.14931 | null |
2025-02-20 | NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis | Xiaoxing Liu et.al. | 2502.14178 | null |
2025-02-19 | GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian | Bang Du et.al. | 2502.14129 | null |
2025-02-18 | Geometry-Aware Diffusion Models for Multiview Scene Inpainting | Ahmad Salimi et.al. | 2502.13335 | null |
2025-02-18 | GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis | Pedro Martin et.al. | 2502.13196 | null |
2025-02-18 | ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition | Quoc-Anh Bui et.al. | 2502.12673 | null |
2025-02-21 | HumanGif: Single-View Human Diffusion with Generative Prior | Shoukang Hu et.al. | 2502.12080 | link |
2025-02-17 | 3D Gaussian Inpainting with Depth-Guided Cross-View Consistency | Sheng-Yu Huang et.al. | 2502.11801 | null |
2025-02-13 | Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures | Francesco Ballerini et.al. | 2502.09623 | null |
2025-02-13 | DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior | Mingrui Li et.al. | 2502.09111 | null |
2025-02-12 | Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision | Tianle Liu et.al. | 2502.08352 | null |
2025-02-10 | PrismAvatar: Real-time animated 3D neural head avatars on edge devices | Prashant Raina et.al. | 2502.07030 | null |
2025-02-10 | Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC | Siwei Meng et.al. | 2502.07007 | null |
2025-02-08 | GWRF: A Generalizable Wireless Radiance Field for Wireless Signal Propagation Modeling | Kang Yang et.al. | 2502.05708 | null |
2025-02-05 | VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning | Jayram Palamadai et.al. | 2502.05222 | null |
2025-02-11 | PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression | Feifei Li et.al. | 2502.04843 | null |
2025-02-04 | SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification | Yifu Tao et.al. | 2502.02657 | null |
2025-02-04 | MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning | Shengbo Gu et.al. | 2502.02372 | null |
2025-02-03 | FourieRF: Few-Shot NeRFs via Progressive Fourier Frequency Control | Diego Gomez et.al. | 2502.01405 | null |
2025-01-31 | VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting | Mateusz Nowak et.al. | 2501.17978 | null |
2025-01-28 | LinPrim: Linear Primitives for Differentiable Volumetric Rendering | Nicolas von Lützow et.al. | 2501.16312 | null |
2025-01-24 | SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation | Yujian Liu et.al. | 2501.14646 | null |
2025-02-05 | GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting | Junzhe Jiang et.al. | 2501.13971 | link |
2025-01-23 | VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM | Gyuhyeon Pak et.al. | 2501.13402 | null |
2025-01-22 | Neural Radiance Fields for the Real World: A Survey | Wenhui Xiao et.al. | 2501.13104 | null |
2025-02-02 | DWTNeRF: Boosting Few-shot Neural Radiance Fields via Discrete Wavelet Transform | Hung Nguyen et.al. | 2501.12637 | null |
2025-01-21 | DNRSelect: Active Best View Selection for Deferred Neural Rendering | Dongli Wu et.al. | 2501.12150 | null |
2025-01-21 | Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging | Shuyi Hu et.al. | 2501.11884 | null |
2025-01-16 | Poxel: Voxel Reconstruction for 3D Printing | Ruixiang Cao et.al. | 2501.10474 | null |
2025-01-17 | Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation | Xiaoyun Zheng et.al. | 2501.09947 | link |
2025-01-16 | Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective Scenes | Ji Shi et.al. | 2501.09460 | link |
2025-01-15 | SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM | Yuhang Ming et.al. | 2501.08880 | null |
2025-01-14 | VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes | Ke Wu et.al. | 2501.08286 | null |
2025-01-13 | Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes | Yuhang Zhang et.al. | 2501.08072 | null |
2025-01-14 | SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting | Yue Hu et.al. | 2501.07015 | null |
2025-01-12 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
2025-01-12 | ActiveGAMER: Active GAussian Mapping through Efficient Rendering | Liyan Chen et.al. | 2501.06897 | null |
2025-01-17 | SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis | Peng Zheng et.al. | 2501.06770 | null |
2025-01-11 | NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without References | Qiang Qu et.al. | 2501.06488 | link |
2025-01-10 | UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping | Yanjie Li et.al. | 2501.05783 | null |
2025-01-13 | Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes | Ludwic Leonard et.al. | 2501.05226 | link |
2025-01-07 | NeRFs are Mirror Detectors: Using Structural Similarity for Multi-View Mirror Scene Reconstruction with 3D Surface Primitives | Leif Van Holland et.al. | 2501.04074 | link |
2025-01-07 | NeuralSVG: An Implicit Representation for Text-to-Vector Generation | Sagi Polaczek et.al. | 2501.03992 | null |
2025-01-07 | DehazeGS: Seeing Through Fog with 3D Gaussian Splatting | Jinze Yu et.al. | 2501.03659 | null |
2025-01-07 | ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting | Yifeng Yang et.al. | 2501.03605 | link |
2025-01-07 | AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene | Chaoran Feng et.al. | 2501.02807 | null |
2024-12-29 | Bringing Objects to Life: 4D generation from 3D objects | Ohad Rahamim et.al. | 2412.20422 | null |
2024-12-27 | Learning Radiance Fields from a Single Snapshot Compressive Image | Yunhao Li et.al. | 2412.19483 | null |
2025-01-05 | BeSplat: Gaussian Splatting from a Single Blurry Image and Event Stream | Gopi Raju Matta et.al. | 2412.19370 | link |
2024-12-26 | Generating Editable Head Avatars with 3D Gaussian GANs | Guohao Li et.al. | 2412.19149 | link |
2024-12-26 | MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo | Byeonggwon Lee et.al. | 2412.19130 | null |
2024-12-26 | Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos | Changwoon Choi et.al. | 2412.19089 | null |
2024-12-23 | Editing Implicit and Explicit Representations of Radiance Fields: A Survey | Arthur Hubert et.al. | 2412.17628 | null |
2024-12-23 | Exploring Dynamic Novel View Synthesis Technologies for Cinematography | Adrian Azzarelli et.al. | 2412.17532 | null |
2024-12-21 | LUCES-MV: A Multi-View Dataset for Near-Field Point Light Source Photometric Stereo | Fotios Logothetis et.al. | 2412.16737 | null |
2024-12-20 | NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems | Laura Weihl et.al. | 2412.16141 | null |
2024-12-20 | NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images | Yue Guo et.al. | 2412.15890 | null |
2024-12-19 | LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction | Pou-Chun Kung et.al. | 2412.15447 | null |
2024-12-18 | DreaMark: Rooting Watermark in Score Distillation Sampling Generated Neural Radiance Fields | Xingyu Zhu et.al. | 2412.15278 | null |
2024-12-19 | GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting | Qianpu Sun et.al. | 2412.14579 | null |
2024-12-19 | Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images | Min Wang et.al. | 2412.14547 | null |
2024-12-18 | GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians | Xiaobao Wei et.al. | 2412.13983 | link |
2024-12-17 | EOGS: Gaussian Splatting for Earth Observation | Luca Savant Aira et.al. | 2412.13047 | null |
2024-12-18 | Optimize the Unseen – Fast NeRF Cleanup with Free Space Prior | Leo Segre et.al. | 2412.12772 | null |
2024-12-17 | Towards a Training Free Approach for 3D Scene Editing | Vivek Madhavaram et.al. | 2412.12766 | null |
2024-12-16 | GS-ProCams: Gaussian Splatting-based Projector-Camera Systems | Qingyue Deng et.al. | 2412.11762 | null |
2024-12-18 | Sequence Matters: Harnessing Video Models in 3D Super-Resolution | Hyun-kyu Ko et.al. | 2412.11525 | null |
2024-12-16 | VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression | Qiang Hu et.al. | 2412.11362 | null |
2024-12-13 | NeRF-Texture: Synthesizing Neural Radiance Field Textures | Yi-Hua Huang et.al. | 2412.10004 | null |
2024-12-13 | Sharpening Your Density Fields: Spiking Neuron Aided Fast Geometry Learning | Yi Gu et.al. | 2412.09881 | null |
2024-12-12 | PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields | Sean Wu et.al. | 2412.09680 | link |
2024-12-11 | GN-FR:Generalizable Neural Radiance Fields for Flare Removal | Gopi Raju Matta et.al. | 2412.08200 | null |
2024-12-11 | NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods | Qiang Qu et.al. | 2412.08029 | link |
2024-12-10 | EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering | Toshiya Yura et.al. | 2412.07293 | null |
2024-12-09 | Diffusing Differentiable Representations | Yash Savani et.al. | 2412.06981 | null |
2024-12-09 | Dynamic EventNeRF: Reconstructing General Dynamic Scenes from Multi-view Event Cameras | Viktor Rudnev et.al. | 2412.06770 | null |
2024-12-09 | Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video | Renlong Wu et.al. | 2412.06424 | link |
2024-12-09 | Splatter-360: Generalizable 360 $^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images | Zheng Chen et.al. | 2412.06250 | link |
2024-12-07 | WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking | Yuqi Tan et.al. | 2412.05695 | null |
2024-12-06 | Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories | Susung Hong et.al. | 2412.05279 | null |
2024-12-11 | MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting | Peng Chen et.al. | 2412.04955 | link |
2024-12-04 | NeRF and Gaussian Splatting SLAM in the Wild | Fabian Schmidt et.al. | 2412.03263 | link |
2024-12-01 | SAGA: Surface-Aligned Gaussian Avatar | Ronghan Chen et.al. | 2412.00845 | null |
2024-12-01 | CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images | Jian Liu et.al. | 2412.00754 | null |
2024-11-30 | Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives | Alex Hanson et.al. | 2412.00578 | link |
2024-11-30 | Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects | Amir Barda et.al. | 2412.00518 | null |
2024-11-29 | $C^{3}$ -NeRF: Modeling Multiple Scenes via Conditional-cum-Continual Neural Radiance Fields | Prajwal Singh et.al. | 2411.19903 | null |
2024-11-29 | Gaussian Splashing: Direct Volumetric Rendering Underwater | Nir Mualem et.al. | 2411.19588 | null |
2024-11-29 | ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration | Chaojun Ni et.al. | 2411.19548 | null |
2024-11-29 | LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis | Tianqi Li et.al. | 2411.19525 | null |
2024-11-28 | SAMa: Material-aware 3D Selection and Segmentation | Michael Fischer et.al. | 2411.19322 | null |
2024-11-27 | Surf-NeRF: Surface Regularised Neural Radiance Fields | Jack Naylor et.al. | 2411.18652 | null |
2024-11-26 | MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields | Yixiong Yang et.al. | 2411.17235 | link |
2024-11-25 | The Radiance of Neural Fields: Democratizing Photorealistic and Dynamic Robotic Simulation | Georgina Nuthall et.al. | 2411.16940 | null |
2024-11-27 | SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving | Georg Hess et.al. | 2411.16816 | link |
2024-11-25 | Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction | Ziyu Zhang et.al. | 2411.16392 | null |
2024-11-25 | U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields | Vinayak Gupta et.al. | 2411.16172 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-11-24 | GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision | Xu Baixin et.al. | 2411.15723 | link |
2024-11-23 | NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation | Menglin Zhang et.al. | 2411.15551 | null |
2024-11-23 | SplatSDF: Boosting Neural Implicit SDF via Gaussian Splatting Fusion | Runfa Blark Li et.al. | 2411.15468 | null |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | Robust SG-NeRF: Robust Scene Graph Aided Neural Surface Reconstruction | Yi Gu et.al. | 2411.13620 | null |
2024-11-20 | GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting | Xiaobao Wei et.al. | 2411.12981 | null |
2024-11-25 | SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image | Zixu Wang et.al. | 2411.12471 | null |
2024-11-19 | GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Shaoqing Xu et.al. | 2411.12452 | link |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-18 | LeC $^2$ O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes | Zhenxing Mi et.al. | 2411.11374 | null |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting | Kang Chen et.al. | 2411.10504 | link |
2024-11-15 | GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization | Yanhao Sun et.al. | 2411.10033 | null |
2024-11-22 | BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis | David Svitov et.al. | 2411.08508 | link |
2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-12 | TomoGRAF: A Robust and Generalizable Reconstruction Network for Single-View Computed Tomography | Di Xu et.al. | 2411.08158 | null |
2024-11-12 | Material Transforms from Disentangled NeRF Representations | Ivan Lopes et.al. | 2411.08037 | link |
2024-11-11 | LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes | Zefan Qu et.al. | 2411.06757 | link |
2024-11-10 | Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field | Liuyue Xie et.al. | 2411.06365 | null |
2024-11-09 | AI-Driven Stylization of 3D Environments | Yuanbo Chen et.al. | 2411.06067 | null |
2024-11-08 | A Nerf-Based Color Consistency Method for Remote Sensing Images | Zongcheng Zuo et.al. | 2411.05557 | null |
2024-11-08 | Rate-aware Compression for NeRF-based Volumetric Video | Zhiyu Zhang et.al. | 2411.05322 | null |
2024-11-07 | Planar Reflection-Aware Neural Radiance Fields | Chen Gao et.al. | 2411.04984 | null |
2024-11-07 | GANESH: Generalizable NeRF for Lensless Imaging | Rakesh Raj Madavan et.al. | 2411.04810 | null |
2024-11-08 | SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation | Xun Tu et.al. | 2411.04386 | null |
2024-11-06 | Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Rui Peng et.al. | 2411.03637 | link |
2024-11-05 | Enhancing Exploratory Capability of Visual Navigation Using Uncertainty of Implicit Scene Representation | Yichen Wang et.al. | 2411.03487 | link |
2024-11-05 | CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval | Xin Wen et.al. | 2411.02979 | null |
2024-11-05 | Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery | Liv Kåreborn et.al. | 2411.02972 | null |
2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
2024-11-04 | NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields | Eric Zhu et.al. | 2411.02482 | null |
2024-11-05 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-11-06 | GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes | Gaochao Song et.al. | 2411.01853 | null |
2024-11-04 | A Probabilistic Formulation of LiDAR Mapping with Neural Radiance Fields | Matthew McDermott et.al. | 2411.01725 | link |
2024-11-01 | ZIM: Zero-Shot Image Matting for Anything | Beomyoung Kim et.al. | 2411.00626 | link |
2024-10-31 | Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes | Karim Kassab et.al. | 2410.23742 | null |
2024-10-31 | Get a Grip: Multi-Finger Grasp Evaluation at Scale Enables Robust Sim-to-Real Transfer | Tyler Ga Wei Lum et.al. | 2410.23701 | null |
2024-10-31 | XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM | Xiaomeng Wang et.al. | 2410.23690 | link |
2024-10-30 | Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder | Antoine Schnepf et.al. | 2410.22936 | null |
2024-10-28 | MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps | Yating Xu et.al. | 2410.21566 | link |
2024-10-29 | EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior | Xin Xiang et.al. | 2410.20981 | null |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | link |
2024-10-27 | GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields | Yusuke Sekikawa et.al. | 2410.20306 | null |
2024-10-25 | Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization | Weihang Liu et.al. | 2410.19483 | link |
2024-10-25 | Evaluation of strategies for efficient rate-distortion NeRF streaming | Pedro Martin et.al. | 2410.19459 | null |
2024-10-27 | Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis | Liang Han et.al. | 2410.18822 | null |
2024-10-24 | Real-time 3D-aware Portrait Video Relighting | Ziqi Cai et.al. | 2410.18355 | link |
2024-10-22 | Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies | Shrey Vishen et.al. | 2410.18137 | link |
2024-10-23 | VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points | Linus Franke et.al. | 2410.17932 | null |
2024-10-23 | Few-shot NeRF by Adaptive Rendering Loss Regularization | Qingshan Xu et.al. | 2410.17839 | null |
2024-10-23 | Efficient Neural Implicit Representation for 3D Human Reconstruction | Zexu Huang et.al. | 2410.17741 | link |
2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
2024-10-22 | LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias | Haian Jin et.al. | 2410.17242 | null |
2024-10-18 | GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting | Yusen Xie et.al. | 2410.17084 | null |
2024-10-22 | E-3DGS: Gaussian Splatting with Exposure and Motion Events | Xiaoting Yin et.al. | 2410.16995 | link |
2024-10-21 | Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions | Malte Prinzler et.al. | 2410.16395 | null |
2024-10-21 | FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors | Chin-Yang Lin et.al. | 2410.16271 | null |
2024-10-22 | EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting | Bohao Liao et.al. | 2410.15392 | null |
2024-10-19 | Neural Radiance Field Image Refinement through End-to-End Sampling Point Optimization | Kazuhiro Ohta et.al. | 2410.14958 | null |
2024-10-18 | Learning autonomous driving from aerial imagery | Varun Murali et.al. | 2410.14177 | null |
2024-10-18 | DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction | Ange Lou et.al. | 2410.14169 | null |
2024-10-17 | DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering | Jiahao Lu et.al. | 2410.13607 | link |
2024-10-21 | DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation | Guosheng Zhao et.al. | 2410.13571 | null |
2024-10-17 | Object Pose Estimation Using Implicit Representation For Transparent Objects | Varun Burde et.al. | 2410.13465 | null |
2024-10-17 | GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting | Shuichang Lai et.al. | 2410.13349 | null |
2024-10-16 | 3D Gaussian Splatting in Robotics: A Survey | Siting Zhu et.al. | 2410.12262 | link |
2024-10-16 | EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View | Zhaorong Wang et.al. | 2410.12242 | null |
2024-10-14 | 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications | Eduardo R. Corral-Soto et.al. | 2410.10782 | null |
2024-10-14 | NeRF-enabled Analysis-Through-Synthesis for ISAR Imaging of Small Everyday Objects with Sparse and Noisy UWB Radar Data | Md Farhan Tasnim Oshim et.al. | 2410.10085 | null |
2024-10-13 | Magnituder Layers for Implicit Neural Representations in 3D | Sang Min Kim et.al. | 2410.09771 | null |
2024-10-12 | Improving 3D Finger Traits Recognition via Generalizable Neural Rendering | Hongbin Xu et.al. | 2410.09582 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering | Jaehoon Choi et.al. | 2410.08941 | null |
2024-10-11 | Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints | Yicheng He et.al. | 2410.08780 | null |
2024-10-10 | RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image | Xiaoxue Chen et.al. | 2410.08181 | null |
2024-10-10 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
2024-10-11 | NeRF-Accelerated Ecological Monitoring in Mixed-Evergreen Redwood Forest | Adam Korycki et.al. | 2410.07418 | link |
2024-10-09 | DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation | Zhiqi Li et.al. | 2410.06756 | null |
2024-10-09 | MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Zhenhui Ye et.al. | 2410.06734 | null |
2024-10-09 | 3D Representation Methods: A Survey | Zhengren Wang et.al. | 2410.06475 | null |
2024-10-08 | Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters | Guoji Tian et.al. | 2410.05772 | null |
2024-10-07 | Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors | Ziwei Liao et.al. | 2410.05514 | link |
2024-10-07 | PH-Dropout: Prctical Epistemic Uncertainty Quantification for View Synthesis | Chuanhao Sun et.al. | 2410.05468 | link |
2024-10-07 | LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting | Qifeng Chen et.al. | 2410.05111 | null |
2024-10-07 | 6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering | Zhongpai Gao et.al. | 2410.04974 | null |
2024-10-07 | TeX-NeRF: Neural Radiance Fields from Pseudo-TeX Vision | Chonghao Zhong et.al. | 2410.04873 | null |
2024-10-06 | Deformable NeRF using Recursively Subdivided Tetrahedra | Zherui Qiu et.al. | 2410.04402 | null |
2024-10-05 | Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy | Pengcheng Chen et.al. | 2410.04041 | null |
2024-10-02 | MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis | Xiaobiao Du et.al. | 2410.02103 | link |
2024-10-03 | EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis | Alexander Mai et.al. | 2410.01804 | null |
2024-10-02 | 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Yang Cao et.al. | 2410.01647 | link |
2024-10-02 | Gaussian Splatting in Mirrors: Reflection-Aware Rendering via Virtual Camera Optimization | Zihan Wang et.al. | 2410.01614 | link |
2024-10-02 | Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection | Hongru Yan et.al. | 2410.01404 | null |
2024-10-01 | GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer | Youngho Yoon et.al. | 2410.00672 | link |
2024-09-30 | Distributed NeRF Learning for Collaborative Multi-Robot Perception | Hongrui Zhao et.al. | 2409.20289 | null |
2024-09-30 | Active Neural Mapping at Scale | Zijia Kuang et.al. | 2409.20276 | null |
2024-09-30 | OPONeRF: One-Point-One NeRF for Robust Neural Rendering | Yu Zheng et.al. | 2409.20043 | link |
2024-09-28 | G3R: Gradient Guided Generalizable Reconstruction | Yun Chen et.al. | 2409.19405 | null |
2024-09-26 | LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Huan Wang et.al. | 2409.18057 | link |
2024-09-26 | Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions | Weng Fei Low et.al. | 2409.17988 | null |
2024-09-26 | Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry | Qi Zhang et.al. | 2409.17729 | null |
2024-09-26 | TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene | Sandika Biswas et.al. | 2409.17459 | link |
2024-09-25 | SeaSplat: Representing Underwater Scenes with 3D Gaussian Splatting and a Physically Grounded Image Formation Model | Daniel Yang et.al. | 2409.17345 | null |
2024-09-25 | TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans | Aggelina Chatziagapi et.al. | 2409.16666 | null |
2024-09-26 | Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities | Peizhi Yan et.al. | 2409.16147 | link |
2024-09-24 | Disentangled Generation and Aggregation for Robust Radiance Fields | Shihe Shen et.al. | 2409.15715 | null |
2024-09-24 | Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB | Jae Yong Lee et.al. | 2409.15689 | null |
2024-09-23 | AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions | Samarth Chopra et.al. | 2409.15487 | null |
2024-09-22 | MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Wangze Xu et.al. | 2409.14316 | null |
2024-09-21 | MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors | Zhenhua Du et.al. | 2409.14019 | null |
2024-09-19 | CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications | Vladimir Frolov et.al. | 2409.12617 | null |
2024-09-18 | JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation | Sai Tanmay Reddy Chakkera et.al. | 2409.12156 | null |
2024-09-25 | BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling | Lulin Zhang et.al. | 2409.12014 | link |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-21 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
2024-09-16 | Baking Relightable NeRF for Real-time Direct/Indirect Illumination Rendering | Euntae Choi et.al. | 2409.10327 | null |
2024-09-16 | DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments | Mahmud A. Mohamad et.al. | 2409.10041 | link |
2024-09-15 | NARF24: Estimating Articulated Object Structure for Implicit Rendering | Stanley Lewis et.al. | 2409.09829 | null |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2024-09-11 | DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Haibo Yang et.al. | 2409.07454 | null |
2024-09-11 | ThermalGaussian: Thermal 3D Gaussian Splatting | Rongfeng Lu et.al. | 2409.07200 | link |
2024-09-10 | LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation | Archana Swaminathan et.al. | 2409.06703 | null |
2024-09-10 | Sources of Uncertainty in 3D Scene Reconstruction | Marcus Klasson et.al. | 2409.06407 | link |
2024-09-09 | LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo | Wei Zhi Tang et.al. | 2409.06104 | link |
2024-09-09 | G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis | Lutao Jiang et.al. | 2409.05617 | null |
2024-09-09 | From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models | Tessa Pulli et.al. | 2409.05413 | null |
2024-09-09 | KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction | Davide Di Nucci et.al. | 2409.05407 | null |
2024-09-09 | Lagrangian Hashing for Compressed Neural Field Representations | Shrisudhan Govindarajan et.al. | 2409.05334 | null |
2024-09-09 | Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems | Jianheng Liu et.al. | 2409.05310 | null |
2024-09-06 | SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields | Yuze Wang et.al. | 2409.04482 | null |
2024-09-05 | Weight Conditioning for Smooth Optimization of Neural Networks | Hemanth Saratchandran et.al. | 2409.03424 | null |
2024-09-05 | Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction | Shen Chen et.al. | 2409.03213 | null |
2024-09-04 | UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views | Jiaxin Guo et.al. | 2409.02917 | link |
2024-09-03 | GraspSplats: Efficient Manipulation with 3D Feature Splatting | Mazeyu Ji et.al. | 2409.02084 | null |
2024-09-03 | $S^2$ NeRF: Privacy-preserving Training Framework for NeRF | Bokang Zhang et.al. | 2409.01661 | link |
2024-08-30 | ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images | Xiaoshuai Zhang et.al. | 2408.17027 | null |
2024-08-29 | GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content | Lebin Zhou et.al. | 2408.16866 | null |
2024-09-01 | Generic Objects as Pose Probes for Few-Shot View Synthesis | Zhirui Gao et.al. | 2408.16690 | null |
2024-08-29 | Spurfies: Sparse Surface Reconstruction using Local Geometry Priors | Kevin Raj et.al. | 2408.16544 | null |
2024-08-29 | NeRF-CA: Dynamic Reconstruction of X-ray Coronary Angiography with Extremely Sparse-views | Kirsten W. H. Maas et.al. | 2408.16355 | link |
2024-08-28 | Towards Realistic Example-based Modeling via 3D Gaussian Stitching | Xinyu Gao et.al. | 2408.15708 | null |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-08-27 | GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning | Shubhendu Jena et.al. | 2408.14724 | null |
2024-08-28 | FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2408.14035 | link |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-23 | SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting | Jiseung Hong et.al. | 2408.13285 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-21 | Irregularity Inspection using Neural Radiance Field | Tianqi Ding et.al. | 2408.11251 | null |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | link |
2024-08-20 | Learning Part-aware 3D Representations by Fusing 2D Gaussians and Superquadrics | Zhirui Gao et.al. | 2408.10789 | null |
2024-08-20 | TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai et.al. | 2408.10739 | null |
2024-08-19 | $R^2$ -Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement | Haoyang Wang et.al. | 2408.10135 | null |
2024-08-19 | DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery | Corentin Dumery et.al. | 2408.09928 | null |
2024-08-20 | CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning | Haoyu Zhao et.al. | 2408.09663 | null |
2024-08-18 | S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis | Dongze Li et.al. | 2408.09347 | null |
2024-08-17 | SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation | Xiao Cao et.al. | 2408.09144 | null |
2024-08-17 | HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction | Xiao Zhao et.al. | 2408.09104 | null |
2024-08-16 | VF-NeRF: Learning Neural Vector Fields for Indoor Scene Reconstruction | Albert Gassol Puigjaner et.al. | 2408.08766 | link |
2024-08-15 | WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting | Huapeng Li et.al. | 2408.08206 | null |
2024-08-18 | Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space | Hyunjee Lee et.al. | 2408.07416 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-08-13 | ActiveNeRF: Learning Accurate 3D Geometry by Active Pattern Projection | Jianyu Tao et.al. | 2408.06592 | link |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering | Jiameng Li et.al. | 2408.06286 | link |
2024-08-12 | 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) | Jaydeep Rade et.al. | 2408.06244 | null |
2024-08-10 | Radiance Field Learners As UAV First-Person Viewers | Liqi Yan et.al. | 2408.05533 | null |
2024-08-09 | DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-09 | FewShotNeRF: Meta-Learning-based Novel View Synthesis for Rapid Scene-Specific Adaptation | Piraveen Sivakumar et.al. | 2408.04803 | null |
2024-08-06 | LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting | Joanna Kaleta et.al. | 2408.04474 | link |
2024-08-08 | A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery | Mengya Xu et.al. | 2408.04426 | link |
2024-08-08 | Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods | Yiming Zhou et.al. | 2408.04268 | null |
2024-08-07 | Goal-oriented Semantic Communication for the Metaverse Application | Zhe Wang et.al. | 2408.03646 | null |
2024-08-06 | RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis | Hugo Blanc et.al. | 2408.03356 | null |
2024-08-06 | Efficient NeRF Optimization – Not All Samples Remain Equally Hard | Juuso Korhonen et.al. | 2408.03193 | null |
2024-08-06 | MGFs: Masked Gaussian Fields for Meshing Building based on Multi-View Images | Tengfei Wang et.al. | 2408.03060 | null |
2024-08-04 | PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone | Xin Yang et.al. | 2408.02053 | null |
2024-08-03 | FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields | Yifan Wu et.al. | 2408.01878 | null |
2024-08-03 | E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images | Yunshan Qi et.al. | 2408.01840 | null |
2024-08-02 | NeRFoot: Robot-Footprint Estimation for Image-Based Visual Servoing | Daoxin Zhong et.al. | 2408.01251 | null |
2024-08-05 | UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization | Ziwen Guo et.al. | 2408.00860 | null |
2024-07-31 | StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization | Kaiyuan Tang et.al. | 2408.00150 | null |
2024-07-22 | PAV: Personalized Head Avatar from Unstructured Video Collection | Akin Caliskan et.al. | 2407.21047 | null |
2024-07-30 | Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering | Yanpeng Zhao et.al. | 2407.20908 | link |
2024-07-29 | Radiance Fields for Robotic Teleoperation | Maximum Wilder-Smith et.al. | 2407.20194 | link |
2024-07-29 | Garment Animation NeRF with Color Editing | Renke Wang et.al. | 2407.19774 | link |
2024-07-27 | Revisit Self-supervised Depth Estimation with Local Structure-from-Motion | Shengjie Zhu et.al. | 2407.19166 | null |
2024-07-26 | IOVS4NeRF:Incremental Optimal View Selection for Large-Scale NeRFs | Jingpeng Xie et.al. | 2407.18611 | null |
2024-07-24 | SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency | Yiming Xie et.al. | 2407.17470 | null |
2024-07-23 | HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images | Shreyas Singh et.al. | 2407.16503 | link |
2024-07-23 | DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors | Zizheng Yan et.al. | 2407.16260 | null |
2024-07-22 | BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes | Chih-Hai Su et.al. | 2407.15848 | null |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-19 | HOTS3D: Hyper-Spherical Optimal Transport for Semantic Alignment of Text-to-3D Generation | Zezeng Li et.al. | 2407.14419 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Semantic Communications for 3D Human Face Transmission with Neural Radiance Fields | Guanlin Wu et.al. | 2407.13992 | null |
2024-07-18 | EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting | Yuchen Weng et.al. | 2407.13520 | null |
2024-07-18 | GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields | Xiufeng Huang et.al. | 2407.13390 | null |
2024-07-18 | KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter | Yifan Zhan et.al. | 2407.13185 | null |
2024-07-17 | Generalizable Human Gaussians for Sparse View Synthesis | Youngjoong Kwon et.al. | 2407.12777 | link |
2024-07-17 | SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization | Yiyang Chen et.al. | 2407.12667 | link |
2024-07-17 | InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction | Xulong Wang et.al. | 2407.12661 | link |
2024-07-17 | Invertible Neural Warp for NeRF | Shin-Fang Chng et.al. | 2407.12354 | null |
2024-07-17 | Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections | Congrong Xu et.al. | 2407.12306 | null |
2024-07-18 | Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling | Jaehyeok Kim et.al. | 2407.11962 | null |
2024-07-18 | IPA-NeRF: Illusory Poisoning Attack Against Neural Radiance Fields | Wenxiang Jiang et.al. | 2407.11921 | link |
2024-07-16 | DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation | Jiwook Kim et.al. | 2407.11394 | link |
2024-07-15 | Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method | Adam Korycki et.al. | 2407.11238 | null |
2024-07-15 | AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems | Alexey Kotcov et.al. | 2407.10865 | null |
2024-07-15 | Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis | Antoine Legrand et.al. | 2407.10762 | null |
2024-07-15 | IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild | Shuaixian Wang et.al. | 2407.10695 | null |
2024-07-15 | NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis | Yubin Hu et.al. | 2407.10482 | null |
2024-07-15 | Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering | Francesco Di Sario et.al. | 2407.10389 | null |
2024-07-14 | RS-NeRF: Neural Radiance Fields from Rolling Shutter Images | Muyao Niu et.al. | 2407.10267 | link |
2024-07-14 | SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion | Jiyuan Zhang et.al. | 2407.10062 | null |
2024-07-12 | Physics-Informed Learning of Characteristic Trajectories for Smoke Reconstruction | Yiming Wang et.al. | 2407.09679 | link |
2024-07-12 | Radiance Fields from Photons | Sacha Jungerman et.al. | 2407.09386 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-11 | Feasibility of Neural Radiance Fields for Crime Scene Video Reconstruction | Shariq Nadeem Malik et.al. | 2407.08795 | null |
2024-07-11 | WildGaussians: 3D Gaussian Splatting in the Wild | Jonas Kulhanek et.al. | 2407.08447 | link |
2024-07-11 | MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos | Yushuo Chen et.al. | 2407.08414 | link |
2024-07-11 | Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-11 | Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields | Haojie Lian et.al. | 2407.08154 | null |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2024-07-10 | Protecting NeRFs’ Copyright via Plug-And-Play Watermarking Base Model | Qi Song et.al. | 2407.07735 | null |
2024-07-10 | Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field | Ganlin Yang et.al. | 2407.07461 | null |
2024-07-09 | Reference-based Controllable Scene Stylization with Gaussian Splatting | Yiqun Mei et.al. | 2407.07220 | null |
2024-07-09 | Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View | Dogyoon Lee et.al. | 2407.06613 | null |
2024-07-08 | RRM: Relightable assets using Radiance guided Material extraction | Diego Gomez et.al. | 2407.06397 | null |
2024-07-08 | PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes | Mohammad Reza Karimi Dastjerdi et.al. | 2407.06150 | null |
2024-07-08 | Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views | Jiawei Guo et.al. | 2407.05666 | null |
2024-07-08 | GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields | Weiyi Xue et.al. | 2407.05597 | null |
2024-07-08 | Dynamic Neural Radiance Field From Defocused Monocular Video | Xianrui Luo et.al. | 2407.05586 | null |
2024-07-07 | GaussReg: Fast 3D Registration with Gaussian Splatting | Jiahao Chang et.al. | 2407.05254 | null |
2024-07-06 | SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction | Weixing Xie et.al. | 2407.05023 | link |
2024-07-04 | CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images | Junghe Lee et.al. | 2407.03923 | null |
2024-07-02 | MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering | Ahmad AlMughrabi et.al. | 2407.02668 | null |
2024-07-03 | BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream | Wenpu Li et.al. | 2407.02174 | link |
2024-07-01 | Active Human Pose Estimation via an Autonomous UAV Agent | Jingxi Chen et.al. | 2407.01811 | null |
2024-07-01 | DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction | Yujin Ham et.al. | 2407.01761 | null |
2024-07-01 | Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation | Zihan Gao et.al. | 2407.01220 | link |
2024-06-29 | Intrinsic PAPR for Point-level 3D Scene Albedo and Shading Editing | Alireza Moazeni et.al. | 2407.00500 | null |
2024-06-28 | ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction | Ding-Jiun Huang et.al. | 2406.20066 | null |
2024-06-28 | EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting | Daiwei Zhang et.al. | 2406.19811 | null |
2024-06-27 | Shorter SPECT Scans Using Self-supervised Coordinate Learning to Synthesize Skipped Projection Views | Zongyu Li et.al. | 2406.18840 | null |
2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | link |
2024-06-25 | NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods | Jonas Kulhanek et.al. | 2406.17345 | null |
2024-06-24 | From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking | Xiaohao Xu et.al. | 2406.16850 | link |
2024-06-24 | Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis | Jianning Deng et.al. | 2406.16623 | null |
2024-06-24 | Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction | Tong Qin et.al. | 2406.16289 | null |
2024-06-23 | Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study | Zhe Wang et.al. | 2406.16068 | null |
2024-06-23 | Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction | Yangdi Lu et.al. | 2406.15982 | null |
2024-06-22 | psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery | Tongtong Zhang et.al. | 2406.15707 | null |
2024-06-21 | A3D: Does Diffusion Dream about 3D Alignment? | Savva Ignatyev et.al. | 2406.15020 | null |
2024-06-21 | E2GS: Event Enhanced Gaussian Splatting | Hiroyuki Deguchi et.al. | 2406.14978 | link |
2024-06-21 | Relighting Scenes with Object Insertions in Neural Radiance Fields | Xuening Zhu et.al. | 2406.14806 | null |
2024-06-20 | Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment | Yunshan Qi et.al. | 2406.14360 | null |
2024-06-19 | NeRF-Feat: 6D Object Pose Estimation using Feature Rendering | Shishir Reddy Vutukur et.al. | 2406.13796 | null |
2024-06-19 | Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images | Haruo Fujiwara et.al. | 2406.13393 | null |
2024-06-19 | Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields | Youngin Park et.al. | 2406.13251 | link |
2024-06-18 | Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models | Paul Henderson et.al. | 2406.13099 | null |
2024-06-18 | Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings | Ruijie Tang et.al. | 2406.13048 | null |
2024-06-18 | Fast Global Localization on Neural Radiance Field | Mangyu Kong et.al. | 2406.12202 | link |
2024-06-20 | TutteNet: Injective 3D Deformations by Composition of 2D Mesh Deformations | Bo Sun et.al. | 2406.12121 | null |
2024-06-17 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | Uncertainty modeling for fine-tuned implicit functions | Anna Susmelj et.al. | 2406.12082 | null |
2024-06-17 | LLaNA: Large Language and NeRF Assistant | Andrea Amaduzzi et.al. | 2406.11840 | null |
2024-06-17 | Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization | Huaiji Zhou et.al. | 2406.11766 | null |
2024-06-17 | InterNeRF: Scaling Radiance Fields via Parameter Interpolation | Clinton Wang et.al. | 2406.11737 | null |
2024-06-17 | NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation | Niu Guanchen et.al. | 2406.11259 | null |
2024-06-15 | NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows | Zhenggang Tang et.al. | 2406.10543 | link |
2024-06-15 | Federated Neural Radiance Field for Distributed Intelligence | Yintian Zhang et.al. | 2406.10474 | null |
2024-06-14 | Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections | Jiacong Xu et.al. | 2406.10373 | null |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219 | link |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control | Yuzhong Huang et.al. | 2406.10000 | null |
2024-06-14 | dGrasp: NeRF-Informed Implicit Grasp Policies with Supervised Optimization Slopes | Gergely Sóti et.al. | 2406.09939 | null |
2024-06-14 | RaNeuS: Ray-adaptive Neural Surface Reconstruction | Yida Wang et.al. | 2406.09801 | link |
2024-06-13 | Rethinking Score Distillation as a Bridge Between Image Distributions | David McAllister et.al. | 2406.09417 | null |
2024-06-13 | Preserving Identity with Variational Score for General-purpose 3D Editing | Duong H. Le et.al. | 2406.08953 | null |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-14 | AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis | Swapnil Bhosale et.al. | 2406.08920 | null |
2024-06-13 | NeRF Director: Revisiting View Selection in Neural Volume Rendering | Wenhui Xiao et.al. | 2406.08839 | link |
2024-06-12 | ICE-G: Image Conditional Editing of 3D Gaussian Splats | Vishnu Jaganathan et.al. | 2406.08488 | null |
2024-06-12 | OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Yinan Deng et.al. | 2406.08009 | link |
2024-06-12 | Spatial Annealing Smoothing for Efficient Few-shot Neural Rendering | Yuru Xiao et.al. | 2406.07828 | link |
2024-06-11 | C3DAG: Controlled 3D Animal Generation using 3D pose guidance | Sandeep Mishra et.al. | 2406.07742 | null |
2024-06-11 | M-LRM: Multi-view Large Reconstruction Model | Mengfei Li et.al. | 2406.07648 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431 | null |
2024-06-11 | Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion | Xin Yuan et.al. | 2406.06972 | null |
2024-06-11 | Neural Visibility Field for Uncertainty-Driven Active Mapping | Shangjie Xue et.al. | 2406.06948 | null |
2024-06-10 | IllumiNeRF: 3D Relighting without Inverse Rendering | Xiaoming Zhao et.al. | 2406.06527 | null |
2024-06-10 | GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation | Haozhe Xie et.al. | 2406.06526 | link |
2024-06-10 | PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction | Danpeng Chen et.al. | 2406.06521 | null |
2024-06-10 | Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis | Xin Jin et.al. | 2406.06216 | link |
2024-06-10 | ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models | Meng-Li Shih et.al. | 2406.06133 | null |
2024-06-09 | GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement | Peiye Zhuang et.al. | 2406.05649 | null |
2024-06-07 | Multiplane Prior Guided Few-Shot Aerial Scene Rendering | Zihan Gao et.al. | 2406.04961 | null |
2024-06-07 | Multi-style Neural Radiance Field with AdaIN | Yu-Wen Pao et.al. | 2406.04960 | link |
2024-06-06 | Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization | Takuhiro Kaneko et.al. | 2406.04155 | null |
2024-06-06 | How Far Can We Compress Instant-NGP-Based NeRF? | Yihang Chen et.al. | 2406.04101 | link |
2024-06-06 | Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling | Xinhang Liu et.al. | 2406.03723 | null |
2024-06-06 | Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Diwen Wan et.al. | 2406.03697 | link |
2024-06-04 | 3D-HGS: 3D Half-Gaussian Splatting | Haolin Li et.al. | 2406.02720 | link |
2024-06-06 | Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting | Inkyu Shin et.al. | 2406.02541 | null |
2024-06-04 | Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning | Jiaxu Wang et.al. | 2406.02370 | null |
2024-06-03 | Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting | Shaojie Ma et.al. | 2406.01593 | null |
2024-06-03 | Tetrahedron Splatting for 3D Generation | Chun Gu et.al. | 2406.01579 | link |
2024-06-03 | Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting | Fang Li et.al. | 2406.01042 | link |
2024-06-02 | PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency | Yeonsung Jung et.al. | 2406.00798 | null |
2024-06-02 | Representing Animatable Avatar via Factorized Neural Fields | Chunjin Song et.al. | 2406.00637 | null |
2024-06-04 | SuperGaussian: Repurposing Video Models for 3D Super Resolution | Yuan Shen et.al. | 2406.00609 | null |
2024-06-02 | Efficient Neural Light Fields (ENeLF) for Mobile Devices | Austin Peng et.al. | 2406.00598 | null |
2024-06-01 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-05-31 | R $^2$ -Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction | Ruyi Zha et.al. | 2405.20693 | link |
2024-05-31 | 4Diffusion: Multi-view Video Diffusion Model for 4D Generation | Haiyu Zhang et.al. | 2405.20674 | null |
2024-05-30 | $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323 | link |
2024-05-30 | TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes | Minghao Guo et.al. | 2405.20283 | null |
2024-05-31 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-05-30 | IReNe: Instant Recoloring in Neural Radiance Fields | Alessio Mazzucchelli et.al. | 2405.19876 | null |
2024-05-30 | HINT: Learning Complete Human Neural Representations from Limited Viewpoints | Alessandro Sanvito et.al. | 2405.19712 | null |
2024-05-30 | View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields | Haodi He et.al. | 2405.19678 | link |
2024-05-29 | Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy | Zijie Jiang et.al. | 2405.18863 | null |
2024-06-02 | NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild | Weining Ren et.al. | 2405.18715 | link |
2024-05-28 | Self-supervised Pre-training for Transferable Multi-modal Perception | Xiaohao Xu et.al. | 2405.17942 | link |
2024-05-28 | A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction | Bin Zhang et.al. | 2405.17891 | null |
2024-05-29 | HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction | Haoyu Zhao et.al. | 2405.17872 | link |
2024-05-28 | Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh | Xiangjun Gao et.al. | 2405.17811 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-29 | PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting | Zipeng Wang et.al. | 2405.16829 | null |
2024-05-26 | Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors | Soumava Paul et.al. | 2405.16517 | null |
2024-05-24 | Neural Elevation Models for Terrain Mapping and Path Planning | Adam Dai et.al. | 2405.15227 | link |
2024-05-27 | HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting | Yuanhao Cai et.al. | 2405.15125 | link |
2024-05-24 | GS-Hider: Hiding Messages into 3D Gaussian Splatting | Xuanyu Zhang et.al. | 2405.15118 | null |
2024-05-23 | NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections | Dor Verbin et.al. | 2405.14871 | null |
2024-05-23 | Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling | Liwen Wu et.al. | 2405.14847 | null |
2024-05-23 | Camera Relocalization in Shadow-free Neural Radiance Fields | Shiyao Xu et.al. | 2405.14824 | link |
2024-05-23 | LDM: Large Tensorial SDF Model for Textured Mesh Generation | Rengan Xie et.al. | 2405.14580 | link |
2024-05-23 | JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression | Zihan Zheng et.al. | 2405.14452 | null |
2024-05-22 | DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus | Yu Chen et.al. | 2405.13943 | link |
2024-05-22 | Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances | Licheng Shen et.al. | 2405.13694 | null |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations | Antoine Legrand et.al. | 2405.12728 | null |
2024-05-20 | Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo | Tianqi Liu et.al. | 2405.12218 | link |
2024-05-20 | Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents | Guanlin Wu et.al. | 2405.12155 | null |
2024-05-20 | NPLMV-PS: Neural Point-Light Multi-View Photometric Stereo | Fotios Logothetis et.al. | 2405.12057 | null |
2024-05-19 | Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems | Shengxiang Sun et.al. | 2405.11629 | null |
2024-05-19 | R-NeRF: Neural Radiance Fields for Modeling RIS-enabled Wireless Environments | Huiying Yang et.al. | 2405.11541 | link |
2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129 | link |
2024-05-16 | When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models | Xianzheng Ma et.al. | 2405.10255 | link |
2024-05-15 | From NeRFs to Gaussian Splats, and Back | Siming He et.al. | 2405.09717 | link |
2024-05-14 | Dynamic NeRF: A Review | Jinwei Lin et.al. | 2405.08609 | null |
2024-05-13 | Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs | Mingyu Kim et.al. | 2405.07857 | link |
2024-05-12 | Point Resampling and Ray Transformation Aid to Editable NeRF Models | Zhenyang Li et.al. | 2405.07306 | null |
2024-05-12 | Hologram: Realtime Holographic Overlays via LiDAR Augmented Reconstruction | Ekansh Agrawal et.al. | 2405.07178 | null |
2024-05-11 | TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization | Zhen Tan et.al. | 2405.07027 | link |
2024-05-10 | LIVE: LaTex Interactive Visual Editing | Jinwei Lin et.al. | 2405.06762 | null |
2024-05-14 | SketchDream: Sketch-based Text-to-3D Generation and Editing | Feng-Lin Liu et.al. | 2405.06461 | null |
2024-05-10 | Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering | Xiaohan Zhang et.al. | 2405.06214 | null |
2024-05-10 | Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation | Bardienus P. Duisterhof et.al. | 2405.06181 | null |
2024-05-09 | DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation | Sitian Shen et.al. | 2405.05800 | null |
2024-05-10 | NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior | Gihoon Kim et.al. | 2405.05749 | null |
2024-05-09 | RPBG: Towards Robust Neural Point-based Graphics in the Wild | Qingtian Zhu et.al. | 2405.05663 | link |
2024-05-09 | Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview | Yuhang Ming et.al. | 2405.05526 | null |
2024-05-08 | ${M^2D}$ NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields | Ning Wang et.al. | 2405.05010 | null |
2024-05-08 | DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid | Sidun Liu et.al. | 2405.04416 | null |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-05 | Blending Distributed NeRFs with Tri-stage Robust Pose Optimization | Baijun Ye et.al. | 2405.02880 | null |
2024-05-05 | MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior | Honghua Chen et.al. | 2405.02859 | null |
2024-05-04 | TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes | Christopher Maxey et.al. | 2405.02762 | null |
2024-05-04 | ActiveNeuS: Active 3D Reconstruction using Neural Implicit Surface Uncertainty | Hyunseo Kim et.al. | 2405.02568 | null |
2024-05-03 | Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning | Dhruva Tirumala et.al. | 2405.02425 | null |
2024-05-03 | Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids | Junchen Liu et.al. | 2405.02386 | link |
2024-05-03 | WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights | Youngdong Jang et.al. | 2405.02066 | null |
2024-05-02 | NeRF in Robotics: A Survey | Guangming Wang et.al. | 2405.01333 | null |
2024-05-04 | LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun et.al. | 2405.00900 | null |
2024-05-01 | Depth Priors in Removal Neural Radiance Fields | Zhihao Guo et.al. | 2405.00630 | null |
2024-05-01 | NeRF-Guided Unsupervised Learning of RGB-D Registration | Zhinan Yu et.al. | 2405.00507 | null |
2024-05-01 | RTG-SLAM: Real-time 3D Reconstruction at Scale using Gaussian Splatting | Zhexi Peng et.al. | 2404.19706 | null |
2024-04-30 | NeRF-Insert: 3D Local Editing with Multimodal Control Signals | Benet Oriol Sabat et.al. | 2404.19204 | null |
2024-04-29 | SAGS: Structure-Aware 3D Gaussian Splatting | Evangelos Ververas et.al. | 2404.19149 | null |
2024-04-29 | GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting | Bo Chen et.al. | 2404.19040 | null |
2024-04-29 | Embedded Representation Learning Network for Animating Styled Video Portrait | Tianyong Wang et.al. | 2404.19038 | null |
2024-04-29 | Simple-RF: Regularizing Sparse Input Radiance Fields with Simpler Solutions | Nagabhushan Somraj et.al. | 2404.19015 | null |
2024-04-28 | S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM | Zhiyao Zhang et.al. | 2404.18284 | null |
2024-04-27 | DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction | Chenhe Du et.al. | 2404.17890 | null |
2024-04-26 | Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields | Tianqi Liu et.al. | 2404.17528 | link |
2024-04-25 | Depth Supervised Neural Surface Reconstruction from Airborne Imagery | Vincent Hackstein et.al. | 2404.16429 | null |
2024-04-24 | NeRF-XL: Scaling NeRFs with Multiple GPUs | Ruilong Li et.al. | 2404.16221 | null |
2024-04-24 | ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images | Jinseo Jeong et.al. | 2404.15707 | null |
2024-04-23 | DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft | Sam Earle et.al. | 2404.15538 | null |
2024-04-28 | GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Hongyun Yu et.al. | 2404.14037 | null |
2024-04-22 | NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation | Chi Huang et.al. | 2404.13921 | null |
2024-04-23 | CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory | Yunlong Ran et.al. | 2404.13896 | null |
2024-04-26 | Neural Radiance Field in Autonomous Driving: A Survey | Lei He et.al. | 2404.13816 | null |
2024-04-26 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711 | link |
2024-04-21 | Generalizable Novel-View Synthesis using a Stereo Camera | Haechan Lee et.al. | 2404.13541 | null |
2024-04-20 | High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces | Baoru Huang et.al. | 2404.13437 | null |
2024-04-20 | EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment | Guanghao Li et.al. | 2404.13346 | link |
2024-04-19 | FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction | Maria Dronova et.al. | 2404.12970 | null |
2024-04-22 | Does Gaussian Splatting need SFM Initialization? | Yalda Foroutan et.al. | 2404.12547 | null |
2024-04-18 | MeshLRM: Large Reconstruction Model for High-Quality Mesh | Xinyue Wei et.al. | 2404.12385 | null |
2024-04-18 | AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering | Jingfeng Guo et.al. | 2404.11897 | link |
2024-04-18 | Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations | Yu Feng et.al. | 2404.11852 | null |
2024-04-17 | SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping | Vincent Cartillier et.al. | 2404.11419 | null |
2024-04-16 | Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2404.10625 | null |
2024-04-16 | Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences | Seungwook Kim et.al. | 2404.10603 | null |
2024-04-16 | 1st Place Solution for ICCV 2023 OmniObject3D Challenge: Sparse-View Reconstruction | Hang Du et.al. | 2404.10441 | null |
2024-04-16 | SRGS: Super-Resolution 3D Gaussian Splatting | Xiang Feng et.al. | 2404.10318 | link |
2024-04-16 | Plug-and-Play Acceleration of Occupancy Grid-based NeRF Rendering using VDB Grid and Hierarchical Ray Traversal | Yoshio Kato et.al. | 2404.10272 | link |
2024-04-15 | Taming Latent Diffusion Model for Neural Radiance Field Inpainting | Chieh Hubert Lin et.al. | 2404.09995 | null |
2024-04-15 | Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video | Hongchi Xia et.al. | 2404.09833 | null |
2024-04-15 | DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading | Tong Wu et.al. | 2404.09412 | null |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-04-15 | OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering | Jingrui Ye et.al. | 2404.08449 | null |
2024-04-12 | GPN: Generative Point-based NeRF | Haipeng Wang et.al. | 2404.08312 | link |
2024-04-12 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
2024-04-11 | Connecting NeRFs, Images, and Text | Francesco Ballerini et.al. | 2404.07993 | link |
2024-04-11 | Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation | Keonhee Han et.al. | 2404.07933 | link |
2024-04-12 | NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | William Ljungbergh et.al. | 2404.07762 | link |
2024-04-11 | G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images | Zixiong Huang et.al. | 2404.07474 | link |
2024-04-10 | SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection | Mathis Kruse et.al. | 2404.06832 | link |
2024-04-10 | MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views | Runfa Li et.al. | 2404.06753 | null |
2024-04-10 | Bayesian NeRF: Quantifying Uncertainty with Volume Density in Neural Radiance Fields | Sibeak Lee et.al. | 2404.06727 | link |
2024-04-11 | SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera | Gaole Dai et.al. | 2404.06710 | null |
2024-04-09 | Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion | Fan Yang et.al. | 2404.06429 | link |
2024-04-09 | 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis | Zhicheng Lu et.al. | 2404.06270 | null |
2024-04-09 | GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields | Arnab Dey et.al. | 2404.06246 | null |
2024-04-09 | HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields | Arnab Dey et.al. | 2404.06152 | null |
2024-04-08 | Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation | Y. Wang et.al. | 2404.05236 | null |
2024-04-08 | StylizedGS: Controllable Stylization for 3D Gaussian Splatting | Dingxi Zhang et.al. | 2404.05220 | null |
2024-04-08 | Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos | Fengrui Tian et.al. | 2404.05163 | link |
2024-04-07 | CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis | Gyeongjin Kang et.al. | 2404.04913 | null |
2024-04-07 | GauU-Scene V2: Expanse Lidar Image Dataset Shows Unreliable Geometric Reconstruction Using Gaussian Splatting and NeRF | Butian Xiong et.al. | 2404.04880 | null |
2024-04-07 | NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization | Peng Tu et.al. | 2404.04875 | null |
2024-04-06 | DATENeRF: Depth-Aware Text-based Editing of NeRFs | Sara Rojas et.al. | 2404.04526 | null |
2024-04-05 | Robust Gaussian Splatting | François Darmon et.al. | 2404.04211 | null |
2024-04-04 | SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer | Zijie Wu et.al. | 2404.03736 | link |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views | Francis Engelmann et.al. | 2404.03650 | null |
2024-04-04 | VF-NeRF: Viewshed Fields for Rigid NeRF Registration | Leo Segre et.al. | 2404.03349 | null |
2024-04-03 | GenN2N: Generative NeRF2NeRF Translation | Xiangyue Liu et.al. | 2404.02788 | link |
2024-04-03 | LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis | Zehan Zheng et.al. | 2404.02742 | link |
2024-04-03 | Neural Radiance Fields with Torch Units | Bingnan Ni et.al. | 2404.02617 | null |
2024-04-03 | Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition | Yisheng He et.al. | 2404.02514 | null |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-02 | Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields | Joshua Ahn et.al. | 2404.02155 | null |
2024-04-02 | Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions | Saptarshi Dasgupta et.al. | 2404.01812 | null |
2024-04-01 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400 | null |
2024-04-01 | NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields | Muhammad Zubair Irshad et.al. | 2404.01300 | link |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-02 | StructLDM: Structured Latent Diffusion for 3D Human Generation | Tao Hu et.al. | 2404.01241 | null |
2024-04-01 | Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting | Jiarui Meng et.al. | 2404.01168 | null |
2024-04-01 | SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance | Yuru Xiao et.al. | 2404.00992 | null |
2024-04-01 | FlexiDreamer: Single Image-to-3D Generation with FlexiCubes | Ruowen Zhao et.al. | 2404.00987 | link |
2024-04-01 | Marrying NeRF with Feature Matching for One-step Pose Estimation | Ronghan Chen et.al. | 2404.00891 | null |
2024-03-29 | HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes | Ke Wu et.al. | 2403.20159 | null |
2024-03-29 | Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior | Jaehoon Ko et.al. | 2403.20153 | link |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising | Tianchen Deng et.al. | 2403.20034 | link |
2024-03-29 | SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image | Yunhao Li et.al. | 2403.20018 | link |
2024-03-29 | DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal | Yunhao Li et.al. | 2403.20013 | link |
2024-03-29 | Stable Surface Regularization for Fast Few-Shot NeRF | Byeongin Joung et.al. | 2403.19985 | null |
2024-03-29 | MI-NeRF: Learning a Single Face NeRF from Multiple Identities | Aggelina Chatziagapi et.al. | 2403.19920 | null |
2024-03-28 | Mitigating Motion Blur in Neural Radiance Fields with Events and Frames | Marco Cannici et.al. | 2403.19780 | link |
2024-03-28 | SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects | Avinash Ummadisingu et.al. | 2403.19607 | null |
2024-03-28 | CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians | Avinash Paliwal et.al. | 2403.19495 | link |
2024-03-28 | Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation | Yujin Chen et.al. | 2403.19319 | null |
2024-03-28 | Sine Activated Low-Rank Matrices for Parameter Efficient Learning | Yiping Ji et.al. | 2403.19243 | null |
2024-03-29 | Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | Qiuhong Shen et.al. | 2403.18795 | link |
2024-03-27 | SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery | Camille Billouard et.al. | 2403.18711 | link |
2024-03-27 | Modeling uncertainty for Gaussian Splatting | Luca Savant et.al. | 2403.18476 | null |
2024-03-26 | Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians | Kerui Ren et.al. | 2403.17898 | link |
2024-03-26 | NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation | Jiahao Chen et.al. | 2403.17537 | null |
2024-03-25 | VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation | Yang Chen et.al. | 2403.17001 | null |
2024-03-25 | CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs | Yingji Zhong et.al. | 2403.16885 | null |
2024-03-25 | Spike-NeRF: Neural Radiance Field Based On Spike Camera | Yijia Guo et.al. | 2403.16410 | null |
2024-03-24 | Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields | Haoyuan Wang et.al. | 2403.16224 | null |
2024-03-24 | Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes | Takashi Otonari et.al. | 2403.16141 | null |
2024-03-24 | CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Jiarui Hu et.al. | 2403.16095 | null |
2024-03-24 | Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap | Carl Lindström et.al. | 2403.16092 | null |
2024-03-26 | PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling | Xiaoyun Zheng et.al. | 2403.16080 | link |
2024-03-24 | Semantic Is Enough: Only Semantic Information For NeRF Reconstruction | Ruibo Wang et.al. | 2403.16043 | null |
2024-03-24 | Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields | unhong Zhao et.al. | 2403.15981 | null |
2024-03-23 | DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation | Mu-Yi Shen et.al. | 2403.15791 | link |
2024-03-23 | UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation | Yuliang Guo et.al. | 2403.15705 | link |
2024-03-22 | WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization | Jialu Wang et.al. | 2403.15272 | null |
2024-03-21 | Hyperspectral Neural Radiance Fields | Gerry Chen et.al. | 2403.14839 | null |
2024-03-21 | ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition | Tianhao Wu et.al. | 2403.14619 | null |
2024-03-21 | CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis | Matteo Bonotto et.al. | 2403.14412 | link |
2024-03-21 | InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity | Jiabin Liang et.al. | 2403.14376 | null |
2024-03-21 | Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions | Jiacong Xu et.al. | 2403.14053 | link |
2024-03-20 | MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination | Weiying Wang et.al. | 2403.13348 | null |
2024-03-19 | Depth-guided NeRF Training via Earth Mover’s Distance | Anita Rau et.al. | 2403.13206 | null |
2024-03-19 | DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images | Zaid Tasneem et.al. | 2403.13199 | null |
2024-03-19 | Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering | Mingqi Shao et.al. | 2403.12839 | null |
2024-03-19 | Learning Neural Volumetric Pose Features for Camera Localization | Jingyu Lin et.al. | 2403.12800 | null |
2024-03-19 | IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model | Matteo Bortolon et.al. | 2403.12682 | null |
2024-03-18 | FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos | Florian Philipp Stilz et.al. | 2403.12198 | null |
2024-03-18 | ThermoNeRF: Multimodal Neural Radiance Fields for Thermal Novel View Synthesis | Mariam Hassan et.al. | 2403.12154 | link |
2024-03-18 | RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF | Sibi Catley-Chandar et.al. | 2403.11909 | null |
2024-03-18 | GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors | LI Yang et.al. | 2403.11899 | null |
2024-03-18 | Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging | Mert Özer et.al. | 2403.11865 | null |
2024-03-19 | BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting | Lingzhe Zhao et.al. | 2403.11831 | link |
2024-03-18 | Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Yuqi Zhang et.al. | 2403.11812 | link |
2024-03-18 | DVN-SLAM: Dynamic Visual Neural SLAM Based on Local-Global Encoding | Wenhua Wu et.al. | 2403.11776 | null |
2024-03-18 | Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes | Antoine Schnepf et.al. | 2403.11678 | null |
2024-03-18 | UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling | Yujiao Jiang et.al. | 2403.11589 | null |
2024-03-18 | Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem | Mincheol Chang et.al. | 2403.11573 | null |
2024-03-17 | Creating Seamless 3D Maps Using Radiance Fields | Sai Tarun Sathyan et.al. | 2403.11364 | null |
2024-03-17 | SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream | Lin Zhu et.al. | 2403.11222 | link |
2024-03-17 | Recent Advances in 3D Gaussian Splatting | Tong Wu et.al. | 2403.11134 | null |
2024-03-17 | Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications | Yonggan Fu et.al. | 2403.11131 | link |
2024-03-16 | Fast Sparse View Guided NeRF Update for Object Reconfigurations | Ziqi Lu et.al. | 2403.11024 | null |
2024-03-16 | HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering | Seunghyeon Seo et.al. | 2403.10906 | null |
2024-03-15 | FeatUp: A Model-Agnostic Framework for Features at Any Resolution | Stephanie Fu et.al. | 2403.10516 | link |
2024-03-15 | Thermal-NeRF: Neural Radiance Fields from an Infrared Camera | Tianxiang Ye et.al. | 2403.10340 | link |
2024-03-15 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297 | link |
2024-03-15 | GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time | Hao Li et.al. | 2403.10147 | null |
2024-03-15 | URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields | Bo Xu et.al. | 2403.10119 | null |
2024-03-15 | DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video | Huiqiang Sun et.al. | 2403.10103 | null |
2024-03-15 | Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience | Xiaohang Yu et.al. | 2403.09973 | null |
2024-03-14 | GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping | Yuhang Zheng et.al. | 2403.09637 | link |
2024-03-14 | The NeRFect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou et.al. | 2403.09577 | null |
2024-03-14 | VIRUS-NeRF – Vision, InfraRed and UltraSonic based Neural Radiance Fields | Nicolaj Schmid et.al. | 2403.09477 | link |
2024-03-14 | 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation | Frank Zhang et.al. | 2403.09439 | null |
2024-03-14 | RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes | Thang-Anh-Quan Nguyen et.al. | 2403.09419 | null |
2024-03-14 | PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Tianyuan Yuan et.al. | 2403.09079 | link |
2024-03-13 | Gaussian Splatting in Style | Abhishek Saroha et.al. | 2403.08498 | null |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | link |
2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-12 | Q-SLAM: Quadric Representations for Monocular SLAM | Chensheng Peng et.al. | 2403.08125 | null |
2024-03-12 | SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields | Jungho Lee et.al. | 2403.07547 | link |
2024-03-11 | SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection | Yifu Tao et.al. | 2403.06877 | null |
2024-03-11 | Vosh: Voxel-Mesh Hybrid Representation for Real-Time View Synthesis | Chenhao Zhang et.al. | 2403.06505 | null |
2024-03-13 | FSViewFusion: Few-Shots View Generation of Novel Objects | Rukhshanda Hussain et.al. | 2403.06394 | null |
2024-03-10 | Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis? | Hanxin Zhu et.al. | 2403.06092 | null |
2024-03-09 | Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving | Junyi Cao et.al. | 2403.05907 | link |
2024-03-09 | Large Generative Model Assisted 3D Semantic Communication | Feibo Jiang et.al. | 2403.05783 | null |
2024-03-08 | GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting | Francesco Palandra et.al. | 2403.05154 | null |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508 | null |
2024-03-07 | Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis | Yuanhao Cai et.al. | 2403.04116 | link |
2024-03-08 | DNAct: Diffusion Guided Multi-Task 3D Policy Learning | Ge Yan et.al. | 2403.04115 | null |
2024-03-07 | Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs | Nikhil Mishra et.al. | 2403.04114 | link |
2024-03-06 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding | Zi-Ting Chou et.al. | 2403.03608 | null |
2024-03-05 | A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction | Haofan Lu et.al. | 2403.03241 | null |
2024-03-05 | Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps | Timothy Chen et.al. | 2403.02751 | link |
2024-03-04 | DaReNeRF: Direction-aware Representation for Dynamic Scenes | Ange Lou et.al. | 2403.02265 | null |
2024-03-04 | Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views | Shuai Guo et.al. | 2403.02063 | null |
2024-03-02 | NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning | Linsheng Chen et.al. | 2403.01325 | link |
2024-03-02 | Neural radiance fields-based holography [Invited] | Minsung Kang et.al. | 2403.01137 | null |
2024-03-02 | Neural Field Classifiers via Target Encoding and Classification Loss | Xindi Yang et.al. | 2403.01058 | null |
2024-03-01 | DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots | Chunlin Li et.al. | 2403.00228 | link |
2024-02-28 | NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images | Jingrui Yu et.al. | 2402.18196 | link |
2024-02-26 | Neural Radiance Fields in Medical Imaging: Challenges and Next Steps | Xin Wang et.al. | 2402.17797 | null |
2024-02-27 | Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning | Xiaoyu Zhang et.al. | 2402.17768 | null |
2024-02-27 | VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction | Jiaqi Lin et.al. | 2402.17427 | null |
2024-02-27 | Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis | Zicheng Zhang et.al. | 2402.17364 | link |
2024-02-27 | DivAvatar: Diverse 3D Avatar Generation with a Single Prompt | Weijing Tao et.al. | 2402.17292 | null |
2024-02-27 | CharNeRF: 3D Character Generation from Concept Art | Eddy Chu et.al. | 2402.17115 | null |
2024-02-26 | Disentangled 3D Scene Generation with Layout Learning | Dave Epstein et.al. | 2402.16936 | null |
2024-02-26 | CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency | Hanxin Zhu et.al. | 2402.16407 | null |
2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Zetian Song et.al. | 2402.16366 | null |
2024-02-26 | DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer | Yizhe Wu et.al. | 2402.16308 | null |
2024-02-22 | Consolidating Attention Features for Multi-view Image Editing | Or Patashnik et.al. | 2402.14792 | null |
2024-02-26 | FrameNeRF: A Simple and Efficient Framework for Few-shot Novel View Synthesis | Yan Xing et.al. | 2402.14586 | null |
2024-02-22 | NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection | Chenxi Huang et.al. | 2402.14464 | link |
2024-02-22 | TaylorGrid: Towards Fast and High-Quality Implicit Field Learning via Direct Taylor-based Grid Optimization | Renyi Mao et.al. | 2402.14415 | null |
2024-02-22 | Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields | Seungtae Nam et.al. | 2402.14196 | null |
2024-02-21 | Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Joongho Jo et.al. | 2402.13827 | null |
2024-02-21 | SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields | Zhentao Huang et.al. | 2402.13510 | null |
2024-02-20 | How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey | Fabio Tosi et.al. | 2402.13255 | link |
2024-02-20 | Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields | Bo-Yu Cheng et.al. | 2402.13252 | link |
2024-02-20 | NeRF Solves Undersampled MRI Reconstruction | Tae Jun Jang et.al. | 2402.13226 | null |
2024-02-20 | OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow | Simon Boeder et.al. | 2402.12792 | null |
2024-02-19 | Binary Opacity Grids: Capturing Fine Geometric Detail for Mesh-Based View Synthesis | Christian Reiser et.al. | 2402.12377 | null |
2024-02-19 | Colorizing Monochromatic Radiance Fields | Yean Cheng et.al. | 2402.12184 | null |
2024-02-17 | Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review | Thang-Anh-Quan Nguyen et.al. | 2402.11141 | link |
2024-02-15 | Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions | Muhammad Arbab Arshad et.al. | 2402.10344 | null |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325 | link |
2024-02-13 | Preconditioners for the Stochastic Training of Implicit Neural Representations | Shin-Fang Chng et.al. | 2402.08784 | null |
2024-02-13 | NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs | Michael Fischer et.al. | 2402.08622 | null |
2024-02-13 | H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields | Minyoung Park et.al. | 2402.08138 | null |
2024-02-12 | DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation | Chenchang Li et.al. | 2402.07648 | null |
2024-02-11 | BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis | Leandro A. Passos et.al. | 2402.07310 | link |
2024-02-11 | 3D Gaussian as a New Vision Era: A Survey | Ben Fei et.al. | 2402.07181 | null |
2024-02-09 | ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting | Georgii Stanishevskii et.al. | 2402.06390 | link |
2024-02-07 | NeRF as Non-Distant Environment Emitter in Physics-based Inverse Rendering | Jingwang Ling et.al. | 2402.04829 | null |
2024-02-07 | OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding | Guibiao Liao et.al. | 2402.04648 | link |
2024-02-11 | BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery | Huiqing Zhang et.al. | 2402.04554 | null |
2024-02-06 | Improved Generalization of Weight Space Networks via Augmentations | Aviv Shamsian et.al. | 2402.04081 | link |
2024-02-05 | ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis | Bernard Spiegl et.al. | 2402.02906 | link |
2024-02-02 | ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields | Xingyu Miao et.al. | 2402.01950 | link |
2024-02-02 | Robust Inverse Graphics via Probabilistic Inference | Tuan Anh Le et.al. | 2402.01915 | link |
2024-02-02 | HyperPlanes: Hypernetwork Approach to Rapid NeRF Adaptation | Paweł Batorski et.al. | 2402.01524 | link |
2024-02-02 | Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses | Mahboubeh Asadi et.al. | 2402.01485 | null |
2024-02-06 | GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting | Joanna Waczyńska et.al. | 2402.01459 | link |
2024-02-02 | Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization | Zhiyu Zhang et.al. | 2402.01380 | null |
2024-02-06 | Taming Uncertainty in Sparse-view Generalizable NeRF via Indirect Diffusion Guidance | Yaokun Li et.al. | 2402.01217 | null |
2024-02-01 | ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Jiahua Dong et.al. | 2402.00864 | link |
2024-02-01 | Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering | Pinxin Liu et.al. | 2402.00827 | link |
2024-01-31 | CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting | Jiezhi Yang et.al. | 2401.18075 | null |
2024-02-01 | Segment Anything in 3D Gaussians | Xu Hu et.al. | 2401.17857 | link |
2024-01-30 | Physical Priors Augmented Event-Based 3D Reconstruction | Jiaxu Wang et.al. | 2401.17121 | link |
2024-01-31 | Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting | Yiming Huang et.al. | 2401.16416 | link |
2024-01-29 | Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields | Rongkai Ma et.al. | 2401.16144 | null |
2024-01-26 | 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field | Zhenyu Bao et.al. | 2401.14726 | link |
2024-01-25 | Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation | Jiaxu Wang et.al. | 2401.14354 | null |
2024-01-27 | Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation | Minglin Chen et.al. | 2401.14257 | null |
2024-01-24 | EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues Reconstruction | Yangsen Chen et.al. | 2401.13352 | null |
2024-01-23 | NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for Talking Face Synthesis | Chongke Bi et.al. | 2401.12568 | null |
2024-01-23 | Exploration and Improvement of Nerf-based 3D Scene Editing Techniques | Shun Fang et.al. | 2401.12456 | null |
2024-01-23 | Methods and strategies for improving the novel view synthesis quality of neural radiation field | Shun Fang et.al. | 2401.12451 | null |
2024-01-22 | Single-View 3D Human Digitalization with Large Reconstruction Models | Zhenzhen Weng et.al. | 2401.12175 | null |
2024-01-22 | Scaling Face Interaction Graph Networks to Real World Scenes | Tatiana Lopez-Guevara et.al. | 2401.11985 | null |
2024-01-22 | HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs | Zelin Gao et.al. | 2401.11711 | null |
2024-01-23 | IPR-NeRF: Ownership Verification meets Neural Radiance Field | Win Kent Ong et.al. | 2401.09495 | null |
2024-01-17 | ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization | Weiyao Wang et.al. | 2401.08937 | null |
2024-01-18 | ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Process | Kiyohiro Nakayama et.al. | 2401.08140 | null |
2024-01-16 | Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities | Xu Yan et.al. | 2401.08045 | link |
2024-01-15 | 6-DoF Grasp Pose Evaluation and Optimization via Transfer Learning from NeRFs | Gergely Sóti et.al. | 2401.07935 | null |
2024-01-11 | TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation | Rajaei Khatib et.al. | 2401.06191 | null |
2024-01-11 | Fast High Dynamic Range Radiance Fields for Dynamic Scenes | Guanjun Wu et.al. | 2401.06052 | null |
2024-01-11 | CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians | Bin Dou et.al. | 2401.05925 | null |
2024-01-11 | GO-NeRF: Generating Virtual Objects in Neural Radiance Fields | Peng Dai et.al. | 2401.05750 | null |
2024-01-10 | Diffusion Priors for Dynamic View Synthesis from Monocular Videos | Chaoyang Wang et.al. | 2401.05583 | null |
2024-01-10 | InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes | Mohamad Shahbazi et.al. | 2401.05335 | null |
2024-01-10 | CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video | Xingyu Miao et.al. | 2401.04861 | link |
2024-01-08 | A Survey on 3D Gaussian Splatting | Guikun Chen et.al. | 2401.03890 | link |
2024-01-08 | NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation | Casimir Feldmann et.al. | 2401.03771 | null |
2024-01-06 | RustNeRF: Robust Neural Radiance Field with Low-Quality Images | Mengfei Li et.al. | 2401.03257 | null |
2024-01-06 | Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping | Tongyan Hua et.al. | 2401.03203 | null |
2024-01-05 | Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human | Song Bai et.al. | 2401.02620 | null |
2024-01-05 | FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF | Hao Zhang et.al. | 2401.02616 | link |
2024-01-05 | Characterizing Satellite Geometry via Accelerated 3D Gaussian Splatting | Van Minh Nguyen et.al. | 2401.02588 | null |
2024-01-03 | SIGNeRF: Scene Integrated Generation for Neural Radiance Fields | Jan-Niklas Dihlmann et.al. | 2401.01647 | null |
2024-01-02 | Street Gaussians for Modeling Dynamic Urban Scenes | Yunzhi Yan et.al. | 2401.01339 | link |
2024-01-02 | Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise | Qinglong Huang et.al. | 2401.01216 | null |
2024-01-02 | 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands | Xuan Huang et.al. | 2401.00979 | link |
2024-01-01 | Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior | Byeonghyeon Lee et.al. | 2401.00825 | link |
2024-01-02 | GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields | Xiao Pan et.al. | 2401.00616 | null |
2023-12-30 | Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models | Han Jiang et.al. | 2401.00208 | null |
2023-12-29 | Informative Rays Selection for Few-Shot Neural Radiance Fields | Marco Orsingher et.al. | 2312.17561 | null |
2023-12-27 | City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web | Kaiwen Song et.al. | 2312.16457 | link |
2023-12-26 | DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision | Lu Ling et.al. | 2312.16256 | null |
2023-12-24 | SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition | Nikhil Behari et.al. | 2312.16215 | null |
2023-12-23 | INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields | Andrew Hou et.al. | 2312.16197 | null |
2023-12-26 | LangSplat: 3D Language Gaussian Splatting | Minghan Qin et.al. | 2312.16084 | link |
2023-12-26 | 2D-Guided 3D Gaussian Segmentation | Kun Lan et.al. | 2312.16047 | null |
2023-12-26 | Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images | Zhan Lu et.al. | 2312.15942 | link |
2023-12-23 | Human101: Training 100+FPS Human Gaussians in 100s from 1 View | Mingwei Li et.al. | 2312.15258 | link |
2023-12-23 | Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane | Chen Yang et.al. | 2312.15253 | link |
2023-12-23 | CaLDiff: Camera Localization in NeRF via Pose Diffusion | Rashik Shrestha et.al. | 2312.15242 | null |
2023-12-22 | PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF | Mohsen Gholami et.al. | 2312.14915 | link |
2023-12-22 | Density Uncertainty Quantification with NeRF-Ensembles: Impact of Data and Scene Constraints | Miriam Jäger et.al. | 2312.14664 | null |
2023-12-21 | PlatoNeRF: 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar | Tzofi Klinghoffer et.al. | 2312.14239 | null |
2023-12-21 | Virtual Pets: Animatable Animal Generation in 3D Scenes | Yen-Chi Cheng et.al. | 2312.14154 | null |
2023-12-21 | Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning | Desai Xie et.al. | 2312.13980 | null |
2023-12-21 | SyncDreamer for 3D Reconstruction of Endangered Animal Species with NeRF and NeuS | Ahmet Haydar Ornek et.al. | 2312.13832 | null |
2023-12-22 | Gaussian Splatting with NeRF-based Color and Opacity | Dawid Malarz et.al. | 2312.13729 | link |
2023-12-21 | DyBluRF: Dynamic Deblurring Neural Radiance Fields for Blurry Monocular Video | Minh-Quan Viet Bui et.al. | 2312.13528 | null |
2023-12-21 | Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects | David Nakath et.al. | 2312.13494 | null |
2023-12-20 | NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields | Jens Naumann et.al. | 2312.13471 | null |
2023-12-20 | Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM | Junru Lin et.al. | 2312.13332 | null |
2023-12-20 | ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors | Weijia Mao et.al. | 2312.13324 | null |
2023-12-20 | UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections | Fangjinhua Wang et.al. | 2312.13285 | null |
2023-12-19 | ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields | Xiang Feng et.al. | 2312.12122 | null |
2023-12-19 | LHManip: A Dataset for Long-Horizon Language-Grounded Manipulation Tasks in Cluttered Tabletop Environments | Federico Ceola et.al. | 2312.12036 | link |
2023-12-19 | MixRT: Mixed Neural Representations For Real-Time NeRF Rendering | Chaojian Li et.al. | 2312.11841 | null |
2023-12-19 | Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation | Yuze He et.al. | 2312.11774 | null |
2023-12-15 | FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline | Chien-Yu Lin et.al. | 2312.11537 | null |
2023-12-15 | Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior | Nan Huang et.al. | 2312.11535 | null |
2023-12-18 | GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning | Ye Yuan et.al. | 2312.11461 | null |
2023-12-18 | AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis | Dongze Li et.al. | 2312.10921 | null |
2023-12-17 | PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields | Boming Zhao et.al. | 2312.10649 | null |
2023-12-19 | Learning Dense Correspondence for NeRF-Based Face Reenactment | Songlin Yang et.al. | 2312.10422 | null |
2023-12-15 | SlimmeRF: Slimmable Radiance Fields | Shiran Yuan et.al. | 2312.10034 | link |
2023-12-15 | LAENeRF: Local Appearance Editing for Neural Radiance Fields | Lukas Radl et.al. | 2312.09913 | null |
2023-12-15 | SLS4D: Sparse Latent Space for 4D Novel View Synthesis | Qi-Yuan Feng et.al. | 2312.09743 | null |
2023-12-15 | Towards Transferable Targeted 3D Adversarial Attack in the Physical World | Yao Huang et.al. | 2312.09558 | link |
2023-12-14 | LatentEditor: Text Driven Local Editing of 3D Scenes | Umar Khalid et.al. | 2312.09313 | link |
2023-12-14 | Stable Score Distillation for High-Quality 3D Generation | Boshi Tang et.al. | 2312.09305 | null |
2023-12-14 | ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining | Ruoxi Shi et.al. | 2312.09249 | null |
2023-12-15 | 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting | Zhiyin Qian et.al. | 2312.09228 | null |
2023-12-15 | ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field | Zhangkai Ni et.al. | 2312.09095 | link |
2023-12-15 | Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption | Ziteng Cui et.al. | 2312.09093 | link |
2023-12-14 | iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching | Yuan Sun et.al. | 2312.09031 | null |
2023-12-14 | Scene 3-D Reconstruction System in Scattering Medium | Zhuoyifan Zhang et.al. | 2312.09005 | null |
2023-12-14 | CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning | Qingsong Yan et.al. | 2312.08760 | null |
2023-12-14 | SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field | Ru Li et.al. | 2312.08692 | link |
2023-12-13 | ProNeRF: Learning Efficient Projection-Aware Ray Sampling for Fine-Grained Implicit Neural Radiance Fields | Juan Luis Gonzalez Bello et.al. | 2312.08136 | null |
2023-12-13 | Neural Radiance Fields for Transparent Object Using Visual Hull | Heechan Yoon et.al. | 2312.08118 | null |
2023-12-13 | uSF: Learning Neural Semantic Field with Uncertainty | Vsevolod Skorokhodov et.al. | 2312.08012 | link |
2023-12-12 | COLMAP-Free 3D Gaussian Splatting | Yang Fu et.al. | 2312.07504 | link |
2023-12-12 | Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs | Sunghwan Hong et.al. | 2312.07246 | link |
2023-12-12 | WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction | Jingchun Zhou et.al. | 2312.06946 | null |
2023-12-10 | TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video | Minye Wu et.al. | 2312.06713 | null |
2023-12-11 | CorresNeRF: Image Correspondence Priors for Neural Radiance Fields | Yixing Lao et.al. | 2312.06642 | link |
2023-12-11 | DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior | Tianyu Huang et.al. | 2312.06439 | link |
2023-12-10 | NeVRF: Neural Video-based Radiance Fields for Long-duration Sequences | Minye Wu et.al. | 2312.05855 | null |
2023-12-10 | IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment | Letian Zhang et.al. | 2312.05748 | null |
2023-12-09 | CoGS: Controllable Gaussian Splatting | Heng Yu et.al. | 2312.05664 | null |
2023-12-09 | R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning | Zhiling Ye et.al. | 2312.05572 | null |
2023-12-08 | Multi-view Inversion for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2312.05330 | link |
2023-12-08 | TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis | Heming Zhu et.al. | 2312.05161 | null |
2023-12-08 | Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting | Xiaofeng Yang et.al. | 2312.04820 | null |
2023-12-08 | Reality’s Canvas, Language’s Brush: Crafting 3D Avatars from Monocular Video | Yuchen Rao et.al. | 2312.04784 | null |
2023-12-07 | MuRF: Multi-Baseline Radiance Fields | Haofei Xu et.al. | 2312.04565 | link |
2023-12-07 | EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS | Sharath Girish et.al. | 2312.04564 | link |
2023-12-07 | Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection | Kohei Yamashita et.al. | 2312.04527 | null |
2023-12-07 | Multi-View Unsupervised Image Generation with Cross Attention Guidance | Llukman Cerkezi et.al. | 2312.04337 | null |
2023-12-07 | Towards 4D Human Video Stylization | Tiantian Wang et.al. | 2312.04143 | link |
2023-12-07 | Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction | Jiayi Kong et.al. | 2312.04106 | null |
2023-12-06 | Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion | Kira Prabhu et.al. | 2312.03869 | null |
2023-12-06 | Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle | Youtian Lin et.al. | 2312.03431 | null |
2023-12-06 | Artist-Friendly Relightable and Animatable Neural Heads | Yingyan Xu et.al. | 2312.03420 | null |
2023-12-06 | Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method | Hongyu Huang et.al. | 2312.03372 | null |
2023-12-06 | RING-NeRF: A Versatile Architecture based on Residual Implicit Neural Grids | Doriand Petit et.al. | 2312.03357 | null |
2023-12-06 | SO-NeRF: Active View Planning for NeRF using Surrogate Objectives | Keifer Lee et.al. | 2312.03266 | null |
2023-12-06 | Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Shijie Zhou et.al. | 2312.03203 | link |
2023-12-05 | HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces | Haithem Turki et.al. | 2312.03160 | null |
2023-12-05 | ReconFusion: 3D Reconstruction with Diffusion Priors | Rundi Wu et.al. | 2312.02981 | null |
2023-12-05 | GauHuman: Articulated Gaussian Splatting from Monocular Human Videos | Shoukang Hu et.al. | 2312.02973 | link |
2023-12-05 | Alchemist: Parametric Control of Material Properties with Diffusion Models | Prafull Sharma et.al. | 2312.02970 | null |
2023-12-05 | MVHumanNet: A Large-scale Dataset of Multi-view Daily Dressing Human Captures | Zhangyang Xiong et.al. | 2312.02963 | null |
2023-12-05 | C-NERF: Representing Scene Changes as Directional Consistency Difference-based NeRF | Rui Huang et.al. | 2312.02751 | link |
2023-12-05 | Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent | Jianmeng Liu et.al. | 2312.02568 | null |
2023-12-04 | PointNeRF++: A multi-scale, point-based Neural Radiance Field | Weiwei Sun et.al. | 2312.02362 | null |
2023-12-04 | Calibrated Uncertainties for Neural Radiance Fields | Niki Amini-Naieni et.al. | 2312.02350 | null |
2023-12-04 | Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis | Felix Tristram et.al. | 2312.02255 | null |
2023-12-04 | ColonNeRF: Neural Radiance Fields for High-Fidelity Long-Sequence Colonoscopy Reconstruction | Yufei Shi et.al. | 2312.02015 | null |
2023-12-04 | Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training | Runze He et.al. | 2312.01663 | null |
2023-12-03 | SANeRF-HQ: Segment Anything for NeRF in High Quality | Yichen Liu et.al. | 2312.01531 | null |
2023-12-03 | VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams | Liao Wang et.al. | 2312.01407 | null |
2023-12-02 | Self-Evolving Neural Radiance Fields | Jaewoo Jung et.al. | 2312.01003 | link |
2023-12-01 | Gaussian Grouping: Segment and Edit Anything in 3D Scenes | Mingqiao Ye et.al. | 2312.00732 | link |
2023-11-30 | LucidDreaming: Controllable Object-Centric 3D Generation | Zhaoning Wang et.al. | 2312.00588 | null |
2023-12-01 | FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting | Zehao Zhu et.al. | 2312.00451 | null |
2023-11-30 | PyNeRF: Pyramidal Neural Radiance Fields | Haithem Turki et.al. | 2312.00252 | link |
2023-11-30 | SparseGS: Real-Time 360° Sparse View Synthesis using Gaussian Splatting | Haolin Xiong et.al. | 2312.00206 | link |
2023-11-30 | Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing | Hyelin Nam et.al. | 2311.18608 | null |
2023-11-30 | ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs | Violeta Menéndez González et.al. | 2311.18491 | null |
2023-11-30 | Anisotropic Neural Representation Learning for High-Quality Neural Rendering | Y. Wang et.al. | 2311.18311 | null |
2023-11-30 | CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt | Haiyao Xiao et.al. | 2311.18288 | null |
2023-11-30 | Compact3D: Compressing Gaussian Splat Radiance Field Models with Vector Quantization | KL Navaneet et.al. | 2311.18159 | link |
2023-11-29 | GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces | Yingwenqi Jiang et.al. | 2311.17977 | null |
2023-11-29 | AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text | Jianfeng Zhang et.al. | 2311.17917 | null |
2023-11-29 | FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information | Wen Jiang et.al. | 2311.17874 | link |
2023-11-29 | Cinematic Behavior Transfer via NeRF-based Differentiable Filming | Xuekun Jiang et.al. | 2311.17754 | null |
2023-11-29 | SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Ziqiao Peng et.al. | 2311.17590 | link |
2023-11-29 | NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields | Xiaoliang Liu et.al. | 2311.17332 | null |
2023-11-28 | LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS | Zhiwen Fan et.al. | 2311.17245 | link |
2023-11-28 | Continuous Pose for Monocular Cameras in Neural Implicit Representation | Qi Ma et.al. | 2311.17119 | link |
2023-11-28 | UC-NeRF: Neural Radiance Field for Under-Calibrated multi-view cameras in autonomous driving | Kai Cheng et.al. | 2311.16945 | null |
2023-11-28 | The Sky’s the Limit: Re-lightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility | James A. D. Gardner et.al. | 2311.16937 | link |
2023-11-28 | SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation | Jesus Zarzar et.al. | 2311.16671 | link |
2023-11-28 | DGNR: Density-Guided Neural Point Rendering of Large Driving Scenes | Zhuopeng Li et.al. | 2311.16664 | null |
2023-11-28 | SCALAR-NeRF: SCAlable LARge-scale Neural Radiance Fields for Scene Reconstruction | Yu Chen et.al. | 2311.16657 | null |
2023-11-28 | Rethinking Directional Integration in Neural Radiance Fields | Congyue Deng et.al. | 2311.16504 | null |
2023-11-27 | Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images | Shiu-hong Kao et.al. | 2311.16499 | link |
2023-11-27 | Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling | Zhe Li et.al. | 2311.16096 | link |
2023-11-27 | SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance Fields | Quentin Herau et.al. | 2311.15803 | null |
2023-11-27 | CaesarNeRF: Calibrated Semantic Representation for Few-shot Generalizable Neural Rendering | Haidong Zhu et.al. | 2311.15510 | link |
2023-11-26 | Efficient Encoding of Graphics Primitives with Simplex-based Structures | Yibo Wen et.al. | 2311.15439 | null |
2023-11-26 | Obj-NeRF: Extract Object NeRFs from Multi-view Images | Zhiyi Li et.al. | 2311.15291 | null |
2023-11-26 | NeuRAD: Neural Rendering for Autonomous Driving | Adam Tonderski et.al. | 2311.15260 | link |
2023-11-24 | Animate124: Animating One Image to 4D Dynamic Scene | Yuyang Zhao et.al. | 2311.14603 | null |
2023-11-24 | GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting | Yiwen Chen et.al. | 2311.14521 | link |
2023-11-23 | ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain Optimization | Soonbin Lee et.al. | 2311.14208 | null |
2023-11-23 | Tube-NeRF: Efficient Imitation Learning of Visuomotor Policies from MPC using Tube-Guided Data Augmentation and NeRFs | Andrea Tagliabue et.al. | 2311.14153 | null |
2023-11-23 | Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder | Xiaohao Xu et.al. | 2311.13750 | null |
2023-11-22 | Compact 3D Gaussian Representation for Radiance Field | Joo Chan Lee et.al. | 2311.13681 | link |
2023-11-22 | Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning | Kai Yu et.al. | 2311.13617 | null |
2023-11-22 | Animatable 3D Gaussians for High-fidelity Synthesis of Human Motions | Keyang Ye et.al. | 2311.13404 | null |
2023-11-22 | Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images | Jaeyoung Chung et.al. | 2311.13398 | link |
2023-11-22 | 3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization | Jianwei Feng et.al. | 2311.13168 | null |
2023-11-22 | PIE-NeRF: Physics-based Interactive Elastodynamics with NeRF | Yutao Feng et.al. | 2311.13099 | null |
2023-11-21 | SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering | Antoine Guédon et.al. | 2311.12775 | link |
2023-11-21 | Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields | Yifan Wang et.al. | 2311.12490 | null |
2023-11-18 | Towards Function Space Mesh Watermarking: Protecting the Copyright of Signed Distance Fields | Xingyu Zhu et.al. | 2311.12059 | null |
2023-11-20 | GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding | Hao Li et.al. | 2311.11863 | null |
2023-11-20 | Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields | Zhiyuan Min et.al. | 2311.11845 | link |
2023-11-19 | GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise | Xinhai Li et.al. | 2311.11221 | null |
2023-11-18 | SNI-SLAM: Semantic Neural Implicit SLAM | Siting Zhu et.al. | 2311.11016 | link |
2023-11-18 | Structure-Aware Sparse-View X-ray 3D Reconstruction | Yuanhao Cai et.al. | 2311.10959 | link |
2023-11-17 | Removing Adverse Volumetric Effects From Trained Neural Radiance Fields | Andreas L. Teigen et.al. | 2311.10523 | null |
2023-11-18 | EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices | Jingnan Gao et.al. | 2311.09806 | null |
2023-11-16 | Reconstructing Continuous Light Field From Single Coded Image | Yuya Ishikawa et.al. | 2311.09646 | null |
2023-11-15 | Single-Image 3D Human Digitization with Shape-Guided Diffusion | Badour AlBahar et.al. | 2311.09221 | null |
2023-11-15 | DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model | Yinghao Xu et.al. | 2311.09217 | null |
2023-11-15 | Spiking NeRF: Representing the Real-World Geometry by a Discontinuous Representation | Zhanfeng Liao et.al. | 2311.09077 | link |
2023-11-13 | $L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF | Liangchen Li et.al. | 2311.07044 | null |
2023-11-11 | Aria-NeRF: Multimodal Egocentric View Synthesis | Jiankai Sun et.al. | 2311.06455 | null |
2023-11-10 | Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model | Jiahao Li et.al. | 2311.06214 | null |
2023-11-10 | A Neural Height-Map Approach for the Binocular Photometric Stereo Problem | Fotios Logothetis et.al. | 2311.05958 | null |
2023-11-09 | BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis | Hao-Bin Duan et.al. | 2311.05521 | link |
2023-11-09 | Control3D: Towards Controllable Text-to-3D Generation | Yang Chen et.al. | 2311.05461 | null |
2023-11-08 | LRM: Large Reconstruction Model for Single Image to 3D | Yicong Hong et.al. | 2311.04400 | null |
2023-11-07 | ADFactory: Automated Data Factory for Optical Flow Tasks | Han Ling et.al. | 2311.04246 | null |
2023-11-07 | High-fidelity 3D Reconstruction of Plants using Neural Radiance Field | Kewei Hu et.al. | 2311.04154 | null |
2023-11-07 | Fast Sun-aligned Outdoor Scene Relighting based on TensoRF | Yeonjin Chang et.al. | 2311.03965 | null |
2023-11-08 | UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields | Injae Kim et.al. | 2311.03784 | link |
2023-11-06 | Osprey: Multi-Session Autonomous Aerial Mapping with LiDAR-based SLAM and Next Best View Planning | Rowan Border et.al. | 2311.03484 | null |
2023-11-06 | Animating NeRFs from Texture Space: A Framework for Pose-Dependent Rendering of Human Performances | Paul Knoll et.al. | 2311.03140 | null |
2023-11-06 | InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image | Jianhui Li et.al. | 2311.02826 | link |
2023-11-03 | Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields | Jianxiong Shen et.al. | 2311.01815 | null |
2023-11-03 | PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation | Yuhan Ding et.al. | 2311.01773 | null |
2023-11-03 | Efficient Cloud Pipelines for Neural Radiance Fields | Derek Jacoby et.al. | 2311.01659 | null |
2023-11-02 | Novel View Synthesis from a Single RGBD Image for Indoor Scenes | Congrui Hetang et.al. | 2311.01065 | null |
2023-10-31 | FPO++: Efficient Encoding and Rendering of Dynamic Neural Radiance Fields by Analyzing and Enhancing Fourier PlenOctrees | Saskia Rabich et.al. | 2310.20710 | link |
2023-10-31 | NeRF Revisited: Fixing Quadrature Instability in Volume Rendering | Mikaela Angelina Uy et.al. | 2310.20685 | null |
2023-10-30 | Generative Neural Fields by Mixtures of Neural Implicit Functions | Tackgeun You et.al. | 2310.19464 | null |
2023-11-04 | TiV-NeRF: Tracking and Mapping via Time-Varying Representation with Dynamic Neural Radiance Fields | Chengyao Duan et.al. | 2310.18917 | null |
2023-10-28 | INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings | Amirhossein Kazerouni et.al. | 2310.18846 | link |
2023-10-27 | ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image | Kyle Sargent et.al. | 2310.17994 | link |
2023-10-27 | Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations | Tristan Aumentado-Armstrong et.al. | 2310.17880 | null |
2023-10-27 | HyperFields: Towards Zero-Shot Generation of NeRFs from Text | Sudarshan Babu et.al. | 2310.17075 | null |
2023-10-25 | 4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via 4D Semantic Segmentation | Dadong Jiang et.al. | 2310.16858 | null |
2023-10-26 | LightSpeed: Light and Fast Neural Light Fields on Mobile Devices | Aarush Gupta et.al. | 2310.16832 | link |
2023-10-28 | PERF: Panoramic Neural Radiance Field from a Single Panorama | Guangcong Wang et.al. | 2310.16831 | link |
2023-10-25 | Open-NeRF: Towards Open Vocabulary NeRF Decomposition | Hao Zhang et.al. | 2310.16383 | null |
2023-10-25 | UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception | Christopher Maxey et.al. | 2310.16255 | null |
2023-10-24 | Cross-view Self-localization from Synthesized Scene-graphs | Ryogo Yamamoto et.al. | 2310.15504 | null |
2023-10-23 | CAwa-NeRF: Instant Learning of Compression-Aware NeRF Features | Omnia Mahmoud et.al. | 2310.14695 | null |
2023-10-23 | VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations | Yiying Yang et.al. | 2310.14487 | null |
2023-10-20 | ManifoldNeRF: View-dependent Image Feature Supervision for Few-shot Neural Radiance Fields | Daiju Kanaoka et.al. | 2310.13670 | null |
2023-10-20 | Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos | Seoha Kim et.al. | 2310.13356 | link |
2023-10-20 | UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene | Jiaming Gu et.al. | 2310.13263 | null |
2023-10-18 | VQ-NeRF: Neural Reflectance Decomposition and Editing with Vector Quantization | Hongliang Zhong et.al. | 2310.11864 | null |
2023-10-18 | Towards Abdominal 3-D Scene Rendering from Laparoscopy Surgical Videos using NeRFs | Khoa Tuan Nguyen et.al. | 2310.11645 | null |
2023-10-16 | TraM-NeRF: Tracing Mirror and Near-Perfect Specular Reflections through Neural Radiance Fields | Leif Van Holland et.al. | 2310.10650 | link |
2023-10-16 | DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing | Jia-Wei Liu et.al. | 2310.10624 | null |
2023-10-16 | Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model | Junpeng Tan et.al. | 2310.10209 | null |
2023-10-15 | ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context | Binglun Wang et.al. | 2310.09965 | null |
2023-10-15 | Active Perception using Neural Radiance Fields | Siming He et.al. | 2310.09892 | link |
2023-10-15 | CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses | Hongyu Fu et.al. | 2310.09776 | null |
2023-10-11 | Dynamic Appearance Particle Neural Radiance Field | Ancheng Lin et.al. | 2310.07916 | null |
2023-10-12 | PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction | Jia-Wang Bian et.al. | 2310.07449 | link |
2023-10-11 | rpcPRF: Generalizable MPI Neural Radiance Field for Satellite Camera | Tongtong Zhang et.al. | 2310.07179 | null |
2023-10-10 | Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization | Le Chen et.al. | 2310.06984 | null |
2023-10-10 | High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field | Minghan Qin et.al. | 2310.06275 | null |
2023-10-09 | A Real-time Method for Inserting Virtual Objects into Neural Radiance Fields | Keyang Ye et.al. | 2310.05837 | null |
2023-10-09 | Neural Impostor: Editing Neural Radiance Fields with Explicit Shape Manipulation | Ruiyang Liu et.al. | 2310.05391 | null |
2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
2023-10-08 | Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation | Dominik Hollidt et.al. | 2310.05133 | null |
2023-10-06 | Improving Neural Radiance Field using Near-Surface Sampling with Point Cloud Generation | Hye Bin Yoo et.al. | 2310.04152 | null |
2023-10-05 | Drag View: Generalizable Novel View Synthesis with Unposed Imagery | Zhiwen Fan et.al. | 2310.03704 | link |
2023-10-05 | Targeted Adversarial Attacks on Generalizable Neural Radiance Fields | Andras Horvath et.al. | 2310.03578 | null |
2023-10-05 | BID-NeRF: RGB-D image pose estimation with inverted Neural Radiance Fields | Ágoston István Csehi et.al. | 2310.03563 | null |
2023-10-04 | Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation | Yihan Wu et.al. | 2310.03125 | null |
2023-10-04 | T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation | Yuze He et.al. | 2310.02977 | link |
2023-10-04 | ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF | Jangho Park et.al. | 2310.02712 | null |
2023-10-05 | USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields | Moyang Li et.al. | 2310.02687 | link |
2023-10-03 | EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields | Anish Bhattacharya et.al. | 2310.02437 | link |
2023-10-03 | Adaptive Multi-NeRF: Exploit Efficient Parallelism in Adaptive Multiple Scale Neural Radiance Field Rendering | Tong Wang et.al. | 2310.01881 | null |
2023-10-03 | MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields | Takuhiro Kaneko et.al. | 2310.01821 | null |
2023-10-02 | PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2310.00874 | link |
2023-10-01 | How Many Views Are Needed to Reconstruct an Unknown Object Using NeRF? | Sicong Pan et.al. | 2310.00684 | link |
2023-10-01 | Enabling Neural Radiance Fields (NeRF) for Large-scale Aerial Images – A Multi-tiling Approaching and the Geometry Assessment of NeRF | Ningli Xu et.al. | 2310.00530 | null |
2023-09-30 | MMPI: a Flexible Radiance Field Representation by Multiple Multi-plane Images Blending | Yuze He et.al. | 2310.00249 | null |
2023-09-29 | Multi-task View Synthesis with Neural Radiance Fields | Shuhong Zheng et.al. | 2309.17450 | link |
2023-09-29 | Forward Flow for Novel View Synthesis of Dynamic Scenes | Xiang Guo et.al. | 2309.17390 | null |
2023-09-29 | HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field | Xiaochen Zhao et.al. | 2309.17128 | null |
2023-09-28 | Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis | Marcel C. Bühler et.al. | 2309.16859 | null |
2023-09-28 | MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond | Yixuan Li et.al. | 2309.16553 | null |
2023-09-28 | FG-NeRF: Flow-GAN based Probabilistic Neural Radiance Field for Independence-Assumption-Free Uncertainty Estimation | Songlin Wei et.al. | 2309.16364 | null |
2023-09-28 | Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge | Zheyuan Yang et.al. | 2309.16110 | null |
2023-09-27 | P2I-NET: Mapping Camera Pose to Image via Adversarial Learning for New View Synthesis in Real Indoor Environments | Xujie Kang et.al. | 2309.15526 | null |
2023-09-27 | BASED: Bundle-Adjusting Surgical Endoscopic Dynamic Video Reconstruction using Neural Radiance Fields | Shreya Saha et.al. | 2309.15329 | null |
2023-09-26 | 3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction | Miriam Jäger et.al. | 2309.14800 | null |
2023-09-25 | NAS-NeRF: Generative Neural Architecture Search for Neural Radiance Fields | Saeejith Nair et.al. | 2309.14293 | null |
2023-09-25 | Variational Inference for Scalable 3D Object-centric Learning | Tianyu Wang et.al. | 2309.14010 | null |
2023-09-24 | MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field | Zijiang Yang et.al. | 2309.13607 | null |
2023-09-23 | NeRF-Enhanced Outpainting for Faithful Field-of-View Extrapolation | Rui Yu et.al. | 2309.13240 | null |
2023-09-22 | NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields | Xiaoxue Chen et.al. | 2309.13039 | link |
2023-09-21 | ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding | Yu Cheng et.al. | 2309.12183 | null |
2023-09-21 | NeuralLabeling: A versatile toolset for labeling vision datasets using Neural Radiance Fields | Floris Erich et.al. | 2309.11966 | link |
2023-09-21 | Fast Satellite Tensorial Radiance Field for Multi-date Satellite Imagery of Large Size | Tongtong Zhang et.al. | 2309.11767 | null |
2023-09-21 | MarkNerf:Watermarking for Neural Radiance Field | Lifeng Chen et.al. | 2309.11747 | null |
2023-09-21 | Rendering stable features improves sampling-based localisation with Neural radiance fields | Boxuan Zhang et.al. | 2309.11698 | null |
2023-09-20 | GenLayNeRF: Generalizable Layered Representations with 3D Model Alignment for Multi-Human View Synthesis | Youssef Abdelkareem et.al. | 2309.11627 | null |
2023-09-20 | Light Field Diffusion for Single-View Novel View Synthesis | Yifeng Xiong et.al. | 2309.11525 | null |
2023-09-21 | Controllable Dynamic Appearance for Neural 3D Portraits | ShahRukh Athar et.al. | 2309.11009 | null |
2023-09-20 | Spiking NeRF: Making Bio-inspired Neural Networks See through the Real World | Xingting Yao et.al. | 2309.10987 | link |
2023-09-19 | Locally Stylized Neural Radiance Fields | Hong-Wing Pang et.al. | 2309.10684 | null |
2023-09-19 | Steganography for Neural Radiance Fields by Backdooring | Weina Dong et.al. | 2309.10503 | null |
2023-09-18 | Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach | Rong Liu et.al. | 2309.10011 | null |
2023-09-18 | RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision | Mingjie Pan et.al. | 2309.09502 | link |
2023-09-17 | NeRF-VINS: A Real-time Neural Radiance Field Map-based Visual-Inertial Navigation System | Saimouli Katragadda et.al. | 2309.09295 | null |
2023-09-16 | DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF | Mert Asim Karaoglu et.al. | 2309.08927 | link |
2023-09-15 | Robust e-NeRF: NeRF from Sparse & Noisy Events under Non-Uniform Motion | Weng Fei Low et.al. | 2309.08596 | link |
2023-09-14 | Gradient based Grasp Pose Optimization on a NeRF that Approximates Grasp Success | Gergely Sóti et.al. | 2309.08040 | null |
2023-09-14 | MC-NeRF: Muti-Camera Neural Radiance Fields for Muti-Camera Image Acquisition Systems | Yu Gao et.al. | 2309.07846 | null |
2023-09-14 | DT-NeRF: Decomposed Triplane-Hash Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis | Yaoyu Su et.al. | 2309.07752 | null |
2023-09-14 | CoRF : Colorizing Radiance Fields using Knowledge Distillation | Ankit Dhiman et.al. | 2309.07668 | null |
2023-09-13 | Text-Guided Generation and Editing of Compositional 3D Avatars | Hao Zhang et.al. | 2309.07125 | null |
2023-09-13 | Dynamic NeRFs for Soccer Scenes | Sacha Lewin et.al. | 2309.06802 | link |
2023-09-12 | Federated Learning for Large-Scale Scene Modeling with Neural Radiance Fields | Teppei Suzuki et.al. | 2309.06030 | null |
2023-09-11 | PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics | Claus Smitt et.al. | 2309.05339 | null |
2023-09-10 | Text-driven Editing of 3D Scenes without Retraining | Shuangkang Fang et.al. | 2309.04917 | link |
2023-09-09 | Mirror-Aware Neural Humans | Daniel Ajisafe et.al. | 2309.04750 | link |
2023-09-08 | Dynamic Mesh-Aware Radiance Fields | Yi-Ling Qiao et.al. | 2309.04581 | null |
2023-09-08 | DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields | Junzhe Zhang et.al. | 2309.04410 | link |
2023-09-14 | SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions | Nagabhushan Somraj et.al. | 2309.03955 | null |
2023-09-07 | BluNF: Blueprint Neural Field | Robin Courant et.al. | 2309.03933 | null |
2023-09-07 | Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model | Sungwon Hwang et.al. | 2309.03550 | null |
2023-09-06 | Bayes’ Rays: Uncertainty Quantification for Neural Radiance Fields | Lily Goli et.al. | 2309.03185 | link |
2023-09-06 | ResFields: Residual Neural Fields for Spatiotemporal Signals | Marko Mihajlovic et.al. | 2309.03160 | link |
2023-09-06 | Instant Continual Learning of Neural Radiance Fields | Ryan Po et.al. | 2309.01811 | null |
2023-09-04 | Adv3D: Generating 3D Adversarial Examples in Driving Scenarios with NeRF | Leheng Li et.al. | 2309.01351 | null |
2023-09-01 | SparseSat-NeRF: Dense Depth Supervised Neural Radiance Fields for Sparse Satellite Images | Lulin Zhang et.al. | 2309.00277 | link |
2023-08-24 | Improving NeRF Quality by Progressive Camera Placement for Unrestricted Navigation in Complex Environments | Georgios Kopanas et.al. | 2309.00014 | null |
2023-09-03 | GHuNeRF: Generalizable Human NeRF from a Monocular Video | Chen Li et.al. | 2308.16576 | link |
2023-08-30 | From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications | Shreyank N Gowda et.al. | 2308.16041 | null |
2023-08-30 | Drone-NeRF: Efficient NeRF Based 3D Scene Reconstruction for Large-Scale Drone Survey | Zhihao Jia et.al. | 2308.15733 | null |
2023-08-29 | Efficient Ray Sampling for Radiance Fields Reconstruction | Shilei Sun et.al. | 2308.15547 | null |
2023-08-29 | Pose-Free Neural Radiance Fields via Implicit Pose Regularization | Jiahui Zhang et.al. | 2308.15049 | null |
2023-08-28 | CLNeRF: Continual Learning Meets NeRF | Zhipeng Cai et.al. | 2308.14816 | link |
2023-08-26 | InsertNeRF: Instilling Generalizability into NeRF with HyperNet Modules | Yanqi Bao et.al. | 2308.13897 | link |
2023-08-24 | NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects | Dakshit Agrawal et.al. | 2308.12560 | link |
2023-08-23 | Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields | Hyeonseop Song et.al. | 2308.11974 | null |
2023-08-25 | Pose Modulated Avatars from Video | Chunjin Song et.al. | 2308.11951 | null |
2023-08-22 | Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts | Wenyan Cong et.al. | 2308.11793 | link |
2023-08-22 | SAMSNeRF: Segment Anything Model (SAM) Guides Dynamic Surgical Scene Reconstruction by Neural Radiance Field (NeRF) | Ange Lou et.al. | 2308.11774 | null |
2023-08-22 | Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views | Wentian Qu et.al. | 2308.11198 | null |
2023-08-22 | Efficient View Synthesis with Neural Radiance Distribution Field | Yushuang Wu et.al. | 2308.11130 | null |
2023-08-21 | CamP: Camera Preconditioning for Neural Radiance Fields | Keunhong Park et.al. | 2308.10902 | null |
2023-08-20 | Strata-NeRF : Neural Radiance Fields for Stratified Scenes | Ankit Dhiman et.al. | 2308.10337 | null |
2023-08-19 | HollowNeRF: Pruning Hashgrid-Based NeRFs with Trainable Collision Mitigation | Xiufeng Xie et.al. | 2308.10122 | null |
2023-08-19 | AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization | Kun Wang et.al. | 2308.10001 | null |
2023-08-19 | Semantic-Human: Neural Rendering of Humans from Monocular Video with Human Parsing | Jie Zhang et.al. | 2308.09894 | null |
2023-08-18 | MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection | Junkai Xu et.al. | 2308.09421 | link |
2023-08-18 | DReg-NeRF: Deep Registration for Neural Radiance Fields | Yu Chen et.al. | 2308.09386 | link |
2023-08-17 | Watch Your Steps: Local Image and Scene Editing by Text Instructions | Ashkan Mirzaei et.al. | 2308.08947 | null |
2023-08-21 | Ref-DVGO: Reflection-Aware Direct Voxel Grid Optimization for an Improved Quality-Efficiency Trade-Off in Reflective Scene Reconstruction | Georgios Kouros et.al. | 2308.08530 | link |
2023-08-16 | SceNeRFlow: Time-Consistent Reconstruction of General Dynamic Scenes | Edith Tretschk et.al. | 2308.08258 | null |
2023-08-16 | Neural radiance fields in the industrial and robotics domain: applications, research opportunities and use cases | Eugen Šlapak et.al. | 2308.07118 | link |
2023-08-14 | S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural Fields | Zeke Xie et.al. | 2308.07032 | link |
2023-08-11 | Focused Specific Objects NeRF | Yuesong Li et.al. | 2308.05970 | null |
2023-08-11 | VERF: Runtime Monitoring of Pose Estimation with Neural Radiance Fields | Dominic Maggio et.al. | 2308.05939 | null |
2023-08-09 | WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields | Muyu Xu et.al. | 2308.04826 | null |
2023-08-14 | A General Implicit Framework for Fast NeRF Composition and Rendering | Xinyu Gao et.al. | 2308.04669 | null |
2023-08-08 | Digging into Depth Priors for Outdoor Neural Radiance Fields | Chen Wang et.al. | 2308.04413 | null |
2023-08-07 | Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing | Junyi Zeng et.al. | 2308.03280 | null |
2023-08-05 | Where and How: Mitigating Confusion in Neural Radiance Fields from Sparse Inputs | Yanqi Bao et.al. | 2308.02908 | link |
2023-08-05 | Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis | Yuxin Wang et.al. | 2308.02840 | null |
2023-08-05 | NeRFs: The Search for the Best 3D Representation | Ravi Ramamoorthi et.al. | 2308.02751 | null |
2023-08-04 | ES-MVSNet: Efficient Framework for End-to-end Self-supervised Multi-View Stereo | Qiang Zhou et.al. | 2308.02191 | null |
2023-08-02 | Incorporating Season and Solar Specificity into Renderings made by a NeRF Architecture using Satellite Images | Michael Gableman et.al. | 2308.01262 | link |
2023-08-01 | High-Fidelity Eye Animatable Neural Radiance Fields for Human Face | Hengfei Wang et.al. | 2308.00773 | null |
2023-08-01 | Context-Aware Talking-Head Video Editing | Songlin Yang et.al. | 2308.00462 | null |
2023-07-28 | Dynamic PlenOctree for Adaptive Sampling Refinement in Explicit NeRF | Haotian Bai et.al. | 2307.15333 | null |
2023-07-27 | Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields | Xiangyu Wang et.al. | 2307.15131 | link |
2023-07-27 | MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving | Zirui Wu et.al. | 2307.15058 | link |
2023-07-27 | NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection | Chenfeng Xu et.al. | 2307.14620 | link |
2023-07-26 | Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation | Chaohui Yu et.al. | 2307.13908 | null |
2023-07-24 | Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields | Shangzhan Zhang et.al. | 2307.12909 | null |
2023-07-24 | CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components | Davide Di Nucci et.al. | 2307.12718 | null |
2023-07-23 | TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering | Xiao Pan et.al. | 2307.12291 | null |
2023-07-29 | CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields | Ziyuan Luo et.al. | 2307.11526 | link |
2023-07-21 | FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields | Sungwon Hwang et.al. | 2307.11418 | null |
2023-07-21 | Tri-MipRF: Tri-Mip Representation for Efficient Anti-Aliasing Neural Radiance Fields | Wenbo Hu et.al. | 2307.11335 | null |
2023-07-20 | Urban Radiance Field Representation with Deformable Neural Mesh Primitives | Fan Lu et.al. | 2307.10776 | null |
2023-07-20 | Lighting up NeRF via Unsupervised Decomposition and Enhancement | Haoyuan Wang et.al. | 2307.10664 | link |
2023-07-19 | An Improved NeuMIP with Better Accuracy | Bowen Xue et.al. | 2307.10135 | null |
2023-07-19 | Magic NeRF Lens: Interactive Fusion of Neural Radiance Fields for Virtual Facility Inspection | Ke Li et.al. | 2307.09860 | link |
2023-07-14 | Transient Neural Radiance Fields for Lidar View Synthesis and 3D Reconstruction | Anagh Malik et.al. | 2307.09555 | null |
2023-07-18 | Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis | Jiahe Li et.al. | 2307.09323 | link |
2023-07-16 | Cross-Ray Neural Radiance Fields for Novel-view Synthesis from Unconstrained Image Collections | Yifan Yang et.al. | 2307.08093 | link |
2023-07-15 | Improving NeRF with Height Data for Utilization of GIS Data | Hinata Aoki et.al. | 2307.07729 | null |
2023-07-11 | SAR-NeRF: Neural Radiance Fields for Synthetic Aperture Radar Multi-View Representation | Zhengxin Lei et.al. | 2307.05087 | null |
2023-07-07 | NOFA: NeRF-based One-shot Facial Avatar Reconstruction | Wangbo Yu et.al. | 2307.03441 | null |
2023-07-07 | RGB-D Mapping and Tracking in a Plenoxel Radiance Field | Andreas L. Teigen et.al. | 2307.03404 | link |
2023-07-16 | FlipNeRF: Flipped Reflection Rays for Few-shot Novel View Synthesis | Seunghyeon Seo et.al. | 2306.17723 | link |
2023-07-03 | Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial Predictions | Gengchen Mai et.al. | 2306.17624 | null |
2023-06-28 | Envisioning a Next Generation Extended Reality Conferencing System with Efficient Photorealistic Human Rendering | Chuanyue Shen et.al. | 2306.16541 | null |
2023-06-27 | Unsupervised Polychromatic Neural Representation for CT Metal Artifact Reduction | Qing Wu et.al. | 2306.15203 | link |
2023-06-22 | Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields | Ori Gordon et.al. | 2306.12760 | link |
2023-06-21 | Local 3D Editing via 3D Distillation of CLIP Knowledge | Junha Hyung et.al. | 2306.12570 | null |
2023-06-21 | Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase | Qiuyu Wang et.al. | 2306.12423 | link |
2023-06-21 | DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation | Yukun Huang et.al. | 2306.12422 | null |
2023-06-20 | NeRF synthesis with shading guidance | Chenbin Li et.al. | 2306.11556 | null |
2023-06-24 | MA-NeRF: Motion-Assisted Neural Radiance Fields for Face Synthesis from Sparse Images | Weichen Zhang et.al. | 2306.10350 | null |
2023-06-15 | Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model | Lu Yu et.al. | 2306.09551 | null |
2023-06-16 | UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video | Zhi-Hao Lin et.al. | 2306.09349 | null |
2023-06-13 | DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$ | Allan Jabri et.al. | 2306.08068 | null |
2023-06-13 | Binary Radiance Fields | Seungjoo Shin et.al. | 2306.07581 | null |
2023-06-10 | From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm | Kun Zhou et.al. | 2306.06388 | null |
2023-06-15 | NERFBK: A High-Quality Benchmark for NERF-Based 3D Reconstruction | Ali Karami et.al. | 2306.06300 | link |
2023-06-09 | HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork | Bipasha Sen et.al. | 2306.06093 | null |
2023-06-09 | GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields | Barbara Roessle et.al. | 2306.06044 | null |
2023-06-09 | RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models | Xingchen Zhou et.al. | 2306.05668 | null |
2023-06-08 | LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs | Zezhou Cheng et.al. | 2306.05410 | null |
2023-06-08 | Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields | Qianqiu Tan et.al. | 2306.05303 | link |
2023-06-06 | Towards Visual Foundational Models of Physical Scenes | Chethan Parameshwara et.al. | 2306.03727 | null |
2023-06-06 | Human 3D Avatar Modeling with Implicit Neural Representation: A Brief Survey | Mingyang Sun et.al. | 2306.03576 | null |
2023-06-05 | H2-Mapping: Real-time Dense Mapping Using Hierarchical Hybrid Representation | Chenxing Jiang et.al. | 2306.03207 | link |
2023-06-05 | BeyondPixels: A Comprehensive Review of the Evolution of Neural Radiance Fields | AKM Shahariar Azad Rabby et.al. | 2306.03000 | null |
2023-06-05 | ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields | Kanghyeok Ko et.al. | 2306.02741 | null |
2023-06-01 | FDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models | Hao Zhang et.al. | 2306.00783 | link |
2023-06-01 | Analyzing the Internals of Neural Radiance Fields | Lukas Radl et.al. | 2306.00696 | link |
2023-06-02 | AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars | Mohit Mendiratta et.al. | 2306.00547 | null |
2023-05-30 | DäRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation | Jiuhn Song et.al. | 2305.19201 | link |
2023-05-30 | Template-free Articulated Neural Point Clouds for Reposable View Synthesis | Lukas Uzolas et.al. | 2305.19065 | link |
2023-05-31 | HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance | Junzhe Zhu et.al. | 2305.18766 | link |
2023-05-31 | Towards a Robust Framework for NeRF Evaluation | Adrian Azzarelli et.al. | 2305.18079 | link |
2023-05-31 | Volume Feature Rendering for Fast Neural Radiance Field Reconstruction | Kang Han et.al. | 2305.17916 | null |
2023-05-30 | PlaNeRF: SVD Unsupervised 3D Plane Regularization for NeRF Large-Scale Scene Reconstruction | Fusang Wang et.al. | 2305.16914 | null |
2023-05-25 | ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image | Zhenzhen Weng et.al. | 2305.16411 | null |
2023-05-25 | Interactive Segment Anything NeRF with Feature Imitation | Xiaokang Chen et.al. | 2305.16233 | null |
2023-05-25 | ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation | Zhengyi Wang et.al. | 2305.16213 | link |
2023-05-31 | Deceptive-NeRF: Enhancing NeRF Reconstruction using Pseudo-Observations from Diffusion Models | Xinhang Liu et.al. | 2305.15171 | null |
2023-05-24 | InpaintNeRF360: Text-Guided 3D Inpainting on Unbounded Neural Radiance Fields | Dongqing Wang et.al. | 2305.15094 | null |
2023-05-24 | OD-NeRF: Efficient Training of On-the-Fly Dynamic Neural Radiance Fields | Zhiwen Yan et.al. | 2305.14831 | null |
2023-05-24 | 3D Open-vocabulary Segmentation with Foundation Models | Kunhao Liu et.al. | 2305.14093 | link |
2023-05-22 | NeRFuser: Large-Scale Scene Representation by NeRF Fusion | Jiading Fang et.al. | 2305.13307 | link |
2023-05-22 | Registering Neural Radiance Fields as 3D Density Images | Han Jiang et.al. | 2305.12843 | null |
2023-05-19 | Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields | Jingbo Zhang et.al. | 2305.11588 | link |
2023-05-18 | MVPSNet: Fast Generalizable Multi-view Photometric Stereo | Dongxu Zhao et.al. | 2305.11167 | null |
2023-05-18 | ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis | Shoukang Hu et.al. | 2305.11031 | link |
2023-05-17 | MultiPlaneNeRF: Neural Radiance Field with Non-Trainable Representation | Dominik Zimny et.al. | 2305.10579 | link |
2023-05-24 | OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields | Youtan Yin et.al. | 2305.10503 | link |
2023-05-16 | NerfBridge: Bringing Real-time, Online Neural Radiance Field Training to Robotics | Javier Yu et.al. | 2305.09761 | link |
2023-05-15 | MV-Map: Offboard HD-Map Generation with Multi-view Consistency | Ziyang Xie et.al. | 2305.08851 | link |
2023-05-12 | BundleRecon: Ray Bundle-Based 3D Neural Reconstruction | Weikun Zhang et.al. | 2305.07342 | null |
2023-05-10 | Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era | Chenghao Li et.al. | 2305.06131 | null |
2023-05-10 | NeRF $^\textbf{2}$ : Neural Radio-Frequency Radiance Fields | Xiaopeng Zhao et.al. | 2305.06118 | null |
2023-05-09 | Instant-NeRF: Instant On-Device Neural Radiance Field Training via Algorithm-Accelerator Co-Designed Near-Memory Processing | Yang Zhao et.al. | 2305.05766 | null |
2023-05-09 | PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces | Yiqun Wang et.al. | 2305.05594 | link |
2023-05-08 | NerfAcc: Efficient Sampling Accelerates NeRFs | Ruilong Li et.al. | 2305.04966 | null |
2023-05-08 | AvatarReX: Real-time Expressive Full-body Avatars | Zerong Zheng et.al. | 2305.04789 | null |
2023-05-07 | HashCC: Lightweight Method to Improve the Quality of the Camera-less NeRF Scene Generation | Jan Olszewski et.al. | 2305.04296 | null |
2023-05-07 | Multi-Space Neural Radiance Fields | Ze-Xin Yin et.al. | 2305.04268 | null |
2023-05-04 | NeRF-QA: Neural Radiance Fields Quality Assessment Database | Pedro Martin et.al. | 2305.03176 | null |
2023-05-04 | NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds | Jun-Kun Chen et.al. | 2305.03049 | null |
2023-05-04 | Radiance Field Gradient Scaling for Unbiased Near-Camera Training | Julien Philip et.al. | 2305.02756 | link |
2023-05-04 | Semantic-aware Generation of Multi-view Portrait Drawings | Biao Ma et.al. | 2305.02618 | link |
2023-05-02 | Neural LiDAR Fields for Novel View Synthesis | Shengyu Huang et.al. | 2305.01643 | null |
2023-05-03 | LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar | Yuelang Xu et.al. | 2305.01190 | null |
2023-05-02 | Federated Neural Radiance Fields | Lachlan Holden et.al. | 2305.01163 | link |
2023-05-01 | GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation | Zhenhui Ye et.al. | 2305.00787 | null |
2023-04-30 | Neural Radiance Fields (NeRFs): A Review and Some Recent Developments | Mohamed Debbagh et.al. | 2305.00375 | null |
2023-04-28 | ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance Fields | Nagabhushan Somraj et.al. | 2305.00041 | link |
2023-04-28 | NeRF-LiDAR: Generating Realistic LiDAR Point Clouds with Neural Radiance Fields | Junge Zhang et.al. | 2304.14811 | link |
2023-04-27 | Learning a Diffusion Prior for NeRFs | Guandao Yang et.al. | 2304.14473 | null |
2023-04-27 | ActorsNeRF: Animatable Few-shot Human Rendering with Generalizable NeRFs | Jiteng Mu et.al. | 2304.14401 | null |
2023-05-03 | Combining HoloLens with Instant-NeRFs: Advanced Real-Time 3D Mobile Mapping | Dennis Haitz et.al. | 2304.14301 | null |
2023-04-27 | Compositional 3D Human-Object Neural Animation | Zhi Hou et.al. | 2304.14070 | null |
2023-04-26 | Super-NeRF: View-consistent Detail Generation for NeRF super-resolution | Yuqi Han et.al. | 2304.13518 | null |
2023-04-26 | VGOS: Voxel Grid Optimization for View Synthesis from Sparse Inputs | Jiakai Sun et.al. | 2304.13386 | link |
2023-04-25 | Local Implicit Ray Function for Generalizable Radiance Field Representation | Xin Huang et.al. | 2304.12746 | null |
2023-04-27 | MF-NeRF: Memory Efficient NeRF with Mixed-Feature Hash Table | Yongjae Lee et.al. | 2304.12587 | link |
2023-04-24 | Instant-3D: Instant Neural Radiance Field Training Towards On-Device AR/VR 3D Reconstruction | Sixu Li et.al. | 2304.12467 | null |
2023-04-24 | TextMesh: Generation of Realistic 3D Meshes From Text Prompts | Christina Tsalicoglou et.al. | 2304.12439 | null |
2023-04-26 | Segment Anything in 3D with NeRFs | Jiazhong Cen et.al. | 2304.12308 | link |
2023-04-24 | Explicit Correspondence Matching for Generalizable Neural Radiance Fields | Yuedong Chen et.al. | 2304.12294 | link |
2023-04-25 | Gen-NeRF: Efficient and Generalizable Neural Radiance Fields via Algorithm-Hardware Co-Design | Yonggan Fu et.al. | 2304.11842 | null |
2023-04-22 | 3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes | Haotian Xue et.al. | 2304.11470 | null |
2023-04-22 | Dehazing-NeRF: Neural Radiance Fields from Hazy Images | Tian Li et.al. | 2304.11448 | null |
2023-04-22 | NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation | Baao Xie et.al. | 2304.11342 | link |
2023-04-21 | AutoNeRF: Training Implicit Scene Representations with Autonomous Agents | Pierre Marza et.al. | 2304.11241 | link |
2023-04-21 | Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction | Binbin Huang et.al. | 2304.10780 | null |
2023-04-20 | A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion | Miriam Jäger et.al. | 2304.10664 | null |
2023-04-20 | Learning Neural Duplex Radiance Fields for Real-Time View Synthesis | Ziyu Wan et.al. | 2304.10537 | null |
2023-04-21 | Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs | Frederik Warburg et.al. | 2304.10532 | link |
2023-04-20 | ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects | Marco Toschi et.al. | 2304.10448 | null |
2023-04-20 | LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields | Tang Tao et.al. | 2304.10406 | link |
2023-04-20 | Revisiting Implicit Neural Representations in Low-Level Vision | Wentian Xu et.al. | 2304.10250 | link |
2023-04-20 | Multiscale Representation for Real-Time Anti-Aliasing Neural Rendering | Dongting Hu et.al. | 2304.10075 | null |
2023-04-20 | Neural Radiance Fields: Past, Present, and Future | Ansh Mittal et.al. | 2304.10050 | link |
2023-04-19 | Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra | Jonas Kulhanek et.al. | 2304.09987 | link |
2023-04-20 | Reference-guided Controllable Inpainting of Neural Radiance Fields | Ashkan Mirzaei et.al. | 2304.09677 | null |
2023-04-18 | SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes | Yiming Gao et.al. | 2304.08971 | null |
2023-04-18 | NeAI: A Pre-convoluted Representation for Plug-and-Play Neural Ambient Illumination | Yiyu Zhuang et.al. | 2304.08757 | null |
2023-04-17 | MoDA: Modeling Deformable 3D Objects from Casual Videos | Chaoyue Song et.al. | 2304.08279 | link |
2023-04-17 | NeRF-Loc: Visual Localization with Conditional Neural Radiance Field | Jianlin Liu et.al. | 2304.07979 | link |
2023-04-16 | Likelihood-Based Generative Radiance Field with Latent Space Energy-Based Model for 3D-Aware Disentangled Image Representation | Yaxuan Zhu et.al. | 2304.07918 | null |
2023-04-16 | CAT-NeRF: Constancy-Aware Tx $^2$ Former for Dynamic Body Modeling | Haidong Zhu et.al. | 2304.07915 | link |
2023-04-16 | SeaThru-NeRF: Neural Radiance Fields in Scattering Media | Deborah Levy et.al. | 2304.07743 | link |
2023-04-14 | UVA: Towards Unified Volumetric Avatar for View Synthesis, Pose rendering, Geometry and Texture Editing | Jinlong Fan et.al. | 2304.06969 | null |
2023-04-17 | Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction | Hansheng Chen et.al. | 2304.06714 | link |
2023-04-13 | Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields | Jonathan T. Barron et.al. | 2304.06706 | null |
2023-04-13 | NeRFVS: Neural Radiance Fields for Free View Synthesis via Geometry Scaffolds | Chen Yang et.al. | 2304.06287 | null |
2023-04-12 | NutritionVerse-Thin: An Optimized Strategy for Enabling Improved Rendering of 3D Thin Food Models | Chi-en Amy Tai et.al. | 2304.05620 | null |
2023-04-11 | Improving Neural Radiance Fields with Depth-aware Optimization for Novel View Synthesis | Shu Chen et.al. | 2304.05218 | link |
2023-04-11 | One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field | Weichuang Li et.al. | 2304.05097 | null |
2023-04-11 | MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields | Ganlin Yang et.al. | 2304.04962 | link |
2023-04-10 | Neural Image-based Avatars: Generalizable Radiance Fields for Human Avatar Modeling | Youngjoong Kwon et.al. | 2304.04897 | null |
2023-04-07 | Event-based Camera Tracker by $\nabla$ t NeRF | Mana Masuda et.al. | 2304.04559 | null |
2023-04-10 | Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos | Liao Wang et.al. | 2304.04452 | null |
2023-04-10 | Inferring Fluid Dynamics via Inverse Rendering | Jinxian Liu et.al. | 2304.04446 | null |
2023-04-10 | Instance Neural Radiance Field | Benran Hu et.al. | 2304.04395 | link |
2023-04-12 | NeRF applied to satellite imagery for surface reconstruction | Federico Semeraro et.al. | 2304.04133 | link |
2023-04-08 | PVD-AL: Progressive Volume Distillation with Active Learning for Efficient Conversion Between Different NeRF Architectures | Shuangkang Fang et.al. | 2304.04012 | link |
2023-04-07 | Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field | Leheng Li et.al. | 2304.03526 | null |
2023-04-06 | Beyond NeRF Underwater: Learning Neural Reflectance Fields for True Color Correction of Marine Imagery | Tianyi Zhang et.al. | 2304.03384 | link |
2023-04-06 | LANe: Lighting-Aware Neural Fields for Compositional Scene Synthesis | Akshay Krishnan et.al. | 2304.03280 | null |
2023-04-06 | Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes | Zian Wang et.al. | 2304.03266 | null |
2023-04-06 | DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model | Hoigi Seo et.al. | 2304.02827 | null |
2023-04-05 | Image Stabilization for Hololens Camera in Remote Collaboration | Gowtham Senthil et.al. | 2304.02736 | null |
2023-04-04 | Generating Continual Human Motion in Diverse 3D Scenes | Aymen Mir et.al. | 2304.02061 | null |
2023-04-04 | MonoHuman: Animatable Human Neural Field from Monocular Video | Zhengming Yu et.al. | 2304.02001 | null |
2023-04-06 | DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models | Yukang Cao et.al. | 2304.00916 | link |
2023-04-01 | JacobiNeRF: NeRF Shaping with Mutual Information Gradients | Xiaomeng Xu et.al. | 2304.00341 | link |
2023-03-31 | VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization | Bingfan Zhu et.al. | 2303.17968 | link |
2023-03-30 | NeRF-Supervised Deep Stereo | Fabio Tosi et.al. | 2303.17603 | link |
2023-03-30 | SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling | Zhitao Yang et.al. | 2303.17368 | link |
2023-03-30 | NeILF++: Inter-Reflectable Light Fields for Geometry and Material Estimation | Jingyang Zhang et.al. | 2303.17147 | null |
2023-03-30 | Enhanced Stable View Synthesis | Nishant Jain et.al. | 2303.17094 | null |
2023-03-29 | TriVol: Point Cloud Rendering via Triple Volumes | Tao Hu et.al. | 2303.16485 | link |
2023-03-29 | Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance Fields | Tao Hu et.al. | 2303.16482 | null |
2023-03-28 | Flow supervision for Deformable NeRF | Chaoyang Wang et.al. | 2303.16333 | null |
2023-03-28 | SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis | Guangcong Wang et.al. | 2303.16196 | link |
2023-03-28 | VMesh: Hybrid Volume-Mesh Representation for Efficient View Synthesis | Yuan-Chen Guo et.al. | 2303.16184 | null |
2023-03-30 | Adaptive Voronoi NeRFs | Tim Elsner et.al. | 2303.16001 | null |
2023-03-28 | F $^{2}$ -NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories | Peng Wang et.al. | 2303.15951 | link |
2023-03-27 | JAWS: Just A Wild Shot for Cinematic Transfer in Neural Radiance Fields | Xi Wang et.al. | 2303.15427 | link |
2023-03-27 | Generalizable Neural Voxels for Fast Human Radiance Fields | Taoran Yi et.al. | 2303.15387 | null |
2023-03-27 | NeUDF: Learning Unsigned Distance Fields from Multi-view Images for Reconstructing Non-watertight Models | Fei Hou et.al. | 2303.15368 | link |
2023-03-24 | Perceptual Quality Assessment of NeRF and Neural View Synthesis Methods for Front-Facing Views | Hanxue Liang et.al. | 2303.15206 | null |
2023-03-27 | 3D-Aware Multi-Class Image-to-Image Translation with NeRFs | Senmao Li et.al. | 2303.15012 | link |
2023-03-26 | Clean-NeRF: Reformulating NeRF to account for View-Dependent Observations | Xinhang Liu et.al. | 2303.14707 | null |
2023-03-25 | SUDS: Scalable Urban Dynamic Scenes | Haithem Turki et.al. | 2303.14536 | null |
2023-03-25 | DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields | Yu Chen et.al. | 2303.14478 | null |
2023-03-25 | NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects | Zhiwen Yan et.al. | 2303.14435 | link |
2023-03-24 | Grid-guided Neural Radiance Fields for Large Urban Scenes | Linning Xu et.al. | 2303.14001 | null |
2023-03-24 | CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout | Yiqi Lin et.al. | 2303.13843 | null |
2023-03-24 | HandNeRF: Neural Radiance Fields for Animatable Interacting Hands | Zhiyang Guo et.al. | 2303.13825 | null |
2023-03-24 | ABLE-NeRF: Attention-Based Rendering with Learnable Embeddings for Neural Radiance Field | Zhe Jun Tang et.al. | 2303.13817 | link |
2023-03-24 | GM-NeRF: Learning Generalizable Model-based Neural Radiance Fields from Multi-view Images | Jianchuan Chen et.al. | 2303.13777 | null |
2023-03-24 | TEGLO: High Fidelity Canonical Texture Mapping from Single-View Images | Vishal Vinod et.al. | 2303.13743 | null |
2023-03-23 | SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates | Mikaela Angelina Uy et.al. | 2303.13582 | null |
2023-03-23 | TriPlaneNet: An Encoder for EG3D Inversion | Ananta R. Bhattarai et.al. | 2303.13497 | null |
2023-03-23 | Plotting Behind the Scenes: Towards Learnable Game Engines | Willi Menapace et.al. | 2303.13472 | null |
2023-03-23 | Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes | Dana Cohen-Bar et.al. | 2303.13450 | link |
2023-03-23 | SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field | Chong Bao et.al. | 2303.13277 | link |
2023-03-23 | Transforming Radiance Field with Lipschitz Network for Photorealistic 3D Scene Stylization | Zicheng Zhang et.al. | 2303.13232 | null |
2023-03-23 | Semantic Ray: Learning a Generalizable Semantic Field with Cross-Reprojection Attention | Fangfu Liu et.al. | 2303.13014 | link |
2023-03-22 | NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions | Mohamad Shahbazi et.al. | 2303.12865 | link |
2023-03-22 | SHERF: Generalizable Human NeRF from a Single Image | Shoukang Hu et.al. | 2303.12791 | link |
2023-03-22 | Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions | Ayaan Haque et.al. | 2303.12789 | null |
2023-03-22 | FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models | Jianglong Ye et.al. | 2303.12786 | link |
2023-03-22 | Balanced Spherical Grid for Egocentric View Synthesis | Changwoon Choi et.al. | 2303.12408 | link |
2023-03-21 | Pre-NeRF 360: Enriching Unbounded Appearances for Neural Radiance Fields | Ahmad AlMughrabi et.al. | 2303.12234 | link |
2023-03-21 | 3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion | Yu-Jhe Li et.al. | 2303.11938 | null |
2023-03-22 | ExtremeNeRF: Few-shot Neural Radiance Fields Under Unconstrained Illumination | SeokYeong Lee et.al. | 2303.11728 | null |
2023-03-20 | DehazeNeRF: Multiple Image Haze Removal and 3D Shape Reconstruction using Neural Radiance Fields | Wei-Ting Chen et.al. | 2303.11364 | null |
2023-03-20 | ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-real Novel View Synthesis via Contrastive Learning | Hao Yang et.al. | 2303.11052 | null |
2023-03-19 | SKED: Sketch-guided Text-based 3D Editing | Aryan Mikaeili et.al. | 2303.10735 | null |
2023-03-19 | NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping | Junyuan Deng et.al. | 2303.10709 | link |
2023-03-18 | 3D Data Augmentation for Driving Scenes on Camera | Wenwen Tong et.al. | 2303.10340 | null |
2023-03-17 | $α$ Surf: Implicit Surface Reconstruction for Semi-Transparent and Thin Objects with Decoupled Geometry and Opacity | Tianhao Wu et.al. | 2303.10083 | null |
2023-03-17 | Single-view Neural Radiance Fields with Depth Teacher | Yurui Chen et.al. | 2303.09952 | null |
2023-03-21 | PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D Supervision | Konstantinos Tertikas et.al. | 2303.09554 | null |
2023-03-16 | LERF: Language Embedded Radiance Fields | Justin Kerr et.al. | 2303.09553 | null |
2023-03-16 | NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes | Marie-Julie Rakotosaona et.al. | 2303.09431 | null |
2023-03-17 | NeRFtrinsic Four: An End-To-End Trainable NeRF Jointly Optimizing Diverse Intrinsic and Extrinsic Camera Parameters | Hannah Schieber et.al. | 2303.09412 | link |
2023-03-16 | Reliable Image Dehazing by NeRF | Zheyan Jin et.al. | 2303.09153 | null |
2023-03-15 | Mesh Strikes Back: Fast and Efficient Human Reconstruction from RGB videos | Rohit Jena et.al. | 2303.08808 | null |
2023-03-15 | Re-ReND: Real-time Rendering of NeRFs across Devices | Sara Rojas et.al. | 2303.08717 | link |
2023-03-15 | RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters | Shuja Khalid et.al. | 2303.08695 | null |
2023-03-15 | Harnessing Low-Frequency Neural Fields for Few-Shot View Synthesis | Liangchen Song et.al. | 2303.08370 | link |
2023-03-14 | MELON: NeRF with Unposed Images Using Equivalence Class Estimation | Axel Levy et.al. | 2303.08096 | null |
2023-03-16 | Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation | Junyoung Seo et.al. | 2303.07937 | link |
2023-03-16 | NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction from Multi-view Images | Yunfan Ye et.al. | 2303.07653 | link |
2023-03-14 | Frequency-Modulated Point Cloud Rendering with Easy Editing | Yi Zhang et.al. | 2303.07596 | link |
2023-03-13 | FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization | Jiawei Yang et.al. | 2303.07418 | link |
2023-03-13 | NeRFLiX: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-viewpoint MiXer | Kun Zhou et.al. | 2303.06919 | link |
2023-03-11 | Just Flip: Flipped Observation Generation and Optimization for Neural Radiance Fields to Cover Unobserved View | Minjae Lee et.al. | 2303.06335 | link |
2023-03-10 | NeRFlame: FLAME-based conditioning of NeRF for 3D face rendering | Wojciech Zając et.al. | 2303.06226 | link |
2023-03-10 | You Only Train Once: Multi-Identity Free-Viewpoint Neural Human Rendering from Monocular Videos | Jaehyeok Kim et.al. | 2303.05835 | null |
2023-03-10 | Aleth-NeRF: Low-light Condition View Synthesis with Concealing Fields | Ziteng Cui et.al. | 2303.05807 | null |
2023-03-10 | Self-NeRF: A Self-Training Pipeline for Few-Shot Neural Radiance Fields | Jiayang Bai et.al. | 2303.05775 | null |
2023-03-14 | Hardware Acceleration of Neural Graphics | Muhammad Husnain Mubarik et.al. | 2303.05735 | null |
2023-03-10 | MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field | Kaizhi Yang et.al. | 2303.05703 | null |
2023-03-09 | PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification | Xuan Li et.al. | 2303.05512 | null |
2023-03-08 | FastSurf: Fast Neural RGB-D Surface Reconstruction using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning | Seunghwan Lee et.al. | 2303.04508 | link |
2023-03-08 | DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields | Dipam Patel et.al. | 2303.04322 | null |
2023-03-07 | NEPHELE: A Neural Platform for Highly Realistic Cloud Radiance Rendering | Haimin Luo et.al. | 2303.04086 | null |
2023-03-05 | Semantic-aware Occlusion Filtering Neural Radiance Fields in the Wild | Jaewon Lee et.al. | 2303.03966 | null |
2023-03-07 | Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis | Kang Han et.al. | 2303.03808 | link |
2023-03-10 | Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation from 2D Supervision | Xiaoshuai Zhang et.al. | 2303.03361 | null |
2023-03-07 | Efficient Large-scale Scene Representation with a Hybrid of High-resolution Grid and Plane Features | Yuqi Zhang et.al. | 2303.03003 | link |
2023-03-03 | Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement | Jiaxiang Tang et.al. | 2303.02091 | link |
2023-03-03 | Multi-Plane Neural Radiance Fields for Novel View Synthesis | Youssef Abdelkareem et.al. | 2303.01736 | null |
2023-03-01 | S-NeRF: Neural Radiance Fields for Street Views | Ziyang Xie et.al. | 2303.00749 | null |
2023-02-28 | IntrinsicNGP: Intrinsic Coordinate based Hash Encoding for Human NeRF | Bo Peng et.al. | 2302.14683 | null |
2023-02-27 | BaLi-RF: Bandlimited Radiance Fields for Dynamic Scene Modeling | Sameera Ramasinghe et.al. | 2302.13543 | null |
2023-02-26 | Efficient physics-informed neural networks using hash encoding | Xinquan Huang et.al. | 2302.13397 | null |
2023-02-24 | CATNIPS: Collision Avoidance Through Neural Implicit Probabilistic Scenes | Timothy Chen et.al. | 2302.12931 | link |
2023-02-24 | Learning Neural Volumetric Representations of Dynamic Humans in Minutes | Chen Geng et.al. | 2302.12237 | link |
2023-02-23 | DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models | Jamie Wynn et.al. | 2302.12231 | link |
2023-02-20 | NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion | Jiatao Gu et.al. | 2302.10109 | null |
2023-02-19 | LC-NeRF: Local Controllable Face Generation in Neural Randiance Field | Wenyang Zhou et.al. | 2302.09486 | null |
2023-02-17 | MixNeRF: Modeling a Ray with Mixture Density for Novel View Synthesis from Sparse Inputs | Seunghyeon Seo et.al. | 2302.08788 | link |
2023-02-14 | VQ3D: Learning a 3D-Aware Generative Model on ImageNet | Kyle Sargent et.al. | 2302.06833 | null |
2023-02-13 | 3D-aware Blending with Generative NeRFs | Hyunsu Kim et.al. | 2302.06608 | link |
2023-02-11 | 3D Colored Shape Reconstruction from a Single RGB Image through Diffusion | Bo Li et.al. | 2302.05573 | null |
2023-02-08 | Nerfstudio: A Modular Framework for Neural Radiance Field Development | Matthew Tancik et.al. | 2302.04264 | null |
2023-02-07 | AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis | Susan Liang et.al. | 2302.02088 | null |
2023-02-03 | Semantic 3D-aware Portrait Synthesis and Manipulation Based on Compositional Neural Radiance Field | Tianxiang Ma et.al. | 2302.01579 | link |
2023-02-03 | Robust Camera Pose Refinement for Multi-Resolution Hash Encoding | Hwan Heo et.al. | 2302.01571 | null |
2023-02-03 | INV: Towards Streaming Incremental Neural Videos | Shengze Wang et.al. | 2302.01532 | null |
2023-02-02 | Factor Fields: A Unified Framework for Neural Fields and Beyond | Anpei Chen et.al. | 2302.01226 | null |
2023-02-02 | RobustNeRF: Ignoring Distractors with Robust Losses | Sara Sabour et.al. | 2302.00833 | null |
2023-01-31 | GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis | Zhenhui Ye et.al. | 2301.13430 | null |
2023-01-30 | Equivariant Architectures for Learning in Deep Weight Spaces | Aviv Navon et.al. | 2301.12780 | link |
2023-01-27 | HyperNeRFGAN: Hypernetwork approach to 3D NeRF GAN | Adam Kania et.al. | 2301.11631 | link |
2023-01-27 | A Comparison of Tiny-nerf versus Spatial Representations for 3d Reconstruction | Saulo Abraham Gante et.al. | 2301.11522 | null |
2023-01-27 | SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning | Dongseok Shim et.al. | 2301.11520 | null |
2023-01-26 | Text-To-4D Dynamic Scene Generation | Uriel Singer et.al. | 2301.11280 | null |
2023-01-26 | GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency | Minseop Kwak et.al. | 2301.10941 | link |
2023-01-23 | HexPlane: A Fast Representation for Dynamic Scenes | Ang Cao et.al. | 2301.09632 | link |
2023-01-22 | 3D Reconstruction of Non-cooperative Resident Space Objects using Instant NGP-accelerated NeRF and D-NeRF | Trupti Mahendrakar et.al. | 2301.09060 | null |
2023-01-18 | NeRF in the Palm of Your Hand: Corrective Augmentation for Robotics via Novel-View Synthesis | Allan Zhou et.al. | 2301.08556 | null |
2023-01-19 | RecolorNeRF: Layer Decomposed Radiance Field for Efficient Color Editing of 3D Scenes | Bingchen Gong et.al. | 2301.07958 | null |
2023-01-18 | Behind the Scenes: Density Fields for Single View Reconstruction | Felix Wimbauer et.al. | 2301.07668 | link |
2023-01-17 | A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction | Chongshan Lu et.al. | 2301.06782 | null |
2023-01-13 | Laser: Latent Set Representations for 3D Generative Modeling | Pol Moreno et.al. | 2301.05747 | null |
2023-01-10 | Benchmarking Robustness in Neural Radiance Fields | Chen Wang et.al. | 2301.04075 | null |
2023-01-08 | Towards Open World NeRF-Based SLAM | Daniil Lisus et.al. | 2301.03102 | null |
2023-01-10 | Traditional Readability Formulas Compared for English | Bruce W. Lee et.al. | 2301.02975 | null |
2023-01-09 | Class-Continuous Conditional Generative Neural Radiance Field | Jiwook Kim et.al. | 2301.00950 | link |
2023-01-11 | Detachable Novel Views Synthesis of Dynamic Scenes Using Distribution-Driven Neural Radiance Fields | Boyu Zhang et.al. | 2301.00411 | link |
2022-12-26 | MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos | Fengrui Tian et.al. | 2212.13056 | link |
2022-12-25 | PaletteNeRF: Palette-based Color Editing for NeRFs | Qiling Wu et.al. | 2212.12871 | null |
2022-12-22 | Removing Objects From Neural Radiance Fields | Silvan Weder et.al. | 2212.11966 | null |
2022-12-21 | Incremental Learning for Neural Radiance Field with Uncertainty-Filtered Knowledge Distillation | Mengqi Guo et.al. | 2212.10950 | link |
2022-12-21 | PaletteNeRF: Palette-based Appearance Editing of Neural Radiance Fields | Zhengfei Kuang et.al. | 2212.10699 | null |
2022-12-20 | Correspondence Distillation from NeRF-based GAN | Yushi Lan et.al. | 2212.09735 | null |
2022-12-19 | StyleTRF: Stylizing Tensorial Radiance Fields | Rahul Goel et.al. | 2212.09330 | null |
2022-12-18 | SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images | Abdullah Hamdi et.al. | 2212.09100 | link |
2022-12-18 | Masked Wavelet Representation for Compact Neural Radiance Fields | Daniel Rho et.al. | 2212.09069 | link |
2022-12-15 | SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory | Sicheng Li et.al. | 2212.08476 | null |
2022-12-16 | MEIL-NeRF: Memory-Efficient Incremental Learning of Neural Radiance Fields | Jaeyoung Chung et.al. | 2212.08328 | null |
2022-12-15 | NeRF-Art: Text-Driven Neural Radiance Fields Stylization | Can Wang et.al. | 2212.08070 | link |
2022-12-15 | Real-Time Neural Light Field on Mobile Devices | Junli Cao et.al. | 2212.08057 | link |
2022-12-14 | NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior | Wenjing Bian et.al. | 2212.07388 | link |
2022-12-08 | GazeNeRF: 3D-Aware Gaze Redirection with Neural Radiance Fields | Alessandro Ruzzi et.al. | 2212.04823 | link |
2022-12-09 | 4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions | Zhongshu Wang et.al. | 2212.04701 | link |
2022-12-07 | EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points | Chengwei Zheng et.al. | 2212.04247 | null |
2022-12-08 | NeRFEditor: Differentiable Style Decomposition for Full 3D Scene Editing | Chunyi Sun et.al. | 2212.03848 | null |
2022-12-07 | Non-uniform Sampling Strategies for NeRF on 360{\textdegree} images | Takashi Otonari et.al. | 2212.03635 | null |
2022-12-07 | SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields | Siddhant Ranade et.al. | 2212.03406 | null |
2022-12-06 | NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors | Congyue Deng et.al. | 2212.03267 | null |
2022-12-05 | SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields | Anh-Quan Cao et.al. | 2212.02501 | link |
2022-12-05 | Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields | Rohith Agaram et.al. | 2212.02493 | link |
2022-12-06 | D-TensoRF: Tensorial Radiance Fields for Dynamic Scenes | Hankyu Jang et.al. | 2212.02375 | null |
2022-12-07 | GARF:Geometry-Aware Generalized Neural Radiance Field | Yue Shi et.al. | 2212.02280 | null |
2022-12-05 | INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors | Chaojian Li et.al. | 2212.01959 | null |
2022-12-03 | MaRF: Representing Mars as Neural Radiance Fields | Lorenzo Giusti et.al. | 2212.01672 | link |
2022-12-03 | StegaNeRF: Embedding Invisible Information within Neural Radiance Fields | Chenxin Li et.al. | 2212.01602 | null |
2022-12-02 | RT-NeRF: Real-Time On-Device Neural Radiance Fields Towards Immersive AR/VR Rendering | Chaojian Li et.al. | 2212.01120 | null |
2022-12-02 | 3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation | Zutao Jiang et.al. | 2212.01103 | null |
2022-12-02 | QFF: Quantized Fourier Features for Neural Field Representations | Jae Yong Lee et.al. | 2212.00914 | null |
2022-12-01 | ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural Radiance Fields | Octave Mariotti et.al. | 2212.00436 | null |
2022-11-30 | NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation | Yu Yin et.al. | 2211.17235 | null |
2022-11-29 | NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views | Dejia Xu et.al. | 2211.16431 | link |
2022-11-29 | Compressing Volumetric Radiance Fields to 1 MB | Lingzhi Li et.al. | 2211.16386 | link |
2022-11-28 | In-Hand 3D Object Scanning from an RGB Sequence | Shreyas Hampali et.al. | 2211.16193 | null |
2022-11-30 | One is All: Bridging the Gap Between Neural Radiance Fields Architectures with Progressive Volume Distillation | Shuangkang Fang et.al. | 2211.15977 | link |
2022-11-28 | High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors | Yunpeng Bai et.al. | 2211.15064 | null |
2022-11-27 | SuNeRF: Validation of a 3D Global Reconstruction of the Solar Corona Using Simulated EUV Images | Kyriaki-Margarita Bintsi et.al. | 2211.14879 | null |
2022-11-27 | 3D Scene Creation and Rendering via Rough Meshes: A Lighting Transfer Avenue | Yujie Li et.al. | 2211.14823 | null |
2022-11-27 | Sampling Neural Radiance Fields for Refractive Objects | Jen-I Pan et.al. | 2211.14799 | link |
2022-11-25 | 3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models | Gang Li et.al. | 2211.14108 | null |
2022-11-25 | ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision | Jingwang Ling et.al. | 2211.14086 | link |
2022-11-25 | Dynamic Neural Portraits | Michail Christos Doukas et.al. | 2211.13994 | null |
2022-11-25 | Unsupervised Continual Semantic Adaptation through Neural Rendering | Zhizheng Liu et.al. | 2211.13969 | link |
2022-11-25 | TPA-Net: Generate A Dataset for Text to Physics-based Animation | Yuxing Qiu et.al. | 2211.13887 | null |
2022-11-24 | ScanNeRF: a Scalable Benchmark for Neural Radiance Fields | Luca De Luigi et.al. | 2211.13762 | null |
2022-11-24 | Immersive Neural Graphics Primitives | Ke Li et.al. | 2211.13494 | link |
2022-11-23 | CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields | Keqiang Sun et.al. | 2211.13251 | null |
2022-11-26 | ClimateNeRF: Physically-based Neural Rendering for Extreme Climate Synthesis | Yuan Li et.al. | 2211.13226 | null |
2022-11-23 | ManVatar : Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels | Yuelang Xu et.al. | 2211.13206 | null |
2022-11-23 | BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields | Peng Wang et.al. | 2211.12853 | link |
2022-11-23 | PANeRF: Pseudo-view Augmentation for Improved Neural Radiance Fields Based on Few-shot Inputs | Young Chun Ahn et.al. | 2211.12758 | null |
2022-11-23 | ActiveRMAP: Radiance Field for Active Mapping And Planning | Huangying Zhan et.al. | 2211.12656 | null |
2022-11-22 | Zero NeRF: Registration with Zero Overlap | Casey Peat et.al. | 2211.12544 | null |
2022-11-22 | Depth-Supervised NeRF for Multi-View RGB-D Operating Room Images | Beerend G. A. Gerats et.al. | 2211.12436 | null |
2022-11-22 | Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition | Jiaxiang Tang et.al. | 2211.12368 | null |
2022-11-22 | Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields | Brian K. S. Isaac-Medina et.al. | 2211.12285 | link |
2022-11-22 | SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields | Ashkan Mirzaei et.al. | 2211.12254 | null |
2022-11-22 | Deblurred Neural Radiance Field with Physical Scene Priors | Dogyoon Lee et.al. | 2211.12046 | link |
2022-11-22 | ONeRF: Unsupervised 3D Object Segmentation from Multiple Views | Shengnan Liang et.al. | 2211.12038 | null |
2022-11-21 | Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques | David Ramirez et.al. | 2211.11836 | null |
2022-11-21 | SPARF: Neural Radiance Fields from Sparse and Noisy Poses | Prune Truong et.al. | 2211.11738 | link |
2022-11-21 | ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari et.al. | 2211.11704 | null |
2022-11-21 | Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion | Dario Pavllo et.al. | 2211.11674 | link |
2022-11-18 | Magic3D: High-Resolution Text-to-3D Content Creation | Chen-Hsuan Lin et.al. | 2211.10440 | null |
2022-11-17 | AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training | Yifan Jiang et.al. | 2211.09682 | null |
2022-11-16 | CoNFies: Controllable Neural Face Avatars | Heng Yu et.al. | 2211.08610 | null |
2022-11-14 | Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures | Gal Metzer et.al. | 2211.07600 | link |
2022-11-12 | 3D-Aware Encoding for Style-based Neural Radiance Fields | Yu-Jhe Li et.al. | 2211.06583 | null |
2022-11-11 | ParticleNeRF: A Particle-Based Encoding for Online Neural Radiance Fields in Dynamic Scenes | Jad Abou-Chakra et.al. | 2211.04041 | null |
2022-11-07 | Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories | Samarth Sinha et.al. | 2211.03889 | null |
2022-11-03 | nerf2nerf: Pairwise Registration of Neural Radiance Fields | Lily Goli et.al. | 2211.01600 | null |
2022-10-27 | ProbNeRF: Uncertainty-Aware Inference of 3D Shapes from 2D Images | Matthew D. Hoffman et.al. | 2210.17415 | null |
2022-10-27 | Boosting Point Clouds Rendering via Radiance Mapping | Xiaoyang Huang et.al. | 2210.15107 | link |
2022-10-24 | Learning Neural Radiance Fields from Multi-View Geometry | Marco Orsingher et.al. | 2210.13041 | null |
2022-10-23 | Compressing Explicit Voxel Grid Representations: fast NeRFs become also small | Chenxi Lola Deng et.al. | 2210.12782 | null |
2022-11-06 | Joint Rigid Motion Correction and Sparse-View CT via Self-Calibrating Neural Field | Qing Wu et.al. | 2210.12731 | null |
2022-10-21 | An Exploration of Neural Radiance Field Scene Reconstruction: Synthetic, Real-world and Dynamic Scenes | Benedict Quartey et.al. | 2210.12268 | null |
2022-11-06 | Neural Fields for Robotic Object Manipulation from a Single Image | Valts Blukis et.al. | 2210.12126 | null |
2022-10-21 | HDHumans: A Hybrid Approach for High-fidelity Digital Humans | Marc Habermann et.al. | 2210.12003 | null |
2022-10-21 | RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control | Zhenggang Tang et.al. | 2210.11668 | null |
2022-10-21 | Coordinates Are NOT Lonely – Codebook Prior Helps Implicit Neural 3D Representations | Fukun Yin et.al. | 2210.11170 | link |
2022-10-18 | Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation | Yunzhi Lin et.al. | 2210.10108 | link |
2022-10-18 | ARAH: Animatable Volume Rendering of Articulated Human SDFs | Shaofei Wang et.al. | 2210.10036 | null |
2022-10-20 | Differentiable Physics Simulation of Dynamics-Augmented Neural Objects | Simon Le Cleac’h et.al. | 2210.09420 | null |
2022-10-15 | SPIDR: SDF-based Neural Point Fields for Illumination and Deformation | Ruofan Liang et.al. | 2210.08398 | null |
2022-10-15 | IBL-NeRF: Image-Based Lighting Formulation of Neural Radiance Fields | Changwoon Choi et.al. | 2210.08202 | link |
2022-10-17 | 3D GAN Inversion with Pose Optimization | Jaehoon Ko et.al. | 2210.07301 | link |
2022-10-13 | Multiplane NeRF-Supervised Disentanglement of Depth and Camera Pose from Videos | Yang Fu et.al. | 2210.07181 | null |
2022-10-12 | GraspNeRF: Multiview-based 6-DoF Grasp Detection for Transparent and Specular Objects Using Generalizable NeRF | Qiyu Dai et.al. | 2210.06575 | link |
2022-10-12 | Reconstructing Personalized Semantic Facial NeRF Models From Monocular Video | Xuan Gao et.al. | 2210.06108 | link |
2022-10-11 | X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360 $^{\circ}$ Insufficient RGB-D Views | Haoyi Zhu et.al. | 2210.05135 | link |
2022-10-10 | NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields | Arunkumar Byravan et.al. | 2210.04932 | null |
2022-10-10 | EVA3D: Compositional 3D Human Generation from 2D Image Collections | Fangzhou Hong et.al. | 2210.04888 | link |
2022-10-13 | NerfAcc: A General NeRF Acceleration Toolbox | Ruilong Li et.al. | 2210.04847 | link |
2022-10-10 | SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction | Yitong Xia et.al. | 2210.04553 | link |
2022-10-09 | Robustifying the Multi-Scale Representation of Neural Radiance Fields | Nishant Jain et.al. | 2210.04233 | null |
2022-10-09 | Estimating Neural Reflectance Field from Radiance Field using Tree Structures | Xiu Li et.al. | 2210.04217 | null |
2022-10-09 | Data augmentation for NeRF: a geometric consistent solution based on view morphing | Matteo Bortolon et.al. | 2210.04214 | link |
2022-10-09 | Towards Efficient Neural Scene Graphs by Learning Consistency Fields | Yeji Song et.al. | 2210.04127 | null |
2022-10-08 | ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints | Yinpeng Dong et.al. | 2210.03895 | link |
2022-10-04 | SelfNeRF: Fast Training NeRF for Human from Monocular Self-rotating Video | Bo Peng et.al. | 2210.01651 | null |
2022-10-03 | NARF22: Neural Articulated Radiance Fields for Configuration-Aware Rendering | Stanley Lewis et.al. | 2210.01166 | null |
2022-10-02 | IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis | Weicai Ye et.al. | 2210.00647 | link |
2022-10-02 | Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation | Xinhang Liu et.al. | 2210.00489 | null |
2022-10-01 | NeRF: Neural Radiance Field in 3D Vision, A Comprehensive Review | Kyle Gao et.al. | 2210.00379 | null |
2022-10-01 | Structure-Aware NeRF without Posed Camera via Epipolar Constraint | Shu Chen et.al. | 2210.00183 | link |
2022-09-30 | Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator | Zifan Shi et.al. | 2209.15637 | null |
2022-09-30 | Understanding Pure CLIP Guidance for Voxel Grid NeRF Models | Han-Hung Lee et.al. | 2209.15172 | null |
2022-09-29 | DreamFusion: Text-to-3D using 2D Diffusion | Ben Poole et.al. | 2209.14988 | null |
2022-09-29 | SymmNeRF: Learning to Explore Symmetry Prior for Single-View View Synthesis | Xingyi Li et.al. | 2209.14819 | link |
2022-10-03 | 360FusionNeRF: Panoramic Neural Radiance Fields with Joint Guidance | Shreyas Kulkarni et.al. | 2209.14265 | link |
2022-09-27 | OmniNeRF: Hybriding Omnidirectional Distance and Radiance fields for Neural Surface Reconstruction | Jiaming Shen et.al. | 2209.13433 | null |
2022-09-27 | Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping | Chi-Ming Chung et.al. | 2209.13274 | link |
2022-09-27 | WaterNeRF: Neural Radiance Fields for Underwater Scenes | Advaith Venkatramanan Sethuraman et.al. | 2209.13091 | null |
2022-09-26 | Baking in the Feature: Accelerating Volumetric Segmentation by Rendering Feature Maps | Kenneth Blomqvist et.al. | 2209.12744 | null |
2022-09-25 | Enforcing safety for vision-based controllers via Control Barrier Functions and Neural Radiance Fields | Mukun Tong et.al. | 2209.12266 | null |
2022-09-24 | NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields | Jiankai Sun et.al. | 2209.12068 | null |
2022-09-19 | Loc-NeRF: Monte Carlo Localization using Neural Radiance Fields | Dominic Maggio et.al. | 2209.09050 | link |
2022-09-23 | NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes | Zhiwen Fan et.al. | 2209.08776 | link |
2022-09-19 | Density-aware NeRF Ensembles: Quantifying Predictive Uncertainty in Neural Radiance Fields | Niko Sünderhauf et.al. | 2209.08718 | null |
2022-09-18 | ActiveNeRF: Learning where to See with Uncertainty Estimation | Xuran Pan et.al. | 2209.08546 | link |
2022-09-18 | LATITUDE: Robotic Global Localization with Truncated Dynamic Low-pass Filter in City-scale NeRF | Zhenxin Zhu et.al. | 2209.08498 | link |
2022-09-16 | iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking | Yuhang Ming et.al. | 2209.07919 | null |
2022-09-12 | StructNeRF: Neural Radiance Fields for Indoor Scenes with Structural Hints | Zheng Chen et.al. | 2209.05277 | null |
2022-09-09 | Generative Deformable Radiance Fields for Disentangled Image Synthesis of Topology-Varying Objects | Ziyu Wang et.al. | 2209.04183 | null |
2022-09-08 | im2nerf: Image to Neural Radiance Field in the Wild | Lu Mi et.al. | 2209.04061 | null |
2022-09-08 | PixTrack: Precise 6DoF Object Pose Tracking using NeRF Templates and Feature-metric Alignment | Prajwal Chidananda et.al. | 2209.03910 | link |
2022-09-07 | Neural Feature Fusion Fields: 3D Distillation of Self-Supervised 2D Image Representations | Vadim Tschernezki et.al. | 2209.03494 | null |
2022-08-29 | Volume Rendering Digest (for NeRF) | Andrea Tagliasacchi et.al. | 2209.02417 | null |
2022-09-06 | CLONeR: Camera-Lidar Fusion for Occupancy Grid-aided Neural Representations | Alexandra Carlson et.al. | 2209.01194 | null |
2022-09-01 | On Quantizing Implicit Neural Representations | Cameron Gordon et.al. | 2209.01019 | null |
2022-08-31 | Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces | Yihao Zhi et.al. | 2208.14851 | link |
2022-08-30 | A Portable Multiscopic Camera for Novel View and Time Synthesis in Dynamic Scenes | Tianjia Zhang et.al. | 2208.14433 | null |
2022-08-24 | PeRFception: Perception using Radiance Fields | Yoonwoo Jeong et.al. | 2208.11537 | link |
2022-08-24 | E-NeRF: Neural Radiance Fields from a Moving Event Camera | Simon Klenk et.al. | 2208.11300 | link |
2022-08-18 | Neural Capture of Animatable 3D Human from Monocular Video | Gusi Te et.al. | 2208.08728 | null |
2022-08-16 | Casual Indoor HDR Radiance Capture from Omnidirectional Images | Pulkit Gera et.al. | 2208.07903 | null |
2022-08-15 | DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images | Bing Wang et.al. | 2208.07227 | link |
2022-08-11 | RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild | Jason Y. Zhang et.al. | 2208.05963 | null |
2022-08-11 | FDNeRF: Few-shot Dynamic Neural Radiance Fields for Face Reconstruction and Expression Editing | Jingbo Zhang et.al. | 2208.05751 | link |
2022-08-04 | 360Roam: Real-Time Indoor Roaming Using Geometry-Aware ${360^\circ}$ Radiance Fields | Huajian Huang et.al. | 2208.02705 | null |
2022-08-02 | T4DT: Tensorizing Time for Learning Temporal 3D Visual Data | Mikhail Usvyatsov et.al. | 2208.01421 | link |
2022-08-01 | DoF-NeRF: Depth-of-Field Meets Neural Radiance Fields | Zijin Wu et.al. | 2208.00945 | link |
2022-08-06 | MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures | Zhiqin Chen et.al. | 2208.00277 | link |
2022-07-30 | Distilled Low Rank Neural Radiance Field with Quantization for Light Field Compression | Jinglei Shi et.al. | 2208.00164 | null |
2022-08-01 | End-to-end View Synthesis via NeRF Attention | Zelin Zhao et.al. | 2207.14741 | null |
2022-07-29 | Neural Density-Distance Fields | Itsuki Ueda et.al. | 2207.14455 | link |
2022-07-27 | Is Attention All NeRF Needs? | Mukund Varma T et.al. | 2207.13298 | null |
Gaussian Splatting
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-15 | A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction | Haoxuan Qu et.al. | 2507.11321 | null |
2025-07-16 | TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update | Jeongyun Kim et.al. | 2507.11069 | null |
2025-07-15 | Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling | Hayeon Kim et.al. | 2507.11061 | null |
2025-07-14 | ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions | Shivangi Aneja et.al. | 2507.10542 | null |
2025-07-14 | 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving | Yixun Zhang et.al. | 2507.09993 | null |
2025-07-11 | Learning human-to-robot handovers through 3D scene reconstruction | Yuekun Wu et.al. | 2507.08726 | null |
2025-07-11 | RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting | Ji Hyun Seo et.al. | 2507.08434 | null |
2025-07-10 | Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction | Hyungjun Doh et.al. | 2507.08137 | null |
2025-07-10 | RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration | Chong Cheng et.al. | 2507.08136 | null |
2025-07-10 | RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection | Yongyang Zhou et.al. | 2507.07733 | null |
2025-07-10 | MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation | Bangning Wei et.al. | 2507.07519 | null |
2025-07-10 | SD-GS: Structured Deformable 3D Gaussians for Efficient Dynamic Scene Reconstruction | Wei Yao et.al. | 2507.07465 | null |
2025-07-10 | Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections | Yongtang Bao et.al. | 2507.07395 | null |
2025-07-09 | LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS | Wanhua Li et.al. | 2507.07136 | null |
2025-07-09 | Enhancing non-Rigid 3D Model Deformations Using Mesh-based Gaussian Splatting | Wijayathunga W. M. R. D. B et.al. | 2507.07000 | null |
2025-07-09 | Photometric Stereo using Gaussian Splatting and inverse rendering | Matéo Ducastel et.al. | 2507.06684 | null |
2025-07-09 | FlexGaussian: Flexible and Cost-Effective Training-Free Compression for 3D Gaussian Splatting | Boyuan Tian et.al. | 2507.06671 | null |
2025-07-09 | ClipGS: Clippable Gaussian Splatting for Interactive Cinematic Visualization of Volumetric Medical Data | Chengkun Li et.al. | 2507.06647 | null |
2025-07-08 | LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures | Seungoh Han et.al. | 2507.06109 | null |
2025-07-08 | Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering | Jiayi Song et.al. | 2507.06103 | null |
2025-07-08 | VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis | Alexandre Symeonidis-Herzig et.al. | 2507.06060 | null |
2025-07-08 | D-FCGS: Feedforward Compression of Dynamic Gaussian Splatting for Free-Viewpoint Videos | Wenkang Zhang et.al. | 2507.05859 | null |
2025-07-08 | DreamArt: Generating Interactable Articulated Objects from a Single Image | Ruijie Lu et.al. | 2507.05763 | null |
2025-07-08 | 3DGS_LSR:Large_Scale Relocation for Autonomous Driving Based on 3D Gaussian Splatting | Haitao Lu et.al. | 2507.05661 | null |
2025-07-07 | Mastering Regional 3DGS: Locating, Initializing, and Editing with Diverse 2D Priors | Lanqing Guo et.al. | 2507.05426 | null |
2025-07-07 | SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation | Jiahao Zhu et.al. | 2507.05256 | null |
2025-07-07 | InterGSEdit: Interactive 3D Gaussian Splatting Editing with 3D Geometry-Consistent Attention Prior | Minghao Wen et.al. | 2507.04961 | null |
2025-07-05 | A3FR: Agile 3D Gaussian Splatting with Incremental Gaze Tracked Foveated Rendering in Virtual Reality | Shuo Xin et.al. | 2507.04147 | null |
2025-07-05 | Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM | Xiaolei Lang et.al. | 2507.04004 | null |
2025-07-05 | ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments | Guile Wu et.al. | 2507.03886 | null |
2025-07-04 | Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps | Chong Cheng et.al. | 2507.03737 | null |
2025-07-03 | HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars | Gent Serifi et.al. | 2507.02803 | null |
2025-07-03 | ArtGS:3D Gaussian Splatting for Interactive Visual-Physical Modeling and Manipulation of Articulated Objects | Qiaojun Yu et.al. | 2507.02600 | null |
2025-07-03 | LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling | Jiahao Wu et.al. | 2507.02363 | null |
2025-07-03 | Gbake: Baking 3D Gaussian Splats into Reflection Probes | Stephen Pasch et.al. | 2507.02257 | null |
2025-07-02 | 3D Gaussian Splatting Driven Multi-View Robust Physical Adversarial Camouflage Generation | Tianrui Lou et.al. | 2507.01367 | null |
2025-07-01 | VISTA: Open-Vocabulary, Task-Relevant Robot Exploration with Online Semantic Gaussian Splatting | Keiko Nagami et.al. | 2507.01125 | null |
2025-07-01 | A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory | Felix Windisch et.al. | 2507.01110 | null |
2025-07-01 | Masks make discriminative models great again! | Tianshi Cao et.al. | 2507.00916 | null |
2025-07-01 | GaussianVLM: Scene-centric 3D Vision-Language Models using Language-aligned Gaussian Splats for Embodied Reasoning and Beyond | Anna-Maria Halacheva et.al. | 2507.00886 | null |
2025-07-01 | LOD-GS: Level-of-Detail-Sensitive 3D Gaussian Splatting for Detail Conserved Anti-Aliasing | Zhenya Yang et.al. | 2507.00554 | null |
2025-07-01 | GDGS: 3D Gaussian Splatting Via Geometry-Guided Initialization And Dynamic Density Control | Xingjun Wang et.al. | 2507.00363 | null |
2025-06-30 | MILo: Mesh-In-the-Loop Gaussian Splatting for Detailed and Efficient Surface Reconstruction | Antoine Guédon et.al. | 2506.24096 | null |
2025-06-30 | GaVS: 3D-Grounded Video Stabilization via Temporally-Consistent Local Reconstruction and Rendering | Zinuo You et.al. | 2506.23957 | null |
2025-06-30 | AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention | Ziao Liu et.al. | 2506.23611 | null |
2025-06-30 | Instant GaussianImage: A Generalizable and Self-Adaptive Image Representation via 2D Gaussian Splatting | Zhaojie Zeng et.al. | 2506.23479 | null |
2025-07-01 | SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting | Yiming Huang et.al. | 2506.23309 | null |
2025-06-29 | Endo-4DGX: Robust Endoscopic Scene Reconstruction and Illumination Correction with Gaussian Splatting | Yiming Huang et.al. | 2506.23308 | null |
2025-06-29 | TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints | Zhen Tan et.al. | 2506.23207 | null |
2025-06-29 | STD-GS: Exploring Frame-Event Interaction for SpatioTemporal-Disentangled Gaussian Splatting to Reconstruct High-Dynamic Scene | Hanyu Zhou et.al. | 2506.23157 | null |
2025-06-29 | From Coarse to Fine: Learnable Discrete Wavelet Transforms for Efficient 3D Gaussian Splatting | Hung Nguyen et.al. | 2506.23042 | null |
2025-06-28 | Confident Splatting: Confidence-Based Compression of 3D Gaussian Splatting via Learnable Beta Distributions | AmirHossein Naghi Razlighi et.al. | 2506.22973 | null |
2025-06-27 | DIGS: Dynamic CBCT Reconstruction using Deformation-Informed 4D Gaussian Splatting and a Low-Rank Free-Form Deformation Model | Yuliang Huang et.al. | 2506.22280 | null |
2025-06-27 | BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting | Zipei Ma et.al. | 2506.22099 | null |
2025-06-26 | MADrive: Memory-Augmented Driving Scene Modeling | Polina Karpikova et.al. | 2506.21520 | null |
2025-06-26 | EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting | Taoyu Wu et.al. | 2506.21420 | null |
2025-06-28 | Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction | Zhirui Gao et.al. | 2506.21401 | null |
2025-06-26 | Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image | Pufan Li et.al. | 2506.21152 | null |
2025-06-26 | CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization | Jan Ackermann et.al. | 2506.21117 | null |
2025-06-26 | User-in-the-Loop View Sampling with Error Peaking Visualization | Ayaka Yasunaga et.al. | 2506.21009 | null |
2025-06-26 | DBMovi-GS: Dynamic View Synthesis from Blurry Monocular Video via Sparse-Controlled Gaussian Splatting | Yeon-Ji Song et.al. | 2506.20998 | null |
2025-06-25 | 3DGH: 3D Head Generation with Composable Hair and Face | Chengan He et.al. | 2506.20875 | null |
2025-06-25 | RaRa Clipper: A Clipper for Gaussian Splatting Based on Ray Tracer and Rasterizer | Da Li et.al. | 2506.20202 | null |
2025-06-24 | ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model | Tengbo Yu et.al. | 2506.19842 | null |
2025-06-24 | Virtual Memory for 3D Gaussian Splatting | Jonathan Haberl et.al. | 2506.19415 | null |
2025-06-24 | HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis | Xiaoyuan Wang et.al. | 2506.19291 | null |
2025-06-23 | GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM | Annika Thomas et.al. | 2506.18885 | null |
2025-06-23 | ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs | Michal Nazarczuk et.al. | 2506.18792 | null |
2025-06-23 | 3D Arena: An Open Platform for Generative 3D Evaluation | Dylan Ebert et.al. | 2506.18787 | null |
2025-06-23 | Reconstructing Tornadoes in 3D with Gaussian Splatting | Adam Yang et.al. | 2506.18677 | null |
2025-06-21 | 3D Gaussian Splatting for Fine-Detailed Surface Reconstruction in Large-Scale Scene | Shihan Chen et.al. | 2506.17636 | null |
2025-06-20 | Part $^{2}$ GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212 | null |
2025-06-23 | R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision | Weeyoung Kwon et.al. | 2506.16262 | link |
2025-06-19 | Information-computation trade-offs in non-linear transforms | Connor Ding et.al. | 2506.15948 | null |
2025-06-18 | Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos | Kaifeng Zhang et.al. | 2506.15680 | null |
2025-06-18 | RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories | Qingsong Yan et.al. | 2506.15242 | null |
2025-06-17 | Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction | Zhengquan Zhang et.al. | 2506.14856 | null |
2025-06-17 | SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting | Ziqiao Peng et.al. | 2506.14742 | null |
2025-06-17 | 3DGS-IEval-15K: A Large-scale Image Quality Evaluation Database for 3D Gaussian-Splatting | Yuke Xing et.al. | 2506.14642 | link |
2025-06-17 | HRGS: Hierarchical Gaussian Splatting for Memory-Efficient High-Resolution 3D Reconstruction | Changbai Li et.al. | 2506.14229 | null |
2025-06-17 | GAF: Gaussian Action Field as a Dvnamic World Model for Robotic Mlanipulation | Ying Chai et.al. | 2506.14135 | null |
2025-06-16 | GRaD-Nav++: Vision-Language Model Enabled Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics | Qianzhong Chen et.al. | 2506.14009 | null |
2025-06-16 | PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images | Lingteng Qiu et.al. | 2506.13766 | null |
2025-06-16 | Micro-macro Gaussian Splatting with Enhanced Scalability for Unconstrained Scene Reconstruction | Yihui Li et.al. | 2506.13516 | link |
2025-06-16 | Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields | Jungeon Kim et.al. | 2506.13508 | null |
2025-06-16 | TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian Splatting | Mae Younes et.al. | 2506.13348 | link |
2025-06-16 | GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction | Jinguang Tong et.al. | 2506.13110 | null |
2025-06-15 | Metropolis-Hastings Sampling for 3D Gaussian Reconstruction | Hyunjin Kim et.al. | 2506.12945 | null |
2025-06-15 | Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting | Mufan Liu et.al. | 2506.12787 | null |
2025-06-17 | Efficient multi-view training for 3D Gaussian Splatting | Minhyuk Choi et.al. | 2506.12727 | null |
2025-06-15 | Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors | Wen-Hsuan Chu et.al. | 2506.12716 | null |
2025-06-14 | Perceptual-GS: Scene-adaptive Perceptual Densification for Gaussian Splatting | Hongbi Zhou et.al. | 2506.12400 | link |
2025-06-12 | Anti-Aliased 2D Gaussian Splatting | Mae Younes et.al. | 2506.11252 | link |
2025-06-12 | PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting | Lintao Xiang et.al. | 2506.10335 | null |
2025-06-11 | DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos | Chieh Hubert Lin et.al. | 2506.09997 | null |
2025-06-11 | UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting | Ziyi Wang et.al. | 2506.09952 | link |
2025-06-11 | DynaSplat: Dynamic-Static Gaussian Splatting with Hierarchical Motion Decomposition for Scene Reconstruction | Junli Deng et.al. | 2506.09836 | null |
2025-06-11 | Self-Supervised Multi-Part Articulated Objects Modeling via Deformable Gaussian Splatting and Progressive Primitive Segmentation | Haowen Wang et.al. | 2506.09663 | null |
2025-06-11 | Gaussian Herding across Pens: An Optimal Transport Perspective on Global Gaussian Reduction for 3DGS | Tao Wang et.al. | 2506.09534 | null |
2025-06-11 | HAIF-GS: Hierarchical and Induced Flow-Guided Gaussian Splatting for Dynamic Scene | Jianing Chen et.al. | 2506.09518 | null |
2025-06-11 | TinySplat: Feedforward Approach for Generating Compact 3D Scene Representation | Zetian Song et.al. | 2506.09479 | null |
2025-06-12 | ODG: Occupancy Prediction Using Dual Gaussians | Yunxiao Shi et.al. | 2506.09417 | null |
2025-06-11 | UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images | Qijian Tian et.al. | 2506.09378 | null |
2025-06-10 | StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams | Zike Wu et.al. | 2506.08862 | link |
2025-06-11 | Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting | Keyi Liu et.al. | 2506.08777 | null |
2025-06-10 | SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting | Mengjiao Ma et.al. | 2506.08710 | null |
2025-06-10 | TraGraph-GS: Trajectory Graph-based Gaussian Splatting for Arbitrary Large-Scale Scene Rendering | Xiaohan Zhang et.al. | 2506.08704 | null |
2025-06-10 | Complex-Valued Holographic Radiance Fields | Yicheng Zhan et.al. | 2506.08350 | null |
2025-06-09 | Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes | Allen Tu et.al. | 2506.07917 | link |
2025-06-09 | GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution | Shuja Khalid et.al. | 2506.07897 | null |
2025-06-09 | R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation | William Ljungbergh et.al. | 2506.07826 | null |
2025-06-09 | OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting | Jens Piekenbrinck et.al. | 2506.07697 | null |
2025-06-09 | ProSplat: Improved Feed-Forward 3D Gaussian Splatting for Wide-Baseline Sparse Views | Xiaohan Lu et.al. | 2506.07670 | null |
2025-06-09 | PIG: Physically-based Multi-Material Interaction with 3D Gaussians | Zeyu Xiao et.al. | 2506.07657 | null |
2025-06-09 | Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation | Yijie Deng et.al. | 2506.07338 | null |
2025-06-08 | Accelerating 3D Gaussian Splatting with Neural Sorting and Axis-Oriented Rasterization | Zhican Wang et.al. | 2506.07069 | null |
2025-06-08 | Hybrid Mesh-Gaussian Representation for Efficient Indoor Scene Reconstruction | Binxiao Huang et.al. | 2506.06988 | null |
2025-06-07 | Gaussian Mapping for Evolving Scenes | Vladimir Yugay et.al. | 2506.06909 | null |
2025-06-06 | Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments | Mingrui Li et.al. | 2506.05965 | null |
2025-06-06 | SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction | Yuchao Zheng et.al. | 2506.05935 | null |
2025-06-06 | Lumina: Real-Time Mobile Neural Rendering by Exploiting Computational Redundancy | Yu Feng et.al. | 2506.05682 | null |
2025-06-05 | VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction | Ziyue Zhu et.al. | 2506.05563 | null |
2025-06-05 | On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images | Andreas Meuleman et.al. | 2506.05558 | null |
2025-06-05 | ODE-GS: Latent ODEs for Dynamic Scene Extrapolation with 3D Gaussian Splatting | Daniel Wang et.al. | 2506.05480 | null |
2025-06-05 | Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting | Duochao Shi et.al. | 2506.05327 | null |
2025-06-06 | Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting | Nan Wang et.al. | 2506.05280 | link |
2025-06-05 | Synthetic Dataset Generation for Autonomous Mobile Robots Using 3D Gaussian Splatting for Vision Training | Aneesh Deogan et.al. | 2506.05092 | null |
2025-06-05 | UAV4D: Dynamic Neural Rendering of Human-Centric UAV Imagery using Gaussian Splatting | Jaehoon Choi et.al. | 2506.05011 | null |
2025-06-05 | Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting | Alfred T. Christiansen et.al. | 2506.05009 | null |
2025-06-05 | Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer | Filip Slezak et.al. | 2506.04908 | null |
2025-06-05 | Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations | Gaia Di Lorenzo et.al. | 2506.04789 | null |
2025-06-04 | Photoreal Scene Reconstruction from an Egocentric Device | Zhaoyang Lv et.al. | 2506.04444 | link |
2025-06-04 | HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting | Maksym Ivashechkin et.al. | 2506.04351 | null |
2025-06-04 | Pseudo-Simulation for Autonomous Driving | Wei Cao et.al. | 2506.04218 | link |
2025-06-04 | FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting | Hengyu Liu et.al. | 2506.04174 | null |
2025-06-04 | Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data | Ben Moran et.al. | 2506.04120 | null |
2025-06-04 | JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting | Yang Xiao et.al. | 2506.03872 | null |
2025-06-04 | SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting | Shengjie Lin et.al. | 2506.03594 | link |
2025-06-04 | Robust Neural Rendering in the Wild with Asymmetric Dual 3D Gaussian Splatting | Chengqi Li et.al. | 2506.03538 | null |
2025-06-03 | Multi-Spectral Gaussian Splatting with Neural Color Representation | Lukas Meyer et.al. | 2506.03407 | null |
2025-06-03 | LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM | Roman Titkov et.al. | 2506.03073 | null |
2025-06-03 | Large Processor Chip Model | Kaiyan Chang et.al. | 2506.02929 | null |
2025-06-04 | Voyager: Real-Time Splatting City-Scale 3D Gaussians on Your Phone | Zheng Liu et.al. | 2506.02774 | null |
2025-06-03 | RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS | Chuanyu Fu et.al. | 2506.02751 | null |
2025-06-03 | EyeNavGS: A 6-DoF Navigation Dataset and Record-n-Replay Software for Real-World 3DGS Scenes in VR | Zihao Ding et.al. | 2506.02380 | link |
2025-06-02 | GSCodec Studio: A Modular Framework for Gaussian Splat Compression | Sicheng Li et.al. | 2506.01822 | link |
2025-06-02 | WorldExplorer: Towards Generating Fully Navigable 3D Scenes | Manuel-Andreas Schneider et.al. | 2506.01799 | null |
2025-06-02 | WoMAP: World Models For Embodied Open-Vocabulary Object Localization | Tenny Yin et.al. | 2506.01600 | null |
2025-06-02 | RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes | Pou-Chun Kung et.al. | 2506.01379 | null |
2025-06-01 | CountingFruit: Real-Time 3D Fruit Counting with Language-Guided Semantic Gaussian Splatting | Fengze Li et.al. | 2506.01109 | null |
2025-05-30 | AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion | Yangyi Huang et.al. | 2505.24877 | null |
2025-05-30 | TC-GS: A Faster Gaussian Splatting Module Utilizing Tensor Cores | Zimu Liao et.al. | 2505.24796 | link |
2025-05-30 | Tackling View-Dependent Semantics in 3D Language Gaussian Splatting | Jiazhong Cen et.al. | 2505.24746 | link |
2025-05-30 | GARLIC: GAussian Representation LearnIng for spaCe partitioning | Panagiotis Rigas et.al. | 2505.24608 | null |
2025-05-30 | LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework | Xin Kang et.al. | 2505.24245 | null |
2025-05-29 | 3DGEER: Exact and Efficient Volumetric Rendering with 3D Gaussians | Zixun Huang et.al. | 2505.24053 | link |
2025-05-30 | ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS | Weijie Wang et.al. | 2505.23734 | link |
2025-05-29 | AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views | Lihan Jiang et.al. | 2505.23716 | null |
2025-05-29 | Mobi- $π$ : Mobilizing Your Robot Learning Policy | Jingyun Yang et.al. | 2505.23692 | null |
2025-05-29 | Radiant Triangle Soup with Soft Connectivity Forces for 3D Reconstruction and Novel View Synthesis | Nathaniel Burgdorfer et.al. | 2505.23642 | null |
2025-05-29 | Holistic Large-Scale Scene Reconstruction via Mixed Gaussian Splatting | Chuandong Liu et.al. | 2505.23280 | link |
2025-05-29 | LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering | Jonas Kulhanek et.al. | 2505.23158 | null |
2025-05-29 | Pose-free 3D Gaussian splatting via shape-ray estimation | Youngju Na et.al. | 2505.22978 | null |
2025-05-28 | 3DGS Compression with Sparsity-guided Hierarchical Transform Coding | Hao Xu et.al. | 2505.22908 | null |
2025-05-28 | CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting | Kornel Howil et.al. | 2505.22854 | link |
2025-05-28 | STDR: Spatio-Temporal Decoupling for Real-Time Dynamic Scene Rendering | Zehao Li et.al. | 2505.22400 | null |
2025-05-28 | UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments | Wancai Zheng et.al. | 2505.22335 | null |
2025-05-28 | Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss | Wenjun Lu et.al. | 2505.22279 | null |
2025-05-28 | Hyperspectral Gaussian Splatting | Sunil Kumar Narayanan et.al. | 2505.21890 | null |
2025-05-27 | Generalizable and Relightable Gaussian Splatting for Human Novel View Synthesis | Yipengjing Sun et.al. | 2505.21502 | null |
2025-05-27 | Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility | Yidi Li et.al. | 2505.21377 | link |
2025-05-27 | Structure from Collision | Takuhiro Kaneko et.al. | 2505.21335 | null |
2025-05-29 | 3D-UIR: 3D Gaussian for Underwater 3D Scene Reconstruction via Physics Based Appearance-Medium Decoupling | Jieyu Yuan et.al. | 2505.21238 | null |
2025-05-28 | CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians | Weihang Liu et.al. | 2505.21041 | null |
2025-05-27 | Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting | Xiangyu Sun et.al. | 2505.20729 | null |
2025-05-27 | Wideband RF Radiance Field Modeling Using Frequency-embedded 3D Gaussian Splatting | Zechen Li et.al. | 2505.20714 | link |
2025-05-26 | CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting | Lei Tian et.al. | 2505.20469 | null |
2025-05-26 | ParticleGS: Particle-Based Dynamics Modeling of 3D Gaussians for Prior-free Motion Extrapolation | Jinsheng Quan et.al. | 2505.20270 | link |
2025-05-26 | HaloGS: Loose Coupling of Compact Geometry and Gaussian Splats for 3D Scenes | Changjian Jiang et.al. | 2505.20267 | null |
2025-05-26 | OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender | Shintaro Ito et.al. | 2505.20126 | link |
2025-05-26 | Weather-Magician: Reconstruction and Rendering Framework for 4D Weather Synthesis In Real Time | Chen Sang et.al. | 2505.19919 | null |
2025-05-26 | Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud | Natsuki Takama et.al. | 2505.19854 | null |
2025-05-26 | K-Buffers: A Plug-in Method for Enhancing Neural Fields with Multiple Buffers | Haofan Ren et.al. | 2505.19564 | link |
2025-05-26 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting | Wenhua Wu et.al. | 2505.19420 | null |
2025-05-25 | Improving Novel view synthesis of 360 $^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images | Guangan Chen et.al. | 2505.19264 | link |
2025-05-25 | Triangle Splatting for Real-Time Radiance Field Rendering | Jan Held et.al. | 2505.19175 | null |
2025-05-25 | FHGS: Feature-Homogenized Gaussian Splatting | Q. G. Duan et.al. | 2505.19154 | null |
2025-05-25 | Veta-GS: View-dependent deformable 3D Gaussian Splatting for thermal infrared Novel-view Synthesis | Myeongseok Nam et.al. | 2505.19138 | null |
2025-05-25 | VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes | Tianchen Deng et.al. | 2505.18992 | link |
2025-05-23 | SplatCo: Structure-View Collaborative Gaussian Splatting for Detail-Preserving Rendering of Large-Scale Unbounded Scenes | Haihong Xiao et.al. | 2505.17951 | null |
2025-05-23 | CGS-GAN: 3D Consistent Gaussian Splatting GANs for High Resolution Human Head Synthesis | Florian Barthel et.al. | 2505.17590 | link |
2025-05-23 | From Flight to Insight: Semantic 3D Reconstruction for Aerial Inspection via Gaussian Splatting and Language-Guided Segmentation | Mahmoud Chick Zaouali et.al. | 2505.17402 | null |
2025-05-22 | Render-FM: A Foundation Model for Real-time Photorealistic Volumetric Rendering | Zhongpai Gao et.al. | 2505.17338 | null |
2025-05-22 | SHaDe: Compact and Consistent Dynamic 3D Reconstruction via Tri-Plane Deformation and Latent Diffusion | Asrar Alruwayqi et.al. | 2505.16535 | null |
2025-05-22 | Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction | Jiacong Chen et.al. | 2505.16533 | null |
2025-05-21 | RUSplatting: Robust 3D Gaussian Splatting for Sparse-View Underwater Scene Reconstruction | Zhuodong Jiang et.al. | 2505.15737 | null |
2025-05-21 | PlantDreamer: Achieving Realistic 3D Plant Models with Diffusion-Guided Gaussian Splatting | Zane K J Hartley et.al. | 2505.15528 | null |
2025-05-21 | R3GS: Gaussian Splatting for Robust Reconstruction and Relocalization in Unconstrained Image Collections | Xu yan et.al. | 2505.15294 | null |
2025-05-21 | GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation | Yuchen Li et.al. | 2505.15287 | null |
2025-05-21 | X-GRM: Large Gaussian Reconstruction Model for Sparse-view X-rays to Computed Tomography | Yifan Liu et.al. | 2505.15235 | link |
2025-05-21 | GT^2-GS: Geometry-aware Texture Transfer for Gaussian Splatting | Wenjie Liu et.al. | 2505.15208 | null |
2025-05-21 | MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models | Yifan Liu et.al. | 2505.15185 | link |
2025-05-20 | Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning | Amine Elhafsi et.al. | 2505.14938 | null |
2025-05-20 | Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image | Yuxuan Wang et.al. | 2505.14537 | null |
2025-05-20 | MGStream: Motion-aware 3D Gaussian for Streamable Dynamic Scene Reconstruction | Zhenyu Bao et.al. | 2505.13839 | link |
2025-05-19 | Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos | Ruoyu Wang et.al. | 2505.13440 | link |
2025-05-19 | Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation | Seungjun Oh et.al. | 2505.13215 | link |
2025-05-19 | 3D Gaussian Adaptive Reconstruction for Fourier Light-Field Microscopy | Chenyu Xu et.al. | 2505.12875 | null |
2025-05-19 | TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy | Luyao Lei et.al. | 2505.12693 | null |
2025-05-18 | Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey | Calvin Galagain et.al. | 2505.12384 | null |
2025-05-17 | GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity | Takuya Ikeda et.al. | 2505.11905 | null |
2025-05-17 | MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos | Hongyi Zhou et.al. | 2505.11868 | null |
2025-05-17 | Gaussian Splatting as a Unified Representation for Autonomy in Unstructured Environments | Dexter Ong et.al. | 2505.11794 | null |
2025-05-16 | Exploiting Radiance Fields for Grasp Generation on Novel Synthetic Views | Abhishek Kashyap et.al. | 2505.11467 | null |
2025-05-16 | GrowSplat: Constructing Temporal Digital Twins of Plants with Gaussian Splats | Simeon Adebola et.al. | 2505.10923 | null |
2025-05-16 | EA-3DGS: Efficient and Adaptive 3D Gaussians with Highly Enhanced Quality for outdoor scenes | Jianlin Guo et.al. | 2505.10787 | link |
2025-05-14 | ExploreGS: a vision-based low overhead framework for 3D scene reconstruction | Yunji Feng et.al. | 2505.10578 | null |
2025-05-15 | Consistent Quantity-Quality Control across Scenes for Deployment-Aware Gaussian Splatting | Fengdi Zhang et.al. | 2505.10473 | link |
2025-05-15 | VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality | Xuechang Tu et.al. | 2505.10144 | link |
2025-05-15 | Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field | Jinlong Fan et.al. | 2505.10049 | link |
2025-05-15 | Large-Scale Gaussian Splatting SLAM | Zhe Xin et.al. | 2505.09915 | null |
2025-05-14 | Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware | Justin Yu et.al. | 2505.09601 | null |
2025-05-14 | Neural Video Compression using 2D Gaussian Splatting | Lakshya Gupta et.al. | 2505.09324 | null |
2025-05-15 | NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance | Wenzhe Cai et.al. | 2505.08712 | null |
2025-05-13 | DLO-Splatting: Tracking Deformable Linear Objects Using 3D Gaussian Splatting | Holly Dinkel et.al. | 2505.08644 | null |
2025-05-13 | FOCI: Trajectory Optimization on Gaussian Splats | Mario Gomez Andreu et.al. | 2505.08510 | null |
2025-05-13 | A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering | Chuanzhi Xu et.al. | 2505.08438 | null |
2025-05-13 | ADC-GS: Anchor-Driven Deformable and Compressed Gaussian Splatting for Dynamic Scene Reconstruction | He Huang et.al. | 2505.08196 | link |
2025-05-12 | SLAG: Scalable Language-Augmented Gaussian Splatting | Laszlo Szilagyi et.al. | 2505.08124 | null |
2025-05-12 | GIFStream: 4D Gaussian-based Immersive Video with Feature Stream | Hao Li et.al. | 2505.07539 | null |
2025-05-13 | TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset | Olaf Wysocki et.al. | 2505.07396 | null |
2025-05-10 | Virtualized 3D Gaussians: Flexible Cluster-based Level-of-Detail System for Real-Time Rendering of Composed Scenes | Xijie Yang et.al. | 2505.06523 | null |
2025-05-08 | TeGA: Texture Space Gaussian Avatars for High-Resolution Dynamic Head Modeling | Gengyan Li et.al. | 2505.05672 | null |
2025-05-08 | UltraGauss: Ultrafast Gaussian Reconstruction of 3D Ultrasound Volumes | Mark C. Eid et.al. | 2505.05643 | null |
2025-05-08 | QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization | Yueh-Cheng Liu et.al. | 2505.05591 | null |
2025-05-08 | Steepest Descent Density Control for Compact 3D Gaussian Splatting | Peihao Wang et.al. | 2505.05587 | null |
2025-05-08 | SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation | Yonwoo Choi et.al. | 2505.05475 | link |
2025-05-08 | Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields | Runfeng Li et.al. | 2505.05356 | null |
2025-05-07 | SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction | Xinran Yang et.al. | 2505.04668 | link |
2025-05-07 | GSsplat: Generalizable Semantic Gaussian Splatting for Novel-view Synthesis in 3D Scenes | Feng Xiao et.al. | 2505.04659 | link |
2025-05-07 | Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting | Feng Yang et.al. | 2505.04262 | null |
2025-05-06 | 3D Gaussian Splatting Data Compression with Mixture of Priors | Lei Liu et.al. | 2505.03310 | null |
2025-05-04 | Sparfels: Fast Reconstruction from Sparse Unposed Imagery | Shubhendu Jena et.al. | 2505.02178 | null |
2025-05-04 | SparSplat: Fast Multi-View Reconstruction with Generalizable 2D Gaussian Splatting | Shubhendu Jena et.al. | 2505.02175 | null |
2025-05-04 | GarmentGS: Point-Cloud Guided Gaussian Splatting for High-Fidelity Non-Watertight 3D Garment Reconstruction | Zhihao Tang et.al. | 2505.02126 | null |
2025-05-04 | SignSplat: Rendering Sign Language via Gaussian Splatting | Maksym Ivashechkin et.al. | 2505.02108 | null |
2025-05-03 | HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud Encoder | Qi Yang et.al. | 2505.01938 | link |
2025-05-03 | GenSync: A Generalized Talking Head Framework for Audio-driven Multi-Subject Lip-Sync using 3D Gaussian Splatting | Anushka Agarwal et.al. | 2505.01928 | null |
2025-05-03 | Visual enhancement and 3D representation for underwater scenes: a review | Guoxi Huang et.al. | 2505.01869 | null |
2025-05-03 | AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting | Junhao Shi et.al. | 2505.01799 | null |
2025-05-02 | FalconWing: An Open-Source Platform for Ultra-Light Fixed-Wing Aircraft Research | Yan Miao et.al. | 2505.01383 | null |
2025-05-02 | Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting | Youngsik Yun et.al. | 2505.01235 | null |
2025-04-30 | A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond | Jiajia Li et.al. | 2505.00737 | link |
2025-05-01 | Real-Time Animatable 2DGS-Avatars with Detail Enhancement from Monocular Videos | Xia Yuan et.al. | 2505.00421 | null |
2025-04-30 | HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation | Haiyang Zhou et.al. | 2504.21650 | link |
2025-04-29 | GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction | Yuhan Xie et.al. | 2504.21067 | link |
2025-04-29 | GaussTrap: Stealthy Poisoning Attacks on 3D Gaussian Splatting for Targeted Scene Confusion | Jiaxin Hong et.al. | 2504.20829 | null |
2025-04-29 | EfficientHuman: Efficient Training and Reconstruction of Moving Human using Articulated 2D Gaussian | Hao Tian et.al. | 2504.20607 | null |
2025-04-29 | Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting | Hanxi Liu et.al. | 2504.20403 | null |
2025-05-01 | GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting | Jongwon Lee et.al. | 2504.20379 | null |
2025-04-29 | Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views | Jiang Wu et.al. | 2504.20378 | link |
2025-04-28 | Mesh-Learner: Texturing Mesh with Spherical Harmonics | Yunfei Wan et.al. | 2504.19938 | link |
2025-04-28 | CE-NPBG: Connectivity Enhanced Neural Point-Based Graphics for Novel View Synthesis in Autonomous Driving Scenes | Mohammad Altillawi et.al. | 2504.19557 | null |
2025-04-28 | GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field | Zuxing Lu et.al. | 2504.19409 | null |
2025-04-27 | Rendering Anywhere You See: Renderability Field-guided Gaussian Splatting | Xiaofeng Jin et.al. | 2504.19261 | null |
2025-04-30 | 4DGS-CC: A Contextual Coding Framework for 4D Gaussian Splatting Data Compression | Zicong Chen et.al. | 2504.18925 | null |
2025-04-26 | TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians | Letian Huang et.al. | 2504.18768 | null |
2025-04-28 | RGS-DR: Reflective Gaussian Surfels with Deferred Rendering for Shiny Objects | Georgios Kouros et.al. | 2504.18468 | null |
2025-04-25 | STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting | Yunze Deng et.al. | 2504.18318 | null |
2025-04-25 | PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models | Michel Gokan Khan et.al. | 2504.18165 | link |
2025-04-24 | iVR-GS: Inverse Volume Rendering for Explorable Visualization via Editable 3D Gaussian Splatting | Kaiyuan Tang et.al. | 2504.17954 | link |
2025-04-23 | Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning | Mingxuan Cui et.al. | 2504.17815 | link |
2025-04-24 | CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos | Shucheng Gong et.al. | 2504.17728 | link |
2025-04-23 | Gaussian Splatting is an Effective Data Generator for 3D Object Detection | Farhad G. Zanjani et.al. | 2504.16740 | null |
2025-04-23 | PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation | Wenxuan Li et.al. | 2504.16693 | null |
2025-04-23 | HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction | Zhongtao Wang et.al. | 2504.16606 | null |
2025-04-23 | ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration | Andrea Conti et.al. | 2504.16545 | null |
2025-04-21 | StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians | Cailin Zhuang et.al. | 2504.15281 | null |
2025-04-21 | Immersive Teleoperation Framework for Locomanipulation Tasks | Takuya Boehringer et.al. | 2504.15229 | null |
2025-04-21 | MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video | Minh-Quan Viet Bui et.al. | 2504.15122 | null |
2025-04-20 | IXGS-Intraoperative 3D Reconstruction from Sparse, Arbitrarily Posed Real X-rays | Sascha Jecklin et.al. | 2504.14699 | null |
2025-04-20 | NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation | Junyuan Fang et.al. | 2504.14638 | null |
2025-04-20 | VGNC: Reducing the Overfitting of Sparse-view 3DGS via Validation-guided Gaussian Number Control | Lifeng Lin et.al. | 2504.14548 | null |
2025-04-20 | Metamon-GS: Enhancing Representability with Variance-Guided Densification and Light Encoding | Junyan Su et.al. | 2504.14460 | null |
2025-04-23 | SEGA: Drivable 3D Gaussian Head Avatar from a Single Image | Chen Guo et.al. | 2504.14373 | null |
2025-04-21 | SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM | Samuel Cerezo et.al. | 2504.13713 | link |
2025-04-18 | Green Robotic Mixed Reality with Gaussian Splatting | Chenxuan Liu et.al. | 2504.13697 | null |
2025-04-18 | EG-Gaussian: Epipolar Geometry and Graph Network Enhanced 3D Gaussian Splatting | Beizhen Zhao et.al. | 2504.13540 | null |
2025-04-17 | Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering | Landon Dyken et.al. | 2504.13339 | null |
2025-04-17 | Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation | Sizhe Yang et.al. | 2504.13175 | null |
2025-04-18 | ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos | Zetong Zhang et.al. | 2504.13167 | null |
2025-04-17 | Digital Twin Generation from Visual Data: A Survey | Andrew Melnik et.al. | 2504.13159 | link |
2025-04-17 | Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint Graphs | Shaohui Dai et.al. | 2504.13153 | link |
2025-04-17 | CompGS++: Compressed Gaussian Splatting for Static and Dynamic Scene Representation | Xiangrui Liu et.al. | 2504.13022 | null |
2025-04-17 | GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration | Rendong Zhang et.al. | 2504.12999 | link |
2025-04-17 | Second-order Optimization of Gaussian Splats with Importance Sampling | Hamza Pehlivan et.al. | 2504.12905 | null |
2025-04-17 | AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering | Michael Steiner et.al. | 2504.12811 | null |
2025-04-17 | CAGE-GS: High-fidelity Cage Based 3D Gaussian Splatting Deformation | Yifei Tong et.al. | 2504.12800 | null |
2025-04-17 | TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors | Mingwei Li et.al. | 2504.12799 | null |
2025-04-16 | CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting | Wei Sun et.al. | 2504.11893 | null |
2025-04-16 | 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians | Zeming Wei et.al. | 2504.11218 | link |
2025-04-15 | Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation | Andrea Simonelli et.al. | 2504.11024 | null |
2025-04-15 | 3D Gabor Splatting: Reconstruction of High-frequency Surface Texture using Gabor Noise | Haato Watanabe et.al. | 2504.11003 | null |
2025-04-15 | GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR | Christophe Bolduc et.al. | 2504.10809 | null |
2025-04-14 | DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting | Zeren Jiang et.al. | 2504.10486 | link |
2025-04-15 | LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis | Hao Sun et.al. | 2504.10331 | null |
2025-04-14 | ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting | Huiqi Wu et.al. | 2504.10316 | null |
2025-04-14 | EBAD-Gaussian: Event-driven Bundle Adjusted Deblur Gaussian Splatting | Yufei Deng et.al. | 2504.10012 | null |
2025-04-16 | GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting | Junlin Hao et.al. | 2504.10001 | null |
2025-04-14 | MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling | Yunpeng Tan et.al. | 2504.09878 | null |
2025-04-13 | TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting | Zhicong Wu et.al. | 2504.09588 | null |
2025-04-13 | DropoutGS: Dropping Out Gaussians for Better Sparse-view Rendering | Yexing Xu et.al. | 2504.09491 | null |
2025-04-12 | A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds | Jizong Peng et.al. | 2504.09129 | null |
2025-04-12 | BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting | Jeongwan On et.al. | 2504.09097 | null |
2025-04-11 | FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents | Xin Tan et.al. | 2504.08581 | null |
2025-04-11 | Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation | Bram Vanherle et.al. | 2504.08473 | link |
2025-04-11 | In-2-4D: Inbetweening from Two Single-View Images to 4D Generation | Sauradip Nag et.al. | 2504.08366 | null |
2025-04-10 | ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting | Junbang Liu et.al. | 2504.08100 | link |
2025-04-10 | InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians | Kefan Chen et.al. | 2504.07949 | null |
2025-04-10 | View-Dependent Uncertainty Estimation of 3D Gaussian Splatting | Chenyu Han et.al. | 2504.07370 | null |
2025-04-09 | Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting | Daiwei Zhang et.al. | 2504.06978 | null |
2025-04-09 | IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments | Can Zhang et.al. | 2504.06827 | null |
2025-04-09 | SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering | Hanxiao Sun et.al. | 2504.06815 | link |
2025-04-09 | GSta: Efficient Training Scheme with Siestaed Gaussians for Monocular 3D Scene Reconstruction | Anil Armagan et.al. | 2504.06716 | null |
2025-04-09 | Collision avoidance from monocular vision trained with novel view synthesis | Valentin Tordjman–Levavasseur et.al. | 2504.06651 | null |
2025-04-10 | Stochastic Ray Tracing of 3D Transparent Gaussians | Xin Sun et.al. | 2504.06598 | null |
2025-04-08 | Micro-splatting: Maximizing Isotropic Constraints for Refined Optimization in 3D Gaussian Splatting | Jee Won Lee et.al. | 2504.05740 | null |
2025-04-07 | View-Dependent Deformation Fields for 2D Editing of 3D Models | Martin El Mqirmi et.al. | 2504.05544 | null |
2025-04-07 | L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene Delivery | Yi-Zhen Tsai et.al. | 2504.05517 | link |
2025-04-07 | Let it Snow! Animating Static Gaussian Scenes With Dynamic Weather Effects | Gal Fiebelman et.al. | 2504.05296 | null |
2025-04-07 | PanoDreamer: Consistent Text to 360-Degree Scene Generation | Zhexiao Xiong et.al. | 2504.05152 | null |
2025-04-07 | 3D Gaussian Particle Approximation of VDB Datasets: A Study for Scientific Visualization | Isha Sharma et.al. | 2504.04857 | null |
2025-04-07 | Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM | Zhicong Sun et.al. | 2504.04844 | link |
2025-04-07 | DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal | Wanzhou Liu et.al. | 2504.04679 | null |
2025-04-06 | Tool-as-Interface: Learning Robot Policies from Human Tool Usage through Imitation Learning | Haonan Chen et.al. | 2504.04612 | null |
2025-04-06 | Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models | Etienne Chassaing et.al. | 2504.04448 | null |
2025-04-05 | 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS | Zhisheng Huang et.al. | 2504.04294 | null |
2025-04-05 | Interpretable Single-View 3D Gaussian Splatting using Unsupervised Hierarchical Disentangled Representation Learning | Yuyang Zhang et.al. | 2504.04190 | null |
2025-04-04 | WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments | Jianhao Zheng et.al. | 2504.03886 | null |
2025-04-04 | HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration | Boyuan Wang et.al. | 2504.03536 | null |
2025-04-03 | Compressing 3D Gaussian Splatting by Noise-Substituted Vector Quantization | Haishan Wang et.al. | 2504.03059 | link |
2025-04-03 | MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM | Renwu Li et.al. | 2504.02437 | null |
2025-04-03 | ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation | Yuan Zhou et.al. | 2504.02316 | link |
2025-04-03 | Digital-twin imaging based on descattering Gaussian splatting | Suguru Shimomura et.al. | 2504.02278 | null |
2025-04-02 | UAVTwin: Neural Digital Twins for UAVs using Gaussian Splatting | Jaehoon Choi et.al. | 2504.02158 | null |
2025-04-02 | WorldPrompter: Traversable Text-to-Scene Generation | Zhaoyang Zhang et.al. | 2504.02045 | null |
2025-04-02 | Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis | Niluthpol Chowdhury Mithun et.al. | 2504.01960 | null |
2025-04-03 | Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting | Shu-Wei Lu et.al. | 2504.01957 | null |
2025-04-02 | BOGausS: Better Optimized Gaussian Splatting | Stéphane Pateux et.al. | 2504.01844 | null |
2025-04-02 | FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking | Ulas Gunes et.al. | 2504.01732 | null |
2025-04-02 | FlowR: Flowing from Sparse to Dense 3D Reconstructions | Tobias Fischer et.al. | 2504.01647 | null |
2025-04-02 | 3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting | Hao Wu et.al. | 2504.01619 | null |
2025-04-02 | RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars | Yahui Li et.al. | 2504.01559 | null |
2025-04-02 | High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model | Yiyang Shen et.al. | 2504.01512 | null |
2025-04-02 | Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment | Ziteng Cui et.al. | 2504.01503 | link |
2025-04-02 | 3D Gaussian Inverse Rendering with Approximated Global Illumination | Zirui Wu et.al. | 2504.01358 | null |
2025-03-31 | Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views | Chong Bao et.al. | 2503.24382 | null |
2025-03-31 | ERUPT: Efficient Rendering with Unposed Patch Transformer | Maxim V. Shugaev et.al. | 2503.24374 | null |
2025-03-31 | StochasticSplats: Stochastic Rasterization for Sorting-Free 3D Gaussian Splatting | Shakiba Kheradmand et.al. | 2503.24366 | null |
2025-04-01 | Visual Acoustic Fields | Yuelei Li et.al. | 2503.24270 | null |
2025-03-31 | DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting | Seungjun Lee et.al. | 2503.24210 | null |
2025-03-31 | Learning 3D-Gaussian Simulators from RGB Videos | Mikel Zhobro et.al. | 2503.24009 | null |
2025-03-31 | ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image | Tianyi Gong et.al. | 2503.23881 | null |
2025-03-30 | Gaussian Blending Unit: An Edge GPU Plug-in for Real-Time Gaussian-Based Rendering in AR/VR | Zhifan Ye et.al. | 2503.23625 | null |
2025-03-30 | Enhancing 3D Gaussian Splatting Compression via Spatial Condition-based Prediction | Jingui Ma et.al. | 2503.23337 | null |
2025-03-30 | ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning | Zhenyang Liu et.al. | 2503.23297 | null |
2025-03-28 | TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting | Boyang et.al. | 2503.22676 | null |
2025-03-28 | Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis | Shuai Shen et.al. | 2503.22605 | null |
2025-03-28 | EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting | Xu Wang et.al. | 2503.22437 | link |
2025-03-28 | AH-GS: Augmented 3D Gaussian Splatting for High-Frequency Detail Representation | Chenyang Xu et.al. | 2503.22324 | null |
2025-03-28 | Follow Your Motion: A Generic Temporal Consistency Portrait Editing Framework with Trajectory Guidance | Haijie Yang et.al. | 2503.22225 | null |
2025-03-28 | ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting | Wenjie Liu et.al. | 2503.22218 | null |
2025-03-28 | Segment then Splat: A Unified Approach for 3D Open-Vocabulary Segmentation based on Gaussian Splatting | Yiren Lu et.al. | 2503.22204 | null |
2025-03-28 | Disentangled 4D Gaussian Splatting: Towards Faster and More Efficient Dynamic Scene Rendering | Hao Feng et.al. | 2503.22159 | null |
2025-03-27 | X $^{2}$ -Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction | Weihao Yu et.al. | 2503.21779 | null |
2025-03-27 | Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying | Hairong Yin et.al. | 2503.21767 | null |
2025-03-27 | RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting | Qiyu Dai et.al. | 2503.21442 | null |
2025-03-28 | LandMarkSystem Technical Report | Zhenxiang Ma et.al. | 2503.21364 | link |
2025-03-27 | Frequency-Aware Gaussian Splatting Decomposition | Yishai Lavi et.al. | 2503.21226 | null |
2025-03-27 | StyledStreets: Multi-style Street Simulator with Spatial and Temporal Consistency | Yuyin Chen et.al. | 2503.21104 | null |
2025-03-26 | PGC: Physics-Based Gaussian Cloth from a Single Pose | Michelle Guo et.al. | 2503.20779 | null |
2025-03-28 | Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields | Shijie Zhou et.al. | 2503.20776 | null |
2025-03-26 | TC-GS: Tri-plane based compression for 3D Gaussian Splatting | Taorui Wang et.al. | 2503.20221 | link |
2025-03-26 | EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis | Sheng Miao et.al. | 2503.20168 | null |
2025-03-25 | Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields | Navami Kairanda et.al. | 2503.19976 | null |
2025-03-26 | A Survey on Event-driven 3D Reconstruction: Development under Different Categories | Chuanzhi Xu et.al. | 2503.19753 | null |
2025-03-25 | High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting | Qian Wang et.al. | 2503.19703 | null |
2025-03-25 | GaussianUDF: Inferring Unsigned Distance Functions through 3D Gaussian Splatting | Shujuan Li et.al. | 2503.19458 | null |
2025-03-25 | SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors | Yiqing Li et.al. | 2503.19452 | null |
2025-03-26 | COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting | Jiaxin Zhang et.al. | 2503.19443 | link |
2025-03-25 | From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting | Zhiwei Huang et.al. | 2503.19358 | null |
2025-03-25 | Divide-and-Conquer: Dual-Hierarchical Optimization for Semantic 4D Gaussian Spatting | Zhiying Yan et.al. | 2503.19332 | null |
2025-03-25 | MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection | Jee Won Lee et.al. | 2503.19330 | null |
2025-03-25 | HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting | Xinpeng Liu et.al. | 2503.19232 | link |
2025-03-24 | NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting | Yulong Zheng et.al. | 2503.18794 | null |
2025-03-24 | GS-Marker: Generalizable and Robust Watermarking for 3D Gaussian Splatting | Lijiang Li et.al. | 2503.18718 | null |
2025-03-24 | Hardware-Rasterized Ray-Based Gaussian Splatting | Samuel Rota Bulò et.al. | 2503.18682 | null |
2025-03-24 | LLGS: Unsupervised Gaussian Splatting for Image Enhancement and Reconstruction in Pure Dark Environment | Haoran Wang et.al. | 2503.18640 | null |
2025-03-25 | StableGS: A Floater-Free Framework for 3D Gaussian Splatting | Luchao Wang et.al. | 2503.18458 | null |
2025-03-24 | 4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video | Qiang Hu et.al. | 2503.18421 | null |
2025-03-24 | DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds | Youyu Chen et.al. | 2503.18402 | null |
2025-03-24 | GI-SLAM: Gaussian-Inertial SLAM | Xulang Liu et.al. | 2503.18275 | null |
2025-03-23 | Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving | Junhao Ge et.al. | 2503.18108 | link |
2025-03-23 | PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding | Hongjia Zhai et.al. | 2503.18107 | null |
2025-03-21 | TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting | Jianchuan Chen et.al. | 2503.17032 | null |
2025-03-21 | Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting | Jinbo Yan et.al. | 2503.16979 | link |
2025-03-21 | DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery | Jiadong Tang et.al. | 2503.16964 | null |
2025-03-21 | Optimized Minimal 3D Gaussian Splatting | Joo Chan Lee et.al. | 2503.16924 | null |
2025-03-20 | SAGE: Semantic-Driven Adaptive Gaussian Splatting in Extended Reality | Chiara Schiavo et.al. | 2503.16747 | null |
2025-03-20 | 4D Gaussian Splatting SLAM | Yanyan Li et.al. | 2503.16710 | null |
2025-03-20 | GauRast: Enhancing GPU Triangle Rasterizers to Accelerate 3D Gaussian Splatting | Sixu Li et.al. | 2503.16681 | null |
2025-03-20 | 1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering | Yuheng Yuan et.al. | 2503.16422 | null |
2025-03-20 | M3: 3D-Spatial MultiModal Memory | Xueyan Zou et.al. | 2503.16413 | link |
2025-03-20 | Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images | Shengjun Zhang et.al. | 2503.16338 | null |
2025-03-20 | OccluGaussian: Occlusion-Aware Gaussian Splatting for Large Scene Reconstruction and Rendering | Shiyong Liu et.al. | 2503.16177 | null |
2025-03-20 | Enhancing Close-up Novel View Synthesis via Pseudo-labeling | Jiatong Xia et.al. | 2503.15908 | link |
2025-03-20 | VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling | Hyojun Go et.al. | 2503.15855 | null |
2025-03-20 | BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting | Yiren Lu et.al. | 2503.15835 | null |
2025-03-18 | HandSplat: Embedding-Driven Gaussian Splatting for High-Fidelity Hand Rendering | Yilan Dong et.al. | 2503.14736 | null |
2025-03-18 | SplatVoxel: History-Aware Novel View Streaming without Temporal Training | Yiming Wang et.al. | 2503.14698 | null |
2025-03-18 | Optimized 3D Gaussian Splatting using Coarse-to-Fine Image Frequency Modulation | Umar Farooq et.al. | 2503.14475 | null |
2025-03-18 | Improving Adaptive Density Control for 3D Gaussian Splatting | Glenn Grubert et.al. | 2503.14274 | link |
2025-03-18 | RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images | Junjin Xiao et.al. | 2503.14198 | link |
2025-03-18 | Lightweight Gradient-Aware Upscaling of 3D Gaussian Splatting Images | Simon Niedermayr et.al. | 2503.14171 | null |
2025-03-18 | Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting | Runsong Zhu et.al. | 2503.14029 | link |
2025-03-18 | Light4GS: Lightweight Compact 4D Gaussian Splatting Generation via Context Model | Mufan Liu et.al. | 2503.13948 | null |
2025-03-17 | Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors | Katja Schwarz et.al. | 2503.13272 | null |
2025-03-17 | DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction | Rui Wang et.al. | 2503.13176 | null |
2025-03-17 | Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization | Yiwei Xu et.al. | 2503.13086 | null |
2025-03-17 | CAT-3DGS Pro: A New Benchmark for Efficient 3DGS Compression | Yu-Ting Zhan et.al. | 2503.12862 | null |
2025-03-17 | CompMarkGS: Robust Watermarking for Compression 3D Gaussian Splatting | Sumin In et.al. | 2503.12836 | null |
2025-03-17 | AV-Surf: Surface-Enhanced Geometry-Aware Novel-View Acoustic Synthesis | Hadam Baek et.al. | 2503.12806 | null |
2025-03-16 | Deblur Gaussian Splatting SLAM | Francesco Girlanda et.al. | 2503.12572 | null |
2025-03-16 | MTGS: Multi-Traversal Gaussian Splatting | Tianyu Li et.al. | 2503.12552 | link |
2025-03-16 | SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs | Guibiao Liao et.al. | 2503.12535 | null |
2025-03-16 | VRsketch2Gaussian: 3D VR Sketch Guided 3D Object Generation with Gaussian Splatting | Songen Gu et.al. | 2503.12383 | null |
2025-03-14 | Advancing 3D Gaussian Splatting Editing with Complementary and Consensus Information | Xuanqi Zhang et.al. | 2503.11601 | null |
2025-03-14 | EgoSplat: Open-Vocabulary Egocentric Scene Understanding with Language Embedded 3D Gaussian Splatting | Di Li et.al. | 2503.11345 | null |
2025-03-14 | Uncertainty-Aware Normal-Guided Gaussian Splatting for Surface Reconstruction from Sparse Image Sequences | Zhen Tan et.al. | 2503.11172 | null |
2025-03-13 | RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors | Avinash Paliwal et.al. | 2503.10860 | link |
2025-03-13 | LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds | Lingteng Qiu et.al. | 2503.10625 | link |
2025-03-13 | MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction | Yingshuang Zou et.al. | 2503.10604 | null |
2025-03-13 | 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models | Wanhua Li et.al. | 2503.10437 | link |
2025-03-13 | VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames | Zhiqi Li et.al. | 2503.10286 | null |
2025-03-13 | ROODI: Reconstructing Occluded Objects with Denoising Inpainters | Yeonjin Chang et.al. | 2503.10256 | null |
2025-03-13 | GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction | Jianheng Liu et.al. | 2503.10170 | link |
2025-03-13 | 3D Student Splatting and Scooping | Jialin Zhu et.al. | 2503.10148 | link |
2025-03-13 | GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping | Jinfeng Liu et.al. | 2503.10143 | null |
2025-03-12 | Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation | Máté Tóth et.al. | 2503.09464 | null |
2025-03-12 | Online Language Splatting | Saimouli Katragadda et.al. | 2503.09447 | null |
2025-03-12 | Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training | Jiatong Xia et.al. | 2503.09396 | null |
2025-03-12 | GASPACHO: Gaussian Splatting for Controllable Humans and Objects | Aymen Mir et.al. | 2503.09342 | null |
2025-03-12 | SDD-4DGS: Static-Dynamic Aware Decoupling in Gaussian Splatting for 4D Scene Reconstruction | Dai Sun et.al. | 2503.09332 | null |
2025-03-12 | Motion Blender Gaussian Splatting for Dynamic Reconstruction | Xinyu Zhang et.al. | 2503.09040 | link |
2025-03-11 | PCGS: Progressive Compression of 3D Gaussian Splatting | Yihang Chen et.al. | 2503.08511 | link |
2025-03-11 | TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting | Fengyi Zhang et.al. | 2503.08485 | null |
2025-03-11 | Mitigating Ambiguities in 3D Classification with Gaussian Splatting | Ruiqi Zhang et.al. | 2503.08352 | null |
2025-03-11 | Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios | Zikang Yuan et.al. | 2503.08317 | null |
2025-03-11 | ELECTRA: A Symmetry-breaking Cartesian Network for Charge Density Prediction with Floating Orbitals | Jonas Elsborg et.al. | 2503.08305 | link |
2025-03-11 | HRAvatar: High-Quality and Relightable Gaussian Head Avatar | Dongbin Zhang et.al. | 2503.08224 | null |
2025-03-11 | S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction | Guangting Zheng et.al. | 2503.08217 | null |
2025-03-11 | Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming | Jiaxuan Zhu et.al. | 2503.08166 | null |
2025-03-11 | ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting | Junfu Guo et.al. | 2503.08135 | null |
2025-03-11 | MVGSR: Multi-View Consistency Gaussian Splatting for Robust Surface Reconstruction | Chenfeng Hou et.al. | 2503.08093 | null |
2025-03-10 | SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting | Jiahui Zhang et.al. | 2503.07476 | null |
2025-03-10 | EigenGS Representation: From Eigenspace to Gaussian Image Space | Lo-Wei Tai et.al. | 2503.07446 | null |
2025-03-10 | All That Glitters Is Not Gold: Key-Secured 3D Secrets within 3D Gaussian Splatting | Yan Ren et.al. | 2503.07191 | link |
2025-03-10 | Multi-Modal 3D Mesh Reconstruction from Images and Text | Melvin Reka et.al. | 2503.07190 | null |
2025-03-10 | Frequency-Aware Density Control via Reparameterization for High-Quality Rendering of 3D Gaussian Splatting | Zhaojie Zeng et.al. | 2503.07000 | link |
2025-03-10 | DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation | Xiaoliang Ju et.al. | 2503.06900 | null |
2025-03-10 | ActiveInitSplat: How Active Image Selection Helps Gaussian Splatting | Konstantinos D. Polyzos et.al. | 2503.06859 | null |
2025-03-09 | Gaussian RBFNet: Gaussian Radial Basis Functions for Fast and Accurate Representation and Reconstruction of Neural Fields | Abdelaziz Bouzidi et.al. | 2503.06762 | null |
2025-03-09 | CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving | Rui Song et.al. | 2503.06744 | null |
2025-03-09 | D3DR: Lighting-Aware Object Insertion in Gaussian Splatting | Vsevolod Skorokhodov et.al. | 2503.06740 | null |
2025-03-07 | D2GV: Deformable 2D Gaussian Splatting for Video Representation in 400FPS | Mufan Liu et.al. | 2503.05600 | link |
2025-03-07 | Free Your Hands: Lightweight Relightable Turntable Capture Pipeline | Jiahui Fan et.al. | 2503.05511 | null |
2025-03-07 | LiDAR-enhanced 3D Gaussian Splatting Mapping | Jian Shen et.al. | 2503.05425 | null |
2025-03-07 | Self-Modeling Robots by Photographing | Kejun Hu et.al. | 2503.05398 | null |
2025-03-07 | CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images | Jungho Lee et.al. | 2503.05332 | link |
2025-03-07 | STGA: Selective-Training Gaussian Head Avatars | Hanzhi Guo et.al. | 2503.05196 | null |
2025-03-07 | Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects | Justin Yu et.al. | 2503.05189 | null |
2025-03-07 | MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions | Qingyuan Zhou et.al. | 2503.05182 | null |
2025-03-07 | SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting | Linqi Yang et.al. | 2503.05174 | null |
2025-03-07 | SeeLe: A Unified Acceleration Framework for Real-Time Gaussian Splatting | Xiaotong Huang et.al. | 2503.05168 | null |
2025-03-06 | GaussianVideo: Efficient Video Representation and Compression by Gaussian Splatting | Inseo Lee et.al. | 2503.04333 | null |
2025-03-06 | S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting | Yecong Wan et.al. | 2503.04314 | null |
2025-03-06 | Instrument-Splatting: Controllable Photorealistic Reconstruction of Surgical Instruments Using Gaussian Splatting | Shuojue Yang et.al. | 2503.04082 | null |
2025-03-06 | Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details | Yifei Gao et.al. | 2503.04037 | null |
2025-03-06 | GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding | Xihan Wang et.al. | 2503.04034 | null |
2025-03-06 | GRaD-Nav: Efficiently Learning Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics | Qianzhong Chen et.al. | 2503.03984 | null |
2025-03-05 | LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation | Qian Feng et.al. | 2503.03890 | null |
2025-03-05 | NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics | Kun Yang et.al. | 2503.03115 | null |
2025-03-04 | 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting | Qipeng Yan et.al. | 2503.02452 | null |
2025-03-04 | DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting | Haoyuan Li et.al. | 2503.02223 | link |
2025-03-03 | Morpheus: Text-Driven 3D Gaussian Splat Shape and Color Stylization | Jamie Wynn et.al. | 2503.02009 | null |
2025-03-03 | Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models | Jay Zhangjie Wu et.al. | 2503.01774 | null |
2025-03-03 | OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding | Dianyi Yang et.al. | 2503.01646 | null |
2025-03-03 | LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training | Kaimin Liao et.al. | 2503.01199 | link |
2025-03-03 | FGS-SLAM: Fourier-based Gaussian Splatting for Real-time SLAM with Sparse and Dense Map Fusion | Yansong Xu et.al. | 2503.01109 | null |
2025-03-02 | Evolving High-Quality Rendering and Reconstruction in a Unified Framework with Contribution-Adaptive Regularization | You Shen et.al. | 2503.00881 | null |
2025-03-02 | Vid2Fluid: 3D Dynamic Fluid Assets from Single-View Videos with Generative Gaussian Splatting | Zhiwei Zhao et.al. | 2503.00868 | null |
2025-03-02 | PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery | BoCheng Li et.al. | 2503.00848 | null |
2025-03-03 | FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering | Jingqiu Zhou et.al. | 2502.21093 | null |
2025-02-28 | EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering | John J. Han et.al. | 2502.20669 | null |
2025-02-27 | ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting | Dexter Ong et.al. | 2502.20386 | null |
2025-02-27 | Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling | Hanyang Kong et.al. | 2502.20378 | null |
2025-02-27 | No Parameters, No Problem: 3D Gaussian Splatting without Camera Intrinsics and Extrinsics | Dongbo Shi et.al. | 2502.19800 | null |
2025-02-27 | Open-Vocabulary Semantic Part Segmentation of 3D Human | Keito Suzuki et.al. | 2502.19782 | null |
2025-02-26 | Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting | Yu Liu et.al. | 2502.19459 | link |
2025-02-26 | Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions | Muhammad Salman Ali et.al. | 2502.19457 | null |
2025-02-26 | Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? | Adam Celarek et.al. | 2502.19318 | link |
2025-02-28 | OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation | Yunpeng Gao et.al. | 2502.18041 | null |
2025-02-27 | UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting | Haoyuan Li et.al. | 2502.17860 | null |
2025-02-24 | Laplace-Beltrami Operator for Gaussian Splatting | Hongyu Zhou et.al. | 2502.17531 | null |
2025-02-24 | Graph-Guided Scene Reconstruction from Images with 3D Gaussian Splatting | Chong Cheng et.al. | 2502.17377 | null |
2025-02-25 | GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow | Simon Boeder et.al. | 2502.17288 | null |
2025-02-24 | VR-Pipe: Streamlining Hardware Graphics Pipeline for Volume Rendering | Junseo Lee et.al. | 2502.17078 | null |
2025-02-23 | GS-TransUNet: Integrated 2D Gaussian Splatting and Transformer UNet for Accurate Skin Lesion Analysis | Anand Kumar et.al. | 2502.16748 | null |
2025-02-23 | Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration | Kim Jun-Seong et.al. | 2502.16652 | null |
2025-02-23 | Dragen3D: Multiview Geometry Consistent 3D Gaussian Generation with Drag-Based Control | Jinbo Yan et.al. | 2502.16475 | null |
2025-02-21 | RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes | Sicheng Yu et.al. | 2502.15633 | null |
2025-02-24 | DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation | Luzhou Ge et.al. | 2502.15309 | link |
2025-02-20 | GS-Cache: A GS-Cache Inference Framework for Large-scale Gaussian Splatting Models | Miao Tao et.al. | 2502.14938 | null |
2025-02-20 | Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2502.14931 | null |
2025-02-20 | CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting | Qilin Zhang et.al. | 2502.14684 | link |
2025-02-20 | OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving | Yedong Shen et.al. | 2502.14235 | null |
2025-02-19 | GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian | Bang Du et.al. | 2502.14129 | null |
2025-02-19 | Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction | Gan Chen et.al. | 2502.14004 | link |
2025-02-19 | 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments | Vincent Ress et.al. | 2502.13803 | null |
2025-02-18 | GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis | Pedro Martin et.al. | 2502.13196 | null |
2025-02-18 | RadSplatter: Extending 3D Gaussian Splatting to Radio Frequencies for Wireless Radiomap Extrapolation | Yiheng Wang et.al. | 2502.12686 | null |
2025-02-17 | PUGS: Zero-shot Physical Understanding with Gaussian Splatting | Yinghao Shuai et.al. | 2502.12231 | link |
2025-02-17 | 3D Gaussian Inpainting with Depth-Guided Cross-View Consistency | Sheng-Yu Huang et.al. | 2502.11801 | null |
2025-02-17 | Exploring the Versal AI Engine for 3D Gaussian Splatting | Kotaro Shimamura et.al. | 2502.11782 | null |
2025-02-17 | GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text | Gyumin Shim et.al. | 2502.11642 | null |
2025-02-16 | OMG: Opacity Matters in Material Modeling with Gaussian Splatting | Silong Yong et.al. | 2502.10988 | null |
2025-02-16 | GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting | Zelin Zhou et.al. | 2502.10975 | null |
2025-02-15 | E-3DGS: Event-Based Novel View Rendering of Large-Scale Scenes Using 3D Gaussian Splatting | Sohaib Zahid et.al. | 2502.10827 | null |
2025-02-13 | X-SG $^2$ S: Safe and Generalizable Gaussian Splatting with X-dimensional Watermarks | Zihang Cheng et.al. | 2502.10475 | null |
2025-02-13 | Self-Calibrating Gaussian Splatting for Large Field of View Reconstruction | Youming Deng et.al. | 2502.09563 | null |
2025-02-13 | DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior | Mingrui Li et.al. | 2502.09111 | null |
2025-02-13 | Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting | Lingting Zhu et.al. | 2502.09039 | link |
2025-02-12 | Interactive Holographic Visualization for 3D Facial Avatar | Tri Tung Nguyen Nguyen et.al. | 2502.08085 | null |
2025-02-11 | TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation | Jeongyun Kim et.al. | 2502.07840 | link |
2025-02-11 | MeshSplats: Mesh-Based Rendering with Gaussian Splatting Initialization | Rafał Tobiasz et.al. | 2502.07754 | link |
2025-02-11 | Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors | Lin-Zhuo Chen et.al. | 2502.07615 | null |
2025-02-10 | Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC | Siwei Meng et.al. | 2502.07007 | null |
2025-02-10 | SIREN: Semantic, Initialization-Free Registration of Multi-Robot Gaussian Splatting Maps | Ola Shorinwa et.al. | 2502.06519 | null |
2025-02-10 | Three-Dimensional MRI Reconstruction with Gaussian Representations: Tackling the Undersampling Problem | Tengya Peng et.al. | 2502.06510 | null |
2025-02-11 | Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform | Kyle Gao et.al. | 2502.05769 | null |
2025-02-09 | PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map | Yue Pan et.al. | 2502.05752 | link |
2025-02-08 | Vision-in-the-loop Simulation for Deep Monocular Pose Estimation of UAV in Ocean Environment | Maneesha Wickramasuriya et.al. | 2502.05409 | null |
2025-02-07 | AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting | Chung-Ho Wu et.al. | 2502.05176 | null |
2025-02-07 | GaussRender: Learning 3D Occupancy with Gaussian Rendering | Loick Chambon et.al. | 2502.05040 | link |
2025-02-07 | OccGS: Zero-shot 3D Occupancy Reconstruction with Semantic and Geometric-Aware Gaussian Splatting | Xiaoyu Zhou et.al. | 2502.04981 | null |
2025-02-07 | PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression | Feifei Li et.al. | 2502.04843 | null |
2025-02-07 | SC-OmniGS: Self-Calibrating Omnidirectional Gaussian Splatting | Huajian Huang et.al. | 2502.04734 | null |
2025-02-07 | High-Speed Dynamic 3D Imaging with Sensor Fusion Splatting | Zihao Zou et.al. | 2502.04630 | null |
2025-02-05 | GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM | Mingrui Li et.al. | 2502.03228 | null |
2025-02-05 | GP-GS: Gaussian Processes for Enhanced Gaussian Splatting | Zhihao Guo et.al. | 2502.02283 | link |
2025-02-04 | LAYOUTDREAMER: Physics-guided Layout for Text-to-3D Compositional Scene Generation | Yang Zhou et.al. | 2502.01949 | null |
2025-02-03 | UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping | Aashish Rai et.al. | 2502.01846 | null |
2025-02-03 | Scalable 3D Gaussian Splatting-Based RF Signal Spatial Propagation Modeling | Kang Yang et.al. | 2502.01826 | null |
2025-02-03 | VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion | Shaoting Zhu et.al. | 2502.01536 | null |
2025-02-03 | Radiant Foam: Real-Time Differentiable Ray Tracing | Shrisudhan Govindarajan et.al. | 2502.01157 | null |
2025-02-02 | EmoTalkingGaussian: Continuous Emotion-conditioned Talking Head Synthesis | Junuk Cha et.al. | 2502.00654 | null |
2025-01-31 | Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation | Rohan Chacko et.al. | 2502.00173 | null |
2025-01-31 | Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping | Yiming Huang et.al. | 2501.19319 | link |
2025-01-31 | RaySplats: Ray Tracing based Gaussian Splatting | Krzysztof Byrski et.al. | 2501.19196 | link |
2025-01-31 | JGHand: Joint-Driven Animatable Hand Avater via 3D Gaussian Splatting | Zhoutao Sun et.al. | 2501.19088 | null |
2025-01-30 | Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting | Yansong Qu et.al. | 2501.18672 | null |
2025-01-29 | 3D Reconstruction of Shoes for Augmented Reality | Pratik Shrestha et.al. | 2501.18643 | null |
2025-01-31 | VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting | Mateusz Nowak et.al. | 2501.17978 | null |
2025-01-29 | CrowdSplat: Exploring Gaussian Splatting For Crowd Rendering | Xiaohan Sun et.al. | 2501.17792 | link |
2025-01-29 | FeatureGS: Eigenvalue-Feature Optimization in 3D Gaussian Splatting for Geometrically Accurate and Artifact-Reduced Reconstruction | Miriam Jäger et.al. | 2501.17655 | null |
2025-01-28 | Evaluating CrowdSplat: Perceived Level of Detail for Gaussian Crowds | Xiaohan Sun et.al. | 2501.17085 | null |
2025-01-28 | DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation | Chenguo Lin et.al. | 2501.16764 | null |
2025-01-26 | GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting | Jiajun Dong et.al. | 2501.15619 | link |
2025-01-25 | Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos | Zhen-Hui Dong et.al. | 2501.15096 | null |
2025-01-25 | HuGDiffusion: Generalizable Single-Image Human Rendering via 3D Gaussian Diffusion | Yingzhi Tang et.al. | 2501.15008 | null |
2025-01-24 | Trick-GS: A Balanced Bag of Tricks for Efficient Gaussian Splatting | Anil Armagan et.al. | 2501.14534 | null |
2025-01-24 | Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video | Xiaohao Xu et.al. | 2501.14319 | link |
2025-01-24 | Dense-SfM: Structure from Motion with Dense Consistent Matching | JongMin Lee et.al. | 2501.14277 | null |
2025-01-24 | Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images | Yihui Li et.al. | 2501.14231 | null |
2025-01-24 | HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting | Javier Yu et.al. | 2501.14147 | null |
2025-01-23 | GoDe: Gaussians on Demand for Progressive Level of Detail and Scalable Compression | Francesco Di Sario et.al. | 2501.13558 | null |
2025-01-23 | MultiDreamer3D: Multi-concept 3D Customization with Concept-Aware Diffusion Guidance | Wooseok Song et.al. | 2501.13449 | null |
2025-01-23 | GeomGS: LiDAR-Guided Geometry-Aware Gaussian Splatting for Robot Localization | Jaewon Lee et.al. | 2501.13417 | null |
2025-01-23 | VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM | Gyuhyeon Pak et.al. | 2501.13402 | null |
2025-01-23 | Deblur-Avatar: Animatable Avatars from Motion-Blurred Monocular Videos | Xianrui Luo et.al. | 2501.13335 | null |
2025-01-22 | Sketch and Patch: Efficient 3D Gaussian Representation for Man-Made Scenes | Yuang Shi et.al. | 2501.13045 | null |
2025-01-21 | DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions | Vishagar Arunan et.al. | 2501.12369 | null |
2025-01-22 | HAC++: Towards 100X Compression of 3D Gaussian Splatting | Yihang Chen et.al. | 2501.12255 | link |
2025-01-22 | GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting | Longan Wang et.al. | 2501.12060 | null |
2025-01-20 | See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization | Zongqi He et.al. | 2501.11508 | null |
2025-01-19 | RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering | Chenlu Zhan et.al. | 2501.11102 | null |
2025-01-18 | Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting | Jiaqi Lin et.al. | 2501.10788 | null |
2025-01-15 | BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation | Xiaolu Hou et.al. | 2501.10462 | link |
2025-01-20 | GSTAR: Gaussian Surface Tracking and Reconstruction | Chengwei Zheng et.al. | 2501.10283 | null |
2025-01-16 | Creating Virtual Environments with 3D Gaussian Splatting: A Comparative Study | Shi Qiu et.al. | 2501.09302 | null |
2025-01-15 | CityLoc: 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation | Qi Ma et.al. | 2501.08982 | null |
2025-01-15 | GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping | Sheng Hong et.al. | 2501.08672 | null |
2025-01-14 | 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering | Meenakshi Krishnan et.al. | 2501.08370 | null |
2025-01-14 | VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes | Ke Wu et.al. | 2501.08286 | null |
2025-01-14 | Object-Centric 2D Gaussian Splatting: Background Removal and Occlusion-Aware Pruning for Compact Object Models | Marcel Rogge et.al. | 2501.08174 | null |
2025-01-13 | Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes | Yuhang Zhang et.al. | 2501.08072 | null |
2025-01-13 | UnCommon Objects in 3D | Xingchen Liu et.al. | 2501.07574 | link |
2025-01-13 | 3DGS-to-PC: Convert a 3D Gaussian Splatting Scene into a Dense Point Cloud or Mesh | Lewis A G Stuart et.al. | 2501.07478 | link |
2025-01-13 | RMAvatar: Photorealistic Human Avatar Reconstruction from Monocular Video Based on Rectified Mesh-embedded Gaussians | Sen Peng et.al. | 2501.07104 | null |
2025-01-14 | SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting | Yue Hu et.al. | 2501.07015 | null |
2025-01-12 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
2025-01-12 | Synthetic Prior for Few-Shot Drivable Head Avatar Inversion | Wojciech Zielonka et.al. | 2501.06903 | null |
2025-01-12 | ActiveGAMER: Active GAussian Mapping through Efficient Rendering | Liyan Chen et.al. | 2501.06897 | null |
2025-01-12 | Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution | Du Chen et.al. | 2501.06838 | link |
2025-01-12 | F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian Splatting | Yuxin Wang et.al. | 2501.06714 | null |
2025-01-11 | MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis | Hengyuan Zhang et.al. | 2501.06660 | null |
2025-01-10 | Locality-aware Gaussian Compression for Fast and High-quality Rendering | Seungjoo Shin et.al. | 2501.05757 | null |
2025-01-09 | Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation | Xuyi Meng et.al. | 2501.05427 | null |
2025-01-09 | Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance | Dimitrios Gerogiannis et.al. | 2501.05379 | null |
2025-01-09 | Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping | Wen Tianci et.al. | 2501.05242 | null |
2025-01-08 | GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting | Andrew Bond et.al. | 2501.04782 | null |
2025-01-08 | FatesGS: Fast and Accurate Sparse-View Surface Reconstruction using Gaussian Splatting with Depth-Feature Consistency | Han Huang et.al. | 2501.04628 | null |
2025-01-07 | ZDySS – Zero-Shot Dynamic Scene Stylization using Gaussian Splatting | Abhishek Saroha et.al. | 2501.03875 | null |
2025-01-07 | MoDec-GS: Global-to-Local Motion Decomposition and Temporal Interval Adjustment for Compact Dynamic 3D Gaussian Splatting | Sangwoon Kwak et.al. | 2501.03714 | null |
2025-01-07 | DehazeGS: Seeing Through Fog with 3D Gaussian Splatting | Jinze Yu et.al. | 2501.03659 | null |
2025-01-07 | ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting | Yifeng Yang et.al. | 2501.03605 | link |
2025-01-06 | Compression of 3D Gaussian Splatting with Optimized Feature Planes and Standard Video Codecs | Soonbin Lee et.al. | 2501.03399 | null |
2025-01-06 | Gaussian Masked Autoencoders | Jathushan Rajasegaran et.al. | 2501.03229 | null |
2025-01-06 | HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation | Wentian Qu et.al. | 2501.02845 | null |
2025-01-05 | GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking | Weikang Bian et.al. | 2501.02690 | null |
2025-01-03 | EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation | Siyuan Huang et.al. | 2501.01895 | null |
2025-01-03 | Cloth-Splatting: 3D Cloth State Estimation from RGB Supervision | Alberta Longhini et.al. | 2501.01715 | null |
2025-01-03 | CrossView-GS: Cross-view Gaussian Splatting For Large-scale Scene Reconstruction | Chenhao Zhang et.al. | 2501.01695 | null |
2025-01-03 | PG-SAG: Parallel Gaussian Splatting for Fine-Grained Large-Scale Urban Buildings Reconstruction via Semantic-Aware Grouping | Tengfei Wang et.al. | 2501.01677 | link |
2025-01-02 | Deformable Gaussian Splatting for Efficient and High-Fidelity Reconstruction of Surgical Scenes | Jiwei Shan et.al. | 2501.01101 | null |
2025-01-02 | EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy | Ao Gao et.al. | 2501.01003 | null |
2024-12-31 | Gaussian Building Mesh (GBM): Extract a Building’s 3D Mesh with Google Earth and Gaussian Splatting | Kyle Gao et.al. | 2501.00625 | null |
2024-12-31 | DreamDrive: Generative 4D Scene Modeling from Street View Images | Jiageng Mao et.al. | 2501.00601 | null |
2024-12-31 | PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM | Runnan Chen et.al. | 2501.00352 | null |
2024-12-31 | SG-Splatting: Accelerating 3D Gaussian Splatting with Spherical Gaussians | Yiwen Wang et.al. | 2501.00342 | null |
2024-12-30 | PERSE: Personalized 3D Generative Avatars from A Single Portrait | Hyunsoo Cha et.al. | 2412.21206 | null |
2024-12-30 | KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences | Keng-Wei Chang et.al. | 2412.20767 | null |
2024-12-30 | 4D Gaussian Splatting: Modeling Dynamic Scenes with Native 4D Primitives | Zeyu Yang et.al. | 2412.20720 | null |
2024-12-29 | MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks | Yifei Liu et.al. | 2412.20522 | link |
2024-12-28 | DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis | Kaijun Deng et.al. | 2412.20148 | link |
2024-12-28 | GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting | Atticus J. Zeller et.al. | 2412.20056 | link |
2024-12-27 | DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction | Kai Xu et.al. | 2412.19584 | null |
2024-12-27 | Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images | Xudong Cai et.al. | 2412.19518 | null |
2024-12-27 | Learning Radiance Fields from a Single Snapshot Compressive Image | Yunhao Li et.al. | 2412.19483 | null |
2024-12-26 | BeSplat – Gaussian Splatting from a Single Blurry Image and Event Stream | Gopi Raju Matta et.al. | 2412.19370 | link |
2024-12-26 | Reflective Gaussian Splatting | Yuxuan Yao et.al. | 2412.19282 | null |
2024-12-26 | Generating Editable Head Avatars with 3D Gaussian GANs | Guohao Li et.al. | 2412.19149 | link |
2024-12-26 | CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting | Siyu Jiao et.al. | 2412.19142 | null |
2024-12-26 | MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo | Byeonggwon Lee et.al. | 2412.19130 | null |
2024-12-25 | WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian Splatting | Chenghao Qian et.al. | 2412.18862 | link |
2024-12-25 | GSAVS: Gaussian Splatting-based Autonomous Vehicle Simulator | Rami Wilson et.al. | 2412.18816 | null |
2024-12-24 | Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation | Anselm Krainovic et.al. | 2412.18584 | null |
2024-12-24 | RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis | Yiling Yao et.al. | 2412.18380 | null |
2024-12-23 | FaceLift: Single Image to 3D Head with View Generation and GS-LRM | Weijie Lyu et.al. | 2412.17812 | null |
2024-12-23 | ActiveGS: Active Scene Reconstruction using Gaussian Splatting | Liren Jin et.al. | 2412.17769 | link |
2024-12-23 | GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance | Jingqiu Zhou et.al. | 2412.17715 | null |
2024-12-24 | LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding | Hao Li et.al. | 2412.17635 | null |
2024-12-23 | CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Yuanyuan Gao et.al. | 2412.17612 | null |
2024-12-23 | Exploring Dynamic Novel View Synthesis Technologies for Cinematography | Adrian Azzarelli et.al. | 2412.17532 | null |
2024-12-23 | Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling | Hao Gui et.al. | 2412.17378 | null |
2024-12-22 | GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs | Xingrui Wang et.al. | 2412.16932 | link |
2024-12-22 | GeoTexDensifier: Geometry-Texture-Aware Densification for High-Quality Photorealistic 3D Gaussian Splatting | Hanqing Jiang et.al. | 2412.16809 | null |
2024-12-21 | Topology-Aware 3D Gaussian Splatting: Leveraging Persistent Homology for Optimized Structural Integrity | Tianqi Shen et.al. | 2412.16619 | link |
2024-12-20 | CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images | Jungho Lee et.al. | 2412.16028 | null |
2024-12-20 | IRGS: Inter-Reflective Gaussian Splatting with 2D Gaussian Ray Tracing | Chun Gu et.al. | 2412.15867 | null |
2024-12-20 | AvatarPerfect: User-Assisted 3D Gaussian Splatting Avatar Refinement with Automatic Pose Suggestion | Jotaro Sakamiya et.al. | 2412.15609 | null |
2024-12-20 | EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene | Yixiong Huo et.al. | 2412.15550 | link |
2024-12-19 | LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction | Pou-Chun Kung et.al. | 2412.15447 | null |
2024-12-19 | SolidGS: Consolidating Gaussian Surfel Splatting for Sparse-View Surface Reconstruction | Zhuowen Shen et.al. | 2412.15400 | null |
2024-12-19 | SqueezeMe: Efficient Gaussian Avatars for VR | Shunsuke Saito et.al. | 2412.15171 | null |
2024-12-19 | Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination | Leonardo Barcellona et.al. | 2412.14957 | null |
2024-12-19 | GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting | Qianpu Sun et.al. | 2412.14579 | null |
2024-12-19 | Improving Geometry in Sparse-View 3DGS via Reprojection-based DoF Separation | Yongsung Kim et.al. | 2412.14568 | null |
2024-12-18 | GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians | Xiaobao Wei et.al. | 2412.13983 | link |
2024-12-18 | GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting | Yuning Peng et.al. | 2412.13654 | null |
2024-12-18 | 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching | Fernando Amodeo et.al. | 2412.13639 | link |
2024-12-18 | Turbo-GS: Accelerating 3D Gaussian Fitting for High-Quality Radiance Fields | Tao Lu et.al. | 2412.13547 | null |
2024-12-18 | Vivar: A Generative AR System for Intuitive Multi-Modal Sensor Data Presentation | Yunqi Guo et.al. | 2412.13509 | null |
2024-12-17 | Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures | Guoxing Sun et.al. | 2412.13183 | null |
2024-12-17 | EOGS: Gaussian Splatting for Earth Observation | Luca Savant Aira et.al. | 2412.13047 | null |
2024-12-17 | 4DRGS: 4D Radiative Gaussian Splatting for Efficient 3D Vessel Reconstruction from Sparse-View Dynamic DSA Images | Zhentao Liu et.al. | 2412.12919 | link |
2024-12-17 | CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image | Wonseok Roh et.al. | 2412.12906 | null |
2024-12-17 | HyperGS: Hyperspectral 3D Gaussian Splatting | Christopher Thirgood et.al. | 2412.12849 | null |
2024-12-17 | Gaussian Billboards: Expressive 2D Gaussian Splatting with Textures | Sebastian Weiss et.al. | 2412.12734 | null |
2024-12-17 | 3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian Splatting | Qi Wu et.al. | 2412.12507 | link |
2024-12-16 | PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Cheng Zhang et.al. | 2412.12096 | link |
2024-12-16 | Wonderland: Navigating 3D Scenes from a Single Image | Hanwen Liang et.al. | 2412.12091 | null |
2024-12-16 | GS-ProCams: Gaussian Splatting-based Projector-Camera Systems | Qingyue Deng et.al. | 2412.11762 | null |
2024-12-16 | Deformable Radial Kernel Splatting | Yi-Hua Huang et.al. | 2412.11752 | null |
2024-12-16 | SweepEvGS: Event-Based 3D Gaussian Splatting for Macro and Micro Radiance Field Rendering from a Single Sweep | Jingqian Wu et.al. | 2412.11579 | null |
2024-12-16 | EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting | Dong In Lee et.al. | 2412.11520 | null |
2024-12-14 | DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting | Luis Wiedmann et.al. | 2412.10972 | link |
2024-12-13 | SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians | Siyun Liang et.al. | 2412.10231 | null |
2024-12-13 | GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion | Jiapeng Tang et.al. | 2412.10209 | null |
2024-12-13 | TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views | Liang Zhao et.al. | 2412.10051 | link |
2024-12-13 | SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video | Jongmin Park et.al. | 2412.09982 | null |
2024-12-13 | RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting | Lizhi Bai et.al. | 2412.09868 | null |
2024-12-12 | MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction | Xiaohao Xu et.al. | 2412.09723 | link |
2024-12-12 | PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields | Sean Wu et.al. | 2412.09680 | link |
2024-12-12 | Feat2GS: Probing Visual Foundation Models with Gaussian Splatting | Yue Chen et.al. | 2412.09606 | null |
2024-12-12 | LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors | Yabo Chen et.al. | 2412.09597 | null |
2024-12-12 | FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction | Jiale Xu et.al. | 2412.09573 | null |
2024-12-12 | GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency | Dongyue Lu et.al. | 2412.09511 | link |
2024-12-12 | LIVE-GS: LLM Powers Interactive VR by Enhancing Gaussian Splatting | Haotian Mao et.al. | 2412.09176 | null |
2024-12-11 | SLGaussian: Fast Language Gaussian Splatting in Sparse Views | Kangjie Chen et.al. | 2412.08331 | null |
2024-12-11 | ProGDF: Progressive Gaussian Differential Field for Controllable and Flexible 3D Editing | Yian Zhao et.al. | 2412.08152 | null |
2024-12-10 | Diffusion-Based Attention Warping for Consistent 3D Scene Editing | Eyal Gomel et.al. | 2412.07984 | null |
2024-12-10 | GASP: Gaussian Avatars with Synthetic Priors | Jack Saunders et.al. | 2412.07739 | null |
2024-12-10 | Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians | Yixuan Li et.al. | 2412.07660 | null |
2024-12-10 | Faster and Better 3D Splatting via Group Training | Chengbo Wang et.al. | 2412.07608 | null |
2024-12-10 | ResGS: Residual Densification of 3D Gaussian for Efficient Detail Recovery | Yanzhe Lyu et.al. | 2412.07494 | null |
2024-12-10 | EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering | Toshiya Yura et.al. | 2412.07293 | null |
2024-12-09 | MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds | Zhenggang Tang et.al. | 2412.06974 | null |
2024-12-09 | Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video | Renlong Wu et.al. | 2412.06424 | link |
2024-12-09 | 4D Gaussian Splatting with Scale-aware Residual Field and Adaptive Optimization for Real-time Rendering of Temporally Complex Dynamic Scenes | Jinbo Yan et.al. | 2412.06299 | null |
2024-12-09 | Advancing Extended Reality with 3D Gaussian Splatting: Innovations and Prospects | Shi Qiu et.al. | 2412.06257 | null |
2024-12-09 | Splatter-360: Generalizable 360 $^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images | Zheng Chen et.al. | 2412.06250 | link |
2024-12-09 | Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction | Seungtae Nam et.al. | 2412.06234 | null |
2024-12-08 | Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation | Zipeng Qi et.al. | 2412.05969 | null |
2024-12-08 | GBR: Generative Bundle Refinement for High-fidelity Gaussian Splatting and Meshing | Jianing Zhang et.al. | 2412.05908 | null |
2024-12-07 | Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes | Saqib Javed et.al. | 2412.05700 | null |
2024-12-07 | WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking | Yuqi Tan et.al. | 2412.05695 | null |
2024-12-07 | Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis | Diwen Wan et.al. | 2412.05570 | null |
2024-12-06 | Extrapolated Urban View Synthesis Benchmark | Xiangyu Han et.al. | 2412.05256 | link |
2024-12-06 | MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting | Peng Chen et.al. | 2412.04955 | link |
2024-12-06 | Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction | Jixuan Fan et.al. | 2412.04887 | link |
2024-12-06 | WRF-GS: Wireless Radiation Field Reconstruction with 3D Gaussian Splatting | Chaozheng Wen et.al. | 2412.04832 | link |
2024-12-06 | Pushing Rendering Boundaries: Hard Gaussian Splatting | Qingshan Xu et.al. | 2412.04826 | null |
2024-12-05 | Turbo3D: Ultra-fast Text-to-3D Generation | Hanzhe Hu et.al. | 2412.04470 | null |
2024-12-05 | QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos | Sharath Girish et.al. | 2412.04469 | null |
2024-12-05 | Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering | Cheng Sun et.al. | 2412.04459 | link |
2024-12-05 | Monocular Dynamic Gaussian Splatting is Fast and Brittle but Smooth Motion Helps | Yiqing Liang et.al. | 2412.04457 | null |
2024-12-05 | PBDyG: Position Based Dynamic Gaussians for Motion-Aware Clothed Human Avatars | Shota Sasaki et.al. | 2412.04433 | null |
2024-12-05 | Multi-View Pose-Agnostic Change Localization with Zero Labels | Chamuditha Jayanga Galappaththige et.al. | 2412.03911 | link |
2024-12-05 | DGNS: Deformable Gaussian Splatting and Dynamic Neural Surface for Monocular Dynamic 3D Reconstruction | Xuesong Li et.al. | 2412.03910 | link |
2024-12-05 | HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting | Jingyu Lin et.al. | 2412.03844 | link |
2024-12-04 | Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos | Hanxue Liang et.al. | 2412.03526 | null |
2024-12-04 | Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter | Hermes McGriff et.al. | 2412.03518 | null |
2024-12-04 | Urban4D: Semantic-Guided 4D Gaussian Splatting for Urban Scene Reconstruction | Ziwen Li et.al. | 2412.03473 | null |
2024-12-04 | 2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction | Wanting Zhang et.al. | 2412.03428 | null |
2024-12-04 | Volumetrically Consistent 3D Gaussian Rasterization | Chinmay Talegaonkar et.al. | 2412.03378 | link |
2024-12-04 | SGSST: Scaling Gaussian Splatting StyleTransfer | Bruno Galerne et.al. | 2412.03371 | link |
2024-12-04 | NeRF and Gaussian Splatting SLAM in the Wild | Fabian Schmidt et.al. | 2412.03263 | link |
2024-12-04 | Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting | Yijia Guo et.al. | 2412.03121 | null |
2024-12-04 | RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videos | Yoonwoo Jeong et.al. | 2412.03077 | null |
2024-12-03 | Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects | Abdurrahman Zeybey et.al. | 2412.02803 | null |
2024-12-03 | AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction | Lingteng Qiu et.al. | 2412.02684 | null |
2024-12-03 | RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians | Qiankun Gao et.al. | 2412.02493 | link |
2024-12-03 | TimeWalker: Personalized Neural Space for Lifelong Head Avatars | Dongwei Pan et.al. | 2412.02421 | null |
2024-12-03 | GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos | Zhiyuan Chen et.al. | 2412.02267 | null |
2024-12-03 | Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance | Jing Zeng et.al. | 2412.02249 | null |
2024-12-03 | SparseLGS: Sparse View Language Embedded Gaussian Splatting | Jun Hu et.al. | 2412.02245 | null |
2024-12-03 | How to Use Diffusion Priors under Sparse Views? | Qisen Wang et.al. | 2412.02225 | link |
2024-12-03 | SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images | Junqiu Yu et.al. | 2412.02140 | null |
2024-12-03 | Gaussian Object Carver: Object-Compositional Gaussian Splatting with surfaces completion | Liu Liu et.al. | 2412.02075 | link |
2024-12-02 | Planar Gaussian Splatting | Farhad G. Zanjani et.al. | 2412.01931 | null |
2024-12-02 | GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting | Zixuan Chen et.al. | 2411.19895 | link |
2024-11-29 | DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering | Yihao Wang et.al. | 2411.19756 | null |
2024-11-29 | TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting | Bojun Xiong et.al. | 2411.19654 | link |
2024-11-29 | Tortho-Gaussian: Splatting True Digital Orthophoto Maps | Xin Wang et.al. | 2411.19594 | null |
2024-11-29 | Gaussian Splashing: Direct Volumetric Rendering Underwater | Nir Mualem et.al. | 2411.19588 | null |
2024-11-29 | Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding | Wenbo Zhang et.al. | 2411.19551 | link |
2024-12-02 | GausSurf: Geometry-Guided 3D Gaussian Splatting for Surface Reconstruction | Jiepeng Wang et.al. | 2411.19454 | null |
2024-11-29 | RF-3DGS: Wireless Channel Modeling with Radio Radiance Field and 3D Gaussian Splatting | Lihao Zhang et.al. | 2411.19420 | link |
2024-11-28 | SADG: Segment Any Dynamic Gaussian Without Object Trackers | Yun-Jin Li et.al. | 2411.19290 | link |
2024-11-28 | AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones | Xuqian Ren et.al. | 2411.19271 | null |
2024-11-27 | Textured Gaussians for Enhanced 3D Scene Appearance Modeling | Brian Chao et.al. | 2411.18625 | null |
2024-11-27 | PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image | Han Yan et.al. | 2411.18548 | null |
2024-11-27 | HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression | Lei Liu et.al. | 2411.18473 | null |
2024-11-27 | Neural Surface Priors for Editable Gaussian Splatting | Jakub Szymkowiak et.al. | 2411.18311 | link |
2024-11-27 | Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters | Zhiyang Guo et.al. | 2411.18197 | null |
2024-11-27 | SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images | Yanyan Li et.al. | 2411.18072 | null |
2024-11-27 | GLS: Geometry-aware 3D Language Gaussian Splatting | Jiaxiong Qiu et.al. | 2411.18066 | link |
2024-11-27 | HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction | Wei Zhang et.al. | 2411.17982 | link |
2024-11-26 | DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting | Christian Homeyer et.al. | 2411.17660 | link |
2024-11-26 | Distractor-free Generalizable 3D Gaussian Splatting | Yanqi Bao et.al. | 2411.17605 | link |
2024-11-26 | SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting | Gyeongjin Kang et.al. | 2411.17190 | null |
2024-11-26 | 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction | Woong Oh Cho et.al. | 2411.17044 | null |
2024-11-25 | G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs | Kunyi Li et.al. | 2411.16898 | null |
2024-11-25 | PreF3R: Pose-Free Feed-Forward 3D Gaussian Splatting from Variable-length Image Sequence | Zequn Chen et.al. | 2411.16877 | null |
2024-11-25 | SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving | Georg Hess et.al. | 2411.16816 | link |
2024-11-25 | SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis | Hyojun Go et.al. | 2411.16443 | link |
2024-11-25 | Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction | Ziyu Zhang et.al. | 2411.16392 | null |
2024-11-25 | Event-boosted Deformable 3D Gaussians for Fast Dynamic Scene Reconstruction | Wenhao Xu et.al. | 2411.16180 | null |
2024-11-25 | UnitedVLN: Generalizable Gaussian Splatting for Continuous Vision-Language Navigation | Guangzhao Dai et.al. | 2411.16053 | null |
2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-11-24 | DynamicAvatars: Accurate Dynamic Facial Avatars Reconstruction and Precise Editing with Diffusion Models | Yangyang Qian et.al. | 2411.15732 | null |
2024-11-24 | GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision | Xu Baixin et.al. | 2411.15723 | link |
2024-11-23 | EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting | Xiaobao Wei et.al. | 2411.15582 | null |
2024-11-23 | SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving | Su Sun et.al. | 2411.15482 | null |
2024-11-22 | Neural 4D Evolution under Large Topological Changes from 2D Images | AmirHossein Naghi Razlighi et.al. | 2411.15018 | null |
2024-11-22 | 3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes | Jan Held et.al. | 2411.14974 | link |
2024-11-22 | Dynamics-Aware Gaussian Splatting Streaming Towards Fast On-the-Fly Training for 4D Reconstruction | Zhening Liu et.al. | 2411.14847 | null |
2024-11-22 | VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving | Haiming Zhang et.al. | 2411.14716 | null |
2024-11-21 | NexusSplats: Efficient 3D Gaussian Splatting in the Wild | Yuzhou Tang et.al. | 2411.14514 | null |
2024-11-21 | Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation | Zhuoman Liu et.al. | 2411.14423 | null |
2024-11-21 | Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation | Yuanhao Cai et.al. | 2411.14384 | null |
2024-11-21 | SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching | Arjun P S et.al. | 2411.14322 | link |
2024-11-20 | FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting | Ola Shorinwa et.al. | 2411.13753 | null |
2024-11-20 | Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization | Hao Ju et.al. | 2411.13610 | null |
2024-11-20 | Generating 3D-Consistent Videos from Unposed Internet Photos | Gene Chou et.al. | 2411.13549 | null |
2024-11-20 | GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting | Xiaobao Wei et.al. | 2411.12981 | null |
2024-11-19 | Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting | Haoyu Zhao et.al. | 2411.12789 | null |
2024-11-19 | Mini-Splatting2: Building 360 Scenes within Minutes via Aggressive Gaussian Densification | Guangchi Fang et.al. | 2411.12788 | null |
2024-11-19 | PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy | Joanna Kaleta et.al. | 2411.12510 | link |
2024-11-19 | SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image | Zixu Wang et.al. | 2411.12471 | null |
2024-11-20 | Beyond Gaussians: Fast and High-Fidelity 3D Splatting with Linear Kernels | Haodong Chen et.al. | 2411.12440 | null |
2024-11-19 | LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2411.12185 | null |
2024-11-19 | Sketch-guided Cage-based 3D Gaussian Splatting Deformation | Tianhao Xie et.al. | 2411.12168 | null |
2024-11-18 | FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting | Fangyu Wu et.al. | 2411.12089 | null |
2024-11-18 | TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction | DaDong Jiang et.al. | 2411.11941 | null |
2024-11-18 | DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes | Chensheng Peng et.al. | 2411.11921 | link |
2024-11-18 | RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator | Xinhai Li et.al. | 2411.11839 | null |
2024-11-18 | GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views | Boyao Zhou et.al. | 2411.11363 | null |
2024-11-17 | VeGaS: Video Gaussian Splatting | Weronika Smolak-Dyżewska et.al. | 2411.11024 | link |
2024-11-17 | Direct and Explicit 3D Generation from a Single Image | Haoyu Wu et.al. | 2411.10947 | null |
2024-11-16 | DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment | Mangyu Kong et.al. | 2411.10722 | link |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting | Kang Chen et.al. | 2411.10504 | link |
2024-11-15 | Efficient Density Control for 3D Gaussian Splatting | Xiaobin Deng et.al. | 2411.10133 | link |
2024-11-15 | GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization | Yanhao Sun et.al. | 2411.10033 | null |
2024-11-15 | GGAvatar: Reconstructing Garment-Separated 3D Gaussian Splatting Avatars from Monocular Video | Jingxuan Chen et.al. | 2411.09952 | link |
2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
2024-11-14 | DyGASR: Dynamic Generalized Exponential Splatting with Surface Alignment for Accelerated 3D Mesh Reconstruction | Shengchao Zhao et.al. | 2411.09156 | null |
2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879 | null |
2024-11-13 | Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models | Chengdong Dong et.al. | 2411.08642 | null |
2024-11-13 | BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis | David Svitov et.al. | 2411.08508 | link |
2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
2024-11-13 | DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization | Yueming Xu et.al. | 2411.08373 | null |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-14 | Projecting Gaussian Ellipsoids While Avoiding Affine Projection Approximation | Han Qi et.al. | 2411.07579 | null |
2024-11-12 | GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting | Umangi Jain et.al. | 2411.07555 | null |
2024-11-12 | HiCoM: Hierarchical Coherent Motion for Streamable Dynamic Scene with 3D Gaussian Splatting | Qiankun Gao et.al. | 2411.07541 | link |
2024-11-12 | GUS-IR: Gaussian Splatting with Unified Shading for Inverse Rendering | Zhihao Liang et.al. | 2411.07478 | null |
2024-11-11 | A Hierarchical Compression Technique for 3D Gaussian Splatting Compression | He Huang et.al. | 2411.06976 | null |
2024-11-10 | Adaptive and Temporally Consistent Gaussian Surfels for Multi-view Dynamic Reconstruction | Decai Chen et.al. | 2411.06602 | null |
2024-11-12 | SplatFormer: Point Transformer for Robust 3D Gaussian Splatting | Yutong Chen et.al. | 2411.06390 | link |
2024-11-10 | Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field | Liuyue Xie et.al. | 2411.06365 | null |
2024-11-09 | AI-Driven Stylization of 3D Environments | Yuanbo Chen et.al. | 2411.06067 | null |
2024-11-09 | GaussianSpa: An “Optimizing-Sparsifying” Simplification Framework for Compact and High-Quality 3D Gaussian Splatting | Yangming Zhang et.al. | 2411.06019 | null |
2024-11-07 | ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing | Jun-Kun Chen et.al. | 2411.05006 | null |
2024-11-07 | MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views | Yuedong Chen et.al. | 2411.04924 | link |
2024-11-08 | GS2Pose: Two-stage 6D Object Pose Estimation Guided by Gaussian Splatting | Jilan Mei et.al. | 2411.03807 | null |
2024-11-06 | 3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object Rearrangement | Ziqi Lu et.al. | 2411.03706 | link |
2024-11-06 | Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Rui Peng et.al. | 2411.03637 | link |
2024-11-05 | Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting | Michael Büttner et.al. | 2411.03555 | null |
2024-11-05 | HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features | Arnab Dey et.al. | 2411.03086 | null |
2024-11-05 | LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting | Huibin Zhao et.al. | 2411.02703 | null |
2024-11-04 | Modeling Uncertainty in 3D Gaussian Splatting through Continuous Semantic Splatting | Joey Wilson et.al. | 2411.02547 | null |
2024-11-06 | SplatOverflow: Asynchronous Hardware Troubleshooting | Amritansh Kwatra et.al. | 2411.02332 | null |
2024-11-05 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-11-06 | GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes | Gaochao Song et.al. | 2411.01853 | null |
2024-11-02 | Real-Time Spatio-Temporal Reconstruction of Dynamic Endoscopic Scenes with 4D Gaussian Splatting | Fengze Li et.al. | 2411.01218 | null |
2024-11-01 | CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes | Yang Liu et.al. | 2411.00771 | null |
2024-11-01 | PCoTTA: Continual Test-Time Adaptation for Multi-Task Point Cloud Understanding | Jincen Jiang et.al. | 2411.00632 | null |
2024-10-31 | Aquatic-GS: A Hybrid 3D Representation for Underwater Scenes | Shaohua Liu et.al. | 2411.00239 | null |
2024-10-31 | Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis | Chen Zhao et.al. | 2411.00144 | link |
2024-10-31 | No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images | Botao Ye et.al. | 2410.24207 | link |
2024-11-01 | GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering | Kai Ye et.al. | 2410.24204 | null |
2024-10-31 | GaussianMarker: Uncertainty-Aware Copyright Protection of 3D Gaussian Splatting | Xiufeng Huang et.al. | 2410.23718 | null |
2024-10-31 | GS-Blur: A 3D Scene-Based Dataset for Realistic Image Deblurring | Dongwoo Lee et.al. | 2410.23658 | link |
2024-10-30 | ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting | Muhammad Salman Ali et.al. | 2410.23213 | null |
2024-10-31 | Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis | Zhiyuan Min et.al. | 2410.22817 | null |
2024-10-30 | Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted Images | Qi Song et.al. | 2410.22705 | null |
2024-10-29 | PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting | Sunghwan Hong et.al. | 2410.22128 | link |
2024-10-29 | FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives | Qizhi Chen et.al. | 2410.22070 | null |
2024-10-29 | ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting | Yuetao Li et.al. | 2410.21955 | link |
2024-10-28 | MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps | Yating Xu et.al. | 2410.21566 | link |
2024-10-28 | Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting | Jiawei Xu et.al. | 2410.20815 | null |
2024-10-28 | LoDAvatar: Hierarchical Embedding and Adaptive Levels of Detail with Gaussian Splatting for Enhanced Human Avatars | Xiaonuo Dongye et.al. | 2410.20789 | null |
2024-10-28 | CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians | Chongjian Ge et.al. | 2410.20723 | null |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | link |
2024-10-27 | Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering | Meng Wei et.al. | 2410.20593 | null |
2024-10-26 | Neural Fields in Robotics: A Survey | Muhammad Zubair Irshad et.al. | 2410.20220 | link |
2024-10-25 | DiffGS: Functional Gaussian Splatting Diffusion | Junsheng Zhou et.al. | 2410.19657 | null |
2024-10-25 | Robotic Learning in your Backyard: A Neural Simulator from Open Source Components | Liyou Zhou et.al. | 2410.19564 | link |
2024-10-25 | Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization | Weihang Liu et.al. | 2410.19483 | link |
2024-10-24 | 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation | Hansheng Chen et.al. | 2410.18974 | link |
2024-10-24 | Sort-free Gaussian Splatting via Weighted Sum Rendering | Qiqi Hou et.al. | 2410.18931 | null |
2024-10-24 | Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling | Mingtong Zhang et.al. | 2410.18912 | null |
2024-10-27 | Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis | Liang Han et.al. | 2410.18822 | null |
2024-10-23 | VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points | Linus Franke et.al. | 2410.17932 | null |
2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
2024-10-22 | AG-SLAM: Active Gaussian Splatting SLAM | Wen Jiang et.al. | 2410.17422 | null |
2024-10-22 | SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes | Cheng-De Fan et.al. | 2410.17249 | null |
2024-10-18 | GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting | Yusen Xie et.al. | 2410.17084 | null |
2024-10-22 | E-3DGS: Gaussian Splatting with Exposure and Motion Events | Xiaoting Yin et.al. | 2410.16995 | link |
2024-10-22 | Multi-Layer Gaussian Splatting for Immersive Anatomy Visualization | Constantin Kleinbeck et.al. | 2410.16978 | link |
2024-10-21 | 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors | Xi Liu et.al. | 2410.16266 | null |
2024-10-21 | MSGField: A Unified Scene Representation Integrating Motion, Semantics, and Geometry for Robotic Manipulation | Yu Sheng et.al. | 2410.15730 | null |
2024-10-22 | Fully Explicit Dynamic Gaussian Splatting | Junoh Lee et.al. | 2410.15629 | null |
2024-10-22 | EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting | Bohao Liao et.al. | 2410.15392 | null |
2024-10-18 | LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes | Juliette Marrie et.al. | 2410.14462 | null |
2024-10-18 | Neural Signed Distance Function Inference through Splatting 3D Gaussians Pulled on Zero-Level Set | Wenyuan Zhang et.al. | 2410.14189 | null |
2024-10-18 | DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction | Ange Lou et.al. | 2410.14169 | null |
2024-10-17 | DepthSplat: Connecting Gaussian Splatting and Depth | Haofei Xu et.al. | 2410.13862 | link |
2024-10-17 | Differentiable Robot Rendering | Ruoshi Liu et.al. | 2410.13851 | null |
2024-10-17 | MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes | Xinjie Zhang et.al. | 2410.13613 | null |
2024-10-17 | DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering | Jiahao Lu et.al. | 2410.13607 | link |
2024-10-17 | GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting | Shuichang Lai et.al. | 2410.13349 | null |
2024-10-16 | Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats | Chen Ziwen et.al. | 2410.12781 | null |
2024-10-16 | 3D Gaussian Splatting in Robotics: A Survey | Siting Zhu et.al. | 2410.12262 | link |
2024-10-15 | SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection | Yizhe Liu et.al. | 2410.12080 | link |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-15 | GS^3: Efficient Relighting with Triple Gaussian Splatting | Zoubin Bi et.al. | 2410.11419 | link |
2024-10-15 | MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields | Yuru Xiao et.al. | 2410.11394 | null |
2024-10-15 | GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information | Wancai Zheng et.al. | 2410.11356 | null |
2024-10-15 | Scalable Indoor Novel-View Synthesis using Drone-Captured 360 Imagery with 3D Gaussian Splatting | Yuanbo Chen et.al. | 2410.11285 | null |
2024-10-14 | Few-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting | Raja Kumar et.al. | 2410.11080 | link |
2024-10-15 | 4-LEGS: 4D Language Embedded Gaussian Splatting | Gal Fiebelman et.al. | 2410.10719 | null |
2024-10-14 | 4DStyleGaussian: Zero-shot 4D Style Transfer with Gaussian Splatting | Wanlin Liang et.al. | 2410.10412 | null |
2024-10-13 | Gaussian Splatting Visual MPC for Granular Media Manipulation | Wei-Cheng Tseng et.al. | 2410.09740 | null |
2024-10-12 | Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors | Hritam Basak et.al. | 2410.09467 | null |
2024-10-11 | SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted Surgical Scene Reconstruction | Jialei Chen et.al. | 2410.09292 | null |
2024-10-11 | MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering | Jaehoon Choi et.al. | 2410.08941 | null |
2024-10-11 | Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars | Xuan Huang et.al. | 2410.08840 | link |
2024-10-11 | Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose Initialization | Christian Schmidt et.al. | 2410.08743 | link |
2024-10-10 | FusionSense: Bridging Common Sense, Vision, and Touch for Robust Sparse-View Reconstruction | Irving Fang et.al. | 2410.08282 | null |
2024-10-10 | Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics | Junyi Cao et.al. | 2410.08257 | null |
2024-10-10 | Poison-splat: Computation Cost Attack on 3D Gaussian Splatting | Jiahao Lu et.al. | 2410.08190 | link |
2024-10-10 | DifFRelight: Diffusion-Based Facial Performance Relighting | Mingming He et.al. | 2410.08188 | null |
2024-10-10 | Efficient Perspective-Correct 3D Gaussian Splatting Using Hybrid Transparency | Florian Hahlbohm et.al. | 2410.08129 | null |
2024-10-10 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
2024-10-11 | Fast Feedforward 3D Gaussian Splatting Compression | Yihang Chen et.al. | 2410.08017 | link |
2024-10-10 | L-VITeX: Light-weight Visual Intuition for Terrain Exploration | Antar Mazumder et.al. | 2410.07872 | null |
2024-10-10 | MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting | Ruijie Zhu et.al. | 2410.07707 | link |
2024-10-10 | 3D Vision-Language Gaussian Splatting | Qucheng Peng et.al. | 2410.07577 | null |
2024-10-09 | DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation | Zhiqi Li et.al. | 2410.06756 | null |
2024-10-09 | ES-Gaussian: Gaussian Splatting Mapping via Error Space-Based Gaussian Completion | Lu Chen et.al. | 2410.06613 | null |
2024-10-09 | 3D Representation Methods: A Survey | Zhengren Wang et.al. | 2410.06475 | null |
2024-10-08 | HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction | Shengji Tang et.al. | 2410.06245 | null |
2024-10-10 | RelitLRM: Generative Relightable Radiance for Large Reconstruction Models | Tianyuan Zhang et.al. | 2410.06231 | null |
2024-10-08 | GSLoc: Visual Localization with 3D Gaussian Splatting | Kazii Botashev et.al. | 2410.06165 | null |
2024-10-08 | SplaTraj: Camera Trajectory Generation with Semantic Gaussian Splatting | Xinyi Liu et.al. | 2410.06014 | null |
2024-10-08 | Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters | Guoji Tian et.al. | 2410.05772 | null |
2024-10-07 | PH-Dropout: Prctical Epistemic Uncertainty Quantification for View Synthesis | Chuanhao Sun et.al. | 2410.05468 | link |
2024-10-07 | GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting | Yukang Cao et.al. | 2410.05259 | null |
2024-10-07 | LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting | Qifeng Chen et.al. | 2410.05111 | null |
2024-10-07 | DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects | Nidhi Mathihalli et.al. | 2410.05097 | link |
2024-10-07 | PhotoReg: Photometrically Registering 3D Gaussian Splatting Models | Ziwen Yuan et.al. | 2410.05044 | null |
2024-10-07 | 6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering | Zhongpai Gao et.al. | 2410.04974 | null |
2024-10-07 | Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting | Matthew Strong et.al. | 2410.04680 | link |
2024-10-06 | Mode-GS: Monocular Depth Guided Anchored 3D Gaussian Splatting for Robust Ground-View Scene Rendering | Yonghan Lee et.al. | 2410.04646 | null |
2024-10-06 | StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting | Xiao Cui et.al. | 2410.04354 | null |
2024-10-04 | Variational Bayes Gaussian Splatting | Toon Van de Maele et.al. | 2410.03592 | link |
2024-10-03 | Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats | Mingyang Xie et.al. | 2410.02764 | null |
2024-10-03 | GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering | Hongze Chen et.al. | 2410.02619 | null |
2024-10-03 | SuperGS: Super-Resolution 3D Gaussian Splatting via Latent Feature Field and Gradient-guided Splitting | Shiyun Xie et.al. | 2410.02571 | link |
2024-10-02 | MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis | Xiaobiao Du et.al. | 2410.02103 | link |
2024-10-03 | EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis | Alexander Mai et.al. | 2410.01804 | null |
2024-10-02 | 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Yang Cao et.al. | 2410.01647 | link |
2024-10-02 | Gaussian Splatting in Mirrors: Reflection-Aware Rendering via Virtual Camera Optimization | Zihan Wang et.al. | 2410.01614 | link |
2024-10-02 | GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians | Shuyi Jiang et.al. | 2410.01535 | null |
2024-10-02 | MiraGe: Editable 2D Images using Gaussian Splatting | Joanna Waczyńska et.al. | 2410.01521 | link |
2024-10-02 | UW-GS: Distractor-Aware 3D Gaussian Splatting for Enhanced Underwater Scene Reconstruction | Haoran Wang et.al. | 2410.01517 | link |
2024-10-02 | EVA-Gaussian: 3D Gaussian-based Real-time Human Novel View Synthesis under Diverse Camera Settings | Yingdong Hu et.al. | 2410.01425 | null |
2024-10-02 | Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection | Hongru Yan et.al. | 2410.01404 | null |
2024-10-02 | CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM | Dapeng Feng et.al. | 2410.00486 | link |
2024-10-01 | Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance | Hongchao Shu et.al. | 2410.00386 | null |
2024-09-30 | RL-GSBridge: 3D Gaussian Splatting Based Real2Sim2Real Method for Robotic Manipulation Learning | Yuxuan Wu et.al. | 2409.20291 | null |
2024-09-30 | Robust Gaussian Splatting SLAM by Leveraging Loop Closure | Zunjie Zhu et.al. | 2409.20111 | null |
2024-10-01 | RNG: Relightable Neural Gaussians | Jiahui Fan et.al. | 2409.19702 | null |
2024-09-28 | GS-EVT: Cross-Modal Event Camera Tracking based on Gaussian Splatting | Tao Liu et.al. | 2409.19228 | null |
2024-09-28 | 1st Place Solution to the 8th HANDS Workshop Challenge – ARCTIC Track: 3DGS-based Bimanual Category-agnostic Interaction Reconstruction | Jeongwan On et.al. | 2409.19215 | null |
2024-09-27 | Gaussian Heritage: 3D Digitization of Cultural Heritage with Integrated Object Segmentation | Mahtab Dahaghin et.al. | 2409.19039 | null |
2024-09-27 | Space-time 2D Gaussian Splatting for Accurate Surface Reconstruction under Complex Dynamic Scenes | Shuo Wang et.al. | 2409.18852 | link |
2024-09-26 | RT-GuIDE: Real-Time Gaussian splatting for Information-Driven Exploration | Yuezhan Tao et.al. | 2409.18122 | null |
2024-09-26 | Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot | Justin Yu et.al. | 2409.18108 | null |
2024-09-26 | WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians | Dmytro Kotovenko et.al. | 2409.17917 | null |
2024-09-26 | HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian Splatting | Zijun Xu et.al. | 2409.17624 | null |
2024-09-25 | SeaSplat: Representing Underwater Scenes with 3D Gaussian Splatting and a Physically Grounded Image Formation Model | Daniel Yang et.al. | 2409.17345 | null |
2024-09-25 | Disco4D: Disentangled 4D Human Generation and Animation from a Single Image | Hui En Pang et.al. | 2409.17280 | null |
2024-09-25 | Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM | Phu Pham et.al. | 2409.16944 | null |
2024-09-25 | Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model | Hongliang Zhong et.al. | 2409.16938 | link |
2024-09-25 | Let’s Make a Splan: Risk-Aware Trajectory Optimization in a Normalized Gaussian Splat | Jonathan Michaux et.al. | 2409.16915 | null |
2024-09-24 | GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Gennady Sidorov et.al. | 2409.16502 | link |
2024-09-24 | Frequency-based View Selection in Gaussian Splatting Reconstruction | Monica M. Q. Li et.al. | 2409.16470 | null |
2024-09-23 | Gaussian Déjà-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities | Peizhi Yan et.al. | 2409.16147 | link |
2024-09-24 | Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality | Hannah Schieber et.al. | 2409.15959 | null |
2024-09-24 | Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB | Jae Yong Lee et.al. | 2409.15689 | null |
2024-09-23 | Human Hair Reconstruction with Strand-Aligned 3D Gaussians | Egor Zakharov et.al. | 2409.14778 | null |
2024-09-22 | MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Wangze Xu et.al. | 2409.14316 | null |
2024-09-21 | SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality | Hongjia Zhai et.al. | 2409.14067 | null |
2024-09-20 | Elite-EvGS: Learning Event-based 3D Gaussian Splatting by Distilling Event-to-Video Priors | Zixin Zhang et.al. | 2409.13392 | null |
2024-09-20 | 3D-GSW: 3D Gaussian Splatting Watermark for Protecting Copyrights in Radiance Fields | Youngdong Jang et.al. | 2409.13222 | null |
2024-09-19 | MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting | Yan Song Hu et.al. | 2409.13055 | null |
2024-09-19 | GStex: Per-Primitive Texturing of 2D Gaussian Splatting for Decoupled Appearance and Geometry Modeling | Victor Rong et.al. | 2409.12954 | link |
2024-09-18 | Vista3D: Unravel the 3D Darkside of a Single Image | Qiuhong Shen et.al. | 2409.12193 | link |
2024-09-18 | SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation | Mingze Sun et.al. | 2409.11682 | link |
2024-09-18 | Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks | Joji Joseph et.al. | 2409.11681 | link |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-17 | GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module | Yichen Zhang et.al. | 2409.11307 | null |
2024-09-17 | SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction | Marko Mihajlovic et.al. | 2409.11211 | null |
2024-09-17 | GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure | Ziheng Xu et.al. | 2409.10982 | null |
2024-09-16 | Phys3DGS: Physically-based 3D Gaussian Splatting for Inverse Rendering | Euntae Choi et.al. | 2409.10335 | null |
2024-09-16 | BEINGS: Bayesian Embodied Image-goal Navigation with Gaussian Splatting | Wugang Meng et.al. | 2409.10216 | link |
2024-09-16 | SplatSim: Zero-Shot Sim2Real Transfer of RGB Manipulation Policies Using Gaussian Splatting | Mohammad Nomaan Qureshi et.al. | 2409.10161 | null |
2024-09-16 | Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression | Yi-Hsin Li et.al. | 2409.10101 | null |
2024-09-16 | DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments | Mahmud A. Mohamad et.al. | 2409.10041 | link |
2024-09-15 | SAFER-Splat: A Control Barrier Function for Safe Navigation with Online Gaussian Splatting Maps | Timothy Chen et.al. | 2409.09868 | null |
2024-09-15 | MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation | Shuzhao Xie et.al. | 2409.09756 | null |
2024-09-14 | GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians | Dasong Gao et.al. | 2409.09295 | link |
2024-09-13 | A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis | Yohan Poirier-Ginter et.al. | 2409.08947 | null |
2024-09-13 | AdR-Gaussian: Accelerating Gaussian Splatting with Adaptive Radius | Xinzhe Wang et.al. | 2409.08669 | null |
2024-09-13 | Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints | Shan Chen et.al. | 2409.08613 | null |
2024-09-13 | CSS: Overcoming Pose and Scene Challenges in Crowd-Sourced 3D Gaussian Splatting | Runze Chen et.al. | 2409.08562 | null |
2024-09-12 | Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos | Yuheng Jiang et.al. | 2409.08353 | null |
2024-09-12 | FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally | Qiuhong Shen et.al. | 2409.08270 | link |
2024-09-12 | Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis | Qian Chen et.al. | 2409.08042 | link |
2024-09-12 | SwinGS: Sliding Window Gaussian Splatting for Volumetric Video Streaming with Arbitrary Length | Bangya Liu et.al. | 2409.07759 | null |
2024-09-11 | Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs | Sadra Safadoust et.al. | 2409.07456 | null |
2024-09-11 | Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models | Haibo Yang et.al. | 2409.07452 | link |
2024-09-11 | Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering | Dafei Qin et.al. | 2409.07441 | null |
2024-09-11 | Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks | Ruihan Xu et.al. | 2409.07245 | null |
2024-09-11 | ThermalGaussian: Thermal 3D Gaussian Splatting | Rongfeng Lu et.al. | 2409.07200 | link |
2024-09-10 | gsplat: An Open-Source Library for Gaussian Splatting | Vickie Ye et.al. | 2409.06765 | link |
2024-09-10 | GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction | Junyi Chen et.al. | 2409.06685 | null |
2024-09-10 | Sources of Uncertainty in 3D Scene Reconstruction | Marcus Klasson et.al. | 2409.06407 | link |
2024-09-09 | Online 3D reconstruction and dense tracking in endoscopic videos | Michel Hayoz et.al. | 2409.06037 | link |
2024-09-09 | GASP: Gaussian Splatting for Physic-Based Simulations | Piotr Borycki et.al. | 2409.05819 | link |
2024-09-09 | Lagrangian Hashing for Compressed Neural Field Representations | Shrisudhan Govindarajan et.al. | 2409.05334 | null |
2024-09-08 | DreamMapping: High-Fidelity Text-to-3D Generation via Variational Distribution Mapping | Zeyu Cai et.al. | 2409.05099 | null |
2024-09-08 | GS-PT: Exploiting 3D Gaussian Splatting for Comprehensive Point Cloud Understanding via Self-supervised Learning | Keyi Liu et.al. | 2409.04963 | null |
2024-09-11 | Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras | Zimu Liao et.al. | 2409.04751 | link |
2024-09-06 | GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers | Lorenza Prospero et.al. | 2409.04196 | link |
2024-09-06 | 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors | Yujun Huang et.al. | 2409.04013 | link |
2024-09-05 | LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors | Hanyang Yu et.al. | 2409.03456 | null |
2024-09-05 | Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction | Shen Chen et.al. | 2409.03213 | null |
2024-09-04 | Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Zhibin Liu et.al. | 2409.02851 | link |
2024-09-04 | Object Gaussian for Monocular 6D Pose Estimation from Sparse Views | Luqing Luo et.al. | 2409.02581 | null |
2024-09-04 | GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving | Huasong Han et.al. | 2409.02382 | null |
2024-09-03 | DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction | Jenny Seidenschwarz et.al. | 2409.02104 | null |
2024-09-03 | PRoGS: Progressive Rendering of Gaussian Splats | Brent Zoomers et.al. | 2409.01761 | null |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-09-02 | Free-DyGS: Camera-Pose-Free Scene Reconstruction based on Gaussian Splatting for Dynamic Surgical Videos | Qian Li et.al. | 2409.01003 | null |
2024-08-31 | 3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images | YuanZheng Wu et.al. | 2409.00381 | null |
2024-08-31 | UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM | Mostafa Mansour et.al. | 2409.00362 | null |
2024-08-30 | OG-Mapping: Octree-based Structured 3D Gaussians for Online Dense Mapping | Meng Wang et.al. | 2408.17223 | null |
2024-08-30 | 2DGH: 2D Gaussian-Hermite Splatting for High-quality Rendering and Better Geometry Reconstruction | Ruihan Yu et.al. | 2408.16982 | null |
2024-08-29 | ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model | Fangfu Liu et.al. | 2408.16767 | null |
2024-08-29 | OmniRe: Omni Urban Scene Reconstruction | Ziyu Chen et.al. | 2408.16760 | null |
2024-08-28 | Towards Realistic Example-based Modeling via 3D Gaussian Stitching | Xinyu Gao et.al. | 2408.15708 | null |
2024-08-28 | G-Style: Stylized Gaussian Splatting | Áron Samuel Kovács et.al. | 2408.15695 | link |
2024-08-27 | Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Saining Zhang et.al. | 2408.15242 | link |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-08-27 | Robo-GS: A Physics Consistent Spatial-Temporal Model for Robotic Arm with Hybrid Representation | Haozhe Lou et.al. | 2408.14873 | null |
2024-08-27 | LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming | Yuang Shi et.al. | 2408.14823 | link |
2024-08-26 | Avatar Concept Slider: Manipulate Concepts In Your Human Avatar With Fine-grained Control | Yixuan He et.al. | 2408.13995 | null |
2024-08-26 | DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian Splatting | Weiwei Cai et.al. | 2408.13972 | link |
2024-08-27 | Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs | Brandon Smart et.al. | 2408.13912 | null |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-08-25 | SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting | Wenrui Li et.al. | 2408.13711 | link |
2024-08-23 | BiGS: Bidirectional Gaussian Primitives for Relightable 3D Gaussian Splatting | Zhenyuan Liu et.al. | 2408.13370 | null |
2024-08-23 | S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points | Bing He et.al. | 2408.13036 | link |
2024-08-23 | FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering | Yunji Seo et.al. | 2408.12894 | null |
2024-08-26 | GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion | Jiaxin Wei et.al. | 2408.12677 | link |
2024-08-22 | Subsurface Scattering for 3D Gaussian Splatting | Jan-Niklas Dihlmann et.al. | 2408.12282 | null |
2024-08-21 | Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of Distractors | Paul Ungermann et.al. | 2408.11697 | link |
2024-08-22 | DeRainGS: Gaussian Splatting for Enhanced Scene Reconstruction in Rainy Environments | Shuhong Liu et.al. | 2408.11540 | null |
2024-08-21 | GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting | Wanshui Gan et.al. | 2408.11447 | link |
2024-08-21 | Pano2Room: Novel View Synthesis from a Single Indoor Panorama | Guo Pu et.al. | 2408.11413 | link |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | link |
2024-08-20 | ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining | Qi Ma et.al. | 2408.10906 | null |
2024-08-20 | DEGAS: Detailed Expressions on Full-Body Gaussian Avatars | Zhijing Shao et.al. | 2408.10588 | link |
2024-08-20 | LoopSplat: Loop Closure by Registering 3D Gaussian Splats | Liyuan Zhu et.al. | 2408.10154 | link |
2024-08-19 | Implicit Gaussian Splatting with Efficient Multi-Level Tri-Plane Representation | Minye Wu et.al. | 2408.10041 | null |
2024-08-19 | SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting | Haoyu Zhao et.al. | 2408.09665 | null |
2024-08-20 | CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning | Haoyu Zhao et.al. | 2408.09663 | null |
2024-08-20 | Gaussian in the Dark: Real-Time View Synthesis From Inconsistent Dark Images Using Gaussian Splatting | Sheng Ye et.al. | 2408.09130 | link |
2024-08-16 | Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS | Wei Sun et.al. | 2408.08723 | null |
2024-08-16 | GS-ID: Illumination Decomposition on Gaussian Splatting via Diffusion Prior and Parametric Light Source Optimization | Kang Du et.al. | 2408.08524 | link |
2024-08-15 | WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting | Huapeng Li et.al. | 2408.08206 | null |
2024-08-19 | FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering | Guofeng Feng et.al. | 2408.07967 | link |
2024-08-14 | Progressive Radiance Distillation for Inverse Rendering with Gaussian Splatting | Keyang Ye et.al. | 2408.07595 | null |
2024-08-14 | 3D Gaussian Editing with A Single Image | Guan Luo et.al. | 2408.07540 | null |
2024-08-13 | SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis | Saptarshi Neil Sinha et.al. | 2408.06975 | null |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering | Jiameng Li et.al. | 2408.06286 | link |
2024-08-12 | Developing Smart MAVs for Autonomous Inspection in GPS-denied Constructions | Paoqiang Pan et.al. | 2408.06030 | null |
2024-08-12 | HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors | Xiaozheng Zheng et.al. | 2408.06019 | null |
2024-08-10 | Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis | Zhongche Qu et.al. | 2408.05635 | null |
2024-08-09 | DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-14 | Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction | Lingbei Meng et.al. | 2408.04831 | null |
2024-08-06 | LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting | Joanna Kaleta et.al. | 2408.04474 | link |
2024-08-08 | A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery | Mengya Xu et.al. | 2408.04426 | link |
2024-08-08 | InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting | Xin-Yi Yu et.al. | 2408.04249 | null |
2024-08-07 | Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM | Yan Song Hu et.al. | 2408.03825 | null |
2024-08-07 | Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Joo Chan Lee et.al. | 2408.03822 | null |
2024-08-07 | 3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting | Zhe Jun Tang et.al. | 2408.03753 | link |
2024-08-07 | PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting | Yijia Guo et.al. | 2408.03538 | null |
2024-08-02 | A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness | Lutao Jiang et.al. | 2408.01269 | null |
2024-08-02 | Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion | Ke Li et.al. | 2408.01225 | link |
2024-08-07 | IG-SLAM: Instant Gaussian SLAM | F. Aykut Sarikamis et.al. | 2408.01126 | null |
2024-08-01 | LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting | Zhenyu Bao et.al. | 2408.00254 | null |
2024-07-31 | Localized Gaussian Splatting Editing with Contextual Awareness | Hanyuan Xiao et.al. | 2408.00083 | null |
2024-07-31 | Expressive Whole-Body 3D Gaussian Avatar | Gyeongsik Moon et.al. | 2407.21686 | null |
2024-07-30 | SceneTeller: Language-to-3D Scene Generation | Başak Melis Öcal et.al. | 2407.20727 | null |
2024-07-29 | Registering Neural 4D Gaussians for Endoscopic Surgery | Yiming Huang et.al. | 2407.20213 | null |
2024-07-29 | Radiance Fields for Robotic Teleoperation | Maximum Wilder-Smith et.al. | 2407.20194 | link |
2024-07-26 | ScalingGaussian: Enhancing 3D Content Creation with Generative Gaussian Splatting | Shen Chen et.al. | 2407.19035 | null |
2024-07-25 | GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution | Jintong Hu et.al. | 2407.18046 | null |
2024-07-24 | 3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities | Yanqi Bao et.al. | 2407.17418 | link |
2024-07-29 | DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene | Xi Shi et.al. | 2407.16600 | null |
2024-07-23 | HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images | Shreyas Singh et.al. | 2407.16503 | link |
2024-07-23 | Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance | Jiyeop Kim et.al. | 2407.16173 | null |
2024-07-22 | 6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model | Matteo Bortolon et.al. | 2407.15484 | null |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-21 | HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions | Haiyang Zhou et.al. | 2407.15187 | null |
2024-07-20 | Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting | Tianle Zeng et.al. | 2407.14846 | null |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-19 | GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation | Florian Chabot et.al. | 2407.14108 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-20 | Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation | Zongrui Li et.al. | 2407.13584 | link |
2024-07-18 | EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting | Yuchen Weng et.al. | 2407.13520 | null |
2024-07-17 | Generalizable Human Gaussians for Sparse View Synthesis | Youngjoong Kwon et.al. | 2407.12777 | link |
2024-07-17 | Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections | Congrong Xu et.al. | 2407.12306 | null |
2024-07-16 | MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification | Zhuoxiao Li et.al. | 2407.11840 | null |
2024-07-16 | Click-Gaussian: Interactive Segmentation to Any 3D Gaussians | Seokhun Choi et.al. | 2407.11793 | null |
2024-07-16 | SlingBAG: Sliding ball adaptive growth algorithm with differentiable radiation enables super-efficient iterative 3D photoacoustic image reconstruction | Shuang Li et.al. | 2407.11781 | link |
2024-07-16 | I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM | Gwangtak Bae et.al. | 2407.11347 | null |
2024-07-16 | Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering | Jingqian Wu et.al. | 2407.11343 | null |
2024-07-16 | Gaussian Splatting LK | Liuyue Xie et.al. | 2407.11309 | null |
2024-07-15 | iHuman: Instant Animatable Digital Humans From Monocular Videos | Pramish Paudel et.al. | 2407.11174 | link |
2024-07-15 | Scaling 3D Reasoning with LMMs to Large Robot Mission Environments Using Datagraphs | W. J. Meijer et.al. | 2407.10743 | null |
2024-07-15 | Interactive Rendering of Relightable and Animatable Gaussian Avatars | Youyi Zhan et.al. | 2407.10707 | link |
2024-07-16 | RecGS: Removing Water Caustic with Recurrent Gaussian Splatting | Tianyi Zhang et.al. | 2407.10318 | null |
2024-07-14 | 3DEgo: 3D Editing on the Go! | Umar Khalid et.al. | 2407.10102 | null |
2024-07-14 | SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion | Jiyuan Zhang et.al. | 2407.10062 | null |
2024-07-13 | Textured-GS: Gaussian Splatting with Spatially Defined Color and Opacity | Zhentao Huang et.al. | 2407.09733 | link |
2024-07-12 | StyleSplat: 3D Object Style Transfer with Gaussian Splatting | Sahil Jain et.al. | 2407.09473 | null |
2024-07-11 | WildGaussians: 3D Gaussian Splatting in the Wild | Jonas Kulhanek et.al. | 2407.08447 | link |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2024-07-10 | MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition | Aggelina Chatziagapi et.al. | 2407.07284 | null |
2024-07-09 | Reference-based Controllable Scene Stylization with Gaussian Splatting | Yiqun Mei et.al. | 2407.07220 | null |
2024-07-10 | 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes | Nicolas Moenne-Loccoz et.al. | 2407.07090 | null |
2024-07-07 | PICA: Physics-Integrated Clothed Avatar | Bo Peng et.al. | 2407.05324 | null |
2024-07-07 | GaussReg: Fast 3D Registration with Gaussian Splatting | Jiahao Chang et.al. | 2407.05254 | null |
2024-07-06 | SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction | Weixing Xie et.al. | 2407.05023 | link |
2024-07-05 | Gaussian Eigen Models for Human Heads | Wojciech Zielonka et.al. | 2407.04545 | null |
2024-07-12 | Segment Any 4D Gaussians | Shengxiang Ji et.al. | 2407.04504 | null |
2024-07-10 | GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction | Yuxuan Mu et.al. | 2407.04237 | null |
2024-07-04 | CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images | Junghe Lee et.al. | 2407.03923 | null |
2024-07-04 | PFGS: High Fidelity Point Cloud Rendering via Feature Splatting | Jiaxu Wang et.al. | 2407.03857 | link |
2024-07-04 | SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors | Yijia Guo et.al. | 2407.03771 | null |
2024-07-04 | VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors | Sungwon Hwang et.al. | 2407.02945 | link |
2024-07-03 | Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jiaxin Guo et.al. | 2407.02918 | link |
2024-07-04 | AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction | Mustafa Khan et.al. | 2407.02598 | null |
2024-07-02 | TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation | Chaofan Luo et.al. | 2407.02034 | null |
2024-07-01 | DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction | Yujin Ham et.al. | 2407.01761 | null |
2024-07-01 | GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting | Chenxin Li et.al. | 2407.01301 | null |
2024-07-01 | EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting | Chenxin Li et.al. | 2407.01029 | null |
2024-07-02 | RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering | Weikai Lin et.al. | 2407.00435 | link |
2024-06-29 | OccFusion: Rendering Occluded Humans with Generative Diffusion Priors | Adam Sun et.al. | 2407.00316 | null |
2024-06-28 | SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting | Sara Sabour et.al. | 2406.20055 | null |
2024-06-28 | EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting | Daiwei Zhang et.al. | 2406.19811 | null |
2024-06-27 | Lightweight Predictive 3D Gaussian Splats | Junli Cao et.al. | 2406.19434 | link |
2024-06-26 | Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos | Colton Stearns et.al. | 2406.18717 | link |
2024-06-26 | On Scaling Up 3D Gaussian Splatting Training | Hexu Zhao et.al. | 2406.18533 | link |
2024-06-26 | GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality | Taoran Yi et.al. | 2406.18462 | null |
2024-06-26 | Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning | Muhammad Salman Ali et.al. | 2406.18214 | link |
2024-06-26 | GS-Octree: Octree-based 3D Gaussian Splatting for Robust Object-level 3D Reconstruction Under Strong Lighting | Jiaze Li et.al. | 2406.18199 | null |
2024-06-26 | VDG: Vision-Only Dynamic Gaussian for Driving Simulation | Hao Li et.al. | 2406.18198 | null |
2024-06-25 | NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods | Jonas Kulhanek et.al. | 2406.17345 | null |
2024-06-24 | Reducing the Memory Footprint of 3D Gaussian Splatting | Panagiotis Papantonakis et.al. | 2406.17074 | null |
2024-06-24 | From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking | Xiaohao Xu et.al. | 2406.16850 | link |
2024-06-24 | ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians | Yufei Liu et.al. | 2406.16815 | null |
2024-06-23 | LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction | Hengyu Liu et.al. | 2406.16073 | link |
2024-06-23 | Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction | Yangdi Lu et.al. | 2406.15982 | null |
2024-06-21 | Taming 3DGS: High-Quality Radiance Fields with Limited Resources | Saswat Subhajyoti Mallick et.al. | 2406.15643 | link |
2024-06-21 | Gaussian Splatting to Real World Flight Navigation Transfer with Liquid Networks | Alex Quach et.al. | 2406.15149 | null |
2024-06-21 | E2GS: Event Enhanced Gaussian Splatting | Hiroyuki Deguchi et.al. | 2406.14978 | link |
2024-06-18 | Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models | Paul Henderson et.al. | 2406.13099 | null |
2024-06-18 | HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors | Panwang Pan et.al. | 2406.12459 | link |
2024-06-17 | A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets | Bernhard Kerbl et.al. | 2406.12080 | null |
2024-06-17 | RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians | Bingling Li et.al. | 2406.11836 | null |
2024-06-18 | Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting | Junha Hyung et.al. | 2406.11672 | null |
2024-06-16 | Physically Embodied Gaussian Splatting: A Realtime Correctable World Model for Robotics | Jad Abou-Chakra et.al. | 2406.10788 | null |
2024-06-14 | Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections | Jiacong Xu et.al. | 2406.10373 | null |
2024-06-14 | L4GM: Large 4D Gaussian Reconstruction Model | Jiawei Ren et.al. | 2406.10324 | null |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219 | link |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion | Trapoom Ukarapol et.al. | 2406.09850 | link |
2024-06-14 | Unified Gaussian Primitives for Scene Representation and Rendering | Yang Zhou et.al. | 2406.09733 | null |
2024-06-13 | Modeling Ambient Scene Dynamics for Free-view Synthesis | Meng-Li Shih et.al. | 2406.09395 | null |
2024-06-13 | GGHead: Fast and Generalizable 3D Gaussian Heads | Tobias Kirschstein et.al. | 2406.09377 | null |
2024-06-14 | AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis | Swapnil Bhosale et.al. | 2406.08920 | null |
2024-06-13 | Gaussian-Forest: Hierarchical-Hybrid 3D Gaussian Splatting for Compressed Scene Modeling | Fengyi Zhang et.al. | 2406.08759 | null |
2024-06-12 | ICE-G: Image Conditional Editing of 3D Gaussian Splats | Vishnu Jaganathan et.al. | 2406.08488 | null |
2024-06-12 | Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models | Yuxuan Xue et.al. | 2406.08475 | null |
2024-06-12 | From Chaos to Clarity: 3DGS in the Dark | Zhihao Li et.al. | 2406.08300 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499 | null |
2024-06-11 | Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field | Chao Wang et.al. | 2406.07329 | null |
2024-06-10 | GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation | Haozhe Xie et.al. | 2406.06526 | link |
2024-06-10 | PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction | Danpeng Chen et.al. | 2406.06521 | null |
2024-06-10 | MVGamba: Unify 3D Content Generation as State Space Sequence Modeling | Xuanyu Yi et.al. | 2406.06367 | link |
2024-06-10 | Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis | Xin Jin et.al. | 2406.06216 | link |
2024-06-09 | RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering | Rui Zhang et.al. | 2406.05852 | null |
2024-06-09 | VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction | Hanlin Chen et.al. | 2406.05774 | null |
2024-06-06 | Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image | Stanislaw Szymanowicz et.al. | 2406.04343 | link |
2024-06-06 | A Survey on 3D Human Avatar Modeling – From Reconstruction to Generation | Ruihe Wang et.al. | 2406.04253 | null |
2024-06-06 | Localized Gaussian Point Management | Haosen Yang et.al. | 2406.04251 | null |
2024-06-06 | Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Diwen Wan et.al. | 2406.03697 | link |
2024-06-05 | Event3DGS: Event-based 3D Gaussian Splatting for Fast Egomotion | Tianyi Xiong et.al. | 2406.02972 | null |
2024-06-05 | Adversarial Generation of Hierarchical Gaussians for 3D Generative Model | Sangeek Hyun et.al. | 2406.02968 | link |
2024-06-04 | 3D-HGS: 3D Half-Gaussian Splatting | Haolin Li et.al. | 2406.02720 | link |
2024-06-06 | Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting | Inkyu Shin et.al. | 2406.02541 | null |
2024-06-04 | SatSplatYOLO: 3D Gaussian Splatting-based Virtual Object Detection Ensembles for Satellite Feature Recognition | Van Minh Nguyen et.al. | 2406.02533 | null |
2024-06-04 | DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering | Zhongpai Gao et.al. | 2406.02518 | null |
2024-06-04 | WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections | Yuze Wang et.al. | 2406.02407 | null |
2024-06-04 | Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning | Jiaxu Wang et.al. | 2406.02370 | null |
2024-06-04 | OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding | Yanmin Wu et.al. | 2406.02058 | null |
2024-06-04 | FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping | Yuzhou Ji et.al. | 2406.01916 | null |
2024-06-03 | Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting | Shaojie Ma et.al. | 2406.01593 | null |
2024-06-03 | Tetrahedron Splatting for 3D Generation | Chun Gu et.al. | 2406.01579 | link |
2024-06-03 | DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors | Tianyu Huang et.al. | 2406.01476 | link |
2024-05-31 | ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model | Yufei Wang et.al. | 2405.20721 | link |
2024-05-31 | R $^2$ -Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction | Ruyi Zha et.al. | 2405.20693 | link |
2024-05-30 | $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323 | link |
2024-06-03 | A Pixel Is Worth More Than One 3D Gaussians in Single-View 3D Reconstruction | Jianghao Shen et.al. | 2405.20310 | null |
2024-05-29 | EvaGaussians: Event Stream Assisted Gaussian Splatting from Blurry Images | Wangbo Yu et.al. | 2405.20224 | null |
2024-05-30 | Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting | Kuldeep R Barad et.al. | 2405.20104 | null |
2024-06-04 | PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting | Qiaowei Miao et.al. | 2405.19957 | link |
2024-05-30 | GaussianRoom: Improving 3D Gaussian Splatting with SDF Guidance and Monocular Cues for Indoor Scene Reconstruction | Haodong Xiang et.al. | 2405.19671 | null |
2024-05-30 | Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian | Wei Sun et.al. | 2405.19657 | null |
2024-05-30 | TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM | Peifeng Jiang et.al. | 2405.19614 | null |
2024-05-29 | NPGA: Neural Parametric Gaussian Avatars | Simon Giebenhain et.al. | 2405.19331 | null |
2024-05-29 | LP-3DGS: Learning to Prune 3D Gaussian Splatting | Zhaoliang Zhang et.al. | 2405.18784 | link |
2024-05-28 | GFlow: Recovering 4D World from Monocular Video | Shizun Wang et.al. | 2405.18426 | null |
2024-05-28 | 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting | Qihang Zhang et.al. | 2405.18424 | null |
2024-05-28 | 3D StreetUnveiler with Semantic-Aware 2DGS | Jingwei Xu et.al. | 2405.18416 | null |
2024-05-28 | NegGS: Negative Gaussian Splatting | Artur Kasymov et.al. | 2405.18163 | link |
2024-05-28 | A Grid-Free Fluid Solver based on Gaussian Spatial Representation | Jingrui Xing et.al. | 2405.18133 | null |
2024-05-28 | EG4D: Explicit Generation of 4D Object without Score Distillation | Qi Sun et.al. | 2405.18132 | link |
2024-05-28 | RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields | Mihnea-Bogdan Jurca et.al. | 2405.18033 | link |
2024-05-28 | FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes | Yunsong Wang et.al. | 2405.17958 | link |
2024-05-28 | A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction | Bin Zhang et.al. | 2405.17891 | null |
2024-05-29 | HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction | Haoyu Zhao et.al. | 2405.17872 | link |
2024-05-27 | MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds | Jiahui Lei et.al. | 2405.17421 | link |
2024-05-27 | DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal | Yujie Wang et.al. | 2405.17351 | null |
2024-05-27 | Memorize What Matters: Emergent Scene Decomposition from Multitraverse | Yiming Li et.al. | 2405.17187 | link |
2024-05-27 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-27 | SA-GS: Semantic-Aware Gaussian Splatting for Large Scene Reconstruction with Geometry Constrain | Butian Xiong et.al. | 2405.16923 | null |
2024-05-28 | PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting | Zipeng Wang et.al. | 2405.16829 | null |
2024-05-26 | Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models | Hanwen Liang et.al. | 2405.16645 | null |
2024-05-26 | Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians | Erik Sandström et.al. | 2405.16544 | link |
2024-05-24 | Feature Splatting for Better Novel View Synthesis with Low Overlap | T. Berriel Martins et.al. | 2405.15518 | link |
2024-05-24 | GSDeformer: Direct Cage-based Deformation for 3D Gaussian Splatting | Jiajun Huang et.al. | 2405.15491 | null |
2024-05-24 | DisC-GS: Discontinuity-aware Gaussian Splatting | Haoxuan Qu et.al. | 2405.15196 | null |
2024-05-24 | HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting | Yuanhao Cai et.al. | 2405.15125 | link |
2024-05-24 | GS-Hider: Hiding Messages into 3D Gaussian Splatting | Xuanyu Zhang et.al. | 2405.15118 | null |
2024-05-23 | EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting | Jiaxu Wang et.al. | 2405.14959 | link |
2024-05-23 | Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras | Hanzhang Tu et.al. | 2405.14866 | null |
2024-05-23 | MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes | Ruiyuan Gao et.al. | 2405.14475 | null |
2024-05-23 | TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing | Teng Xu et.al. | 2405.14455 | null |
2024-05-24 | RoGS: Large Scale Road Surface Reconstruction based on 2D Gaussian Splatting | Zhiheng Feng et.al. | 2405.14342 | link |
2024-05-23 | D-MiSo: Editing Dynamic 3D Scenes using Multi-Gaussians Soup | Joanna Waczyńska et.al. | 2405.14276 | link |
2024-05-22 | DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus | Yu Chen et.al. | 2405.13943 | link |
2024-05-22 | Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances | Licheng Shen et.al. | 2405.13694 | null |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting | Jia Gong et.al. | 2405.12663 | null |
2024-05-21 | Gaussian Control with Hierarchical Semantic Graphs in 3D Human Recovery | Hongsheng Wang et.al. | 2405.12477 | null |
2024-05-20 | GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details | Boqian Li et.al. | 2405.12420 | link |
2024-05-20 | AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field | Rong Liu et.al. | 2405.12369 | link |
2024-05-20 | Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo | Tianqi Liu et.al. | 2405.12218 | link |
2024-05-20 | Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents | Guanlin Wu et.al. | 2405.12155 | null |
2024-05-20 | CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization | Jiawei Zhang et.al. | 2405.12110 | link |
2024-05-21 | Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping | Tianhao Wu et.al. | 2405.12069 | null |
2024-05-20 | MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections | Jiayue Liu et.al. | 2405.11921 | null |
2024-05-18 | Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching | Xingyu Miao et.al. | 2405.11252 | link |
2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129 | link |
2024-05-17 | Photorealistic 3D Urban Scene Reconstruction and Point Cloud Extraction using Google Earth Imagery and Gaussian Splatting | Kyle Gao et.al. | 2405.11021 | null |
2024-05-17 | ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation | Pengzhi Li et.al. | 2405.10508 | null |
2024-05-16 | GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction | Rui Jin et.al. | 2405.10142 | null |
2024-05-15 | From NeRFs to Gaussian Splats, and Back | Siming He et.al. | 2405.09717 | link |
2024-05-13 | GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting | Haodong Chen et.al. | 2405.07472 | null |
2024-05-11 | Direct Learning of Mesh and Appearance via 3D Gaussian Splatting | Ancheng Lin et.al. | 2405.06945 | null |
2024-05-10 | OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | Jinwei Lin et.al. | 2405.06547 | link |
2024-05-10 | I3DGS: Improve 3D Gaussian Splatting from Multiple Dimensions | Jinwei Lin et.al. | 2405.06408 | null |
2024-05-10 | MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization | Pengcheng Zhu et.al. | 2405.06241 | null |
2024-05-09 | DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation | Sitian Shen et.al. | 2405.05800 | null |
2024-05-09 | FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting | Yikun Ma et.al. | 2405.05768 | null |
2024-05-09 | NGM-SLAM: Gaussian Splatting SLAM with Radiance Field Submap | Mingrui Li et.al. | 2405.05702 | null |
Stereo Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-15 | Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation | Zhen Xu et.al. | 2507.11540 | null |
2025-07-15 | Uniting the World by Dividing it: Federated Maps to Enable Spatial Applications | Sagar Bharadwaj et.al. | 2507.11437 | null |
2025-07-15 | Caveats about measuring carbon abundances in stars using the CH band | Pablo Santos-Peral et.al. | 2507.11351 | null |
2025-07-15 | MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network | Jianfei Jiang et.al. | 2507.11333 | null |
2025-07-15 | Fairness-Aware Grouping for Continuous Sensitive Variables: Application for Debiasing Face Analysis with respect to Skin Tone | Veronika Shilova et.al. | 2507.11247 | null |
2025-07-15 | Generative Click-through Rate Prediction with Applications to Search Advertising | Lingwei Kong et.al. | 2507.11246 | null |
2025-07-15 | MMOne: Representing Multiple Modalities in One Scene | Zhifeng Gu et.al. | 2507.11129 | null |
2025-07-15 | Urban delineation through the lens of commute networks: Leveraging graph embeddings to distinguish socioeconomic groups in cities | Devashish Khulbe et.al. | 2507.11057 | null |
2025-07-15 | Uncertainty Aware Mapping for Vision-Based Underwater Robots | Abhimanyu Bhowmik et.al. | 2507.10991 | null |
2025-07-15 | Terms and Conditions (Do Not) Apply: Understanding Exploitation Disparities in Design of Mobile-Based Financial Services | Lindah Kotut et.al. | 2507.10970 | null |
2025-07-14 | Cameras as Relative Positional Encoding | Ruilong Li et.al. | 2507.10496 | null |
2025-07-14 | Rows and Capabilities as Modal Effects | Wenhao Tang et.al. | 2507.10301 | null |
2025-07-14 | Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures | Xinlong Ding et.al. | 2507.10265 | null |
2025-07-14 | Is Micro-expression Ethnic Leaning? | Huai-Qian Khor et.al. | 2507.10209 | null |
2025-07-14 | Minimizing the Pretraining Gap: Domain-aligned Text-Based Person Retrieval | Shuyu Yang et.al. | 2507.10195 | null |
2025-07-14 | Simulating Biases for Interpretable Fairness in Offline and Online Classifiers | Ricardo Inácio et.al. | 2507.10154 | null |
2025-07-14 | Efficient RF Chain Selection for MIMO Integrated Sensing and Communications: A Greedy Approach | Subin Shin et.al. | 2507.09960 | null |
2025-07-13 | EventHunter: Dynamic Clustering and Ranking of Security Events from Hacker Forum Discussions | Yasir Ech-Chammakhy et.al. | 2507.09762 | null |
2025-07-13 | Pre-trained Under Noise: A Framework for Robust Bone Fracture Detection in Medical Imaging | Robby Hoover et.al. | 2507.09731 | null |
2025-07-13 | Electric Vehicle Public Charging Equity Considerations: A Systematic Review | Boyou Chen et.al. | 2507.09726 | null |
2025-07-11 | Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT | Wei Zhang et.al. | 2507.08448 | null |
2025-07-11 | PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models | Yongjian Zhang et.al. | 2507.08400 | null |
2025-07-10 | Highly accurate simulations of asymmetric black-hole scattering and cross validation of effective-one-body models | Oliver Long et.al. | 2507.08071 | null |
2025-07-10 | Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions | Longfei Li et.al. | 2507.07978 | null |
2025-07-10 | On-Manifold Low-Thrust Maneuvering of Quasi-Periodic Orbits | Ian M. Down et.al. | 2507.07940 | null |
2025-07-10 | TRIX- Trading Adversarial Fairness via Mixed Adversarial Training | Tejaswini Medi et.al. | 2507.07768 | null |
2025-07-10 | Prime Power Residues and Blocking Sets | Bhawesh Mishra et.al. | 2507.07673 | null |
2025-07-10 | Bridging the gap in FER: addressing age bias in deep learning | F. Xavier Gaya-Morey et.al. | 2507.07638 | null |
2025-07-10 | Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects | Yuqi Cheng et.al. | 2507.07435 | null |
2025-07-09 | Dirty Data in the Newsroom: Comparing Data Preparation in Journalism and Data Science | Stephen Kasica et.al. | 2507.07238 | null |
2025-07-09 | Combining Pre-Trained Models for Enhanced Feature Representation in Reinforcement Learning | Elia Piccoli et.al. | 2507.07197 | null |
2025-07-09 | Correlations between Dust Extinction Features across All Wavelength Scales: From Diffuse Interstellar Bands to R(V) | Andrew K. Saydjari et.al. | 2507.07162 | null |
2025-07-09 | Hierarchical Feature Alignment for Gloss-Free Sign Language Translation | Sobhan Asasi et.al. | 2507.06732 | null |
2025-07-09 | Photometric Stereo using Gaussian Splatting and inverse rendering | Matéo Ducastel et.al. | 2507.06684 | null |
2025-07-09 | Transferable Parasitic Estimation via Graph Contrastive Learning and Label Rebalancing in AMS Circuits | Shan Shen et.al. | 2507.06535 | null |
2025-07-08 | Bridging Sequential Deep Operator Network and Video Diffusion: Residual Refinement of Spatio-Temporal PDE Solutions | Jaewan Park et.al. | 2507.06133 | null |
2025-07-08 | Discontinuity-aware Normal Integration for Generic Central Camera Models | Francesco Milano et.al. | 2507.06075 | null |
2025-07-08 | Bridging Perception and Language: A Systematic Benchmark for LVLMs’ Understanding of Amodal Completion Reports | Amane Watahiki et.al. | 2507.05799 | null |
2025-07-08 | Fairness-Aware Static and Dynamic Assortment Optimization: Optimal Selection with Balanced Market Share | Omar El Housni et.al. | 2507.05606 | null |
2025-07-08 | SingLoRA: Low Rank Adaptation Using a Single Matrix | David Bensaïd et.al. | 2507.05566 | null |
2025-07-07 | Incorporating Interventional Independence Improves Robustness against Interventional Distribution Shift | Gautam Sreekumar et.al. | 2507.05412 | null |
2025-07-07 | Feature Geometry for Stereo Sidescan and Forward-looking Sonar | Kalin Norman et.al. | 2507.05410 | null |
2025-07-07 | Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates | Andrea Eichenseer et.al. | 2507.05409 | null |
2025-07-07 | Stereo Reproduction in the Presence of Sample Rate Offsets | Srikanth Korse et.al. | 2507.05402 | null |
2025-07-07 | Untangling Selberg from the Wilson spool: 1-loop determinants and trace formulae in (A)dS $_{3}$ | Samuel Haupfear et.al. | 2507.05358 | null |
2025-07-07 | Causal Impacts of Protected Bike Lanes on Cycling Behavior with Demographic Disparities | Marcel Moran et.al. | 2507.04936 | null |
2025-07-07 | Spatial and Semantic Embedding Integration for Stereo Sound Event Localization and Detection in Regular Videos | Davide Berghi et.al. | 2507.04845 | null |
2025-07-07 | Toward Valid Measurement Of (Un)fairness For Generative AI: A Proposal For Systematization Through The Lens Of Fair Equality of Chances | Kimberly Le Truong et.al. | 2507.04641 | null |
2025-07-07 | Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts | Yun Wang et.al. | 2507.04631 | null |
2025-07-07 | DisMS-TS: Eliminating Redundant Multi-Scale Features for Time Series Classification | Zhipeng Liu et.al. | 2507.04600 | null |
2025-07-06 | Thousand-Brains Systems: Sensorimotor Intelligence for Rapid, Robust Learning and Inference | Niels Leadholm et.al. | 2507.04494 | null |
2025-07-05 | Nested economies of scale in city mass | Kangning Huang et.al. | 2507.03960 | null |
2025-07-04 | Assessing the Viability of Wave Field Synthesis in VR-Based Cognitive Research | Benjamin Kahl et.al. | 2507.03797 | null |
2025-07-04 | Improving Social Determinants of Health Documentation in French EHRs Using Large Language Models | Adrien Bazoge et.al. | 2507.03433 | null |
2025-07-04 | CME activities on spotless days during descending phase of solar cycles 23 and 24 | Dipali Burud et.al. | 2507.03399 | null |
2025-07-02 | The Illusion of Fairness: Auditing Fairness Interventions with Audit Studies | Disa Sariola et.al. | 2507.02152 | null |
2025-07-02 | The Thin Line Between Comprehension and Persuasion in LLMs | Adrian de Wynter et.al. | 2507.01936 | null |
2025-07-02 | How Do Vision-Language Models Process Conflicting Information Across Modalities? | Tianze Hua et.al. | 2507.01790 | null |
2025-07-02 | RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather | Yuran Wang et.al. | 2507.01653 | null |
2025-07-02 | Adapting Language Models to Indonesian Local Languages: An Empirical Study of Language Transferability on Zero-Shot Settings | Rifki Afina Putri et.al. | 2507.01645 | null |
2025-07-02 | Two Cases of Non-Radial Filament Eruption and Associated CME Deflection | Kostadinka Koleva et.al. | 2507.01580 | null |
2025-07-02 | Penalizing Transparency? How AI Disclosure and Author Demographics Shape Human and AI Judgments About Writing | Inyoung Cheong et.al. | 2507.01418 | null |
2025-07-01 | Improving Stereo 3D Sound Event Localization and Detection: Perceptual Features, Stereo-specific Data Augmentation, and Distance Normalization | Jun-Wei Yeow et.al. | 2507.00874 | null |
2025-07-01 | Impact of temperature asymmetry and small fraction of static positive ions on the relaxed states of a relativistic hot pair plasma | Usman Shazad et.al. | 2507.00760 | null |
2025-07-01 | Renormalization group based implicit function approach to connecting orbits | Pengfei Guo et.al. | 2507.00749 | null |
2025-07-01 | Self-organization of earth’s inner magnetospheric multi-ion plasma | Usman Shazad et.al. | 2507.00734 | null |
2025-06-30 | Development of Hybrid Artificial Intelligence Training on Real and Synthetic Data: Benchmark on Two Mixed Training Strategies | Paul Wachter et.al. | 2506.24093 | null |
2025-06-30 | Simultaneous Super-Resolution of Spatial and Spectral Imaging with a Camera Array and Notch Filters | Peng Lin et.al. | 2506.24014 | null |
2025-06-30 | Statistical Modeling for Accurate Characterization of Doppler Effect in LEO-Terrestrial Networks | Islam M. Tanash et.al. | 2506.23817 | null |
2025-06-30 | AdFair-CLIP: Adversarial Fair Contrastive Language-Image Pre-training for Chest X-rays | Chenlang Yi et.al. | 2506.23467 | null |
2025-06-29 | Zero-disparity Distribution Synthesis: Fast Exact Calculation of Chi-Squared Statistic Distribution for Discrete Uniform Histograms | Nikola Banić et.al. | 2506.23416 | null |
2025-06-29 | Datasets for Fairness in Language Models: An In-Depth Survey | Jiale Zhang et.al. | 2506.23411 | null |
2025-06-29 | Modeling European Electricity Market Integration during turbulent times | Francesco Ravazzolo et.al. | 2506.23289 | null |
2025-06-29 | Event-based Stereo Visual-Inertial Odometry with Voxel Map | Zhaoxing Zhang et.al. | 2506.23078 | null |
2025-06-28 | Feature-Wise Mixing for Mitigating Contextual Bias in Predictive Supervised Learning | Yash Vardhan Tomar et.al. | 2506.23033 | null |
2025-06-28 | SPICE-HL3: Single-Photon, Inertial, and Stereo Camera dataset for Exploration of High-Latitude Lunar Landscapes | David Rodríguez-Martínez et.al. | 2506.22956 | null |
2025-06-27 | Towards Fair Rankings: Leveraging LLMs for Gender Bias Detection and Measurement | Maryam Mousavian et.al. | 2506.22372 | null |
2025-06-27 | NoticeLight: Embracing Socio-Technical Asymmetry through Tangible Peripheral Robotic Embodiment in Hybrid Collaboration | Marie Altmann et.al. | 2506.22125 | null |
2025-06-27 | Quantifying Institutional Gender Inequality in Contemporary Visual Art | Xindi Wang et.al. | 2506.22103 | null |
2025-06-27 | Seismic resolution enhancement via deep Learning with Knowledge Distillation and Domain Adaptation | Hanpeng Cai et.al. | 2506.22018 | null |
2025-06-27 | SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images | Naftaly Wambugu et.al. | 2506.21945 | null |
2025-06-26 | Counterfactual Voting Adjustment for Quality Assessment and Fairer Voting in Online Platforms with Helpfulness Evaluation | Chang Liu et.al. | 2506.21362 | null |
2025-06-26 | ToosiCubix: Monocular 3D Cuboid Labeling via Vehicle Part Annotations | Behrooz Nasihatkon et.al. | 2506.21358 | null |
2025-06-26 | ESMStereo: Enhanced ShuffleMixer Disparity Upsampling for Real-Time and Accurate Stereo Matching | Mahmoud Tahmasebi et.al. | 2506.21091 | null |
2025-06-26 | The Role of Cyclopean-Eye in Stereo Vision | Sherlon Almeida da Silva et.al. | 2506.20900 | null |
2025-06-25 | THIRDEYE: Cue-Aware Monocular Depth Estimation via Brain-Inspired Multi-Stage Fusion | Calin Teodor Ioan et.al. | 2506.20877 | null |
2025-06-25 | StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation | Haodong Li et.al. | 2506.20756 | null |
2025-06-25 | Don’t Hash Me Like That: Exposing and Mitigating Hash-Induced Unfairness in Local Differential Privacy | Berkay Kemal Balioglu et.al. | 2506.20290 | null |
2025-06-25 | Effects of flame macrostructures on the combustion dynamics of novel counter-rotating radial swirl injector in a model can combustor | SK Thirumalaikumaran et.al. | 2506.20138 | null |
2025-06-24 | Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation | Jun Wang et.al. | 2506.19774 | null |
2025-06-24 | Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders | Matyas Bohacek et.al. | 2506.19708 | null |
2025-06-24 | Recurrent Visual Feature Extraction and Stereo Attentions for CT Report Generation | Yuanhe Tian et.al. | 2506.19665 | null |
2025-06-24 | AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models | Zeyu Li et.al. | 2506.19505 | null |
2025-06-24 | MuBench: Assessment of Multilingual Capabilities of Large Language Models Across 61 Languages | Wenhan Han et.al. | 2506.19468 | null |
2025-06-24 | Online camera-pose-free stereo endoscopic tissue deformation recovery with tissue-invariant vision-biomechanics consistency | Jiahe Chen et.al. | 2506.19388 | null |
2025-06-23 | MOSCARD – Causal Reasoning and De-confounding for Multimodal Opportunistic Screening of Cardiovascular Adverse Events | Jialu Pi et.al. | 2506.19174 | null |
2025-06-23 | Identifying Causally-Robust Mediators of Health Disparities: A Review and Simulation Studies With Directed Acyclic Graphs | Soojin Park et.al. | 2506.19047 | null |
2025-06-23 | Simulation-Based Sensitivity Analysis in Optimal Treatment Regimes and Causal Decomposition with Individualized Interventions | Soojin Park et.al. | 2506.19010 | null |
2025-06-23 | Causal Decomposition Analysis with Synergistic Interventions: A Triply-Robust Machine Learning Approach to Addressing Multiple Dimensions of Social Disparities | Soojin Park et.al. | 2506.18994 | null |
2025-06-23 | Light of Normals: Unified Feature Representation for Universal Photometric Stereo | Hong Li et.al. | 2506.18882 | null |
2025-06-23 | Evaluating Multichannel Speech Enhancement Algorithms at the Phoneme Scale Across Genders | Nasser-Eddine Monir et.al. | 2506.18691 | null |
2025-06-23 | NOVA: Navigation via Object-Centric Visual Autonomy for High-Speed Target Tracking in Unstructured GPS-Denied Environments | Alessandro Saviolo et.al. | 2506.18689 | null |
2025-06-23 | Bias vs Bias – Dawn of Justice: A Fair Fight in Recommendation Systems | Tahsin Alamgir Kheya et.al. | 2506.18327 | null |
2025-06-22 | Mental Health Equity in LLMs: Leveraging Multi-Hop Question Answering to Detect Amplified and Silenced Perspectives | Batool Haider et.al. | 2506.18116 | null |
2025-06-22 | StereoTacTip: Vision-based Tactile Sensing with Biomimetic Skin-Marker Arrangements | Chenghua Lu et.al. | 2506.18040 | null |
2025-06-22 | Feedback Driven Multi Stereo Vision System for Real-Time Event Analysis | Mohamed Benkedadra et.al. | 2506.17910 | null |
2025-06-21 | In-Context Learning Strategies Emerge Rationally | Daniel Wurgaft et.al. | 2506.17859 | null |
2025-06-21 | Learning to Dock: A Simulation-based Study on Closing the Sim2Real Gap in Autonomous Underwater Docking | Kevin Chang et.al. | 2506.17823 | null |
2025-06-21 | Optimization-Free Patch Attack on Stereo Depth Estimation | Hangcheng Liu et.al. | 2506.17632 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186 | link |
2025-06-20 | Are Bias Evaluation Methods Biased ? | Lina Berrayana et.al. | 2506.17111 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110 | null |
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063 | null |
2025-06-20 | LunarLoc: Segment-Based Global Localization on the Moon | Annika Thomas et.al. | 2506.16940 | link |
2025-06-20 | DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches | Yun Xing et.al. | 2506.16690 | null |
2025-06-19 | External Evaluation of Discrimination Mitigation Efforts in Meta’s Ad Delivery | Basileal Imana et.al. | 2506.16560 | null |
2025-06-19 | PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking | Yan Zhou et.al. | 2506.16379 | null |
2025-06-19 | Heterotopic energy for Sobolev mappings | Antoine Detaille et.al. | 2506.16204 | null |
2025-06-19 | Solar Transient Recognition Using Deep Learning (STRUDL) for heliospheric imager data | Maike Bauer et.al. | 2506.16194 | null |
2025-06-18 | Mono-Modalizing Extremely Heterogeneous Multi-Modal Medical Image Registration | Kyobin Choo et.al. | 2506.15596 | null |
2025-06-18 | SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models’ Knowledge of Indian Culture | Arijit Maji et.al. | 2506.15355 | null |
2025-06-18 | Dissecting the gender divide: Authorship and acknowledgment in scientific publications | Keigo Kusumegi et.al. | 2506.15237 | null |
2025-06-18 | Transit for All: Mapping Equitable Bike2Subway Connection using Region Representation Learning | Min Namgung et.al. | 2506.15113 | null |
2025-06-18 | 3D Vision-tactile Reconstruction from Infrared and Visible Images for Robotic Fine-grained Tactile Perception | Yuankai Lin et.al. | 2506.15087 | null |
2025-06-17 | Time-Optimized Safe Navigation in Unstructured Environments through Learning Based Depth Completion | Jeffrey Mao et.al. | 2506.14975 | null |
2025-06-17 | Cost-Aware Routing for Efficient Text-To-Image Generation | Qinchan et.al. | 2506.14753 | null |
2025-06-17 | DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning | Kunal Swami et.al. | 2506.14709 | null |
2025-06-17 | One Size Fits None: Rethinking Fairness in Medical AI | Roland Roller et.al. | 2506.14400 | null |
2025-06-17 | Consensus Power Inequality: A Comparative Study of Blockchain Networks | Kamil Tylinski et.al. | 2506.14393 | null |
2025-06-16 | Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble | Zhiqi Wang et.al. | 2506.13972 | link |
2025-06-16 | Bias Delayed is Bias Denied? Assessing the Effect of Reporting Delays on Disparity Assessments | Jennah Gosciak et.al. | 2506.13735 | link |
2025-06-16 | Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields | Jungeon Kim et.al. | 2506.13508 | null |
2025-06-16 | Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling | Wenmiao Gao et.al. | 2506.13455 | null |
2025-06-16 | Cloud-to-cloud velocity dispersions across a Local arm segment | Lixia Yuan et.al. | 2506.13424 | null |
2025-06-16 | DVP-MVS++: Synergize Depth-Normal-Edge and Harmonized Visibility Prior for Multi-View Stereo | Zhenlong Yuan et.al. | 2506.13215 | null |
2025-06-16 | Equitable Electronic Health Record Prediction with FAME: Fairness-Aware Multimodal Embedding | Nikkie Hooman et.al. | 2506.13104 | null |
2025-06-14 | Recent Advances and Future Directions in Literature-Based Discovery | Andrej Kastrin et.al. | 2506.12385 | null |
2025-06-14 | Path-specific effects for pulse-oximetry guided decisions in critical care | Kevin Zhang et.al. | 2506.12371 | null |
2025-06-16 | A Reference Model and Patterns for Production Event Data Enrichment | Mark van der Pas et.al. | 2506.11502 | null |
2025-06-16 | SemanticST: Spatially Informed Semantic Graph Learning for Clustering, Integration, and Scalable Analysis of Spatial Transcriptomics | Roxana Zahedi et.al. | 2506.11491 | link |
2025-06-13 | A Watermark for Auto-Regressive Image Generation Models | Yihan Wu et.al. | 2506.11371 | null |
2025-06-12 | Forbidden configurations for coherency | Victoria Gould et.al. | 2506.11321 | null |
2025-06-12 | Principled Approaches for Extending Neural Architectures to Function Spaces for Operator Learning | Julius Berner et.al. | 2506.10973 | link |
2025-06-12 | FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition | Jongsuk Kim et.al. | 2506.10747 | null |
2025-06-12 | Balancing Tails when Comparing Distributions: Comprehensive Equity Index (CEI) with Application to Bias Evaluation in Operational Face Biometrics | Imanol Solano et.al. | 2506.10564 | null |
2025-06-12 | EasyDRAM: An FPGA-based Infrastructure for Fast and Accurate End-to-End Evaluation of Emerging DRAM Techniques | Oğuzhan Canpolat et.al. | 2506.10441 | link |
2025-06-12 | Transcorrelated Theory for Transition Metal Atoms | Kristoffer Simula et.al. | 2506.10429 | null |
2025-06-12 | PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting | Lintao Xiang et.al. | 2506.10335 | null |
2025-06-12 | A Novel Feedforward Youla Parameterization Method for Avoiding Local Minima in Stereo Image Based Visual Servoing Control | Rongfei Li et.al. | 2506.10252 | null |
2025-06-10 | Down But Not Out: The Case of Long-Period Comet C/2021 O3 (Panstarrs) | David Jewitt. Jing Li et.al. | 2506.09263 | null |
2025-06-10 | Princeton365: A Diverse Dataset with Accurate Camera Pose | Karhan Kayan et.al. | 2506.09035 | null |
2025-06-10 | Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia | Katelyn Xiaoying Mei et.al. | 2506.08846 | link |
2025-06-11 | Towards Fair Representation: Clustering and Consensus | Diptarka Chakraborty et.al. | 2506.08673 | null |
2025-06-09 | Unmasking inequility: socio-economic determinants and gender disparities in Maharashtra and India’s health outcomes – Insights from NFHS-5 | Sharmishtha Raghuvanshi et.al. | 2506.08206 | null |
2025-06-09 | GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors | Wenlong Meng et.al. | 2506.08188 | null |
2025-06-09 | Balanced Area Deprivation Index (bADI): Enhancing social determinants of health indices to strengthen their association with healthcare clinical outcomes, utilization and costs | Mohammad Amin Morid et.al. | 2506.08131 | null |
2025-06-09 | Unraveling Ethereum’s Mempool: The Impact of Fee Fairness, Transaction Prioritization, and Consensus Efficiency | S M Mostaq Hossain et.al. | 2506.07988 | null |
2025-06-09 | LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement | Dimitris Panagopoulos et.al. | 2506.07915 | null |
2025-06-09 | Erbium-implanted WS2 flakes with room-temperature photon emission at telecom wavelengths | Guadalupe García-Arellano et.al. | 2506.07746 | null |
2025-06-09 | Federated In-Context Learning: Iterative Refinement for Improved Answer Quality | Ruhan Wang et.al. | 2506.07440 | null |
2025-06-09 | The impact of extracurricular education on socioeconomic mobility in Japan: an application of causal machine learning | Yang Qiang et.al. | 2506.07421 | null |
2025-06-08 | Analyzing Breast Cancer Survival Disparities by Race and Demographic Location: A Survival Analysis Approach | Ramisa Farha et.al. | 2506.07191 | null |
2025-06-08 | Optimal Transport Driven Asymmetric Image-to-Image Translation for Nuclei Segmentation of Histological Images | Suman Mahapatra et.al. | 2506.07023 | null |
2025-06-08 | End-to-End Probabilistic Framework for Learning with Hard Constraints | Utkarsh Utkarsh et.al. | 2506.07003 | null |
2025-06-07 | Spatial Disparities in Fire Shelter Accessibility: Capacity Challenges in the Palisades and Eaton Fires | Su Yeon Han et.al. | 2506.06803 | null |
2025-06-06 | Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception | Pushyami Kaveti et.al. | 2506.06476 | null |
2025-06-06 | PyGemini: Unified Software Development towards Maritime Autonomy Systems | Kjetil Vasstein et.al. | 2506.06262 | null |
2025-06-06 | Masked Language Models are Good Heterogeneous Graph Generalizers | Jinyu Yang et.al. | 2506.06157 | link |
2025-06-06 | SVD: Spatial Video Dataset | M. H. Izadimehr et.al. | 2506.06037 | null |
2025-06-06 | Restereo: Diffusion stereo video generation and restoration | Xingchang Huang et.al. | 2506.06023 | null |
2025-06-06 | Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning | Fan Yang et.al. | 2506.05997 | null |
2025-06-06 | A Culturally-Rich Romanian NLP Dataset from “Who Wants to Be a Millionaire?” Videos | Alexandru-Gabriel Ganea et.al. | 2506.05991 | null |
2025-06-06 | NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces | Pierluigi Zama Ramirez et.al. | 2506.05815 | null |
2025-06-06 | Efficient Online RFT with Plug-and-Play LLM Judges: Unlocking State-of-the-Art Performance | Rudransh Agnihotri et.al. | 2506.05748 | null |
2025-06-06 | Aerial Multi-View Stereo via Adaptive Depth Range Inference and Normal Cues | Yimei Liu et.al. | 2506.05655 | null |
2025-06-05 | Planets similar in size are often dissimilar in interior | E. Mamonova et.al. | 2506.05089 | link |
2025-06-05 | Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer | Filip Slezak et.al. | 2506.04908 | null |
2025-06-05 | Is It JUST Semantics? A Case Study of Discourse Particle Understanding in LLMs | William Sheffield et.al. | 2506.04534 | null |
2025-06-04 | The Latent Space Hypothesis: Toward Universal Medical Representation Learning | Salil Patel et.al. | 2506.04515 | null |
2025-06-04 | Edge interventions can mitigate demographic and prestige disparities in the Computer Science coauthorship network | Kate Barnes et.al. | 2506.04435 | link |
2025-06-04 | MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale | Ran Xu et.al. | 2506.04405 | null |
2025-06-06 | Enduring Disparities in the Workplace: A Pilot Study in the AI Community | Yunusa Simpa Abdulsalam et.al. | 2506.04305 | null |
2025-06-04 | Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation | Tianyu Huang et.al. | 2506.04225 | null |
2025-06-04 | Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness | Stephen R. Pfohl et.al. | 2506.04193 | link |
2025-06-04 | Lions and Muons: Optimization via Stochastic Frank-Wolfe | Maria-Eleni Sfyraki et.al. | 2506.04192 | null |
2025-06-04 | Multi-view Surface Reconstruction Using Normal and Reflectance Cues | Robin Bruneau et.al. | 2506.04115 | link |
2025-06-04 | When Fairness Isn’t Statistical: The Limits of Machine Learning in Evaluating Legal Reasoning | Claire Barale et.al. | 2506.03913 | null |
2025-06-04 | FedFACT: A Provable Framework for Controllable Group-Fairness Calibration in Federated Learning | Li Zhang et.al. | 2506.03777 | null |
2025-06-04 | Analyzing Pension Fund Mortality with Gaussian Processes in a Sub Population Framework | Eduardo F. L. de Melo et.al. | 2506.03584 | null |
2025-06-04 | Time-Domain Excitation of Complex Resonances | Asaf Farhi et.al. | 2506.03485 | null |
2025-06-03 | Targeted Forgetting of Image Subgroups in CLIP Models | Zeliang Zhang et.al. | 2506.03117 | null |
2025-06-03 | A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems | Đorđe Klisura et.al. | 2506.02998 | null |
2025-06-03 | Towards a Japanese Full-duplex Spoken Dialogue System | Atsumoto Ohashi et.al. | 2506.02979 | null |
2025-06-03 | TaxAgent: How Large Language Model Designs Fiscal Policy | Jizhou Wang et.al. | 2506.02838 | null |
2025-06-03 | HORUS: A Mixed Reality Interface for Managing Teams of Mobile Robots | Omotoye Shamsudeen Adekoya et.al. | 2506.02622 | null |
2025-06-03 | On the Language and Gender Biases in PSTN, VoIP and Neural Audio Codecs | Kemal Altwlkany et.al. | 2506.02545 | null |
2025-06-03 | Gender Inequality in English Textbooks Around the World: an NLP Approach | Tairan Liu et.al. | 2506.02425 | null |
2025-06-03 | Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology | Wenhao Tang et.al. | 2506.02408 | link |
2025-06-02 | ImpRAG: Retrieval-Augmented Generation with Implicit Queries | Wenzheng Zhang et.al. | 2506.02279 | null |
2025-06-02 | Tunable magnons in a dual-gated 2D antiferromagnet | Nele Stetzuhn et.al. | 2506.02185 | null |
2025-05-30 | Predicting the Past: Estimating Historical Appraisals with OCR and Machine Learning | Mihir Bhaskar et.al. | 2505.24676 | link |
2025-05-30 | Thermodynamic Signatures of Gaussian Entanglement Beyond Entropy | Beatriz Polo et.al. | 2505.24596 | null |
2025-05-30 | 50 years of spin glass theory | David Sherrington et.al. | 2505.24432 | null |
2025-05-30 | A Unified Scale Factor for the Cosmic Evolution -Motivated by Brane World Models- | Farzin Safarzadeh-Maleki et.al. | 2505.24420 | null |
2025-05-30 | Verifiable Weighted Secret Sharing | Kareem Shehata et.al. | 2505.24289 | null |
2025-05-30 | Evolution of Gas Velocity Dispersion in Discs from $z\sim8$ to $z\sim0.5$ | E. Wisnioski et.al. | 2505.24129 | null |
2025-05-30 | CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs | Ai Jian et.al. | 2505.24120 | null |
2025-05-29 | Estimation of Gender Wage Gap in the University of North Carolina System | Zihan Zhang et.al. | 2505.24078 | null |
2025-05-29 | Can Emotion Fool Anti-spoofing? | Aurosweta Mahapatra et.al. | 2505.23962 | null |
2025-05-29 | Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts | Xuweiyi Chen et.al. | 2505.23926 | null |
2025-05-29 | ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks | Akashah Shabbir et.al. | 2505.23752 | link |
2025-05-29 | Let’s Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM’s Math Capability | Ruida Wang et.al. | 2505.23703 | null |
2025-05-29 | Errors in Stereo Geometry Induce Distance Misperception | Raffles Xingqi Zhu et.al. | 2505.23685 | null |
2025-05-29 | Dual-Task Graph Neural Network for Joint Seizure Onset Zone Localization and Outcome Prediction using Stereo EEG | Syeda Abeera Amir et.al. | 2505.23669 | null |
2025-05-29 | PAN-Crafter: Learning Modality-Consistent Alignment for PAN-Sharpening | Jeonghyeok Do et.al. | 2505.23367 | null |
2025-05-29 | Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data | Lingkai Kong et.al. | 2505.23062 | null |
2025-05-29 | Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift | Minh Nguyen Nhat To et.al. | 2505.23027 | link |
2025-05-28 | Talent or Luck? Evaluating Attribution Bias in Large Language Models | Chahat Raj et.al. | 2505.22910 | link |
2025-05-28 | Permissioned LLMs: Enforcing Access Control in Large Language Models | Bargav Jayaraman et.al. | 2505.22860 | null |
2025-05-28 | Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese | Hanjia Lyu et.al. | 2505.22645 | link |
2025-05-28 | Overpartitions and Kaur, Rana, and Eyyunni’s mex sequences | Brian Hopkins et.al. | 2505.22588 | null |
2025-05-28 | Beyond Leaders and Laggards: A Typology of Renewable Energy Adoption Trajectories with Evidence from Off-Grid Communities | Roni Blushtein-Livnon et.al. | 2505.22456 | null |
2025-05-28 | MObyGaze: a film dataset of multimodal objectification densely annotated by experts | Julie Tores et.al. | 2505.22084 | null |
2025-05-28 | D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples | Zijing Hu et.al. | 2505.22002 | null |
2025-05-27 | From prosthetic memory to prosthetic denial: Auditing whether large language models are prone to mass atrocity denialism | Roberto Ulloa et.al. | 2505.21753 | null |
2025-05-27 | MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs | Raoyuan Zhao et.al. | 2505.21693 | link |
2025-05-27 | Data and Technology for Equitable Public Administration: Understanding City Government Employees’ Challenges and Needs | Angie Zhang et.al. | 2505.21682 | null |
2025-05-27 | ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models | Dingming Li et.al. | 2505.21500 | null |
2025-05-27 | Subgroups Matter for Robust Bias Mitigation | Anissa Alloula et.al. | 2505.21363 | link |
2025-05-27 | The Multilingual Divide and Its Impact on Global AI Safety | Aidan Peppin et.al. | 2505.21344 | null |
2025-05-27 | Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models | Zhaoqing Li et.al. | 2505.21237 | null |
2025-05-27 | Interpreting Social Bias in LVLMs via Information Flow Analysis and Multi-Round Dialogue Evaluation | Zhengyang Ji et.al. | 2505.21106 | null |
2025-05-27 | On VLMs for Diverse Tasks in Multimodal Meme Classification | Deepesh Gavit et.al. | 2505.20937 | null |
2025-05-28 | Stereo Radargrammetry Using Deep Learning from Airborne SAR Images | Tatsuya Sasayama et.al. | 2505.20876 | null |
2025-05-27 | Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties | Jiyoung Lee et.al. | 2505.20875 | null |
2025-05-27 | Aggregation Buffer: Revisiting DropEdge with a New Parameter Block | Dooho Lee et.al. | 2505.20840 | null |
2025-05-27 | TrustSkin: A Fairness Pipeline for Trustworthy Facial Affect Analysis Across Skin Tone | Ana M. Cabanas et.al. | 2505.20637 | null |
2025-05-26 | Spurious Privacy Leakage in Neural Networks | Chenxiang Zhang et.al. | 2505.20095 | null |
2025-05-26 | Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud | Natsuki Takama et.al. | 2505.19854 | null |
2025-05-26 | Deep learning based spatial aliasing reduction in beamforming for audio capture | Mateusz Guzik et.al. | 2505.19781 | null |
2025-05-26 | SACM: SEEG-Audio Contrastive Matching for Chinese Speech Decoding | Hongbin Wang et.al. | 2505.19652 | link |
2025-05-26 | Evaluating Robustness of Large Audio Language Models to Audio Injection: An Empirical Study | Guanyu Hou et.al. | 2505.19598 | null |
2025-05-26 | VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models | Hu Xiaobin et.al. | 2505.19571 | link |
2025-05-26 | AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and Healthcare | Ying Xiao et.al. | 2505.19562 | link |
2025-05-26 | SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams | Zhuoheng Gao et.al. | 2505.19487 | null |
2025-05-25 | Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding | Shiyue Wang et.al. | 2505.19219 | null |
2025-05-25 | MMATH: A Multilingual Benchmark for Mathematical Reasoning | Wenyang Luo et.al. | 2505.19126 | link |
2025-05-23 | Frankentext: Stitching random text fragments into long-form narratives | Chau Minh Pham et.al. | 2505.18128 | link |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024 | null |
2025-05-23 | Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras | Masataka Kobayashi et.al. | 2505.17582 | null |
2025-05-23 | H2:Towards Efficient Large-Scale LLM Training on Hyper-Heterogeneous Cluster over 1,000 Chips | Ding Tang et.al. | 2505.17548 | null |
2025-05-23 | Learning Representational Disparities | Pavan Ravishankar et.al. | 2505.17533 | null |
2025-05-23 | Transparency and Proportionality in Post-Processing Algorithmic Bias Correction | Juliett Suárez Ferreira et.al. | 2505.17525 | null |
2025-05-23 | FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow | Haoyu Sun et.al. | 2505.17399 | link |
2025-05-23 | Pulse duration dependence of material response in ultrafast laser-induced surface-penetrating nanovoids in fused silica | Guodong Zhang et.al. | 2505.17385 | null |
2025-05-22 | Mitigate One, Skew Another? Tackling Intersectional Biases in Text-to-Image Models | Pushkar Shukla et.al. | 2505.17280 | null |
2025-05-22 | A Framework for Multi-View Multiple Object Tracking using Single-View Multi-Object Trackers on Fish Data | Chaim Chai Elchik et.al. | 2505.17201 | null |
2025-05-22 | NY Real Estate Racial Equity Analysis via Applied Machine Learning | Sanjana Chalavadi et.al. | 2505.16946 | null |
2025-05-22 | Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining | Shangquan Sun et.al. | 2505.16811 | null |
2025-05-22 | Optimising the decision threshold in a weighted voting system: The case of the IMF’s Board of Governors | Dóra Gréta Petróczy et.al. | 2505.16654 | null |
2025-05-22 | M2SVid: End-to-End Inpainting and Refinement for Monocular-to-Stereo Video Conversion | Nina Shvetsova et.al. | 2505.16565 | null |
2025-05-22 | Utilizing citation index and synthetic quality measure to compare Wikipedia languages across various topics | Włodzimierz Lewoniewski et.al. | 2505.16506 | null |
2025-05-22 | KoBALT: Korean Benchmark For Advanced Linguistic Tasks | Hyopil Shin et.al. | 2505.16125 | null |
2025-05-22 | Continually Self-Improving Language Models for Bariatric Surgery Question–Answering | Yash Kumar Atri et.al. | 2505.16102 | null |
2025-05-21 | In Silico Trials for Sex-Specific patient Inclusion Criteria in Cardiac Resynchronization Therapy: Advancing Precision in Heart Failure Treatment | Shuang Qian et.al. | 2505.15708 | null |
2025-05-21 | Kernel PCA for Out-of-Distribution Detection: Non-Linear Kernel Selections and Approximations | Kun Fang et.al. | 2505.15284 | link |
2025-05-20 | DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis | Prashanth Vijayaraghavan et.al. | 2505.14971 | null |
2025-05-20 | The Great Comets of 1843 and 1882 at Their Previous Return to Perihelion in the Twelfth Century: One Spectacular, the Other Dull | Zdenek Sekanina et.al. | 2505.14662 | null |
2025-05-20 | Early Diagnosis of Atrial Fibrillation Recurrence: A Large Tabular Model Approach with Structured and Unstructured Clinical Data | Ane G. Domingo-Aldama et.al. | 2505.14643 | null |
2025-05-21 | Mitigating Subgroup Disparities in Multi-Label Speech Emotion Recognition: A Pseudo-Labeling and Unsupervised Learning Approach | Yi-Cheng Lin et.al. | 2505.14449 | null |
2025-05-20 | MindVote: How LLMs Predict Human Decision-Making in Social Media Polls | Xutao Mao et.al. | 2505.14422 | null |
2025-05-20 | Diving into the Fusion of Monocular Priors for Generalized Stereo Matching | Chengtang Yao et.al. | 2505.14414 | link |
2025-05-20 | Accuracy and Fairness of Facial Recognition Technology in Low-Quality Police Images: An Experiment With Synthetic Faces | Maria Cuellar et.al. | 2505.14320 | null |
2025-05-20 | Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models | Zahraa Al Sahili et.al. | 2505.14160 | null |
2025-05-20 | M3Depth: Wavelet-Enhanced Depth Estimation on Mars via Mutual Boosting of Dual-Modal Data | Junjie Li et.al. | 2505.14159 | null |
2025-05-20 | Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts | Xi Chen et.al. | 2505.14088 | null |
2025-05-20 | AppleGrowthVision: A large-scale stereo dataset for phenological analysis, fruit detection, and 3D reconstruction in apple orchards | Laura-Sophia von Hirschhausen et.al. | 2505.14029 | null |
2025-05-19 | The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation | David Stap et.al. | 2505.13090 | null |
2025-05-19 | Unifying concepts in information-theoretic time-series analysis | Annie G. Bryant et.al. | 2505.13080 | null |
2025-05-20 | 3D Visual Illusion Depth Estimation | Chengtang Yao et.al. | 2505.13061 | link |
2025-05-19 | Multi-Level Aware Preference Learning: Enhancing RLHF for Complex Multi-Instruction Tasks | Ruopei Sun et.al. | 2505.12845 | null |
2025-05-19 | On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding | Haoyuan Wu et.al. | 2505.12723 | null |
2025-05-19 | IA-MVS: Instance-Focused Adaptive Depth Sampling for Multi-View Stereo | Yinzhe Wang et.al. | 2505.12714 | null |
2025-05-19 | Rethinking Predictive Modeling for LLM Routing: When Simple kNN Beats Complex Learned Routers | Yang Li et.al. | 2505.12601 | null |
2025-05-18 | On long-duration storage, weather uncertainty and limited foresight | Felix Schmidt et.al. | 2505.12538 | link |
2025-05-18 | Depth Transfer: Learning to See Like a Simulator for Real-World Drone Navigation | Hang Yu et.al. | 2505.12428 | null |
2025-05-18 | Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents | Shuo Han et.al. | 2505.12204 | null |
2025-05-16 | SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision | Utsav Rai et.al. | 2505.11439 | null |
2025-05-16 | MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection | Shrutarv Awasthi et.al. | 2505.11282 | link |
2025-05-16 | Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization | Yanhao Jia et.al. | 2505.11217 | null |
2025-05-16 | A Cautionary Tale on Integrating Studies with Disparate Outcome Measures for Causal Inference | Harsh Parikh et.al. | 2505.11014 | null |
2025-05-16 | Patient-Specific Dynamic Digital-Physical Twin for Coronary Intervention Training: An Integrated Mixed Reality Approach | Shuo Wang et.al. | 2505.10902 | null |
2025-05-16 | From Embeddings to Accuracy: Comparing Foundation Models for Radiographic Classification | Xue Li et.al. | 2505.10823 | null |
2025-05-15 | TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation | Manthan Patel et.al. | 2505.10696 | null |
2025-05-15 | Artificial Intelligence Bias on English Language Learners in Automatic Scoring | Shuchen Guo et.al. | 2505.10643 | null |
2025-05-15 | Multi-contrast laser endoscopy for in vivo gastrointestinal imaging | Taylor L. Bobrow et.al. | 2505.10492 | null |
2025-05-15 | ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention | Jintian Shao et.al. | 2505.10222 | null |
2025-05-15 | VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality | Xuechang Tu et.al. | 2505.10144 | link |
2025-05-15 | Large-Scale Gaussian Splatting SLAM | Zhe Xin et.al. | 2505.09915 | null |
2025-05-14 | ZENN: A Thermodynamics-Inspired Computational Framework for Heterogeneous Data-Driven Modeling | Shun Wang et.al. | 2505.09851 | null |
2025-05-14 | Should I Stay or Should I Go Now? An Investigation into Gender Differences in the Impact of Switching Jobs on Earnings | Emily Winskill et.al. | 2505.09791 | null |
2025-05-14 | Enabling Group Fairness in Graph Unlearning via Bi-level Debiasing | Yezi Liu et.al. | 2505.09702 | null |
2025-05-14 | Fairness-aware Bayes optimal functional classification | Xiaoyu Hu et.al. | 2505.09471 | null |
2025-05-14 | RobustSpring: Benchmarking Robustness to Image Corruptions for Optical Flow, Scene Flow and Stereo | Jenny Schmalfuss et.al. | 2505.09368 | null |
2025-05-14 | Toward Fair Federated Learning under Demographic Disparities and Data Imbalance | Qiming Wu et.al. | 2505.09295 | link |
2025-05-14 | Signatures of asymmetry: Gravitational wave memory and the parity violation | Indranil Chakraborty et.al. | 2505.09096 | null |
2025-05-13 | Ages and metallicities of quiescent galaxies: confronting broadband ( $UVJ$ ) colours with stellar absorption lines | Chloe M. Cheng et.al. | 2505.08858 | null |
2025-05-13 | Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World | Yuran Wang et.al. | 2505.08607 | null |
2025-05-13 | BizChat: Scaffolding AI-Powered Business Planning for Small Business Owners Across Digital Skill Levels | Quentin Romero Lauro et.al. | 2505.08493 | null |
2025-05-13 | A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering | Chuanzhi Xu et.al. | 2505.08438 | null |
2025-05-13 | Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion | Anle Ke et.al. | 2505.08281 | link |
2025-05-13 | Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images | Ziteng Liu et.al. | 2505.08178 | null |
2025-05-14 | Fast Text-to-Audio Generation with Adversarial Post-Training | Zachary Novack et.al. | 2505.08175 | link |
2025-05-13 | MoKD: Multi-Task Optimization for Knowledge Distillation | Zeeshan Hayder et.al. | 2505.08170 | null |
2025-05-12 | Unequal Journeys to Food Markets: Continental-Scale Evidence from Open Data in Africa | Robert Benassai-Dalmau et.al. | 2505.07913 | link |
2025-05-12 | Disparity in sound speeds: implications for unitarity and effective potential in quantum field theory | Dmitry S. Ageev et.al. | 2505.07794 | null |
2025-05-12 | Higher-Order Convolution Improves Neural Predictivity in the Retina | Simone Azeglio et.al. | 2505.07620 | null |
2025-05-11 | Empirical Analysis of Asynchronous Federated Learning on Heterogeneous Devices: Efficiency, Fairness, and Privacy Trade-offs | Samaneh Mohammadi et.al. | 2505.07041 | null |
2025-05-11 | Enhancing Monocular Height Estimation via Sparse LiDAR-Guided Correction | Jian Song et.al. | 2505.06905 | null |
2025-05-11 | ContribChain: A Stress-Balanced Blockchain Sharding Protocol with Node Contribution Awareness | Xinpeng Huang et.al. | 2505.06899 | null |
2025-05-11 | Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies | Zhengmi Tang et.al. | 2505.06855 | null |
2025-05-11 | Feedback-enhanced distant entanglement of magnon and phonon modes with atomic ensembles in coupled cavities | Muhammad Awais Altaf et.al. | 2505.06838 | null |
2025-05-10 | Behind the Byline: A Large-Scale Study of Scientific Author Contributions | Itai Assraf et.al. | 2505.06721 | null |
2025-05-09 | Adaptive Wiping: Adaptive contact-rich manipulation through few-shot imitation learning with Force-Torque feedback and pre-trained object representations | Chikaha Tsuji et.al. | 2505.06451 | null |
2025-05-09 | 2D Quon Language: Unifying Framework for Cliffords, Matchgates, and Beyond | Byungmin Kang et.al. | 2505.06336 | null |
2025-05-09 | Who’s at Risk? Effects of Inflation on Unemployment Risk | Hie Joo Ahn et.al. | 2505.05757 | null |
2025-05-08 | Trends and Gender Disparities in Grades and Grade Penalties Among Bioscience and Health-Related Major Students Before, During, and After COVID-19 Remote Instruction | Alysa Malespina et.al. | 2505.05667 | null |
2025-05-07 | StereoINR: Cross-View Geometry Consistent Stereo Super Resolution with Implicit Neural Representation | Yi Liu et.al. | 2505.05509 | null |
2025-05-08 | Facets of Disparate Impact: Evaluating Legally Consistent Bias in Machine Learning | Jarren Briscoe et.al. | 2505.05471 | link |
2025-05-08 | Synthesis of innovation and obsolescence | Edward D. Lee et.al. | 2505.05182 | null |
2025-05-08 | DispBench: Benchmarking Disparity Estimation to Synthetic Corruptions | Shashank Agnihotri et.al. | 2505.05091 | link |
2025-05-08 | Learning Item Representations Directly from Multimodal Features for Effective Recommendation | Xin Zhou et.al. | 2505.04960 | link |
2025-05-08 | Enhancing Blockchain Cross Chain Interoperability: A Comprehensive Survey | Zhihong Deng et.al. | 2505.04934 | null |
2025-05-08 | Advanced 3D Imaging Approach to TSV/TGV Metrology and Inspection Using Only Optical Microscopy | Gugeong Sung et.al. | 2505.04913 | null |
2025-05-06 | Algorithmic Accountability in Small Data: Sample-Size-Induced Bias Within Classification Metrics | Jarren Briscoe et.al. | 2505.03992 | link |
2025-05-06 | Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach | Srecharan Selvam et.al. | 2505.03702 | null |
2025-05-06 | Blending 3D Geometry and Machine Learning for Multi-View Stereopsis | Vibhas Vats et.al. | 2505.03470 | link |
2025-05-06 | Domain Adversarial Training for Mitigating Gender Bias in Speech-based Mental Health Detection | June-Woo Kim et.al. | 2505.03359 | null |
2025-05-06 | The Impact of Large Language Models on K-12 Education in Rural India: A Thematic Analysis of Student Volunteer’s Perspectives | Harshita Goyal et.al. | 2505.03163 | null |
2025-05-06 | Towards Application-Specific Evaluation of Vision Models: Case Studies in Ecology and Biology | Alex Hoi Hang Chan et.al. | 2505.02825 | null |
2025-05-05 | Exceptional, but Separate: Precursors to Spontaneous Symmetry Breaking | Lewis Hill et.al. | 2505.02691 | null |
2025-05-05 | VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection | Hao Cheng et.al. | 2505.02331 | link |
2025-05-04 | SparSplat: Fast Multi-View Reconstruction with Generalizable 2D Gaussian Splatting | Shubhendu Jena et.al. | 2505.02175 | null |
2025-05-04 | Representation Learning of Limit Order Book: A Comprehensive Study and Benchmarking | Muyao Zhong et.al. | 2505.02139 | null |
2025-05-04 | Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents | Christian Schroeder de Witt et.al. | 2505.02077 | null |
2025-05-03 | Mitigating Group-Level Fairness Disparities in Federated Visual Language Models | Chaomeng Chen et.al. | 2505.01851 | null |
2025-05-03 | AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting | Junhao Shi et.al. | 2505.01799 | null |
2025-05-03 | T-REX: Vision-Based System for Autonomous Leaf Detection and Grasp Estimation | Srecharan Selvam et.al. | 2505.01654 | null |
2025-05-02 | Toward a Unified Theory of Catalysis | Frank Nelson Crespilho et.al. | 2505.01213 | null |
2025-05-02 | Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods | Mahdi Dhaini et.al. | 2505.01198 | link |
2025-05-02 | Enhancing MHD model accuracy and CME forecasting by constraining coronal plasma properties with Faraday rotation | Salvatore Mancuso et.al. | 2505.01080 | null |
2025-05-02 | Destructive Interference: Encoding Loss in the Overlap | Nik Aberle et.al. | 2505.00987 | null |
2025-05-01 | Quantum Modular Forms and Resurgence | Eleanor McSpirit et.al. | 2505.00799 | null |
2025-05-01 | HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection | Deanna Emery et.al. | 2505.00506 | null |
2025-04-30 | Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis | Michal Geyer et.al. | 2505.00135 | null |
2025-04-30 | Stereo X-ray tomography on deformed object tracking | Zhenduo Shang et.al. | 2505.00122 | null |
2025-04-30 | An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation | Yaming Ou et.al. | 2504.21826 | null |
2025-04-30 | Assessing Racial Disparities in Healthcare Expenditures Using Causal Path-Specific Effects | Xiaxian Ou et.al. | 2504.21688 | link |
2025-04-30 | Lights Out, Stress In: Assessing Stress Amidst Power and Energy Challenges in Bangladesh | Faisal Quaiyyum et.al. | 2504.21541 | null |
2025-04-30 | DGFNet: End-to-End Audio-Visual Source Separation Based on Dynamic Gating Fusion | Yinfeng Yu et.al. | 2504.21366 | null |
2025-04-30 | CMD: Constraining Multimodal Distribution for Domain Adaptation in Stereo Matching | Zhelun Shen et.al. | 2504.21302 | null |
2025-04-30 | LSTM+Geo with xgBoost Filtering: A Novel Approach for Race and Ethnicity Imputation with Reduced Bias | S. Chalavadi et.al. | 2504.21259 | null |
2025-04-29 | OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification | Shangyu Li et.al. | 2504.20964 | link |
2025-04-29 | Imaging on the Edge: Mapping Object Corners and Edges with Stereo X-ray Tomography | Zhenduo Shang et.al. | 2504.20892 | null |
2025-04-29 | Partitioned Memory Storage Inspired Few-Shot Class-Incremental learning | Renye Zhang et.al. | 2504.20797 | null |
2025-04-29 | The Anyonic Quantum Carnot Engine | H S Mani et.al. | 2504.20596 | null |
2025-04-29 | Mordell–Lang and disparate Selmer ranks of odd twists of some superelliptic curves over global function fields | Sun Woo Park et.al. | 2504.20594 | null |
2025-04-29 | Hetu v2: A General and Scalable Deep Learning System with Hierarchical and Heterogeneous Single Program Multiple Data Annotations | Haoyang Li et.al. | 2504.20490 | link |
2025-04-29 | The two-clock problem in population dynamics | Kaan Öcal et.al. | 2504.20388 | null |
2025-04-29 | Neural Stereo Video Compression with Hybrid Disparity Compensation | Shiyin Jiang et.al. | 2504.20383 | null |
2025-04-29 | Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views | Jiang Wu et.al. | 2504.20378 | link |
2025-04-28 | $\texttt{SAGE}$ : A Generic Framework for LLM Safety Evaluation | Madhur Jindal et.al. | 2504.19674 | link |
2025-04-27 | Mitigating Bias in Facial Recognition Systems: Centroid Fairness Loss Optimization | Jean-Rémy Conti et.al. | 2504.19370 | null |
2025-04-27 | Unscented Particle Filter for Visual-inertial Navigation using IMU and Landmark Measurements | Khashayar Ghanizadegan et.al. | 2504.19318 | null |
2025-04-27 | OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion | Shuhao Kang et.al. | 2504.19258 | null |
2025-04-26 | Minimum Cost Nowhere-zero Flows and Cut-balanced Orientations | Karthekeyan Chandrasekaran et.al. | 2504.18767 | null |
2025-04-25 | Fairness Is More Than Algorithms: Racial Disparities in Time-to-Recidivism | Jessy Xinyi Han et.al. | 2504.18629 | null |
2025-04-25 | Are We on the Same Page? Examining Developer Perception Alignment in Open Source Code Reviews | Yoseph Berhanu Alebachew et.al. | 2504.18407 | null |
2025-04-25 | Study on Real-Time Road Surface Reconstruction Using Stereo Vision | Deepak Ghimire et.al. | 2504.18112 | null |
2025-04-29 | Factorization Formula Connecting the Shape Functions of Heavy Meson in QCD and Heavy Quark Effective Theory | Wei Wang et.al. | 2504.18018 | null |
2025-04-24 | LLM Agent Swarm for Hypothesis-Driven Drug Discovery | Kevin Song et.al. | 2504.17967 | null |
2025-04-24 | Set Phasers to Stun: Beaming Power and Control to Mobile Robots with Laser Light | Charles J. Carver et.al. | 2504.17865 | null |
2025-04-24 | The Fourth Monocular Depth Estimation Challenge | Anton Obukhov et.al. | 2504.17787 | null |
2025-04-24 | Spectral Irradiance Variability in Lyman-Alpha Emission During Solar Flares | Luke Majury et.al. | 2504.17667 | null |
2025-04-24 | Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization | Guangyang Zeng et.al. | 2504.17410 | null |
2025-04-24 | StereoMamba: Real-time and Robust Intraoperative Stereo Disparity Estimation via Long-range Spatial Dependencies | Xu Wang et.al. | 2504.17401 | null |
2025-04-24 | Evaluating and Mitigating Bias in AI-Based Medical Text Generation | Xiuying Chen et.al. | 2504.17279 | null |
2025-04-23 | Structural roles and gender disparities in corruption networks | Arthur A. B. Pessa et.al. | 2504.17086 | null |
2025-04-23 | Procedural Dataset Generation for Zero-Shot Stereo Matching | David Yan et.al. | 2504.16930 | null |
2025-04-23 | An Accelerated Camera 3DMA Framework for Efficient Urban GNSS Multipath Estimation | Shiyao Lv et.al. | 2504.16906 | null |
2025-04-23 | A model of the heliocentric dust ring on Venus orbit | Ariane Courtot et.al. | 2504.16610 | null |
2025-04-23 | Tinkering Against Scaling | Bolun Zhang et.al. | 2504.16546 | null |
2025-04-22 | Long-term disparities in the recovery of urban mobility after COVID-19 in Latin America | Carmen Cabrera et.al. | 2504.15871 | null |
2025-04-22 | DERD-Net: Learning Depth from Event-based Ray Densities | Diego de Oliveira Hitzges et.al. | 2504.15863 | null |
2025-04-22 | Trustworthy Decentralized Autonomous Machines: A New Paradigm in Automation Economy | Fernando Castillo et.al. | 2504.15676 | null |
2025-04-22 | Multimodal Perception for Goal-oriented Navigation: A Survey | I-Tak Ieong et.al. | 2504.15643 | null |
2025-04-22 | Yet Another Diminishing Spark: Low-level Cyberattacks in the Israel-Gaza Conflict | Anh V. Vu et.al. | 2504.15592 | null |
2025-04-22 | The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks | Minghao Wu et.al. | 2504.15521 | null |
2025-04-21 | Real-Time Sentiment Insights from X Using VADER, DistilBERT, and Web-Scraped Data | Yanampally Abhiram Reddy et.al. | 2504.15448 | null |
2025-04-21 | MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video | Minh-Quan Viet Bui et.al. | 2504.15122 | null |
2025-04-21 | Robust and Real-time Surface Normal Estimation from Stereo Disparities using Affine Transformations | Csongor Csanad Kariko et.al. | 2504.15121 | null |
2025-04-21 | Sum-Rate Maximization for NOMA-Assisted Pinching-Antenna Systems | Ziwu Zhou et.al. | 2504.15006 | null |
2025-04-21 | Reliable Multi-Modal Object Re-Identification via Modality-Aware Graph Reasoning | Xixi Wan et.al. | 2504.14847 | null |
2025-04-21 | Aligning Beam with Imbalanced Multi-modality: A Generative Federated Learning Approach | Jiahui Liang et.al. | 2504.14835 | null |
2025-04-20 | Polynomial-Time Constant-Approximation for Fair Sum-of-Radii Clustering | Sina Bagheri Nezhad et.al. | 2504.14683 | null |
2025-04-20 | Regret-aware Re-ranking for Guaranteeing Two-sided Fairness and Accuracy in Recommender Systems | Xiaopeng Ye et.al. | 2504.14550 | null |
2025-04-20 | Anisotropic quark propagation and Zeeman effect in an external magnetic field | Minghui Ding et.al. | 2504.14504 | null |
2025-04-20 | sEEG-based Encoding for Sentence Retrieval: A Contrastive Learning Approach to Brain-Language Alignment | Yijun Liu et.al. | 2504.14468 | null |
2025-04-19 | Balancing Fairness and Performance in Healthcare AI: A Gradient Reconciliation Approach | Xiaoyang Wang et.al. | 2504.14388 | null |
2025-04-18 | Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion | Sandipan Dhar et.al. | 2504.13791 | null |
2025-04-18 | Predictors of Childhood Vaccination Uptake in England: An Explainable Machine Learning Analysis of Longitudinal Regional Data (2021-2024) | Amin Noroozi et.al. | 2504.13755 | null |
2025-04-18 | Divergent LLM Adoption and Heterogeneous Convergence Paths in Research Writing | Cong William Lin et.al. | 2504.13629 | null |
2025-04-18 | Open-Loop and Closed-Loop Strategies for Linear Quadratic Mean Field Games: The Direct Approach | Yong Liang et.al. | 2504.13496 | null |
2025-04-17 | Addressing the Minor-Embedding Problem in Quantum Annealing and Evaluating State-of-the-Art Algorithm Performance | Aitor Gómez-Tejedor et.al. | 2504.13376 | null |
2025-04-17 | Generalized Parton Distributions from Symbolic Regression | Anusha Reddy Singireddy et.al. | 2504.13289 | null |
2025-04-17 | Prospects for Detecting Signs of Life on Exoplanets in the JWST Era | Sara Seager et.al. | 2504.12946 | null |
2025-04-17 | Quantifying walkable accessibility to urban services: An application to Florence, Italy | Leonardo Boncinelli et.al. | 2504.12934 | null |
2025-04-17 | Unsupervised Cross-Domain 3D Human Pose Estimation via Pseudo-Label-Guided Global Transforms | Jingjing Liu et.al. | 2504.12699 | null |
2025-04-16 | Reinforcement Learning from Human Feedback | Nathan Lambert et.al. | 2504.12501 | link |
2025-04-16 | A Survey on Archetypal Analysis | Aleix Alcacer et.al. | 2504.12392 | null |
2025-04-16 | Regist3R: Incremental Registration with Stereo Foundation Model | Sidun Liu et.al. | 2504.12356 | null |
2025-04-16 | Towards Explainable Fusion and Balanced Learning in Multimodal Sentiment Analysis | Miaosen Luo et.al. | 2504.12151 | null |
2025-04-16 | Stochastic Quadrature Rules for Solving PDEs using Neural Networks | Jamie M. Taylor et.al. | 2504.11976 | link |
2025-04-16 | Benchmarking Mutual Information-based Loss Functions in Federated Learning | Sarang S et.al. | 2504.11877 | null |
2025-04-16 | Boosting Multi-View Stereo with Depth Foundation Model in the Absence of Real-World Labels | Jie Zhu et.al. | 2504.11845 | null |
2025-04-15 | Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models | Maria Teleki et.al. | 2504.11431 | link |
2025-04-15 | Breaking the TDD Flow for Over-the-Air Phase Synchronization in Distributed Antenna Systems | Khac-Hoang Ngo et.al. | 2504.11411 | null |
2025-04-15 | Towards global equity in political polarization research | Max Falkenberg et.al. | 2504.11090 | null |
2025-04-15 | Meta-learning For Few-Shot Time Series Crop Type Classification: A Benchmark On The EuroCropsML Dataset | Joana Reuss et.al. | 2504.11022 | null |
2025-04-15 | Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy | Botao Zhao et.al. | 2504.10819 | null |
2025-04-14 | FuzzSense: Towards A Modular Fuzzing Framework for Autonomous Driving Software | Andrew Roberts et.al. | 2504.10717 | null |
2025-04-14 | Emotion Alignment: Discovering the Gap Between Social Media and Real-World Sentiments in Persian Tweets and Images | Sina Elahimanesh et.al. | 2504.10662 | null |
2025-04-14 | Who Speaks for Ethics? How Demographics Shape Ethical Advocacy in Software Development | Lauren Olson et.al. | 2504.10276 | null |
2025-04-14 | Localized Cultural Knowledge is Conserved and Controllable in Large Language Models | Veniamin Veselovsky et.al. | 2504.10191 | null |
2025-04-14 | Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution | Yiwen Wang et.al. | 2504.09887 | link |
2025-04-14 | RAKG:Document-level Retrieval Augmented Knowledge Graph Construction | Hairong Zhang et.al. | 2504.09823 | link |
2025-04-13 | FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird’s Eye View | Yuting Zhao et.al. | 2504.09535 | null |
2025-04-12 | “It’s not a representation of me”: Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services | Shira Michel et.al. | 2504.09346 | null |
2025-04-12 | CrossLink: A Decentralized Framework for Secure Cross-Chain Smart Contract Execution | Tahrim Hossain et.al. | 2504.09319 | link |
2025-04-12 | PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks | Jianyu Wu et.al. | 2504.09258 | null |
2025-04-15 | FairACE: Achieving Degree Fairness in Graph Neural Networks via Contrastive and Adversarial Group-Balanced Training | Jiaxin Liu et.al. | 2504.09210 | null |
2025-04-12 | Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence | Yuxu Lu et.al. | 2504.09197 | null |
2025-04-11 | Application of machine learning models to predict the relationship between air pollution, ecosystem degradation, and health disparities and lung cancer in Vietnam | Ngoc Hong Tran et.al. | 2504.08651 | null |
2025-04-11 | seeBias: A Comprehensive Tool for Assessing and Visualizing AI Fairness | Yilin Ning et.al. | 2504.08418 | link |
2025-04-10 | Adaptive Bounded Exploration and Intermediate Actions for Data Debiasing | Yifan Yang et.al. | 2504.08151 | link |
2025-04-10 | Experimental Analysis of Quadcopter Drone Hover Constraints for Localization Improvements | Uthman Olawoye et.al. | 2504.07843 | null |
2025-04-10 | FairEval: Evaluating Fairness in LLM-Based Recommendations with Personality Awareness | Chandan Kumar Sah et.al. | 2504.07801 | null |
2025-04-10 | MMLA: Multi-Environment, Multi-Species, Low-Altitude Aerial Footage Dataset | Jenna Kline et.al. | 2504.07744 | null |
2025-04-10 | Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation | Yanglin Huang et.al. | 2504.07691 | null |
2025-04-10 | Tuning chirality amplitude at ultrafast timescales | Hiroki Ueda et.al. | 2504.07599 | null |
2025-04-10 | Echoes of Disagreement: Measuring Disparity in Social Consensus | Marios Papachristou et.al. | 2504.07480 | link |
2025-04-10 | Continuity conditions weaker than lower semi-continuity | Jacob Westerhout et.al. | 2504.07451 | null |
2025-04-10 | ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement | Anning Hu et.al. | 2504.07418 | null |
2025-04-10 | FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair | Arya Fayyazi et.al. | 2504.07395 | null |
2025-04-09 | Universal neural wave functions for high-pressure hydrogen | David Linteau et.al. | 2504.07062 | null |
2025-04-09 | Identifying Key Challenges of Hardness-Based Resampling | Pawel Pukowski et.al. | 2504.07031 | null |
2025-04-09 | Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting | Daiwei Zhang et.al. | 2504.06978 | null |
2025-04-09 | Communicating complex statistical models to a public health audience: translating science into action with the FARSI approach | Mattia Stival et.al. | 2504.06787 | null |
2025-04-09 | A Novel Nonlinear Fertility Catastrophe Model Based on Thom’s Differential Equations of Morphogenesis | Rolando Gonzales Martinez et.al. | 2504.06668 | null |
2025-04-08 | Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring | José A. Pilartes-Congo et.al. | 2504.06464 | null |
2025-04-08 | Computing for Community-Based Economies: A Sociotechnical Ecosystem for Democratic, Egalitarian and Sustainable Futures | Kwame Porter Robinson et.al. | 2504.06114 | null |
2025-04-08 | Co-evolution of cooperation and resource allocation in the advantageous environment-based spatial multi-game using adaptive control | Chengbin Sun et.al. | 2504.06112 | null |
2025-04-08 | AI analysis of medical images at scale as a health disparities probe: a feasibility demonstration using chest radiographs | Heather M. Whitney et.al. | 2504.05990 | null |
2025-04-08 | Uncovering Fairness through Data Complexity as an Early Indicator | Juliett Suárez Ferreira et.al. | 2504.05923 | null |
2025-04-08 | Thermodynamic supercriticality and complex phase diagram for the AdS black hole | Zhen-Ming Xu et.al. | 2504.05708 | null |
2025-04-08 | Fairness in Machine Learning-based Hand Load Estimation: A Case Study on Load Carriage Tasks | Arafat Rahman et.al. | 2504.05610 | null |
2025-04-07 | Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation | Manvi Agarwal et.al. | 2504.05364 | null |
2025-04-07 | A BLE and UWB Beacon-Assist Framework for Multiuser Augmented Reality Synchronization Across Multiple Devices in Shared Environments | Maitree Hirunteeyakul et.al. | 2504.05293 | null |
2025-04-07 | CARE: Aligning Language Models for Regional Cultural Awareness | Geyang Guo et.al. | 2504.05154 | link |
2025-04-07 | Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and Semidensification | Yasuhiro Yao et.al. | 2504.05148 | link |
2025-04-07 | M-Prometheus: A Suite of Open Multilingual LLM Judges | José Pombal et.al. | 2504.04953 | link |
2025-04-07 | CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images | Cheng Chen et.al. | 2504.04753 | null |
2025-04-06 | eKalibr-Stereo: Continuous-Time Spatiotemporal Calibration for Event-Based Stereo Visual Systems | Shuolong Chen et.al. | 2504.04451 | link |
2025-04-05 | Exploration of Approaches for Robustness and Safety in a Low Code Open Environment for Factory Automation | Gustavo Quiros A. et.al. | 2504.04224 | null |
2025-04-05 | Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources | Zihao Li et.al. | 2504.04152 | null |
2025-04-05 | The Labor Market Incidence of New Technologies | Tianyu Fan et.al. | 2504.04047 | null |
2025-04-05 | Disparate Privacy Vulnerability: Targeted Attribute Inference Attacks and Defenses | Ehsanul Kabir et.al. | 2504.04033 | null |
2025-04-04 | SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding | Yimin Wei et.al. | 2504.03254 | link |
2025-04-03 | Bias in Large Language Models Across Clinical Applications: A Systematic Review | Thanathip Suenghataiphorn et.al. | 2504.02917 | null |
2025-04-03 | Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets | Chuning Zhu et.al. | 2504.02792 | null |
2025-04-03 | The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context | Nikhil Verma et.al. | 2504.02708 | null |
2025-04-02 | Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks | Ali Al-Kaswan et.al. | 2504.01850 | null |
2025-04-02 | SOLAQUA: SINTEF Ocean Large Aquaculture Robotics Dataset | Sveinung Johan Ohrem et.al. | 2504.01790 | null |
2025-04-02 | DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image | Jijun Xiang et.al. | 2504.01596 | link |
2025-04-02 | Hyperbolic Diffusion Recommender Model | Meng Yuan et.al. | 2504.01541 | null |
2025-04-02 | ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue | Thomas Pritchard et.al. | 2504.01261 | link |
2025-04-01 | Feature-Preserving Mesh Decimation for Normal Integration | Moritz Heep et.al. | 2504.00867 | null |
2025-04-01 | Bridging the Gap: Integrating Ethics and Environmental Sustainability in AI Research and Practice | Alexandra Sasha Luccioni et.al. | 2504.00797 | null |
2025-04-01 | Alleviating Performance Disparity in Adversarial Spatiotemporal Graph Learning Under Zero-Inflated Distribution | Songran Bai et.al. | 2504.00721 | null |
2025-04-01 | ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection | Xiaoxuan Zhu et.al. | 2504.00695 | link |
2025-04-01 | Using complex prompts to identify fine-grained biases in image generation through ChatGPT-4o | Marinus Ferreira et.al. | 2504.00388 | null |
2025-03-31 | Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views | Chong Bao et.al. | 2503.24382 | null |
2025-03-31 | BAR-Analytics: A Web-based Platform for Analyzing Information Spreading Barriers in News: Comparative Analysis Across Multiple Barriers and Events | Abdul Sittar et.al. | 2503.24220 | null |
2025-03-31 | Ride-Sourcing Vehicle Rebalancing with Service Accessibility Guarantees via Constrained Mean-Field Reinforcement Learning | Matej Jusup et.al. | 2503.24183 | link |
2025-03-31 | Is LLM the Silver Bullet to Low-Resource Languages Machine Translation? | Yewei Song et.al. | 2503.24102 | null |
2025-03-31 | Level the Level: Balancing Game Levels for Asymmetric Player Archetypes With Reinforcement Learning | Florian Rupp et.al. | 2503.24099 | link |
2025-03-31 | Multispacecraft Observations of the 2024 September 9 Backside Solar Eruption that Resulted in a Sustained Gamma Ray Emission Event | Nat Gopalswamy et.al. | 2503.23852 | null |
2025-03-31 | A PINN Methodology for Temperature Field Reconstruction in the PIV Measurement Plane: Case of Rayleigh-Bénard Convection | Marie-Christine Volk et.al. | 2503.23801 | null |
2025-03-31 | Consistency-aware Self-Training for Iterative-based Stereo Matching | Jingyi Zhou et.al. | 2503.23747 | null |
2025-03-31 | Detail-aware multi-view stereo network for depth estimation | Haitao Tian et.al. | 2503.23684 | null |
2025-03-30 | Third Harmonic Structure in an Interplanetary Type II Radio Burst and Other Energetic Phenomena During the 2024 September 14 Solar Eruption | Nat Gopalswamy et.al. | 2503.23584 | null |
2025-03-28 | Benchmarking Ultra-Low-Power $μ$ NPUs | Josh Millar et.al. | 2503.22567 | null |
2025-03-28 | A Causal Framework to Measure and Mitigate Non-binary Treatment Discrimination | Ayan Majumdar et.al. | 2503.22454 | link |
2025-03-28 | Scaling Laws of Scientific Discovery with AI and Robot Scientists | Pengsong Zhang et.al. | 2503.22444 | null |
2025-03-28 | MVSAnywhere: Zero-Shot Multi-View Stereo | Sergio Izquierdo et.al. | 2503.22430 | null |
2025-03-28 | Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion | Songsong Yu et.al. | 2503.22262 | null |
2025-03-28 | An Advanced Ensemble Deep Learning Framework for Stock Price Prediction Using VAE, Transformer, and LSTM Model | Anindya Sarkar et.al. | 2503.22192 | null |
2025-03-28 | Reflection on Code Contributor Demographics and Collaboration Patterns in the Rust Communit | Rohit Dandamudi et.al. | 2503.22066 | null |
2025-03-28 | Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges | Ukcheol Shin et.al. | 2503.22060 | link |
2025-03-27 | Improved Tomographic Reconstruction of 3D Global Coronal Density from STEREO/COR1 Observations | Tongjiang Wang et.al. | 2503.22041 | null |
2025-03-27 | The commutativity problem for effective varieties of formal series, and applications | Lorenzo Clemente et.al. | 2503.21697 | null |
2025-03-27 | Exploring the Energy Landscape of RBMs: Reciprocal Space Insights into Bosons, Hierarchical Learning and Symmetry Breaking | J. Quetzalcóatl Toledo-Marin et.al. | 2503.21536 | null |
2025-03-27 | ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo | Yuxi Hu et.al. | 2503.21525 | null |
2025-03-27 | Behavioral response to mobile phone evacuation alerts | Erick Elejalde et.al. | 2503.21497 | null |
2025-03-27 | GPU-Accelerated Charge-Equilibration for Shadow Molecular Dynamics in Python | Mehmet Cagri Kaymak et.al. | 2503.21176 | link |
2025-03-26 | Can Large Language Models Predict Associations Among Human Attitudes? | Ana Ma et.al. | 2503.21011 | null |
2025-03-26 | CH $_3$ OH as a User-Friendly Density Probe: Calibration and Beyond | A. Giannetti et.al. | 2503.20944 | null |
2025-03-26 | SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments | Tanmoy Dam et.al. | 2503.20614 | link |
2025-03-26 | Emergent properties and the multiscale characterization challenge in condensed matter, from crystals to complex materials: a Review | Elisabetta Nocerino et.al. | 2503.20266 | null |
2025-03-26 | Attention IoU: Examining Biases in CelebA using Attention Maps | Aaron Serianni et.al. | 2503.19846 | link |
2025-03-26 | A Survey on Event-driven 3D Reconstruction: Development under Different Categories | Chuanzhi Xu et.al. | 2503.19753 | null |
2025-03-25 | Fairness in Proof of Team Sprint (PoTS): Evaluating Reward Distribution Across Performance Levels | Naoki Yonezawa et.al. | 2503.19301 | null |
2025-03-25 | ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency | Yang Ren et.al. | 2503.19283 | link |
2025-03-24 | Information-Seeking Decision Strategies Mitigate Risk in Dynamic, Uncertain Environments | Nicholas W. Barendregt et.al. | 2503.19107 | link |
2025-03-25 | Learning to segment anatomy and lesions from disparately labeled sources in brain MRI | Meva Himmetoglu et.al. | 2503.18840 | null |
2025-03-24 | LeanStereo: A Leaner Backbone based Stereo Network | Rafia Rahim et.al. | 2503.18557 | link |
2025-03-24 | Distilling Stereo Networks for Performant and Efficient Leaner Networks | Rafia Rahim et.al. | 2503.18544 | link |
2025-03-24 | Natural Language Processing for Electronic Health Records in Scandinavian Languages: Norwegian, Swedish, and Danish | Ashenafi Zebene Woldaregay et.al. | 2503.18539 | null |
2025-03-24 | PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model | Junyuan Gao et.al. | 2503.18484 | link |
2025-03-24 | PS-EIP: Robust Photometric Stereo Based on Event Interval Profile | Kazuma Kitazawa et.al. | 2503.18341 | null |
2025-03-24 | Vision-Guided Loco-Manipulation with a Snake Robot | Adarsh Salagame et.al. | 2503.18308 | null |
2025-03-24 | RAU: Towards Regularized Alignment and Uniformity for Representation Learning in Recommendation | Xi Wu et.al. | 2503.18300 | null |
2025-03-24 | Fact-checking AI-generated news reports: Can LLMs catch their own lies? | Jiayi Yao et.al. | 2503.18293 | null |
2025-03-24 | GI-SLAM: Gaussian-Inertial SLAM | Xulang Liu et.al. | 2503.18275 | null |
2025-03-21 | Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors | Wonbong Jang et.al. | 2503.17316 | null |
2025-03-21 | Uncovering cooling center usage as an adaptation strategy for hurricane-blackout-heat compound hazards during Hurricane Beryl (2024) | Tianle Duan et.al. | 2503.17292 | null |
2025-03-21 | Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers | Gaojie Jin et.al. | 2503.17172 | null |
2025-03-21 | Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study of Leukocytes and Schistocytes | Davide Antonio Mura et.al. | 2503.17107 | null |
2025-03-21 | TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting | Jianchuan Chen et.al. | 2503.17032 | null |
2025-03-21 | Exploring the Role of Women in Hugging Face Organizations | Maria Tubella Salinas et.al. | 2503.17000 | link |
2025-03-21 | DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery | Jiadong Tang et.al. | 2503.16964 | null |
2025-03-21 | A Flexible Fairness Framework with Surrogate Loss Reweighting for Addressing Sociodemographic Disparities | Wen Xu et.al. | 2503.16836 | null |
2025-03-20 | RESFL: An Uncertainty-Aware Framework for Responsible Federated Learning by Balancing Privacy, Fairness and Utility in Autonomous Vehicles | Dawood Wasif et.al. | 2503.16251 | null |
2025-03-20 | Variance-Aware Noisy Training: Hardening DNNs against Unstable Analog Computations | Xiao Wang et.al. | 2503.16183 | null |
2025-03-19 | Quantum entropy as a harbinger of factorizability | Henry Bloss et.al. | 2503.15603 | null |
2025-03-19 | Evaluating Bias in Retrieval-Augmented Medical Question-Answering Systems | Yuelyu Ji et.al. | 2503.15454 | null |
2025-03-19 | Beacon2Science: Enhancing STEREO/HI beacon data1 with machine learning for efficient CME tracking | Justin Le Louëdec et.al. | 2503.15288 | link |
2025-03-19 | EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds | Yuanchao Yue et.al. | 2503.15284 | link |
2025-03-19 | Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening | Zihan Cao et.al. | 2503.14975 | null |
2025-03-19 | Body-Hand Modality Expertized Networks with Cross-attention for Fine-grained Skeleton Action Recognition | Seungyeon Cho et.al. | 2503.14960 | null |
2025-03-19 | USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network | Joseph Emmanuel DL Dayo et.al. | 2503.14950 | null |
2025-03-18 | VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms | Seungwon Lim et.al. | 2503.14427 | link |
2025-03-18 | Exploring Disparity-Accuracy Trade-offs in Face Recognition Systems: The Role of Datasets, Architectures, and Loss Functions | Siddharth D Jaiswal et.al. | 2503.14138 | null |
2025-03-17 | SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint | Zhenlong Yuan et.al. | 2503.13721 | null |
2025-03-17 | Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios | Iryna Repinetska et.al. | 2503.13710 | null |
2025-03-17 | A Circular Construction Product Ontology for End-of-Life Decision-Making | Kwabena Adu-Duodu et.al. | 2503.13708 | null |
2025-03-17 | Subgroup Performance of a Commercial Digital Breast Tomosynthesis Model for Breast Cancer Detection | Beatrice Brown-Mulry et.al. | 2503.13581 | null |
2025-03-17 | Scale Efficient Training for Large Datasets | Qing Zhou et.al. | 2503.13385 | link |
2025-03-17 | Financial Adviser Misconduct and Labor Market Penalties: Uncovering Racial Disparities in the Absence of Gender Gaps | Jun Honda et.al. | 2503.12837 | null |
2025-03-17 | Stereo Event-based, 6-DOF Pose Tracking for Uncooperative Spacecraft | Zibin Liu et.al. | 2503.12732 | link |
2025-03-17 | GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching | Feng Qiao et.al. | 2503.12720 | link |
2025-03-16 | A novel association and ranking approach identifies factors affecting educational outcomes of STEM majors | Kira Adaricheva et.al. | 2503.12321 | link |
2025-03-15 | Robust Isolation Forest using Soft Sparse Random Projection and Valley Emphasis Method | Hun Kang et.al. | 2503.12125 | null |
2025-03-18 | 3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene Reconstruction | Peizhen Zheng et.al. | 2503.12001 | link |
2025-03-14 | Black Older Adults’ Perception of Using Voice Assistants to Enact a Medical Recovery Curriculum | Andrea Green et.al. | 2503.11894 | null |
2025-03-14 | Bridging the LLM Accessibility Divide? Performance, Fairness, and Cost of Closed versus Open LLMs for Automated Essay Scoring | Kezia Oketch et.al. | 2503.11827 | null |
2025-03-14 | Thermodynamics of the Hubbard Model on the Bethe Lattice | Jia-Lin Chen et.al. | 2503.11598 | link |
2025-03-14 | TikZero: Zero-Shot Text-Guided Graphics Program Synthesis | Jonas Belouadi et.al. | 2503.11509 | link |
2025-03-14 | An automated geometric space curve approach for designing dynamically corrected gates | Evangelos Piliouras et.al. | 2503.11492 | link |
2025-03-14 | ARCAS: Adaptive Runtime System for Chiplet-Aware Scheduling | Alessandro Fogli et.al. | 2503.11460 | null |
2025-03-14 | AQUA-SLAM: Tightly-Coupled Underwater Acoustic-Visual-Inertial SLAM with Sensor Calibration | Shida Xu et.al. | 2503.11420 | link |
2025-03-14 | Exploring Competitive and Collusive Behaviors in Algorithmic Pricing with Deep Reinforcement Learning | Shidi Deng et.al. | 2503.11270 | null |
2025-03-14 | NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications | Li Cui et.al. | 2503.11199 | null |
2025-03-14 | SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets | Hao Liu et.al. | 2503.11133 | null |
2025-03-14 | TigerLLM – A Family of Bangla Large Language Models | Nishat Raihan et.al. | 2503.10995 | link |
2025-03-13 | Design and Development of the MeCO Open-Source Autonomous Underwater Vehicle | David Widhalm et.al. | 2503.10928 | null |
2025-03-13 | Controlling the dynamical phase diagram of a spinor BEC using time-dependent potentials | Q. Guan et.al. | 2503.10563 | null |
2025-03-13 | Subgroup Performance Analysis in Hidden Stratifications | Alceu Bissoto et.al. | 2503.10382 | null |
2025-03-13 | Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction | Xiaobo Xia et.al. | 2503.09947 | null |
2025-03-12 | Approximately Counting and Sampling Hamiltonian Motifs in Sublinear Time | Talya Eden et.al. | 2503.09810 | null |
2025-03-12 | How good are deep learning methods for automated road safety analysis using video data? An experimental study | Qingwu Liu et.al. | 2503.09807 | null |
2025-03-12 | BiasConnect: Investigating Bias Interactions in Text-to-Image Models | Pushkar Shukla et.al. | 2503.09763 | null |
2025-03-12 | Resolving the Kagome Origin of the Strange Metallicity in Ni $_3$ In | Jean C. Souza et.al. | 2503.09704 | null |
2025-03-12 | Edge AI for Real-time Fetal Assessment in Rural Guatemala | Nasim Katebi et.al. | 2503.09659 | null |
2025-03-12 | IUP: Integrated and Programmable User Plane for Next-Generation Mobile Networks | Chieh-Chun Chen et.al. | 2503.09430 | null |
2025-03-12 | OpenVidVRD: Open-Vocabulary Video Visual Relation Detection via Prompt-Driven Semantic Space Alignment | Qi Liu et.al. | 2503.09416 | null |
2025-03-12 | GRU: Mitigating the Trade-off between Unlearning and Retention for Large Language Models | Yue Wang et.al. | 2503.09117 | null |
2025-03-12 | StratIncon Detector: Analyzing Strategy Inconsistencies Between Real-Time Strategy and Preferred Professional Strategy in MOBA Esports | Ruofei Ma et.al. | 2503.09060 | null |
2025-03-11 | BoundarEase: Fostering Constructive Community Engagement to Inform More Equitable Student Assignment Policies | Cassandra Overney et.al. | 2503.08543 | link |
2025-03-11 | Does excellence correspond to universal inequality level? Evidences from scholarly citations and Olympic medal data | Soumyajyoti Biswas et.al. | 2503.08480 | null |
2025-03-11 | SegDesicNet: Lightweight Semantic Segmentation in Remote Sensing with Geo-Coordinate Embeddings for Domain Adaptation | Sachin Verma et.al. | 2503.08290 | null |
2025-03-11 | CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning | Kaiqiang Xiong et.al. | 2503.08219 | null |
2025-03-10 | The Janus Face of Innovation: Global Disparities and Divergent Options | Nihat Mugurtay et.al. | 2503.07676 | null |
2025-03-10 | VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models | Jen-tse Huang et.al. | 2503.07575 | link |
2025-03-10 | OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation | Ding Zhong et.al. | 2503.07098 | null |
2025-03-10 | SDFA: Structure Aware Discriminative Feature Aggregation for Efficient Human Fall Detection in Video | Sania Zahan et.al. | 2503.07008 | null |
2025-03-10 | Kinetic model and numerical method for multispecies radiation hydrodynamic system with multiscale nonequilibrium transport | Mingyu Quan et.al. | 2503.06906 | null |
2025-03-09 | DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning | Chengxuan Qian et.al. | 2503.06456 | link |
2025-03-09 | Socioeconomic centers in cities worldwide | Shuai Pang et.al. | 2503.06445 | link |
2025-03-09 | Global physics-informed neural networks (GPINNs): from local point-wise constraint to global nodal association | Feng Chen et.al. | 2503.06403 | null |
2025-03-08 | Mitigating Blockchain extractable value (BEV) threats by Distributed Transaction Sequencing in Blockchains | Xiongfei Zhao et.al. | 2503.06279 | null |
2025-03-08 | Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations | Meng Wang et.al. | 2503.06222 | null |
2025-03-08 | Generation of Optimized Solidity Code for Machine Learning Models using LLMs | Nikumbh Sarthak Sham et.al. | 2503.06203 | null |
2025-03-07 | Stereo Any Video: Temporally Consistent Stereo Matching | Junpeng Jing et.al. | 2503.05549 | null |
2025-03-07 | Asteroid phase curves and phase coloring effect using the ATLAS survey data | Colazo Milagros et.al. | 2503.05412 | null |
2025-03-07 | Preparing Tetra-Digit Long-Range Entangled States via Unified Sequential Quantum Circuit | Yu-Tao Hu et.al. | 2503.05374 | null |
2025-03-07 | Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects | Justin Yu et.al. | 2503.05189 | null |
2025-03-07 | RocketEval: Efficient Automated LLM Evaluation via Grading Checklist | Tianjun Wei et.al. | 2503.05142 | link |
2025-03-06 | Addressing the Subsumption Thesis: A Formal Bridge between Microeconomics and Active Inference | Noe Kuhn et.al. | 2503.05048 | null |
2025-03-06 | MIDAS: Modeling Ground-Truth Distributions with Dark Knowledge for Domain Generalized Stereo Matching | Peng Xu et.al. | 2503.04376 | null |
2025-03-06 | Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English | Runtao Zhou et.al. | 2503.04099 | null |
2025-03-06 | Uncovering inequalities in new knowledge learning by large language models across different languages | Chenglong Wang et.al. | 2503.04064 | link |
2025-03-05 | Connecting the dots: Tracing the evolutionary pathway of Polar Ring Galaxies in the cases of NGC 3718, NGC 2685, and NGC 4262 | Krishna R. Akhil et.al. | 2503.03709 | null |
2025-03-05 | The Roles of Size, Packing, and Cohesion in the Emergence of Force Chains in Granular Packings | Ankit Shrivastava et.al. | 2503.03668 | null |
2025-03-05 | Improved FPT Approximation Algorithms for TSP | Jingyang Zhao et.al. | 2503.03642 | null |
2025-03-05 | Topo Goes Political: TDA-Based Controversy Detection in Imbalanced Reddit Political Data | Arvindh Arun et.al. | 2503.03500 | null |
2025-03-05 | BANet: Bilateral Aggregation Network for Mobile Stereo Matching | Gangwei Xu et.al. | 2503.03259 | link |
2025-03-05 | Transformer-Based Spatio-Temporal Association of Apple Fruitlets | Harry Freeman et.al. | 2503.03200 | null |
2025-03-04 | CADDI: An in-Class Activity Detection Dataset using IMU data from low-cost sensors | Luis Marquez-Carpintero et.al. | 2503.02853 | null |
2025-03-04 | Educational Assortative Mating and Household Income Inequality: Evidence from Brazil, Indonesia, Mexico, and South Africa | Ana Kujundzic et.al. | 2503.02713 | null |
2025-03-04 | XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification | Xiaoyu Zheng et.al. | 2503.02619 | null |
2025-03-04 | Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation | Dengke Zhang et.al. | 2503.02459 | link |
2025-03-04 | Tabby: Tabular Data Synthesis with Language Models | Sonia Cromp et.al. | 2503.02152 | null |
2025-03-03 | Building Machine Learning Challenges for Anomaly Detection in Science | Elizabeth G. Campolongo et.al. | 2503.02112 | null |
2025-03-03 | Understanding Urban-Rural Disparities in Mobility Inefficiency for Colombia, Mexico, and India | Nandini Iyer et.al. | 2503.01810 | link |
2025-03-03 | MUSt3R: Multi-view Network for Stereo 3D Reconstruction | Yohann Cabon et.al. | 2503.01661 | link |
2025-03-03 | Unmasking Implicit Bias: Evaluating Persona-Prompted LLM Responses in Power-Disparate Social Scenarios | Bryan Chen Zhengyu Tan et.al. | 2503.01532 | null |
2025-03-03 | RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation | Shu Pan et.al. | 2503.01434 | null |
2025-02-28 | Back to the Future Cyclopean Stereo: a human perception approach unifying deep and geometric constraints | Sherlon Almeida da Silva et.al. | 2502.21280 | null |
2025-02-28 | An LLM-based Delphi Study to Predict GenAI Evolution | Francesco Bertolotti et.al. | 2502.21092 | null |
2025-02-28 | Modelling the Spatially Varying Non-Linear Effects of Heat Exposure | Xinyi Chen et.al. | 2502.20745 | null |
2025-02-28 | Displaying Fear, Sadness, and Joy in Public: Schizophrenia Vloggers’ Video Narration of Emotion and Online Care-Seeking | Jiaying “Lizzy” Liu et.al. | 2502.20658 | null |
2025-02-28 | FedConv: A Learning-on-Model Paradigm for Heterogeneous Federated Clients | Leming Shen et.al. | 2502.20639 | link |
2025-02-27 | Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis | Jeffrey Yang Fan Chiang et.al. | 2502.20383 | null |
2025-02-27 | UniTok: A Unified Tokenizer for Visual Generation and Understanding | Chuofan Ma et.al. | 2502.20321 | link |
2025-02-27 | Educator Attention: How computational tools can systematically identify the distribution of a key resource for students | Qingyang Zhang et.al. | 2502.20135 | null |
2025-02-26 | Treatment Non-Adherence Bias in Clinical Machine Learning: A Real-World Study on Hypertension Medication | Zhongyuan Liang et.al. | 2502.19625 | null |
2025-02-26 | Do LLMs exhibit demographic parity in responses to queries about Human Rights? | Rafiya Javed et.al. | 2502.19463 | null |
2025-03-01 | GraphBridge: Towards Arbitrary Transfer Learning in GNNs | Li Ju et.al. | 2502.19252 | link |
2025-02-26 | Improving the quality of Web-mined Parallel Corpora of Low-Resource Languages using Debiasing Heuristics | Aloka Fernando et.al. | 2502.19074 | null |
2025-02-26 | The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training | Jinbo Wang et.al. | 2502.19002 | null |
2025-02-26 | Disparities in Magnetic Cloud Observations Between Two Spacecraft Having Small Radial and Angular Separations Near 1 AU | Anjali Agarwal et.al. | 2502.18919 | null |
2025-02-26 | M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance | Qingpei Guo et.al. | 2502.18778 | null |
2025-02-26 | Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance | Xueqing Peng et.al. | 2502.18772 | null |
2025-02-26 | Deep-Bench: Deep Learning Benchmark Dataset for Code Generation | Alireza Daghighfarsoodeh et.al. | 2502.18726 | null |
2025-02-25 | Expected Variational Inequalities | Brian Hu Zhang et.al. | 2502.18605 | null |
2025-02-25 | Exploring Gender Disparities in Automatic Speech Recognition Technology | Hend ElGhazaly et.al. | 2502.18434 | null |
2025-02-25 | A Kinetic Model of Solar Wind Acceleration Driven by Ambipolar Electric Potential and Velocity-Space Diffusion | Maximilien Péters de Bonhome et.al. | 2502.18132 | null |
2025-02-25 | PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching | Han Nie et.al. | 2502.18104 | link |
2025-02-25 | Assessing Large Language Models in Agentic Multilingual National Bias | Qianying Liu et.al. | 2502.17945 | null |
2025-02-25 | Escaping the Subprime Trap in Algorithmic Lending | Adam Bouyamourn et.al. | 2502.17816 | null |
2025-02-25 | Radial dependence of ion fluences in the 2023 July 17 SEP event from Parker Solar Probe to STEREO and ACE | G. D. Muro et.al. | 2502.17806 | null |
2025-02-25 | FinP: Fairness-in-Privacy in Federated Learning by Addressing Disparities in Privacy Risk | Tianyu Zhao et.al. | 2502.17748 | null |
2025-02-24 | Homophilic Effects on Economic Inequality: A Dynamic Network Agent-Based Model | Gustavo L. Kohlrausch et.al. | 2502.17705 | null |
2025-02-24 | $A$-Norm and $A$ -numerical Radius Inequalities for Sums Of Operators in semi-Hilbertian spaces | M. H. M. Rashid et.al. | 2502.17696 | null |
2025-02-24 | The DECADE cosmic shear project III: validation of analysis pipeline using spatially inhomogeneous data | D. Anbajagane et.al. | 2502.17676 | null |
2025-02-24 | Kandinsky Conformal Prediction: Beyond Class- and Covariate-Conditional Coverage | Konstantina Bairaktari et.al. | 2502.17264 | null |
2025-02-24 | Determinants of the Spousal Age Gap in India: Analysis of Indian Microdata | Praveen et.al. | 2502.17059 | null |
2025-02-24 | Achieving Fair PCA Using Joint Eigenvalue Decomposition | Vidhi Rathore et.al. | 2502.16933 | null |
2025-02-24 | PulseBat: A field-accessible dataset for second-life battery diagnostics from realistic histories using multidimensional rapid pulse test | Shengyu Tao et.al. | 2502.16848 | null |
2025-02-23 | Optical appearance of a boson star with soliton potential | Ke-Jian He et.al. | 2502.16623 | null |
2025-02-23 | Unmasking Societal Biases in Respiratory Support for ICU Patients through Social Determinants of Health | Mira Moukheiber et.al. | 2502.16477 | link |
2025-02-23 | Make Literature-Based Discovery Great Again through Reproducible Pipelines | Bojan Cestnik et.al. | 2502.16450 | link |
2025-02-23 | Facilitating Emergency Vehicle Passage in Congested Urban Areas Using Multi-agent Deep Reinforcement Learning | Haoran Su et.al. | 2502.16449 | null |
2025-02-22 | Semantic Gaussian Mixture Variational Autoencoder for Sequential Recommendation | Beibei Li et.al. | 2502.16140 | link |
2025-02-22 | A Trust-Aware and Cost-Optimized Blockchain Oracle Selection Model with Deep Reinforcement Learning | Hengyang Zhang et.al. | 2502.16133 | link |
2025-02-21 | MoMa: A Modular Deep Learning Framework for Material Property Prediction | Botian Wang et.al. | 2502.15483 | null |
2025-02-21 | UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction | Chenyu Li et.al. | 2502.15199 | null |
2025-02-21 | Graph-Based Deep Learning on Stereo EEG for Predicting Seizure Freedom in Epilepsy Patients | Artur Agaronyan et.al. | 2502.15198 | null |
2025-02-21 | TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba | Xiuwei Chen et.al. | 2502.15130 | null |
2025-02-20 | Electron Beam Propagation and Radio-Wave Scattering in the Inner Heliosphere using Five Spacecraft | Luis Alberto Cañizares et.al. | 2502.15067 | null |
2025-02-20 | Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion | Jiangyuan Liu et.al. | 2502.14616 | link |
2025-02-20 | OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images | Zhichao Zheng et.al. | 2502.14279 | null |
2025-02-20 | Asymmetric Co-Training for Source-Free Few-Shot Domain Adaptation | Gengxu Li et.al. | 2502.14214 | link |
2025-02-20 | Stereo Image Coding for Machines with Joint Visual Feature Compression | Dengchao Jin et.al. | 2502.14190 | null |
2025-02-19 | The NavINST Dataset for Multi-Sensor Autonomous Navigation | Paulo Ricardo Marques de Araujo et.al. | 2502.13863 | null |
2025-02-19 | CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement | Zheng Wu et.al. | 2502.13624 | null |
2025-02-18 | Two Tickets are Better than One: Fair and Accurate Hiring Under Strategic LLM Manipulations | Lee Cohen et.al. | 2502.13221 | null |
2025-02-18 | Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks | Markus J. Buehler et.al. | 2502.13025 | link |
2025-02-18 | Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version) | Tianyi Zhang et.al. | 2502.13017 | null |
2025-02-18 | High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion | Xiang Zhang et.al. | 2502.12752 | null |
2025-02-18 | Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection | Zijian Cao et.al. | 2502.12735 | null |
2025-02-18 | Simulated Bifurcation with High-dimensional Expansion for Traffic Signal Optimization on Real-world Networks | Shengda Zhao et.al. | 2502.12440 | null |
2025-02-17 | The impact of job stability on monetary poverty in Italy: causal small area estimation | Katarzyna Reluga et.al. | 2502.12376 | null |
2025-02-17 | Healthcare cost prediction for heterogeneous patient profiles using deep learning models with administrative claims data | Mohammad Amin Morid et.al. | 2502.12277 | null |
2025-02-17 | A versatile experimental method to measure the traction forces at interfaces | Yingwei Hou et.al. | 2502.12044 | null |
2025-02-17 | pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM | Luigi Freda et.al. | 2502.11955 | link |
2025-02-17 | BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages | Shamsuddeen Hassan Muhammad et.al. | 2502.11926 | link |
2025-02-17 | Weak solutions and sharp interface limit of the anisotropic Cahn-Hilliard equation with disparate mobility and inhomogeneous potential | Charles Elbar et.al. | 2502.11849 | null |
2025-02-17 | Text Classification in the LLM Era - Where do we stand? | Sowmya Vajjala et.al. | 2502.11830 | null |
2025-02-17 | Deep Neural Networks for Accurate Depth Estimation with Latent Space Features | Siddiqui Muhammad Yasir et.al. | 2502.11777 | null |
2025-02-17 | SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking | Zijian Wu et.al. | 2502.11534 | null |
2025-02-16 | Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation | Kunal Swami et.al. | 2502.11002 | null |
2025-02-15 | Do Deepfake Detectors Work in Reality? | Simiao Ren et.al. | 2502.10920 | null |
2025-02-15 | Mobile Robotic Multi-View Photometric Stereo | Suryansh Kumar et.al. | 2502.10842 | null |
2025-02-14 | Enhancing Multilingual LLM Pretraining with Model-Based Data Selection | Bettina Messmer et.al. | 2502.10361 | null |
2025-02-14 | Merging public elementary schools to reduce racial/ethnic segregation | Madison Landry et.al. | 2502.10193 | link |
2025-02-14 | Evaluating and Improving Graph-based Explanation Methods for Multi-Agent Coordination | Siva Kailas et.al. | 2502.09889 | null |
2025-02-13 | Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages | Shreyan Biswas et.al. | 2502.09532 | null |
2025-02-13 | SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest | Jack Erhardt et.al. | 2502.09528 | null |
2025-02-13 | Diffusion Models Through a Global Lens: Are They Culturally Inclusive? | Zahra Bayramli et.al. | 2502.08914 | null |
2025-02-13 | Uncovering Disparities in Rideshare Drivers Earning and Work Patterns: A Case Study of Chicago | Hy Dang et.al. | 2502.08893 | null |
2025-02-12 | Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors | Vishwanath Pratap Singh et.al. | 2502.08587 | null |
2025-02-12 | An entropy based comparative study of regional and seasonal distributions of particulate matter in Indian cities | Suchismita Banerjee et.al. | 2502.08491 | null |
2025-02-12 | Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision | Tianle Liu et.al. | 2502.08352 | null |
2025-02-12 | Emergent dimer-model topological order and quasi-particle excitations in liquid crystals: combinatorial vortex lattices | Cuiling Meng et.al. | 2502.08314 | null |
2025-02-12 | Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model | Bencheng Yan et.al. | 2502.08309 | null |
2025-02-12 | From Individual Experience to Collective Evidence: A Reporting-Based Framework for Identifying Systemic Harms | Jessica Dai et.al. | 2502.08166 | link |
2025-02-11 | Federated Self-supervised Domain Generalization for Label-efficient Polyp Segmentation | Xinyi Tan et.al. | 2502.07951 | null |
2025-02-11 | Small Area Estimation of Education Levels in Low- and Middle-Income Countries | Yunhan Wu et.al. | 2502.07946 | link |
2025-02-11 | PFedDST: Personalized Federated Learning with Decentralized Selection Training | Mengchen Fan et.al. | 2502.07750 | null |
2025-02-11 | A Nonparametric and Functional Wombling Methodology | Luke A. Barratt et.al. | 2502.07740 | null |
2025-02-11 | HGTUL: A Hypergraph-based Model For Trajectory User Linking | Fengjie Chang et.al. | 2502.07549 | null |
2025-02-11 | MoENAS: Mixture-of-Expert based Neural Architecture Search for jointly Accurate, Fair, and Robust Edge Deep Neural Networks | Lotfi Abdelkrim Mecharbat et.al. | 2502.07422 | null |
2025-02-11 | BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models | Xu Huang et.al. | 2502.07346 | link |
2025-02-11 | Music for All: Exploring Multicultural Representations in Music Generation Models (Camera Ready) | Atharva Mehta et.al. | 2502.07328 | link |
2025-02-11 | Does Training on Synthetic Data Make Models Less Robust? | Lingze Zhang et.al. | 2502.07164 | null |
2025-02-11 | Feature Importance Depends on Properties of the Data: Towards Choosing the Correct Explanations for Your Data and Decision Trees based Models | Célia Wafa Ayad et.al. | 2502.07153 | null |
2025-02-10 | Using Contextually Aligned Online Reviews to Measure LLMs’ Performance Disparities Across Language Varieties | Zixin Tang et.al. | 2502.07058 | null |
2025-02-10 | A Compiler for Operations on Relations with Bag Semantics | James Dong et.al. | 2502.06988 | null |
2025-02-10 | Beyond Literal Token Overlap: Token Alignability for Multilinguality | Katharina Hämmerl et.al. | 2502.06468 | null |
2025-02-10 | On the reason for the widespread energetic storm particle event of 13 March 2023 | N. Dresing et.al. | 2502.06332 | null |
2025-02-10 | The digital labour of artificial intelligence in Latin America: a comparison of Argentina, Brazil, and Venezuela | Paola Tubaro et.al. | 2502.06317 | null |
2025-02-08 | Knowledge is Power: Harnessing Large Language Models for Enhanced Cognitive Diagnosis | Zhiang Dong et.al. | 2502.05556 | null |
2025-02-07 | Point-Identifying Semiparametric Sample Selection Models with No Excluded Variable | Dongwoo Kim et.al. | 2502.05353 | null |
2025-02-07 | Differentiable Mobile Display Photometric Stereo | Gawoon Ban et.al. | 2502.05055 | null |
2025-02-07 | Unified Approaches in Self-Supervised Event Stream Modeling: Progress and Prospects | Levente Zólyomi et.al. | 2502.04899 | null |
2025-02-07 | Practical implementation of a chiral phononic crystal demonstrator with ultra-low frequency bandgap | Line Mardini et.al. | 2502.04775 | null |
2025-02-06 | Targeted Learning for Data Fairness | Alexander Asemota et.al. | 2502.04309 | null |
2025-02-06 | Online Learning of Counter Categories and Ratings in PvP Games | Chiu-Chou Lin et.al. | 2502.03998 | null |
2025-02-06 | Fairness Aware Reinforcement Learning via Proximal Policy Optimization | Gabriele La Malfa et.al. | 2502.03953 | null |
2025-02-05 | Large Teams Overshadow Individual Recognition | Lulin Yang et.al. | 2502.03623 | null |
2025-02-04 | How Inclusively do LMs Perceive Social and Moral Norms? | Michael Galarnyk et.al. | 2502.02696 | link |
2025-02-04 | Fairness in Survival Analysis: A Novel Conditional Mutual Information Augmentation Approach | Tianyang Xie et.al. | 2502.02567 | null |
2025-02-04 | Review of Demographic Bias in Face Recognition | Ketan Kotwal et.al. | 2502.02309 | null |
2025-02-04 | Ilargi: a GPU Compatible Factorized ML Model Training Framework | Wenbo Sun et.al. | 2502.01985 | null |
2025-02-03 | CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition | Martijn Bartelds et.al. | 2502.01777 | null |
2025-02-03 | Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool | Floris Holstege et.al. | 2502.01713 | null |
2025-02-03 | Comprehensive Modeling Approaches for Forecasting Bitcoin Transaction Fees: A Comparative Study | Jiangqin Ma et.al. | 2502.01029 | null |
2025-02-02 | Fruit Fly Classification (Diptera: Tephritidae) in Images, Applying Transfer Learning | Erick Andrew Bustamante Flores et.al. | 2502.00939 | null |
2025-02-02 | Psychometric-Based Evaluation for Theorem Proving with Large Language Models | Jianyu Zhang et.al. | 2502.00855 | null |
2025-02-01 | DeepUKF-VIN: Adaptively-tuned Deep Unscented Kalman Filter for 3D Visual-Inertial Navigation based on IMU-Vision-Net | Khashayar Ghanizadegan et.al. | 2502.00575 | null |
2025-02-01 | Evaluation of End-to-End Continuous Spanish Lipreading in Different Data Conditions | David Gimeno-Gómez et.al. | 2502.00464 | link |
2025-01-31 | Beyond checkmate: exploring the creative chokepoints in AI text | Nafis Irtiza Tripto et.al. | 2501.19301 | link |
2025-02-03 | DyPCL: Dynamic Phoneme-level Contrastive Learning for Dysarthric Speech Recognition | Wonjun Lee et.al. | 2501.19010 | null |
2025-01-31 | Examining the Impact of Income Inequality and Gender on School Completion in Malaysia: A Machine Learning Approach Utilizing Malaysia’s Public Sector Open Data | Muhammad Sukri Bin Ramli et.al. | 2501.18868 | null |
2025-01-31 | Systematic Uncertainties in the Measurement of Neutron lifetime Using Lunar Prospector Neutron Spectrometer | Akshatha Vydula et.al. | 2501.18831 | null |
2025-01-30 | Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion | Vitor Guizilini et.al. | 2501.18804 | null |
2025-01-30 | CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering | Yumeng Wang et.al. | 2501.18457 | null |
2025-01-30 | Surface Defect Identification using Bayesian Filtering on a 3D Mesh | Matteo Dalle Vedove et.al. | 2501.18315 | null |
2025-01-29 | From tools to thieves: Measuring and understanding public perceptions of AI through crowdsourced metaphors | Myra Cheng et.al. | 2501.18045 | null |
2025-01-29 | STGCN-LSTM for Olympic Medal Prediction: Dynamic Power Modeling and Causal Policy Optimization | Yiquan Wang et.al. | 2501.17711 | null |
2025-01-29 | Cross-Language Approach for Quranic QA | Islam Oshallah et.al. | 2501.17449 | null |
2025-01-29 | Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models | Yuxuan Li et.al. | 2501.17420 | null |
2025-01-28 | Stiff Transfer Learning for Physics-Informed Neural Networks | Emilien Seiler et.al. | 2501.17281 | null |
2025-01-28 | Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving | Evgenii Evstafev et.al. | 2501.17084 | null |
2025-01-28 | Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning | Xi Chen et.al. | 2501.16966 | null |
2025-01-28 | Hybrid Phenology Modeling for Predicting Temperature Effects on Tree Dormancy | Ron van Bree et.al. | 2501.16848 | link |
2025-01-28 | Strawberry Robotic Operation Interface: An Open-Source Device for Collecting Dexterous Manipulation Data in Robotic Strawberry Farming | Linsheng Hou et.al. | 2501.16717 | null |
2025-01-27 | BiFold: Bimanual Cloth Folding with Language Guidance | Oriol Barbany et.al. | 2501.16458 | null |
2025-01-27 | Will nanodust reappear in STEREO/WAVES data? | Nicole Meyer-Vernet et.al. | 2501.16133 | null |
2025-01-27 | SampleLLM: Optimizing Tabular Data Synthesis in Recommendations | Jingtong Gao et.al. | 2501.16125 | null |
2025-01-27 | Vienna Mosaic: Navigating Social Borders in a Melting Pot | Marc Sadurní et.al. | 2501.15920 | link |
2025-01-26 | Fuzzy-aware Loss for Source-free Domain Adaptation in Visual Emotion Recognition | Ying Zheng et.al. | 2501.15519 | null |
2025-01-26 | Dfilled: Repurposing Edge-Enhancing Diffusion for Guided DSM Void Filling | Daniel Panangian et.al. | 2501.15440 | null |
2025-01-26 | Evaluating Simple Debiasing Techniques in RoBERTa-based Hate Speech Detection Models | Diana Iftimie et.al. | 2501.15430 | null |
2025-01-26 | A General Approach to Relaxing Unconfoundedness | Matthew A. Masten et.al. | 2501.15400 | null |
2025-01-25 | Fairness in LLM-Generated Surveys | Andrés Abeliuk et.al. | 2501.15351 | null |
2025-01-25 | Fairness-aware Contextual Dynamic Pricing with Strategic Buyers | Pangpang Liu et.al. | 2501.15338 | null |
2025-01-25 | The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders? | Ayo Adedeji et.al. | 2501.15310 | null |
2025-01-24 | Fairness of Deep Ensembles: On the interplay between per-group task difficulty and under-representation | Estanislao Claucich et.al. | 2501.14551 | null |
2025-01-24 | SoK: What Makes Private Learning Unfair? | Kai Yao et.al. | 2501.14414 | null |
2025-01-22 | Synthetic CT image generation from CBCT: A Systematic Review | Alzahra Altalib et.al. | 2501.13972 | null |
2025-01-23 | Analysis of Indic Language Capabilities in LLMs | Aatman Vaidya et.al. | 2501.13912 | null |
2025-01-23 | You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain | Timothy Chase Jr et.al. | 2501.13725 | null |
2025-01-23 | Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers | Akshit Achara et.al. | 2501.13302 | link |
2025-01-22 | Flying shape and aerodynamics of a full-scale flexible Olympic windsurf sail | J. Zhang et.al. | 2501.13254 | null |
2025-01-22 | On the development of open geographical data infrastructures in Latin America: progress and challenges | Daniela Ballari et.al. | 2501.13235 | null |
2025-01-22 | Enhancing Multi-Attribute Fairness in Healthcare Predictive Modeling | Xiaoyang Wang et.al. | 2501.13219 | null |
2025-01-22 | Machine Learning Modeling for Multi-order Human Visual Motion Processing | Zitang Sun et.al. | 2501.12810 | link |
2025-01-22 | Exploring Wikipedia Gender Diversity Over Time $\unicode{x2013}$ The Wikipedia Gender Dashboard (WGD) | Yahya Yunus et.al. | 2501.12610 | null |
2025-01-23 | Academic Case Reports Lack Diversity: Assessing the Presence and Diversity of Sociodemographic and Behavioral Factors related to Post COVID-19 Condition | Juan Andres Medina Florez et.al. | 2501.12538 | null |
2025-01-21 | Decoherence of Schrödinger cat states in light of wave/particle duality | Th. K. Mavrogordatos et.al. | 2501.12328 | null |
2025-01-21 | Improving robot understanding using conversational AI: demonstration and feasibility study | Shikhar Kumar et.al. | 2501.12214 | null |
2025-01-21 | Towards autonomous photogrammetric forest inventory using a lightweight under-canopy robotic drone | Väinö Karjalainen et.al. | 2501.12073 | null |
2025-01-21 | Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging | Shuyi Hu et.al. | 2501.11884 | null |
2025-01-21 | FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients | Jiaqi Leng et.al. | 2501.11876 | link |
2025-01-20 | Are generative models fair? A study of racial bias in dermatological image generation | Miguel López-Pérez et.al. | 2501.11752 | null |
2025-01-20 | Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy | Saeid Asgari Taghanaki et.al. | 2501.11721 | link |
2025-01-20 | Multi-View Spectral Clustering for Graphs with Multiple View Structures | Yorgos Tsitsikas et.al. | 2501.11422 | link |
2025-01-20 | UniTrans: A Unified Vertical Federated Knowledge Transfer Framework for Enhancing Cross-Hospital Collaboration | Chung-ju Huang et.al. | 2501.11388 | link |
2025-01-20 | Mitigating Spatial Disparity in Urban Prediction Using Residual-Aware Spatiotemporal Graph Neural Networks: A Chicago Case Study | Dingyi Zhuang et.al. | 2501.11214 | null |
2025-01-17 | DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration | Huiyun Cao et.al. | 2501.10325 | null |
2025-01-17 | Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt | Qingcheng Zeng et.al. | 2501.09950 | null |
2025-01-17 | FoundationStereo: Zero-Shot Stereo Matching | Bowen Wen et.al. | 2501.09898 | link |
2025-01-16 | Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment | Maksim Filipenko et.al. | 2501.09490 | null |
2025-01-16 | DEFOM-Stereo: Depth Foundation Model Based Stereo Matching | Hualie Jiang et.al. | 2501.09466 | link |
2025-01-15 | TeV afterglow emission from a multi-component GRB jet using the kinetic approach | John P. Hope et.al. | 2501.09093 | null |
2025-01-15 | How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias | Tosin Fadahunsi et.al. | 2501.09014 | link |
2025-01-15 | StereoGen: High-quality Stereo Image Generation from a Single Image | Xianqi Wang et.al. | 2501.08654 | null |
2025-01-15 | MonSter: Marry Monodepth to Stereo Unleashes Power | Junda Cheng et.al. | 2501.08643 | link |
2025-01-15 | Image-to-Force Estimation for Soft Tissue Interaction in Robotic-Assisted Surgery Using Structured Light | Jiayin Wang et.al. | 2501.08593 | null |
2025-01-15 | Addressing Intersectionality, Explainability, and Ethics in AI-Driven Diagnostics: A Rebuttal and Call for Transdiciplinary Action | Myles Joshua Toledo Tan et.al. | 2501.08497 | null |
2025-01-16 | Navigating Gender Disparities in Communication Research Leadership: Academic Recognition, Career Development, and Compensation | Diego F. M. Oliveira et.al. | 2501.08401 | null |
2025-01-14 | TriMod Fusion for Multimodal Named Entity Recognition in Social Media | Mosab Alfaqeeh et.al. | 2501.08267 | null |
2025-01-13 | An Investigation of Experiences Engaging the Margins in Data-Centric Innovation | Gabriella Thompson et.al. | 2501.07690 | null |
2025-01-13 | Digital Twin for Smart Societies: A Catalyst for Inclusive and Accessible Healthcare | Joshit Mohanty et.al. | 2501.07570 | null |
2025-01-13 | TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models | Thales Sales Almeida et.al. | 2501.07482 | link |
2025-01-13 | PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations | Ting-Yu Dai et.al. | 2501.07447 | null |
2025-01-13 | Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion | Li Liang et.al. | 2501.07260 | link |
2025-01-13 | Depth and Image Fusion for Road Obstacle Detection Using Stereo Camera | Oleg Perezyabov et.al. | 2501.07245 | null |
2025-01-13 | Combined effect of incentives and coupling in multigames in two-layer networks | Luo-Luo Jiang et.al. | 2501.07193 | null |
2025-01-13 | Reducing Latency by Eliminating CSIT Feedback: FDD Downlink MIMO Precoding Without CSIT Feedback for Internet-of-Things Communications | Juntaek Han et.al. | 2501.07094 | null |
2025-01-12 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
2025-01-12 | Integrators at War: Mediating in AI-assisted Resort-to-Force Decisions | Dennis Müller et.al. | 2501.06861 | null |
2025-01-12 | Enabling Cardiac Monitoring using In-ear Ballistocardiogram on COTS Wireless Earbuds | Yongjian Fu et.al. | 2501.06744 | null |
2025-01-10 | A monthly sub-national Harmonized Food Insecurity Dataset for comprehensive analysis and predictive modeling | Machefer Mélissande et.al. | 2501.06076 | null |
2025-01-10 | “Cause” is Mechanistic Narrative within Scientific Domains: An Ordinary Language Philosophical Critique of “Causal Machine Learning” | Vyacheslav Kungurtsev et.al. | 2501.05844 | null |
2025-01-10 | An Efficient Dual ADMM for Huber Regression with Fused Lasso Penalty | Mengjiao Shi et.al. | 2501.05676 | null |
2025-01-10 | The Impact of Model Scaling on Seen and Unseen Language Performance | Rhitabrat Pokharel et.al. | 2501.05629 | null |
2025-01-09 | Datasheets for Healthcare AI: A Framework for Transparency and Bias Mitigation | Marjia Siddik et.al. | 2501.05617 | null |
2025-01-09 | Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping | Wen Tianci et.al. | 2501.05242 | null |
2025-01-09 | An Algorithmic Approach for Causal Health Equity: A Look at Race Differentials in Intensive Care Unit (ICU) Outcomes | Drago Plecko et.al. | 2501.05197 | null |
2025-01-09 | A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision | Ali Rohan et.al. | 2501.05147 | null |
2025-01-08 | Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations | Kirandeep Kaur et.al. | 2501.04762 | null |
2025-01-09 | Do Automated Fixes Truly Mitigate Smart Contract Exploits? | Sofia Bobadilla et.al. | 2501.04600 | link |
2025-01-08 | Towards Fair Class-wise Robustness: Class Optimal Distribution Adversarial Training | Hongxin Zhi et.al. | 2501.04527 | null |
2025-01-08 | Neighborhood Disparities in Smart City Service Adoption | Shahaf Donio et.al. | 2501.04363 | null |
2025-01-07 | MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives | Wisdom O. Ikezogwo et.al. | 2501.04184 | null |
2025-01-07 | Unifying restart accelerated gradient and proximal bundle methods | Jiaming Liang et.al. | 2501.04165 | null |
2025-01-07 | Spanish heat waves curb discretionary mobility and alter work behavior | Andrew Renninger et.al. | 2501.03978 | null |
2025-01-07 | Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware | Hegel Pedroza et.al. | 2501.03720 | null |
2025-01-06 | Solar Cycle Variation of Axial Orientations and Favorable Locations of Eruptive MFRs | Hong Xie et.al. | 2501.03346 | null |
2025-01-06 | CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation | Yuanhong Chen et.al. | 2501.02786 | null |
2025-01-05 | Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera | Yuliang Guo et.al. | 2501.02464 | link |
2025-01-05 | Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap | Hyunwoo Ko et.al. | 2501.02448 | null |
2025-01-05 | Unsupervised Search for Ethnic Minorities’ Medical Segmentation Training Set | Yixiao Chen et.al. | 2501.02442 | link |
2025-01-04 | The Integration of Blockchain and Artificial Intelligence for Secure Healthcare Systems | Umar Safdar et.al. | 2501.02169 | null |
2025-01-03 | How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data | Ishaan Maitra et.al. | 2501.02111 | link |
2025-01-03 | VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment | Wenyan Cong et.al. | 2501.01949 | link |
2025-01-03 | Exploring Equality: An Investigation into Custom Loss Functions for Fairness Definitions | Gordon Lee et.al. | 2501.01889 | null |
2025-01-03 | CycleFlow: Leveraging Cycle Consistency in Flow Matching for Speaker Style Adaptation | Ziqi Liang et.al. | 2501.01861 | null |
2025-01-03 | MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling | Simon Rouard et.al. | 2501.01757 | null |
2025-01-03 | The Essence of Contextual Understanding in Theory of Mind: A Study on Question Answering with Story Characters | Chulun Zhou et.al. | 2501.01705 | null |
2025-01-03 | CrossView-GS: Cross-view Gaussian Splatting For Large-scale Scene Reconstruction | Chenhao Zhang et.al. | 2501.01695 | null |
2025-01-03 | Equity Impacts of Public Transit Network Redesign with Shared Autonomous Mobility Services | Max T. M. Ng et.al. | 2501.01615 | null |
2025-01-02 | CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries | Shudong Liu et.al. | 2501.01282 | null |
2025-01-02 | TS-SatMVSNet: Slope Aware Height Estimation for Large-Scale Earth Terrain Multi-view Stereo | Song Zhang et.al. | 2501.01049 | null |
2025-01-02 | Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer | Ziyang Chen et.al. | 2501.01023 | link |
2025-01-01 | High-Probability Polynomial-Time Complexity of Restarted PDHG for Linear Programming | Zikai Xiong et.al. | 2501.00728 | null |
2024-12-31 | H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters | Pedram Fekri et.al. | 2501.00514 | null |
2024-12-31 | Who Gets Recommended? Investigating Gender, Race, and Country Disparities in Paper Recommendations from Large Language Models | Yifan Tian et.al. | 2501.00367 | null |
2024-12-31 | SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation | Shi-Feng Peng et.al. | 2501.00303 | link |
2024-12-30 | A Data-Centric Approach to Detecting and Mitigating Demographic Bias in Pediatric Mental Health Text: A Case Study in Anxiety Detection | Julia Ive et.al. | 2501.00129 | null |
2024-12-30 | What Makes for a Good Stereoscopic Image? | Netanel Y. Tamir et.al. | 2412.21127 | null |
2024-12-30 | Closing Speed Computation using Stereo Camera and Applications in Unsignalized T-Intersection | Gautam Kumar et.al. | 2412.20717 | null |
2024-12-30 | MarsSQE: Stereo Quality Enhancement for Martian Images Using Bi-level Cross-view Attention | Mai Xu et.al. | 2412.20685 | null |
2024-12-29 | Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control | Bingliang Li et.al. | 2412.20378 | null |
2024-12-29 | Impact of Data Distribution on Fairness Guarantees in Equitable Deep Learning | Yan Luo et.al. | 2412.20377 | link |
2024-12-29 | FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation | Yan Luo et.al. | 2412.20374 | link |
2024-12-29 | Dual-Level Precision Edges Guided Multi-View Stereo with Accurate Planarization | Kehua Chen et.al. | 2412.20328 | link |
2024-12-28 | The impact of China’s economic growth on poverty alleviation: From absolute to relative poverty | Yixun Kang et.al. | 2412.20176 | null |
2024-12-28 | Neutron star stability beyond the mass peak: assessing the role of out-of-equilibrium perturbations | Martin O. Canullan-Pascual et.al. | 2412.20133 | null |
2024-12-28 | Incentivizing supplemental math assignments and using AI-generated hints improve exam performance, especially for racially minoritized students | Yifan Lu et.al. | 2412.19961 | null |
2024-12-27 | Analysis of Premature Death Rates in Texas Counties: The Impact of Air Quality, Socioeconomic Factors, and COPD Prevalence | Richard Rich et.al. | 2412.19774 | null |
2024-12-27 | Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis | Jiaqi Wang et.al. | 2412.19654 | link |
2024-12-27 | Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference | Keke Zhang et.al. | 2412.19553 | null |
2024-12-27 | Is Your Text-to-Image Model Robust to Caption Noise? | Weichen Yu et.al. | 2412.19531 | null |
2024-12-27 | Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images | Xudong Cai et.al. | 2412.19518 | null |
2024-12-27 | Disparate Model Performance and Stability in Machine Learning Clinical Support for Diabetes and Heart Diseases | Ioannis Bilionis et.al. | 2412.19495 | null |
2024-12-27 | Effects of Reynolds number and spatial resolution on the pressure source terms in turbulent boundary layers | Aditya Agarwal et.al. | 2412.19474 | null |
2024-12-26 | MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo | Byeonggwon Lee et.al. | 2412.19130 | null |
2024-12-25 | Evaluating authorship disambiguation quality through anomaly analysis on researchers’ career transition | Huaxia Zhou et.al. | 2412.18757 | null |
2024-12-24 | Uncertainty Quantification in Stereo Matching | Wenxiao Cai et.al. | 2412.18703 | link |
2024-12-24 | Topological phases protected by projective PT symmetry in alkaline-earth-like atoms | Xiaofan Zhou et.al. | 2412.18494 | null |
2024-12-24 | scReader: Prompting Large Language Models to Interpret scRNA-seq Data | Cong Li et.al. | 2412.18156 | null |
2024-12-24 | Fundamental Limits in the Search for Less Discriminatory Algorithms – and How to Avoid Them | Benjamin Laufer et.al. | 2412.18138 | null |
2024-12-23 | Shifted Composition III: Local Error Framework for KL Divergence | Jason M. Altschuler et.al. | 2412.17997 | null |
2024-12-23 | A Multimodal Fusion Framework for Bridge Defect Detection with Cross-Verification | Ravi Datta Rachuri et.al. | 2412.17968 | null |
2024-12-23 | Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective | Xinmiao Yu et.al. | 2412.17787 | null |
2024-12-23 | Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings | Jérémie Sublime et.al. | 2412.17486 | null |
2024-12-24 | Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement | Hyeonjin Kim et.al. | 2412.17387 | link |
2024-12-22 | Fairness in Reinforcement Learning with Bisimulation Metrics | Sahand Rezaei-Shoshtari et.al. | 2412.17123 | null |
2024-12-22 | Differentially Private Random Block Coordinate Descent | Artavazd Maranjyan et.al. | 2412.17054 | null |
2024-12-22 | Lightweight Design and Optimization methods for DCNNs: Progress and Futures | Hanhua Long et.al. | 2412.16886 | null |
2024-12-21 | Does calibration mean what they say it means; or, the reference class problem rises again | Lily Hu et.al. | 2412.16769 | null |
2024-12-21 | ViM-Disparity: Bridging the Gap of Speed, Accuracy and Memory for Disparity Map Generation | Maheswar Bora et.al. | 2412.16745 | link |
2024-12-21 | LUCES-MV: A Multi-View Dataset for Near-Field Point Light Source Photometric Stereo | Fotios Logothetis et.al. | 2412.16737 | null |
2024-12-21 | A Unifying Family of Data-Adaptive Partitioning Algorithms | Guy B. Oldaker IV et.al. | 2412.16713 | null |
2024-12-20 | Climate Impact Assessment Requires Weighting: Introducing the Weighted Climate Dataset | Marco Gortan et.al. | 2412.15699 | null |
2024-12-20 | Gender Disparities in Contributions, Leadership, and Collaboration: An Exploratory Study on Software Systems Research | Shamse Tasnim Cynthia et.al. | 2412.15661 | null |
2024-12-20 | Radio filaments as Z-pinched Galactic center wind | Fan Zhang et.al. | 2412.15575 | null |
2024-12-20 | SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation | Ke Yan et.al. | 2412.15526 | link |
2024-12-19 | Uncertainty-Guided Cross Attention Ensemble Mean Teacher for Semi-supervised Medical Image Segmentation | Meghana Karri et.al. | 2412.15380 | null |
2024-12-19 | Tiled Diffusion | Or Madar et.al. | 2412.15185 | null |
2024-12-19 | Improving Geometry in Sparse-View 3DGS via Reprojection-based DoF Separation | Yongsung Kim et.al. | 2412.14568 | null |
2024-12-19 | Provincial allocation of China’s commercial building operational carbon towards carbon neutrality | Yanqiao Deng et.al. | 2412.14523 | null |
2024-12-19 | Who is Helping Whom? Student Concerns about AI- Teacher Collaboration in Higher Education Classrooms | Bingyi Han et.al. | 2412.14469 | null |
2024-12-19 | An Immersive Multi-Elevation Multi-Seasonal Dataset for 3D Reconstruction and Visualization | Xijun Liu et.al. | 2412.14418 | null |
2024-12-18 | I0T: Embedding Standardization Method Towards Zero Modality Gap | Na Min An et.al. | 2412.14384 | link |
2024-12-18 | Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs | David Restrepo et.al. | 2412.14304 | null |
2024-12-18 | What Has Been Overlooked in Contrastive Source-Free Domain Adaptation: Leveraging Source-Informed Latent Augmentation within Neighborhood Context | Jing Wang et.al. | 2412.14301 | link |
2024-12-18 | On Calibration in Multi-Distribution Learning | Rajeev Verma et.al. | 2412.14142 | null |
2024-12-18 | LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research | Tianyang Gu et.al. | 2412.14141 | null |
2024-12-18 | Performance Gap in Entity Knowledge Extraction Across Modalities in Vision Language Models | Ido Cohen et.al. | 2412.14133 | link |
2024-12-18 | Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation | Rémi Marsal et.al. | 2412.14103 | null |
2024-12-18 | Neural Combinatorial Optimization for Stochastic Flexible Job Shop Scheduling Problems | Igor G. Smit et.al. | 2412.14052 | link |
2024-12-18 | What If: Causal Analysis with Graph Databases | Amedeo Pachera et.al. | 2412.13965 | null |
2024-12-18 | MobiFuse: A High-Precision On-device Depth Perception System with Multi-Data Fusion | Jinrui Zhang et.al. | 2412.13848 | null |
2024-12-18 | A2H: A UI Converter from Android to HarmonyOS Platform | Chen Wang et.al. | 2412.13693 | link |
2024-12-18 | Soft Modes as a Predictive Framework for Low Dimensional Biological Systems across Scales | Christopher Joel Russo et.al. | 2412.13637 | null |
2024-12-18 | SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation | Kazuki Shimada et.al. | 2412.13462 | null |
2024-12-17 | C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System | Parker Addison et.al. | 2412.13163 | null |
2024-12-17 | Unlocking the Potential of Digital Pathology: Novel Baselines for Compression | Maximilian Fischer et.al. | 2412.13137 | null |
2024-12-17 | Queries, Representation & Detection: The Next 100 Model Fingerprinting Schemes | Augustin Godinot et.al. | 2412.13021 | link |
2024-12-17 | AoI in Context-Aware Hybrid Radio-Optical IoT Networks | Aymen Hamrouni et.al. | 2412.12914 | null |
2024-12-17 | ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation | Shiqi Huang et.al. | 2412.12798 | link |
2024-12-17 | Preference Robust Ordinal Priority Approach and its Satisficing Extension for Multi-Attribute Decision-Making with Incomplete Information | Renlong Wang et.al. | 2412.12690 | null |
2024-12-17 | SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing | Chen Chen et.al. | 2412.12685 | link |
2024-12-17 | DriveTester: A Unified Platform for Simulation-Based Autonomous Driving Testing | Mingfei Cheng et.al. | 2412.12656 | link |
2024-12-17 | PBVS 2024 Solution: Self-Supervised Learning and Sampling Strategies for SAR Classification in Extreme Long-Tail Distribution | Yuhyun Kim et.al. | 2412.12565 | null |
2024-12-17 | Beyond Data Quantity: Key Factors Driving Performance in Multilingual Language Models | Sina Bagheri Nezhad et.al. | 2412.12500 | link |
2024-12-16 | CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models | Felix Taubner et.al. | 2412.12093 | null |
2024-12-16 | IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations | Zhibing Li et.al. | 2412.12083 | null |
2024-12-16 | Hybrid quantum network for sensing in the acoustic frequency range | Valeriy Novikov et.al. | 2412.11824 | null |
2024-12-16 | Image Gradient-Aided Photometric Stereo Network | Kaixuan Wang et.al. | 2412.11650 | null |
2024-12-16 | DVP-MVS: Synergize Depth-Edge and Visibility Prior for Multi-View Stereo | Zhenlong Yuan et.al. | 2412.11578 | null |
2024-12-16 | RoMeO: Robust Metric Visual Odometry | Junda Cheng et.al. | 2412.11530 | null |
2024-12-16 | SpatialMe: Stereo Video Conversion Using Depth-Warping and Blend-Inpainting | Jiale Zhang et.al. | 2412.11512 | null |
2024-12-15 | On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning | Pengfei Fang et.al. | 2412.11017 | null |
2024-12-13 | EvalGIM: A Library for Evaluating Generative Image Models | Melissa Hall et.al. | 2412.10604 | link |
2024-12-13 | Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples | Yeyuan Wang et.al. | 2412.10029 | null |
2024-12-13 | All-in-One: Transferring Vision Foundation Models into Stereo Matching | Jingyi Zhou et.al. | 2412.09912 | null |
2024-12-13 | OpenForge: Probabilistic Metadata Integration | Tianji Cong et.al. | 2412.09788 | link |
2024-12-12 | Egyptian fractions meet the Sierpinski triangle | Laura De Carli et.al. | 2412.09728 | null |
2024-12-12 | Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos | Linyi Jin et.al. | 2412.09621 | null |
2024-12-12 | Learned Compression for Compressed Learning | Dan Jacobellis et.al. | 2412.09405 | link |
2024-12-12 | T-SVG: Text-Driven Stereoscopic Video Generation | Qiao Jin et.al. | 2412.09323 | null |
2024-12-12 | Multimodal Sentiment Analysis based on Video and Audio Inputs | Antonio Fernandez et.al. | 2412.09317 | null |
2024-12-12 | Pinpoint Counterfactuals: Reducing social bias in foundation models via localized counterfactual generation | Kirill Sirotkin et.al. | 2412.09160 | null |
2024-12-12 | LV-CadeNet: Long View Feature Convolution-Attention Fusion Encoder-Decoder Network for Clinical MEG Spike Detection | Kuntao Xiao et.al. | 2412.08896 | null |
2024-12-11 | jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images | Andreas Koukounas et.al. | 2412.08802 | null |
2024-12-11 | TGOSPA Metric Parameters Selection and Evaluation for Visual Multi-object Tracking | Jan Krejčí et.al. | 2412.08321 | null |
2024-12-11 | Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation | Marta R. Costa-jussà et.al. | 2412.08279 | null |
2024-12-11 | Neural Observation Field Guided Hybrid Optimization of Camera Placement | Yihan Cao et.al. | 2412.08266 | link |
2024-12-11 | Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions | Mohammadmostafa Rostamkhani et.al. | 2412.08169 | link |
2024-12-11 | Rigid Communication Topologies: Impact on Stability, Safety, Energy Consumption, Passenger Comfort, and Robustness of Vehicular Platoons | Amir Zakerimanesh et.al. | 2412.08122 | null |
2024-12-11 | Multilingual LLMs Inherently Reward In-Language Time-Sensitive Semantic Alignment for Low-Resource Languages | Ashutosh Bajpai et.al. | 2412.08090 | link |
2024-12-10 | A large language model-based approach to quantifying the effects of social determinants in liver transplant decisions | Emily Robitschek et.al. | 2412.07924 | null |
2024-12-10 | ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer | Jinyi Hu et.al. | 2412.07720 | link |
2024-12-10 | Access to care improves EHR reliability and clinical risk prediction model performance | Anna Zink et.al. | 2412.07712 | null |
2024-12-10 | Stereo Hand-Object Reconstruction for Human-to-Robot Handover | Yik Lung Pang et.al. | 2412.07487 | null |
2024-12-10 | PRM: Photometric Stereo based Large Reconstruction Model | Wenhang Ge et.al. | 2412.07371 | null |
2024-12-10 | A Bayesian Mixture Model Approach to Examining Neighborhood Social Determinants of Health Disparities in Endometrial Cancer Care in Massachusetts | Carmen B. Rodriguez et.al. | 2412.07134 | null |
2024-12-10 | TT-MPD: Test Time Model Pruning and Distillation | Haihang Wu et.al. | 2412.07114 | null |
2024-12-09 | MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds | Zhenggang Tang et.al. | 2412.06974 | null |
2024-12-09 | Bridging the Divide: Reconsidering Softmax and Linear Attention | Dongchen Han et.al. | 2412.06590 | link |
2024-12-09 | Emerging Challenges in Molecular Paleontology: Misapplication of Environmental DNA Fragments and Misconception of Deamination as a Key Criterion for In Situ DNA Identification | Wan-Qian Zhao et.al. | 2412.06378 | null |
2024-12-09 | SeFENet: Robust Deep Homography Estimation via Semantic-Driven Feature Enhancement | Zeru Shi et.al. | 2412.06352 | null |
2024-12-08 | DECO: Life-Cycle Management of Enterprise-Grade Chatbots | Yiwen Zhu et.al. | 2412.06099 | null |
2024-12-08 | Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors | Alex Rich et.al. | 2412.05771 | null |
2024-12-07 | On the effective transfer of knowledge from English to Hindi Wikipedia | Paramita Das et.al. | 2412.05708 | link |
2024-12-07 | A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions | Ola Shorinwa et.al. | 2412.05563 | null |
2024-12-06 | Excitation spectrum of a double supersolid in a trapped dipolar Bose mixture | Daniel Scheiermann et.al. | 2412.05215 | null |
2024-12-06 | Automatic Tissue Differentiation in Parotidectomy using Hyperspectral Imaging | Eric L. Wisotzky et.al. | 2412.04879 | null |
2024-12-06 | Differentially Private Random Feature Model | Chunyang Liao et.al. | 2412.04785 | link |
2024-12-06 | Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs | Kun Wu et.al. | 2412.04747 | null |
2024-12-05 | From Models to Systems: A Comprehensive Fairness Framework for Compositional Recommender Systems | Brian Hsu et.al. | 2412.04655 | null |
2024-12-05 | Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail | Luca Bartolomei et.al. | 2412.04472 | link |
2024-12-05 | Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird’s-Eye-View via Uncertainty Measure | Saheli Hazra et.al. | 2412.04337 | null |
2024-12-05 | Complexity of Vector-valued Prediction: From Linear Models to Stochastic Convex Optimization | Matan Schliserman et.al. | 2412.04274 | null |
2024-12-05 | Relationships between Keywords and Strong Beats in Lyrical Music | Callie C. Liao et.al. | 2412.04202 | null |
2024-12-05 | Adult Glioma Segmentation in Sub-Saharan Africa using Transfer Learning on Stratified Finetuning Data | Abhijeet Parida et.al. | 2412.04111 | link |
2024-12-05 | Augmenting Minds or Automating Skills: The Differential Role of Human Capital in Generative AI’s Impact on Creative Tasks | Meiling Huang et.al. | 2412.03963 | null |
2024-12-05 | BEFL: Balancing Energy Consumption in Federated Learning for Mobile Edge IoT | Zehao Ju et.al. | 2412.03950 | link |
2024-12-05 | MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application | Hyesu Jang et.al. | 2412.03887 | null |
2024-12-05 | E-Commerce in Africa: Divergent Impacts on Rural and Urban Economies | Jaelyn S. Liang et.al. | 2412.03879 | null |
2024-12-05 | Un-evaluated Solutions May Be Valuable in Expensive Optimization | Hao Hao et.al. | 2412.03858 | null |
2024-12-04 | Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter | Hermes McGriff et.al. | 2412.03518 | null |
2024-12-04 | NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images | Lingen Li et.al. | 2412.03517 | null |
2024-12-04 | Data Fusion of Semantic and Depth Information in the Context of Object Detection | Md Abu Yusuf et.al. | 2412.03490 | null |
2024-12-04 | Exploring trends in audio mixes and masters: Insights from a dataset analysis | Angeliki Mourgela et.al. | 2412.03373 | null |
2024-12-04 | TASR: Timestep-Aware Diffusion Model for Image Super-Resolution | Qinwei Lin et.al. | 2412.03355 | link |
2024-12-04 | Social media and suicide: empirical evidence from the quasi-exogenous geographical adoption of Twitter | Alexis Du et.al. | 2412.03217 | null |
2024-12-04 | MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras | Huai Yu et.al. | 2412.03146 | link |
2024-12-03 | Quaternion-based Unscented Kalman Filter for 6-DoF Vision-based Inertial Navigation in GPS-denied Regions | Khashayar Ghanizadegan et.al. | 2412.02768 | null |
2024-12-03 | ROVER: A Multi-Season Dataset for Visual SLAM | Fabian Schmidt et.al. | 2412.02506 | link |
2024-12-03 | Single-Shot Metric Depth from Focused Plenoptic Cameras | Blanca Lasheras-Hernandez et.al. | 2412.02386 | null |
2024-12-03 | Dual Exposure Stereo for Extended Dynamic Range 3D Imaging | Juhyung Choi et.al. | 2412.02351 | null |
2024-12-03 | SparseLGS: Sparse View Language Embedded Gaussian Splatting | Jun Hu et.al. | 2412.02245 | null |
2024-12-03 | Crash Severity Risk Modeling Strategies under Data Imbalance | Abdullah Al Mamun et.al. | 2412.02094 | null |
2024-12-02 | Mutli-View 3D Reconstruction using Knowledge Distillation | Aditya Dutt et.al. | 2412.02039 | link |
2024-12-02 | A Shared Standard for Valid Measurement of Generative AI Systems’ Capabilities, Risks, and Impacts | Alexandra Chouldechova et.al. | 2412.01934 | null |
2024-12-02 | World-consistent Video Diffusion with Explicit 3D Modeling | Qihang Zhang et.al. | 2412.01821 | null |
2024-12-03 | FairML: A Julia Package for Fair Classification | Jan Pablo Burgard et.al. | 2412.01585 | link |
2024-12-02 | Second FRCSyn-onGoing: Winning Solutions and Post-Challenge Analysis to Improve Face Recognition with Synthetic Data | Ivan DeAndres-Tame et.al. | 2412.01383 | null |
2024-11-29 | Quantifying the synthetic and real domain gap in aerial scene understanding | Alina Marcu et.al. | 2411.19913 | null |
2024-11-29 | Privacy-Preserving Orthogonal Aggregation for Guaranteeing Gender Fairness in Federated Recommendation | Siqing Zhang et.al. | 2411.19678 | null |
2024-11-29 | Subjective and Objective Quality Assessment Methods of Stereoscopic Videos with Visibility Affecting Distortions | Sria Biswas et.al. | 2411.19522 | null |
2024-12-02 | GausSurf: Geometry-Guided 3D Gaussian Splatting for Surface Reconstruction | Jiepeng Wang et.al. | 2411.19454 | null |
2024-11-28 | Cross-Spectral Attention for Unsupervised RGB-IR Face Verification and Person Re-identification | Kshitij Nikhal et.al. | 2411.19215 | null |
2024-11-28 | Examining Multimodal Gender and Content Bias in ChatGPT-4o | Roberto Balestri et.al. | 2411.19140 | null |
2024-11-28 | Tracking Progress Towards Sustainable Development Goal 6 Using Satellite Imagery | Othmane Echchabi et.al. | 2411.19093 | null |
2024-11-28 | Study on the Influence of Embodied Avatars on Gait Parameters in Virtual Environments and Real World | Tianyi Zhou et.al. | 2411.18949 | null |
2024-11-27 | A Talent-infused Policy-gradient Approach to Efficient Co-Design of Morphology and Task Allocation Behavior of Multi-Robot Systems | Prajit KrisshnaKumar et.al. | 2411.18519 | null |
2024-11-27 | A comparison of extended object tracking with multi-modal sensors in indoor environment | Jiangtao Shuai et.al. | 2411.18476 | null |
2024-11-27 | When does a bridge become an aeroplane? | Tina A. Dardeno et.al. | 2411.18406 | null |
2024-11-27 | Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation | Mehdi Zayene et.al. | 2411.18335 | link |
2024-11-27 | Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision | Jinnyeong Kim et.al. | 2411.18025 | null |
2024-11-26 | Updating the constraint on the quantum collapse models via kilogram masses | Qi Dai et.al. | 2411.17588 | null |
2024-11-26 | Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles | Yichen Wang et.al. | 2411.17554 | null |
2024-11-26 | Variational Quantum Simulation of the Fokker-Planck Equation applied to Quantum Radiation Reaction | Óscar Amaro et.al. | 2411.17517 | link |
2024-11-26 | Object-centric proto-symbolic behavioural reasoning from pixels | Ruben van Bergen et.al. | 2411.17438 | link |
2024-11-26 | Enhancing Imbalance Learning: A Novel Slack-Factor Fuzzy SVM Approach | M. Tanveer et.al. | 2411.17128 | link |
2024-11-26 | Multimodal Alignment and Fusion: A Survey | Songtao Li et.al. | 2411.17040 | null |
2024-11-24 | PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation | Ziyao Zeng et.al. | 2411.16750 | null |
2024-11-25 | Location-Based Service (LBS) Data Quality Metrics and Effects on Mobility Inference | Xinhua Wu et.al. | 2411.16595 | null |
2024-11-23 | IRSKG: Unified Intrusion Response System Knowledge Graph Ontology for Cyber Defense | Damodar Panigrahi et.al. | 2411.15672 | null |
2024-11-23 | Elucidating the nature of axial-vector charm-antibottom tetraquark states | U. Özdem et.al. | 2411.15508 | null |
2024-11-22 | Adaptive Group Robust Ensemble Knowledge Distillation | Patrik Kenfack et.al. | 2411.14984 | null |
2024-11-22 | A Benchmark Dataset for Collaborative SLAM in Service Environments | Harin Park et.al. | 2411.14775 | link |
2024-11-22 | FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification | Zhengrui Guo et.al. | 2411.14743 | link |
2024-11-22 | Boson-fermion universality of mesoscopic entanglement fluctuations in free systems | Cunzhong Lou et.al. | 2411.14687 | null |
2024-11-21 | Learning Fair Robustness via Domain Mixup | Meiyu Zhong et.al. | 2411.14424 | null |
2024-11-21 | InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation | Marziyeh Bamdad et.al. | 2411.14358 | link |
2024-11-21 | StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart | Jian Shi et.al. | 2411.14295 | link |
2024-11-21 | Why do language models perform worse for morphologically complex languages? | Catherine Arnett et.al. | 2411.14198 | link |
2024-11-21 | Compact Visual Data Representation for Green Multimedia – A Human Visual System Perspective | Peilin Chen et.al. | 2411.14135 | null |
2024-11-21 | Stereo Anything: Unifying Stereo Matching with Large-Scale Mixed Data | Xianda Guo et.al. | 2411.14053 | link |
2024-11-21 | XAgents: A Framework for Interpretable Rule-Based Multi-Agents Cooperation | Hailong Yang et.al. | 2411.13932 | null |
2024-11-20 | Predictive Insights into LGBTQ+ Minority Stress: A Transductive Exploration of Social Media Discourse | S. Chapagain et.al. | 2411.13534 | link |
2024-11-20 | Non-Perturbative Corrections to Charged Black Hole Evaporation | Vyshnav Mohan et.al. | 2411.13454 | null |
2024-11-20 | BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation | Umamaheswaran Raman Kumar et.al. | 2411.13251 | null |
2024-11-20 | Asymptotic-Preserving schemes for the Boltzmann mixture model with disparate mass | Zhen Hao et.al. | 2411.13240 | null |
2024-11-20 | Superpixel Cost Volume Excitation for Stereo Matching | Shanglong Liu et.al. | 2411.13105 | null |
2024-11-19 | MLDGG: Meta-Learning for Domain Generalization on Graphs | Qin Tian et.al. | 2411.12913 | null |
2024-11-19 | Towards Fairness in AI for Melanoma Detection: Systemic Review and Recommendations | Laura N Montoya et.al. | 2411.12846 | null |
2024-11-19 | Human-Robot Dialogue Annotation for Multi-Modal Common Ground | Claire Bonial et.al. | 2411.12829 | link |
2024-11-19 | Probing the Capacity of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction | Sonny George et.al. | 2411.12828 | link |
2024-11-19 | Multivariate and Online Transfer Learning with Uncertainty Quantification | Jimmy Hickey et.al. | 2411.12555 | null |
2024-11-19 | Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution | Yang Zou et.al. | 2411.12530 | link |
2024-11-19 | Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph | Ziyang Chen et.al. | 2411.12426 | link |
2024-11-19 | Cities beyond proximity | Dan Hill et.al. | 2411.12335 | null |
2024-11-19 | Neuro-3D: Towards 3D Visual Decoding from EEG Signals | Zhanqiang Guo et.al. | 2411.12248 | null |
2024-11-18 | MMBind: Unleashing the Potential of Distributed and Heterogeneous Data for Multimodal Learning in IoT | Xiaomin Ouyang et.al. | 2411.12126 | null |
2024-11-18 | Fair Distillation: Teaching Fairness from Biased Teachers in Medical Imaging | Milad Masroor et.al. | 2411.11939 | null |
2024-11-18 | SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input | Zhen Lv et.al. | 2411.11934 | null |
2024-11-18 | The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather | Markus Schön et.al. | 2411.11455 | null |
2024-11-18 | Causal Effect of Group Diversity on Redundancy and Coverage in Peer-Reviewing | Navita Goyal et.al. | 2411.11437 | null |
2024-11-17 | Label Sharing Incremental Learning Framework for Independent Multi-Label Segmentation Tasks | Deepa Anand et.al. | 2411.11105 | null |
2024-11-16 | BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment | Sizhe Wang et.al. | 2411.10914 | null |
2024-11-16 | DEAL: Decoupled Classifier with Adaptive Linear Modulation for Group Robust Early Diagnosis of MCI to AD Conversion | Donggyu Lee et.al. | 2411.10814 | null |
2024-11-16 | LTCXNet: Advancing Chest X-Ray Analysis with Solutions for Long-Tailed Multi-Label Classification and Fairness Challenges | Chin-Wei Huang et.al. | 2411.10746 | null |
2024-11-16 | A Wearable Gait Monitoring System for 17 Gait Parameters Based on Computer Vision | Jiangang Chen et.al. | 2411.10739 | null |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | Debias-CLR: A Contrastive Learning Based Debiasing Method for Algorithmic Fairness in Healthcare Applications | Ankita Agarwal et.al. | 2411.10544 | null |
2024-11-15 | Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion | Haoran Wei et.al. | 2411.10369 | null |
2024-11-15 | Domain Adaptation-based Edge Computing for Cross-Conditions Fault Diagnosis | Yanzhi Wang et.al. | 2411.10340 | null |
2024-11-15 | Filament eruption deflection and associated CMEs | K. Koleva et.al. | 2411.10110 | null |
2024-11-15 | Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses | Yongfan Liu et.al. | 2411.10013 | link |
2024-11-15 | Assessing Response Disparities in California Wildland-Urban-Interface (WUI) Cities Using the Compartmental Model | Zihui Ma et.al. | 2411.09946 | null |
2024-11-14 | Propensity Score Matching: Should We Use It in Designing Observational Studies? | Fei Wan et.al. | 2411.09579 | null |
2024-11-14 | Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data | Rik Raes et.al. | 2411.09431 | null |
2024-11-14 | Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching | Yuran Wang et.al. | 2411.09151 | null |
2024-11-14 | Artificial Intelligence for Quantum Computing | Yuri Alexeev et.al. | 2411.09131 | null |
2024-11-13 | Fluoroformer: Scaling multiple instance learning to multiplexed images via attention-based channel fusion | Marc Harary et.al. | 2411.08975 | link |
2024-11-13 | Gendered Words and Grant Rates: A Textual Analysis of Disparate Outcomes in the Patent System | Deborah Gerhardt et.al. | 2411.08526 | null |
2024-11-13 | Anomalous Hall effect from inter-superlattice scattering in a noncollinear antiferromagnet | Lilia S. Xie et.al. | 2411.08381 | null |
2024-11-12 | Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset | Khaoula Chehbouni et.al. | 2411.08243 | null |
2024-11-12 | Detection asymmetry in solar energetic particle events | S. Dalla et.al. | 2411.08211 | null |
2024-11-12 | Estimating Variability in Hospital Charges: The Case of Cesarean Section | Anna Perfilyeva et.al. | 2411.08174 | null |
2024-11-11 | Identifying Differential Patient Care Through Inverse Intent Inference | Hyewon Jeong et.al. | 2411.07372 | null |
2024-11-11 | Targeting mediating mechanisms of social disparities with an interventional effects framework, applied to the gender pay gap in West Germany | Christiane Didden et.al. | 2411.07368 | null |
2024-11-11 | $SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation | Yinshuang Xu et.al. | 2411.07326 | null |
2024-11-11 | Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations | Kirti Bhagat et.al. | 2411.07320 | link |
2024-11-10 | Analysis of spatially clustered survival data with unobserved covariates using SBART | Durbadal Ghosh et.al. | 2411.06591 | null |
2024-11-10 | Image Segmentation from Shadow-Hints using Minimum Spanning Trees | Moritz Heep et.al. | 2411.06530 | null |
2024-11-10 | SymmeTac: Symmetric Color LED Driven Efficient Photometric Stereo Reconstruction Methods for Camera-based Tactile Sensors | Jieji Ren et.al. | 2411.06377 | link |
2024-11-08 | Characterizing Implementability of Global Protocols with Infinite States and Data | Elaine Li et.al. | 2411.05722 | null |
2024-11-08 | Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation | Peidong Liu et.al. | 2411.05472 | link |
2024-11-08 | From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS | Haoran Zhang et.al. | 2411.05362 | link |
2024-11-07 | Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks? | Jonathan Roberts et.al. | 2411.05000 | null |
2024-11-06 | Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation | Teppei Kurita et.al. | 2411.04714 | null |
2024-11-11 | The Multiple Dimensions of Spuriousness in Machine Learning | Samuel J. Bell et.al. | 2411.04696 | null |
2024-11-07 | Comparing Fairness of Generative Mobility Models | Daniel Wang et.al. | 2411.04453 | null |
2024-11-06 | Topology Bench: Systematic Graph Based Benchmarking for Core Optical Networks | Robin Matzner et.al. | 2411.04160 | null |
2024-11-06 | Optimizing Quantum Circuits, Fast and Slow | Amanda Xu et.al. | 2411.04104 | null |
2024-11-06 | These Maps Are Made by Propagation: Adapting Deep Stereo Networks to Road Scenarios with Decisive Disparity Diffusion | Chuang-Wei Liu et.al. | 2411.03717 | null |
2024-11-06 | Physical Layer Deception in OFDM Systems | Wenwen Chen et.al. | 2411.03677 | null |
2024-11-06 | Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions | Zihan Qin et.al. | 2411.03638 | null |
2024-11-05 | Exploring the Cybersecurity-Resilience Gap: An Analysis of Student Attitudes and Behaviors in Higher Education | Steve Goliath et.al. | 2411.03219 | null |
2024-11-05 | Gender Differences in Comparative Advantage Matches: Evidence from Linked Employer-Employee Data | Hugo Sant’Anna et.al. | 2411.03209 | null |
2024-11-04 | Designing and Evaluating Sampling Strategies for Multiple-Forecast Visualization (MFV) | Ruishi Zou et.al. | 2411.02576 | null |
2024-11-04 | Gravitational wave energy spectral density properties from BPASS Galactic binary population in the Milky Way galaxy | Petra Tang et.al. | 2411.02563 | null |
2024-11-04 | Neural optical flow for planar and stereo PIV | Andrew I. Masker et.al. | 2411.02373 | null |
2024-11-04 | Can Personalized Medicine Coexist with Health Equity? Examining the Cost Barrier and Ethical Implications | Kishi Kobe Yee Francisco et.al. | 2411.02307 | null |
2024-11-04 | Constructing Emergent U(1) Symmetries in the Gamma-prime $\left(\bf Γ^{\prime} \right)$ model | Sagar Ramchandani et.al. | 2411.02070 | null |
2024-11-04 | Typicalness-Aware Learning for Failure Detection | Yijun Liu et.al. | 2411.01981 | link |
2024-11-04 | A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding | Yitong Dong et.al. | 2411.01893 | null |
2024-11-03 | Mitigating Matching Biases Through Score Calibration | Mohammad Hossein Moslemi et.al. | 2411.01685 | link |
2024-11-03 | One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection | Zhenyu Wang et.al. | 2411.01584 | null |
2024-11-02 | Visual Fourier Prompt Tuning | Runjia Zeng et.al. | 2411.01327 | link |
2024-11-02 | On The Influence Of The Solar Wind On The Propagation Of Earth-impacting Coronal Mass Ejections | Sandeep Kumar et.al. | 2411.01165 | null |
2024-11-02 | Why Does the Cortex Have Such a Vast Storage Capacity? | Hui Wei et.al. | 2411.01164 | null |
2024-10-31 | Matchmaker: Self-Improving Large Language Model Programs for Schema Matching | Nabeel Seedat et.al. | 2410.24105 | null |
2024-10-31 | A Multi-Modal Approach for Face Anti-Spoofing in Non-Calibrated Systems using Disparity Maps | Ariel Larey et.al. | 2410.24031 | null |
2024-10-31 | Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts | Xiang Deng et.al. | 2410.23836 | null |
2024-10-30 | Enhancing Image Resolution: A Simulation Study and Sensitivity Analysis of System Parameters for Resourcesat-3S/3SA | Ankur Garg et.al. | 2410.23319 | null |
2024-10-30 | TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models | Ziyao Shangguan et.al. | 2410.23266 | link |
2024-10-30 | Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe | Songyu Xu et.al. | 2410.23154 | null |
2024-10-30 | FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training | Tejaswini Medi et.al. | 2410.23142 | null |
2024-10-30 | Decarbonisation of industry and the energy system: exploring mutual impacts and investment planning | Quentin Raillard-Cazanove et.al. | 2410.23025 | null |
2024-10-30 | Improving Musical Accompaniment Co-creation via Diffusion Transformers | Javier Nistal et.al. | 2410.23005 | null |
2024-10-30 | Knowledge Graph Based Visual Search Application | Pawandeep Kaur Betz et.al. | 2410.22846 | null |
2024-10-30 | Price Regulation, Technology and Provider Redistribution | Piyush Akimitsu et.al. | 2410.22616 | null |
2024-10-29 | FairSkin: Fair Diffusion for Skin Disease Image Generation | Ruichen Zhang et.al. | 2410.22551 | null |
2024-10-29 | From Silos to Systems: Process-Oriented Hazard Analysis for AI Systems | Shalaleh Rismani et.al. | 2410.22526 | null |
2024-10-29 | Multimodal Structure Preservation Learning | Chang Liu et.al. | 2410.22520 | null |
2024-10-29 | Relieving scale disparity in binary black hole simulations | Nikolas A. Wittek et.al. | 2410.22290 | null |
2024-10-29 | Complex-Phase Extensions of Szegedy Quantum Walk on Graphs | Sergio A. Ortega et.al. | 2410.22011 | null |
2024-10-29 | Photonic systolic array for all-optical matrix-matrix multiplication | Jungmin Kim et.al. | 2410.21671 | null |
2024-10-28 | Intersectional inequalities in social networks | Samuel Martin-Gutierez et.al. | 2410.21189 | link |
2024-10-28 | Revealing the core-periphery structure of cities | Federica Fanelli et.al. | 2410.21133 | null |
2024-10-28 | BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment | Mehdi Hosseinzadeh et.al. | 2410.20969 | null |
2024-10-28 | The Zeno’s Paradox of `Low-Resource’ Languages | Hellina Hailu Nigatu et.al. | 2410.20817 | null |
2024-10-28 | Faster WIND: Accelerating Iterative Best-of- $N$ Distillation for LLM Alignment | Tong Yang et.al. | 2410.20727 | null |
2024-10-28 | Physics-Free Spectrally Multiplexed Photometric Stereo under Unknown Spectral Composition | Satoshi Ikehata et.al. | 2410.20716 | link |
2024-10-27 | Language Models And A Second Opinion Use Case: The Pocket Professional | David Noever et.al. | 2410.20636 | null |
2024-10-27 | TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation | Juntong Shi et.al. | 2410.20626 | link |
2024-10-27 | Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks? | Xuan He et.al. | 2410.20533 | link |
2024-10-27 | A Navier-Stokes asymptotic preserving Direct Simulation Monte Carlo method for multi-species gas flows | Fei Fei et.al. | 2410.20322 | null |
2024-10-25 | DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems | Muhammad Zaeem Shahzad et.al. | 2410.19336 | null |
2024-10-24 | Self-organized homogenization of flow networks | Julien Bouvard et.al. | 2410.19089 | null |
2024-10-24 | Bridge-Coder: Unlocking LLMs’ Potential to Overcome Language Gaps in Low-Resource Code | Jipeng Zhang et.al. | 2410.18957 | null |
2024-10-27 | Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis | Liang Han et.al. | 2410.18822 | null |
2024-10-24 | Rigid Single-Slice-in-Volume registration via rotation-equivariant 2D/3D feature matching | Stefan Brandstätter et.al. | 2410.18683 | null |
2024-10-24 | A Cranial-Feature-Based Registration Scheme for Robotic Micromanipulation Using a Microscopic Stereo Camera System | Xiaofeng Lin et.al. | 2410.18630 | null |
2024-10-24 | Spatial-Temporal Search for Spiking Neural Networks | Kaiwei Che et.al. | 2410.18580 | null |
2024-10-24 | Estimating early coronal mass ejection propagation direction with DIRECD during the severe May 8 and follow-up June 8, 2024 events | Shantanu Jain et.al. | 2410.18549 | null |
2024-10-24 | Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction | Hongxin Peng et.al. | 2410.18433 | null |
2024-10-24 | Large Language Models Reflect the Ideology of their Creators | Maarten Buyl et.al. | 2410.18417 | link |
2024-10-23 | Pathological Rheology of Non-Stretching Entangled Polymers: Finite-Time Blow-Up Predictions | Vickie Chen et.al. | 2410.18306 | null |
2024-10-23 | Rethinking Positive Pairs in Contrastive Learning | Jiantao Wu et.al. | 2410.18200 | null |
2024-10-23 | Continual Learning on a Data Diet | Elif Ceren Gok Yildirim et.al. | 2410.17715 | link |
2024-10-23 | Role of the argon and helium bath gases on the structure of H2/O2 detonations | Farzane Zangene et.al. | 2410.17561 | null |
2024-10-22 | Characterizing Robocalls with Multiple Vantage Points | Sathvik Prasad et.al. | 2410.17361 | null |
2024-10-22 | FairLoRA: Unpacking Bias Mitigation in Vision Models with Fairness-Driven Low-Rank Adaptation | Rohan Sukumaran et.al. | 2410.17358 | null |
2024-10-22 | Dhoroni: Exploring Bengali Climate Change and Environmental Views with a Multi-Perspective News Dataset and Natural Language Processing | Azmine Toushik Wasi et.al. | 2410.17225 | link |
2024-10-22 | Arabic Dataset for LLM Safeguard Evaluation | Yasser Ashraf et.al. | 2410.17040 | link |
2024-10-22 | DENOASR: Debiasing ASRs through Selective Denoising | Anand Kumar Rai et.al. | 2410.16712 | null |
2024-10-21 | GReFEL: Geometry-Aware Reliable Facial Expression Learning under Bias and Imbalanced Data Distribution | Azmine Toushik Wasi et.al. | 2410.15927 | null |
2024-10-21 | Analysis of short-run and long-run marginal costs of generation in the power market | Shamim Homaei et.al. | 2410.15861 | null |
2024-10-20 | A hybrid origin for the Martian atmosphere | Kaveh Pahlevan et.al. | 2410.15508 | null |
2024-10-20 | Investigating the Impact of Age and Sex on Cataract Surgery Complications and Outcomes | Hadas Ben-Eli Yaacov Cnaany et.al. | 2410.15505 | null |
2024-10-20 | CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts | Malvina Nikandrou et.al. | 2410.15453 | link |
2024-10-20 | ActiveNeuS: Neural Signed Distance Fields for Active Stereo | Kazuto Ichimaru et.al. | 2410.15376 | null |
2024-10-19 | A Semidefinite Relaxation Approach for Fair Graph Clustering | Sina Baharlouei et.al. | 2410.15233 | link |
2024-10-19 | Smart-optimism. Uncovering the Resilience of Romanian City Halls in Online Service Delivery | Catalin Vrabie et.al. | 2410.15189 | null |
2024-10-19 | Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation | Seulbi Lee et.al. | 2410.14975 | null |
2024-10-18 | A Complexity-Based Theory of Compositionality | Eric Elmoznino et.al. | 2410.14817 | null |
2024-10-18 | Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum | Ryan Soh-Eun Shim et.al. | 2410.14589 | null |
2024-10-18 | Sim2real Cattle Joint Estimation in 3D point clouds | Okour Mohammad et.al. | 2410.14419 | null |
2024-10-18 | Coded Water-Filling for Multi-User Interference Cancellation | Yuan Li et.al. | 2410.14136 | null |
2024-10-17 | Auditing and Enforcing Conditional Fairness via Optimal Transport | Mohsen Ghassemi et.al. | 2410.14029 | null |
2024-10-17 | A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models | Qiaoyu Tang et.al. | 2410.13841 | null |
2024-10-17 | The Disparate Benefits of Deep Ensembles | Kajetan Schweighofer et.al. | 2410.13831 | link |
2024-10-18 | Aggregation Artifacts in Subjective Tasks Collapse Large Language Models’ Posteriors | Georgios Chochlakis et.al. | 2410.13776 | null |
2024-10-17 | Material Fingerprinting: Identifying and Predicting Perceptual Attributes of Material Appearance | Jiri Filip et.al. | 2410.13615 | null |
2024-10-17 | SAda-Net: A Self-Supervised Adaptive Stereo Estimation CNN For Remote Sensing Image Data | Dominik Hirner et.al. | 2410.13500 | link |
2024-10-17 | Inner ear morphology in wild versus laboratory house mice | Sabrina Renaud et.al. | 2410.13325 | null |
2024-10-17 | Perceptions of Discriminatory Decisions of Artificial Intelligence: Unpacking the Role of Individual Characteristics | Soojong Kim et.al. | 2410.13250 | null |
2024-10-16 | A Location Validation Technique to Mitigate GPS Spoofing Attacks in IEEE 802.11p based Fleet Operator’s Network of Electric Vehicles | Ankita Samaddar et.al. | 2410.13031 | null |
2024-10-16 | Stability properties for subgroups generated by return words | France Gheeraert et.al. | 2410.12534 | null |
2024-10-16 | Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention | Weixuan Wang et.al. | 2410.12462 | link |
2024-10-16 | Real-time Stereo-based 3D Object Detection for Streaming Perception | Changcai Li et.al. | 2410.12394 | link |
2024-10-16 | Pyramid-Driven Alignment: Pyramid Principle Guided Integration of Large Language Models and Knowledge Graphs | Lei Sun et.al. | 2410.12298 | null |
2024-10-15 | A Software Engineering Capstone Course Facilitated By GitHub Templates | Spencer Smith et.al. | 2410.12114 | null |
2024-10-15 | DAXA: Traversing the X-ray desert by Democratising Archival X-ray Astronomy | David J. Turner et.al. | 2410.11954 | link |
2024-10-15 | Adaptive Coordinators and Prompts on Heterogeneous Graphs for Cross-Domain Recommendations | Hengyu Zhang et.al. | 2410.11719 | null |
2024-10-15 | Multiple scales homogenisation of a porous viscoelastic material with rigid inclusions: application to lithium-ion battery electrodes | J. M. Foster et.al. | 2410.11699 | null |
2024-10-16 | Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture | Dabbrata Das et.al. | 2410.11610 | link |
2024-10-15 | Towards a Healthy AI Tradition: Lessons from Biology and Biomedical Science | Simon Kasif et.al. | 2410.11590 | null |
2024-10-15 | MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields | Yuru Xiao et.al. | 2410.11394 | null |
2024-10-15 | Improving Bias in Facial Attribute Classification: A Combined Impact of KL Divergence induced Loss Function and Dual Attention | Shweta Patel et.al. | 2410.11176 | null |
2024-10-14 | Solving the Transient Dyson Equation with Quasilinear Complexity via Matrix Compression | Baptiste Lamic et.al. | 2410.11057 | null |
2024-10-14 | Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation | Emmanouil Zaranis et.al. | 2410.10995 | link |
2024-10-14 | Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation | Peiwen Sun et.al. | 2410.10676 | null |
2024-10-14 | MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator | Taozhe Li et.al. | 2410.10669 | null |
2024-10-14 | Double Jeopardy and Climate Impact in the Use of Large Language Models: Socio-economic Disparities and Reduced Utility for Non-English Speakers | Aivin V. Solatorio et.al. | 2410.10665 | link |
2024-10-14 | Energetic Analysis of Emerging Quantum Communication Protocols | Raja Yehia et.al. | 2410.10661 | link |
2024-10-14 | Dual-Path Mechanism of Amino Acid Racemization Mediated by Quantum Mechanical Tunneling | Xinrui Yang et.al. | 2410.10544 | null |
2024-10-14 | Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world | Han Ling et.al. | 2410.10453 | link |
2024-10-14 | Minimum Tuning to Unlock Long Output from LLMs with High Quality Data as the Key | Yingda Chen et.al. | 2410.10210 | null |
2024-10-13 | Robust 3D Point Clouds Classification based on Declarative Defenders | Kaidong Li et.al. | 2410.09691 | link |
2024-10-12 | Scito2M: A 2 Million, 30-Year Cross-disciplinary Dataset for Temporal Scientometric Analysis | Yiqiao Jin et.al. | 2410.09510 | link |
2024-10-12 | Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors | Hritam Basak et.al. | 2410.09467 | null |
2024-10-11 | Efficient Multi-Object Tracking on Edge Devices via Reconstruction-Based Channel Pruning | Jan Müller et.al. | 2410.08769 | null |
2024-10-11 | No Tick-Size Too Small: A General Method for Modelling Small Tick Limit Order Books | Konark Jain et.al. | 2410.08744 | null |
2024-10-11 | Bio-inspired reconfigurable stereo vision for robotics using omnidirectional cameras | Suchang Chen et.al. | 2410.08691 | null |
2024-10-10 | PubMed knowledge graph 2.0: Connecting papers, patents, and clinical trials in biomedical science | Jian Xu et.al. | 2410.07969 | null |
2024-10-10 | Determining the Magnetic Field in the Galactic Plane from New Arecibo Pulsar Faraday Rotation Measurements | Alice P. Curtin et.al. | 2410.07967 | null |
2024-10-10 | A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways | Jing Su et.al. | 2410.07915 | null |
2024-10-10 | Multi-Scale Deformable Transformers for Student Learning Behavior Detection in Smart Classroom | Zhifeng Wang et.al. | 2410.07834 | null |
2024-10-09 | ACDC: Automated Creation of Digital Cousins for Robust Policy Learning | Tianyuan Dai et.al. | 2410.07408 | null |
2024-10-09 | Enhancing Performance of Point Cloud Completion Networks with Consistency Loss | Kevin Tirta Wijaya et.al. | 2410.07298 | null |
2024-10-09 | IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation | Xinchen Zhang et.al. | 2410.07171 | link |
2024-10-10 | Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology | Xiangyu Wang et.al. | 2410.07087 | null |
2024-10-09 | CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models | Zi Gong et.al. | 2410.06741 | link |
2024-10-09 | Analysis of different disparity estimation techniques on aerial stereo image datasets | Ishan Narayan et.al. | 2410.06711 | null |
2024-10-09 | Decomposing Relationship from 1-to-N into N 1-to-1 for Text-Video Retrieval | Jian Xiao et.al. | 2410.06618 | link |
2024-10-09 | The Sampling-Gaussian for stereo matching | Baiyu Pan et.al. | 2410.06527 | null |
2024-10-09 | OledFL: Unleashing the Potential of Decentralized Federated Learning via Opposite Lookahead Enhancement | Qinglun Li et.al. | 2410.06482 | null |
2024-10-08 | Skin Cancer Machine Learning Model Tone Bias | James Pope et.al. | 2410.06385 | null |
2024-10-08 | HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction | Shengji Tang et.al. | 2410.06245 | null |
2024-10-08 | BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way | Jiazi Bu et.al. | 2410.06241 | null |
2024-10-07 | Studying and Mitigating Biases in Sign Language Understanding Models | Katherine Atwell et.al. | 2410.05206 | null |
2024-10-07 | Enhancing Equity in Large Language Models for Medical Applications | Yuelyu Ji et.al. | 2410.05180 | link |
2024-10-07 | Presto! Distilling Steps and Layers for Accelerating Music Generation | Zachary Novack et.al. | 2410.05167 | null |
2024-10-07 | Correcting for Popularity Bias in Recommender Systems via Item Loss Equalization | Juno Prent et.al. | 2410.04830 | null |
2024-10-07 | The divide between us: Internet access among people with and without disabilities in the post-pandemic era | Edgar Pacheco et.al. | 2410.04825 | null |
2024-10-06 | Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives | Carolina Veiga et.al. | 2410.04318 | null |
2024-10-05 | Fast Object Detection with a Machine Learning Edge Device | Richard C. Rodriguez et.al. | 2410.04173 | null |
2024-10-05 | High-Speed Stereo Visual SLAM for Low-Powered Computing Devices | Ashish Kumar et.al. | 2410.04090 | link |
2024-10-05 | Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy | Pengcheng Chen et.al. | 2410.04041 | null |
2024-10-04 | Improving Arabic Multi-Label Emotion Classification using Stacked Embeddings and Hybrid Loss Function | Nisar Ahmed et.al. | 2410.03979 | link |
2024-10-04 | Noncollinear ferrielectricity and hydrogen-induced ferromagnetic polar half-metallicity in MnO $_3$ Cl | Xinyu Yang et.al. | 2410.03220 | null |
2024-10-03 | Q-SCALE: Quantum computing-based Sensor Calibration for Advanced Learning and Efficiency | Lorenzo Bergadano et.al. | 2410.02998 | null |
2024-10-03 | Individuation of 3D perceptual units from neurogeometry of binocular cells | Maria Virginia Bolelli et.al. | 2410.02870 | null |
2024-10-03 | Pseudo-Stereo Inputs: A Solution to the Occlusion Challenge in Self-Supervised Stereo Matching | Ruizhi Yang et.al. | 2410.02534 | link |
2024-10-03 | Cooperative Semantic Knowledge Base Update Policy for Multiple Semantic Communication Pairs | Shuling Li et.al. | 2410.02405 | null |
2024-10-03 | Extracting the Potential of Emerging Hardware Accelerators for Symmetric Eigenvalue Decomposition | Hansheng Wang et.al. | 2410.02170 | null |
2024-10-03 | Quantum Mutual Information in Time | James Fullwood et.al. | 2410.02137 | null |
2024-10-04 | C-MELT: Contrastive Enhanced Masked Auto-Encoders for ECG-Language Pre-Training | Manh Pham et.al. | 2410.02131 | link |
2024-10-02 | Unified space-time description of pulsed twin beams | Alessandra Gatti et.al. | 2410.01907 | null |
2024-10-02 | Conformal Prediction Sets Can Cause Disparate Impact | Jesse C. Cresswell et.al. | 2410.01888 | link |
2024-10-02 | A Novel Framework of Horizontal-Vertical Hybrid Federated Learning for EdgeIoT | Kai Li et.al. | 2410.01644 | null |
2024-10-02 | Fair Class-Incremental Learning using Sample Weighting | Jaeyoung Park et.al. | 2410.01324 | null |
2024-10-02 | SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network | Ahmed Tawfik Aboukhadra et.al. | 2410.01293 | null |
2024-10-02 | Unifying the Scope of Bridging Anaphora Types in English: Bridging Annotations in ARRAU and GUM | Lauren Levine et.al. | 2410.01170 | null |
2024-10-01 | M2P2: A Multi-Modal Passive Perception Dataset for Off-Road Mobility in Extreme Low-Light Conditions | Aniket Datar et.al. | 2410.01105 | null |
2024-10-01 | A catalog of multi-vantage point observations of type-II bursts: Statistics and correlations | Atul Mohan et.al. | 2410.00814 | null |
2024-10-01 | CME-associated type-IV radio bursts: The solar paradigm and the unique case of AD Leo | Atul Mohan et.al. | 2410.00787 | null |
2024-10-01 | What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study | Beatrice Savoldi et.al. | 2410.00545 | link |
2024-10-01 | Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration | Yida Lin et.al. | 2410.00503 | null |
2024-09-30 | ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity Learning | Jian Shi et.al. | 2410.00262 | link |
2024-09-30 | Uni $^2$ Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection | Yubin Wang et.al. | 2409.20558 | null |
2024-09-30 | Match Stereo Videos via Bidirectional Alignment | Junpeng Jing et.al. | 2409.20283 | null |
2024-09-30 | Understanding How Psychological Distance Influences User Preferences in Conversational Versus Web Search | Yitian Yang et.al. | 2409.19982 | null |
2024-09-30 | Positive-Sum Fairness: Leveraging Demographic Attributes to Achieve Fair AI Outcomes Without Sacrificing Group Gains | Samia Belhadj et.al. | 2409.19940 | null |
2024-09-29 | Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems | Xuyang Wu et.al. | 2409.19804 | link |
2024-09-29 | Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving | Wei-Bin Kou et.al. | 2409.19560 | null |
2024-09-29 | Transforming Scholarly Landscapes: Influence of Large Language Models on Academic Fields beyond Computer Science | Aniket Pramanick et.al. | 2409.19508 | link |
2024-09-29 | KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation | Soofiyan Atar et.al. | 2409.19490 | null |
2024-10-01 | Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models | Seongmin Lee et.al. | 2409.19382 | null |
2024-09-27 | Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping | Anthony A. Song et.al. | 2409.19153 | null |
2024-09-27 | LW2G: Learning Whether to Grow for Prompt-based Continual Learning | Qian Feng et.al. | 2409.18860 | link |
2024-09-27 | Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation | Chaomin Shen et.al. | 2409.18785 | null |
2024-09-27 | Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds | Hanbin Bae et.al. | 2409.18705 | null |
2024-09-27 | Analysis of commissioning data from SST-1M : A Prototype of Single-Mirror Small Size Telescope | Thomas Tavernier et.al. | 2409.18639 | null |
2024-09-27 | ChARLES: Change-Aware Recovery of Latent Evolution Semantics in Relational Data | Shiyi He et.al. | 2409.18386 | null |
2024-09-26 | Realistic Evaluation of Model Merging for Compositional Generalization | Derek Tam et.al. | 2409.18314 | link |
2024-09-26 | Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Kartik Garg et.al. | 2409.18049 | link |
2024-09-26 | LGFN: Lightweight Light Field Image Super-Resolution using Local Convolution Modulation and Global Attention Feature Extraction | Zhongxin Yu et.al. | 2409.17759 | null |
2024-09-26 | Efficient Bias Mitigation Without Privileged Information | Mateo Espinosa Zarlenga et.al. | 2409.17691 | null |
2024-09-26 | Event-based Stereo Depth Estimation: A Survey | Suman Ghosh et.al. | 2409.17680 | null |
2024-09-26 | Improving Fast Adversarial Training via Self-Knowledge Guidance | Chengze Jiang et.al. | 2409.17589 | null |
2024-09-26 | Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Integrating SGBM and Segmentation Models | Yida Lin et.al. | 2409.17526 | null |
2024-09-26 | Characteristics of Powerful Radio Galaxies | Chandra B. Singh et.al. | 2409.17514 | null |
2024-09-26 | Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation | Ian Chuang et.al. | 2409.17435 | link |
2024-09-25 | NTIRE 2024 Challenge on Stereo Image Super-Resolution: Methods and Results | Longguang Wang et.al. | 2409.16947 | null |
2024-09-25 | The diverse star formation histories of early massive, quenched galaxies in modern galaxy formation simulations | Claudia del P. Lagos et.al. | 2409.16916 | link |
2024-09-25 | Pruning Multilingual Large Language Models for Multilingual Inference | Hwichan Kim et.al. | 2409.16911 | link |
2024-09-25 | An Adaptive Screen-Space Meshing Approach for Normal Integration | Moritz Heep et.al. | 2409.16907 | null |
2024-09-25 | GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning | Zhe-Rui Yang et.al. | 2409.16670 | link |
2024-09-25 | Task-driven SLAM Benchmarking | Yanwei Du et.al. | 2409.16573 | link |
2024-09-24 | Camera Calibration and Stereo via a Single Image of a Spherical Mirror | Nissim Barzilay et.al. | 2409.16386 | null |
2024-09-24 | Transient bubble rising in the presence of a surfactant at very low concentrations | D. Fernández-Martínez et.al. | 2409.16029 | null |
2024-09-24 | AutoCE: An Accurate and Efficient Model Advisor for Learned Cardinality Estimation | Jintao Zhang et.al. | 2409.16027 | null |
2024-09-24 | NER-Luxury: Named entity recognition for the fashion and luxury domain | Akim Mousterou et.al. | 2409.15804 | null |
2024-09-24 | Identified-and-Targeted: The First Early Evidence of the Privacy-Invasive Use of Browser Fingerprinting for Online Tracking | Zengrui Liu et.al. | 2409.15656 | null |
2024-09-23 | Rethinking Emotion Bias in Music via Frechet Audio Distance | Yuanchao Li et.al. | 2409.15545 | link |
2024-09-23 | Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras | Ming Li et.al. | 2409.14766 | null |
2024-09-23 | An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding | Wei-Bin Kou et.al. | 2409.14737 | null |
2024-09-22 | Exploring Multilingual Probing in Large Language Models: A Cross-Language Analysis | Daoyang Li et.al. | 2409.14459 | null |
2024-09-22 | Nonmodal stability analysis of the plane Poiseuille flow in a multilayer porous-fluid channel | Supriya Karmakar et.al. | 2409.14420 | null |
2024-09-22 | MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting | Chen Tessler et.al. | 2409.14393 | null |
2024-09-23 | Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping | Jaehyung Jung et.al. | 2409.12051 | null |
2024-09-18 | SymFace: Additional Facial Symmetry Loss for Deep Face Recognition | Pritesh Prakash et.al. | 2409.11816 | null |
2024-09-17 | A Pileup of Coronal Mass Ejections Produced the Largest Geomagnetic Storm in Two Decades | Ying D. Liu et.al. | 2409.11492 | null |
2024-09-17 | A generalized non-hourglass updated Lagrangian formulation for SPH solid dynamics | Shuaihao Zhang et.al. | 2409.11474 | null |
2024-09-17 | Connecting the Low to High Corona: Propagating Disturbances as Tracers of the Near-Sun Solar Wind | Nathalia Alzate et.al. | 2409.11352 | null |
2024-09-17 | The SST-1M imaging atmospheric Cherenkov telescope for gamma-ray astrophysics | C. Alispach et.al. | 2409.11310 | null |
2024-09-17 | SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration | Xin Guan et.al. | 2409.11149 | link |
2024-09-17 | Optimal Investment under the Influence of Decision-changing Imitation | Huisheng Wang et.al. | 2409.10933 | null |
2024-09-16 | GPT takes the SAT: Tracing changes in Test Difficulty and Math Performance of Students | Vikram Krishnaveti et.al. | 2409.10750 | null |
2024-09-16 | Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance | Simone Maurizio La Cava et.al. | 2409.10481 | null |
2024-09-16 | uniGasFoam: a particle-based OpenFOAM solver for multiscale rarefied gas flows | Nikos Vasileiadis et.al. | 2409.10288 | null |
2024-09-16 | SOLVR: Submap Oriented LiDAR-Visual Re-Localisation | Joshua Knights et.al. | 2409.10247 | null |
2024-09-16 | RF-GML: Reference-Free Generative Machine Listener | Arijit Biswas et.al. | 2409.10210 | null |
2024-09-16 | DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection | Kun Fang et.al. | 2409.10094 | null |
2024-09-16 | Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments | Wessel Ledder et.al. | 2409.10048 | link |
2024-09-15 | Estimating Wage Disparities Using Foundation Models | Keyon Vafa et.al. | 2409.09894 | null |
2024-09-15 | A Benchmark Dataset with Larger Context for Non-Factoid Question Answering over Islamic Text | Faiza Qamar et.al. | 2409.09844 | null |
2024-09-15 | Introducing DAIMYO: a first-time-right dynamic design architecture and its application to tail-sitter UAS development | Jolan Wauters et.al. | 2409.09820 | null |
2024-09-14 | An Augmentation-based Model Re-adaptation Framework for Robust Image Segmentation | Zheming Zuo et.al. | 2409.09530 | null |
2024-09-13 | ClearDepth: Enhanced Stereo Perception of Transparent Objects for Robotic Manipulation | Kaixin Bai et.al. | 2409.08926 | null |
2024-09-12 | The Impact of Large Language Models on Open-source Innovation: Evidence from GitHub Copilot | Doron Yeverechyahu et.al. | 2409.08379 | null |
2024-09-12 | Reducing Population-level Inequality Can Improve Demographic Group Fairness: a Twitter Case Study | Avijit Ghosh et.al. | 2409.08135 | null |
2024-09-12 | FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments | Devansh Dhrafani et.al. | 2409.07715 | null |
2024-09-12 | Modeling Information Narrative Detection and Evolution on Telegram during the Russia-Ukraine War | Patrick Gerard et.al. | 2409.07684 | null |
2024-09-11 | Unsupervised anomaly detection in spatio-temporal stream network sensor data | Edgar Santos-Fernandez et.al. | 2409.07667 | null |
2024-09-11 | Object Depth and Size Estimation using Stereo-vision and Integration with SLAM | Layth Hamad et.al. | 2409.07623 | null |
2024-09-11 | Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs | Sadra Safadoust et.al. | 2409.07456 | null |
2024-09-11 | StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos | Sijie Zhao et.al. | 2409.07447 | null |
2024-09-11 | Towards Fairer Health Recommendations: finding informative unbiased samples via Word Sense Disambiguation | Gavin Butts et.al. | 2409.07424 | null |
2024-09-11 | The microbiome science of composting and human excrement composting: a review | Jeff Meilander et.al. | 2409.07376 | null |
2024-09-11 | Constraining Genetic Symbolic Regression via Semantic Backpropagation | Maximilian Reissmann et.al. | 2409.07369 | link |
2024-09-11 | MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications | Praveen K Kanithi et.al. | 2409.07314 | null |
2024-09-11 | Learning Personalized Scoping for Graph Neural Networks under Heterophily | Gangda Deng et.al. | 2409.06998 | link |
2024-09-11 | Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention | Wenhao Zhao et.al. | 2409.06985 | null |
2024-09-10 | A Quality Diversity Approach to Automatically Generate Multi-Agent Path Finding Benchmark Maps | Cheng Qian et.al. | 2409.06888 | null |
2024-09-10 | Adversarial Attacks to Multi-Modal Models | Zhihao Dou et.al. | 2409.06793 | null |
2024-09-10 | Synchronization of wave-propelled capillary spinners | Jack-William Barotta et.al. | 2409.06652 | link |
2024-09-10 | Quantum-like approaches unveil the intrinsic limits of predictability in compartmental models | José Alejandro Rojas-Venegas et.al. | 2409.06438 | null |
2024-09-09 | LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo | Wei Zhi Tang et.al. | 2409.06104 | link |
2024-09-09 | Online 3D reconstruction and dense tracking in endoscopic videos | Michel Hayoz et.al. | 2409.06037 | link |
2024-09-09 | Dust-UV offsets in high-redshift galaxies in the Cosmic Dawn III simulation | Pierre Ocvirk et.al. | 2409.05946 | null |
2024-09-09 | The Influence of Task and Group Disparities over Users’ Attitudes Toward Using Large Language Models for Psychotherapy | Qihang He et.al. | 2409.05703 | null |
2024-09-09 | LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow | Hongyu Wen et.al. | 2409.05688 | null |
2024-09-09 | Adaptive Offloading and Enhancement for Low-Light Video Analytics on Mobile Devices | Yuanyi He et.al. | 2409.05297 | null |
2024-09-08 | PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels | Aayushman et.al. | 2409.04975 | link |
2024-09-10 | Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios | Zhiqiang Chen et.al. | 2409.04961 | link |
2024-09-08 | A Hetero-functional Graph Resilience Analysis for Convergent Systems-of-Systems | Amro M. Farid et.al. | 2409.04936 | null |
2024-09-06 | A Short Survey on Set-Based Aggregation Techniques for Single-Vector WSI Representation in Digital Pathology | S. Hemati et.al. | 2409.04615 | null |
2024-09-06 | AGR: Age Group fairness Reward for Bias Mitigation in LLMs | Shuirong Cao et.al. | 2409.04340 | null |
2024-09-06 | Calibration of Network Confidence for Unsupervised Domain Adaptation Using Estimated Accuracy | Coby Penso et.al. | 2409.04241 | link |
2024-09-06 | Confidence-Aware Document OCR Error Detection | Arthur Hemmer et.al. | 2409.04117 | null |
2024-09-06 | 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors | Yujun Huang et.al. | 2409.04013 | link |
2024-09-05 | An analysis of spectroscopic, seismological, astrometric, and photometric masses of pulsating white dwarf stars | Leila M. Calcaferro et.al. | 2409.03896 | null |
2024-09-05 | LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors | Hanyang Yu et.al. | 2409.03456 | null |
2024-09-05 | Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities | Wei Lu et.al. | 2409.03444 | link |
2024-09-04 | Fast algorithms to improve fair information access in networks | Dennis Robert Windham et.al. | 2409.03127 | link |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views | Jiaxin Guo et.al. | 2409.02917 | link |
2024-09-04 | MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling | Jihye Ahn et.al. | 2409.02846 | null |
2024-09-04 | Deep Learning Meets Satellite Images – An Evaluation on Handcrafted and Learning-based Features for Multi-date Satellite Stereo Images | Shuang Song et.al. | 2409.02825 | null |
2024-09-04 | Experimental Framework for Generating Reliable Ground Truth for Laryngeal Spatial Segmentation Tasks | Hamzeh Ghasemzadeh et.al. | 2409.02809 | null |
2024-09-04 | UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching | Soomin Kim et.al. | 2409.02545 | null |
2024-09-04 | Demographic parity in regression and classification within the unawareness framework | Vincent Divol et.al. | 2409.02471 | null |
2024-09-04 | Unified Framework with Consistency across Modalities for Human Activity Recognition | Tuyen Tran et.al. | 2409.02385 | link |
2024-09-03 | Collaboratively Learning Federated Models from Noisy Decentralized Data | Haoyuan Li et.al. | 2409.02189 | null |
2024-09-03 | Taming Randomness in Agent-Based Models using Common Random Numbers | Daniel J. Klein et.al. | 2409.02086 | link |
2024-09-03 | Observing Context Improves Disparity Estimation when Race is Unobserved | Kweku Kwegyir-Aggrey et.al. | 2409.01984 | null |
2024-08-30 | Semi-supervised permutation invariant particle-level anomaly detection | Gabriel Matos et.al. | 2408.17409 | link |
2024-08-30 | Fairness-Aware Estimation of Graphical Models | Zhuoping Zhou et.al. | 2408.17396 | link |
2024-08-30 | BioBricks.ai: A Versioned Data Registry for Life Sciences Data Assets | Yifan Gao et.al. | 2408.17320 | null |
2024-08-30 | Accelerating the discovery of steady-states of planetary interior dynamics with machine learning | Siddhant Agarwal et.al. | 2408.17298 | null |
2024-08-30 | A Generic and Automated Methodology to Simulate Melting Point | Fu-Zhi Dai et.al. | 2408.17270 | null |
2024-08-30 | Self-supervised learning for crystal property prediction via denoising | Alexander New et.al. | 2408.17255 | null |
2024-08-30 | EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs | Zhen Fan et.al. | 2408.17168 | null |
2024-08-30 | FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition | Chen Hu et.al. | 2408.17090 | link |
2024-08-29 | STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models | Koushik Srivatsan et.al. | 2408.16807 | link |
2024-08-30 | ARINC 429 Cyber-vulnerabilities and Voltage Data in a Hardware-in-the-Loop Simulator | Connor Trask et.al. | 2408.16714 | null |
2024-08-29 | Fibrations of algebras | Danel Ahman et.al. | 2408.16581 | null |
2024-08-29 | Spurfies: Sparse Surface Reconstruction using Local Geometry Priors | Kevin Raj et.al. | 2408.16544 | null |
2024-08-29 | Physical Similarity of Fluid Flow in Bimodal Porous Media: Part 1 – Basic Model and Solution Characteristics | Yuhe Wang et.al. | 2408.16434 | null |
2024-08-28 | Simulation and analysis of a high-k electron scale turbulence diagnostic for MAST-U | David C. Speirs et.al. | 2408.15807 | null |
2024-08-28 | Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions | Huachuan Qiu et.al. | 2408.15787 | link |
2024-08-30 | Addressing the challenges of loop detection in agricultural environments | Nicolás Soncini et.al. | 2408.15761 | link |
2024-08-28 | ES-PTAM: Event-based Stereo Parallel Tracking and Mapping | Suman Ghosh et.al. | 2408.15605 | link |
2024-08-27 | Regional emission dynamics across phases of the EU ETS | Marco Dueñas et.al. | 2408.15438 | null |
2024-08-27 | Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Saining Zhang et.al. | 2408.15242 | link |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-08-27 | Investigating Coverage Criteria in Large Language Models: An In-Depth Study Through Jailbreak Attacks | Shide Zhou et.al. | 2408.15207 | null |
2024-08-27 | Strategic Optimization and Challenges of Large Language Models in Object-Oriented Programming | Zinan Wang et.al. | 2408.14834 | null |
2024-08-26 | Towards Graph Prompt Learning: A Survey and Beyond | Qingqing Long et.al. | 2408.14520 | null |
2024-08-26 | Predictability and Causality in Spanish and English Natural Language Generation | Andrea Busto-Castiñeira et.al. | 2408.14283 | null |
2024-08-26 | Harnessing the Digital Revolution: A Comprehensive Review of mHealth Applications for Remote Monitoring in Transforming Healthcare Delivery | Avnish Singh Jat et.al. | 2408.14190 | null |
2024-08-26 | ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation | Ruohua Shi et.al. | 2408.14114 | null |
2024-08-26 | Bengali Sign Language Recognition through Hand Pose Estimation using Multi-Branch Spatial-Temporal Attention Model | Abu Saleh Musa Miah et.al. | 2408.14111 | null |
2024-08-26 | Fast Edge-Aware Occlusion Detection in the Context of Multispectral Camera Arrays | Frank Sippel et.al. | 2408.14050 | link |
2024-08-26 | More Pictures Say More: Visual Intersection Network for Open Set Object Detection | Bingcheng Dong et.al. | 2408.14032 | null |
2024-08-25 | Splatt3R: Zero-shot Gaussian Splatting from Uncalibarated Image Pairs | Brandon Smart et.al. | 2408.13912 | null |
2024-08-24 | Submodular Maximization Approaches for Equitable Client Selection in Federated Learning | Andrés Catalino Castillo Jiménez et.al. | 2408.13683 | null |
2024-08-24 | Outlier Detection Bias Busted: Understanding Sources of Algorithmic Bias through Data-centric Factors | Xueying Ding et.al. | 2408.13667 | null |
2024-08-23 | HEK-Omics: The promise of omics to optimize HEK293 for recombinant adeno-associated virus (rAAV) gene therapy manufacturing | Sai Guna Ranjan Gurazada et.al. | 2408.13374 | null |
2024-08-23 | Deep Learning at the Intersection: Certified Robustness as a Tool for 3D Vision | Gabriel Pérez S et.al. | 2408.13135 | null |
2024-08-23 | VCEMO: Multi-Modal Emotion Recognition for Chinese Voiceprints | Jinghua Tang et.al. | 2408.13019 | null |
2024-08-23 | Ada2I: Enhancing Modality Balance for Multimodal Conversational Emotion Recognition | Cam-Van Thi Nguyen et.al. | 2408.12895 | null |
2024-08-23 | Refining the isovector component of the Woods-Saxon potential | L. Xayavong et.al. | 2408.12794 | null |
2024-08-22 | Disentangled Structural and Featural Representation for Task-Agnostic Graph Valuation | Ali Falahati et.al. | 2408.12659 | null |
2024-08-22 | The Hybrid Hospital: Balancing On-Site and Remote Hospitalization | Noa Zychlinski et.al. | 2408.12431 | null |
2024-08-22 | Multi-Style Facial Sketch Synthesis through Masked Generative Modeling | Bowen Sun et.al. | 2408.12400 | null |
2024-08-22 | Aligning (Medical) LLMs for (Counterfactual) Fairness | Raphael Poulain et.al. | 2408.12055 | link |
2024-08-21 | Electrostatic Origins of the Dirichlet Principle | Steven Deckelman et.al. | 2408.12002 | null |
2024-08-21 | Time-Dependent Strategy for Improving Aortic Blood Flow Simulations with Boundary Control and Data Assimilation | Muhammad Adnan Anwar et.al. | 2408.11617 | null |
2024-08-21 | A Novel $δ$ -SBM-OPA Approach for Policy-Driven Analysis of Carbon Emission Efficiency under Uncertainty in the Chinese Industrial Sector | Shutian Cui et.al. | 2408.11600 | null |
2024-08-21 | GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation | Abiao Li et.al. | 2408.11558 | link |
2024-08-21 | Mutagenesis screen to map the functionals of parameters of Large Language Models | Yue Hu et.al. | 2408.11494 | link |
2024-08-20 | Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier in 3D Object Detection for AVs | Sanjay Bhargav Dharavath et.al. | 2408.11207 | link |
2024-08-20 | SDI-Net: Toward Sufficient Dual-View Interaction for Low-light Stereo Image Enhancement | Linlin Hu et.al. | 2408.10934 | null |
2024-08-20 | A Noncontact Technique for Wave Measurement Based on Thermal Stereography and Deep Learning | Deyu Li et.al. | 2408.10670 | null |
2024-08-20 | Multi-view Hand Reconstruction with a Point-Embedded Transformer | Lixin Yang et.al. | 2408.10581 | link |
2024-08-19 | Customizing Language Models with Instance-wise LoRA for Sequential Recommendation | Xiaoyu Kong et.al. | 2408.10159 | link |
2024-08-19 | Envisioning Possibilities and Challenges of AI for Personalized Cancer Care | Elaine Kong et.al. | 2408.10108 | null |
2024-08-19 | ARMADA: Attribute-Based Multimodal Data Augmentation | Xiaomeng Jin et.al. | 2408.10086 | null |
2024-08-19 | Helical edge modes in a triangular Heisenberg antiferromagnet | Bastian Pradenas et.al. | 2408.10062 | null |
2024-08-19 | Bridging the Language Gap: Enhancing Multilingual Prompt-Based Code Generation in LLMs via Zero-Shot Cross-Lingual Transfer | Mingda Li et.al. | 2408.09701 | null |
2024-08-17 | Intuitive Human-Robot Interface: A 3-Dimensional Action Recognition and UAV Collaboration Framework | Akash Chaudhary et.al. | 2408.09232 | null |
2024-08-17 | TableBench: A Comprehensive and Complex Benchmark for Table Question Answering | Xianjie Wu et.al. | 2408.09174 | null |
2024-08-17 | GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation | Weiming Zhang et.al. | 2408.09115 | null |
2024-08-17 | Depth-guided Texture Diffusion for Image Semantic Segmentation | Wei Sun et.al. | 2408.09097 | null |
2024-08-17 | From Urban Clusters to Megaregions: Mapping Australia’s Evolving Urban Regions | M. K. M Ng et.al. | 2408.09054 | null |
2024-08-16 | An Empirical Examination of Balancing Strategy for Counterfactual Estimation on Time Series | Qiang Huang et.al. | 2408.08815 | null |
2024-08-16 | CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving | Shihan Peng et.al. | 2408.08500 | null |
2024-08-16 | Fishers Harvest Parallel Unlearning in Inherited Model Networks | Xiao Liu et.al. | 2408.08493 | null |
2024-08-15 | Comparing NASA Discovery and New Frontiers Class Mission Concepts for the Io Volcano Observer (IVO) | Christopher W. Hamilton et.al. | 2408.08334 | null |
2024-08-15 | Cluster Formations of Free and Congested Flows in Urban Road Networks | Yongsung Kwon et.al. | 2408.08122 | null |
2024-08-15 | Motif analysis and passing behavior in football passing networks | Ming-Xia Li et.al. | 2408.07927 | null |
2024-08-14 | Polarization dynamics: a study of individuals shifting between political communities on social media | Federico Albanese et.al. | 2408.07731 | null |
2024-08-14 | Hierarchical Working Memory and a New Magic Number | Weishun Zhong et.al. | 2408.07637 | null |
2024-08-14 | Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks | Liting Jiang et.al. | 2408.07613 | null |
2024-08-15 | DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution | Yuanbo Zhou et.al. | 2408.07516 | null |
2024-08-14 | M2L Translation Operators for Kernel Independent Fast Multipole Methods on Modern Architectures | Srinath Kailasa et.al. | 2408.07436 | null |
2024-08-14 | Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction | Liting Jiang et.al. | 2408.07419 | link |
2024-08-14 | MorphFader: Enabling Fine-grained Controllable Morphing with Text-to-Audio Models | Purnima Kamath et.al. | 2408.07260 | null |
2024-08-12 | Quantized Redshift and its significance for recent observations | Arindam Mal et.al. | 2408.07101 | null |
2024-08-13 | The News Comment Gap and Algorithmic Agenda Setting in Online Forums | Flora Böwing et.al. | 2408.07052 | link |
2024-08-13 | Quantifying the checkerboard problem to reduce numerical dissipation | Johannes Arend Hopman et.al. | 2408.06821 | null |
2024-08-12 | Observation of vortex stripes in UTe $_2$ | Y. F. Wang et.al. | 2408.06209 | null |
2024-08-12 | IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI | Yash Rampuria et.al. | 2408.06113 | null |
2024-08-12 | Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models | Haifan Gong et.al. | 2408.05985 | null |
2024-08-11 | Predictors and Socio-Demographic Disparities in STEM Degree Outcomes: A ten-year UK study using Hierarchical Logistic Regression | Andrew M. Low et.al. | 2408.05853 | null |
2024-08-10 | EV-MGDispNet: Motion-Guided Event-Based Stereo Disparity Estimation Network with Left-Right Consistency | Junjie Jiang et.al. | 2408.05452 | null |
2024-08-08 | LiDAR-Event Stereo Fusion with Hallucinations | Luca Bartolomei et.al. | 2408.04633 | link |
2024-08-08 | Charmed hypernuclei within density-dependent relativistic mean-field theory | Wei Yang et.al. | 2408.04527 | null |
2024-08-08 | A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery | Mengya Xu et.al. | 2408.04426 | link |
2024-08-07 | A Framework for Assessing Cumulative Exposure to Extreme Temperatures During Transit Trip | Huiying Fan et.al. | 2408.04081 | null |
2024-08-07 | A Comparison of Fireball Luminous Efficiency Models using Acoustic Records | Luke McFadden et.al. | 2408.04078 | null |
2024-08-07 | A Blockchain-based Reliable Federated Meta-learning for Metaverse: A Dual Game Framework | Emna Baccour et.al. | 2408.03694 | null |
2024-08-07 | TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization | Kien T. Pham et.al. | 2408.03637 | null |
2024-08-07 | Unlocking Exocentric Video-Language Data for Egocentric Video Representation Learning | Zi-Yi Dou et.al. | 2408.03567 | null |
2024-08-07 | D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods | Onkar Susladkar et.al. | 2408.03558 | link |
2024-08-07 | Opening the Black Box of 3D Reconstruction Error Analysis with VECTOR | Racquel Fygenson et.al. | 2408.03503 | link |
2024-08-06 | Transit Rider Heat Stress in Atlanta, GA under Current and Future Climate Scenarios | Huiying Fan et.al. | 2408.03457 | null |
2024-08-06 | Fusing Forces: Deep-Human-Guided Refinement of Segmentation Masks | Rafael Sterzinger et.al. | 2408.03304 | link |
2024-08-06 | Measuring interconnectedness of infectious diseases in funded and unfunded research: a temporal network analysis on bibliometric data 1995-2022 | Anbang Du et.al. | 2408.03140 | null |
2024-08-06 | Predictive Performance Test based on the Exhaustive Nested Cross-Validation for High-dimensional data | Iris Ivy Gauran et.al. | 2408.03138 | null |
2024-08-06 | Interoperability and Explicable AI-based Zero-Day Attacks Detection Process in Smart Community | Mohammad Sayduzzaman et.al. | 2408.02921 | null |
2024-08-05 | Phase Transitions in Anisotropic Turbulence | Adrian van Kan et.al. | 2408.02844 | null |
2024-08-05 | Gaussian Mixture based Evidential Learning for Stereo Matching | Weide Liu et.al. | 2408.02796 | null |
2024-08-04 | Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image | Xinlin Ren et.al. | 2408.02079 | link |
2024-08-04 | PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone | Xin Yang et.al. | 2408.02053 | null |
2024-08-03 | Are EU low-carbon structural funds efficient in reducing emissions? | Marco Dueñas et.al. | 2408.01782 | null |
2024-08-03 | MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas | Feng Qiao et.al. | 2408.01653 | null |
2024-08-06 | Three-dimensional Morphological Reconstruction of Millimeter-Scale Soft Continuum Robots based on Dual-Stereo-Vision | Tian-Ao Ren et.al. | 2408.01615 | null |
2024-08-02 | Decentralized Smoothing ADMM for Quantile Regression with Non-Convex Sparse Penalties | Reza Mirzaeifard et.al. | 2408.01307 | null |
2024-08-02 | The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models | Hannah Chen et.al. | 2408.01285 | null |
2024-08-01 | High-Impact Innovations and Hidden Gender Disparities in Inventor-Evaluator Networks | Tara Sowrirajan et.al. | 2408.00905 | null |
2024-08-01 | Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection | Ruiyang Zhang et.al. | 2408.00619 | link |
2024-07-31 | Machine Learning Boosted Entropy-Engineered Synthesis of stable Nanometric Solid Solution CuCo Alloys for Efficient Nitrate Reduction to Ammonia | Yao Hu et.al. | 2408.00142 | null |
2024-07-31 | A comparative study of radio signatures from winds and jets: Modelling synchrotron emission and polarization | Moun Meenakshi et.al. | 2408.00099 | null |
2024-07-31 | Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs | Shi Liu et.al. | 2407.21771 | null |
2024-07-31 | Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching | Pengjie Zhang et.al. | 2407.21735 | null |
2024-07-31 | Deep Learning-Based Longitudinal Prediction of Childhood Myopia Progression Using Fundus Image Sequences and Baseline Refraction Data | Mengtian Kang et.al. | 2407.21467 | null |
2024-07-31 | Modeling Urban Transport Choices: Incorporating Sociocultural Aspects | Kathleen Salazar-Serna et.al. | 2407.21307 | link |
2024-07-30 | Algorithm-Assisted Decision Making and Racial Disparities in Housing: A Study of the Allegheny Housing Assessment Tool | Lingwei Cheng et.al. | 2407.21209 | null |
2024-07-30 | Different behaviour of the gas-phase and stellar metallicity in the central part of MaNGA galaxies | I. A. Zinchenko et.al. | 2407.21160 | null |
2024-07-30 | Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings | Tianyi Zhang et.al. | 2407.20870 | null |
2024-07-30 | Planar network statistics for two-dimensional rupturing foams | Joseph Klobusicky et.al. | 2407.20858 | null |
2024-07-30 | Evaluating Fairness in Black-box Algorithmic Markets: A Case Study of Ride Sharing in Chicago | Yuhan Liu et.al. | 2407.20522 | null |
2024-07-29 | BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation | Kieran Saunders et.al. | 2407.20437 | null |
2024-07-29 | Solving QUBOs with a quantum-amenable branch and bound method | Thomas Häner et.al. | 2407.20185 | null |
2024-07-29 | Classification of Alzheimer’s Dementia vs. Healthy subjects by studying structural disparities in fMRI Time-Series of DMN | Sneha Noble et.al. | 2407.19990 | null |
2024-07-29 | Can I trust my anomaly detection system? A case study based on explainable AI | Muhammad Rashid et.al. | 2407.19951 | link |
2024-07-29 | Generalization bounds for regression and classification on adaptive covering input domains | Wen-Liang Hwang et.al. | 2407.19715 | null |
2024-07-29 | SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages | Wenxuan Zhang et.al. | 2407.19672 | link |
2024-07-29 | AI-Driven Healthcare: A Survey on Ensuring Fairness and Mitigating Bias | Sribala Vidyadhari Chinta et.al. | 2407.19655 | null |
2024-07-28 | On the Evaluation Consistency of Attribution-based Explanations | Jiarui Duan et.al. | 2407.19471 | null |
2024-07-27 | MSP-MVS: Multi-granularity Segmentation Prior Guided Multi-View Stereo | Zhenlong Yuan et.al. | 2407.19323 | null |
2024-07-27 | On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs | Nitay Calderon et.al. | 2407.19200 | null |
2024-07-27 | Assessing Spatial Disparities: A Bayesian Linear Regression Approach | Kyle Lin Wu et.al. | 2407.19171 | null |
2024-07-26 | PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis | Sohyeong Kim et.al. | 2407.18695 | null |
2024-07-26 | Direct observation of quantum vortex fractionalization in multiband superconductors | Yu Zheng et.al. | 2407.18610 | null |
2024-07-26 | Content-driven Magnitude-Derivative Spectrum Complementary Learning for Hyperspectral Image Classification | Huiyan Bai et.al. | 2407.18593 | null |
2024-07-25 | Unsupervised Training of Neural Cellular Automata on Edge Devices | John Kalkhof et.al. | 2407.18114 | link |
2024-07-25 | TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework | Guanfeng Tang et.al. | 2407.18038 | null |
2024-07-25 | Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks | Zhicheng Cai et.al. | 2407.17834 | link |
2024-07-25 | Multi-modal Data Binding for Survival Analysis Modeling with Incomplete Data and Annotations | Linhao Qu et.al. | 2407.17726 | null |
2024-07-24 | Unveiling the structural content of NGC 6357 via kinematics and NIR variability | C. Ordenes-Huanca et.al. | 2407.17577 | null |
2024-07-24 | Gender disparities in the dissemination and acquisition of scientific knowledge | Chiara Zappalà et.al. | 2407.17441 | null |
2024-07-25 | Domain Generalized Recaptured Screen Image Identification Using SWIN Transformer | Preeti Mehta et.al. | 2407.17170 | null |
2024-07-23 | Balanced Multi-Relational Graph Clustering | Zhixiang Shen et.al. | 2407.16863 | link |
2024-07-24 | FCNR: Fast Compressive Neural Representation of Visualization Images | Yunfei Lu et.al. | 2407.16369 | link |
2024-07-23 | MHD activity induced coherent mode excitation in the edge plasma region of ADITYA-U Tokamak | Kaushlender Singh et.al. | 2407.16301 | null |
2024-07-23 | Representation Magnitude has a Liability to Privacy Vulnerability | Xingli Fang et.al. | 2407.16164 | link |
2024-07-22 | Inequalities in Computational Thinking Among Incoming Students in an STEM Chilean University | Felipe González-Pizarro et.al. | 2407.15833 | null |
2024-07-22 | Breaking the Global North Stereotype: A Global South-centric Benchmark Dataset for Auditing and Mitigating Biases in Facial Recognition Systems | Siddharth D Jaiswal et.al. | 2407.15810 | null |
2024-07-22 | Examining Inequality in Park Quality for Promoting Health Across 35 Global Cities | Linus W. Dietz et.al. | 2407.15770 | link |
2024-07-23 | Bidirectional skip-frame prediction for video anomaly detection with intra-domain disparity-driven attention | Jiahao Lyu et.al. | 2407.15424 | null |
2024-07-22 | Iterative approach to reconstructing neural disparity fields from light-field data | Ligen Shi et.al. | 2407.15380 | null |
2024-07-22 | Dissecting Multiplication in Transformers: Insights into LLMs | Luyu Qiu et.al. | 2407.15360 | link |
2024-07-22 | Efficient Multi-disparity Transformer for Light Field Image Super-resolution | Zeke Zexi Hu et.al. | 2407.15329 | null |
2024-07-19 | PolySinger: Singing-Voice to Singing-Voice Translation from English to Japanese | Silas Antonisen et.al. | 2407.14399 | null |
2024-07-19 | tidychangepoint: a unified framework for analyzing changepoint detection in univariate time series | Benjamin S. Baumer et.al. | 2407.14369 | null |
2024-07-19 | Stable Audio Open | Zach Evans et.al. | 2407.14358 | link |
2024-07-19 | SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization | Mae Younes et.al. | 2407.14257 | link |
2024-07-19 | Double-Shot 3D Shape Measurement with a Dual-Branch Network | Mingyang Lei et.al. | 2407.14198 | null |
2024-07-19 | Scale Disparity of Instances in Interactive Point Cloud Segmentation | Chenrui Han et.al. | 2407.14009 | null |
2024-07-19 | Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance | Changye Li et.al. | 2407.13982 | link |
2024-07-19 | The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations | Tyler LaBonte et.al. | 2407.13957 | link |
2024-07-18 | Research on Tibetan Tourism Viewpoints information generation system based on LLM | Jinhu Qi et.al. | 2407.13561 | null |
2024-07-18 | CookAR: Affordance Augmentations in Wearable AR to Support Kitchen Tool Interactions for People with Low Vision | Jaewook Lee et.al. | 2407.13515 | link |
2024-07-18 | MIR laser CEP estimation using machine learning concepts in bulk high harmonic generation | Balázs Nagyillés et.al. | 2407.13512 | null |
2024-07-18 | From Words to Worlds: Compositionality for Cognitive Architectures | Ruchira Dhar et.al. | 2407.13419 | null |
2024-07-18 | Hybridization of terahertz phonons and magnons in disparate and spatially-separated material specimens | Marcin Białek et.al. | 2407.13305 | null |
2024-07-18 | FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection | Jianwei Zhao et.al. | 2407.13133 | null |
2024-07-17 | Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning | Minjae Cho et.al. | 2407.13006 | null |
2024-07-17 | Multi-Band Wi-Fi Neural Dynamic Fusion | Sorachi Kato et.al. | 2407.12937 | null |
2024-07-17 | Propagation of Interplanetary Shocks in the Heliosphere | Munkhjargal Lkhagvadorj et.al. | 2407.12689 | null |
2024-07-16 | Temporally Consistent Stereo Matching | Jiaxi Zeng et.al. | 2407.11950 | link |
2024-07-16 | Fairly Accurate: Optimizing Accuracy Parity in Fair Target-Group Detection | Soumyajit Gupta et.al. | 2407.11933 | null |
2024-07-16 | MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification | Zhuoxiao Li et.al. | 2407.11840 | null |
2024-07-16 | Robust Utility-Preserving Text Anonymization Based on Large Language Models | Tianyu Yang et.al. | 2407.11770 | link |
2024-07-16 | Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems | Jianzhu Huai et.al. | 2407.11705 | null |
2024-07-16 | Rethinking Fair Graph Neural Networks from Re-balancing | Zhixun Li et.al. | 2407.11624 | link |
2024-07-17 | QVD: Post-training Quantization for Video Diffusion Models | Shilong Tian et.al. | 2407.11585 | null |
2024-07-16 | Representation Bias in Political Sample Simulations with Large Language Models | Weihong Qi et.al. | 2407.11409 | null |
2024-07-16 | The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation | Muyang Qiu et.al. | 2407.11356 | link |
2024-07-15 | Benchmarking Vision Language Models for Cultural Understanding | Shravan Nayak et.al. | 2407.10920 | null |
2024-07-15 | Temporal Event Stereo via Joint Learning with Stereoscopic Flow | Hoonhee Cho et.al. | 2407.10831 | link |
2024-07-15 | Growth of Science: How long will the United States uphold its position? | Dipak Patra et.al. | 2407.10771 | null |
2024-07-15 | Socioeconomic factors of national representation in the global film festival circuit: skewed toward the large and wealthy, but small countries can beat the odds | Andres Karjus et.al. | 2407.10755 | null |
2024-07-15 | Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | Zhening Liu et.al. | 2407.10632 | link |
2024-07-15 | Muon-induced collisional flavor instability in core-collapse supernova | Jiabao Liu et.al. | 2407.10604 | null |
2024-07-15 | A Unifying Approach to Product Constructions for Quantitative Temporal Inference | Kazuki Watanabe et.al. | 2407.10465 | null |
2024-07-14 | Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data | Tuo Feng et.al. | 2407.10200 | link |
2024-07-14 | Adaptive Model Predictive Control with Data-driven Error Model for Quadrupedal Locomotion | Xuanqi Zeng et.al. | 2407.10124 | null |
2024-07-13 | Characterizing Disparity Between Edge Models and High-Accuracy Base Models for Vision Tasks | Zhenyu Wang et.al. | 2407.10016 | null |
2024-07-12 | Self-organized multiscale structures in thermally relativistic electron-positron-ion plasmas | Usman Shazad et.al. | 2407.09440 | null |
2024-07-12 | Multi-Modal Dataset Creation for Federated~Learning with DICOM Structured Reports | Malte Tölle et.al. | 2407.09064 | null |
2024-07-12 | Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining on Chest CT | Jie Zheng et.al. | 2407.08961 | null |
2024-07-11 | MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization | Orevaoghene Ahia et.al. | 2407.08818 | null |
2024-07-11 | Adaptive Smooth Non-Stationary Bandits | Joe Suk et.al. | 2407.08654 | link |
2024-07-11 | Multi-Group Proportional Representation | Alex Oesterling et.al. | 2407.08571 | link |
2024-07-11 | Vox Populi, Vox AI? Using Language Models to Estimate German Public Opinion | Leah von der Heyde et.al. | 2407.08563 | link |
2024-07-11 | Unveiling Disparities in Maternity Care: A Topic Modelling Approach to Analysing Maternity Incident Investigation Reports | Georgina Cosma et.al. | 2407.08328 | null |
2024-07-11 | DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing | Minghang Zhou et.al. | 2407.08132 | link |
2024-07-10 | Stretch your reach: Studying Self-Avatar and Controller Misalignment in Virtual Reality Interaction | Jose Luis Ponton et.al. | 2407.08011 | null |
2024-07-10 | A Survey on Deep Stereo Matching in the Twenties | Fabio Tosi et.al. | 2407.07816 | link |
2024-07-10 | Explicit inverse of symmetric, tridiagonal near Toeplitz matrices Part II: with weakly diagonally dominant Toeplitz | Bakytzhan Kurmanbek et.al. | 2407.07654 | null |
2024-07-10 | TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data | Siyi Du et.al. | 2407.07582 | link |
2024-07-10 | Causal Discovery-Driven Change Point Detection in Time Series | Shanyun Gao et.al. | 2407.07290 | null |
2024-07-09 | A Detailed Analysis of a Magnetic Island Observed by WISPR on Parker Solar Probe | Madison L. Ascione et.al. | 2407.07216 | null |
2024-07-09 | Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images | Chuanrui Zhang et.al. | 2407.06984 | null |
2024-07-09 | iASiS: Towards Heterogeneous Big Data Analysis for Personalized Medicine | Anastasia Krithara et.al. | 2407.06748 | null |
2024-07-09 | Computer vision tasks for intelligent aerospace missions: An overview | Huilin Chen et.al. | 2407.06513 | null |
2024-07-09 | LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration | Jiayi Liu et.al. | 2407.06512 | link |
2024-07-08 | Systematic time-coarse graining for driven quantum systems | Leon Bello et.al. | 2407.06068 | link |
2024-07-08 | CA-FedRC: Codebook Adaptation via Federated Reservoir Computing in 5G NR | Ziqiang Ye et.al. | 2407.05928 | null |
2024-07-08 | GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation | Chenxin Li et.al. | 2407.05540 | null |
2024-07-07 | GitHub Marketplace for Automation and Innovation in Software Production | SK Golam Saroar et.al. | 2407.05519 | null |
2024-07-07 | Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models | Nikhil Sharma et.al. | 2407.05502 | null |
2024-07-07 | CLIMB: A Benchmark of Clinical Bias in Large Language Models | Yubo Zhang et.al. | 2407.05250 | link |
2024-07-06 | SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention | Yunzhong Si et.al. | 2407.05128 | link |
2024-07-06 | Crowdsourced reviews reveal substantial disparities in public perceptions of parking | Lingyao Li et.al. | 2407.05104 | link |
2024-07-06 | SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions | Zaid A. El-Shair et.al. | 2407.04908 | null |
2024-07-05 | Balancing Operator’s Risk Averseness in Model Predictive Control of a Reservoir System | Ja-Ho Koo et.al. | 2407.04506 | null |
2024-07-04 | The SOHO LASCO CME Catalog – Version 2 | Nat Gopalswamy et.al. | 2407.04165 | null |
2024-07-04 | Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving | Sergio. Martín Serrano et.al. | 2407.04070 | null |
2024-07-04 | Adversarial Robustness of VAEs across Intersectional Subgroups | Chethan Krishnamurthy Ramanaik et.al. | 2407.03864 | link |
2024-07-04 | M $\mathbf5$ – A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks | Florian Schneider et.al. | 2407.03791 | null |
2024-07-04 | High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching | Gael Le Lan et.al. | 2407.03648 | null |
2024-07-04 | ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution | Yuanbo Zhou et.al. | 2407.03598 | link |
2024-07-03 | Probing Perfection: The Relentless Art of Meddling for Pulmonary Airway Segmentation from HRCT via a Human-AI Collaboration Based Active Learning Method | Shiyi Wang et.al. | 2407.03542 | null |
2024-07-03 | How Does Quantization Affect Multilingual LLMs? | Kelly Marchisio et.al. | 2407.03211 | null |
2024-07-03 | Stereo Risk: A Continuous Modeling Approach to Stereo Matching | Ce Liu et.al. | 2407.03152 | null |
2024-07-03 | Effective Heterogeneous Federated Learning via Efficient Hypernetwork-based Weight Generation | Yujin Shin et.al. | 2407.03086 | link |
2024-07-03 | Early-Stage Anomaly Detection: A Study of Model Performance on Complete vs. Partial Flows | Adrian Pekar et.al. | 2407.02856 | link |
2024-07-03 | A Pairwise DomMix Attentive Adversarial Network for Unsupervised Domain Adaptive Object Detection | Jie Shao et.al. | 2407.02835 | null |
2024-07-02 | Practical Guide for Causal Pathways and Sub-group Disparity Analysis | Farnaz Kohankhaki et.al. | 2407.02702 | null |
2024-07-02 | Domain Generalizable Knowledge Tracing via Concept Aggregation and Relation-Based Attention | Yuquan Xie et.al. | 2407.02547 | null |
2024-07-02 | QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices | Juntao Zhao et.al. | 2407.02327 | link |
2024-07-02 | Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models | Anjishnu Mukherjee et.al. | 2407.02067 | link |
2024-07-02 | Privacy Risks of General-Purpose AI Systems: A Foundation for Investigating Practitioner Perspectives | Stephen Meisenbacher et.al. | 2407.02027 | null |
2024-07-02 | Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model | Yu-Kuan Fu et.al. | 2407.01911 | null |
2024-07-01 | Race and Privacy in Broadcast Police Communications | Pranav Narayanan Venkit et.al. | 2407.01817 | null |
2024-07-01 | Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation | Lianjie Guo et.al. | 2407.01292 | link |
2024-07-01 | OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos | Yassine Benzakour et.al. | 2407.01265 | null |
2024-07-01 | FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models | Ruinan Jin et.al. | 2407.00983 | link |
2024-06-30 | Learning System Dynamics without Forgetting | Xikun Zhang et.al. | 2407.00717 | link |
2024-06-30 | Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP | Ayush Ranjan et.al. | 2407.00592 | null |
2024-06-28 | LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation | Xianda Guo et.al. | 2406.19833 | link |
2024-06-28 | Galaxy Group Ellipticity Confirms a Younger Cosmos | Yu Rong et.al. | 2406.19612 | null |
2024-06-28 | What’s the Weight? Estimating Controlled Outcome Differences in Complex Surveys for Health Disparities Research | Stephen Salerno et.al. | 2406.19597 | link |
2024-06-27 | Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects | Orevaoghene Ahia et.al. | 2406.19564 | link |
2024-06-27 | Stereo Vision Based Robot for Remote Monitoring with VR Support | Mohamed Fazil M. S. et.al. | 2406.19498 | null |
2024-06-27 | STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning | Yanan Zhang et.al. | 2406.19362 | null |
2024-06-27 | Revealing Fine-Grained Values and Opinions in Large Language Models | Dustin Wright et.al. | 2406.19238 | link |
2024-06-27 | RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulaiton | Fanfan Liu et.al. | 2406.18977 | link |
2024-06-27 | From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased Decisions | Trenton Chang et.al. | 2406.18865 | link |
2024-06-27 | Retain, Blend, and Exchange: A Quality-aware Spatial-Stereo Fusion Approach for Event Stream Recognition | Lan Chen et.al. | 2406.18845 | link |
2024-06-26 | DoubleTake: Geometry Guided Depth Estimation | Mohamed Sayed et.al. | 2406.18387 | null |
2024-06-26 | An interactive framework for the evaluation and detection of stereoacuity threshold under ambient lighting | Kritika Lohia et.al. | 2406.18336 | null |
2024-06-26 | Molecular Diffusion Models with Virtual Receptors | Matan Halfon et.al. | 2406.18330 | null |
2024-06-28 | SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance | Caishuang Huang et.al. | 2406.18118 | link |
2024-06-25 | Evaluating Fairness in Large Vision-Language Models Across Diverse Demographic Attributes and Prompts | Xuyang Wu et.al. | 2406.17974 | link |
2024-06-25 | Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals | Kentaro Seki et.al. | 2406.17722 | link |
2024-06-25 | Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation | Xuming Zhang et.al. | 2406.17679 | null |
2024-06-25 | RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale | Beck LaBash et.al. | 2406.16801 | link |
2024-06-24 | Addressing Polarization and Unfairness in Performative Prediction | Kun Jin et.al. | 2406.16756 | null |
2024-06-24 | Lone Pair Induced 1D Character and Weak Cation-anion Interactions: Two Ingredients for Low Thermal Conductivity in Mixed-anion Metal Chalcohalides | Xingchen Shen et.al. | 2406.16744 | null |
2024-06-24 | Effective Elastic Properties of Multilayer Graphene | Yun Hwangbo et.al. | 2406.16344 | null |
2024-06-23 | Thinking beyond Bias: Analyzing Multifaceted Impacts and Implications of AI on Gendered Labour | Satyam Mohla et.al. | 2406.16207 | null |
2024-06-23 | The Persistence of Contrarianism on Twitter: Mapping users’ sharing habits for the Ukraine war, COVID-19 vaccination, and the 2020 Midterm Elections | David Axelrod et.al. | 2406.16175 | null |
2024-06-23 | Comparison of methods for mediation analysis with multiple correlated mediators | Mary Appah et.al. | 2406.16174 | null |
2024-06-23 | Quantitative Global Carbon Inequality Network | Yanming Guo et.al. | 2406.16092 | null |
2024-06-23 | Learning Accurate and Enriched Features for Stereo Image Super-Resolution | Hu Gao et.al. | 2406.16001 | link |
2024-06-23 | Generalized Measures of Population Synchrony | Francis C. Motta et.al. | 2406.15987 | null |
2024-06-21 | Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks | Hokyung Lee et.al. | 2406.15325 | link |
2024-06-21 | Time-Domain Signatures of Distinct Correlated Insulators in a Moiré Superlattice | Eric A. Arsenault et.al. | 2406.15067 | null |
2024-06-21 | 3D-Localization of Single Point-Like Gamma Sources with a Coded Aperture Camera | Tobias Meißner et.al. | 2406.15048 | null |
2024-06-21 | Trustworthy Enhanced Multi-view Multi-modal Alzheimer’s Disease Prediction with Brain-wide Imaging Transcriptomics Data | Shan Cong et.al. | 2406.14977 | link |
2024-06-21 | Direct Multi-Turn Preference Optimization for Language Agents | Wentao Shi et.al. | 2406.14868 | link |
2024-06-21 | Older and Wiser: The Marriage of Device Aging and Intellectual Property Protection of Deep Neural Networks | Ning Lin et.al. | 2406.14863 | null |
2024-06-21 | Non-Markovian Collective Emission of Giant emitters in the Zeno Regime | Qing-Yang Qiu et.al. | 2406.14811 | null |
2024-06-20 | 1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators? | Yue Huang et.al. | 2406.14721 | null |
2024-06-20 | Population Activity Recovery: Milestones Unfolding, Temporal Interdependencies, and Relationship with Physical and Social Vulnerability | Flavia Ioana Patrascu et.al. | 2406.14720 | null |
2024-06-20 | Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data | Johannes Treutlein et.al. | 2406.14546 | link |
2024-06-20 | Towards Truthful Multilingual Large Language Models: Benchmarking and Alignment Strategies | Weihao Liu et.al. | 2406.14434 | link |
2024-06-20 | Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services | David Hartmann et.al. | 2406.14154 | null |
2024-06-20 | Novae: An Important Source of Lithium in the Galaxy | Jun Gao et.al. | 2406.13986 | null |
2024-06-19 | Open Generative Large Language Models for Galician | Pablo Gamallo et.al. | 2406.13893 | null |
2024-06-19 | Leveraging Large Language Models to Measure Gender Bias in Gendered Languages | Erik Derner et.al. | 2406.13677 | null |
2024-06-19 | Transferable Tactile Transformers for Representation Learning Across Diverse Sensors and Tasks | Jialiang Zhao et.al. | 2406.13640 | null |
2024-06-19 | Formation of a Magnetic Cloud from the Merging of Two Successive Coronal Mass Ejections | Chong Chen et.al. | 2406.13603 | null |
2024-06-19 | MVSBoost: An Efficient Point Cloud-based 3D Reconstruction | Umair Haroon et.al. | 2406.13515 | null |
2024-06-19 | Toward Structure Fairness in Dynamic Graph Embedding: A Trend-aware Dual Debiasing Approach | Yicong Li et.al. | 2406.13201 | link |
2024-06-18 | Stealth edits for provably fixing or attacking large language models | Oliver J. Sutton et.al. | 2406.12670 | link |
2024-06-18 | An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation | Qin Li et.al. | 2406.12646 | null |
2024-06-18 | Restorer: Solving Multiple Image Restoration Tasks with One Set of Parameters | Jiawei Mao et.al. | 2406.12587 | link |
2024-06-18 | Rastall gravity: accretion disk image in radiation fields context and visual transformations compared to Reissner-Nordstrom black holes | Yu-Xiang Huang et.al. | 2406.12466 | null |
2024-06-18 | Status of Astronomy Education in India: A Baseline Survey | Moupiya Maji et.al. | 2406.12308 | null |
2024-06-17 | Slicing Through Bias: Explaining Performance Gaps in Medical Image Analysis using Slice Discovery Methods | Vincent Olesen et.al. | 2406.12142 | link |
2024-06-17 | The Benefits and Risks of Transductive Approaches for AI Fairness | Muhammed Razzak et.al. | 2406.12011 | null |
2024-06-17 | Decomposed evaluations of geographic disparities in text-to-image models | Abhishek Sureddy et.al. | 2406.11988 | null |
2024-06-17 | Be careful in multi-messenger inference of the Hubble constant: A path forward for robust inference | Michael Müller et.al. | 2406.11965 | null |
2024-06-17 | Personalized Federated Knowledge Graph Embedding with Client-Wise Relation Graph | Xiaoxiong Zhang et.al. | 2406.11943 | null |
2024-06-17 | P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models | Shuo Yang et.al. | 2406.11391 | null |
2024-06-17 | Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network | Frank Sippel et.al. | 2406.11284 | link |
2024-06-16 | Physics-Informed Deep Learning and Partial Transfer Learning for Bearing Fault Diagnosis in the Presence of Highly Missing Data | Mohammadreza Kavianpour et.al. | 2406.11023 | null |
2024-06-16 | Rectified Iterative Disparity for Stereo Matching | Weiqing Xiao et.al. | 2406.10943 | null |
2024-06-16 | Quantifying Generative Media Bias with a Corpus of Real-world and Generated News Articles | Filip Trhlik et.al. | 2406.10773 | null |
2024-06-15 | Trapping of isotropic droplets by disclinations in nematic liquid crystals controlled by surface anchoring and elastic constant disparity | Nilanthi P. Haputhanthrige et.al. | 2406.10684 | null |
2024-06-15 | Functional Clustering for Longitudinal Associations between County-Level Social Determinants of Health and Stroke Mortality in the US | Fangzhi Luo et.al. | 2406.10499 | null |
2024-06-15 | A Label is Worth a Thousand Images in Dataset Distillation | Tian Qin et.al. | 2406.10485 | link |
2024-06-14 | Consistency-diversity-realism Pareto fronts of conditional image generative models | Pietro Astolfi et.al. | 2406.10429 | null |
2024-06-14 | Gender Representation in TV and Radio: Automatic Information Extraction methods versus Manual Analyses | David Doukhan et.al. | 2406.10316 | null |
2024-06-14 | Carbon Monoxide Cooling in Radiative Transfer Modeling of Supernovae | Collin McLeod et.al. | 2406.10132 | null |
2024-06-14 | DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications | Li Li et.al. | 2406.10068 | link |
2024-06-14 | Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness | Maximilian Spliethöver et.al. | 2406.09977 | null |
2024-06-14 | OpenCapBench: A Benchmark to Bridge Pose Estimation and Biomechanics | Yoni Gozlan et.al. | 2406.09788 | null |
2024-06-14 | Cross-view geo-localization: a survey | Abhilash Durgam et.al. | 2406.09722 | null |
2024-06-14 | MoME: Mixture of Multimodal Experts for Cancer Survival Prediction | Conghao Xiong et.al. | 2406.09696 | link |
2024-06-13 | Strain rate controls alignment in growing bacterial monolayers | Blake Langeslay et.al. | 2406.09615 | null |
2024-06-13 | AOC: Analysis of Orthologous Collections – an application for the characterization of natural selection in protein-coding sequences | Alexander Lucaci et.al. | 2406.09522 | link |
2024-06-13 | You are what you eat? Feeding foundation models a regionally diverse food dataset of World Wide Dishes | Jabez Magomere et.al. | 2406.09496 | link |
2024-06-13 | Scale-Invariant Monocular Depth Estimation via SSI Depth | S. Mahdi H. Miangoleh et.al. | 2406.09374 | link |
2024-06-13 | Less Cybersickness, Please: Demystifying and Detecting Stereoscopic Visual Inconsistencies in VR Apps | Shuqing Li et.al. | 2406.09313 | null |
2024-06-13 | Python-based DSL for generating Verilog model of Synchronous Digital Circuits | Mandar Datar et.al. | 2406.09208 | link |
2024-06-13 | Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns | Kaavya Rekanar et.al. | 2406.09203 | null |
2024-06-13 | Fine-Grained Domain Generalization with Feature Structuralization | Wenlong Yu et.al. | 2406.09166 | link |
2024-06-13 | Mean Field Study of Superconductivity in the Square Lattice $t$-$J$ Model with Three-Site Hopping | Ke Yang et.al. | 2406.08780 | null |
2024-06-12 | On Strongly-equitable Social Welfare Orders Without the Axiom of Choice | Luke Serafin et.al. | 2406.08684 | null |
2024-06-12 | Conditional Similarity Triplets Enable Covariate-Informed Representations of Single-Cell Data | Chi-Jane Chen et.al. | 2406.08638 | link |
2024-06-12 | Unraveling Code-Mixing Patterns in Migration Discourse: Automated Detection and Analysis of Online Conversations on Reddit | Fedor Vitiugin et.al. | 2406.08633 | link |
2024-06-13 | Real2Code: Reconstruct Articulated Objects via Code Generation | Zhao Mandi et.al. | 2406.08474 | null |
2024-06-12 | Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models | Javier Nistal et.al. | 2406.08384 | null |
2024-06-12 | Chemistry3D: Robotic Interaction Benchmark for Chemistry Experiments | Shoujie Li et.al. | 2406.08160 | link |
2024-06-12 | Generalizable Disaster Damage Assessment via Change Detection with Vision Foundation Model | Kyeongjin Ahn et.al. | 2406.08020 | null |
2024-06-12 | Automatic detection of large-scale flux ropes and their geoeffectiveness with a machine learning approach | Sanchita Pal et.al. | 2406.07798 | null |
2024-06-11 | PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow | Joshua Tokarsky et.al. | 2406.07667 | null |
2024-06-11 | Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling | Denis Blessing et.al. | 2406.07423 | link |
2024-06-11 | NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images | Yufei Han et.al. | 2406.07111 | null |
2024-06-11 | The evolution of coronal shock wave properties and their relation with solar energetic particles | Manon Jarry et.al. | 2406.07058 | null |
2024-06-11 | Bridging Language Gaps in Audio-Text Retrieval | Zhiyong Yan et.al. | 2406.07012 | link |
2024-06-11 | HPC Alongside User-space Kubernetes | Vanessa Sochat et.al. | 2406.06995 | null |
2024-06-11 | Stepwise Regression and Pre-trained Edge for Robust Stereo Matching | Weiqing Xiao et.al. | 2406.06953 | link |
2024-06-10 | Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies | Alex DeWeese et.al. | 2406.06823 | null |
2024-06-10 | The Legal Duty to Search for Less Discriminatory Algorithms | Emily Black et.al. | 2406.06817 | null |
2024-06-10 | Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests | T. Tony Cai et.al. | 2406.06749 | null |
2024-06-10 | The largest metallicity difference in twin systems: high-precision abundance analysis of the benchmark pair Krios & Kronos | P. Miquelarena et.al. | 2406.06705 | null |
2024-06-10 | Annotation alignment: Comparing LLM and human annotations of conversational safety | Rajiv Movva et.al. | 2406.06369 | null |
2024-06-10 | Shoulders of Giants: A Look at the Degree and Utility of Openness in NLP Research | Surangika Ranathunga et.al. | 2406.06021 | null |
2024-06-10 | Computational and Statistical Guarantees for Tensor-on-Tensor Regression with Tensor Train Decomposition | Zhen Qin et.al. | 2406.06002 | null |
2024-06-10 | Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context | Jingru Jia et.al. | 2406.05972 | null |
2024-06-09 | Predictors of the Sense of Presence in an Immersive Audio Storytelling Experience, a Mixed Methods Study. PREPRINT | Isabelle Verhulst et.al. | 2406.05856 | null |
2024-06-09 | SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion | Bingsong Bai et.al. | 2406.05692 | null |
2024-06-09 | MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations | Hemant Yadav et.al. | 2406.05661 | null |
2024-06-09 | Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs for Open-Ended Responses | Maryam Amirizaniani et.al. | 2406.05659 | null |
2024-06-08 | I-SIRch: AI-Powered Concept Annotation Tool For Equitable Extraction And Analysis Of Safety Insights From Maternity Investigations | Mohit Kumar Singh et.al. | 2406.05505 | null |
2024-06-08 | M3GIA: A Cognition Inspired Multilingual and Multimodal General Intelligence Ability Benchmark | Wei Song et.al. | 2406.05343 | link |
2024-06-07 | ProMotion: Prototypes As Motion Learners | Yawen Lu et.al. | 2406.04999 | null |
2024-06-07 | On the social bias of speech self-supervised models | Yi-Cheng Lin et.al. | 2406.04997 | null |
2024-06-07 | UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection | Yuchao Wang et.al. | 2406.04647 | null |
2024-06-06 | Function and form of U.S. cities | Sandro M. Reia et.al. | 2406.04543 | null |
2024-06-06 | TexIm FAST: Text-to-Image Representation for Semantic Similarity Evaluation using Transformers | Wazib Ansar et.al. | 2406.04438 | null |
2024-06-06 | Stereo-Depth Fusion through Virtual Pattern Projection | Luca Bartolomei et.al. | 2406.04345 | link |
2024-06-06 | Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation | Honglei Zhang et.al. | 2406.03933 | link |
2024-06-06 | Knowledge Transfer, Knowledge Gaps, and Knowledge Silos in Citation Networks | Eoghan Cunningham et.al. | 2406.03921 | link |
2024-06-06 | Transductive Off-policy Proximal Policy Optimization | Yaozhong Gan et.al. | 2406.03894 | null |
2024-06-05 | Does the Sun have a Dark Disk? | Gustavo F. S. Alves et.al. | 2406.03607 | null |
2024-06-05 | Reconciling Heterogeneous Effects in Causal Inference | Audrey Chang et.al. | 2406.03575 | null |
2024-06-05 | MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization | Xiaobo Guo et.al. | 2406.03479 | null |
2024-06-05 | A Flexible Recursive Network for Video Stereo Matching Based on Residual Estimation | Youchen Zhao et.al. | 2406.03333 | link |
2024-06-05 | On the Maximal Local Disparity of Fairness-Aware Classifiers | Jinqiu Jin et.al. | 2406.03255 | link |
2024-06-05 | MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class Min-Margin Contrastive Learning for Superior Prohibited Item Detection | Mingyuan Li et.al. | 2406.03176 | link |
2024-06-05 | Instructing Prompt-to-Prompt Generation for Zero-Shot Learning | Man Liu et.al. | 2406.03032 | null |
2024-06-05 | GraphAlign: Pretraining One Graph Neural Network on Multiple Graphs via Feature Alignment | Zhenyu Hou et.al. | 2406.02953 | null |
2024-06-04 | Building Socially-Equitable Public Models | Yejia Liu et.al. | 2406.02790 | link |
2024-06-04 | VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors | Markus Plack et.al. | 2406.02552 | null |
2024-06-04 | The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding | Kenneth Enevoldsen et.al. | 2406.02396 | link |
2024-06-04 | Layer-2 Arbitrage: An Empirical Analysis of Swap Dynamics and Price Disparities on Rollups | Krzysztof Gogol et.al. | 2406.02172 | null |
2024-06-04 | A Multipurpose Interface for Close- and Far-Proximity Control of Mobile Collaborative Robots | Hamidreza Raei et.al. | 2406.02171 | link |
2024-06-05 | CondTSF: One-line Plugin of Dataset Condensation for Time Series Forecasting | Jianrong Ding et.al. | 2406.02131 | link |
2024-06-04 | Timescale bridging in atomistic simulations of epoxy polymer mechanics using non-affine deformation theory | Vinay Vaibhav et.al. | 2406.02113 | null |
2024-06-03 | Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities | Golnoosh Farnadi et.al. | 2406.01757 | null |
2024-06-03 | Inverse design of photonic surfaces on Inconel via multi-fidelity machine learning ensemble framework and high throughput femtosecond laser processing | Luka Grbcic et.al. | 2406.01471 | null |
2024-06-03 | Structural Interventions and the Dynamics of Inequality | Aurora Zhang et.al. | 2406.01323 | null |
2024-06-03 | Bridging the Digital Divide: Mapping Internet Connectivity Evolution, Inequalities, and Resilience in six Brazilian Cities | Nicolò Gozzi et.al. | 2406.01113 | null |
2024-05-31 | Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF | Tengyang Xie et.al. | 2405.21046 | null |
2024-05-31 | GANcrop: A Contrastive Defense Against Backdoor Attacks in Federated Learning | Xiaoyun Gan et.al. | 2405.20727 | null |
2024-05-31 | Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation | Shuzhou Yang et.al. | 2405.20669 | link |
2024-05-31 | Weak-Form Inference for Hybrid Dynamical Systems in Ecology | Daniel Messenger et.al. | 2405.20591 | null |
2024-05-31 | The Point of View of a Sentiment: Towards Clinician Bias Detection in Psychiatric Notes | Alissa A. Valentine et.al. | 2405.20582 | null |
2024-05-30 | Impact of Connected and Automated Vehicles on Transport Injustices | Laura Martinez-Buelvas et.al. | 2405.20530 | null |
2024-05-30 | Bridging electronic and classical density-functional theory using universal machine-learned functional approximations | Michelle M. Kelley et.al. | 2405.20270 | null |
2024-05-30 | Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting | Kuldeep R Barad et.al. | 2405.20104 | null |
2024-05-30 | Strategies to Counter Artificial Intelligence in Law Enforcement: Cross-Country Comparison of Citizens in Greece, Italy and Spain | Petra Saskia Bayerl et.al. | 2405.19970 | null |
2024-05-29 | X-ray and Radio campaign of the Z-source GX 340+0: discovery of X-ray polarization and its implications | Yash Bhargava et.al. | 2405.19324 | null |
2024-05-29 | Measuring and Mitigating Bias for Tabular Datasets with Multiple Protected Attributes | Manh Khoi Duong et.al. | 2405.19300 | link |
2024-05-29 | Mitigating Disparate Impact of Differential Privacy in Federated Learning through Robust Clustering | Saber Malekmohammadi et.al. | 2405.19272 | null |
2024-05-29 | MAGIC: Modular Auto-encoder for Generalisable Model Inversion with Bias Corrections | Yihang She et.al. | 2405.18953 | link |
2024-05-29 | UniPTS: A Unified Framework for Proficient Post-Training Sparsity | Jingjing Xie et.al. | 2405.18810 | link |
2024-05-28 | The Efficacy of the Connect America Fund in Addressing US Internet Access Inequities | Haarika Manda et.al. | 2405.18657 | null |
2024-05-28 | Aligning in a Compact Space: Contrastive Knowledge Distillation between Heterogeneous Architectures | Hongjun Wu et.al. | 2405.18524 | null |
2024-05-28 | Exploring the Evolution of Altruistic Punishment with a PDE Model of Cultural Multilevel Selection | Daniel B. Cooney et.al. | 2405.18419 | link |
2024-05-28 | A Calibration Tool for Refractive Underwater Vision | Felix Seegräber et.al. | 2405.18018 | null |
2024-05-28 | Cross-Context Backdoor Attacks against Graph Prompt Learning | Xiaoting Lyu et.al. | 2405.17984 | link |
2024-05-28 | FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes | Yunsong Wang et.al. | 2405.17958 | link |
2024-05-28 | Boosting Protein Language Models with Negative Sample Mining | Yaoyao Xu et.al. | 2405.17902 | link |
2024-05-28 | Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection | Yingwen Wu et.al. | 2405.17816 | null |
2024-05-27 | A Two-sided Model for EV Market Dynamics and Policy Implications | Haoxuan Ma et.al. | 2405.17702 | null |
2024-05-27 | Unifying Perspectives: Plausible Counterfactual Explanations on Global, Group-wise, and Local Levels | Patryk Wielopolski et.al. | 2405.17642 | null |
2024-05-27 | MindMerger: Efficient Boosting LLM Reasoning in non-English Languages | Zixian Huang et.al. | 2405.17386 | link |
2024-05-27 | EF-Calib: Spatiotemporal Calibration of Event- and Frame-Based Cameras Using Continuous-Time Trajectories | Shaoan Wang et.al. | 2405.17278 | link |
2024-05-27 | Highly inhomogeneous interactions between background climate and urban warming across typical local climate zones in heatwave and non-heatwave days | Jing Kong et.al. | 2405.17213 | null |
2024-05-27 | SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing | Yong-Qiang Mao et.al. | 2405.17140 | null |
2024-05-27 | Multi-view Disparity Estimation Using a Novel Gradient Consistency Model | James L. Gray et.al. | 2405.17029 | null |
2024-05-27 | Blind Data Adaptation to tackle Covariate Shift in Operational Steganalysis | Rony Abecidan et.al. | 2405.16961 | null |
2024-05-27 | Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models | Fengfan Zhou et.al. | 2405.16940 | null |
2024-05-28 | PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting | Zipeng Wang et.al. | 2405.16829 | null |
2024-05-27 | Addressing Discretization-Induced Bias in Demographic Prediction | Evan Dong et.al. | 2405.16762 | link |
2024-05-26 | Demystify Mamba in Vision: A Linear Attention Perspective | Dongchen Han et.al. | 2405.16605 | link |
2024-05-24 | Synthetic high angular momentum spin dynamics in a microwave oscillator | Saswata Roy et.al. | 2405.15695 | null |
2024-05-24 | Digital finance, Bargaining Power and Gender Wage Gap | Qing Guo et.al. | 2405.15486 | null |
2024-05-24 | Mind the Gap: A Causal Perspective on Bias Amplification in Prediction & Decision-Making | Drago Plecko et.al. | 2405.15446 | null |
2024-05-24 | Fairness-Accuracy Trade-Offs: A Causal Perspective | Drago Plecko et.al. | 2405.15443 | link |
2024-05-23 | ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization | Han Song et.al. | 2405.15082 | null |
2024-05-23 | Modularity, Higher-Order Recombination, and New Venture Success | Likun Cao et.al. | 2405.15042 | null |
2024-05-23 | Federated Online Adaptation for Deep Stereo | Matteo Poggi et.al. | 2405.14873 | null |
2024-05-23 | An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models | Jiahao Sun et.al. | 2405.14870 | link |
2024-05-23 | Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras | Hanzhang Tu et.al. | 2405.14866 | null |
2024-05-23 | A Systematic and Formal Study of the Impact of Local Differential Privacy on Fairness: Preliminary Results | Karima Makhlouf et.al. | 2405.14725 | null |
2024-05-23 | Is the EJRA proportionate and therefore justified? A critical review of the EJRA policy at Cambridge | Oliver Linton et.al. | 2405.14611 | null |
2024-05-23 | Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks | Xingguang Jiang et.al. | 2405.14520 | null |
2024-05-22 | Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation | Mykhailo Uss et.al. | 2405.14024 | null |
2024-05-22 | CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models | Giada Pistilli et.al. | 2405.13974 | null |
2024-05-22 | Multi-Dataset Multi-Task Learning for COVID-19 Prognosis | Filippo Ruffini et.al. | 2405.13771 | null |
2024-05-22 | Knowledge-Driven Cross-Document Relation Extraction | Monika Jain et.al. | 2405.13546 | link |
Monocular Depth Estimation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-15 | Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation | Zhen Xu et.al. | 2507.11540 | null |
2025-07-15 | MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network | Jianfei Jiang et.al. | 2507.11333 | null |
2025-07-15 | Uncertainty Aware Mapping for Vision-Based Underwater Robots | Abhimanyu Bhowmik et.al. | 2507.10991 | null |
2025-07-14 | Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision | Justin M. Kasowski et.al. | 2507.10813 | null |
2025-07-14 | Cameras as Relative Positional Encoding | Ruilong Li et.al. | 2507.10496 | null |
2025-07-14 | Spatial Lifting for Dense Prediction | Mingzhi Xu et.al. | 2507.10222 | null |
2025-07-13 | Prompt2DEM: High-Resolution DEMs for Urban and Open Environments from Global Prompts Using a Monocular Foundation Model | Osher Rafaeli et.al. | 2507.09681 | null |
2025-07-11 | ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way | Rajarshi Roy et.al. | 2507.08679 | null |
2025-07-10 | An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision | Jareen Anjom et.al. | 2507.08165 | null |
2025-07-10 | Tree-Mamba: A Tree-Aware Mamba for Underwater Monocular Depth Estimation | Peixian Zhuang et.al. | 2507.07687 | null |
2025-07-10 | HOTA: Hierarchical Overlap-Tiling Aggregation for Large-Area 3D Flood Mapping | Wenfeng Jia et.al. | 2507.07585 | null |
2025-07-08 | LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures | Seungoh Han et.al. | 2507.06109 | null |
2025-07-14 | Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation | Quanzhu Niu et.al. | 2507.05948 | null |
2025-07-07 | The Generalization Ridge: Information Flow in Natural Language Generation | Ruidi Chang et.al. | 2507.05387 | null |
2025-07-10 | VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting | Juyi Lin et.al. | 2507.05116 | null |
2025-07-07 | Estimating Object Physical Properties from RGB-D Vision and Depth Robot Sensors Using Deep Learning | Ricardo Cardoso et.al. | 2507.05029 | null |
2025-07-06 | A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields | Aoxiang Fan et.al. | 2507.04408 | null |
2025-07-06 | High-Resolution Sustain Pedal Depth Estimation from Piano Audio Across Room Acoustics | Kun Fang et.al. | 2507.04230 | null |
2025-07-03 | From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images | Danrong Zhang et.al. | 2507.02781 | null |
2025-07-10 | Underwater Monocular Metric Depth Estimation: Real-World Benchmarks and Synthetic Fine-Tuning with Vision Foundation Models | Zijie Cai et.al. | 2507.02148 | null |
2025-07-02 | RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather | Yuran Wang et.al. | 2507.01653 | null |
2025-07-02 | Depth Anything at Any Condition | Boyuan Sun et.al. | 2507.01634 | null |
2025-07-02 | DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation | Yue-Jiang Dong et.al. | 2507.01603 | null |
2025-07-02 | Evaluating Robustness of Monocular Depth Estimation with Procedural Scene Perturbations | Jack Nugent et.al. | 2507.00981 | null |
2025-06-30 | SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures | Fengyi Jiang et.al. | 2507.00209 | null |
2025-06-30 | OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving | Mingqian Ji et.al. | 2506.23565 | null |
2025-06-26 | ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation | Shruti Bansal et.al. | 2506.20969 | null |
2025-06-25 | THIRDEYE: Cue-Aware Monocular Depth Estimation via Brain-Inspired Multi-Stage Fusion | Calin Teodor Ioan et.al. | 2506.20877 | null |
2025-06-30 | StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation | Haodong Li et.al. | 2506.20756 | null |
2025-06-24 | Look to Locate: Vision-Based Multisensory Navigation with 3-D Digital Maps for GNSS-Challenged Environments | Ola Elmaghraby et.al. | 2506.19827 | null |
2025-06-23 | SOF: Sorted Opacity Fields for Fast Unbounded Surface Reconstruction | Lukas Radl et.al. | 2506.19139 | null |
2025-06-23 | BulletGen: Improving 4D Reconstruction with Bullet-Time Generation | Denys Rozumnyi et.al. | 2506.18601 | null |
2025-06-21 | Optimization-Free Patch Attack on Stereo Depth Estimation | Hangcheng Liu et.al. | 2506.17632 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206 | null |
2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119 | link |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110 | null |
2025-06-20 | DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches | Yun Xing et.al. | 2506.16690 | null |
2025-06-19 | EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised Training | Liangjing Shao et.al. | 2506.16017 | link |
2025-06-18 | RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation | Xingrui Qin et.al. | 2506.15560 | null |
2025-06-17 | Time-Optimized Safe Navigation in Unstructured Environments through Learning Based Depth Completion | Jeffrey Mao et.al. | 2506.14975 | null |
2025-06-17 | DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning | Kunal Swami et.al. | 2506.14709 | null |
2025-06-16 | Test3R: Learning to Reconstruct 3D at Test Time | Yuheng Yuan et.al. | 2506.13750 | link |
2025-06-16 | Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields | Jungeon Kim et.al. | 2506.13508 | null |
2025-06-17 | Self-Supervised Enhancement for Depth from a Lightweight ToF Sensor with Monocular Images | Laiyan Ding et.al. | 2506.13444 | link |
2025-06-16 | TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast | Beilei Cui et.al. | 2506.13387 | link |
2025-06-17 | 3D Hand Mesh-Guided AI-Generated Malformed Hand Refinement with Hand Pose Transformation via Diffusion Model | Chen-Bin Feng et.al. | 2506.12680 | null |
2025-06-12 | Leveraging 6DoF Pose Foundation Models For Mapping Marine Sediment Burial | Jerry Yan et.al. | 2506.10386 | link |
2025-06-11 | DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects | Guanghu Xie et.al. | 2506.09491 | null |
2025-06-11 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning | Tong Wang et.al. | 2506.09327 | null |
2025-06-10 | AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models | Zheda Mai et.al. | 2506.09082 | null |
2025-06-10 | One Patch to Rule Them All: Transforming Static Patches into Dynamic Attacks in the Physical World | Xingshuo Han et.al. | 2506.08482 | null |
2025-06-09 | Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence | Octave Mariotti et.al. | 2506.08220 | null |
2025-06-09 | Hidden in plain sight: VLMs overlook their visual representations | Stephanie Fu et.al. | 2506.08008 | null |
2025-06-09 | EgoM2P: Egocentric Multimodal Multitask Pretraining | Gen Li et.al. | 2506.07886 | null |
2025-06-09 | Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images | Yingping Liang et.al. | 2506.07740 | null |
2025-06-07 | Dark Channel-Assisted Depth-from-Defocus from a Single Image | Moushumi Medhi et.al. | 2506.06643 | null |
2025-06-06 | NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces | Pierluigi Zama Ramirez et.al. | 2506.05815 | null |
2025-06-06 | Advancement and Field Evaluation of a Dual-arm Apple Harvesting Robot | Keyi Zhu et.al. | 2506.05714 | null |
2025-06-06 | Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration | Fanhu Zeng et.al. | 2506.05709 | null |
2025-06-06 | Aerial Multi-View Stereo via Adaptive Depth Range Inference and Normal Cues | Yimei Liu et.al. | 2506.05655 | null |
2025-06-09 | Structure-Aware Radar-Camera Depth Estimation | Fuyi Zhang et.al. | 2506.05008 | null |
2025-06-05 | Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer | Filip Slezak et.al. | 2506.04908 | null |
2025-06-05 | Toward Better SSIM Loss for Unsupervised Monocular Depth Estimation | Yijun Cao et.al. | 2506.04758 | null |
2025-06-04 | JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting | Yang Xiao et.al. | 2506.03872 | null |
2025-06-04 | Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation | Joonkyung Kim et.al. | 2506.03834 | null |
2025-06-03 | ViT-Split: Unleashing the Power of Vision Foundation Models via Efficient Splitting Heads | Yifan Li et.al. | 2506.03433 | null |
2025-06-02 | E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models | Wenyan Cong et.al. | 2506.01933 | null |
2025-06-01 | Perceptual Inductive Bias Is What You Need Before Contrastive Learning | Tianqin Li et.al. | 2506.01201 | null |
2025-06-01 | Depth-Aware Scoring and Hierarchical Alignment for Multiple Object Tracking | Milad Khanchi et.al. | 2506.00774 | null |
2025-05-31 | XYZ-IBD: High-precision Bin-picking Dataset for Object 6D Pose Estimation Capturing Real-world Industrial Complexity | Junwen Huang et.al. | 2506.00599 | null |
2025-05-31 | Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline | Zhaoying Wang et.al. | 2506.00546 | null |
2025-05-31 | Improving Optical Flow and Stereo Depth Estimation by Leveraging Uncertainty-Based Learning Difficulties | Jisoo Jeong et.al. | 2506.00324 | null |
2025-05-30 | Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization | Qingyao Tian et.al. | 2505.24249 | null |
2025-05-29 | Ultrafast High-Flux Single-Photon LiDAR Simulator via Neural Mapping | Weijian Zhang et.al. | 2505.23992 | null |
2025-05-29 | Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation | Sanggyun Ma et.al. | 2505.23400 | null |
2025-05-29 | GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion | Gwanghyun Kim et.al. | 2505.23085 | null |
2025-05-28 | Depth to magnetic source estimation using TDX contour | Hammed Oyekan et.al. | 2505.22780 | null |
2025-05-28 | Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss | Wenjun Lu et.al. | 2505.22279 | null |
2025-05-27 | Object Concepts Emerge from Motion | Haoqian Liang et.al. | 2505.21635 | null |
2025-05-23 | EvidenceMoE: A Physics-Guided Mixture-of-Experts with Evidential Critics for Advancing Fluorescence Light Detection and Ranging in Scattering Media | Ismail Erbas et.al. | 2505.21532 | null |
2025-05-27 | Occlusion Boundary and Depth: Mutual Enhancement via Multi-Task Learning | Lintao Xu et.al. | 2505.21231 | null |
2025-05-27 | Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing | Dehao Wang et.al. | 2505.21049 | null |
2025-05-27 | Spatial RoboGrasp: Generalized Robotic Grasping Control Policy | Yiqi Huang et.al. | 2505.20814 | null |
2025-05-26 | SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams | Zhuoheng Gao et.al. | 2505.19487 | null |
2025-05-25 | From Single Images to Motion Policies via Video-Generation Environment Representations | Weiming Zhi et.al. | 2505.19306 | null |
2025-05-23 | Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues | Chinmay Talegaonkar et.al. | 2505.17358 | null |
2025-05-22 | MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation | Bohan Zhou et.al. | 2505.16602 | null |
2025-05-22 | BadDepth: Backdoor Attacks Against Monocular Depth Estimation in the Physical World | Ji Guo et.al. | 2505.16154 | null |
2025-05-21 | RadarRGBD A Multi-Sensor Fusion Dataset for Perception with RGB-D and mmWave Radar | Tieshuai Song et.al. | 2505.15860 | null |
2025-05-21 | MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models | Yifan Liu et.al. | 2505.15185 | link |
2025-05-20 | Diving into the Fusion of Monocular Priors for Generalized Stereo Matching | Chengtang Yao et.al. | 2505.14414 | link |
2025-05-20 | M3Depth: Wavelet-Enhanced Depth Estimation on Mars via Mutual Boosting of Dual-Modal Data | Junjie Li et.al. | 2505.14159 | null |
2025-05-20 | Multi-Label Stereo Matching for Transparent Scene Depth Estimation | Zhidan Liu et.al. | 2505.14008 | link |
2025-05-20 | Event-Driven Dynamic Scene Depth Completion | Zhiqiang Yan et.al. | 2505.13279 | null |
2025-05-19 | DB3D-L: Depth-aware BEV Feature Transformation for Accurate 3D Lane Detection | Yehao Liu et.al. | 2505.13266 | null |
2025-05-20 | 3D Visual Illusion Depth Estimation | Chengtang Yao et.al. | 2505.13061 | link |
2025-05-19 | IA-MVS: Instance-Focused Adaptive Depth Sampling for Multi-View Stereo | Yinzhe Wang et.al. | 2505.12714 | null |
2025-05-18 | Depth Transfer: Learning to See Like a Simulator for Real-World Drone Navigation | Hang Yu et.al. | 2505.12428 | null |
2025-05-18 | Always Clear Depth: Robust Monocular Depth Estimation under Adverse Weather | Kui Jiang et.al. | 2505.12199 | link |
2025-05-17 | SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations | Songchun Zhang et.al. | 2505.11992 | null |
2025-05-17 | MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos | Hongyi Zhou et.al. | 2505.11868 | null |
2025-05-16 | SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision | Utsav Rai et.al. | 2505.11439 | null |
2025-05-16 | Attention on the Sphere | Boris Bonev et.al. | 2505.11157 | link |
2025-05-15 | Depth Anything with Any Prior | Zehan Wang et.al. | 2505.10565 | null |
2025-05-15 | JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation | Tiancong Cheng et.al. | 2505.10057 | null |
2025-05-14 | Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis | Bingxin Ke et.al. | 2505.09358 | link |
2025-05-13 | Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World | Yuran Wang et.al. | 2505.08607 | null |
2025-05-13 | Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images | Ziteng Liu et.al. | 2505.08178 | null |
2025-05-12 | Some insights into depth estimators for location and scatter in the multivariate setting | Jorge G. Adrover et.al. | 2505.07383 | null |
2025-05-11 | Reinforcement Learning-Based Monocular Vision Approach for Autonomous UAV Landing | Tarik Houichime et.al. | 2505.06963 | null |
2025-05-10 | ElectricSight: 3D Hazard Monitoring for Power Lines Using Low-Cost Sensors | Xingchen Li et.al. | 2505.06573 | null |
2025-05-09 | Camera-Only Bird’s Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles | Anupkumar Bochare et.al. | 2505.06113 | null |
2025-05-09 | MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection | Zhihao Zhang et.al. | 2505.04594 | null |
2025-05-13 | Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach | Srecharan Selvam et.al. | 2505.03702 | null |
2025-05-06 | LiftFeat: 3D Geometry-Aware Local Feature Matching | Yepeng Liu et.al. | 2505.03422 | link |
2025-05-06 | VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery | Bojin Wu et.al. | 2505.02704 | link |
2025-05-05 | DELTA: Dense Depth from Events and LiDAR using Transformer’s Attention | Vincent Brebion et.al. | 2505.02593 | null |
2025-05-03 | PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth | Bu Jin et.al. | 2505.01729 | null |
2025-05-02 | LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment | Jiahuan Long et.al. | 2505.00980 | null |
2025-05-01 | JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers | Kwon Byung-Ki et.al. | 2505.00482 | link |
2025-04-30 | HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation | Haiyang Zhou et.al. | 2504.21650 | link |
2025-04-30 | eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes | Henry John Krumb et.al. | 2504.21562 | null |
2025-04-29 | Real-Time Wayfinding Assistant for Blind and Low-Vision Users | Dabbrata Das et.al. | 2504.20976 | null |
2025-04-29 | Large-scale visual SLAM for in-the-wild videos | Shuo Sun et.al. | 2504.20496 | null |
2025-04-28 | MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion | Zador Pataki et.al. | 2504.20040 | link |
2025-04-28 | Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video | Hoang Chuong Nguyen et.al. | 2504.19819 | null |
2025-04-27 | Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection | Athul M. Mathew et.al. | 2504.19271 | null |
2025-04-26 | Depth as Points: Center Point-based Depth Estimation | Zhiheng Tu et.al. | 2504.18773 | null |
2025-04-25 | LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning | Rui Li et.al. | 2504.18424 | null |
2025-04-25 | Dense Geometry Supervision for Underwater Depth Estimation | Wenxiang Gua et.al. | 2504.18233 | null |
2025-04-25 | LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring | Raul David Dominguez Sanchez et.al. | 2504.18203 | null |
2025-04-24 | The Fourth Monocular Depth Estimation Challenge | Anton Obukhov et.al. | 2504.17787 | null |
2025-04-24 | Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images | Zebo Huang et.al. | 2504.17582 | null |
2025-04-24 | Invasion depth estimation of gastric cancer in early stage using circularly polarized light scattering: Phantom studies | Mike R. Maskey et.al. | 2504.17161 | null |
2025-04-23 | PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth Estimation | Xinqi Xiong et.al. | 2504.17067 | null |
2025-04-23 | Helping Blind People Grasp: Enhancing a Tactile Bracelet with an Automated Hand Navigation System | Marcin Furtak et.al. | 2504.16502 | null |
2025-04-21 | MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation | Xingxing Zuo et.al. | 2504.16127 | null |
2025-04-22 | DERD-Net: Learning Depth from Event-based Ray Densities | Diego de Oliveira Hitzges et.al. | 2504.15863 | null |
2025-04-22 | VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation | Mingxia Zhan et.al. | 2504.15095 | null |
2025-04-21 | Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation | Chenjie Cao et.al. | 2504.14899 | link |
2025-04-20 | Seurat: From Moving Points to Depth | Seokju Cho et.al. | 2504.14687 | link |
2025-04-18 | Occlusion-Ordered Semantic Instance Segmentation | Soroosh Baselizadeh et.al. | 2504.14054 | null |
2025-04-18 | Enhancing Pothole Detection and Characterization: Integrated Segmentation and Depth Estimation in Road Anomaly Systems | Uthman Baroudi et.al. | 2504.13648 | null |
2025-04-17 | Perception Encoder: The best visual embeddings are not at the output of the network | Daniel Bolya et.al. | 2504.13181 | null |
2025-04-17 | TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors | Mingwei Li et.al. | 2504.12799 | null |
2025-04-17 | Privacy-Preserving Operating Room Workflow Analysis using Digital Twins | Alejandra Perez et.al. | 2504.12552 | null |
2025-04-16 | Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image | Tao Wen et.al. | 2504.12103 | null |
2025-04-16 | TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion | Yiran Wang et.al. | 2504.11773 | null |
2025-04-16 | An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World | Xingwu Ji et.al. | 2504.11698 | link |
2025-04-15 | Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception | Ziqi Pang et.al. | 2504.11457 | link |
2025-04-16 | DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation | Soyoung Yoo et.al. | 2504.11347 | null |
2025-04-18 | Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting | Jiaxin Huang et.al. | 2504.11092 | null |
2025-04-13 | TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting | Zhicong Wu et.al. | 2504.09588 | null |
2025-04-12 | Text To 3D Object Generation For Scalable Room Assembly | Sonia Laguna et.al. | 2504.09328 | null |
2025-04-11 | Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation | Bram Vanherle et.al. | 2504.08473 | link |
2025-04-10 | Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction | Zeren Jiang et.al. | 2504.07961 | link |
2025-04-09 | FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution | Gene Chou et.al. | 2504.07093 | link |
2025-04-08 | POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction | Songyan Zhang et.al. | 2504.05692 | link |
2025-04-07 | Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and Semidensification | Yasuhiro Yao et.al. | 2504.05148 | link |
2025-04-04 | 3D Scene Understanding Through Local Random Access Sequence Modeling | Wanhee Lee et.al. | 2504.03875 | null |
2025-04-04 | RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation | Hanbo Bi et.al. | 2504.03166 | null |
2025-04-03 | All-day Depth Completion via Thermal-LiDAR Fusion | Janghyun Kim et.al. | 2504.02356 | null |
2025-04-02 | FreSca: Unveiling the Scaling Space in Diffusion Models | Chao Huang et.al. | 2504.02154 | null |
2025-04-02 | Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis | Niluthpol Chowdhury Mithun et.al. | 2504.01960 | null |
2025-04-03 | Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting | Shu-Wei Lu et.al. | 2504.01957 | null |
2025-04-02 | A novel gesture interaction control method for rehabilitation lower extremity exoskeleton | Shuang Qiu et.al. | 2504.01888 | null |
2025-04-02 | DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image | Jijun Xiang et.al. | 2504.01596 | link |
2025-04-01 | GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors | Tian-Xing Xu et.al. | 2504.01016 | null |
2025-04-01 | Monocular and Generalizable Gaussian Talking Head Animation | Shengjie Gong et.al. | 2504.00665 | null |
2025-03-31 | ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image | Tianyi Gong et.al. | 2503.23881 | null |
2025-03-31 | Detail-aware multi-view stereo network for depth estimation | Haitao Tian et.al. | 2503.23684 | null |
2025-03-30 | Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries | Wei Xu et.al. | 2503.23606 | null |
2025-03-30 | Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model | Jannik Endres et.al. | 2503.23502 | link |
2025-03-28 | SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations | Krispin Wandel et.al. | 2503.22462 | null |
2025-03-28 | EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting | Xu Wang et.al. | 2503.22437 | link |
2025-03-28 | MVSAnywhere: Zero-Shot Multi-View Stereo | Sergio Izquierdo et.al. | 2503.22430 | null |
2025-03-28 | One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images | Byeongjun Kwon et.al. | 2503.22351 | null |
2025-03-28 | Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces | Wonhyeok Choi et.al. | 2503.22209 | null |
2025-03-28 | Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges | Ukcheol Shin et.al. | 2503.22060 | link |
2025-03-27 | A Unified Image-Dense Annotation Generation Model for Underwater Scenes | Hongkai Lin et.al. | 2503.21771 | link |
2025-03-27 | ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo | Yuxi Hu et.al. | 2503.21525 | null |
2025-03-26 | Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors | Weilong Yan et.al. | 2503.20211 | link |
2025-03-26 | FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion | Pihai Sun et.al. | 2503.19739 | link |
2025-03-25 | Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving | Yusen Xie et.al. | 2503.19713 | link |
2025-03-25 | StableGS: A Floater-Free Framework for 3D Gaussian Splatting | Luchao Wang et.al. | 2503.18458 | null |
2025-03-24 | PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes | Xinhua Xu et.al. | 2503.18393 | null |
2025-03-24 | MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction | Wenyuan Zhang et.al. | 2503.18363 | null |
2025-03-23 | Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images | Yara AlaaEldin et.al. | 2503.17982 | link |
2025-03-21 | Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image | Jerred Chen et.al. | 2503.17358 | null |
2025-03-21 | Radar-Guided Polynomial Fitting for Metric Depth Estimation | Patrick Rim et.al. | 2503.17182 | null |
2025-03-21 | AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process | Junjie Hu et.al. | 2503.17029 | null |
2025-03-21 | Distilling Monocular Foundation Model for Fine-grained Depth Completion | Yingping Liang et.al. | 2503.16970 | null |
2025-03-20 | QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge | Xuan Shen et.al. | 2503.16709 | link |
2025-03-20 | A Recipe for Generating 3D Worlds From a Single Image | Katja Schwarz et.al. | 2503.16611 | null |
2025-03-20 | DreamTexture: Shape from Virtual Texture with Analysis by Augmentation | Ananta R. Bhattarai et.al. | 2503.16412 | null |
2025-03-20 | Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors | Tian Yi Lim et.al. | 2503.16275 | null |
2025-03-20 | Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras | Beilei Cui et.al. | 2503.15917 | null |
2025-03-20 | Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation | Jiyuan Wang et.al. | 2503.15905 | null |
2025-03-19 | TULIP: Towards Unified Language-Image Pretraining | Zineng Tang et.al. | 2503.15485 | null |
2025-03-19 | EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining | Boshen Xu et.al. | 2503.15470 | link |
2025-03-19 | USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network | Joseph Emmanuel DL Dayo et.al. | 2503.14950 | null |
2025-03-18 | Multi-view Reconstruction via SfM-guided Monocular Depth Estimation | Haoyu Guo et.al. | 2503.14483 | null |
2025-03-18 | DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers | Mert Bulent Sariyildiz et.al. | 2503.14405 | null |
2025-03-18 | 3D Densification for Multi-Map Monocular VSLAM in Endoscopy | X. Anadón et.al. | 2503.14346 | null |
2025-03-17 | MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models | Johannes Meier et.al. | 2503.13743 | null |
2025-03-17 | SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint | Zhenlong Yuan et.al. | 2503.13721 | null |
2025-03-17 | Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios | Iryna Repinetska et.al. | 2503.13710 | null |
2025-03-19 | FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis | Luxi Chen et.al. | 2503.13265 | null |
2025-03-17 | MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs | Erik Daxberger et.al. | 2503.13111 | null |
2025-03-17 | TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image | Haoxiao Wang et.al. | 2503.12779 | null |
2025-03-16 | UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing | Tsu-Jui Fu et.al. | 2503.12652 | null |
2025-03-16 | Deblur Gaussian Splatting SLAM | Francesco Girlanda et.al. | 2503.12572 | null |
2025-03-16 | Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View | Xianzu Wu et.al. | 2503.12553 | link |
2025-03-14 | VGGT: Visual Geometry Grounded Transformer | Jianyuan Wang et.al. | 2503.11651 | link |
2025-03-14 | Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation | Hongyu Wen et.al. | 2503.11633 | null |
2025-03-14 | Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation | Fengchen He et.al. | 2503.11213 | link |
2025-03-13 | Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations | Xunzhi Zheng et.al. | 2503.10464 | null |
2025-03-15 | WonderVerse: Extendable 3D Scene Generation with Video Generative Models | Hao Feng et.al. | 2503.09160 | null |
2025-03-11 | Language-Depth Navigated Thermal and Visible Image Fusion | Jinchang Zhang et.al. | 2503.08676 | null |
2025-03-11 | CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning | Kaiqiang Xiong et.al. | 2503.08219 | null |
2025-03-10 | SIRE: SE(3) Intrinsic Rigidity Embeddings | Cameron Smith et.al. | 2503.07739 | null |
2025-03-10 | LBM: Latent Bridge Matching for Fast Image-to-Image Translation | Clément Chadebec et.al. | 2503.07535 | link |
2025-03-12 | Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion | Mona Sheikh Zeinoddin et.al. | 2503.07204 | null |
2025-03-11 | LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation | Quanjian Song et.al. | 2503.06508 | link |
2025-03-08 | Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity | Xiaohao Xu et.al. | 2503.06014 | link |
2025-03-07 | TomatoScanner: phenotyping tomato fruit based on only RGB image | Xiaobei Zhao et.al. | 2503.05568 | link |
2025-03-07 | Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects | Justin Yu et.al. | 2503.05189 | null |
2025-03-05 | RTFusion: A depth estimation network based on multimodal fusion in challenging scenarios | Zelin Meng et.al. | 2503.04821 | null |
2025-03-06 | A Novel Solution for Drone Photogrammetry with Low-overlap Aerial Images using Monocular Depth Estimation | Jiageng Zhong et.al. | 2503.04513 | null |
2025-03-08 | EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images | Rohit Menon et.al. | 2503.04441 | null |
2025-03-06 | H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision | Yunxiao Shi et.al. | 2503.04059 | null |
2025-03-05 | Task-Agnostic Attacks Against Vision Foundation Models | Brian Pulfer et.al. | 2503.03842 | link |
2025-03-05 | Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings | Xusheng Du et.al. | 2503.03068 | null |
2025-03-04 | RGBSQGrasp: Inferring Local Superquadric Primitives from Single RGB Image for Graspability-Aware Bin Picking | Yifeng Xu et.al. | 2503.02387 | null |
2025-03-03 | MUSt3R: Multi-view Network for Stereo 3D Reconstruction | Yohann Cabon et.al. | 2503.01661 | link |
2025-03-02 | Bridging Spectral-wise and Multi-spectral Depth Estimation via Geometry-guided Contrastive Learning | Ukcheol Shin et.al. | 2503.00793 | link |
2025-02-28 | EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering | John J. Han et.al. | 2502.20669 | null |
2025-02-27 | UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler | Luigi Piccinelli et.al. | 2502.20110 | link |
2025-02-26 | Stellar Models Also Limit Exoplanet Atmosphere Studies in Emission | Thomas J. Fauchez et.al. | 2502.19585 | null |
2025-02-26 | Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator | Xiankang He et.al. | 2502.19204 | link |
2025-02-26 | SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images | Yangfan Xu et.al. | 2502.18932 | null |
2025-02-19 | Physical Depth-aware Early Accident Anticipation: A Multi-dimensional Visual Feature Fusion Framework | Hongpu Huang et.al. | 2502.18496 | null |
2025-02-21 | RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes | Sicheng Yu et.al. | 2502.15633 | null |
2025-02-20 | CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting | Qilin Zhang et.al. | 2502.14684 | link |
2025-03-03 | Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion | Jiangyuan Liu et.al. | 2502.14616 | link |
2025-02-20 | Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining | Wonhyeok Choi et.al. | 2502.14573 | null |
2025-02-20 | OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images | Zhichao Zheng et.al. | 2502.14279 | null |
2025-02-18 | Pre-training Auto-regressive Robotic Models with 4D Representations | Dantong Niu et.al. | 2502.13142 | null |
2025-02-18 | SHADeS: Self-supervised Monocular Depth Estimation Through Non-Lambertian Image Decomposition | Rema Daher et.al. | 2502.12994 | link |
2025-02-17 | Deep Neural Networks for Accurate Depth Estimation with Latent Space Features | Siddiqui Muhammad Yasir et.al. | 2502.11777 | null |
2025-02-16 | Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation | Kunal Swami et.al. | 2502.11002 | null |
2025-02-14 | ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences | Liyuan Zhu et.al. | 2502.10377 | null |
2025-02-14 | RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control | Teng Li et.al. | 2502.10059 | null |
2025-02-13 | SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest | Jack Erhardt et.al. | 2502.09528 | null |
2025-02-17 | S $^2$ -Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation | Quantao Yang et.al. | 2502.09389 | null |
2025-02-13 | CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery | Chenghao Zhang et.al. | 2502.08902 | null |
2025-02-13 | Visual-based spatial audio generation system for multi-speaker environments | Xiaojing Liu et.al. | 2502.07538 | null |
2025-02-11 | Learning Inverse Laplacian Pyramid for Progressive Depth Completion | Kun Wang et.al. | 2502.07289 | null |
2025-02-10 | From Image to Video: An Empirical Study of Diffusion Representations | Pedro Vélez et.al. | 2502.07001 | null |
2025-02-09 | Revisiting Gradient-based Uncertainty for Monocular Depth Estimation | Julia Hornauer et.al. | 2502.05964 | null |
2025-02-09 | SphereFusion: Efficient Panorama Depth Estimation via Gated Fusion | Qingsong Yan et.al. | 2502.05859 | null |
2025-02-05 | MetaFE-DE: Learning Meta Feature Embedding for Depth Estimation from Monocular Endoscopic Images | Dawei Lu et.al. | 2502.03493 | null |
2025-02-04 | DOC-Depth: A novel approach for dense depth ground truth generation | Simon de Moreau et.al. | 2502.02144 | null |
2025-02-01 | Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding | Jingming Xia et.al. | 2502.01666 | null |
2025-02-01 | Exploring Representation-Aligned Latent Space for Better Generation | Wanghan Xu et.al. | 2502.00359 | null |
2025-02-01 | MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model | Jihyeok Kim et.al. | 2502.00315 | null |
2025-01-30 | Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion | Vitor Guizilini et.al. | 2501.18804 | null |
2025-01-25 | Snapshot Compressed Imaging Based Single-Measurement Computer Vision for Videos | Fengpu Pan et.al. | 2501.15122 | null |
2025-01-24 | Rethinking Encoder-Decoder Flow Through Shared Structures | Frederik Laboyrie et.al. | 2501.14535 | null |
2025-01-23 | IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models | Jiayi Lei et.al. | 2501.13920 | null |
2025-01-23 | PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments | Changhao Wang et.al. | 2501.13796 | null |
2025-01-22 | Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation | Akshay Krishnan et.al. | 2501.13087 | null |
2025-01-22 | Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks | Alessio Quercia et.al. | 2501.12824 | link |
2025-01-22 | Video Depth Anything: Consistent Depth Estimation for Super-Long Videos | Sili Chen et.al. | 2501.12375 | null |
2025-01-21 | Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging | Shuyi Hu et.al. | 2501.11884 | null |
2025-01-21 | Survey on Monocular Metric Depth Estimation | Jiuling Zhang et.al. | 2501.11841 | null |
2025-01-19 | RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering | Chenlu Zhan et.al. | 2501.11102 | null |
2025-01-15 | BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation | Xiaolu Hou et.al. | 2501.10462 | link |
2025-01-20 | Zero-Shot Monocular Scene Flow Estimation in the Wild | Yiqing Liang et.al. | 2501.10357 | null |
2025-01-17 | One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression | Keita Miwa et.al. | 2501.10064 | null |
2025-01-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al. | 2501.09994 | link |
2025-01-21 | FoundationStereo: Zero-Shot Stereo Matching | Bowen Wen et.al. | 2501.09898 | link |
2025-01-16 | DEFOM-Stereo: Depth Foundation Model Based Stereo Matching | Hualie Jiang et.al. | 2501.09466 | link |
2025-01-15 | StereoGen: High-quality Stereo Image Generation from a Single Image | Xianqi Wang et.al. | 2501.08654 | null |
2025-01-15 | MonSter: Marry Monodepth to Stereo Unleashes Power | Junda Cheng et.al. | 2501.08643 | link |
2025-01-14 | A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation | Steven Landgraf et.al. | 2501.08188 | null |
2025-01-14 | Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2 | Seamie Hayes et.al. | 2501.08118 | null |
2025-01-13 | Fixing the Scale and Shift in Monocular Depth For Camera Pose Estimation | Yaqing Ding et.al. | 2501.07742 | link |
2025-01-13 | Matching Free Depth Recovery from Structured Light | Zhuohang Yu et.al. | 2501.07113 | null |
2025-01-09 | Relative Pose Estimation through Affine Corrections of Monocular Depth Priors | Yifan Yu et.al. | 2501.05446 | link |
2025-01-09 | $DPF^*$ : improved Depth Potential Function for scale-invariant sulcal depth estimation | Maxime Dieudonné et.al. | 2501.05436 | link |
2025-01-09 | A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision | Ali Rohan et.al. | 2501.05147 | null |
2025-01-08 | FatesGS: Fast and Accurate Sparse-View Surface Reconstruction using Gaussian Splatting with Depth-Feature Consistency | Han Huang et.al. | 2501.04628 | null |
2025-01-08 | FrontierNet: Learning Visual Cues to Explore | Boyang Sun et.al. | 2501.04597 | link |
2025-01-07 | AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features | Ruochen Zhang et.al. | 2501.03700 | null |
2025-01-05 | DepthMaster: Taming Diffusion Models for Monocular Depth Estimation | Ziyang Song et.al. | 2501.02576 | link |
2025-01-05 | Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera | Yuliang Guo et.al. | 2501.02464 | link |
2025-01-03 | SafeAug: Safety-Critical Driving Data Augmentation from Naturalistic Datasets | Zhaobin Mo et.al. | 2501.02143 | null |
2025-01-03 | Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery | Baoru Huang et.al. | 2501.01752 | null |
2025-01-03 | IGAF: Incremental Guided Attention Fusion for Depth Super-Resolution | Athanasios Tragakis et.al. | 2501.01723 | null |
2024-12-31 | Tech Report: Divide and Conquer 3D Real-Time Reconstruction for Improved IGS | Yicheng Zhu et.al. | 2501.01465 | null |
2025-01-02 | TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions | Vriksha Srihari et.al. | 2501.01156 | null |
2025-01-02 | PatchRefiner V2: Fast and Lightweight Real-Domain High-Resolution Metric Depth Estimation | Zhenyu Li et.al. | 2501.01121 | null |
2024-12-30 | FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI | Zhengdong Li et.al. | 2412.20974 | null |
2024-12-29 | MetricDepth: Enhancing Monocular Depth Estimation with Deep Metric Learning | Chunpu Liu et.al. | 2412.20390 | null |
2024-12-28 | Multi-Modality Driven LoRA for Adverse Condition Depth Estimation | Guanglei Yang et.al. | 2412.20162 | null |
2024-12-28 | DepthMamba with Adaptive Fusion | Zelin Meng et.al. | 2412.19964 | null |
2024-12-26 | An End-to-End Depth-Based Pipeline for Selfie Image Rectification | Ahmed Alhawwary et.al. | 2412.19189 | null |
2024-12-26 | Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement | Qiude Zhang et.al. | 2412.19165 | null |
2024-12-26 | MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo | Byeonggwon Lee et.al. | 2412.19130 | null |
2024-12-26 | Learning Monocular Depth from Events via Egomotion Compensation | Haitao Meng et.al. | 2412.19067 | null |
2024-12-24 | RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis | Yiling Yao et.al. | 2412.18380 | null |
2024-12-23 | V $^2$ -SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy | Long Bai et.al. | 2412.17595 | null |
2024-12-22 | GeoTexDensifier: Geometry-Texture-Aware Densification for High-Quality Photorealistic 3D Gaussian Splatting | Hanqing Jiang et.al. | 2412.16809 | null |
2024-12-27 | LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty Guidance | Huawei Sun et.al. | 2412.16380 | link |
2024-12-19 | Flowing from Words to Pixels: A Framework for Cross-Modality Evolution | Qihao Liu et.al. | 2412.15213 | null |
2024-12-19 | Scaling 4D Representations | João Carreira et.al. | 2412.15212 | null |
2024-12-18 | Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation | Rémi Marsal et.al. | 2412.14103 | null |
2024-12-18 | Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation | Haotong Lin et.al. | 2412.14015 | link |
2024-12-18 | Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion | Massimiliano Viola et.al. | 2412.13389 | null |
2024-12-18 | Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera | Zhengdi Yu et.al. | 2412.12861 | null |
2024-12-17 | PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts | Kun Guo et.al. | 2412.12460 | null |
2024-12-16 | V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations | Jin-Cheng Jhang et.al. | 2412.11412 | null |
2024-12-16 | Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video | Junkai Fan et.al. | 2412.11395 | null |
2024-12-15 | ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction | Yi Feng et.al. | 2412.11210 | link |
2024-12-14 | MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance | Wenjun Huang et.al. | 2412.10730 | null |
2024-12-12 | Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos | Linyi Jin et.al. | 2412.09621 | null |
2024-12-12 | T-SVG: Text-Driven Stereoscopic Video Generation | Qiao Jin et.al. | 2412.09323 | null |
2024-12-12 | Cross-View Completion Models are Zero-shot Correspondence Estimators | Honggyu An et.al. | 2412.09072 | null |
2024-12-11 | BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation | Shengze Wang et.al. | 2412.08640 | null |
2024-12-13 | Utilizing Multi-step Loss for Single Image Reflection Removal | Abdelrahman Elnenaey et.al. | 2412.08582 | link |
2024-12-11 | Combining Neural Fields and Deformation Models for Non-Rigid 3D Motion Reconstruction from Partial Data | Aymen Merrouche et.al. | 2412.08511 | null |
2024-12-11 | Dense Depth from Event Focal Stack | Kenta Horikawa et.al. | 2412.08120 | null |
2024-12-10 | Diffusion-Based Attention Warping for Consistent 3D Scene Editing | Eyal Gomel et.al. | 2412.07984 | null |
2024-12-10 | Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation | Kurt H. W. Stolle et.al. | 2412.07966 | null |
2024-12-09 | SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception | Yaniv Benny et.al. | 2412.06968 | null |
2024-12-09 | Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving | Xin Fei et.al. | 2412.06777 | link |
2024-12-09 | MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views | Antoine Guédon et.al. | 2412.06767 | null |
2024-12-09 | On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events | Jesse Hagenaars et.al. | 2412.06359 | null |
2024-12-09 | Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction | Dongxu Wei et.al. | 2412.06273 | null |
2024-12-09 | Event fields: Capturing light fields at high speed, resolution, and dynamic range | Ziyuan Qu et.al. | 2412.06191 | null |
2024-12-08 | GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion | Karlo Koledic et.al. | 2412.06080 | null |
2024-12-08 | Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors | Alex Rich et.al. | 2412.05771 | null |
2024-12-10 | TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action | Zixian Ma et.al. | 2412.05479 | link |
2024-12-06 | SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images | Jiahua Dong et.al. | 2412.05274 | null |
2024-12-06 | Penetrative rotating magnetoconvection subject to lateral variations in temperature gradients | Tirtharaj Barman et.al. | 2412.05235 | null |
2024-12-06 | PanoDreamer: 3D Panorama Synthesis from a Single Image | Avinash Paliwal et.al. | 2412.04827 | link |
2024-12-05 | LAA-Net: A Physical-prior-knowledge Based Network for Robust Nighttime Depth Estimation | Kebin Peng et.al. | 2412.04666 | null |
2024-12-05 | Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail | Luca Bartolomei et.al. | 2412.04472 | link |
2024-12-05 | MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos | Zhengqi Li et.al. | 2412.04463 | null |
2024-12-05 | MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction | Mithun Parab et.al. | 2412.03928 | null |
2024-12-04 | Perception Tokens Enhance Visual Reasoning in Multimodal Language Models | Mahtab Bigverdi et.al. | 2412.03548 | null |
2024-12-04 | Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter | Hermes McGriff et.al. | 2412.03518 | null |
2024-12-04 | 2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction | Wanting Zhang et.al. | 2412.03428 | null |
2024-12-04 | MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction | Gangjian Zhang et.al. | 2412.03103 | null |
2024-12-05 | Align3R: Aligned Monocular Depth Estimation for Dynamic Videos | Jiahao Lu et.al. | 2412.03079 | null |
2024-12-03 | Single-Shot Metric Depth from Focused Plenoptic Cameras | Blanca Lasheras-Hernandez et.al. | 2412.02386 | null |
2024-12-03 | Dual Exposure Stereo for Extended Dynamic Range 3D Imaging | Juhyung Choi et.al. | 2412.02351 | null |
2024-12-03 | Amodal Depth Anything: Amodal Depth Estimation in the Wild | Zhenyu Li et.al. | 2412.02336 | null |
2024-12-03 | GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos | Zhiyuan Chen et.al. | 2412.02267 | null |
2024-12-03 | FoveaSPAD: Exploiting Depth Priors for Adaptive and Efficient Single-Photon 3D Imaging | Justin Folden et.al. | 2412.02052 | null |
2024-12-02 | Mutli-View 3D Reconstruction using Knowledge Distillation | Aditya Dutt et.al. | 2412.02039 | link |
2024-12-02 | AVS-Net: Audio-Visual Scale Net for Self-supervised Monocular Metric Depth Estimation | Xiaohu Liu et.al. | 2412.01637 | null |
2024-12-02 | STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation | Sunghun Yang et.al. | 2412.01090 | null |
2024-12-01 | FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation | Yunpeng Bai et.al. | 2412.00671 | null |
2024-11-29 | SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection | Philipp Wolters et.al. | 2411.19860 | null |
2024-11-29 | MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications | Gasser Elazab et.al. | 2411.19717 | null |
2024-11-29 | Gaussian Splashing: Direct Volumetric Rendering Underwater | Nir Mualem et.al. | 2411.19588 | null |
2024-11-28 | Learning Surrogate Rainfall-driven Inundation Models with Few Data | Marzieh Alireza Mirhoseini et.al. | 2411.19323 | null |
2024-11-28 | AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones | Xuqian Ren et.al. | 2411.19271 | null |
2024-11-28 | Video Depth without Video Models | Bingxin Ke et.al. | 2411.19189 | null |
2024-11-28 | 360Recon: An Accurate Reconstruction Method Based on Depth Fusion from 360 Images | Zhongmiao Yan et.al. | 2411.19102 | null |
2024-11-27 | Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation | Mehdi Zayene et.al. | 2411.18335 | link |
2024-11-27 | GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation | Wenbo Cui et.al. | 2411.18276 | null |
2024-11-27 | SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation | Duc-Hai Pham et.al. | 2411.18229 | null |
2024-11-26 | Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation | Sudarshan Rajagopalan et.al. | 2411.17814 | null |
2024-11-26 | Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors | Ziang Xu et.al. | 2411.17790 | null |
2024-11-26 | DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting | Christian Homeyer et.al. | 2411.17660 | link |
2024-11-26 | Spatially Visual Perception for End-to-End Robotic Learning | Travis Davies et.al. | 2411.17458 | null |
2024-11-26 | DepthCues: Evaluating Monocular Depth Perception in Large Vision Models | Duolikun Danier et.al. | 2411.17385 | null |
2024-11-26 | Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration | Junyuan Deng et.al. | 2411.17240 | link |
2024-11-25 | G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs | Kunyi Li et.al. | 2411.16898 | null |
2024-11-24 | PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation | Ziyao Zeng et.al. | 2411.16750 | null |
2024-11-25 | Generative Omnimatte: Learning to Decompose Video into Layers | Yao-Chih Lee et.al. | 2411.16683 | null |
2024-11-25 | One Diffusion to Generate Them All | Duong H. Le et.al. | 2411.16318 | link |
2024-11-24 | Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors | Soumava Paul et.al. | 2411.15966 | null |
2024-11-21 | StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart | Jian Shi et.al. | 2411.14295 | link |
2024-11-20 | DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild | Weicai Ye et.al. | 2411.13291 | null |
2024-11-20 | OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging | Rajini Makam et.al. | 2411.13230 | link |
2024-11-15 | SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction | Yutao Tang et.al. | 2411.12592 | link |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-18 | MGNiceNet: Unified Monocular Geometric Scene Understanding | Markus Schön et.al. | 2411.11466 | null |
2024-11-18 | The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather | Markus Schön et.al. | 2411.11455 | null |
2024-11-18 | GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views | Boyao Zhou et.al. | 2411.11363 | null |
2024-11-18 | Scalable Autoregressive Monocular Depth Estimation | Jinhong Wang et.al. | 2411.11361 | null |
2024-11-16 | MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation | Ansh Shah et.al. | 2411.10886 | link |
2024-11-19 | EVT: Efficient View Transformation for Multi-Modal 3D Object Detection | Yongjin Lee et.al. | 2411.10715 | null |
2024-11-15 | Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses | Yongfan Liu et.al. | 2411.10013 | link |
2024-11-14 | Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting | Yian Wang et.al. | 2411.09823 | null |
2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
2024-11-14 | Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching | Yuran Wang et.al. | 2411.09151 | null |
2024-11-13 | OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances | Youqi Liao et.al. | 2411.08665 | link |
2024-11-09 | Online Collision Risk Estimation via Monocular Depth-Aware Object Detectors and Fuzzy Inference | Brian Hsuan-Cheng Liao et.al. | 2411.08060 | null |
2024-11-13 | Scaling Properties of Diffusion Models for Perceptual Tasks | Rahul Ravishankar et.al. | 2411.08034 | null |
2024-11-11 | $SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation | Yinshuang Xu et.al. | 2411.07326 | null |
2024-11-08 | Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning | Quang Truong Nguyen et.al. | 2411.05344 | null |
2024-11-08 | SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection | Yun Zhao et.al. | 2411.05292 | null |
2024-11-07 | D $^3$ epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes | Siyu Chen et.al. | 2411.04826 | null |
2024-11-06 | Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation | Teppei Kurita et.al. | 2411.04714 | null |
2024-11-07 | Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation | Qingyao Tian et.al. | 2411.04404 | null |
2024-11-04 | PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes | Kebin Peng et.al. | 2411.04227 | null |
2024-11-06 | Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions | Zihan Qin et.al. | 2411.03638 | null |
2024-11-05 | Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor | Anish Bhattacharya et.al. | 2411.03303 | null |
2024-11-05 | Correlation of Object Detection Performance with Visual Saliency and Depth Estimation | Matthias Bartolo et.al. | 2411.02844 | link |
2024-11-05 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-11-05 | Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training | Yuanqi Yao et.al. | 2411.02149 | null |
2024-11-02 | MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction | Wang Zhao et.al. | 2411.01226 | link |
2024-11-01 | MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes | Sanghyun Byun et.al. | 2411.01048 | null |
2024-11-01 | On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR | Li Li et.al. | 2411.00600 | link |
2024-10-31 | Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving | Ce Zhou et.al. | 2411.00192 | null |
2024-10-31 | ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images | Timing Yang et.al. | 2410.24001 | link |
2024-10-30 | Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe | Songyu Xu et.al. | 2410.23154 | null |
2024-10-29 | Active Event Alignment for Monocular Distance Estimation | Nan Cai et.al. | 2410.22280 | null |
2024-10-29 | PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting | Sunghwan Hong et.al. | 2410.22128 | link |
2024-10-27 | Unlocking Comics: The AI4VA Dataset for Visual Understanding | Peter Grönquist et.al. | 2410.20459 | link |
2024-10-27 | Depth Attention for Robust RGB Tracking | Yu Liu et.al. | 2410.20395 | link |
2024-10-21 | YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning | Ranjan Sapkota et.al. | 2410.19846 | null |
2024-10-25 | MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors | Fanqi Pu et.al. | 2410.19590 | link |
2024-10-24 | Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction | Hongxin Peng et.al. | 2410.18433 | null |
2024-10-24 | Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images | Dong-Guw Lee et.al. | 2410.18340 | link |
2024-10-25 | UnCLe: Unsupervised Continual Learning of Depth Completion | Suchisrit Gangopadhyay et.al. | 2410.18074 | null |
2024-10-21 | TIPS: Text-Image Pretraining with Spatial Awareness | Kevis-Kokitsi Maninis et.al. | 2410.16512 | null |
2024-10-22 | DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain | Kun Wang et.al. | 2410.14980 | link |
2024-10-17 | DepthSplat: Connecting Gaussian Splatting and Depth | Haofei Xu et.al. | 2410.13862 | link |
2024-10-16 | DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning | Jiabao Wei et.al. | 2410.12501 | null |
2024-10-16 | Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture | Dabbrata Das et.al. | 2410.11610 | link |
2024-10-16 | CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction | Pranav Gupta et.al. | 2410.11211 | link |
2024-10-14 | Few-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting | Raja Kumar et.al. | 2410.11080 | link |
2024-10-14 | When Does Perceptual Alignment Benefit Vision Representations? | Shobhita Sundaram et.al. | 2410.10817 | null |
2024-10-14 | Depth Any Video with Scalable Synthetic Data | Honghui Yang et.al. | 2410.10815 | link |
2024-10-15 | Improved Depth Estimation of Bayesian Neural Networks | Bart van Erp et.al. | 2410.10395 | link |
2024-10-10 | Color-Guided Flying Pixel Correction in Depth Images | Ekamresh Vasudevan et.al. | 2410.08084 | link |
2024-10-09 | Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models | Ange Lou et.al. | 2410.07434 | null |
2024-10-09 | Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation | Runze Chen et.al. | 2410.06982 | null |
2024-10-09 | Analysis of different disparity estimation techniques on aerial stereo image datasets | Ishan Narayan et.al. | 2410.06711 | null |
2024-10-08 | Vision Transformer based Random Walk for Group Re-Identification | Guoqing Zhang et.al. | 2410.05808 | null |
2024-10-08 | CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality | Wenjie Chang et.al. | 2410.05735 | null |
2024-10-07 | PhotoReg: Photometrically Registering 3D Gaussian Splatting Models | Ziwen Yuan et.al. | 2410.05044 | null |
2024-10-06 | Mode-GS: Monocular Depth Guided Anchored 3D Gaussian Splatting for Robust Ground-View Scene Rendering | Yonghan Lee et.al. | 2410.04646 | null |
2024-10-10 | Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy | Pengcheng Chen et.al. | 2410.04041 | null |
2024-10-04 | Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering | Laura Fink et.al. | 2410.03861 | link |
2024-10-03 | DecTrain: Deciding When to Train a DNN Online | Zih-Sing Fu et.al. | 2410.02980 | null |
2024-10-03 | RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions | Ziyao Zeng et.al. | 2410.02924 | link |
2024-10-02 | Depth Pro: Sharp Monocular Metric Depth in Less Than a Second | Aleksei Bochkovskii et.al. | 2410.02073 | link |
2024-10-02 | Learning from the Giants: A Practical Approach to Underwater Depth and Surface Normals Estimation | Alzayat Saleh et.al. | 2410.02072 | null |
2024-10-02 | SinkSAM: A Monocular Depth-Guided SAM Framework for Automatic Sinkhole Segmentation | Osher Rafaeli et.al. | 2410.01473 | link |
2024-10-01 | Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation | Shuting Zhao et.al. | 2410.00979 | null |
2024-10-01 | Radar Meets Vision: Robustifying Monocular Metric Depth Prediction for Mobile Robotics | Marco Job et.al. | 2410.00736 | null |
2024-10-01 | Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration | Yida Lin et.al. | 2410.00503 | null |
2024-10-01 | Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance | Hongchao Shu et.al. | 2410.00386 | null |
2024-09-30 | CCDepth: A Lightweight Self-supervised Depth Estimation Network with Enhanced Interpretability | Xi Zhang et.al. | 2409.19933 | null |
2024-09-30 | EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth Prediction | Ivan Reyes-Amezcua et.al. | 2409.19930 | link |
2024-09-29 | fCOP: Focal Length Estimation from Category-level Object Priors | Xinyue Zhang et.al. | 2409.19641 | null |
2024-09-29 | KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation | Soofiyan Atar et.al. | 2409.19490 | null |
2024-09-27 | Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping | Anthony A. Song et.al. | 2409.19153 | null |
2024-09-26 | Self-supervised Monocular Depth Estimation with Large Kernel Attention | Xuezhi Xiang et.al. | 2409.17895 | null |
2024-09-26 | Self-Distilled Depth Refinement with Noisy Poisson Fusion | Jiaqi Li et.al. | 2409.17880 | link |
2024-09-27 | A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts | Aurel Pjetri et.al. | 2409.17851 | null |
2024-09-26 | Event-based Stereo Depth Estimation: A Survey | Suman Ghosh et.al. | 2409.17680 | null |
2024-09-26 | CAMOT: Camera Angle-aware Multi-Object Tracking | Felix Limanta et.al. | 2409.17533 | null |
2024-09-25 | Optical Lens Attack on Deep Learning Based Monocular Depth Estimation | Ce Zhou et.al. | 2409.17376 | null |
2024-09-25 | Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation | Richard D. Paul et.al. | 2409.17085 | null |
2024-09-25 | EventHDR: from Event to High-Speed HDR Videos and Beyond | Yunhao Zou et.al. | 2409.17029 | null |
2024-09-25 | 3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation | Yi Gu et.al. | 2409.16702 | link |
2024-09-24 | MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling | Yifang Men et.al. | 2409.16160 | null |
2024-09-24 | Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data | An Wang et.al. | 2409.16063 | link |
2024-09-23 | FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera | Guoyang Zhao et.al. | 2409.15054 | link |
2024-09-23 | DepthART: Monocular Depth Estimation as Autoregressive Refinement Task | Bulat Gabdullin et.al. | 2409.15010 | null |
2024-09-23 | Generalizing monocular colonoscopy image depth estimation by uncertainty-based global and local fusion network | Sijia Du et.al. | 2409.15006 | null |
2024-09-23 | GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth | Aurélien Cecille et.al. | 2409.14850 | link |
2024-09-23 | Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras | Ming Li et.al. | 2409.14766 | null |
2024-09-25 | D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation | Songlin Wei et.al. | 2409.14365 | null |
2024-09-22 | MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Wangze Xu et.al. | 2409.14316 | null |
2024-09-21 | @Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology | Xin Jiang et.al. | 2409.14215 | null |
2024-09-18 | Panoptic-Depth Forecasting | Juana Valeria Hurtado et.al. | 2409.12008 | null |
2024-09-17 | Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Gonzalo Martin Garcia et.al. | 2409.11355 | link |
2024-09-15 | GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion | Vitor Guizilini et.al. | 2409.09896 | null |
2024-09-15 | Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation | Xiaolong Qian et.al. | 2409.09754 | link |
2024-09-13 | PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage | Denis Zavadski et.al. | 2409.09144 | link |
2024-09-23 | Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding | Rania Hossam et.al. | 2409.08695 | link |
2024-09-12 | Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor | Andrea Conti et.al. | 2409.08277 | null |
2024-09-12 | LED: Light Enhanced Depth Estimation at Night | Simon de Moreau et.al. | 2409.08031 | link |
2024-09-12 | Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes | Ming Li et.al. | 2409.07843 | null |
2024-09-12 | Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy | Bojian Li et.al. | 2409.07723 | null |
2024-09-12 | FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments | Devansh Dhrafani et.al. | 2409.07715 | null |
2024-09-10 | Deep Neural Networks: Multi-Classification and Universal Approximation | Martín Hernández et.al. | 2409.06555 | null |
2024-09-10 | EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation | Nischal Khanal et.al. | 2409.06183 | link |
2024-09-11 | EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels | Qingyao Tian et.al. | 2409.05442 | link |
2024-09-09 | Spontaneous magnetic field and disorder effects in BaPtAs_1-x_Sb_x_ with honeycomb network | T. Adachi et.al. | 2409.05266 | null |
2024-09-08 | TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs | Horatiu Florea et.al. | 2409.05142 | null |
2024-09-12 | Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective | Tim Bader et.al. | 2409.04086 | link |
2024-09-08 | Estimating Indoor Scene Depth Maps from Ultrasonic Echoes | Junpei Honma et.al. | 2409.03336 | null |
2024-09-04 | iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation | Hayeon Jo et.al. | 2409.02838 | null |
2024-09-02 | GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling | Huawei Sun et.al. | 2409.02720 | link |
2024-09-04 | Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects | Kyungmin Jo et.al. | 2409.02653 | null |
2024-09-04 | UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching | Soomin Kim et.al. | 2409.02545 | null |
2024-09-04 | SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction | Sumin Son et.al. | 2409.02513 | null |
2024-09-04 | Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation | Li Liu et.al. | 2409.02494 | link |
2024-09-04 | Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization | Cho-Ying Wu et.al. | 2409.02486 | null |
2024-09-04 | GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving | Huasong Han et.al. | 2409.02382 | null |
2024-09-03 | DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos | Wenbo Hu et.al. | 2409.02095 | link |
2024-09-02 | Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling | Haicheng Liao et.al. | 2409.01256 | null |
2024-08-30 | DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model | Mona Sheikh Zeinoddin et.al. | 2408.17433 | link |
2024-08-30 | Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method | Yuji Lin et.al. | 2408.17339 | link |
2024-08-30 | Synthetic Lunar Terrain: A Multimodal Open Dataset for Training and Evaluating Neuromorphic Vision Algorithms | Marcus Märtens et.al. | 2408.16971 | null |
2024-08-29 | EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More | Kanghao Chen et.al. | 2408.16254 | null |
2024-08-30 | Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective | Zhijie Shen et.al. | 2408.16227 | link |
2024-08-27 | Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack | Naufal Suryanto et.al. | 2408.14879 | link |
2024-08-26 | NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training | Albert Luginov et.al. | 2408.14177 | link |
2024-08-26 | Pixel-Aligned Multi-View Generation with Depth Guided Decoder | Zhenggang Tang et.al. | 2408.14016 | null |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-08-25 | InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth | Cho-Ying Wu et.al. | 2408.13708 | null |
2024-08-25 | SeeBelow: Sub-dermal 3D Reconstruction of Tumors with Surgical Robotic Palpation and Tactile Exploration | Raghava Uppuluri et.al. | 2408.13699 | null |
2024-08-27 | Sapiens: Foundation for Human Vision Models | Rawal Khirodkar et.al. | 2408.12569 | null |
2024-08-21 | LiFCal: Online Light Field Camera Calibration via Bundle Adjustment | Aymeric Fleith et.al. | 2408.11682 | null |
2024-08-19 | Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video | Shuxian Wang et.al. | 2408.10153 | link |
2024-08-19 | SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition | Wiktor Mucha et.al. | 2408.10037 | link |
2024-08-19 | P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders | Xuechao Chen et.al. | 2408.10007 | link |
2024-08-14 | Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling | Ruofeng Wei et.al. | 2408.07266 | null |
2024-08-12 | Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces | Junrui Zhang et.al. | 2408.06083 | null |
2024-08-08 | Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation | Daniele Rege Cambrin et.al. | 2408.04523 | link |
2024-08-08 | Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework | Subhasis Dasgupta et.al. | 2408.04360 | null |
2024-08-08 | Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform | Daniel Vargas et.al. | 2408.04195 | null |
2024-08-07 | Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach | Benedikt W. Hosp et.al. | 2408.03591 | null |
2024-08-06 | BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications | G. Manni et.al. | 2408.03078 | link |
2024-08-05 | Gaussian Mixture based Evidential Learning for Stereo Matching | Weide Liu et.al. | 2408.02796 | null |
2024-08-05 | Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining | Dongyang Liu et.al. | 2408.02657 | link |
2024-08-03 | MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas | Feng Qiao et.al. | 2408.01653 | null |
2024-08-02 | Self-Supervised Depth Estimation Based on Camera Models | Jinchang Zhang et.al. | 2408.01565 | null |
2024-08-01 | MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection | Youjia Fu et.al. | 2408.00438 | null |
2024-08-01 | High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior | Wencheng Han et.al. | 2408.00361 | null |
2024-08-01 | LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting | Zhenyu Bao et.al. | 2408.00254 | null |
2024-07-31 | Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching | Pengjie Zhang et.al. | 2407.21735 | null |
2024-07-29 | BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation | Kieran Saunders et.al. | 2407.20437 | null |
2024-07-29 | Analysis and Improvement of Rank-Ordered Mean Algorithm in Single-Photon LiDAR | William C. Yau et.al. | 2407.20399 | null |
2024-07-29 | Improving 2D Feature Representations by 3D-Aware Fine-Tuning | Yuanwen Yue et.al. | 2407.20229 | null |
2024-07-27 | Revisit Self-supervised Depth Estimation with Local Structure-from-Motion | Shengjie Zhu et.al. | 2407.19166 | null |
2024-07-27 | RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry | Shengjie Zhu et.al. | 2407.19154 | null |
2024-07-26 | HybridDepth: Robust Depth Fusion for Mobile AR by Leveraging Depth from Focus and Single-Image Priors | Ashkan Ganj et.al. | 2407.18443 | link |
2024-07-26 | Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation | Razieh Azizi et.al. | 2407.18195 | null |
2024-07-25 | BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation | Xiang Zhang et.al. | 2407.17952 | null |
2024-07-25 | UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation | Jian Wang et.al. | 2407.17838 | null |
2024-07-24 | DarSwin-Unet: Distortion Aware Encoder-Decoder Architecture | Akshaya Athwale et.al. | 2407.17328 | null |
2024-07-24 | Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches | Chenxing Zhao et.al. | 2407.17312 | null |
2024-07-23 | SINDER: Repairing the Singular Defects of DINOv2 | Haoqi Wang et.al. | 2407.16826 | link |
2024-07-23 | Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions | Fabio Tosi et.al. | 2407.16698 | link |
2024-07-23 | ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation | Zhenhua Wu et.al. | 2407.16508 | null |
2024-07-19 | Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation | Jinfeng Liu et.al. | 2407.14126 | link |
2024-07-18 | Unveiling the purely young star formation history of the SMC’s northeastern shell from colour-magnitude diagram fitting | Joanna D. Sakowska et.al. | 2407.13876 | null |
2024-07-18 | Many Perception Tasks are Highly Redundant Functions of their Input Data | Rahul Ramesh et.al. | 2407.13841 | null |
2024-07-18 | Shape of Motion: 4D Reconstruction from a Single Video | Qianqian Wang et.al. | 2407.13764 | null |
2024-07-18 | Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks | Antoni Kowalczuk et.al. | 2407.12588 | link |
2024-07-16 | Temporally Consistent Stereo Matching | Jiaxi Zeng et.al. | 2407.11950 | link |
2024-07-15 | IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation | Yuanhao Zhai et.al. | 2407.10937 | link |
2024-07-15 | OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection | Jinghua Hou et.al. | 2407.10753 | link |
2024-07-15 | Towards Scale-Aware Full Surround Monodepth with Transformers | Yuchen Yang et.al. | 2407.10406 | null |
2024-07-12 | ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion | Sungmin Woo et.al. | 2407.09303 | link |
2024-07-11 | ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation | Ruijie Zhu et.al. | 2407.08187 | link |
2024-07-10 | Controlling Space and Time with Diffusion Models | Daniel Watson et.al. | 2407.07860 | null |
2024-07-07 | SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning | Yi Feng et.al. | 2407.05283 | link |
2024-07-05 | A Physical Model-Guided Framework for Underwater Image Enhancement and Depth Estimation | Dazhao Du et.al. | 2407.04230 | link |
2024-07-04 | Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation | Laiyan Ding et.al. | 2407.04041 | link |
2024-07-02 | Parametric Modeling and Estimation of Photon Registrations for 3D Imaging | Weijian Zhang et.al. | 2407.02712 | null |
2024-07-02 | Depth-Aware Endoscopic Video Inpainting | Francis Xiatian Zhang et.al. | 2407.02675 | link |
2024-07-04 | Camera-LiDAR Cross-modality Gait Recognition | Wenxuan Guo et.al. | 2407.02038 | null |
2024-07-07 | CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation | Huawei Sun et.al. | 2407.00697 | link |
2024-06-28 | Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey | Uchitha Rajapaksha et.al. | 2406.19675 | null |
2024-06-27 | What Matters in Detecting AI-Generated Videos like Sora? | Chirui Chang et.al. | 2406.19568 | null |
2024-07-05 | 360 in the Wild: Dataset for Depth Prediction and View Synthesis | Kibaek Park et.al. | 2406.18898 | null |
2024-06-27 | Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach | Yuxiang Huang et.al. | 2406.18837 | null |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524 | null |
2024-06-26 | DoubleTake: Geometry Guided Depth Estimation | Mohamed Sayed et.al. | 2406.18387 | null |
2024-06-25 | Depth-Guided Semi-Supervised Instance Segmentation | Xin Chen et.al. | 2406.17413 | null |
2024-06-20 | Uncertainty and Self-Supervision in Single-View Depth | Javier Rodriguez-Puigvert et.al. | 2406.14226 | null |
2024-06-19 | WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation | Yilin Ding et.al. | 2406.13344 | link |
2024-06-18 | Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation | Ning-Hsu Wang et.al. | 2406.12849 | null |
2024-06-21 | GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models | Yongtao Ge et.al. | 2406.12671 | link |
2024-06-17 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | MEDeA: Multi-view Efficient Depth Adjustment | Mikhail Artemyev et.al. | 2406.12048 | null |
2024-06-16 | Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry | Boris Chidlovskii et.al. | 2406.11019 | null |
2024-06-16 | 3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments | Eduardo Davalos et.al. | 2406.11003 | null |
2024-06-15 | GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR | Bharat Singh et.al. | 2406.10722 | null |
2024-06-14 | The BabyView dataset: High-resolution egocentric videos of infants’ and young children’s everyday experiences | Bria Long et.al. | 2406.10447 | null |
2024-06-14 | D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video | Moritz Kappel et.al. | 2406.10078 | null |
2024-06-14 | DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications | Li Li et.al. | 2406.10068 | link |
2024-06-14 | Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion | Runze Liu et.al. | 2406.09782 | null |
2024-06-13 | Depth Anything V2 | Lihe Yang et.al. | 2406.09414 | link |
2024-06-14 | WonderWorld: Interactive 3D Scene Generation from a Single Image | Hong-Xing Yu et.al. | 2406.09394 | null |
2024-06-13 | Scale-Invariant Monocular Depth Estimation via SSI Depth | S. Mahdi H. Miangoleh et.al. | 2406.09374 | link |
2024-06-13 | Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid Transformer | Guodong Sun et.al. | 2406.08928 | link |
2024-06-13 | ToSA: Token Selective Attention for Efficient Vision Transformers | Manish Kumar Singh et.al. | 2406.08816 | null |
2024-06-11 | Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation | Yufan Zhu et.al. | 2406.07741 | link |
2024-06-11 | PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow | Joshua Tokarsky et.al. | 2406.07667 | null |
2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032 | null |
2024-06-10 | PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation | Zhenyu Li et.al. | 2406.06679 | null |
2024-06-10 | Visual-Inertial SLAM as Simple as A, B, VINS | Nathaniel Merrill et.al. | 2406.05969 | null |
2024-06-09 | Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks | Zhiyuan Cheng et.al. | 2406.05857 | link |
2024-06-09 | RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering | Rui Zhang et.al. | 2406.05852 | null |
2024-06-07 | Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction | Aarya Patel et.al. | 2406.04861 | null |
2024-06-07 | UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection | Yuchao Wang et.al. | 2406.04647 | null |
2024-06-06 | MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation | Ionuţ Grigore et.al. | 2406.04532 | null |
2024-06-06 | Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image | Stanislaw Szymanowicz et.al. | 2406.04343 | link |
2024-06-06 | Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry | Kaichen Zhou et.al. | 2406.04301 | null |
2024-06-04 | VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors | Markus Plack et.al. | 2406.02552 | null |
2024-06-03 | L-MAGIC: Language Model Assisted Generation of Images with Coherence | Zhipeng Cai et.al. | 2406.01843 | link |
2024-06-04 | Learning Temporally Consistent Video Depth from Video Diffusion Priors | Jiahao Shao et.al. | 2406.01493 | null |
2024-06-03 | Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry | Takayuki Kanai et.al. | 2406.00929 | null |
2024-06-01 | MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos | Qingming Liu et.al. | 2406.00434 | null |
2024-05-30 | Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian | Wei Sun et.al. | 2405.19657 | null |
2024-05-28 | Hybrid Multi-Head Physics-informed Neural Network for Depth Estimation in Terahertz Imaging | Mingjun Xiang et.al. | 2405.18317 | null |
2024-05-27 | Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation | Amir El-Ghoussani et.al. | 2405.17704 | link |
2024-05-27 | Benchmarking and Improving Bird’s Eye View Perception Robustness in Autonomous Driving | Shaoyuan Xie et.al. | 2405.17426 | link |
2024-05-27 | All-day Depth Completion | Vadim Ezhov et.al. | 2405.17315 | null |
2024-05-27 | GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping | Junyoung Seo et.al. | 2405.17251 | link |
2024-05-27 | SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing | Yong-Qiang Mao et.al. | 2405.17140 | null |
2024-05-27 | DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge | Yifan Mao et.al. | 2405.17102 | null |
2024-05-27 | Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation | Steven Landgraf et.al. | 2405.17097 | null |
2024-05-27 | DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation | Mengtan Zhang et.al. | 2405.16960 | link |
2024-05-27 | ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2405.16873 | null |
2024-05-27 | Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations | Jingguo Liu et.al. | 2405.16858 | null |
2024-05-26 | Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians | Erik Sandström et.al. | 2405.16544 | link |
2024-05-24 | Transparent Object Depth Completion | Yifan Zhou et.al. | 2405.15299 | null |
2024-05-24 | MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method | Pan Liao et.al. | 2405.15176 | null |
2024-05-23 | EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting | Jiaxu Wang et.al. | 2405.14959 | link |
2024-05-23 | Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks | Xingguang Jiang et.al. | 2405.14520 | null |
2024-05-23 | MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes | Ruiyuan Gao et.al. | 2405.14475 | null |
2024-05-23 | Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning | Zhenyu Wei et.al. | 2405.14195 | null |
2024-05-21 | Cross-spectral Gated-RGB Stereo Depth Estimation | Samuel Brucker et.al. | 2405.12759 | null |
2024-05-20 | Depth Reconstruction with Neural Signed Distance Fields in Structured Light Systems | Rukun Qiao et.al. | 2405.12006 | null |
2024-05-20 | Depth Prompting for Sensor-Agnostic Depth Estimation | Jin-Hwi Park et.al. | 2405.11867 | null |
2024-05-19 | CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs | Zidong Cao et.al. | 2405.11564 | null |
2024-05-18 | Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models | Madhu Vankadari et.al. | 2405.11158 | link |
2024-05-17 | FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation | Fei Wang et.al. | 2405.10885 | link |
2024-05-17 | Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory | Jonas Kälble et.al. | 2405.10575 | link |
2024-05-16 | Towards Task-Compatible Compressible Representations | Anderson de Andrade et.al. | 2405.10244 | link |
2024-05-16 | KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment | Zhengxu Shi et.al. | 2405.09964 | null |
2024-05-14 | CLIP with Quality Captions: A Strong Pretraining for Vision Tasks | Pavan Kumar Anasosalu Vasu et.al. | 2405.08911 | null |
2024-05-14 | The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition | Lingdong Kong et.al. | 2405.08816 | null |
2024-05-14 | EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera | Beilei Cui et.al. | 2405.08672 | link |
2024-05-13 | SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling | Yijun Yuan et.al. | 2405.07847 | null |
2024-05-11 | TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization | Zhen Tan et.al. | 2405.07027 | link |
2024-05-11 | Learning Monocular Depth from Focus with Event Focal Stack | Chenxu Jiang et.al. | 2405.06944 | null |
Optical flow
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-14 | Well-posedness of an optical flow based optimal control formulation for image registration | Johannes Haubner et.al. | 2507.10188 | null |
2025-07-14 | Taming Modern Point Tracking for Speckle Tracking Echocardiography via Impartial Motion | Md Abulkalam Azad et.al. | 2507.10127 | null |
2025-07-11 | Taming generative video models for zero-shot optical flow extraction | Seungwoo Kim et.al. | 2507.09082 | null |
2025-07-11 | An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan | Mengyuan Liu et.al. | 2507.08690 | null |
2025-07-11 | PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models | Yongjian Zhang et.al. | 2507.08400 | null |
2025-07-11 | MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion | Jihao Gu et.al. | 2507.08344 | null |
2025-07-10 | X-RAFT: Cross-Modal Non-Rigid Registration of Blue and White Light Neurosurgical Hyperspectral Images | Charlie Budd et.al. | 2507.07747 | null |
2025-07-09 | mmFlux: Crowd Flow Analytics with Commodity mmWave MIMO Radar | Anurag Pallaprolu et.al. | 2507.07331 | null |
2025-07-08 | Learning to Track Any Points from Human Motion | Inès Hyeonsu Kim et.al. | 2507.06233 | null |
2025-07-07 | MoDiT: Learning Highly Consistent 3D Motion Coefficients with Diffusion Transformer for Talking Head Generation | Yucheng Wang et.al. | 2507.05092 | null |
2025-07-07 | TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation | Zonglin Lyu et.al. | 2507.04984 | null |
2025-07-10 | MCFormer: A Multi-Cost-Volume Network and Comprehensive Benchmark for Particle Image Velocimetry | Zicheng Lin et.al. | 2507.04750 | null |
2025-07-06 | FB-Diff: Fourier Basis-guided Diffusion for Temporal Interpolation of 4D Medical Imaging | Xin You et.al. | 2507.04547 | null |
2025-07-03 | Flow-CDNet: A Novel Network for Detecting Both Slow and Fast Changes in Bitemporal Images | Haoxuan Li et.al. | 2507.02307 | null |
2025-07-01 | TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency | Minye Shao et.al. | 2507.00802 | null |
2025-07-01 | DIJE: Dense Image Jacobian Estimation for Robust Robotic Self-Recognition and Visual Servoing | Yasunori Toshimitsu et.al. | 2507.00446 | null |
2025-06-30 | C3VDv2 – Colonoscopy 3D video dataset with enhanced realism | Mayank V. Golhar et.al. | 2506.24074 | null |
2025-07-03 | PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal View | Longliang Liu et.al. | 2506.23897 | null |
2025-06-30 | Proteus-ID: ID-Consistent and Motion-Coherent Video Customization | Guiyu Zhang et.al. | 2506.23729 | null |
2025-06-29 | MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation | Vladislav Bargatin et.al. | 2506.23151 | null |
2025-06-26 | WAFT: Warping-Alone Field Transforms for Optical Flow | Yihan Wang et.al. | 2506.21526 | null |
2025-06-26 | EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting | Taoyu Wu et.al. | 2506.21420 | null |
2025-06-25 | Feature Hallucination for Self-supervised Action Recognition | Lei Wang et.al. | 2506.20342 | null |
2025-06-24 | Online camera-pose-free stereo endoscopic tissue deformation recovery with tissue-invariant vision-biomechanics consistency | Jiahe Chen et.al. | 2506.19388 | null |
2025-06-23 | Flow-Aware Diffusion for Real-Time VR Restoration: Enhancing Spatiotemporal Coherence and Efficiency | Yitong Zhu et.al. | 2506.18786 | null |
2025-06-24 | Multimodal Fusion SLAM with Fourier Attention | Youjie Zhou et.al. | 2506.18204 | null |
2025-06-19 | EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised Training | Liangjing Shao et.al. | 2506.16017 | link |
2025-06-17 | MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution | Zhiwen Shao et.al. | 2506.14511 | link |
2025-06-21 | Inference-Time Gaze Refinement for Micro-Expression Recognition: Enhancing Event-Based Eye Tracking with Motion-Aware Post-Processing | Nuwan Bandara et.al. | 2506.12524 | link |
2025-06-13 | MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution | Linfeng He et.al. | 2506.11768 | null |
2025-06-12 | Post-Training Quantization for Video Matting | Tianrui Zhu et.al. | 2506.10840 | null |
2025-06-10 | UFM: A Simple Path towards Unified Dense Correspondence with Flow | Yuchen Zhang et.al. | 2506.09278 | null |
2025-06-10 | Princeton365: A Diverse Dataset with Accurate Camera Pose | Karhan Kayan et.al. | 2506.09035 | null |
2025-06-09 | Spatio-Temporal State Space Model For Efficient Event-Based Optical Flow | Muhammad Ahmed Humais et.al. | 2506.07878 | link |
2025-06-09 | Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images | Yingping Liang et.al. | 2506.07740 | null |
2025-06-13 | Consistent Video Editing as Flow-Driven Image-to-Video Generation | Ge Wang et.al. | 2506.07713 | null |
2025-06-08 | AllTracker: Efficient Dense Point Tracking at High Resolution | Adam W. Harley et.al. | 2506.07310 | null |
2025-06-08 | GoTrack: Generic 6DoF Object Pose Refinement and Tracking | Van Nguyen Nguyen et.al. | 2506.07155 | null |
2025-06-07 | EV-LayerSegNet: Self-supervised Motion Segmentation using Event Cameras | Youssef Farah et.al. | 2506.06596 | null |
2025-06-06 | 3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model | Hongyan Zhi et.al. | 2506.06199 | link |
2025-06-06 | Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments | Mingrui Li et.al. | 2506.05965 | null |
2025-06-05 | DualX-VSR: Dual Axial Spatial $\times$ Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation | Shuo Cao et.al. | 2506.04830 | null |
2025-06-04 | JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting | Yang Xiao et.al. | 2506.03872 | null |
2025-06-04 | EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation | Daikun Liu et.al. | 2506.03512 | null |
2025-06-03 | Learning Optical Flow Field via Neural Ordinary Differential Equation | Leyla Mirvakhabova et.al. | 2506.03290 | null |
2025-06-03 | LinkTo-Anime: A 2D Animation Optical Flow Dataset from 3D Model Rendering | Xiaoyi Feng et.al. | 2506.02733 | null |
2025-06-03 | LumosFlow: Motion-Guided Long Video Generation | Jiahao Chen et.al. | 2506.02497 | null |
2025-06-02 | MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow | Jakob Schmid et.al. | 2506.01443 | null |
2025-06-01 | MOOSE: Pay Attention to Temporal Dynamics for Video Understanding via Optical Flows | Hong Nguyen et.al. | 2506.01119 | null |
2025-05-31 | Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline | Zhaoying Wang et.al. | 2506.00546 | null |
2025-05-31 | Improving Optical Flow and Stereo Depth Estimation by Leveraging Uncertainty-Based Learning Difficulties | Jisoo Jeong et.al. | 2506.00324 | null |
2025-05-30 | Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction | Chenyou Fan et.al. | 2505.24156 | null |
2025-05-29 | Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing | Tongtong Su et.al. | 2505.23134 | link |
2025-05-27 | Object Concepts Emerge from Motion | Haoqian Liang et.al. | 2505.21635 | null |
2025-05-26 | A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking | Zixiang Zhao et.al. | 2505.19858 | null |
2025-05-23 | Brightness-Invariant Tracking Estimation in Tagged MRI | Zhangxing Bian et.al. | 2505.18365 | null |
2025-05-31 | CTRL-GS: Cascaded Temporal Residue Learning for 4D Gaussian Splatting | Karly Hou et.al. | 2505.18306 | null |
2025-05-23 | Real-time Traffic Accident Anticipation with Feature Reuse | Inpyo Song et.al. | 2505.17449 | null |
2025-05-22 | Efficient Correlation Volume Sampling for Ultra-High-Resolution Optical Flow Estimation | Karlis Martins Briedis et.al. | 2505.16942 | null |
2025-05-22 | V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation | Hanyue Lou et.al. | 2505.16797 | link |
2025-05-21 | SENSE – Sensor-Enhanced Neural Shear Stress Estimation for Quantitative Oilfilm Visualizations | Lennart Rohlfs et.al. | 2505.15697 | null |
2025-05-19 | RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers | Ahmet Berke Gokmen et.al. | 2505.13344 | null |
2025-05-19 | eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks | Jad Mansour et.al. | 2505.13309 | null |
2025-05-19 | FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching | Alp Eren Sari et.al. | 2505.13174 | null |
2025-05-19 | Just Dance with $π$ ! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection | Snehashis Majhi et.al. | 2505.13123 | null |
2025-05-17 | MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos | Hongyi Zhou et.al. | 2505.11868 | null |
2025-05-16 | Planar Velocity Estimation for Fast-Moving Mobile Robots Using Event-Based Optical Flow | Liam Boyle et.al. | 2505.11116 | null |
2025-05-15 | TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation | Manthan Patel et.al. | 2505.10696 | null |
2025-05-15 | A label-free sub-diffractive technique for 3D intracellular tomography using thermally induced convection currents | Jayesh Goswami et.al. | 2505.10112 | null |
2025-05-14 | FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling | Yue Wen et.al. | 2505.09406 | null |
2025-05-14 | RobustSpring: Benchmarking Robustness to Image Corruptions for Optical Flow, Scene Flow and Stereo | Jenny Schmalfuss et.al. | 2505.09368 | null |
2025-05-13 | Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection | Ayush K. Rai et.al. | 2505.08561 | null |
2025-05-13 | TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection | Wenkui Yang et.al. | 2505.08437 | link |
2025-05-13 | EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation | Hanle Zheng et.al. | 2505.08235 | null |
2025-05-13 | Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images | Ziteng Liu et.al. | 2505.08178 | null |
2025-05-12 | Asynchronous Multi-Object Tracking with an Event Camera | Angus Apps et.al. | 2505.08126 | link |
2025-05-11 | MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception | Zhengye Zhang et.al. | 2505.07007 | link |
2025-05-13 | Detection of Moving Objects Using Self-motion Constraints on Optic Flow | Hope Lutwak et.al. | 2505.06686 | null |
2025-05-08 | Nonlinear Motion-Guided and Spatio-Temporal Aware Network for Unsupervised Event-Based Optical Flow | Zuntao Liu et.al. | 2505.05089 | null |
2025-05-08 | A Simple Detector with Frame Dynamics is a Strong Tracker | Chenxu Peng et.al. | 2505.04917 | link |
2025-05-06 | Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment | João Alves et.al. | 2505.03554 | link |
2025-05-06 | TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion | Haoyue Liu et.al. | 2505.03116 | null |
2025-05-04 | Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance | Yingkai Zhang et.al. | 2505.02109 | null |
2025-05-02 | Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation | Zhen Yao et.al. | 2505.01548 | link |
2025-04-30 | AnimalMotionCLIP: Embedding motion in CLIP for Animal Behavior Analysis | Enmin Zhong et.al. | 2505.00569 | null |
2025-04-29 | LPVIMO-SAM: Tightly-coupled LiDAR/Polarization Vision/Inertial/Magnetometer/Optical Flow Odometry via Smoothing and Mapping | Derui Shan et.al. | 2504.20380 | null |
2025-04-25 | RapidPIV: Full Flow-Field kHz PIV for Real-Time Display and Control | Scott A. Bollt et.al. | 2504.17987 | null |
2025-04-22 | Motion-Enhanced Nonlocal Similarity Implicit Neural Representation for Infrared Dim and Small Target Detection | Pei Liu et.al. | 2504.15665 | null |
2025-04-22 | DiTPainter: Efficient Video Inpainting with Diffusion Transformers | Xian Wu et.al. | 2504.15661 | null |
2025-04-21 | PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV | Qianyu Zhu et.al. | 2504.14952 | link |
2025-04-21 | Multimodal Non-Semantic Feature Fusion for Predicting Segment Access Frequency in Lecture Archives | Ruozhu Sheng et.al. | 2504.14927 | null |
2025-04-20 | FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models | Kuanting Wu et.al. | 2504.14535 | null |
2025-04-18 | Neural Ganglion Sensors: Learning Task-specific Event Cameras Inspired by the Neural Circuit of the Human Retina | Haley M. So et.al. | 2504.13457 | null |
2025-04-18 | MicroFlow: Domain-Specific Optical Flow for Ground Deformation Estimation in Seismic Events | Juliette Bertrand et.al. | 2504.13452 | null |
2025-04-18 | Event-Enhanced Blurry Video Super-Resolution | Dachun Kai et.al. | 2504.13042 | link |
2025-04-17 | SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration | Xi Tong et.al. | 2504.12869 | null |
2025-04-17 | SAM-Based Building Change Detection with Distribution-Aware Fourier Adaptation and Edge-Constrained Warping | Yun-Cheng Li et.al. | 2504.12619 | null |
2025-04-14 | Perturbed State Space Feature Encoders for Optical Flow with Event Cameras | Gokul Raju Govinda Raju et.al. | 2504.10669 | null |
2025-04-15 | WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs | Nguyen Ngoc Dat et.al. | 2504.10165 | null |
2025-04-11 | Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review | Claudio Cimarelli et.al. | 2504.08588 | null |
2025-04-10 | Extending Visual Dynamics for Video-to-Music Generation | Xiaohao Liu et.al. | 2504.07594 | null |
2025-04-08 | Intrinsic Saliency Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation | Xiangyu Zheng et.al. | 2504.05904 | null |
2025-04-07 | Towards Efficient Real-Time Video Motion Transfer via Generative Time Series Modeling | Tasmiah Haque et.al. | 2504.05537 | null |
2025-04-06 | FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency | Shiyan Liu et.al. | 2504.04427 | null |
2025-04-05 | Simultaneous Motion And Noise Estimation with Event Cameras | Shintaro Shiba et.al. | 2504.04029 | null |
2025-04-04 | 3D Scene Understanding Through Local Random Access Sequence Modeling | Wanhee Lee et.al. | 2504.03875 | null |
2025-04-03 | L-LBVC: Long-Term Motion Estimation and Prediction for Learned Bi-Directional Video Compression | Yongqi Zhai et.al. | 2504.02560 | null |
2025-04-01 | Beyond Wide-Angle Images: Unsupervised Video Portrait Correction via Spatiotemporal Diffusion Adaptation | Wenbo Nie et.al. | 2504.00401 | null |
2025-04-01 | Hierarchical Flow Diffusion for Efficient Frame Interpolation | Yang Hai et.al. | 2504.00380 | null |
2025-03-31 | Easi3R: Estimating Disentangled Motion from DUSt3R Without Training | Xingyu Chen et.al. | 2503.24391 | link |
2025-04-03 | Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey | Haoyang Wang et.al. | 2503.22943 | null |
2025-03-28 | Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision | Rulin Zhou et.al. | 2503.22394 | null |
2025-03-28 | Segment Any Motion in Videos | Nan Huang et.al. | 2503.22268 | null |
2025-03-28 | Synergistic Bleeding Region and Point Detection in Surgical Videos | Jialun Pei et.al. | 2503.22174 | null |
2025-03-27 | VADMamba: Exploring State Space Models for Fast Video Anomaly Detection | Jiahao Lyu et.al. | 2503.21169 | link |
2025-03-27 | Can Video Diffusion Model Reconstruct 4D Geometry? | Jinjie Mai et.al. | 2503.21082 | null |
2025-03-25 | Burst Image Super-Resolution with Mamba | Ozan Unal et.al. | 2503.19634 | null |
2025-03-24 | NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting | Yulong Zheng et.al. | 2503.18794 | null |
2025-03-27 | MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion | Yikun Ma et.al. | 2503.17695 | null |
2025-03-21 | Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks | Bhishma Dedhia et.al. | 2503.17539 | null |
2025-03-21 | Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras | Shuang Guo et.al. | 2503.17262 | link |
2025-03-20 | 4D Gaussian Splatting SLAM | Yanyan Li et.al. | 2503.16710 | null |
2025-03-20 | EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation | Zihao Zhang et.al. | 2503.15831 | null |
2025-03-19 | DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework | Henrique Morimitsu et.al. | 2503.14880 | link |
2025-03-19 | Temporal-Consistent Video Restoration with Pre-trained Diffusion Models | Hengkang Wang et.al. | 2503.14863 | null |
2025-03-18 | GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics | Tingyang Xiao et.al. | 2503.14247 | link |
2025-03-17 | UCF-Crime-DVS: A Novel Event-Based Dataset for Video Anomaly Detection with Spiking Neural Networks | Yuanbin Qian et.al. | 2503.12905 | link |
2025-03-16 | ProbDiffFlow: An Efficient Learning-Free Framework for Probabilistic Single-Image Optical Flow Estimation | Mo Zhou et.al. | 2503.12348 | null |
2025-03-17 | EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation | Zengyu Wan et.al. | 2503.11371 | null |
2025-03-14 | FG-DFPN: Flow Guided Deformable Frame Prediction Network | M. Akın Yılmaz et.al. | 2503.11343 | link |
2025-03-14 | Zero-TIG: Temporal Consistency-Aware Zero-Shot Illumination-Guided Low-light Video Enhancement | Yini Li et.al. | 2503.11175 | link |
2025-03-14 | A High-Accuracy Alignment Approach for Solar Images of Different Wavelengths | Yun Wang et.al. | 2503.11035 | null |
2025-03-13 | Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations | Xunzhi Zheng et.al. | 2503.10464 | null |
2025-03-13 | Markerless Tracking-Based Registration for Medical Image Motion Correction | Luisa Neubig et.al. | 2503.10260 | null |
2025-03-13 | ST-FlowNet: An Efficient Spiking Neural Network for Event-Based Optical Flow Estimation | Hongze Sun et.al. | 2503.10195 | null |
2025-03-12 | Investigation of Frame Differences as Motion Cues for Video Object Segmentation | Sota Kawamura et.al. | 2503.09132 | null |
2025-03-11 | Feature Alignment with Equivariant Convolutions for Burst Image Super-Resolution | Xinyi Liu et.al. | 2503.08300 | null |
2025-03-10 | MambaFlow: A Mamba-Centric Architecture for End-to-End Optical Flow Estimation | Juntian Du et.al. | 2503.07046 | null |
2025-03-11 | Bridge Frame and Event: Common Spatiotemporal Fusion for High-Dynamic Scene Optical Flow | Hanyu Zhou et.al. | 2503.06992 | null |
2025-03-09 | Online Dense Point Tracking with Streaming Memory | Qiaole Dong et.al. | 2503.06471 | link |
2025-03-10 | VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control | Yuxuan Bian et.al. | 2503.05639 | link |
2025-03-07 | Stereo Any Video: Temporally Consistent Stereo Matching | Junpeng Jing et.al. | 2503.05549 | null |
2025-03-06 | Implicit Neural Representation for Video and Image Super-Resolution | Mary Aiyetigbo et.al. | 2503.04665 | null |
2025-03-09 | ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem | Yu-Hsi Chen et.al. | 2503.04500 | link |
2025-03-05 | Video Super-Resolution: All You Need is a Video Diffusion Model | Zhihao Zhan et.al. | 2503.03355 | null |
2025-03-05 | BAT: Learning Event-based Optical Flow with Bidirectional Adaptive Temporal Correlation | Gangwei Xu et.al. | 2503.03256 | null |
2025-03-05 | Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria | Asma A. Almutairi et.al. | 2503.03100 | null |
2025-03-04 | Anomaly detection in non-stationary videos using time-recursive differencing network based prediction | Gargi V. Pillai et.al. | 2503.02234 | null |
2025-03-03 | MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features | Chao Ye et.al. | 2503.01571 | link |
2025-03-03 | AI-Driven Relocation Tracking in Dynamic Kitchen Environments | Arash Nasr Esfahani et.al. | 2503.01547 | link |
2025-03-02 | Vid2Fluid: 3D Dynamic Fluid Assets from Single-View Videos with Generative Gaussian Splatting | Zhiwei Zhao et.al. | 2503.00868 | null |
2025-02-28 | EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration | Kuangyi Chen et.al. | 2503.00167 | link |
2025-02-21 | Peripheral Teleportation: A Rest Frame Design to Mitigate Cybersickness During Virtual Locomotion | Tongyu Nie et.al. | 2502.15227 | null |
2025-02-20 | Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance | Meng Wang et.al. | 2502.14520 | null |
2025-02-18 | L4P: Low-Level 4D Vision Perception Unified | Abhishek Badki et.al. | 2502.13078 | null |
2025-02-18 | Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection | Zijian Cao et.al. | 2502.12735 | null |
2025-02-17 | Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance | Jixiang Chen et.al. | 2502.11971 | null |
2025-02-17 | Stonefish: Supporting Machine Learning Research in Marine Robotics | Michele Grimaldi et.al. | 2502.11887 | link |
2025-02-15 | Super Resolution image reconstructs via total variation-based image deconvolution: a majorization-minimization approach | Mouhamad Chehaitly et.al. | 2502.10876 | null |
2025-02-15 | Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video | Runyang Feng et.al. | 2502.10616 | null |
2025-02-11 | A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision | Hao Ai et.al. | 2502.10444 | null |
2025-02-12 | FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis | Wonjoon Jin et.al. | 2502.08244 | null |
2025-02-11 | Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors | Lin-Zhuo Chen et.al. | 2502.07615 | null |
2025-02-18 | A Physical Coherence Benchmark for Evaluating Video Generation Models via Optical Flow-guided Frame Prediction | Yongfan Chen et.al. | 2502.05503 | link |
2025-02-05 | MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent | Xinyao Liao et.al. | 2502.03207 | null |
2025-02-03 | XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications | Shangjin Zhai et.al. | 2502.01297 | null |
2025-01-28 | Image Velocimetry using Direct Displacement Field estimation with Neural Networks for Fluids | Efraín Magaña et.al. | 2501.18641 | link |
2025-02-02 | REMOTE: Real-time Ego-motion Tracking for Various Endoscopes via Multimodal Visual Feature Learning | Liangjing Shao et.al. | 2501.18124 | null |
2025-01-28 | Improved Encoding for Overfitted Video Codecs | Thomas Leguay et.al. | 2501.16976 | null |
2025-01-28 | Assessing ultrasonic and optical flow velocimetry in a millifluidic device using oil-in-water emulsions as blood mimicking fluid | Estelle Lu et.al. | 2501.16959 | null |
2025-01-28 | Extending Information Bottleneck Attribution to Video Sequences | Veronika Solopova et.al. | 2501.16889 | link |
2025-02-04 | Event-Based Adaptive Koopman Framework for Optic Flow-Guided Landing on Moving Platforms | Bazeela Banday et.al. | 2501.16868 | null |
2025-01-23 | GC-ConsFlow: Leveraging Optical Flow Residuals and Global Context for Robust Deepfake Detection | Jiaxin Chen et.al. | 2501.13435 | null |
2025-01-22 | MONA: Moving Object Detection from Videos Shot by Dynamic Camera | Boxun Hu et.al. | 2501.13183 | null |
2025-01-22 | Machine Learning Modeling for Multi-order Human Visual Motion Processing | Zitang Sun et.al. | 2501.12810 | link |
2025-01-21 | Efficient Dynamic Image Reconstruction with motion estimation | Toluwani Okunola et.al. | 2501.12497 | null |
2025-01-21 | Learning segmentation from point trajectories | Laurynas Karazija et.al. | 2501.12392 | link |
2025-01-22 | Video Depth Anything: Consistent Depth Estimation for Super-Long Videos | Sili Chen et.al. | 2501.12375 | null |
2025-01-21 | VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models | Chaohao Xie et.al. | 2501.12267 | null |
2025-01-20 | Event-based vision for egomotion estimation using precise event timing | Hugh Greatorex et.al. | 2501.11554 | null |
2025-01-19 | BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution | Eunjin Kim et.al. | 2501.11043 | link |
2025-01-25 | Quadcopter Position Hold Function using Optical Flow in a Smartphone-based Flight Computer | Noel P. Caliston et.al. | 2501.10752 | null |
2025-01-18 | Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection | Yifang Xu et.al. | 2501.10692 | null |
2025-01-17 | DiffuEraser: A Diffusion Model for Video Inpainting | Xiaowen Li et.al. | 2501.10018 | link |
2025-01-16 | VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization | Zixun Fang et.al. | 2501.09499 | null |
2025-01-16 | Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise | Ryan Burgert et.al. | 2501.08331 | link |
2025-01-13 | Aligning First, Then Fusing: A Novel Weakly Supervised Multimodal Violence Detection Method | Wenping Jin et.al. | 2501.07496 | link |
2025-01-08 | Edit as You See: Image-guided Video Editing via Masked Motion Modeling | Zhi-Lin Huang et.al. | 2501.04325 | null |
2025-01-06 | TinySense: A Lighter Weight and More Power-efficient Avionics System for Flying Insect-scale Robots | Zhitao Yu et.al. | 2501.03416 | null |
2025-01-06 | ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking | Tingyang Zhang et.al. | 2501.03220 | null |
2025-01-05 | AHMSA-Net: Adaptive Hierarchical Multi-Scale Attention Network for Micro-Expression Recognition | Lijun Zhang et.al. | 2501.02539 | null |
2025-01-01 | Spatially-guided Temporal Aggregation for Robust Event-RGB Optical Flow Estimation | Qianang Zhou et.al. | 2501.00838 | null |
2025-01-05 | How Honeybees Perceive and Traverse Apertures | Timothy Jakobi et.al. | 2501.00646 | null |
2024-12-29 | Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition | Xiu-Feng Huang et.al. | 2412.20327 | link |
2024-12-28 | Enhancing Marine Debris Acoustic Monitoring by Optical Flow-Based Motion Vector Analysis | Xiaoteng Zhou et.al. | 2412.20085 | null |
2024-12-27 | Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark | Lukas Picek et.al. | 2412.19944 | null |
2024-12-27 | Generalized Uncertainty-Based Evidential Fusion with Hybrid Multi-Head Attention for Weak-Supervised Temporal Action Localization | Yuanpeng He et.al. | 2412.19418 | link |
2025-01-03 | Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry | Zhaoxing Zhang et.al. | 2412.16923 | link |
2024-12-20 | SOUS VIDE: Cooking Visual Drone Navigation Policies in a Gaussian Splatting Vacuum | JunEn Low et.al. | 2412.16346 | null |
2024-12-20 | MotiF: Making Text Count in Image Animation with Motion Focal Loss | Shijie Wang et.al. | 2412.16153 | null |
2024-12-18 | Dynamic semantic VSLAM with known and unknown objects | Sanghyoup Gu et.al. | 2412.14359 | null |
2024-12-18 | SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation | Tong Chen et.al. | 2412.14018 | null |
2024-12-17 | CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices | Andrei Znobishchev et.al. | 2412.13273 | null |
2024-12-17 | Complex extension of optical flow and its practical evaluation for undersampled dynamic MRI | Matthias J. Ehrhardt et.al. | 2412.12711 | null |
2024-12-17 | GG-SSMs: Graph-Generating State Space Models | Nikola Zubić et.al. | 2412.12423 | null |
2024-12-16 | Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising | Zikang Chen et.al. | 2412.11820 | link |
2024-12-16 | Exploring More from Multiple Gait Modalities for Human Identification | Dongyang Jin et.al. | 2412.11495 | link |
2024-12-16 | BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions | Wonyong Seo et.al. | 2412.11365 | null |
2024-12-15 | Learning Normal Flow Directly From Event Neighborhoods | Dehao Yuan et.al. | 2412.11284 | link |
2024-12-13 | BatDeck – Ultra Low-power Ultrasonic Ego-velocity Estimation and Obstacle Avoidance on Nano-drones | Hanna Müller et.al. | 2412.10048 | null |
2024-12-12 | A Plug-and-Play Algorithm for 3D Video Super-Resolution of Single-Photon LiDAR data | Alice Ruget et.al. | 2412.09427 | null |
2024-12-12 | eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction | Jad Mansour et.al. | 2412.09209 | link |
2024-12-12 | ResFlow: Fine-tuning Residual Optical Flow for Event-based High Temporal Resolution Motion Estimation | Qianang Zhou et.al. | 2412.09105 | null |
2024-12-12 | Mojito: Motion Trajectory and Intensity Control for Video Generation | Xuehai He et.al. | 2412.08948 | null |
2024-12-12 | Labits: Layered Bidirectional Time Surfaces Representation for Event Camera-based Continuous Dense Trajectory Estimation | Zhongyang Zhang et.al. | 2412.08849 | null |
2024-12-11 | Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation | Zhigang Cen et.al. | 2412.08034 | null |
2024-12-10 | EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision | Qiang Qu et.al. | 2412.07080 | link |
2024-12-09 | Local Attention Transformers for High-Detail Optical Flow Upsampling | Alexander Gielisse et.al. | 2412.06439 | null |
2024-12-08 | MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation | Shuwei Shi et.al. | 2412.05848 | null |
2024-12-05 | Deep Learning and Hybrid Approaches for Dynamic Scene Analysis, Object Detection and Motion Tracking | Shahran Rahman Alve et.al. | 2412.05331 | null |
2024-12-04 | Advancing Auto-Regressive Continuation for Video Frames | Ruibo Ming et.al. | 2412.03758 | null |
2024-12-03 | Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Hiroki Furuta et.al. | 2412.02617 | null |
2024-12-02 | STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation | Sunghun Yang et.al. | 2412.01090 | null |
2024-12-01 | Advanced Video Inpainting Using Optical Flow-Guided Efficient Diffusion | Bohai Gu et.al. | 2412.00857 | null |
2024-11-30 | A conditional Generative Adversarial network model for the Weather4Cast 2024 Challenge | Atharva Deshpande et.al. | 2412.00451 | null |
2024-11-30 | Hybrid Local-Global Context Learning for Neural Video Compression | Yongqi Zhai et.al. | 2412.00446 | null |
2024-11-27 | RoMo: Robust Motion Segmentation Improves Structure from Motion | Lily Goli et.al. | 2411.18650 | null |
2024-11-27 | ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching | Yangrui Dong et.al. | 2411.18174 | null |
2024-11-27 | An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognition | Song-Jiang Lai et.al. | 2411.18002 | null |
2024-11-26 | Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors | Zhengfei Kuang et.al. | 2411.17249 | null |
2024-11-25 | Context-Aware Input Orchestration for Video Inpainting | Hoyoung Kim et.al. | 2411.16926 | null |
2024-11-22 | TSkips: Efficiency Through Explicit Temporal Delay Connections in Spiking Neural Networks | Prajna G. Malettira et.al. | 2411.16711 | null |
2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
2024-11-23 | Optical-Flow Guided Prompt Optimization for Coherent Video Generation | Hyelin Nam et.al. | 2411.15540 | null |
2024-11-22 | Benchmarking the Robustness of Optical Flow Estimation to Corruptions | Zhonghua Yi et.al. | 2411.14865 | link |
2024-11-21 | EdgeFlowNet: 100FPS@1W Dense Optical Flow For Tiny Mobile Robots | Sai Ramana Kiran Pinnama Raju et.al. | 2411.14576 | null |
2024-11-21 | Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation | Zhuoman Liu et.al. | 2411.14423 | null |
2024-11-21 | Transforming Static Images Using Generative Models for Video Salient Object Detection | Suhwan Cho et.al. | 2411.13975 | link |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild | Weicai Ye et.al. | 2411.13291 | null |
2024-11-20 | Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark | Bing Cao et.al. | 2411.13056 | null |
2024-11-16 | AnimateAnything: Consistent and Controllable Animation for Video Generation | Guojun Lei et.al. | 2411.10836 | null |
2024-11-15 | OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models | Mathis Koroglu et.al. | 2411.10501 | null |
2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
2024-11-14 | MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation | Jonas Serych et.al. | 2411.09551 | link |
2024-11-12 | DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection | Shawn Li et.al. | 2411.08227 | link |
2024-11-17 | Scaling Properties of Diffusion Models for Perceptual Tasks | Rahul Ravishankar et.al. | 2411.08034 | null |
2024-11-11 | Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters | Corwin Grant Jeon MacMillan et.al. | 2411.05225 | null |
2024-11-07 | Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera | Yu Hu et.al. | 2411.04413 | null |
2024-11-07 | AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation | Mingyu Sheng et.al. | 2411.03695 | link |
2024-11-04 | Neural optical flow for planar and stereo PIV | Andrew I. Masker et.al. | 2411.02373 | null |
2024-11-03 | Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation | Zhenbin Wang et.al. | 2411.01647 | null |
2024-11-03 | Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli | Matthias Tangemann et.al. | 2411.01505 | link |
2024-11-02 | Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks | Aarjav Kavathia et.al. | 2411.01348 | null |
2024-10-29 | Motion Graph Unleashed: A Novel Approach to Video Prediction | Yiqi Zhong et.al. | 2410.22288 | link |
2024-10-29 | FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives | Qizhi Chen et.al. | 2410.22070 | null |
2024-10-29 | Investigation of moving objects through atmospheric turbulence from a non-stationary platform | Nicholas Ferrante et.al. | 2410.21639 | null |
2024-10-27 | CloudCast – Total Cloud Cover Nowcasting with Machine Learning | Mikko Partio et.al. | 2410.21329 | link |
2024-10-28 | Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual Context | Manuel Benavent-Lledo et.al. | 2410.21275 | link |
2024-10-27 | BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events | Yijin Li et.al. | 2410.20451 | null |
2024-10-26 | UniVST: A Unified Framework for Training-free Localized Video Style Transfer | Quanjian Song et.al. | 2410.20084 | link |
2024-10-23 | Separating edges from microstructure in X-ray dark-field imaging: Evolving and devolving perspectives via the X-ray Fokker-Planck equation | Samantha J. Alloo et.al. | 2410.18317 | null |
2024-10-16 | Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks | Pranjali Pathre et.al. | 2410.12432 | link |
2024-10-14 | Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world | Han Ling et.al. | 2410.10453 | link |
2024-10-12 | A Collaborative Team of UAV-Hexapod for an Autonomous Retrieval System in GNSS-Denied Maritime Environments | Seungwook Lee et.al. | 2410.09606 | null |
2024-10-12 | Robust Optical Flow Computation: A Higher-Order Differential Approach | Chanuka Algama et.al. | 2410.09563 | null |
2024-10-10 | MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting | Ruijie Zhu et.al. | 2410.07707 | link |
2024-10-09 | Z-upscaling: Optical Flow Guided Frame Interpolation for Isotropic Reconstruction of 3D EM Volumes | Fisseha A. Ferede et.al. | 2410.07043 | link |
2024-10-08 | Future frame prediction in chest cine MR imaging using the PCA respiratory motion model and dynamically trained recurrent neural networks | Michel Pohl et.al. | 2410.05882 | null |
2024-10-01 | Descriptor: Face Detection Dataset for Programmable Threshold-Based Sparse-Vision | Riadul Islam et.al. | 2410.00368 | link |
2024-10-08 | DressRecon: Freeform 4D Human Reconstruction from Monocular Video | Jeff Tan et.al. | 2409.20563 | null |
2024-10-06 | Visual collective behaviors on spherical robots | Diego Castro et.al. | 2409.20539 | null |
2024-09-26 | Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming | Zehao Zhu et.al. | 2409.17596 | null |
2024-09-26 | TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene | Sandika Biswas et.al. | 2409.17459 | link |
2024-09-25 | EventHDR: from Event to High-Speed HDR Videos and Beyond | Yunhao Zou et.al. | 2409.17029 | null |
2024-09-25 | Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation | Hanyu Zhou et.al. | 2409.17001 | null |
2024-09-25 | Pose-Guided Fine-Grained Sign Language Video Generation | Tongkai Shi et.al. | 2409.16709 | null |
2024-09-21 | BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow | EungGu Kang et.al. | 2409.15384 | link |
2024-09-23 | Skills Made to Order: Efficient Acquisition of Robot Cooking Skills Guided by Multiple Forms of Internet Data | Mrinal Verghese et.al. | 2409.15172 | null |
2024-09-22 | Secrets of Edge-Informed Contrast Maximization for Event-Based Vision | Pritam P. Karmokar et.al. | 2409.14611 | null |
2024-09-18 | Optical Flow Matters: an Empirical Comparative Study on Fusing Monocular Extracted Modalities for Better Steering | Fouad Makiyeh et.al. | 2409.12716 | null |
2024-09-16 | ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video | Han Ling et.al. | 2409.12202 | link |
2024-09-16 | Continual Learning of Conjugated Visual Representations through Higher-order Motion Flows | Simone Marullo et.al. | 2409.11441 | null |
2024-09-17 | Training Datasets Generation for Machine Learning: Application to Vision Based Navigation | Jérémy Lebreton et.al. | 2409.11383 | null |
2024-09-17 | Multimodal Attention-Enhanced Feature Fusion-based Weekly Supervised Anomaly Violence Detection | Yuta Kaneko et.al. | 2409.11223 | null |
2024-09-16 | SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning | Amogh Joshi et.al. | 2409.09990 | null |
2024-09-15 | Dynamic Layer Detection of a Thin Silk Cloth using DenseTact Optical Tactile Sensors | Ankush Kundan Dhawan et.al. | 2409.09849 | null |
2024-09-15 | Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings | Oriel Perl et.al. | 2409.09841 | null |
2024-09-13 | InstantDrag: Improving Interactivity in Drag-based Image Editing | Joonghyuk Shin et.al. | 2409.08857 | null |
2024-09-11 | Violence detection in videos using deep recurrent and convolutional neural networks | Abdarahmane Traoré et.al. | 2409.07581 | null |
2024-09-11 | Distance Measurement for UAVs in Deep Hazardous Tunnels | Vishal Choudhary et.al. | 2409.07160 | null |
2024-09-09 | LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow | Hongyu Wen et.al. | 2409.05688 | null |
2024-09-11 | Real-Time Human Action Recognition on Embedded Platforms | Ruiqi Wang et.al. | 2409.05662 | null |
2024-09-15 | HMAFlow: Learning More Accurate Optical Flow via Hierarchical Motion Field Alignment | Dianbo Ma et.al. | 2409.05531 | link |
2024-09-09 | FacialFlowNet: Advancing Facial Optical Flow Estimation with a Diverse Dataset and a Decomposed Model | Jianzhi Lu et.al. | 2409.05396 | link |
2024-09-06 | Hybrid Cost Volume for Memory-Efficient Optical Flow | Yang Zhao et.al. | 2409.04243 | link |
2024-09-06 | SDformerFlow: Spatiotemporal swin spikeformer for event-based optical flow estimation | Yi Tian et.al. | 2409.04082 | link |
2024-09-03 | DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos | Wenbo Hu et.al. | 2409.02095 | link |
2024-08-29 | FlowRetrieval: Flow-Guided Data Retrieval for Few-Shot Imitation Learning | Li-Heng Lin et.al. | 2408.16944 | null |
2024-08-29 | Estimating Dynamic Flow Features in Groups of Tracked Objects | Tanner D. Harms et.al. | 2408.16190 | null |
2024-08-28 | MMASD+: A Novel Dataset for Privacy-Preserving Behavior Analysis of Children with Autism Spectrum Disorder | Pavan Uttej Ravva et.al. | 2408.15077 | link |
2024-08-21 | Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars | Zhihao Lin et.al. | 2408.11582 | null |
2024-08-21 | SelfDRSC++: Self-Supervised Learning for Dual Reversed Rolling Shutter Correction | Wei Shang et.al. | 2408.11411 | link |
2024-09-02 | Video Diffusion Models are Strong Video Inpainter | Minhyeok Lee et.al. | 2408.11402 | null |
2024-08-20 | PooDLe: Pooled and dense self-supervised learning from naturalistic videos | Alex N. Wang et.al. | 2408.11208 | null |
2024-08-21 | NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices | Zhiyong Zhang et.al. | 2408.10161 | link |
2024-08-19 | Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data | Tao Yang et.al. | 2408.10119 | null |
2024-08-18 | Contactless seismocardiography via Gunnar-Farneback optical flow | Mohammad Muntasir Rahman et.al. | 2408.09512 | null |
2024-08-18 | OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare | Chen Long-fei et.al. | 2408.09409 | null |
2024-08-16 | CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving | Shihan Peng et.al. | 2408.08500 | null |
2024-08-15 | MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing | Chenjie Cao et.al. | 2408.08000 | null |
2024-08-12 | FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework | Lukas Meyer et.al. | 2408.06190 | link |
2024-08-12 | Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network | Kailai Sun et.al. | 2408.05877 | null |
2024-08-11 | Egocentric Vision Language Planning | Zhirui Fang et.al. | 2408.05802 | null |
2024-08-08 | KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance | Jingxian Lu et.al. | 2408.02912 | null |
2024-08-02 | NOLO: Navigate Only Look Once | Bohan Zhou et.al. | 2408.01384 | null |
2024-07-31 | RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining | Hongtao Wu et.al. | 2407.21773 | link |
2024-07-31 | Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching | Pengjie Zhang et.al. | 2407.21735 | null |
2024-07-30 | SpotFormer: Multi-Scale Spatio-Temporal Transformer for Facial Expression Spotting | Yicheng Deng et.al. | 2407.20799 | null |
2024-07-29 | Event-based Optical Flow on Neuromorphic Processor: ANN vs. SNN Comparison based on Activation Sparsification | Yingfu Xu et.al. | 2407.20421 | link |
2024-07-26 | Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations | Zipeng Wang et.al. | 2407.18500 | null |
2024-07-23 | Occlusion-Aware 3D Motion Interpretation for Abnormal Behavior Detection | Su Li et.al. | 2407.16788 | null |
2024-07-23 | SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging | Lingtong Kong et.al. | 2407.16308 | link |
2024-07-18 | Many Perception Tasks are Highly Redundant Functions of their Input Data | Rahul Ramesh et.al. | 2407.13841 | null |
2024-07-18 | Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain | Bach Nguyen Gia et.al. | 2407.13159 | link |
2024-07-17 | Fusion Flow-enhanced Graph Pooling Residual Networks for Unmanned Aerial Vehicles Surveillance in Day and Night Dual Visions | Alam Noor et.al. | 2407.12647 | null |
2024-07-16 | Improving Unsupervised Video Object Segmentation via Fake Flow Generation | Suhwan Cho et.al. | 2407.11714 | link |
2024-07-16 | ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment | Xinyi Wang et.al. | 2407.11496 | link |
2024-07-16 | Hybrid physics-AI outperforms numerical weather prediction for extreme precipitation nowcasting | Puja Das et.al. | 2407.11317 | null |
2024-07-15 | Temporal Event Stereo via Joint Learning with Stereoscopic Flow | Hoonhee Cho et.al. | 2407.10831 | link |
2024-07-15 | Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation | Friedhelm Hamann et.al. | 2407.10802 | link |
2024-07-14 | Research Experience of an Undergraduate Student in Computer Vision and Robotics | Ayush V. Gowda et.al. | 2407.10044 | null |
2024-07-13 | ScaleRAFT: Cross-Scale Recurrent All-Pairs Field Transforms for 3D Motion Estimation | Han Ling et.al. | 2407.09797 | link |
2024-07-11 | Generalizable Implicit Motion Modeling for Video Frame Interpolation | Zujin Guo et.al. | 2407.08680 | null |
2024-07-11 | Event-based vision on FPGAs – a survey | Tomasz Kryjak et.al. | 2407.08356 | null |
2024-07-10 | Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction | Yili Liu et.al. | 2407.07587 | null |
2024-07-05 | Unsupervised 4D Cardiac Motion Tracking with Spatiotemporal Optical Flow Networks | Long Teng et.al. | 2407.04663 | null |
2024-07-04 | CardioSpectrum: Comprehensive Myocardium Motion Analysis with 3D Deep Learning and Geometric Insights | Shahar Zuler et.al. | 2407.03794 | link |
2024-07-03 | Towards High Resolution Real-Time Optical Flow Particle Image Velocimetry | Juan Pimienta et.al. | 2407.03057 | null |
2024-07-03 | Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jiaxin Guo et.al. | 2407.02918 | link |
2024-07-01 | DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models | Chang-Han Yeh et.al. | 2407.01519 | link |
2024-07-01 | RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields | Haochen Jiang et.al. | 2407.01303 | link |
2024-06-27 | What Matters in Detecting AI-Generated Videos like Sora? | Chirui Chang et.al. | 2406.19568 | null |
2024-06-27 | A Universal Railway Obstacle Detection System based on Semi-supervised Segmentation And Optical Flow | Qiushi Guo et.al. | 2406.18908 | null |
2024-06-27 | Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach | Yuxiang Huang et.al. | 2406.18837 | null |
2024-06-25 | Disentangled Motion Modeling for Video Frame Interpolation | Jaihyun Lew et.al. | 2406.17256 | link |
2024-06-26 | Splatter a Video: Video Gaussian Representation for Versatile Processing | Yang-Tian Sun et.al. | 2406.13870 | null |
2024-06-19 | Low Latency Visual Inertial Odometry with On-Sensor Accelerated Optical Flow for Resource-Constrained UAVs | Jonas Kühne et.al. | 2406.13345 | null |
2024-06-17 | MEDeA: Multi-view Efficient Depth Adjustment | Mikhail Artemyev et.al. | 2406.12048 | null |
2024-06-13 | Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion | Linzhan Mou et.al. | 2406.09402 | null |
2024-06-11 | PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow | Joshua Tokarsky et.al. | 2406.07667 | null |
2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551 | link |
2024-06-07 | DVOS: Self-Supervised Dense-Pattern Video Object Segmentation | Keyhan Najafian et.al. | 2406.05131 | null |
2024-06-07 | Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior | Tanvir Mahmud et.al. | 2406.04873 | link |
2024-06-07 | Interplay between preconditioning and regularization for linear ill-posed problems solved by conjugate gradient. Application to optical flow estimation | Ahmed Chabib et.al. | 2406.04695 | null |
2024-06-04 | Neural Representations of Dynamic Visual Stimuli | Jacob Yeung et.al. | 2406.02659 | null |
2024-06-03 | DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation | Chun-Hung Wu et.al. | 2406.01591 | null |
2024-06-03 | Prototypical Transformer as Unified Motion Learners | Cheng Han et.al. | 2406.01559 | null |
2024-06-03 | Enhancing Dynamic CT Image Reconstruction with Neural Fields Through Explicit Motion Regularizers | Pablo Arratia et.al. | 2406.01299 | null |
2024-06-03 | Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting | Fang Li et.al. | 2406.01042 | link |
2024-06-03 | Synthetic Data Generation for 3D Myocardium Deformation Analysis | Shahar Zuler et.al. | 2406.01040 | link |
2024-05-30 | EMAG: Ego-motion Aware and Generalizable 2D Hand Forecasting from Egocentric Videos | Masashi Hatano et.al. | 2405.20030 | null |
2024-05-30 | May the Dance be with You: Dance Generation Framework for Non-Humanoids | Hyemin Ahn et.al. | 2405.19743 | null |
2024-05-28 | GFlow: Recovering 4D World from Monocular Video | Shizun Wang et.al. | 2405.18426 | null |
2024-05-28 | Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition | Muhammad Adi Nugroho et.al. | 2405.18012 | null |
2024-05-27 | DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation | Mengtan Zhang et.al. | 2405.16960 | link |
2024-05-27 | SCSim: A Realistic Spike Cameras Simulator | Liwen Hu et.al. | 2405.16790 | link |
2024-05-26 | Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition | Tong Shi et.al. | 2405.16701 | null |
2024-05-26 | Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception | Shuangpeng Han et.al. | 2405.16493 | link |
2024-05-24 | Time-Harmonic Optical Flow with Applications in Elastography | Oleh Melnyk et.al. | 2405.15507 | link |
2024-05-24 | Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features | Lichuan Ji et.al. | 2405.15343 | null |
2024-05-24 | Unsupervised Motion Segmentation for Neuromorphic Aerial Surveillance | Sami Arja et.al. | 2405.15209 | link |
2024-05-23 | SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow | Yihan Wang et.al. | 2405.14793 | link |
2024-05-23 | OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance | Shuheng Ge et.al. | 2405.14709 | null |
2024-05-23 | Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields | Tom Fischer et.al. | 2405.14599 | null |
2024-05-22 | MotionCraft: Physics-based Zero-Shot Video Generation | Luca Savant Aira et.al. | 2405.13557 | link |
2024-05-21 | Weakly supervised alignment and registration of MR-CT for cervical cancer radiotherapy | Jjahao Zhang et.al. | 2405.12850 | null |
2024-05-21 | Rethink Predicting the Optical Flow with the Kinetics Perspective | Yuhao Cheng et.al. | 2405.12512 | link |
2024-05-18 | GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition | Mallika Garg et.al. | 2405.11180 | link |
2024-05-17 | MicroBundlePillarTrack, A Python package for automated segmentation, tracking, and analysis of pillar deflection in cardiac microbundles | Hiba Kobeissi et.al. | 2405.11096 | link |
2024-05-16 | Physics-incorporated Graph Neural Network for Multivariate Time Series Imputation | Guojun Liang et.al. | 2405.10995 | link |
2024-05-15 | Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | Xuanchen Wang et.al. | 2405.09266 | null |
2024-05-11 | DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation | Volodymyr Fedynyak et.al. | 2405.08715 | null |
2024-05-14 | EchoTracker: Advancing Myocardial Point Tracking in Echocardiography | Md Abulkalam Azad et.al. | 2405.08587 | link |
2024-05-15 | Vector-Symbolic Architecture for Event-Based Optical Flow | Hongzhi You et.al. | 2405.08300 | null |
2024-05-12 | NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU | Yuhao Zhang et.al. | 2405.07392 | link |
2024-05-11 | Global Motion Understanding in Large-Scale Video Object Segmentation | Volodymyr Fedynyak et.al. | 2405.07031 | null |
2024-05-09 | A Survey on Backbones for Deep Video Action Recognition | Zixuan Tang et.al. | 2405.05584 | null |
2024-05-08 | Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection | Shengyang Sun et.al. | 2405.05130 | link |
2024-05-07 | Visually Guided Swarm Motion Coordination via Insect-inspired Small Target Motion Reactions | Md Arif Billah et.al. | 2405.04591 | null |
2024-05-06 | Diffeomorphic Template Registration for Atmospheric Turbulence Mitigation | Dong Lao et.al. | 2405.03662 | null |
Object Tracking
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-15 | CharaConsist: Fine-Grained Consistent Character Generation | Mengyu Wang et.al. | 2507.11533 | null |
2025-07-14 | Taming Modern Point Tracking for Speckle Tracking Echocardiography via Impartial Motion | Md Abulkalam Azad et.al. | 2507.10127 | null |
2025-07-14 | MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second | Chenguo Lin et.al. | 2507.10065 | null |
2025-07-14 | OpenHuman4D: Open-Vocabulary 4D Human Parsing | Keito Suzuki et.al. | 2507.09880 | null |
2025-07-12 | Online Long-term Point Tracking in the Foundation Model Era | Görkay Aydemir et.al. | 2507.09217 | null |
2025-07-12 | On the Fragility of Multimodal Perception to Temporal Misalignment in Autonomous Driving | Md Hasan Shahriar et.al. | 2507.09095 | null |
2025-07-11 | SAM2RL: Towards Reinforcement Learning Memory Control in Segment Anything Model 2 | Alen Adamyan et.al. | 2507.08548 | null |
2025-07-14 | HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking | Ruixiang Chen et.al. | 2507.07603 | null |
2025-07-10 | Temporal Unlearnable Examples: Preventing Personal Video Data from Unauthorized Exploitation by Object Tracking | Qiangqiang Wu et.al. | 2507.07483 | null |
2025-07-08 | When Trackers Date Fish: A Benchmark and Framework for Underwater Multiple Fish Tracking | Weiran Li et.al. | 2507.06400 | null |
2025-07-08 | Learning to Track Any Points from Human Motion | Inès Hyeonsu Kim et.al. | 2507.06233 | null |
2025-07-08 | Cooperative Mapping, Localization, and Beam Management via Multi-Modal SLAM in ISAC Systems | Hang Que et.al. | 2507.05718 | null |
2025-07-07 | Self-Supervised Real-Time Tracking of Military Vehicles in Low-FPS UAV Footage | Markiyan Kostiv et.al. | 2507.05229 | null |
2025-07-07 | Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking | Maria Damanaki et.al. | 2507.04762 | null |
2025-07-05 | Integrated Gaussian Processes for Robust and Adaptive Multi-Object Tracking | Fred Lydeard et.al. | 2507.04116 | null |
2025-07-03 | CrowdTrack: A Benchmark for Difficult Multiple Pedestrian Tracking in Real Scenarios | Teng Fu et.al. | 2507.02479 | null |
2025-07-03 | A Novel Tuning Method for Real-time Multiple-Object Tracking Utilizing Thermal Sensor with Complexity Motion Pattern | Duong Nguyen-Ngoc Tran et.al. | 2507.02408 | null |
2025-07-03 | PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection | Seokyeong Lee et.al. | 2507.02393 | null |
2025-07-02 | TrackingMiM: Efficient Mamba-in-Mamba Serialization for Real-time UAV Object Tracking | Bingxi Liu et.al. | 2507.01535 | null |
2025-07-04 | Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations | Shivansh Patel et.al. | 2507.00990 | null |
2025-07-01 | UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions | Siyuan Yao et.al. | 2507.00648 | null |
2025-06-30 | Visual and Memory Dual Adapter for Multi-Modal Object Tracking | Boyue Xu et.al. | 2506.23972 | null |
2025-06-30 | Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking | Shiao Wang et.al. | 2506.23783 | null |
2025-06-28 | Optimal Trajectory Planning for Space Object Tracking with Collision-Avoidance Constraints | Saif R. Kazi et.al. | 2506.22797 | null |
2025-06-27 | Improving Token-based Object Detection with Video | Abhineet Singh et.al. | 2506.22562 | null |
2025-07-01 | R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning | Biao Wang et.al. | 2506.21980 | null |
2025-06-26 | Linear and Second-order-cone Valid Inequalities for Problems with Storage | Juan M. Morales et.al. | 2506.21470 | null |
2025-06-24 | VideoPCDNet: Video Parsing and Prediction with Phase Correlation Networks | Noel José Rodrigues Vicente et.al. | 2506.19621 | null |
2025-06-24 | Trajectory Prediction in Dynamic Object Tracking: A Critical Study | Zhongping Dong et.al. | 2506.19341 | null |
2025-06-23 | Lightweight RGB-T Tracking with Mobile Vision Transformers | Mahdi Falaki et.al. | 2506.19154 | null |
2025-06-23 | USVTrack: USV-Based 4D Radar-Camera Tracking Dataset for Autonomous Driving in Inland Waterways | Shanliang Yao et.al. | 2506.18737 | null |
2025-06-23 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220 | link |
2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119 | link |
2025-06-19 | From Theory to Practice: Identifying the Optimal Approach for Offset Point Tracking in the Context of Agricultural Robotics | Stephane Ngnepiepaye Wembe et.al. | 2506.16143 | null |
2025-06-19 | KARL: Kalman-Filter Assisted Reinforcement Learner for Dynamic Object Tracking and Grasping | Kowndinya Boyalakuntla et.al. | 2506.15945 | null |
2025-06-18 | Probabilistic Trajectory GOSPA: A Metric for Uncertainty-Aware Multi-Object Tracking Performance Evaluation | Yuxuan Xia et.al. | 2506.15148 | null |
2025-06-17 | Projected integral control of impedance passive nonlinear systems | Nicolas Vanspranghe et.al. | 2506.14267 | null |
2025-06-16 | Deep Learning-Based Multi-Object Tracking: A Comprehensive Survey from Foundations to State-of-the-Art | Momir Adžemović et.al. | 2506.13457 | null |
2025-06-15 | Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors | Wen-Hsuan Chu et.al. | 2506.12716 | null |
2025-06-13 | Multiple Object Tracking in Video SAR: A Benchmark and Tracking Baseline | Haoxiang Chen et.al. | 2506.12105 | null |
2025-06-11 | Optimizing Cooperative Multi-Object Tracking using Graph Signal Processing | Maria Damanaki et.al. | 2506.09469 | null |
2025-06-10 | MOSE: A Novel Orchestration Framework for Stateful Microservice Migration at the Edge | Antonio Calagna et.al. | 2506.09159 | null |
2025-06-10 | MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning | Mohammadreza Salehi et.al. | 2506.08694 | link |
2025-06-09 | SAM2Auto: Auto Annotation Using FLASH | Arash Rocky et.al. | 2506.07850 | null |
2025-06-09 | DragNeXt: Rethinking Drag-Based Image Editing | Yuan Zhou et.al. | 2506.07611 | null |
2025-06-08 | AllTracker: Efficient Dense Point Tracking at High Resolution | Adam W. Harley et.al. | 2506.07310 | null |
2025-06-05 | FRAME: Pre-Training Video Feature Representations via Anticipation and Memory | Sethuraman TV et.al. | 2506.05543 | null |
2025-06-08 | Context Is Not Comprehension | Alex Pan et.al. | 2506.04907 | null |
2025-06-04 | Contour Errors: An Ego-Centric Metric for Reliable 3D Multi-Object Tracking | Sharang Kaul et.al. | 2506.04122 | null |
2025-06-03 | SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports | Dheeraj Khanna et.al. | 2506.03335 | null |
2025-06-03 | IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation | Yuanze Lin et.al. | 2506.03150 | null |
2025-06-03 | MVTD: A Benchmark Dataset for Maritime Visual Object Tracking | Ahsan Baidar Bakht et.al. | 2506.02866 | null |
2025-06-09 | E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models | Wenyan Cong et.al. | 2506.01933 | null |
2025-06-02 | UMA: Ultra-detailed Human Avatars via Multi-level Surface Alignment | Heming Zhu et.al. | 2506.01802 | null |
2025-06-02 | No Train Yet Gain: Towards Generic Multi-Object Tracking in Sports and Beyond | Tomasz Stanczyk et.al. | 2506.01373 | null |
2025-06-01 | Depth-Aware Scoring and Hierarchical Alignment for Multiple Object Tracking | Milad Khanchi et.al. | 2506.00774 | null |
2025-05-29 | Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping | Justin Lazarow et.al. | 2505.23756 | null |
2025-05-27 | SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation | Claudia Cuttano et.al. | 2505.21795 | link |
2025-05-27 | Fully Spiking Neural Networks for Unified Frame-Event Object Tracking | Jingjun Yang et.al. | 2505.20834 | null |
2025-05-26 | Video-based Direct Time Series Measurement of Along-Strike Slip on the Coseismic Surface Rupture During the 2025 Mw7.7 Myanmar Earthquake | Jianhao Gao et.al. | 2505.20494 | null |
2025-05-26 | ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking | Sijia Chen et.al. | 2505.20381 | link |
2025-05-28 | Progressive Scaling Visual Object Tracking | Jack Hong et.al. | 2505.19990 | null |
2025-05-24 | Distributed Expectation Propagation for Multi-Object Tracking over Sensor Networks | Qing Li et.al. | 2505.18795 | null |
2025-05-24 | FusionTrack: End-to-End Multi-Object Tracking in Arbitrary Multi-View Environment | Xiaohe Li et.al. | 2505.18727 | null |
2025-05-24 | EOTNet: Deep Memory Aided Bayesian Filter for Extended Object Tracking | Zhixing Wang et.al. | 2505.18684 | link |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111 | null |
2025-05-22 | A Framework for Multi-View Multiple Object Tracking using Single-View Multi-Object Trackers on Fish Data | Chaim Chai Elchik et.al. | 2505.17201 | null |
2025-05-22 | Temporal Object Captioning for Street Scene Videos from LiDAR Tracks | Vignesh Gopinathan et.al. | 2505.16594 | null |
2025-05-21 | Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detection | Shichao Li et.al. | 2505.16029 | link |
2025-05-21 | ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation | Tony Montes et.al. | 2505.15928 | link |
2025-05-19 | Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach | Shiao Wang et.al. | 2505.12903 | link |
2025-05-22 | LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking | Martha Teiko Teye et.al. | 2505.12753 | null |
2025-05-19 | Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking | Shiyu Xuan et.al. | 2505.12606 | null |
2025-05-20 | DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model | Siwei Xia et.al. | 2505.12427 | link |
2025-05-18 | DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking | Jirong Zha et.al. | 2505.12340 | null |
2025-05-17 | GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity | Takuya Ikeda et.al. | 2505.11905 | null |
2025-05-12 | Asynchronous Multi-Object Tracking with an Event Camera | Angus Apps et.al. | 2505.08126 | link |
2025-05-12 | SAEN-BGS: Energy-Efficient Spiking AutoEncoder Network for Background Subtraction | Zhixuan Zhang et.al. | 2505.07336 | null |
2025-05-12 | Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking | Mohamed Nagy et.al. | 2505.07254 | null |
2025-05-09 | Hyperbolic and Elliptic Points Tracking Algorithm (HEPTA) in two-dimensional non-stationary velocity fields defined on a discrete grid | A. A. Udalov et.al. | 2505.05975 | null |
2025-05-09 | CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV Tracking | Weihong Li et.al. | 2505.05936 | link |
2025-05-09 | You Are Your Best Teacher: Semi-Supervised Surgical Point Tracking with Cycle-Consistent Self-Distillation | Valay Bundele et.al. | 2505.05722 | null |
2025-05-08 | A Simple Detector with Frame Dynamics is a Strong Tracker | Chenxu Peng et.al. | 2505.04917 | link |
2025-05-11 | SMMT: Siamese Motion Mamba with Self-attention for Thermal Infrared Target Tracking | Shang Zhang et.al. | 2505.04088 | null |
2025-05-06 | Interactive Instance Annotation with Siamese Networks | Xiang Xu et.al. | 2505.03184 | null |
2025-05-06 | TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion | Haoyue Liu et.al. | 2505.03116 | null |
2025-05-02 | CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking | Vladimir Somers et.al. | 2505.01257 | link |
2025-05-02 | Optimizing Indoor Farm Monitoring Efficiency Using UAV: Yield Estimation in a GNSS-Denied Cherry Tomato Greenhouse | Taewook Park et.al. | 2505.00995 | null |
2025-04-30 | MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection | Qiushi Yang et.al. | 2505.00739 | null |
2025-05-01 | A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic | Muhammad Imran Zaman et.al. | 2505.00534 | null |
2025-04-30 | Direct Motion Models for Assessing Generated Videos | Kelsey Allen et.al. | 2505.00209 | null |
2025-04-30 | Stereo X-ray tomography on deformed object tracking | Zhenduo Shang et.al. | 2505.00122 | null |
2025-04-30 | LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics | Marc Glocker et.al. | 2504.21716 | link |
2025-04-30 | Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction | Zihan Zhou et.al. | 2504.21692 | null |
2025-04-30 | Model-Free Two-Degree-of-Freedom PID Controller Design for Unknown LTI Systems | Taiga Kiyota et.al. | 2504.21341 | null |
2025-04-29 | The Mean of Multi-Object Trajectories | Tran Thien Dat Nguyen et.al. | 2504.20391 | null |
2025-04-28 | Improving trajectory continuity in drone-based crowd monitoring using a set of minimal-cost techniques and deep discriminative correlation filters | Bartosz Ptak et.al. | 2504.20234 | null |
2025-04-28 | A computer vision method to estimate ventilation rate of Atlantic salmon in sea fish farms | Lukas Folkman et.al. | 2504.19719 | null |
2025-04-25 | Decentralized Fusion of 3D Extended Object Tracking based on a B-Spline Shape Model | Longfei Han et.al. | 2504.18708 | null |
2025-04-25 | Multi-Sensor Fusion of Active and Passive Measurements for Extended Object Tracking | Hong Zhu et.al. | 2504.18301 | null |
2025-04-25 | PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models | Michel Gokan Khan et.al. | 2504.18165 | link |
2025-04-25 | S3MOT: Monocular 3D Object Tracking with Selective State Space Model | Zhuohao Yan et.al. | 2504.18068 | null |
2025-04-24 | Dynamic Camera Poses and Where to Find Them | Chris Rockwell et.al. | 2504.17788 | null |
2025-04-23 | PRaDA: Projective Radial Distortion Averaging | Daniil Sinitsyn et.al. | 2504.16499 | null |
2025-04-22 | SonarT165: A Large-scale Benchmark and STFTrack Framework for Acoustic Object Tracking | Yunfeng Li et.al. | 2504.15609 | link |
2025-04-20 | TAPIP3D: Tracking Any Point in Persistent 3D Geometry | Bowei Zhang et.al. | 2504.14717 | link |
2025-04-20 | Seurat: From Moving Points to Depth | Seokju Cho et.al. | 2504.14687 | link |
2025-04-19 | Adversarial Attack for RGB-Event based Visual Object Tracking | Qiang Chen et.al. | 2504.14423 | link |
2025-04-17 | St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World | Haiwen Feng et.al. | 2504.13152 | null |
2025-04-17 | Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving | Shumin Wang et.al. | 2504.12709 | null |
2025-04-16 | Robust Visual Servoing under Human Supervision for Assembly Tasks | Victor Nan Fernandez-Ayala et.al. | 2504.12506 | null |
2025-04-13 | Intelligent driving vehicle front multi-target tracking and detection based on YOLOv5 and point cloud 3D projection | Dayong Liu et.al. | 2504.11310 | null |
2025-04-15 | WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs | Nguyen Ngoc Dat et.al. | 2504.10165 | null |
2025-04-14 | LiteTracker: Leveraging Temporal Causality for Accurate Low-latency Tissue Tracking | Mert Asim Karaoglu et.al. | 2504.09904 | null |
2025-04-12 | PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking | Jiahuan Long et.al. | 2504.09361 | null |
2025-04-12 | Text To 3D Object Generation For Scalable Room Assembly | Sonia Laguna et.al. | 2504.09328 | null |
2025-04-12 | ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking | Tzoulio Chamiti et.al. | 2504.09195 | null |
2025-04-10 | GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Lang Lin et.al. | 2504.07962 | null |
2025-04-09 | Multi-Object Tracking for Collision Avoidance Using Multiple Cameras in Open RAN Networks | Jordi Serra et.al. | 2504.07163 | null |
2025-04-13 | VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning | Xinhao Li et.al. | 2504.06958 | null |
2025-04-08 | POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction | Songyan Zhang et.al. | 2504.05692 | link |
2025-04-06 | SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation | Junjie Jiang et.al. | 2504.04519 | link |
2025-04-05 | Risk-Aware Robot Control in Dynamic Environments Using Belief Control Barrier Functions | Shaohang Han et.al. | 2504.04097 | link |
2025-04-04 | TQD-Track: Temporal Query Denoising for 3D Multi-Object Tracking | Shuxiao Ding et.al. | 2504.03258 | null |
2025-04-03 | Attention-Aware Multi-View Pedestrian Tracking | Reef Alturki et.al. | 2504.03047 | null |
2025-04-03 | Data-Driven Object Tracking: Integrating Modular Neural Networks into a Kalman Framework | Christian Alexander Holz et.al. | 2504.02519 | null |
2025-04-02 | Deep LG-Track: An Enhanced Localization-Confidence-Guided Multi-Object Tracker | Ting Meng et.al. | 2504.01457 | null |
2025-04-02 | COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking | Chunhui Zhang et.al. | 2504.01321 | link |
2025-04-01 | IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval | Bangwei Liu et.al. | 2504.00954 | null |
2025-03-31 | Point Tracking in Surgery–The 2024 Surgical Tattoos in Infrared (STIR) Challenge | Adam Schmidt et.al. | 2503.24306 | link |
2025-04-03 | Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey | Haoyang Wang et.al. | 2503.22943 | null |
2025-03-28 | Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision | Rulin Zhou et.al. | 2503.22394 | null |
2025-03-28 | Hyperspectral Adapter for Object Tracking based on Hyperspectral Video | Long Gao et.al. | 2503.22199 | null |
2025-03-25 | Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better | Zihang Lai et.al. | 2503.19904 | null |
2025-03-24 | TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court Videos | Kazuhiro Yamada et.al. | 2503.18282 | link |
2025-03-22 | MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking | Haolin Qin et.al. | 2503.17699 | link |
2025-03-21 | Dynamic Attention Mechanism in Spatiotemporal Memory Networks for Object Tracking | Meng Zhou et.al. | 2503.16768 | null |
2025-03-20 | Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction | Edgar Sucar et.al. | 2503.16318 | null |
2025-03-19 | Toward Scalable, Flexible Scene Flow for Point Clouds | Kyle Vedder et.al. | 2503.15666 | null |
2025-03-17 | Real-Time Multi-Object Tracking using YOLOv8 and SORT on a SoC FPGA | Michal Danilowicz et.al. | 2503.13023 | null |
2025-03-17 | OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering | Guanhua Ding et.al. | 2503.12968 | null |
2025-03-17 | GIFT: Generated Indoor video frames for Texture-less point tracking | Jianzheng Huang et.al. | 2503.12944 | null |
2025-03-17 | UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Network | Siyuan Yao et.al. | 2503.12888 | link |
2025-03-16 | History-Aware Transformation of ReID Features for Multiple Object Tracking | Ruopeng Gao et.al. | 2503.12562 | link |
2025-03-15 | ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object | Zhe Shan et.al. | 2503.12006 | link |
2025-03-14 | VGGT: Visual Geometry Grounded Transformer | Jianyuan Wang et.al. | 2503.11651 | link |
2025-03-14 | Cognitive Disentanglement for Referring Multi-Object Tracking | Shaofeng Liang et.al. | 2503.11496 | null |
2025-03-13 | 3D Extended Object Tracking based on Extruded B-Spline Side View Profiles | Longfei Han et.al. | 2503.10730 | null |
2025-03-18 | OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer | Jinyang Li et.al. | 2503.10616 | link |
2025-03-13 | Low Complexity Point Tracking of the Myocardium in 2D Echocardiography | Artem Chernyshov et.al. | 2503.10431 | link |
2025-03-13 | Target-aware Bidirectional Fusion Transformer for Aerial Object Tracking | Xinglong Sun et.al. | 2503.09951 | null |
2025-03-12 | How good are deep learning methods for automated road safety analysis using video data? An experimental study | Qingwu Liu et.al. | 2503.09807 | null |
2025-03-11 | TrackOcc: Camera-based 4D Panoptic Occupancy Tracking | Zhuoguang Chen et.al. | 2503.08471 | link |
2025-03-11 | Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking | Yunhao Li et.al. | 2503.08145 | null |
2025-03-10 | SIRE: SE(3) Intrinsic Rigidity Embeddings | Cameron Smith et.al. | 2503.07739 | null |
2025-03-10 | CPAny: Couple With Any Encoder to Refer Multi-Object Tracking | Weize Li et.al. | 2503.07516 | null |
2025-03-09 | Online Dense Point Tracking with Streaming Memory | Qiaole Dong et.al. | 2503.06471 | link |
2025-03-06 | A Novel Control Strategy for Offset Points Tracking in the Context of Agricultural Robotics | Stephane Ngnepiepaye Wembe et.al. | 2503.05835 | null |
2025-03-06 | Omnidirectional Multi-Object Tracking | Kai Luo et.al. | 2503.04565 | link |
2025-03-09 | ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem | Yu-Hsi Chen et.al. | 2503.04500 | link |
2025-03-06 | A Modular Pipeline for 3D Object Tracking Using RGB Cameras | Lars Bredereke et.al. | 2503.04322 | link |
2025-03-03 | AI-Driven Relocation Tracking in Dynamic Kitchen Environments | Arash Nasr Esfahani et.al. | 2503.01547 | link |
2025-02-27 | MITracker: Multi-View Integration for Visual Object Tracking | Mengjie Xu et.al. | 2502.20111 | null |
2025-02-26 | Spectral-Enhanced Transformers: Leveraging Large-Scale Pretrained Models for Hyperspectral Object Tracking | Shaheer Mohamed et.al. | 2502.18748 | null |
2025-02-25 | UASTrack: A Unified Adaptive Selection Framework with Modality-Customization in Single Object Tracking | He Wang et.al. | 2502.18220 | null |
2025-02-26 | Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking | Peng Zhang et.al. | 2502.17822 | null |
2025-02-24 | V-HOP: Visuo-Haptic 6D Object Pose Tracking | Hongyu Li et.al. | 2502.17434 | null |
2025-02-24 | Enriching Physical-Virtual Interaction in AR Gaming by Tracking Identical Real Objects | Liuchuan Yu et.al. | 2502.17399 | link |
2025-02-24 | CRTrack: Low-Light Semi-Supervised Multi-object Tracking Based on Consistency Regularization | Zijing Zhao et.al. | 2502.16809 | null |
2025-02-23 | Benchmarking Online Object Trackers for Underwater Robot Position Locking Applications | Ali Safa et.al. | 2502.16569 | null |
2025-02-19 | A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects | Arjun Gupta et.al. | 2502.13964 | null |
2025-02-19 | MEX: Memory-efficient Approach to Referring Multi-Object Tracking | Huu-Thien Tran et.al. | 2502.13875 | null |
2025-02-18 | Pre-training Auto-regressive Robotic Models with 4D Representations | Dantong Niu et.al. | 2502.13142 | null |
2025-02-13 | IMM-MOT: A Novel 3D Multi-object Tracking Framework with Interacting Multiple Model Filter | Xiaohong Liu et.al. | 2502.09672 | null |
2025-02-12 | Control Barrier Function-Based Quadratic Programming for SafeOperation of Tethered UAVs | Samuel O. Folorunsho et.al. | 2502.08129 | null |
2025-02-10 | Adaptive Perception for Unified Visual Multi-modal Object Tracking | Xiantao Hu et.al. | 2502.06583 | null |
2025-02-09 | Energy-Efficient Autonomous Aerial Navigation with Dynamic Vision Sensors: A Physics-Guided Neuromorphic Approach | Sourav Sanyal et.al. | 2502.05938 | null |
2025-02-08 | Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark | Shiao Wang et.al. | 2502.05574 | link |
2025-02-06 | OneTrack-M: A multitask approach to transformer-based MOT models | Luiz C. S. de Araujo et.al. | 2502.04478 | null |
2025-02-06 | RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology | Nhat-Tan Do et.al. | 2502.03760 | null |
2025-02-04 | Rethinking Vision Transformer for Object Centric Foundation Models | Manuel Traub et.al. | 2502.02763 | null |
2025-02-04 | INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy | Nastaran Darabi et.al. | 2502.01896 | null |
2025-02-03 | Bayesian Approximation-Based Trajectory Prediction and Tracking with 4D Radar | Dong-In Kim et.al. | 2502.01357 | null |
2025-02-03 | Solgenia – A Test Vessel Toward Energy-Efficient Autonomous Water Taxi Applications | Hannes Homburger et.al. | 2502.01207 | link |
2025-01-30 | Track-On: Transformer-based Online Point Tracking with Memory | Görkay Aydemir et.al. | 2501.18487 | link |
2025-01-28 | Overcoming Semantic Dilution in Transformer-Based Next Frame Prediction | Hy Nguyen et.al. | 2501.16753 | null |
2025-01-27 | Understanding Long Videos via LLM-Powered Entity Relation Graphs | Meng Chu et.al. | 2501.15953 | null |
2025-01-24 | MATCHA:Towards Matching Anything | Fei Xue et.al. | 2501.14945 | null |
2025-01-24 | Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection | Viktor Kozák et.al. | 2501.14587 | null |
2025-01-23 | CSAOT: Cooperative Multi-Agent System for Active Object Tracking | Hy Nguyen et.al. | 2501.13994 | null |
2025-01-23 | YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID | Iñaki Erregue et.al. | 2501.13710 | link |
2025-01-21 | Learning segmentation from point trajectories | Laurynas Karazija et.al. | 2501.12392 | link |
2025-01-22 | InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling | Yi Wang et.al. | 2501.12386 | link |
2025-01-21 | Exploring Temporally-Aware Features for Point Tracking | Inès Hyeonsu Kim et.al. | 2501.12218 | link |
2025-01-20 | PD-SORT: Occlusion-Robust Multi-Object Tracking Using Pseudo-Depth Cues | Yanchao Wang et.al. | 2501.11288 | link |
2025-01-17 | Spatio-temporal Graph Learning on Adaptive Mined Key Frames for High-performance Multi-Object Tracking | Futian Wang et.al. | 2501.10129 | null |
2025-01-13 | SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing | Varun Biyyala et.al. | 2501.07554 | link |
2025-01-13 | TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations | Daniel Steininger et.al. | 2501.07360 | link |
2025-01-13 | Robust Single Object Tracking in LiDAR Point Clouds under Adverse Weather Conditions | Xiantong Zhao et.al. | 2501.07133 | null |
2025-01-09 | An Empirical Study of Autoregressive Pre-training from Videos | Jathushan Rajasegaran et.al. | 2501.05453 | null |
2025-01-08 | Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs | Zeyi Huang et.al. | 2501.04336 | null |
2025-01-07 | Neuromorphic Optical Tracking and Imaging of Randomly Moving Targets through Strongly Scattering Media | Ning Zhang et.al. | 2501.03874 | null |
2025-01-06 | ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking | Tingyang Zhang et.al. | 2501.03220 | null |
2025-01-05 | GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking | Weikang Bian et.al. | 2501.02690 | null |
2025-01-05 | DeTrack: In-model Latent Denoising Learning for Visual Object Tracking | Xinyu Zhou et.al. | 2501.02467 | null |
2025-01-02 | HybridTrack: A Hybrid Approach for Robust Multi-Object Tracking | Leandro Di Bella et.al. | 2501.01275 | link |
2025-01-02 | Sensitivity of Room Impulse Responses in Changing Acoustic Environment | Karolina Prawda et.al. | 2501.01206 | null |
2025-01-01 | Less is More: Token Context-aware Learning for Object Tracking | Chenlong Xu et.al. | 2501.00758 | link |
2024-12-26 | SUTrack: Towards Simple and Unified Single Object Tracking | Xin Chen et.al. | 2412.19138 | link |
2024-12-23 | Cross-View Referring Multi-Object Tracking | Sijia Chen et.al. | 2412.17807 | link |
2024-12-20 | Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking | Xiantao Hu et.al. | 2412.15691 | link |
2024-12-19 | Scaling 4D Representations | João Carreira et.al. | 2412.15212 | null |
2024-12-18 | Joint Perception and Prediction for Autonomous Driving: A Survey | Lucas Dal’Col et.al. | 2412.14088 | link |
2024-12-18 | MambaLCT: Boosting Tracking via Long-term Context State Space Model | Xiaohai Li et.al. | 2412.13615 | link |
2024-12-17 | CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices | Andrei Znobishchev et.al. | 2412.13273 | null |
2024-12-17 | Tell Me What to Track: Infusing Robust Language Guidance for Enhanced Referring Multi-Object Tracking | Wenjun Huang et.al. | 2412.12561 | null |
2024-12-15 | Exploring Enhanced Contextual Information for Video-Level Object Tracking | Ben Kang et.al. | 2412.11023 | link |
2024-12-14 | Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos | Qingyu Xu et.al. | 2412.10861 | link |
2024-12-14 | Patch-level Sounding Object Tracking for Audio-Visual Question Answering | Zhangbin Li et.al. | 2412.10749 | null |
2024-12-12 | Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach | Kailas PS et.al. | 2412.10453 | null |
2024-12-13 | Visual Object Tracking across Diverse Data Modalities: A Review | Mengmeng Wang et.al. | 2412.09991 | null |
2024-12-12 | NormalFlow: Fast, Robust, and Accurate Contact-based Object 6DoF Pose Tracking with Vision-based Tactile Sensors | Hung-Jui Huang et.al. | 2412.09617 | link |
2024-12-12 | Temporal-Assisted Beamforming and Trajectory Prediction in Sensing-Enabled UAV Communications | Shengcai Zhou et.al. | 2412.09097 | null |
2024-12-11 | TGOSPA Metric Parameters Selection and Evaluation for Visual Multi-object Tracking | Jan Krejčí et.al. | 2412.08321 | null |
2024-12-11 | Post-Hoc MOTS: Exploring the Capabilities of Time-Symmetric Multi-Object Tracking | Gergely Szabó et.al. | 2412.08313 | null |
2024-12-11 | DTAA: A Detect, Track and Avoid Architecture for navigation in spaces with Multiple Velocity Objects | Samuel Nordström et.al. | 2412.08121 | null |
2024-12-10 | Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation | Kurt H. W. Stolle et.al. | 2412.07966 | null |
2024-12-10 | Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments | Muhayy Ud Din et.al. | 2412.07392 | null |
2024-12-10 | Optical Levitation of Arrays of Microspheres | Benjamin Siegel et.al. | 2412.07088 | null |
2024-12-09 | Microcontroller-Driven MPPT System for Enhanced Photovoltaic Efficiency: An Experimental Approach in Nepal | Diwakar Khadka et.al. | 2412.06956 | null |
2024-12-09 | Enhanced Multi-Object Tracking Using Pose-based Virtual Markers in 3x3 Basketball | Li Yin et.al. | 2412.06258 | null |
2024-12-10 | Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation | Hyeonho Jeong et.al. | 2412.06016 | null |
2024-12-07 | Street Gaussians without 3D Object Tracker | Ruida Zhang et.al. | 2412.05548 | null |
2024-12-06 | HOLa: HoloLens Object Labeling | Michael Schwimmbeck et.al. | 2412.04945 | link |
2024-12-06 | Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection | Khurram Azeem Hashmi et.al. | 2412.04915 | null |
2024-12-05 | EgoPoints: Advancing Point Tracking for Egocentric Videos | Ahmad Darkhalil et.al. | 2412.04592 | null |
2024-12-04 | Distillation of Diffusion Features for Semantic Correspondence | Frank Fundel et.al. | 2412.03512 | null |
2024-12-03 | MVCTrack: Boosting 3D Point Cloud Tracking via Multimodal-Guided Virtual Cues | Zhaofeng Hu et.al. | 2412.02734 | link |
2024-12-03 | GSOT3D: Towards Generic 3D Single Object Tracking in the Wild | Yifan Jiao et.al. | 2412.02129 | link |
2024-12-02 | 6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting | Yufeng Jin et.al. | 2412.01543 | null |
2024-12-02 | A2VIS: Amodal-Aware Approach to Video Instance Segmentation | Minh Tran et.al. | 2412.01147 | null |
2024-12-02 | Referring Video Object Segmentation via Language-aligned Track Selection | Seongchan Kim et.al. | 2412.01136 | link |
2024-12-02 | Eyes on the Road: State-of-the-Art Video Question Answering Models Assessment for Traffic Monitoring Tasks | Joseph Raj Vishal et.al. | 2412.01132 | link |
2024-12-02 | Object Tracking in a $360^o$ View: A Novel Perspective on Bridging the Gap to Biomedical Advancements | Mojtaba S. Fazli et.al. | 2412.01119 | null |
2024-12-02 | LiDAR SLAMMOT based on Confidence-guided Data Association | Susu Fang et.al. | 2412.01041 | null |
2024-12-01 | BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird’s-Eye View | Yizhou Wang et.al. | 2412.00692 | null |
2024-11-29 | Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark | Joseph Heyward et.al. | 2411.19941 | null |
2024-11-28 | HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos | Prithviraj Banerjee et.al. | 2411.19167 | null |
2024-11-28 | Visual SLAMMOT Considering Multiple Motion Models | Peilin Tian et.al. | 2411.19134 | null |
2024-11-28 | CrossTracker: Robust Multi-modal 3D Multi-Object Tracking via Cross Correction | Lipeng Gu et.al. | 2411.18850 | null |
2024-11-27 | TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video | Jinyuan Qu et.al. | 2411.18671 | null |
2024-11-27 | A comparison of extended object tracking with multi-modal sensors in indoor environment | Jiangtao Shuai et.al. | 2411.18476 | null |
2024-11-27 | Efficient Dynamic LiDAR Odometry for Mobile Robots with Structured Point Clouds | Jonathan Lichtenfeld et.al. | 2411.18443 | link |
2024-11-26 | A Distractor-Aware Memory for Visual Object Tracking with SAM2 | Jovana Videnovic et.al. | 2411.17576 | link |
2024-11-24 | FastTrackTr:Towards Fast Multi-Object Tracking with Transformers | Pan Liao et.al. | 2411.15811 | null |
2024-11-23 | How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking | Xuchen Li et.al. | 2411.15600 | null |
2024-11-23 | MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking | Xinqi Liu et.al. | 2411.15459 | null |
2024-11-20 | Gaze2AOI: Open Source Deep-learning Based System for Automatic Area of Interest Annotation with Eye Tracking Data | Karolina Trajkovska et.al. | 2411.13346 | null |
2024-11-20 | Teaching VLMs to Localize Specific Objects from In-context Examples | Sivan Doveh et.al. | 2411.13317 | link |
2024-11-20 | DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild | Weicai Ye et.al. | 2411.13291 | null |
2024-11-24 | ClickTrack: Towards Real-time Interactive Single Object Tracking | Kuiran Wang et.al. | 2411.13183 | null |
2024-11-20 | Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity | Wassim El Ahmar et.al. | 2411.12943 | link |
2024-11-19 | Resolution Improvement in OFDM-based Joint Communication and Sensing through Combined Tracking and Interpolation | Charlotte Muth et.al. | 2411.12464 | null |
2024-11-18 | SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory | Cheng-Yen Yang et.al. | 2411.11922 | link |
2024-11-18 | Learning a Neural Association Network for Self-supervised Multi-Object Tracking | Shuai Li et.al. | 2411.11514 | null |
2024-11-15 | Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras | Ishrath Ahamed et.al. | 2411.10072 | null |
2024-11-21 | MOT FCG++: Enhanced Representation of Spatio-temporal Motion and Appearance Features | Yanzhao Fang et.al. | 2411.10028 | null |
2024-11-13 | Predictive Visuo-Tactile Interactive Perception Framework for Object Properties Inference | Anirvan Dutta et.al. | 2411.09020 | null |
2024-11-13 | 3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter | Xiaoxiang Wang et.al. | 2411.08433 | null |
2024-11-13 | DEEGITS: Deep Learning based Framework for Measuring Heterogenous Traffic State in Challenging Traffic Scenarios | Muttahirul Islam et.al. | 2411.08335 | null |
2024-11-12 | GTA: Global Tracklet Association for Multi-Object Tracking in Sports | Jiacheng Sun et.al. | 2411.08216 | link |
2024-11-11 | BuckTales : A multi-UAV dataset for multi-object tracking and re-identification of wild antelopes | Hemal Naik et.al. | 2411.06896 | null |
2024-11-11 | HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision | Shubo Lin et.al. | 2411.06780 | null |
2024-11-11 | Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs | Jia Syuen Lim et.al. | 2411.06702 | null |
2024-11-10 | PKF: Probabilistic Data Association Kalman Filter for Multi-Object Tracking | Hanwen Cao et.al. | 2411.06378 | link |
2024-11-09 | Multi-object Tracking by Detection and Query: an efficient end-to-end manner | Shukun Jia et.al. | 2411.06197 | null |
2024-11-08 | Agile UAV landing control on moving ship in adverse conditions | James Mordaunt et.al. | 2411.05445 | null |
2024-11-06 | Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving | Depanshu Sani et.al. | 2411.03702 | null |
2024-11-05 | Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting | Michael Büttner et.al. | 2411.03555 | null |
2024-11-04 | SIRA: Scalable Inter-frame Relation and Association for Radar Perception | Ryoma Yataka et.al. | 2411.02220 | null |
2024-11-04 | Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations | Thanh Nguyen Canh et.al. | 2411.01816 | null |
2024-11-04 | ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model | Yiming Sun et.al. | 2411.01756 | null |
2024-11-01 | HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices | Xiang Li et.al. | 2411.00608 | null |
2024-11-01 | Is Multiple Object Tracking a Matter of Specialization? | Gianluca Mancusi et.al. | 2411.00553 | null |
2024-10-31 | Extended Object Tracking and Classification based on Linear Splines | Matteo Tesori et.al. | 2410.24183 | null |
2024-10-30 | IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking | Run Luo et.al. | 2410.23907 | null |
2024-10-28 | Evaluating the Robustness of LiDAR Point Cloud Tracking Against Adversarial Attack | Shengjing Tian et.al. | 2410.20893 | null |
2024-10-27 | BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events | Yijin Li et.al. | 2410.20451 | null |
2024-10-27 | NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tracking | Yu Liu et.al. | 2410.20421 | link |
2024-10-27 | Depth Attention for Robust RGB Tracking | Yu Liu et.al. | 2410.20395 | link |
2024-10-26 | SFTrack: A Robust Scale and Motion Adaptive Algorithm for Tracking Small and Fast Moving Objects | InPyo Song et.al. | 2410.20079 | null |
2024-10-25 | A-MFST: Adaptive Multi-Flow Sparse Tracker for Real-Time Tissue Tracking Under Occlusion | Yuxin Chen et.al. | 2410.19996 | null |
2024-10-23 | ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting | Shaofei Cai et.al. | 2410.17856 | link |
2024-10-23 | Real-time Vehicle-to-Vehicle Communication Based Network Cooperative Control System through Distributed Database and Multimodal Perception: Demonstrated in Crossroads | Xinwen Zhu et.al. | 2410.17576 | link |
2024-10-23 | OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking | Haiji Liang et.al. | 2410.17534 | link |
2024-10-22 | MPT: A Large-scale Multi-Phytoplankton Tracking Benchmark | Yang Yu et.al. | 2410.16695 | link |
2024-10-19 | The Solution for Single Object Tracking Task of Perception Test Challenge 2024 | Zhiqiang Zhong et.al. | 2410.16329 | null |
2024-10-20 | TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool | Thinh Phan et.al. | 2410.15518 | link |
2024-10-20 | Multiset Combinatorial Gray Codes with Application to Proximity Sensor Networks | Chung Shue Chen et.al. | 2410.15428 | null |
2024-10-19 | 3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving | Linh Van Ma et.al. | 2410.14977 | link |
2024-10-18 | Enhancing In-vehicle Multiple Object Tracking Systems with Embeddable Ising Machines | Kosuke Tatsumura et.al. | 2410.14093 | null |
2024-10-17 | Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation | Changcheng Xiao et.al. | 2410.13437 | null |
2024-10-17 | TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal | Yanpeng Jia et.al. | 2410.13240 | null |
2024-10-15 | CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos | Nikita Karaev et.al. | 2410.11831 | null |
2024-10-17 | UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles | Hui Ye et.al. | 2410.11125 | null |
2024-10-14 | Motion-guided small MAV detection in complex and non-planar scenes | Hanqing Guo et.al. | 2410.10527 | null |
2024-10-14 | SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments | Khaled Gabr et.al. | 2410.10409 | link |
2024-10-14 | DINTR: Tracking via Diffusion-based Interpolation | Pha Nguyen et.al. | 2410.10053 | null |
2024-10-11 | Enhanced Kalman with Adaptive Appearance Motion SORT for Grounded Generic Multiple Object Tracking | Duy Le Dinh Anh et.al. | 2410.09243 | null |
2024-10-11 | VideoSAM: Open-World Video Segmentation | Pinxue Guo et.al. | 2410.08781 | null |
2024-10-11 | Efficient Multi-Object Tracking on Edge Devices via Reconstruction-Based Channel Pruning | Jan Müller et.al. | 2410.08769 | null |
2024-10-11 | VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking | Zekun Qian et.al. | 2410.08529 | null |
2024-10-05 | ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments | Lorenzo Terenzi et.al. | 2410.04250 | null |
2024-10-04 | Combing Text-based and Drag-based Editing for Precise and Flexible Image Editing | Ziqi Jiang et.al. | 2410.03097 | null |
2024-10-03 | Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking | Fabian Herzog et.al. | 2410.02638 | link |
2024-10-09 | DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM | Xuchen Li et.al. | 2410.02492 | null |
2024-10-03 | Spiking Neural Network as Adaptive Event Stream Slicer | Jiahang Cao et.al. | 2410.02249 | link |
2024-10-10 | Tracking objects that change in appearance with phase synchrony | Sabine Muzellec et.al. | 2410.02094 | null |
2024-10-02 | Scene Flow as a Partial Differential Equation | Kyle Vedder et.al. | 2410.02031 | null |
2024-10-02 | Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking | Mattia Segu et.al. | 2410.01806 | null |
2024-10-02 | Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking | Ayesha Ishaq et.al. | 2410.01678 | link |
2024-09-29 | One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos | Zechen Bai et.al. | 2409.19603 | link |
2024-09-27 | Improving Visual Object Tracking through Visual Prompting | Shih-Fang Chen et.al. | 2409.18901 | link |
2024-09-30 | An Overview of Multi-Object Estimation via Labeled Random Finite Set | Ba-Ngu Vo et.al. | 2409.18531 | null |
2024-09-26 | BlinkTrack: Feature Tracking over 100 FPS via Events and Images | Yichen Shen et.al. | 2409.17981 | null |
2024-09-26 | General Compression Framework for Efficient Transformer Object Tracking | Lingyi Hong et.al. | 2409.17564 | null |
2024-09-26 | CAMOT: Camera Angle-aware Multi-Object Tracking | Felix Limanta et.al. | 2409.17533 | null |
2024-09-25 | Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs | Mattia Segu et.al. | 2409.17221 | null |
2024-09-25 | Automated Surgical Skill Assessment in Endoscopic Pituitary Surgery using Real-time Instrument Tracking on a High-fidelity Bench-top Phantom | Adrito Das et.al. | 2409.17025 | null |
2024-09-25 | Towards Underwater Camouflaged Object Tracking: An Experimental Evaluation of SAM and SAM 2 | Chunhui Zhang et.al. | 2409.16902 | link |
2024-09-25 | Conditional Generative Denoiser for Nighttime UAV Tracking | Yucheng Wang et.al. | 2409.16834 | link |
2024-09-25 | Progressive Representation Learning for Real-Time UAV Tracking | Changhong Fu et.al. | 2409.16652 | link |
2024-09-25 | Enhancing Nighttime UAV Tracking with Light Distribution Suppression | Liangliang Yao et.al. | 2409.16631 | link |
2024-09-24 | Transformer based time series prediction of the maximum power point for solar photovoltaic cells | Palaash Agrawal et.al. | 2409.16342 | null |
2024-09-24 | Self-Supervised Any-Point Tracking by Contrastive Random Walks | Ayush Shrivastava et.al. | 2409.16288 | link |
2024-09-23 | MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving | Xiyang Wang et.al. | 2409.16149 | link |
2024-09-24 | CloudTrack: Scalable UAV Tracking with Cloud Semantics | Yannik Blei et.al. | 2409.16111 | link |
2024-09-22 | TrackNetV4: Enhancing Fast Sports Object Tracking with Motion Attention Maps | Arjun Raj et.al. | 2409.14543 | null |
2024-09-21 | Masks and Boxes: Combining the Best of Both Worlds for Multi-Object Tracking | Tomasz Stanczyk et.al. | 2409.14220 | null |
2024-09-21 | Foundation Models for Amodal Video Instance Segmentation in Automated Driving | Jasmin Breitenstein et.al. | 2409.14095 | link |
2024-09-18 | Tracking Any Point with Frame-Event Fusion Network at High Frame Rate | Jiaxiong Liu et.al. | 2409.11953 | null |
2024-09-18 | RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework | Xiaoyu Li et.al. | 2409.11749 | null |
2024-09-17 | SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking | Siyuan Li et.al. | 2409.11235 | link |
2024-09-17 | STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking | Jianbo Ma et.al. | 2409.11234 | link |
2024-09-17 | TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection | Philip Jacobson et.al. | 2409.10901 | null |
2024-09-15 | Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings | Oriel Perl et.al. | 2409.09841 | null |
2024-09-14 | Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown | Zimeng Fang et.al. | 2409.09293 | link |
2024-09-12 | FACT: Feature Adaptive Continual-learning Tracker for Multiple Object Tracking | Rongzihan Song et.al. | 2409.07904 | null |
2024-09-10 | When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking | Emirhan Bayar et.al. | 2409.06617 | link |
2024-09-09 | Leveraging Object Priors for Point Tracking | Bikram Boote et.al. | 2409.05786 | link |
2024-09-08 | RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network | Zhiwei Lin et.al. | 2409.04979 | null |
2024-09-06 | LITE: A Paradigm Shift in Multi-Object Tracking with Efficient ReID Feature Integration | Jumabek Alikhanov et.al. | 2409.04187 | link |
2024-09-05 | Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints | Keisuke Toida et.al. | 2409.03252 | null |
2024-09-04 | TP-GMOT: Tracking Generic Multiple Object by Textual Prompt with Motion-Appearance Cost (MAC) SORT | Duy Le Dinh Anh et.al. | 2409.02490 | link |
2024-09-03 | DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction | Jenny Seidenschwarz et.al. | 2409.02104 | null |
2024-09-01 | YOLOO: You Only Learn from Others Once | Lipeng Gu et.al. | 2409.00618 | null |
2024-09-10 | TrackSSM: A General Motion Predictor by State-Space Model | Bin Hu et.al. | 2409.00487 | link |
2024-08-31 | Fish Tracking Challenge 2024: A Multi-Object Tracking Competition with Sweetfish Schooling Data | Makoto M. Itoh et.al. | 2409.00339 | null |
2024-08-30 | UTrack: Multi-Object Tracking with Uncertain Detections | Edgardo Solano-Carrillo et.al. | 2408.17098 | link |
2024-08-29 | Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Sierra Bonilla et.al. | 2408.16445 | link |
2024-08-29 | Estimating Dynamic Flow Features in Groups of Tracked Objects | Tanner D. Harms et.al. | 2408.16190 | null |
2024-08-28 | ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model | Lifan Jiang et.al. | 2408.15548 | link |
2024-08-25 | Camouflaged_Object_Tracking__A_Benchmark | Xiaoyu Guo et.al. | 2408.13877 | link |
2024-08-24 | Can Visual Foundation Models Achieve Long-term Point Tracking? | Görkay Aydemir et.al. | 2408.13575 | null |
2024-08-23 | MCTR: Multi Camera Tracking Transformer | Alexandru Niculescu-Mizil et.al. | 2408.13243 | null |
2024-08-23 | BoostTrack++: using tracklet information to detect more objects in multiple object tracking | Vukašin Stanojević et.al. | 2408.13003 | link |
2024-08-22 | BankTweak: Adversarial Attack against Multi-Object Trackers by Manipulating Feature Banks | Woojin Shin et.al. | 2408.12727 | null |
2024-08-22 | BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking | Hanzheng Wang et.al. | 2408.12232 | null |
2024-08-21 | CHOTA: A Higher Order Accuracy Metric for Cell Tracking | Timo Kaiser et.al. | 2408.11571 | link |
2024-08-21 | Low-Light Object Tracking: A Benchmark | Pengzhi Zhong et.al. | 2408.11463 | link |
2024-08-20 | MambaEVT: Event Stream based Visual Object Tracking using State Space Model | Xiao Wang et.al. | 2408.10487 | link |
2024-08-17 | GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System | Shuo Wang et.al. | 2408.09191 | null |
2024-08-17 | MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model | Changcheng Xiao et.al. | 2408.09178 | null |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-14 | RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking | Song Guo et.al. | 2408.07344 | null |
2024-08-13 | Object Tracking Incorporating Transfer Learning into Unscented and Cubature Kalman Filters | Omar Alotaibi et.al. | 2408.07157 | null |
2024-08-12 | FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework | Lukas Meyer et.al. | 2408.06190 | link |
2024-08-11 | A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot | Haoxuan Ding et.al. | 2408.05729 | link |
2024-08-09 | Mesh-based Object Tracking for Dynamic Semantic 3D Scene Graphs via Ray Tracing | Lennart Niecksch et.al. | 2408.04979 | null |
2024-08-06 | Quantum Imaging Using Spatially Entangled Photon Pairs from a Nonlinear Metasurface | Jinyong Ma et.al. | 2408.02903 | null |
2024-08-05 | VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking | Yuxuan Lu et.al. | 2408.02263 | null |
2024-08-04 | 3D Single-object Tracking in Point Clouds with High Temporal Variation | Qiao Wu et.al. | 2408.02049 | null |
2024-08-03 | SiamMo: Siamese Motion-Centric 3D Object Tracking | Yuxiang Yang et.al. | 2408.01688 | link |
2024-08-02 | Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Yabin Zhu et.al. | 2408.00969 | link |
2024-08-05 | U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight | Tongtong Feng et.al. | 2408.00606 | link |
2024-08-01 | A Batch Update Using Multiplicative Noise Modelling for Extended Object Tracking | Christian Gramsch et.al. | 2408.00417 | null |
2024-07-30 | Autogenic Language Embedding for Coherent Point Tracking | Zikai Song et.al. | 2407.20730 | link |
2024-07-30 | SharkTrack: an accurate, generalisable software for streamlining shark and ray underwater video analysis | Filippo Varini et.al. | 2407.20623 | null |
2024-07-29 | MEVDT: Multi-Modal Event-Based Vehicle Detection and Tracking Dataset | Zaid A. El Shair et.al. | 2407.20446 | null |
2024-07-28 | Progressive Domain Adaptation for Thermal Infrared Object Tracking | Qiao Li et.al. | 2407.19430 | null |
2024-07-25 | Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT | Niels G. Faber et.al. | 2407.18288 | link |
2024-07-20 | CORT: Class-Oriented Real-time Tracking for Embedded Systems | Edoardo Cittadini et.al. | 2407.17521 | null |
2024-07-23 | PlantTrack: Task-Driven Plant Keypoint Tracking with Zero-Shot Sim2Real Transfer | Samhita Marri et.al. | 2407.16829 | null |
2024-07-23 | Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos | Jiahe Liu et.al. | 2407.16124 | link |
2024-07-22 | Local All-Pair Correspondence for Point Tracking | Seokju Cho et.al. | 2407.15420 | link |
2024-07-21 | Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety Analysis | Jingwei Guo et.al. | 2407.15199 | link |
2024-07-19 | Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking | Yunfei Zhang et.al. | 2407.14086 | link |
2024-07-19 | OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking | Zekun Qian et.al. | 2407.14047 | null |
2024-07-18 | Boosting Online 3D Multi-Object Tracking through Camera-Radar Cross Check | Sheng-Yao Kuan et.al. | 2407.13937 | null |
2024-07-18 | Long-Term 3D Point Tracking By Cost Volume Fusion | Hung Nguyen et.al. | 2407.13337 | null |
2024-07-17 | Strawberry detection and counting based on YOLOv7 pruning and information based tracking algorithm | Shiyu Liu et.al. | 2407.12614 | null |
2024-07-15 | Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation | Friedhelm Hamann et.al. | 2407.10802 | link |
2024-07-15 | Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss | Mufeng Yao et.al. | 2407.10485 | link |
2024-07-16 | Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking | Lorenzo Vaquero et.al. | 2407.10151 | link |
2024-07-14 | Power System Architecture and Control for Green Hydrogen Production via Power Converter-less Photovoltaic-Electrolyser Integration | Aymeric Fabre et.al. | 2407.10075 | null |
2024-07-12 | DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects | Peng Wang et.al. | 2407.09051 | null |
2024-07-11 | Manipulating a Tetris-Inspired 3D Video Representation | Mihir Godbole et.al. | 2407.08885 | null |
2024-07-11 | Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets | Linh Van Ma et.al. | 2407.08872 | link |
2024-07-11 | CommRad: Context-Aware Sensing-Driven Millimeter-Wave Networks | Ish Kumar Jain et.al. | 2407.08817 | null |
2024-07-10 | Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors | Lei Cheng et.al. | 2407.08049 | null |
2024-07-10 | MSC-LIO: An MSCKF-Based LiDAR-Inertial Odometry with Same-Plane-Point Tracking | Tisheng Zhang et.al. | 2407.07589 | null |
2024-07-09 | Decomposition Betters Tracking Everything Everywhere | Rui Li et.al. | 2407.06531 | link |
2024-07-08 | GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images | Jon Crall et.al. | 2407.06337 | null |
2024-07-08 | TAPVid-3D: A Benchmark for Tracking Any Point in 3D | Skanda Koppula et.al. | 2407.05921 | link |
2024-07-07 | Addressing single object tracking in satellite imagery through prompt-engineered solutions | Athena Psalta et.al. | 2407.05518 | null |
2024-07-09 | P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds | Jiahao Nie et.al. | 2407.05238 | link |
2024-07-06 | VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking | Xuefeng Jiang et.al. | 2407.05017 | null |
2024-07-05 | TF-SASM: Training-free Spatial-aware Sparse Memory for Multi-object Tracking | Thuc Nguyen-Quang et.al. | 2407.04327 | null |
2024-07-08 | SSP-GNN: Learning to Track via Bilevel Optimization | Griffin Golias et.al. | 2407.04308 | null |
2024-07-05 | FeatureSORT: Essential Features for Effective Tracking | Hamidreza Hashempoor et.al. | 2407.04249 | null |
2024-07-04 | Attention Normalization Impacts Cardinality Generalization in Slot Attention | Markus Krimmel et.al. | 2407.04170 | link |
2024-07-04 | TrackPGD: A White-box Attack using Binary Masks against Robust Transformer Trackers | Fatemeh Nourilenjan Nokabadi et.al. | 2407.03946 | link |
2024-07-03 | Applying Extended Object Tracking for Self-Localization of Roadside Radar Sensors | Longfei Han et.al. | 2407.03084 | null |
2024-07-02 | FlowTrack: Point-level Flow Network for 3D Single Object Tracking | Shuo Li et.al. | 2407.01959 | null |
2024-07-02 | The Solution for the ICCV 2023 Perception Test Challenge 2023 – Task 6 – Grounded videoQA | Hailiang Zhang et.al. | 2407.01907 | null |
2024-06-30 | DroBoost: An Intelligent Score and Model Boosting Method for Drone Detection | Ogulcan Eryuksel et.al. | 2407.00830 | null |
2024-06-30 | Engineering an Efficient Object Tracker for Non-Linear Motion | Momir Adžemović et.al. | 2407.00738 | null |
2024-06-28 | PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators | Kuo-Hao Zeng et.al. | 2406.20083 | null |
2024-06-28 | eMoE-Tracker: Environmental MoE-based Transformer for Robust Event-guided Object Tracking | Yucheng Chen et.al. | 2406.20024 | null |
2024-06-28 | StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction | Jiaheng Zhuang et.al. | 2406.19844 | null |
2024-06-28 | Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking | Qingrui Hu et.al. | 2406.19655 | null |
2024-06-28 | Optimal Video Compression using Pixel Shift Tracking | Hitesh Saai Mananchery Panneerselvam et.al. | 2406.19630 | link |
2024-06-26 | Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos | Colton Stearns et.al. | 2406.18717 | link |
2024-06-26 | BiTrack: Bidirectional Offline 3D Multi-Object Tracking Using Camera-LiDAR Data | Kemiao Huang et.al. | 2406.18414 | link |
2024-06-24 | POPCat: Propagation of particles for complex annotation tasks | Adam Srebrnjak Yang et.al. | 2406.17183 | null |
2024-06-24 | A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking | Lorenzo Shaikewitz et.al. | 2406.16837 | link |
2024-06-24 | The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers | Abhi Kamboj et.al. | 2406.16784 | null |
2024-06-21 | LU2Net: A Lightweight Network for Real-time Underwater Image Enhancement | Haodong Yang et.al. | 2406.14973 | null |
2024-06-22 | Velocity Analysis of Moving Objects in Earth Observation Satellite Images Using Multi-Spectral Push Broom Scanning | Eric Keto et.al. | 2406.13710 | null |
2024-06-19 | Hierarchical IoU Tracking based on Interval | Yunhao Du et.al. | 2406.13271 | link |
2024-06-19 | Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models | Akchay Srivastava et.al. | 2406.13232 | null |
2024-06-17 | Deep HM-SORT: Enhancing Multi-Object Tracking in Sports with Deep Features, Harmonic Mean, and Expansion IOU | Matias Gran-Henriksen et.al. | 2406.12081 | null |
2024-06-17 | VideoVista: A Versatile Benchmark for Video Understanding and Reasoning | Yunxin Li et.al. | 2406.11303 | null |
2024-06-14 | Robust compressive tracking via online weighted multiple instance learning | Sandeep Singh Sengar et.al. | 2406.09914 | null |
2024-06-13 | Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking | Prithviraj Banerjee et.al. | 2406.09598 | null |
2024-06-12 | LaMOT: Language-Guided Multi-Object Tracking | Yunhao Li et.al. | 2406.08324 | link |
2024-06-12 | Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance | Yasod Ginige et.al. | 2406.08294 | null |
2024-06-11 | Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos | Duc Pham et.al. | 2406.07680 | null |
2024-06-11 | Haptic Repurposing with GenAI | Haoyu Wang et.al. | 2406.07228 | null |
2024-06-11 | UVIS: Unsupervised Video Instance Segmentation | Shuaiyi Huang et.al. | 2406.06908 | null |
2024-06-09 | ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05810 | null |
2024-06-09 | SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05800 | null |
2024-06-08 | Training-Free Robust Interactive Video Object Segmentation | Xiaoli Wei et.al. | 2406.05485 | null |
2024-06-07 | Bootstrapping Referring Multi-Object Tracking | Yani Zhang et.al. | 2406.05039 | link |
2024-06-07 | Multi-Granularity Language-Guided Multi-Object Tracking | Yuhao Li et.al. | 2406.04844 | link |
2024-06-06 | Matching Anything by Segmenting Anything | Siyuan Li et.al. | 2406.04221 | link |
2024-06-06 | ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints | Divij Handa et.al. | 2406.04046 | null |
2024-06-04 | UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking | Lijun Zhou et.al. | 2406.02147 | null |
2024-06-03 | Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers | Fatemeh Nourilenjan Nokabadi et.al. | 2406.01765 | link |
2024-06-03 | Prototypical Transformer as Unified Motion Learners | Cheng Han et.al. | 2406.01559 | null |
2024-06-03 | Convolutional Unscented Kalman Filter for Multi-Object Tracking with Outliers | Shiqi Liu et.al. | 2406.01380 | null |
2024-06-03 | Programmable Multi-input Buck-Boost Converter for Photovoltaics Arrays | Zhongting Tang et.al. | 2406.01193 | null |
2024-06-03 | Multi-Object Tracking based on Imaging Radar 3D Object Detection | Patrick Palmer et.al. | 2406.01011 | null |
2024-06-01 | Towards Generalizable Multi-Object Tracking | Zheng Qin et.al. | 2406.00429 | link |
2024-05-30 | WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark | Chunhui Zhang et.al. | 2405.19818 | link |
2024-05-29 | DGD: Dynamic 3D Gaussians Distillation | Isaac Labe et.al. | 2405.19321 | null |
2024-05-28 | Track Initialization and Re-Identification for~3D Multi-View Multi-Object Tracking | Linh Van Ma et.al. | 2405.18606 | link |
2024-05-28 | Reliable Object Tracking by Multimodal Hybrid Feature Extraction and Transformer-Based Fusion | Hongze Sun et.al. | 2405.17903 | link |
2024-05-28 | Towards a Generalist and Blind RGB-X Tracker | Yuedong Tan et.al. | 2405.17773 | link |
2024-06-03 | BaboonLand Dataset: Tracking Primates in the Wild and Automating Behaviour Recognition from Drone Videos | Isla Duporge et.al. | 2405.17698 | null |
2024-05-27 | Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association | Tingwei Liu et.al. | 2405.17323 | null |
2024-05-24 | ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking | Xudong Han et.al. | 2405.15755 | null |
2024-05-24 | Trackastra: Transformer-based cell tracking for live-cell microscopy | Benjamin Gallusser et.al. | 2405.15700 | link |
2024-05-24 | An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking | Pratyusha Musunuru et.al. | 2405.15137 | null |
2024-05-23 | Awesome Multi-modal Object Tracking | Chunhui Zhang et.al. | 2405.14200 | link |
2024-05-23 | Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning | Zhenyu Wei et.al. | 2405.14195 | null |
2024-05-23 | PuTR: A Pure Transformer for Decoupled and Online Multi-Object Tracking | Chongwei Liu et.al. | 2405.14119 | link |
2024-05-22 | Multi Player Tracking in Ice Hockey with Homographic Projections | Harish Prakash et.al. | 2405.13397 | null |
2024-05-20 | Building Temporal Kernels with Orthogonal Polynomials | Yan Ru Pei et.al. | 2405.12179 | link |
2024-05-20 | WiDRa – Enabling Millimeter-Level Differential Ranging Accuracy in Wi-Fi Using Carrier Phase | Vishnu V. Ratnam et.al. | 2405.12168 | null |
2024-05-20 | DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM | Xuchen Li et.al. | 2405.12139 | null |
2024-05-20 | A Vision on Open Science for the Evolution of Software Engineering Research and Practice | Edson OliveiraJr et.al. | 2405.12132 | null |
2024-05-20 | PATE: Proximity-Aware Time series anomaly Evaluation | Ramin Ghorbani et.al. | 2405.12096 | link |
2024-05-20 | SEMv3: A Fast and Robust Approach to Table Separation Line Detection | Chunxia Qin et.al. | 2405.11862 | link |
2024-05-20 | Online Learning Feedback Control Considering Hysteresis for Musculoskeletal Structures | Kento Kawaharazuka et.al. | 2405.11808 | null |
2024-05-20 | CDM-MPC: An Integrated Dynamic Planning and Control Framework for Bipedal Robots Jumping | Zhicheng He et.al. | 2405.11773 | null |
2024-05-19 | PBI: Position-Based Dynamics Handles Updated Lagrangian Inelasticity | Chang Yu et.al. | 2405.11694 | null |
2024-05-19 | Auto-Platoon : Freight by example | Tharun V. Puthanveettil et.al. | 2405.11659 | link |
2024-05-19 | Track Anything Rapter(TAR) | Tharun V. Puthanveettil et.al. | 2405.11655 | link |
2024-05-19 | RobMOT: Robust 3D Multi-Object Tracking by Observational Noise and State Estimation Drift Mitigation on LiDAR PointCloud | Mohamed Nagy et.al. | 2405.11536 | link |
2024-05-17 | Air Signing and Privacy-Preserving Signature Verification for Digital Documents | P. Sarveswarasarma et.al. | 2405.10868 | link |
2024-05-17 | Review on physical impedance models in perovskite solar cells | Rajat Kumar Goyal et.al. | 2405.10855 | null |
2024-05-17 | Model Predictive Contouring Control for Vehicle Obstacle Avoidance at the Limit of Handling Using Torque Vectoring | Alberto Bertipaglia et.al. | 2405.10847 | null |
2024-05-17 | Heterogeneity-Informed Meta-Parameter Learning for Spatiotemporal Time Series Forecasting | Zheng Dong et.al. | 2405.10800 | link |
2024-05-17 | Anomalous relaxation of coarsening foams with viscoelastic continuous phase | Chiara Guidolin et.al. | 2405.10657 | null |
2024-05-17 | Cyclical Weight Consolidation: Towards Solving Catastrophic Forgetting in Serial Federated Learning | Haoyue Song et.al. | 2405.10647 | null |
2024-05-17 | COMET: NFT Price Prediction with Wallet Profiling | Tianfu Wang et.al. | 2405.10640 | link |
2024-05-17 | Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track | Xiaoshuai Hao et.al. | 2405.10567 | null |
2024-05-17 | Dynamic Cluster Analysis to Detect and Track Novelty in Network Telescopes | Kai Huang et.al. | 2405.10545 | null |
2024-05-17 | Hawkes Models And Their Applications | Patrick J. Laub et.al. | 2405.10527 | null |
2024-05-16 | A Novel Bounding Box Regression Method for Single Object Tracking | Omar Abdelaziz et.al. | 2405.10444 | null |
2024-05-16 | Beyond Traditional Single Object Tracking: A Survey | Omar Abdelaziz et.al. | 2405.10439 | null |
2024-05-16 | Spatial Cognition: a Wave Hypothesis | Robert Worden et.al. | 2405.10112 | null |
2024-05-14 | Learning Correspondence for Deformable Objects | Priya Sundaresan et.al. | 2405.08996 | null |
2024-05-14 | ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association | Shuxiao Ding et.al. | 2405.08909 | link |
2024-05-14 | EchoTracker: Advancing Myocardial Point Tracking in Echocardiography | Md Abulkalam Azad et.al. | 2405.08587 | link |
Defocus
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-15 | Digital defocus aberration interference for automated optical microscopy | Haowen Zhou et.al. | 2507.10867 | null |
2025-07-01 | Efficient Depth- and Spatially-Varying Image Simulation for Defocus Deblur | Xinge Yang et.al. | 2507.00372 | null |
2025-07-09 | High-quality metalens enables minimally invasive CFB endoscopy | Ruixiang Song et.al. | 2506.21379 | null |
2025-06-26 | Quantitative structure determination from experimental four-dimensional scanning transmission electron microscopy via the scattering matrix | Emmanuel W. C. Terzoudis-Lumsden et.al. | 2506.21004 | null |
2025-06-22 | On the Particle Image Overlap in Single Camera Defocusing Approaches | Christian Sax et.al. | 2506.18170 | null |
2025-06-25 | Dark Channel-Assisted Depth-from-Defocus from a Single Image | Moushumi Medhi et.al. | 2506.06643 | null |
2025-05-29 | Dc-EEMF: Pushing depth-of-field limit of photoacoustic microscopy via decision-level constrained learning | Wangting Zhou et.al. | 2506.03181 | null |
2025-05-31 | Fovea Stacking: Imaging with Dynamic Localized Aberration Correction | Shi Mao et.al. | 2506.00716 | null |
2025-05-30 | High resolution up-conversion imaging in the 10 μm band under incoherent illumination | Zhao-Qi-Zhi Han et.al. | 2505.24367 | null |
2025-05-30 | Fourier ptychographic microscopy aided with transport of intensity equation for robust full phase spectrum reconstruction | Mikołaj Rogalski et.al. | 2505.24322 | null |
2025-07-02 | Real-Time Blind Defocus Deblurring for Earth Observation: The IMAGIN-e Mission Approach | Alejandro D. Mousist et.al. | 2505.22128 | null |
2025-05-27 | Any-to-Bokeh: One-Step Video Bokeh via Multi-Plane Image Guided Diffusion | Yang Yang et.al. | 2505.21593 | null |
2025-05-23 | Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues | Chinmay Talegaonkar et.al. | 2505.17358 | null |
2025-05-19 | Combinatorial Sample-and Back-Focal-Plane (BFP) Imaging. Pt. I: Instrument and acquisition parameters affecting BFP images and their analysis | Omer Shavit et.al. | 2505.13190 | null |
2025-05-12 | Apple’s Synthetic Defocus Noise Pattern: Characterization and Forensic Applications | David Vázquez-Padín et.al. | 2505.07380 | null |
2025-05-09 | Development of precession Lorentz transmission electron microscopy | Shunsuke Hayashi et.al. | 2505.05790 | null |
2025-05-07 | Image Restoration via Multi-domain Learning | Xingyu Jiang et.al. | 2505.05504 | link |
2025-05-08 | Differentiation of Distinct Single Atoms via Multi-Defocus Fusion Method | Yangfan Li et.al. | 2505.04078 | null |
2025-05-09 | Back-illumination interference tomography for imaging weak scattering in thick tissues | Gregory N. McKay et.al. | 2504.19278 | null |
2025-04-25 | Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models | Patrick Müller et.al. | 2504.18510 | null |
2025-04-24 | Surface morphology and thickness variation estimation of zeolites via electron ptychography | Enci Zhang et.al. | 2504.17501 | null |
2025-04-23 | Dual-Camera All-in-Focus Neural Radiance Fields | Xianrui Luo et.al. | 2504.16636 | null |
2025-04-15 | Focal Split: Untethered Snapshot Depth from Differential Defocus | Junjie Luo et.al. | 2504.11202 | null |
2025-04-15 | Three-dimensional neural network driving self-interference digital holography enables high-fidelity, non-scanning volumetric fluorescence microscopy | Tianlong Man et.al. | 2504.10769 | null |
2025-04-14 | Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials | Jingyun Yang et.al. | 2504.10281 | null |
2025-04-11 | Optical vortex trajectories as probes for wavefront aberrations | Aleksandra K. Korzeniewska et.al. | 2504.08643 | null |
2025-03-31 | InstructRestore: Region-Customized Image Restoration with Human Instructions | Shuaizheng Liu et.al. | 2503.24357 | link |
2025-03-30 | Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries | Wei Xu et.al. | 2503.23606 | null |
2025-03-26 | Spectrum from Defocus: Fast Spectral Imaging with Chromatic Focal Stack | M. Kerem Aydin et.al. | 2503.20184 | null |
2025-03-24 | MaSS13K: A Matting-level Semantic Segmentation Benchmark | Chenxi Xie et.al. | 2503.18364 | link |
2025-03-22 | Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration | Yawei Li et.al. | 2503.17825 | null |
2025-03-25 | Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures | Tim Seizinger et.al. | 2503.16067 | link |
2025-03-18 | The Power of Context: How Multimodality Improves Image Super-Resolution | Kangfu Mei et.al. | 2503.14503 | null |
2025-03-18 | Intra and Inter Parser-Prompted Transformers for Effective Image Restoration | Cong Wang et.al. | 2503.14037 | link |
2025-03-16 | Pathology Image Restoration via Mixture of Prompts | Jiangdong Cai et.al. | 2503.12399 | link |
2025-03-24 | Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models | Armando Fortes et.al. | 2503.08434 | null |
2025-03-12 | Free Your Hands: Lightweight Relightable Turntable Capture Pipeline | Jiahui Fan et.al. | 2503.05511 | null |
2025-03-03 | Blind Augmentation: Calibration-free Camera Distortion Model Estimation for Real-time Mixed-reality Consistency | Siddhant Prakash et.al. | 2503.01387 | link |
2025-03-13 | DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting | Liao Shen et.al. | 2503.00746 | null |
2025-01-24 | Linnik point spread functions, time-reversed logarithmic diffusion equations, and blind deconvolution of electron microscope imagery | Alfred S. Carasso et.al. | 2502.19420 | null |
2025-02-20 | Exploiting Deblurring Networks for Radiance Fields | Haeyun Choi et.al. | 2502.14454 | link |
2025-02-16 | Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation | Kunal Swami et.al. | 2502.11002 | null |
2025-02-11 | CodePhys: Robust Video-based Remote Physiological Measurement through Latent Codebook Querying | Shuyang Chu et.al. | 2502.07526 | null |
2025-02-10 | SparseFocus: Learning-based One-shot Autofocus for Microscopy with Sparse Content | Yongping Zhai et.al. | 2502.06452 | null |
2025-02-13 | Self-similar Features in Sub-secondary Breakup of a Droplet and Ligament Mediated Fragmentation under Extreme Conditions | Saini Jatin Rao et.al. | 2502.05976 | null |
2025-01-29 | Five-dimensional single-shot fluorescence imaging using a polarized Fourier light-field microscope | Oumeng Zhang et.al. | 2501.18047 | null |
2025-01-25 | Image formation theory of optical coherence tomography with optical aberrations and its application for computational aberration correction | Shuichi Makita et.al. | 2501.15011 | null |
2025-01-23 | Theoretical analysis of performance limitation of computational refocusing in optical coherence tomography | Yue Zhu et.al. | 2501.13874 | null |
2025-01-16 | SE-BSFV: Online Subspace Learning based Shadow Enhancement and Background Suppression for ViSAR under Complex Background | Shangqu Yan et.al. | 2501.09341 | null |
2025-02-23 | Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks | Shuang Cui et.al. | 2501.09052 | null |
2024-12-24 | Dissecting CLIP: Decomposition with a Schur Complement-based Approach | Azim Ospanov et.al. | 2412.18645 | link |
2024-12-20 | CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images | Jungho Lee et.al. | 2412.16028 | null |
2025-01-06 | LEDiff: Latent Exposure Diffusion for HDR Generation | Chao Wang et.al. | 2412.14456 | null |
2024-12-29 | AKiRa: Augmentation Kit on Rays for optical video generation | Xi Wang et.al. | 2412.14158 | null |
2024-12-17 | Strain engineering of magnetic anisotropy in the kagome magnet Fe3Sn2 | D. Kong et.al. | 2412.12684 | null |
2024-12-16 | Photoacoustic microscopy with meta-optics | Dorian S. H. Brandmüller et.al. | 2412.11733 | null |
2024-12-11 | Dense Depth from Event Focal Stack | Kenta Horikawa et.al. | 2412.08120 | null |
2024-11-15 | Resilient Stellarator Divertor Characteristics in the Helically Symmetric eXperiment | K. A. Garcia et.al. | 2411.10611 | null |
2024-10-18 | Variable Aperture Bokeh Rendering via Customized Focal Plane Guidance | Kang Chen et.al. | 2410.14400 | link |
2024-11-15 | Feature Extraction Reimagined: Achieving Superior Accuracy in Camera Calibration | Zezhun Shi et.al. | 2410.13371 | link |
2024-10-08 | First experimental study of multiple orientation muon tomography, with image optimization in sparse data environments | Jesus J. Valencia et.al. | 2410.07264 | null |
2024-10-02 | Recording dynamic facial micro-expressions with a multi-focus camera array | Lucas Kreiss et.al. | 2410.01973 | null |
2024-10-29 | EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis | Alexander Mai et.al. | 2410.01804 | null |
2024-10-02 | Frequency-Dependent F-Numbers Suppress Grating Lobes and Improve the Lateral Resolution in Line-by-Line Scanning | Martin F. Schiffner et.al. | 2410.01593 | null |
2024-10-02 | Estimating Atmospheric Wind Speeds From Gemini Planet Imager AO Telemetry | Zhenxi Du et.al. | 2410.01193 | null |
2024-09-28 | Extending Depth of Field for Varifocal Multiview Images | Zhilong Li et.al. | 2409.19220 | null |
2024-09-26 | PNR: Physics-informed Neural Representation for high-resolution LFM reconstruction | Jiayin Zhao et.al. | 2409.18223 | null |
2024-09-26 | Reblurring-Guided Single Image Defocus Deblurring: A Learning Framework with Misaligned Training Pairs | Xinya Shu et.al. | 2409.17792 | link |
2024-09-18 | Depth Estimation Based on 3D Gaussian Splatting Siamese Defocus | Jinchang Zhang et.al. | 2409.12323 | null |
2024-09-16 | Depth from Coupled Optical Differentiation | Junjie Luo et.al. | 2409.10725 | link |
2024-09-16 | Focus diverse phase retrieval test results on broadband continuous wavefront sensing in space telescope applications | Hyukmo Kang et.al. | 2409.10500 | null |
2024-09-15 | Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation | Xiaolong Qian et.al. | 2409.09754 | link |
2024-09-14 | Innovative schemes for Correlation Plenoptic Imaging | Gianlorenzo Massaro et.al. | 2409.09459 | null |
2024-09-14 | Plenoptic microscopy and photography from intensity correlations | Francesco V. Pepe et.al. | 2409.09456 | null |
2024-09-03 | F2former: When Fractional Fourier Meets Deep Wiener Deconvolution and Selective Frequency Transformer for Image Deblurring | Subhajit Paul et.al. | 2409.02056 | null |
2024-08-17 | Pupil-Adaptive 3D Holography Beyond Coherent Depth-of-Field | Yujie Wang et.al. | 2409.00028 | null |
2024-08-05 | Joint-Motion Mutual Learning for Pose Estimation in Videos | Sifan Wu et.al. | 2408.02285 | null |
2024-08-28 | Enhancing Quantitative Image Synthesis through Pretraining and Resolution Scaling for Bone Mineral Density Estimation from a Plain X-ray Image | Yi Gu et.al. | 2407.20495 | link |
2024-07-26 | 3D Orbital Angular Momentum Nonlinear Holography | Feiyang Shen et.al. | 2407.18696 | null |
2024-07-23 | HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images | Shreyas Singh et.al. | 2407.16503 | link |
2024-07-21 | A Novel Method to Improve Quality Surface Coverage in Multi-View Capture | Wei-Lun Huang et.al. | 2407.15883 | null |
2024-07-20 | A New Dataset and Framework for Real-World Blurred Images Super-Resolution | Rui Qin et.al. | 2407.14880 | link |
2024-07-15 | Automated high-resolution backscattered-electron imaging at macroscopic scale | Zhiyuan Lang et.al. | 2407.10628 | null |
2024-07-24 | Inverse-designed 3D laser nanoprinted phase masks to extend the depth of field of imaging systems | T. J. Sturges et.al. | 2407.08482 | null |
2024-07-11 | GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views | Vinayak Gupta et.al. | 2407.08221 | link |
2024-07-31 | Dynamic Neural Radiance Field From Defocused Monocular Video | Xianrui Luo et.al. | 2407.05586 | null |
2024-07-01 | Point-Spread Function of the Optics in Scanning Electron Microscopes | Surya Kamal et.al. | 2407.01439 | null |
2024-06-27 | Super-resolution imaging using super-oscillatory diffractive neural networks | Hang Chen et.al. | 2406.19126 | null |
2024-06-27 | The Space Coronagraph Optical Bench (SCoOB): 5. End-to-end simulations of polarization aberrations | Ramya M Anche et.al. | 2406.18886 | null |
2024-06-22 | Robust Ptychographic Reconstruction with an Out-of-Focus Electron Probe | Shoucong Ning et.al. | 2406.15879 | null |
2024-06-15 | fNeRF: High Quality Radiance Fields from Practical Cameras | Yi Hua et.al. | 2406.10633 | null |
2024-06-12 | Striving towards robust phase diversity on-sky: Implementing LIFT for VLT/MUSE-NFM | Arseniy Kuznetsov et.al. | 2406.08529 | link |
2024-06-21 | Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field | Chao Wang et.al. | 2406.07329 | null |
2024-06-06 | Single Exposure Quantitative Phase Imaging with a Conventional Microscope using Diffusion Models | Gabriel della Maggiora et.al. | 2406.04388 | null |
2024-06-03 | Improved Three-Dimensional Reconstructions in Electron Ptychography through Defocus Series Measurements | Marcel Schloz et.al. | 2406.01141 | null |
2024-06-02 | End-to-End Hybrid Refractive-Diffractive Lens Design with Differentiable Ray-Wave Model | Xinge Yang et.al. | 2406.00834 | null |
2024-06-10 | In vivo fundus imaging and computational refocusing with a diffuser-based fundus camera | Corey Simmerer et.al. | 2406.00122 | null |
2024-05-31 | Axial HoloTile: Extended Depth-of-Focus of Dynamic Holographic Light Projections | Andreas Erik Gejl Madsen et.al. | 2405.20997 | null |
2024-05-27 | DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal | Yujie Wang et.al. | 2405.17351 | null |
2024-05-20 | Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction | Aryan Garg et.al. | 2405.11823 | null |
2024-06-04 | Single-shot volumetric fluorescence imaging with neural fields | Oumeng Zhang et.al. | 2405.10463 | null |
2024-05-09 | Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft | Debabrata Pal et.al. | 2405.05574 | null |
2024-04-05 | Robust Gaussian Splatting | François Darmon et.al. | 2404.04211 | null |
2024-04-05 | Deep Phase Coded Image Prior | Nimrod Shabtay et.al. | 2404.03906 | null |
2024-04-02 | Multiple scattering suppression for in vivo optical coherence tomography measurement using B-scan-wise multi-focus averaging method | Yiqiang Zhu et.al. | 2404.01811 | null |
2024-03-29 | Depth from Defocus Technique for High Number Densities and Non-spherical Particles | Rixin Xua et.al. | 2403.20004 | null |
2024-04-01 | Video-Based Human Pose Regression via Decoupled Space-Time Aggregation | Jijie He et.al. | 2403.19926 | link |
2024-03-21 | Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data | Michael John Fanous et.al. | 2403.14324 | null |
2024-05-06 | Expected Impact of Glints from Space Debris in the LSST | J. Anthony Tyson et.al. | 2403.04942 | null |
2024-02-25 | Forward and inverse modeling of depth-of-field effects in background-oriented schlieren | Joseph P. Molnar et.al. | 2402.15954 | null |
2024-02-12 | Roll-to-roll tomographic volumetric additive manufacturing for continuous production of microstructures on long flexible substrates | Joseph Toombs et.al. | 2402.10955 | null |
2024-04-03 | Ptycho-endoscopy on a lensless ultrathin fiber bundle tip | Pengming Song et.al. | 2401.17213 | null |
2024-02-09 | Exploring one giga electronvolt cosmic gamma rays with a Cherenkov plenoscope capable of recording atmospheric light fields, Part 1: Optics | Sebastian Achim Mueller et.al. | 2401.16148 | null |
2024-01-29 | Light-field imaging from position-momentum correlations | Davide Giannella et.al. | 2401.16129 | null |
2024-01-25 | Single- and multi-layer micro-scale diffractive lens fabrication for fiber imaging probes with versatile depth-of-field | Fei He et.al. | 2401.14551 | null |