The indexing of CVPR 2024 paper's code
Click the "Not reproducible" word to see error log. 😜
Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains
Jupyter Notebook
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation
Python
Towards Co-Evaluation of Cameras, HDR, and Algorithms for Industrial-Grade 6DoF Pose Estimation
No code
Domain-Specific Block Selection and Paired-View Pseudo-Labeling for Online Test-Time Adaptation
Python
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications
Python
From Correspondences to Pose: Non-minimal Certifiably Optimal Relative Pose without Disambiguation
Python
SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction
Python
Not reproducible
SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects
Python
Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning
Python
Not reproducible
NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation
Python
Brain Decodes Deep Nets
Jupyter Notebook
InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion
Python
Not reproducible
CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion
Python
Deep-TROJ: An Inference Stage Trojan Insertion Algorithm through Efficient Weight Replacement Attack
Python
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
Python
Understanding Video Transfomers via Universal Concept Discovery
Jupyter Notebook
StyLitGAN: Image-based Relighting via Latent Control
Jupyter Notebook
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers
Jupyter Notebook
Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving
Python
Not reproducible
HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding
JavaScript
PoNQ: a Neural QEM-based Mesh Representation
Jupyter Notebook
Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining
Python
Not reproducible
In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging
Python
Not reproducible
From a Bird’s Eye View to See: Joint Camera and Subject Registration without the Camera Calibration
Python
DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models
Python
Not reproducible
Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding
Python
Describing Differences in Image Sets with Natural Language
Jupyter Notebook
Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration
Python
Not reproducible
One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
Python
Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
Python
Not reproducible
Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery
Python
Not reproducible
Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation
Python
Not reproducible
FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning
Python
Not reproducible
Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective
Python
Not reproducible
One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications
Python
Not reproducible
Collaborative Learning of Anomalies with Privacy (CLAP) for Unsupervised Video Anomaly Detection: A New Baseline
Jupyter Notebook
A noisy elephant in the room: Is your out-of-distribution detector robust to label noise?
Jupyter Notebook
Improved Implicit Neural Representation with Fourier Reparameterized Training
Python
Not reproducible
In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing
Python
Not reproducible
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Python
Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles
Python
Not reproducible
CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation
JavaScript
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
Python
Improving Out-of-Distribution Generalization in Graphs via Hierarchical Semantic Environments
Python
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation
Python
Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation
Python
Multiview Aerial Visual RECognition (MAVREC) Dataset: Can Multi-view Improve Aerial Visual Perception?
Jupyter Notebook
CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation
Python
Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains
Python
Not reproducible
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
Python
Not reproducible
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts
Python
Not reproducible
Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacements
Jupyter Notebook
LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion
Jupyter Notebook
Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers
Python
Not reproducible
HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation
Python
Not reproducible
Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates
Python
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding
Jupyter Notebook
Logit Standardization in Knowledge Distillation
Jupyter Notebook
RepViT: Revisiting Mobile CNN From ViT Perspective
Jupyter Notebook
Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning
Python
Not reproducible
Face2Diffusion for Fast and Editable Face Personalization
Jupyter Notebook
RankED: Addressing Imbalance and Uncertainty in Edge Detection Using Ranking-based Losses
Python
Not reproducible
AdaShift: Learning Discriminative Self-Gated Neural Feature Activation With an Adaptive Shift Factor
No code
RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly Detection
Python
Not reproducible
Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval
Python
\emph{RealCustom}: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
JavaScript
Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition
Python
Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration
Python
Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification
Python
Not reproducible
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
JavaScript
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Python
Not reproducible
Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement
Python
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Python
Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation
Python
Not reproducible
Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households
Python
Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features
Python
Readout Guidance: Learning Control from Diffusion Features
Jupyter Notebook
Learning the 3D Fauna of the Web
JavaScript
Action Scene Graphs for Long-Form Understanding of Egocentric Videos
Jupyter Notebook
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
JavaScript
Semantics, Distortion, and Style Matter: Towards Source-free UDA for Panoramic Segmentation
Python
Not reproducible
MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly Detection
Python
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Python
Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling
Python
COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction
Python
Not reproducible
OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition
Python
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Jupyter Notebook
DNGaussian: Optimizing Sparse-View 3D Gaussian Radiance Fields with Global-Local Depth Normalization
Python
SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors
Python
Not reproducible
Benchmarking Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAM
Python
Not reproducible
RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D
Python
Not reproducible
When StyleGAN Meets Stable Diffusion: a ${\mathcal{W}_+}$ Adapter for Personalized Image Generation
Python
Not reproducible
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
Python
Not reproducible
Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting
Python
3D-LFM: Lifting Foundation Model
Jupyter Notebook
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
No code
Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions
Jupyter Notebook
Mosaic-SDF for 3D Generative Models
JavaScript
Gaussian Shell Maps for Efficient 3D Human Generation
Jupyter Notebook
FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders
Python
Not reproducible
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
Jupyter Notebook
Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension
Python
Not reproducible
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Python
BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection
Python
Not reproducible
DYSON: Dynamic Feature Space Self-Organization for Online Task-Free Class Incremental Learning
Python
CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers
Python
Not reproducible
MorpheuS: Neural Dynamic 360$^{\circ}$ Surface Reconstruction from Monocular RGB-D Video
Python
Not reproducible
ZERO-IG: Zero-Shot Illumination-Guided Joint Denoising and Adaptive Enhancement for Low-Light Images
Python
Do Vision and Language Encoders Represent the World Similarly?
Jupyter Notebook