Accepted Camera Ready Papers
Proceeding Track
-
Nayana: A Foundation for Document-Centric Vision-Language Models via Multi-Task, Multimodal, and Multilingual Data Synthesis
- Open Review
-
Simulating Refractive Distortions and Weather-Induced Artifacts for Resource-Constrained Autonomous Perception
- Open Review
-
V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?
- Open Review
-
FLD+: Data-efficient Evaluation Metric for Generative Models
- Open Review
-
WavePaint: Resource-efficient Token-mixer for Self-supervised Inpainting
- Open Review
Non-Proceeding Track
-
Neural Collapse Strikes Back: Lightweight Transfer of Large Vision Model for Cross-Scene Spectral Generalization
- Open Review
-
Multiscale Diagnostics of Visual Language Models
- Open Review
-
Vision-Language Models display a strong gender bias
- Open Review
-
Mudra-VLM: Adapting Vision-Language Models for Fine-Grained Bharatanatyam Mudra Recognition
- Open Review
-
Efficient Low-Resolution Chest X-Ray Diagnosis via Vision Transformers with Collaborative Distillation and Coreset Selection
- Open Review
-
Lightweight Vision Models for Remote Sensing in Low-Resource Settings
- Open Review
-
Agro-Consensus: Semantic Self-Consistency in Vision-Language Models for Crop Disease Management in Developing Countries
- Open Review
-
VeS: Teaching Pixels to Listen Without Supervision
- Open Review
-
Fine-Grained Action Quality Assessment in Sports Using Pose-Based Representations
- Open Review
-
Smart Eyes for Silent Threats: VLMs and In-Context Learning for THz Imaging
- Open Review
-
Dynamic Inter-Class Confusion-Aware Encoder for Audio-Visual Fusion in Human Activity Recognition
- Open Review
-
AI-Derived Structural Building Intelligence for Urban Resilience: An Application in Saint Vincent and the Grenadines
- Open Review
-
NUTS: Eddy-Robust Reconstruction of Surface Ocean Nutrients via Two-Scale Modeling
- Open Review