Accepted Camera Ready Papers

Proceeding Track

  • Nayana: A Foundation for Document-Centric Vision-Language Models via Multi-Task, Multimodal, and Multilingual Data Synthesis - Open Review
  • Simulating Refractive Distortions and Weather-Induced Artifacts for Resource-Constrained Autonomous Perception - Open Review
  • V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard? - Open Review
  • FLD+: Data-efficient Evaluation Metric for Generative Models - Open Review
  • WavePaint: Resource-efficient Token-mixer for Self-supervised Inpainting - Open Review

Non-Proceeding Track

  • Neural Collapse Strikes Back: Lightweight Transfer of Large Vision Model for Cross-Scene Spectral Generalization - Open Review
  • Multiscale Diagnostics of Visual Language Models - Open Review
  • Vision-Language Models display a strong gender bias - Open Review
  • Mudra-VLM: Adapting Vision-Language Models for Fine-Grained Bharatanatyam Mudra Recognition - Open Review
  • Efficient Low-Resolution Chest X-Ray Diagnosis via Vision Transformers with Collaborative Distillation and Coreset Selection - Open Review
  • Lightweight Vision Models for Remote Sensing in Low-Resource Settings - Open Review
  • Agro-Consensus: Semantic Self-Consistency in Vision-Language Models for Crop Disease Management in Developing Countries - Open Review
  • VeS: Teaching Pixels to Listen Without Supervision - Open Review
  • Fine-Grained Action Quality Assessment in Sports Using Pose-Based Representations - Open Review
  • Smart Eyes for Silent Threats: VLMs and In-Context Learning for THz Imaging - Open Review
  • Dynamic Inter-Class Confusion-Aware Encoder for Audio-Visual Fusion in Human Activity Recognition - Open Review
  • AI-Derived Structural Building Intelligence for Urban Resilience: An Application in Saint Vincent and the Grenadines - Open Review
  • NUTS: Eddy-Robust Reconstruction of Surface Ocean Nutrients via Two-Scale Modeling - Open Review