Assessing Visual Hallucinations in Vision-Enabled Large Language Models
Recent advancements in vision-enabled large language models have prompted a renewed interest in evaluating their capabilities and limitations when interpreting complex visual data. The current research employs ImageNet-A, a dataset specifically designed with adversarially selected images that ...