Cape Town - 2026 ISMRM-ISMRT Annual Meeting and Exhibition
9 May 2026 – 14 May 2026 · Cape Town, South Africa
461-02-015 ISMRM Abstract

Image quality assessment of MR image enhancement with large vision-language models

Accepted
Caohui Duan 1, Dong Zhang2, Jianxing Hu1, Xiaonan Xu3, Youmin Li3, Xin Lou1
1Department of Radiology, The First Medical Center, Chinese PLA General Hospital, Beijing, China
2Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, Canada
3The Beian Hospital of Beidahuang Group, Heihe City, China
Presenting Author: Caohui Duan

Synopsis

Motivation:
Goals:
Approach:
Results:
Full abstract & presentation

The full text, figures, and any recorded presentation for this abstract are not shown here. Log in if you are a member or registered attendee with access.

Full abstracts, figures, and presentations for Cape Town - 2026 ISMRM-ISMRT Annual Meeting and Exhibition are available to registered attendees. This content becomes freely available to the public roughly two years after the meeting.

To request or purchase access, contact the ISMRM Central Office at info@ismrm.org.

Log in

References

1. Chen Z, Hu B, Niu C, et al. IQAGPT: computed tomography image quality assessment with vision-language and ChatGPT models. Vis Comput Ind Biomed Art. 2024;7(1): 20. https://doi.org/10.1186/s42492-024-00171-w. [doi]
2. Wu T, Ma K, Liang J, et al. A comprehensive study of multimodal large language models for image quality assessment. In: Proceedings of the European conference on computer vision (ECCV), 2024. pp. 143-160. https://doi.org/10.1007/978-3-031-72904-1_9. [doi]
3. OpenAI. Introducing gpt-5. https://openai.com. Accessed August 7 2025
4. Anthropic. Introducing Claude Sonnet 4.5. https://www.anthropic.com. Accessed September 30 2025.
5. Bai J, Bai S, Yang S, et al. Qwen-vl: A versatile vision-language model for understanding, localization, text reading, and beyond. arXiv preprint, 2023. arXiv:2308.12966.
6. Wu Z, Chen X, Pan Z, et al. Deepseek-vl2: Mixture-of-experts vision-language models for advanced multimodal understanding. arXiv preprint, 2024. arXiv:2412.10302.
7. Lin H, Figini M, D’Arco F, et al. Low-field magnetic resonance image enhancement via stochastic image quality transfer. Med Image Anal. 2023;87: 102807. https://doi.org/10.1016/j.media.2023.102807. [doi]
8. Saharia C, Ho J, Chan W, et al. Image super-resolution via iterative refinement. IEEE Trans Pattern Anal Mach Intell. 2022;45(4): 4713-4726. https://doi.org/10.1109/TPAMI.2022.3204461. [doi]
9. Isola P, Zhu J Y, Zhou T, et al. Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. pp. 1125-1134. https://doi.org/10.1109/CVPR.2017.632 [doi]
10. Wang X, Yu K, Wu S, et al. Esrgan: Enhanced super-resolution generative adversarial networks. In: Proceedings of the European conference on computer vision (ECCV) workshops, 2018. pp. 63-79.
11. Han K, Xiang W. Inference-reconstruction variational autoencoder for light field image reconstruction. IEEE Trans Image Process. 2022;31: 5629-5644. https://doi.org/10.1109/TIP.2022.3197976. [doi]
12. Chen F, Taviani V, Malkiel I, et al. Variable-density single-shot fast spin-echo MRI with deep learning reconstruction by using variational networks. Radiology;2018: 289(2):366-373. https://doi.org/10.1148/radiol.2018180445 [doi]

Cite this abstract