461-02-015 ISMRM Abstract

Image quality assessment of MR image enhancement with large vision-language models

Primary: Analysis Methods - Image Enhancement

Secondary: Acquisition & Reconstruction - AI methods

461-02-015 · AI To Make Protocols, Plan, QC, and Correct Motion · Tuesday, 12 May, 9:15 AM–10:10 AM · Digital Posters Row B

Keywords: Large Language Models Image Quality Assessment Vision-language model Image enhacement

Accepted

Caohui Duan ¹, Dong Zhang², Jianxing Hu¹, Xiaonan Xu³, Youmin Li³, Xin Lou¹

¹Department of Radiology, The First Medical Center, Chinese PLA General Hospital, Beijing, China

²Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, Canada

³The Beian Hospital of Beidahuang Group, Heihe City, China

Presenting Author: Caohui Duan

Synopsis

Motivation:

Goals:

Approach:

Results:

Full abstract & presentation

The full text, figures, and any recorded presentation for this abstract are not shown here. Log in if you are a member or registered attendee with access.

Full abstracts, figures, and presentations for Cape Town - 2026 ISMRM-ISMRT Annual Meeting and Exhibition are available to registered attendees. This content becomes freely available to the public roughly two years after the meeting.

To request or purchase access, contact the ISMRM Central Office at info@ismrm.org.

References

1. Chen Z, Hu B, Niu C, et al. IQAGPT: computed tomography image quality assessment with vision-language and ChatGPT models. Vis Comput Ind Biomed Art. 2024;7(1): 20. https://doi.org/10.1186/s42492-024-00171-w. [doi]

2. Wu T, Ma K, Liang J, et al. A comprehensive study of multimodal large language models for image quality assessment. In: Proceedings of the European conference on computer vision (ECCV), 2024. pp. 143-160. https://doi.org/10.1007/978-3-031-72904-1_9. [doi]

3. OpenAI. Introducing gpt-5. https://openai.com. Accessed August 7 2025

4. Anthropic. Introducing Claude Sonnet 4.5. https://www.anthropic.com. Accessed September 30 2025.

5. Bai J, Bai S, Yang S, et al. Qwen-vl: A versatile vision-language model for understanding, localization, text reading, and beyond. arXiv preprint, 2023. arXiv:2308.12966.

6. Wu Z, Chen X, Pan Z, et al. Deepseek-vl2: Mixture-of-experts vision-language models for advanced multimodal understanding. arXiv preprint, 2024. arXiv:2412.10302.

7. Lin H, Figini M, D’Arco F, et al. Low-field magnetic resonance image enhancement via stochastic image quality transfer. Med Image Anal. 2023;87: 102807. https://doi.org/10.1016/j.media.2023.102807. [doi]

8. Saharia C, Ho J, Chan W, et al. Image super-resolution via iterative refinement. IEEE Trans Pattern Anal Mach Intell. 2022;45(4): 4713-4726. https://doi.org/10.1109/TPAMI.2022.3204461. [doi]

9. Isola P, Zhu J Y, Zhou T, et al. Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. pp. 1125-1134. https://doi.org/10.1109/CVPR.2017.632 [doi]

10. Wang X, Yu K, Wu S, et al. Esrgan: Enhanced super-resolution generative adversarial networks. In: Proceedings of the European conference on computer vision (ECCV) workshops, 2018. pp. 63-79.

11. Han K, Xiang W. Inference-reconstruction variational autoencoder for light field image reconstruction. IEEE Trans Image Process. 2022;31: 5629-5644. https://doi.org/10.1109/TIP.2022.3197976. [doi]

12. Chen F, Taviani V, Malkiel I, et al. Variable-density single-shot fast spin-echo MRI with deep learning reconstruction by using variational networks. Radiology;2018: 289(2):366-373. https://doi.org/10.1148/radiol.2018180445 [doi]

Cite this abstract

http://echo.ismrm.org/p/ISMRM2026/461-02-015