Bigger Isn't Always Better: Towards a General Prior for Medical Image Reconstruction

doi:10.48550/arXiv.2501.07376

Bigger Isn't Always Better: Towards a General Prior for Medical Image Reconstruction

Diffusion model have been successfully applied to many inverse problems, including MRI and CT reconstruction. Researchers typically re-purpose models originally designed for unconditional sampling without modifications. Using two different posterior sampling algorithms, we show empirically that such large networks are not necessary. Our smallest model, effectively a ResNet, performs almost as good as an attention U-Net on in-distribution reconstruction, while being significantly more robust towards distribution shifts. Furthermore, we introduce models trained on natural images and demonstrate that they can be used in both MRI and CT reconstruction, out-performing model trained on medical images in out-of-distribution cases. As a result of our findings, we strongly caution against simply re-using very large networks and encourage researchers to adapt the model complexity to the respective task. Moreover, we argue that a key step towards a general diffusion-based prior is training on natural images.

Publication:

arXiv e-prints

Pub Date:

January 2025

DOI:

10.48550/arXiv.2501.07376

arXiv:

arXiv:2501.07376

Bibcode:

2025arXiv250107376G

Keywords:

Electrical Engineering and Systems Science - Image and Video Processing

E-Print:

To appear in the German Conference on Pattern Recognition proceedings. Code available at https://github.com/VLOGroup/bigger-isnt-always-better

ADS

Bigger Isn't Always Better: Towards a General Prior for Medical Image Reconstruction

Abstract