🔗 References¶

[1] Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021, July). Learning transferable visual models from natural language supervision. In International conference on machine learning (pp. 8748-8763). PMLR.
[2] Saab, K., Tu, T., Weng, W. H., Tanno, R., Stutz, D., Wulczyn, E., ... & Natarajan, V. (2024). Capabilities of gemini models in medicine. arXiv preprint arXiv:2404.18416.
[3] Xu, H., Usuyama, N., Bagga, J., Zhang, S., Rao, R., Naumann, T., ... & Poon, H. (2024). A whole-slide foundation model for digital pathology from real-world data. Nature, 1-8.
[4] Chen, R. J., Chen, C., Li, Y., Chen, T. Y., Trister, A. D., Krishnan, R. G., & Mahmood, F. (2022). Scaling vision transformers to gigapixel images via hierarchical self-supervised learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 16144-16155).
[5] Shaikovski, G., Casson, A., Severson, K., Zimmermann, E., Wang, Y. K., Kunz, J. D., ... & Fuchs, T. J. (2024). PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology.
[6] DeYoung, J., Beltagy, I., van Zuylen, M., Kuehl, B., & Wang, L. L. (2021). Ms2: Multi-document summarization of medical studies. arXiv preprint arXiv:2104.06486.
[7] Shor, J., Bi, R. A., Venugopalan, S., Ibara, S., Goldenberg, R., & Rivlin, E. (2023). Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings. arXiv preprint arXiv:2303.05737.