Challenge Overview - REport Generation in pathology using Pan-Asia Giga-pixel WSIs

REport Generation in pathology using Pan-Asia Giga-pixel WSIs¶

📢 REG2025 Challenge Concluded!¶

We are pleased to announce that the REG2025 Challenge has successfully concluded.
A total of 26 teams submitted their results, and the final scores can be viewed at the following link: Final Leaderboard

🔔 Please note that the winning teams will be officially confirmed after reproducibility verification of the submitted materials.
We have reached out via email to the top 13 teams on the leaderboard to request their submissions. If you are among the participants and did not receive the email, please contact us immediately.

📌 Submission deadline: August 27, 2025, 23:59 (KST)
⚠️ Failure to provide the required materials by the deadline may affect award eligibility.

ℹ️ Notice¶

📢 We have recently detected malware on the server and, as a precautionary measure, have taken it offline for security reasons. The server is currently being fully cleared, and all data is being reuploaded. We will restore access as soon as possible and will post an updated notice once the server is available again.
📢 All servers have been fully restored and are operational. Please review the updated information and download links on the Reg2025-TrainDataset page, as several changes have been made.
📢 In light of this situation, all challenge deadlines have been extended by one week. We sincerely apologize for any inconvenience this may cause and kindly ask all participants to review the updated schedule.
📢 Two servers will be temporarily rebooted for maintenance and data upload. The affected server information is listed on the Reg2025-Traindataset page. If you are currently downloading the training data, please check your server connection status again.
📢 Some TIFF files may display incorrectly due to a photometric tag issue. The corrected files have been re-uploaded to REG_test1_revised.
To fix this locally (if needed), run: tiffset -s 262 2 FILENAME.tiff.

🟢 You are allowed to use pre-trained weights without restriction.
🚫 The use of additional training datasets is strictly prohibited.
⚠️ Some TIFF files were found to be corrupted during the data cleaning process. Reconstructing and redistributing the dataset would take considerable time and could lead to fairness issues among participants. Therefore, we have decided to exclude these files from the challenge.

You can check the list of excluded files in the following CSV file:
corrupted_id.csv

⚠️ We have identified color distortion issues in several .tiff files. The corrected versions of these files have been uploaded to the /REG_train_revised directory on servers 104–107. Please re-download and use the revised files from this directory.
If re-downloading is inconvenient, you can repair the files locally with: tiffset -s 262 6 PIT_file_name.tiff

You can find the list of affected files in the revised_list.csv file available at the following CSV file:
revised_list.csv (Data on the server has been updated.)

🔔 Updates¶

📢 Now open for registration!
📢 Now the TrainDataset is available for download! [Reg2025-Traindataset]
📢 Now the Test1Dataset is available for download! [Reg2025-Testdataset]
📢 Now the Test2Dataset is available for download! [Reg2025-Testdataset]

Kindly refer to the submission instructions at the following link before making your submission:
https://reg2025.grand-challenge.org/evaluation/test-phase-2/submissions/create/

📢 Due to a network upgrade to improve FTP download speed, server access will be temporarily unavailable on May 26, 2025 (KST) from 09:00 to 10:00 AM. If you are downloading data via FTP, please resume the download after the maintenance period.
📢 ~~All servers are currently offline for security reasons and will be reopened after maintenance is complete.~~

⚠️ ~~The challenge schedule has been extended by one week; please refer to the updated Important Dates section.~~
⚠️ The challenge schedule has been extended by an additional week; please refer to the updated Important Dates section for the revised timeline.
✅ The Data Description section has been updated — you can now check the dataset overview and format details.
✅ The Debug Phase has been added — In this phase, you can check whether your submission follows the correct format.
✅ We have opened 3 additional servers for downloading the training dataset. If you experience download errors due to high traffic, we recommend using these servers. [Reg2025-Traindataset]
✅ You can evaluate the generated results by referring to the following GitHub repository: Evaluation code [Submission & Evaluation]

⭐ Important Dates:¶

Training data release: ~~13/05/2025~~ > 20/05/2025 ✅
Registration deadline: ~~27/06/2025~~ > 04/07/2025 ✅
Debug Phase opens: ~~27/06/2025~~ > 04/07/2025 ✅
Debug Phase deadline: ~~03/07/2025~~ > 10/07/2025 ✅
Test Phase 1 opens & Test Phase 1 data release: ~~27/06/2025~~ > ~~04/07/2025~~ > 11/07/2025 ✅
Test Phase 1 deadline: ~~18/07/2025~~ > ~~25/07/2025~~ > 01/08/2025 ✅
Test Phase 2 opens & Test Phase 2 data release: ~~19/07/2025~~ > ~~26/07/2025~~ > 02/08/2025 ✅
Test Phase 2 deadline: ~~09/08/2025~~ > ~~16/08/2025~~ > 23/08/2025 ✅
Announcement of winners: 09/09/2025

(All times are 10:00 AM, KST)

🔍 Challenge overview¶

Recent advances in vision-language foundation models have opened new possibilities in medical applications, particularly in image captioning, which generates textual descriptions from images. When applied to gigapixel-scale pathology images, this task demands advanced image analysis methods like slide-level feature extraction to process and interpret vast visual data. Automated pathology report generation, despite its complexities, has gained attention for its potential to address labor shortages, improve diagnostic accuracy, and enhance patient care. However, current evaluation methods relying on traditional NLP metrics such as BLEU, METEOR, and ROUGE are inadequate for the medical domain, where clinical relevance and content accuracy are paramount.

To address these limitations, this initiative focuses on:
1) evaluating report generation models with standardized datasets encompassing diverse pathological cases.
2) comparing generated reports with expert assessments to measure clinical alignment.
3) identifying and adopting evaluation metrics tailored to medical standards.
4) exploring the integration of generated reports into diagnostic workflows, informed by clinical feedback.

Our ultimate goal is to enhance the practicality and reliability of pathology report generation models by ensuring they produce clinically meaningful and high-quality content. Furthermore, this initiative aims to address the limitations of current AI models in reflecting racial and ethnic diversity by utilizing a broader dataset that includes both Pan-Asia and European data. The challenge dataset comprises approximately 10,500 cases collected from six medical centers across five countries—Korea, Japan, India, Turkey, and Germany—contributing to the development of multicultural and multiethnic medical AI technologies.