Add LG AI's EXAONEPath-CRC-MSI-Predictor model

Stephen Aylward · Stephen Aylward · commit fa651d88c7ae · 2025-05-31T11:46:49.000-04:00
Model is on Hugging Face.  This links to that model.
diff --git a/hf_models/README.md b/hf_models/README.md
@@ -17,6 +17,7 @@ These models must be accessed directly from Hugging Face using the `huggingface_
 | Model | Description | HF Repository |
 |-------|-------------|--------------|
 | exaonepath | EXAONEPath is a patch-level pathology pretrained model with 86 million parameters | [LGAI-EXAONE/EXAONEPath](https://huggingface.co/LGAI-EXAONE/EXAONEPath) |
+| exaonepath-crc-msi-predictor | MSI classification of CRC tumors using EXAONEPath 1.0.0 Patch-level Foundation Model for Pathology | [LGAI-EXAONE/EXAONEPath-CRC-MSI-Predictor](https://huggingface.co/LGAI-EXAONE/EXAONEPath-CRC-MSI-Predictor) |
 | llama3_vila_m3_3b | Lightweight medical vision language model that enhances VLMs with medical expert knowledge (3B parameters) | [MONAI/Llama3-VILA-M3-3B](https://huggingface.co/MONAI/Llama3-VILA-M3-3B) |
 | llama3_vila_m3_8b | Medical vision language model that utilizes domain-expert models to improve precision in medical imaging tasks (8B parameters) | [MONAI/Llama3-VILA-M3-8B](https://huggingface.co/MONAI/Llama3-VILA-M3-8B) |
 | llama3_vila_m3_13b | Enhanced medical vision language model with improved capabilities for various medical imaging tasks (13B parameters) | [MONAI/Llama3-VILA-M3-13B](https://huggingface.co/MONAI/Llama3-VILA-M3-13B) |
diff --git a/hf_models/exaonepath-crc-msi-predictor/LICENSE b/hf_models/exaonepath-crc-msi-predictor/LICENSE
@@ -0,0 +1,34 @@
+EXAONEPath AI Model License Agreement 1.0 - NC
+
+This EXAONEPath AI Model License Agreement (the "Agreement") is entered into by and between LG AI Research ("Licensor") and the individual or entity exercising the rights under this Agreement ("Licensee").
+
+1. Definitions
+   a. "Model" means the EXAONEPath AI Model, a machine learning model, including all associated weights, parameters, and other components.
+   b. "Commercial Use" means any use of the Model primarily intended for or directed toward commercial advantage or monetary compensation.
+
+2. License Grant
+   Subject to the terms and conditions of this Agreement, Licensor hereby grants to Licensee a worldwide, non-exclusive, non-transferable, non-sublicensable, royalty-free license to use, reproduce, and create derivative works of the Model for non-commercial purposes only.
+
+3. Restrictions
+   a. Commercial Use is not permitted under this license.
+   b. Licensee shall not use the Model in connection with any illegal, harmful, fraudulent, infringing, or offensive use.
+   c. Licensee shall not use the Model to create, train, or improve any foundation models.
+   d. Licensee shall not rent, lease, lend, sell, redistribute, or sublicense the Model.
+
+4. Disclaimer of Warranties
+   THE MODEL IS PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, AND NONINFRINGEMENT.
+
+5. Limitation of Liability
+   IN NO EVENT SHALL LICENSOR BE LIABLE FOR ANY CLAIM, DAMAGES, OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT, OR OTHERWISE, ARISING FROM, OUT OF, OR IN CONNECTION WITH THE MODEL OR THE USE OR OTHER DEALINGS IN THE MODEL.
+
+6. Attribution
+   Any use of the Model shall include appropriate attribution to LG AI Research and reference to the research paper: "EXAONEPath 1.0 Patch-level Foundation Model for Pathology" (https://arxiv.org/abs/2408.00380).
+
+7. Termination
+   This Agreement will terminate automatically if Licensee breaches any of its terms.
+
+8. Governing Law
+   This Agreement shall be governed by and construed in accordance with the laws of South Korea, without regard to its conflict of law provisions.
+
+9. Entire Agreement
+   This Agreement constitutes the entire agreement between the parties with respect to the use of the Model.
diff --git a/hf_models/exaonepath-crc-msi-predictor/README.md b/hf_models/exaonepath-crc-msi-predictor/README.md
@@ -0,0 +1,120 @@
+---
+license: other
+license_name: exaonepath
+license_link: LICENSE
+tags:
+- lg-ai
+- EXAONEPath-1.0
+- pathology
+- lg-ai
+---
+
+# EXAONEPath-CRC-MSI-Predictor
+
+## MSI classification of CRC tumors
+MSI classification of CRC tumors using EXAONEPath 1.0.0 Patch-level Foundation Model for Pathology.
+
+[[`Paper`](https://arxiv.org/abs/2408.00380)] [[`Model`](https://huggingface.co/LGAI-EXAONE/EXAONEPath-CRC-MSI-Predictor/tree/main)] [[`BibTeX`](#citation)]
+
+## Introduction
+This model serves as a reference for predicting MSI status using CRC (colorectal cancer) tumor images as input. When the model receives an H&E-stained whole slide image as input, it removes artifacts observed in the image and extracts only tissue-related objects. These objects are then reconstructed into a set of tiles with a size of 256 by 256 pixels at an mpp (micron per pixel) of 0.5.
+
+The tiles pass through the EXAONEPath v1.0 patch-level foundation model (https://huggingface.co/LGAI-EXAONE/EXAONEPath), which converts them into a set of features. These features are then integrated into a slide-level feature representation through an aggregator(see the figure below). Finally, a linear classifier predicts the MSI status (MSS or MSI-H/L).
+
+The model achieves an average performance of AUROC 0.93 on TCGA-COAD + TCGA-READ data and 0.84 on in-house data.
+
+
+## Quickstart
+
+### Summary
+
+1. Copy your WSI files in '''.svs''' format into the '''samples''' directory
+2. Run inference
+
+### 1. Hardware Requirements
+- NVIDIA GPU is required
+- Minimum 8GB GPU memory recommended
+- NVIDIA driver version >= 450.80.02 required
+
+### 2. Environment Setup
+Create and activate a virutal environment.
+```bash
+python -m venv venv
+source ./venv/bin/activate
+```
+
+Install huggingface_cli and download files
+```bash
+pip install "huggingface_hub[cli]"
+huggingface-cli download LGAI-EXAONE/EXAONEPath-CRC-MSI-Predictor --local-dir .
+```
+
+Install requirements
+```bash
+pip install -r requirements.txt
+```
+
+Verify pytorch with GPU support
+```bash
+python -c "import torch; print(torch.cuda.is_available())"
+```
+
+### 3. Data Preparation
+Copy your WSI files into the `samples` directory.
+
+The program accepts ```.svs``` formatted files.   This format is used,
+for example, for diagnostic slides as part of the TCGA-COAD project.  An
+image from that project is available at the following link:
+
+https://portal.gdc.cancer.gov/files/17cfcc8c-49a4-48ce-a5e1-4a3c582ce198
+
+Download that data, extract the svs file from that compressed tar file, and
+copy the svs file to the top level of the `samples` directory.
+
+### 4. Inference
+```bash
+python -m monai.bundle run inference --meta_file configs/metadata.json --config_file configs/inference.yaml
+```
+
+### 5. Run-time errors
+
+Particularly on Windows, if you receive the error 
+```
+RuntimeError: Failed to evaluate ConfigExpression:
+"$scripts.inference.infer(__local_refs['model'], __local_refs['input_files'])"
+```
+and references line 71 in the file "scripts/exaonepath.py"
+```
+    for count, patches in enumerate(patch_loader):
+```
+then you may have set the number of workers for the dataloader to 0.  This is
+accomplished by changing line 65 of "scripts/exaonepath.py" to
+```
+            num_workers=0,
+``` 
+and removing lines 66 and 67, such that lines 62-67 become
+```
+        patch_loader = DataLoader(
+            dataset=patch_dataset,
+            batch_size=feature_extractor_batch_size,
+            num_workers=0,
+            pin_memory=self.device.type == "cuda",
+        )
+```
+
+## License
+The model is licensed under [EXAONEPath AI Model License Agreement 1.0 - NC](./LICENSE)
+
+## Citation <a name="citation"></a>
+If you find EXAONEPath useful, please cite it using this BibTeX:
+```
+@article{yun2024exaonepath,
+  title={EXAONEPath 1.0 Patch-level Foundation Model for Pathology},
+  author={Yun, Juseung and Hu, Yi and Kim, Jinhyung and Jang, Jongseong and Lee, Soonyoung},
+  journal={arXiv preprint arXiv:2408.00380},
+  year={2024}
+}
+```
+
+## Contact
+LG AI Research Technical Support: <a href="mailto:contact_us1@lgresearch.ai">contact_us1@lgresearch.ai</a>
diff --git a/hf_models/exaonepath-crc-msi-predictor/metadata.json b/hf_models/exaonepath-crc-msi-predictor/metadata.json
@@ -0,0 +1,35 @@
+{
+    "schema": "https://github.com/Project-MONAI/MONAI-extra-test-data/releases/download/0.8.1/meta_schema_hf_20250321.json",
+    "version": "1.0.0",
+    "changelog": {
+        "1.0.0": "initial release of EXAONEPath CRC MSI Predictor"
+    },
+    "monai_version": "1.4.0",
+    "pytorch_version": "2.4.0",
+    "numpy_version": "1.24.4",
+    "required_packages_version": {
+        "torch": "2.4.0",
+        "torchvision": "0.15.0",
+        "torchstain": "1.3.0",
+        "pillow": "10.0.0",
+        "huggingface_hub": "0.24.2",
+        "transformers": "4.43.3"
+    },
+    "supported_apps": {
+        "exaonepath-crc-msi-predictor": ""
+    },
+    "name": "EXAONEPath-CRC-MSI-Predictor",
+    "task": "MSI classification of CRC tumors using EXAONEPath model",
+    "description": "MSI classification of CRC tumors using EXAONEPath - a patch-level foundation model for pathology.",
+    "authors": "LG AI Research",
+    "copyright": "LG AI Research",
+    "data_source": "LG AI Research",
+    "data_type": "WSI patches",
+    "image_classes": "RGB pathology image patches",
+    "huggingface_model_id": "LGAI-EXAONE/EXAONEPath-CRC-MSI-Predictor",
+    "huggingface_url": "https://huggingface.co/LGAI-EXAONE/EXAONEPath-CRC-MSI-Predictor",
+    "intended_use": "Research and clinical support for pathology image analysis",
+    "references": [
+        "Yun, Juseung, et al. 'EXAONEPath 1.0 Patch-level Foundation Model for Pathology', arXiv preprint arXiv:2408.00380 (2024)."
+    ]
+}