CXRMate-2

This repository provides the code to train the CXRMate-2 model.

Papr coming soon...

Download model

The model is available on Hugging Face Hub: https://huggingface.co/aehrc/cxrmate-2

alias = 'aehrc/cxrmate-2'

model = transformers.AutoModelForCausalLM.from_pretrained(alias, trust_remote_code=True).to(device='cuda')
model.eval()
generation_config = transformers.GenerationConfig.from_pretrained(alias, trust_remote_code=True)
processor = transformers.AutoProcessor.from_pretrained(alias, trust_remote_code=True)

Generation

url = 'https://prod-images-static.radiopaedia.org/images/220869/76052f7902246ff862f52f5d3cd9cd_big_gallery.jpg'
processed = processor(images=url)
processed = processed.to(device='cuda')
generated_ids = model.generate(**processed, generation_config=generation_config)
findings, impression = processor.split_and_decode_sections(generated_ids)

Generated reports

CXRMate-2 generated reports:

MIMIC-CXR - test set

CheXpert Plus - valid set

ReXgradient - test set

Environment

Requirements for the environment are in requirements.txt

Training

Download datasets

Download the MIMIC-CXR-JPG dataset from https://physionet.org/content/mimic-cxr-jpg, e.g.,

wget -r -N -c -np --user <username> --ask-password https://physionet.org/files/mimic-cxr-jpg/2.1.0/

MIMIC-CXR-JPG does not include the radiology reports and are instead included with MIMIC-CXR (the DICOM version of the dataset). To download this dataset and avoid downloading the DICOM files (which are very large), use --reject dcm with the wget command from https://physionet.org/content/mimic-cxr, e.g,

wget -r -N -c -np --reject dcm --user <username> --ask-password https://physionet.org/files/mimic-cxr/2.0.0/

Note that you must be a credentialised user to access MIMIC-CXR/MIMIC-CXR-JPG.

CheXpert Plus can be downloaded from: https://aimi.stanford.edu/datasets/chexpert-plus.

Prepare datasets:

The following scripts prepare each dataset into a HuggingFace DatasetDict saved to database_dir. Each script accepts --database_dir (default: database) and --num_workers (default: 4).

MIMIC-CXR-JPG:

python prepare_datasets/prepare_mimic_cxr_jpg.py --physionet_dir <physionet_dir>

CheXpert Plus:

python prepare_datasets/prepare_chexpert_plus.py --chexpert_plus_dir <chexpert_plus_dir>

ReXGradient-160K:

python prepare_datasets/prepare_rexgradient.py

Note that this script also downloads https://huggingface.co/datasets/rajpurkarlab/ReXGradient-160K.

Training

First train the SFT model:

accelerate launch utils.py -t cxrmate2 -c config/sft_public.yaml

Then train the GRPO model:

accelerate launch utils.py -t cxrmate2 -c config/grpo_public.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
config		config
generated_reports		generated_reports
metrics		metrics
prepare_datasets		prepare_datasets
.gitignore		.gitignore
README.md		README.md
base_stages.py		base_stages.py
command_line_arguments.py		command_line_arguments.py
configuration_cxrmate2.py		configuration_cxrmate2.py
cxrmate2.drawio.png		cxrmate2.drawio.png
dataset.py		dataset.py
example.ipynb		example.ipynb
loggers.py		loggers.py
modelling_cxrmate2.py		modelling_cxrmate2.py
plot_utils.py		plot_utils.py
processing_cxrmate2.py		processing_cxrmate2.py
requirements.txt		requirements.txt
results.ipynb		results.ipynb
rewards.py		rewards.py
slurm.py		slurm.py
stages_cxrmate2.py		stages_cxrmate2.py
stages_cxrmate2_grpo.py		stages_cxrmate2_grpo.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CXRMate-2

Download model

Generation

Generated reports

Environment

Training

Download datasets

Prepare datasets:

MIMIC-CXR-JPG:

CheXpert Plus:

ReXGradient-160K:

Training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CXRMate-2

Download model

Generation

Generated reports

Environment

Training

Download datasets

Prepare datasets:

MIMIC-CXR-JPG:

CheXpert Plus:

ReXGradient-160K:

Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages