Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models

Installation

To get started, it's recommended to create a Python (preferably v3.10) environment using either Conda or venv. This helps to manage dependencies and ensure a consistent runtime environment.

Conda:

  conda create --name your_env_name python=3.10
  conda activate your_env_name

OR

venv:

  python -m venv your_env_name
  source your_env_name/bin/activate

Once your environment is active, install the required packages from requirements.txt using pip:

pip install -r requirements.txt

Usage

Pretrained Model Preparation

Because the pretrained weights of BiomedCLIP and CLIPSeg are readily available in the Hugging Face Model Hub you do not need to save the weights manually. However, the pretrained weights of CRIS were extracted and then saved within the folder pretrained/. Please refer to the config file cris.yaml for more information.

Dataset Preparation

Before running any experiments, you need to ensure that the provided dataset is correctly placed within the data/ folder at the root of the project. The directory structure of the data/ folder should look like this:

data/
│
├── bkai_polyp/
│   ├── anns/
│   │   ├── test.json
│   │   ├── train.json
│   │   └── val.json
│   ├── images/
│   └── masks/
│
├── [other dataset folders...]
│
└── kvasir_polyp/
    ├── anns/
    │   ├── test.json
    │   ├── train.json
    │   └── val.json
    ├── images/
    └── masks/

Each dataset folder (bkai_polyp, busi, camus, etc.) contains three sub-directories: anns, images, and masks. The anns directory contains prompt files (test.json, train.json, val.json), while images and masks hold input images and target masks respectively.

Zero Shot Segmentation

To perform zero-shot segmentation, you can use the provided script. Open a terminal and navigate to the project directory, then execute the following command:

python scripts/zss.py

This script will initiate the zero-shot segmentation process and produce the desired results.

Finetuning

If you need to run fine-tuning for your model, you can do so using the following script:

python scripts/finetune.py

This script will start the fine-tuning process, which is essential for customizing the model for specific tasks. For running inference, please update the defaults configs (such as ckpt_path, models, etc.) in scripts/inference.py to get the evulation metric or generate the output masks (in the original resolution).

Acknowledgement

We would like to thank Lightning-Hydra-Template for providing a modifiable framework for running multiple experiments while tracking the hyperparameters.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
configs		configs
data		data
notebooks		notebooks
pretrained		pretrained
scripts		scripts
src		src
tests		tests
utils		utils
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models

Table of Contents

Installation

Usage

Pretrained Model Preparation

Dataset Preparation

Zero Shot Segmentation

Finetuning

Acknowledgement

About

Releases

Packages

Languages

naamiinepal/medvlsm-active-learning

Folders and files

Latest commit

History

Repository files navigation

Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models

Table of Contents

Installation

Usage

Pretrained Model Preparation

Dataset Preparation

Zero Shot Segmentation

Finetuning

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages