Watermelon

Introduction

This project is based on crf0409/watermelon_eval, reimplemented with PyTorch under CC-BY-NC-SA 4.0 License. You can get the original dataset from the link above.

File Structure

example-main.py: Original main script from @crf0409 with Tensorflow.
clean.py: Run it to clean the original dataset. You may have to modify the path in the script.
preprocess.py: Preprocess the dataset for training and inference.
train.py: Train the model.

Usage

Preparation

Download the dataset from crf0409/watermelon_eval, which provides links to IEEE DataPort and Baidu Netdisk. Unzip and copy to the repository root (rename the folder to datasets is recommended).

(Recommended) Create a virtual environment and install the dependencies:

pip install -r requirements.txt

Clean the Dataset

Run clean.py to clean the original dataset (you may have to modify the path in the script).

python clean.py

The cleaned dataset will be saved in the cleaned folder by default, with the structure:

cleaned
├── {sweetness label}
│   ├── {id}
│   │   ├── {id}.wav
│   │   └── {id}.jpg
│   └── ...
└── ...

Do Preprocessing

Run preprocess_file.py to avoid duplicated preprocessing is useful to accelerate training.

python preprocess_file.py --data_dir /path/to/cleaned --save_dir /path/to/processed

Preprocessing includes:

Read the dataset from disk.
Audio: Choose left channel, resample to 16 kHz, cut/pad to 3 seconds, and convert to Mel spectrogram.
Image: Resize to 1080x1080, normalize, and prepare for ResNet-50.
Make audio-image-label pairs, then save them to disk.

and will generate processed folder with {id}.pt files in the root directory by default.

Train (In Progress)

Run train.py to train the model.

python train.py

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
clean.py		clean.py
example-main.py		example-main.py
infer.py		infer.py
preproces_file.py		preproces_file.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Watermelon

Introduction

File Structure

Usage

Preparation

Clean the Dataset

Do Preprocessing

Train (In Progress)

Web App Inference with Gradio

About

Releases

Packages

Languages

License

leostudiooo/watermelon

Folders and files

Latest commit

History

Repository files navigation

Watermelon

Introduction

File Structure

Usage

Preparation

Clean the Dataset

Do Preprocessing

Train (In Progress)

Web App Inference with Gradio

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages