Skip to content
This repository has been archived by the owner on Feb 26, 2024. It is now read-only.

add scripts for dataset generation #5

Open
vpozdnyakov opened this issue Dec 15, 2021 · 1 comment
Open

add scripts for dataset generation #5

vpozdnyakov opened this issue Dec 15, 2021 · 1 comment
Labels
documentation Improvements or additions to documentation

Comments

@vpozdnyakov
Copy link

Hi, it will be great to add some examples or comments how to generate datasets, for example:

To generate train set, run it (estimated time -- 100 mins)

git clone https://github.com/jvkersch/pyconcorde.git
cd pyconcorde && pip install -e .
python 'data/tsp/generate_tsp.py' --min_nodes 20 --max_nodes 50 --num_samples 128000 --batch_size 128 --filename 'data/tsp/tsp20-50_train_concorde.txt'
@chaitjo chaitjo added the documentation Improvements or additions to documentation label Dec 20, 2021
@chaitjo
Copy link
Owner

chaitjo commented Dec 20, 2021

Hi @vpozdnyakov, yes, that will work -- that script will generate 128000 TSP samples of sizes between 20 and 50 nodes.

I should add some information regarding batch sizes. Thank you for the tip.

In essence, since we are not using GNN libraries like DGL/PyG, we have to ensure that the size of all graphs in a mini-batch is consistent. We ensure this during dataset creation itself in this case.

I will add more documentation.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

2 participants