Reinforcement learning driven TabuCol (RLTCol)

This code was written as part of our bachelor thesis titled: "Reinforcement learning for improved local search, applied to the graph coloring problem". This work was done at the KTH Royal Institute of Technology in Stockholm, Sweden. This repository contains the code for the RLTCol algorithm, which is a hybrid heuristic algorithm for the graph coloring problem that uses reinforcement learning (RL).

The RLTCol algorithm works by iteratively running the local search algorithm TabuCol, and running an RL agent. The two components pass solutions to each other. The paper can be read here.

Code

The RL agent is implemented in Python using the Tianshou library. TabuCol is implemented in Rust, using maturin to interface with Python. The code is written for Python 3.10.

Requirements

Python 3.10
A working Rust installation, see here for instructions.

Installation

Create a virtual environment and install the required Python packages:

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Then, build the Rust code:

cd src/tabucol && maturin develop --release && cd -

Usage

The graphs used as input need to be in the form of a DIMACS text file. The graphs used in the paper can be found here. If they are in the compressed format, they can be decompressed using the translator found on the same page.

The source code for the RLTCol algorithm is located in the src directory. The TabuCol implementation in Rust is located in the src/tabucol directory.

Training the RL agent

The RL agent can be trained using the trainer.py script. The script takes a number of arguments, which can be found by running python trainer.py --help.

The script will save the trained policy to the file specified by the output parameter. The policy can then be used to run the RLTCol algorithm.

Running the RLTCol algorithm

The RLTCol algorithm can be run using the runner.py script. The script takes a number of arguments, which can be found by running python runner.py --help.

In order to run multiple jobs in parallel and/or in sequence, the batch_runner.py script can be used. The script takes a number of arguments, which can be found by running python batch_runner.py --help. This script will run the RLTCol algorithm and save the results to individual files in the directory specified by the output_dir parameter. The results can then be summarized using the result_summarizer.py script.

Credits

The paper was written by Adrian Salamon ([email protected]) and Klara Sandström ([email protected]), supervised by Stefano Markidis.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
paper		paper
src		src
.gitignore		.gitignore
License		License
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement learning driven TabuCol (RLTCol)

Code

Requirements

Installation

Usage

Training the RL agent

Running the RLTCol algorithm

Credits

About

Releases

Packages

Contributors 2

Languages

License

adriansalamon/RLTCol

Folders and files

Latest commit

History

Repository files navigation

Reinforcement learning driven TabuCol (RLTCol)

Code

Requirements

Installation

Usage

Training the RL agent

Running the RLTCol algorithm

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages