Multitask Algorithmic Reasoning

This repository contains code for multitask algorithmic reasoning experiments on the CLRS benchmark and text-based graph tasks. We propose branching neural networks for multitask algorithmic reasoning by dividing algorithms into separate branches. This can be applied on top of base models, including GNNs or LLMs with low-rank adapters.

Repository Structure

The repository is organized into several main components:

clrs_experiments: GNN-based experiments on CLRS-30 benchmark
text-graph-tasks: LLM-based experiments on text-encoded graph reasoning tasks
gnn_experiments: Additional GNN experiments for multitask learning

CLRS experiments

This directory contains code for running GNN-based experiments on the CLRS-30 benchmark, focusing on multitask algorithmic reasoning with different graph neural network architectures.

The CLRS benchmark includes 30 classical algorithms across different categories: bfs, dfs, topological_sort, articulation_points, bridges, strongly_connected_components, mst_kruskal, mst_prim, dijkstra, bellman_ford, dag_shortest_paths

Usage

Create a conda environment:

conda create -n clrs python=3.10
conda activate clrs

Install dependencies:

cd clrs_experiments
pip install -e .

Training

Train a single algorithm (e.g., Dijkstra):

python -m clrs.examples.run --algorithms dijkstra

Train multiple algorithms:

python -m clrs.examples.run --algorithms "bfs" "dfs" "dijkstra"

Specify processor type and model parameters:

python -m clrs.examples.run \
  --algorithms "bfs" "dfs" \
  --processor_type "edge_t" \
  --num_layers 5 \
  --hidden_size 192 \
  --use_projection \
  --projection_dim 16

Train a branching multitask network

CUDA_VISIBLE_DEVICES=$CUDA_DEVICE python -m clrs.examples.run \
    --algorithms "bfs","dfs","topological_sort","articulation_points","bridges","strongly_connected_components","mst_kruskal","mst_prim","dijkstra","bellman_ford",'dag_shortest_paths',"floyd_warshall"\
    --use_branching_structure --branching_structure_dir "tree_structure" --processor_type branching_edge_t \
    --num_layers 5 \
    --runs 3 \
    --train_steps 10000

Available processor types:

gat: Graph Attention Network
edge_t: Edge Transformer
mpnn: Message Passing Neural Network
pgn: Pointer Graph Network
branching_edge_t: Branching Edge Transformers
branching_mpnn: Branching MPNN networks
branching_gat & branching_gatv2: Branching GAT networks
clrs/examples/run.py: Main training script for CLRS algorithms
train_sampled_tasks.py: Multi-task training with random algorithm subsets
branchnn_search.py: Branch neural network search for task grouping
clustering.py: Clustering algorithms based on task similarity

Key hyperparameters (see clrs/examples/run.py and clrs/branchnn_search.py for full list):

--batch_size: Training batch size (default: 4)
--train_steps: Number of training iterations (default: 10000)
--learning_rate: Learning rate (default: 2.5e-4)
--hidden_size: Hidden dimension size (default: 192)
--num_layers: Number of network layers (default: 3)
--processor_type: Type of GNN processor
--hint_mode: How to use hints (encoded_decoded, decoded_only, none)

Text-based graph reasoning tasks

This directory contains code for training LLMs on text-encoded graph reasoning tasks, including CLRS text tasks, GraphWiz, and GraphQA benchmarks.

Installation

Create a conda environment:

conda env create -f text-graph-tasks/environment.yml
conda activate llama-env

Or manually:

conda create -n llama-env python=3.10
conda activate llama-env
cd text-graph-tasks
pip install -r requirements.txt

Supported Tasks

Tasks in text versions of the CLRS benchmark:

Graph algorithms: bfs, dfs, topological_sort, articulation_points, bridges, strongly_connected_components
Shortest path: dijkstra, bellman_ford, dag_shortest_paths, floyd_warshall
Minimum spanning tree: mst_kruskal, mst_prim

GraphWiz Tasks

connectivity, bipartite, cycle, flow, hamilton, shortest, substructure, topology, triangle

GraphQA Tasks

edge_existence, node_degree, node_count, edge_count, connected_nodes, cycle_check, disconnected_nodes, reachability, shortest_path, maximum_flow, triangle_counting, node_classification
Follow instructions in graph_tasks to generate task data

Training

For Text-CLRS datasets, use train_clrs_text.py

For GraphWiz datasets, use train_graphwiz.py

For GraphQA datasets, use train_graphqa.py

The codebase supports various LLM architectures:

LLaMA family: meta-llama/Llama-2-7b-hf, meta-llama/Llama-2-13b-hf, meta-llama/Meta-Llama-3-8B, meta-llama/Meta-Llama-3-1B
Mistral: mistralai/Mistral-7B-v0.1
Qwen: Qwen/Qwen-7B, Qwen/Qwen2-7B

Common parameters for training scripts:

--task_names: List of tasks to train on
--model_key: HuggingFace model identifier
--devices: GPU device IDs to use
--batch_size: Training batch size
--max_epochs: Maximum training epochs
--learning_rate: Learning rate (default: 1e-5)
--max_length: Maximum sequence length
--train_multitask: Enable multi-task training
--use_lora: Use LoRA fine-tuning
--use_qlora: Use QLoRA (4-bit quantization)
--train_adapter: Use adapter-based training
--load_branching_config: Load branching structure from file
--task_branching_config_dir: Directory for branching structure files

Requirements

Both projects require:

Python 3.10+
CUDA-compatible GPU (recommended)
Sufficient GPU memory (8GB+ for smaller models, 24GB+ for 7B+ LLMs)

Citation

If you find this repository useful or happen to use it in a research paper, please cite our work with the following BibTeX information.

@article{li2026efficiently,
  title={Efficiently Learning Branching Networks for Multitask Algorithmic Reasoning},
  author={Li, Dongyue and Zhang, Zhenshuo and Duan, Minxuan and Dobriban, Edgar and Zhang, Hongyang R},
  journal={SIGKDD Conference on Knowledge Discovery and Data Mining},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 176 Commits
.vscode		.vscode
arithmetics		arithmetics
clrs_experiments		clrs_experiments
gnn_experiments		gnn_experiments
text-graph-tasks		text-graph-tasks
.gitignore		.gitignore
README.md		README.md
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multitask Algorithmic Reasoning

Repository Structure

CLRS experiments

Usage

Training

Text-based graph reasoning tasks

Installation

Supported Tasks

Training

Requirements

Citation

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

VirtuosoResearch/Multitask-algorithmic-reasoning

Folders and files

Latest commit

History

Repository files navigation

Multitask Algorithmic Reasoning

Repository Structure

CLRS experiments

Usage

Training

Text-based graph reasoning tasks

Installation

Supported Tasks

Training

Requirements

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages