Evaluate

This section describes how different categories of algorithms are evaluated in RLSolver Contest 2025.

Evaluation Categories

We evaluate submitted methods according to their type:

Distribution-wise Reinforcement Learning Methods
- These are RL-based methods that train on distributions of graph instances and are capable of generalizing to unseen instances.
- Evaluation metrics include:
  - Training Time: Total time required to train the agent across training instances.
  - Inference Time: Average time to infer a solution for a single test instance.
  - Objective Value: The primary optimization target, e.g., Maxcut value, tour length, etc.
Conventional Methods
- These include classical heuristics, local search, greedy algorithms, and solvers like Gurobi or CPLEX.
- Evaluation metrics include:
  - Running Time: Total time taken to solve a test instance (no training phase).
  - Objective Value: The final objective value obtained by the algorithm.

Evaluation Criteria Summary

Method Type	Time Metric	Optimization Metric
Distribution-wise RL	Training + Inference	Objective Value
Conventional (non-RL)	Running Time only	Objective Value

Notes

Objective Value is defined by the problem instance (e.g., Maxcut, Set Cover, Knapsack). Higher is better unless otherwise specified.
Inference Time is measured on GPU (if applicable), averaged across all test instances.
All methods will be evaluated on a standardized server with fixed computational limits.
Final rankings may use a weighted combination of time and objective metrics, depending on the track.