Evaluate
This section describes how different categories of algorithms are evaluated in RLSolver Contest 2025.
Evaluation Categories
We evaluate submitted methods according to their type:
Distribution-wise Reinforcement Learning Methods
These are RL-based methods that train on distributions of graph instances and are capable of generalizing to unseen instances.
Evaluation metrics include:
Training Time: Total time required to train the agent across training instances.
Inference Time: Average time to infer a solution for a single test instance.
Objective Value: The primary optimization target, e.g., Maxcut value, tour length, etc.
Conventional Methods
These include classical heuristics, local search, greedy algorithms, and solvers like Gurobi or CPLEX.
Evaluation metrics include:
Running Time: Total time taken to solve a test instance (no training phase).
Objective Value: The final objective value obtained by the algorithm.
Evaluation Criteria Summary
Method Type |
Time Metric |
Optimization Metric |
|---|---|---|
Distribution-wise RL |
Training + Inference |
Objective Value |
Conventional (non-RL) |
Running Time only |
Objective Value |
Notes
Objective Value is defined by the problem instance (e.g., Maxcut, Set Cover, Knapsack). Higher is better unless otherwise specified.
Inference Time is measured on GPU (if applicable), averaged across all test instances.
All methods will be evaluated on a standardized server with fixed computational limits.
Final rankings may use a weighted combination of time and objective metrics, depending on the track.