MLEvolve

MLEvolve

#1 on MLE-bench in 12 Hours, Now Open Source

MLEvolve is an automated machine learning engineering system for Kaggle-style competitions. It combines progressive MCGS search, multi-agent collaboration, and experience-driven memory to form a full loop from planning to coding, validation, and iterative optimization.

🚀 Open Source: InternScience / MLEvolve
Rank
#1
All (Any Medal %)
61.33 ± 1.33
Runtime Budget
12 Hours
High Split
42.22 ± 2.22

Main Idea

In long-horizon automation tasks, a system should not stop at writing one solution. It needs to continuously search, validate, and refine. MLEvolve turns Plan → Build → Evaluate → Evolve into a repeatable optimization loop so agents can approach better solutions under limited budgets.

Plan Start from strategy design before implementation.
Build Use multi-mode code generation for full builds and local patches.
Evolve Iterate from feedback so solutions improve over time.

Core Innovations

Results and Impact

MLEvolve ranks first on MLE-bench with an Any Medal rate of 61.33% using only a 12-hour runtime budget. It also serves as a key optimization engine in InternAgent 1.5 for longer-horizon scientific discovery workflows.

Low
80.30 ± 1.52
Medium
57.89 ± 1.52
High
42.22 ± 2.22
All
61.33 ± 1.33