Skip to content

Benchmark

ScarfBench evaluates agentic transformation of Java applications across Jakarta EE, Quarkus, and Spring.

The suite combines focused examples and whole applications to measure migration quality, framework idiomaticity, and behavioral parity. Every conversion is manually implemented and developer-verified, with tests to confirm post-migration behavior.

Apps102
Layers6
Frameworks3
Tests1,331