Can an AI coding agent build the same framework-neutral spec on different stacks? External-oracle-graded benchmark. First: Spring Boot vs Tiko DI.