layton-eval is an AI eval benchmark for divergent, out-of-the-box and non-linear reasoning
benchmarking benchmark ai evaluation eval reasoning vlm llm divergent-thinking reasoning-language-models out-of-the-box-thinking non-linear-reasoning
-
Updated
Feb 5, 2026 - JavaScript