Game Agent
-
→
vlmario
by yucheon6000
The VLMario serves as an automated evaluation benchmark for AI-generated Super Mario Bros. levels. It orchestrates a three-stage evaluation pipeline: 1. Simulation & Validation: It executes an A* agent using the Mario-AI-Framework to verify level playability and records the gameplay into video artifacts. 2. VLM-based Assessment: It utilizes a Vision-Language Model (Gemini) to analyze the gameplay videos, scoring the levels across eight qualitative dimensions: Composition, 3.Probability, Completeness, Aesthetics, Originality, Fairness, Fun, and Difficulty. Aggregation: It aggregates these scores to produce a final, multi-dimensional performance metric for the map generator.
Showing 41-50 of 74
•
Page 5 of 8