Show HN: Meadow Mind – a 7B diffusion LLM plays Gym games with zero training
Zero training, second-level reactions (~400ms). A language-rule decision mind on a local 7B diffusion LM. - Hey-Meadow/meadow-mind
Opening excerpt (first ~120 words) tap to expand
Meadow Mind Zero training. Second-level reactions (~400 ms). A language-rule decision mind: write the policy as one sentence, describe the state as one sentence, and a local 7B model makes a real decision every ~0.4 s. No RL, no reward engineering, no gradients, no samples. 🌐 Demo site: meadow-mind.pages.dev (中文) · English · 繁體中文 README pip install meadow-mind # weights auto-download on first use from meadow_mind import MeadowMind, tasks mind = MeadowMind() # loads once, runs on-device task = tasks.mountaincar() mind.check(task) # sanity gate: decision-table exam action, info = mind.decide(task, obs) # obs in, env action out (~0.4s) Results All on official Gymnasium environments, untouched physics, zero training.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at GitHub.