About PIEFACE

PIEFACE (Personalized Interactive Environment For Automata Combination Exploration) is a controllable, verifiable sandbox for training and evaluating reasoning agents. It supports multiple agent backends alongside human-in-the-loop play in the same environment. PIEFACE is ideal for debugging policies, collecting human feedback, and running alignment experiments in symbolic reasoning domains.

While our current demo focuses on a specific domain from theoretical computer science, the PIEFACE platform can support any symbolic reasoning task with discrete, maskable actions and a ground-truth verifier.

What Are Gadget Reductions?

In complexity theory, gadgets are modular components used in reductions—like logic gates for computational hardness proofs. These gadgets help encode constraints in problems such as Sokoban, PushPush, or block-sliding puzzles.

Common gadgets include:

AP2T: Anti-Parallel 2-Toggle
C2T: Crossing 2-Toggle
L2T: Locking 2-Toggle
NWT: Noncrossing-Wire Toggle
And others used in PSPACE-hardness proofs

Purpose of PIEFACE

PIEFACE began as a visual debugger for RL-discovered gadget simulations, allowing researchers to step through traces, inspect gadget states, and interact with known constructions. It has since evolved into a flexible environment where users can swap between agents mid-trace, replay the same scenario with different policies, or take over manually — enabling head-to-head comparisons, interactive debugging, and personalized agent training.

Features

Step-by-step visualization of agent decisions and traces
Switch between multiple trained agents or human control at any point
Support for all 4-location gadget types
Live RL inference in-browser, with accept/deny human feedback interface
Submit custom actions/steps to explore novel gadget combinations
NEW: User preferences directly train a reward model for personalization and measurable improvements in proof search

Background Reading

Contact

Interested in collaborating or exploring new domains in PIEFACE?

Reach out via LinkedIn or email me at zburton [at] mit [dot] edu.

← Back to PIEFACE