AI-Enabled Decision Support for EVA Operations

Onboard, explainable AI systems for human spaceflight operations under communication delay and limited ground support.

Future human spaceflight missions will operate under conditions where real-time support from Earth is no longer possible. For Mars missions, communication delays of up to 20 minutes remove the Mission Control feedback loop entirely, requiring crews to operate with significantly greater autonomy.

Our research develops AI-enabled decision support systems that replicate and extend the capabilities of Mission Control within the spacecraft or spacesuit. These systems integrate domain knowledge, physical models, and multi-agent reasoning to support safe, explainable decision-making during extravehicular activity (EVA).

Approach

We develop a multi-agent architecture that decomposes EVA decision-making into specialized roles, mirroring the structure of Mission Control. Each agent is responsible for a specific function such as planning, systems monitoring, physiology, or operations, and operates within defined constraints and authority.

The system integrates three core components:

  • Retrieval: access to validated procedures, flight rules, and mission documentation with full traceability
  • Simulation: deterministic models of physiology, environment, and vehicle systems to evaluate feasibility
  • Agentic reasoning: coordinated multi-agent workflows that produce structured, explainable decisions
System diagram of client-side authentication flow with mobile and web apps, tokens

Multi-agent architecture for onboard EVA decision support, replicating Mission Control functions through distributed reasoning and tool integration.

This architecture is implemented in the RASAGE (Retrieval and Simulation Augmented Guidance Engine) framework, enabling real-time onboard decision support under communication delay and computational constraints.

Validation

We evaluate the system using both historical mission reconstruction and structured benchmarking.

Reconstruction of Apollo 14 EVA operations demonstrates that a multi-agent architecture can replicate key Mission Control functions while maintaining traceability to source documentation.

Dark analytics dashboard screenshot: map with heat spots, query pane, and time-series charts

Prototype EVA decision-support interface showing integrated timeline, navigation, and physiological monitoring during Apollo 14 reconstruction.

To assess foundation model capability in this domain, we developed EVA-Bench, a scenario-based benchmark comprising over 600 tasks grounded in NASA EVA operations. Results show strong performance in procedural knowledge retrieval but significant degradation in multi-step planning and contingency handling.

Two-page infographic with colorful flowcharts and text boxes about contingency management

EVA-Bench scenario examples illustrating structured evaluation of procedural reasoning, anomaly response, and safety-critical decision-making.

Key Contributions
  • A multi-agent framework for Earth-independent EVA decision support
  • Integration of physiological and operational models for safety-critical reasoning
  • A domain-specific benchmark for evaluating AI systems in spaceflight operations
  • Demonstration of low-hallucination, evidence-backed reasoning in mission scenarios
Impact

This work supports a transition from Earth-dependent mission operations to autonomous onboard decision support. The resulting architecture enables safer EVA execution, reduced crew cognitive load, and scalable operations for future lunar and Mars missions.