PSAM 18 - Abstract Status

Welcome to the PSAM 18 Abstract Status page.

Abstract RO282Full Paper + Presentation

Generative Artificial Intelligence Nuclear Evaluation (GAINE): A Structured Decision Framework to Support Large Language Model Evaluations in Nuclear Power Applications

Authors

PrimaryRonald Laurids Boring— Idaho National Laboratory · ronald.boring@inl.gov

As large language models (LLMs) are deployed in new applications, specifically safety-critical domains like nuclear power, scrutable evaluation methods for those LLMs must be used. This paper introduces the Generative Artificial Intelligence Nuclear Evaluation (GAINE) structured decision framework for applying verification, validation, and benchmarking across the LLM lifecycle of design, development, deployment, and monitoring. For each of these evaluation types and phases, there are both system and human methods, and this paper identifies when human-in-the-loop evaluations of LLMs are necessary. Combined across evaluation phases, methods, and systems, GAINE helps ensure the appropriateness and completeness of LLM evaluations.

✅Status: The abstract has been accepted!

✅Paper Status: Accepted — View submitted paper

← Check another abstract