Nuclear Science and Engineering / Volume 196 / Number 8 / August 2022 / Pages 911-926
Technical Paper / dx.doi.org/10.1080/00295639.2022.2043542
Articles are hosted by Taylor and Francis Online.
This work addresses how analysts of a high-valued system (e.g., nuclear reactor, aircraft turbine designs) can extract findable, accessible, interoperable, and reusable scientific data for public dissemination to artificial intelligence and machine-learning (AI/ML) researchers in a manner that cannot be reverse-engineered, potentially compromising sensitive or proprietary information. State-of-the-art methods address this problem through data masking techniques, which allow access to a subset of the information while obfuscating private and potentially identifying information (e.g., personally identifying medical data). These methods are unsuitable for industrial engineering processes, where AI/ML tools need explicit access to all the data available to draw the best inference about the system to help optimize its performance and identify its vulnerabilities, etc. Our novel deceptive infusion of data paradigm provides a solution to this conundrum by developing a mathematical approach capable of concealing the identity of the system while providing full access to all the features employed by AI/ML tools to ensure their optimal performance.