Nuclear energy plays an important role in ensuring the safety of many countries in the world. When designing and operating complex technological objects (CTO) such as nuclear power plants (NPP), it is critical to take into account their characteristics to ensure safe operation. The relevance of the research topic lies in the need to develop a methodology that can speed up the process of identifying target information contained in scientific publications for nuclear industry enterprises. The lack of scientific papers describing the use of language models for analyzing and extracting characteristics from complex technological objects emphasizes the need for research. In this paper, a NPP is chosen as an example of such an object. To conduct a series of experiments to identify the technical characteristics of the CTO, a list of parameters of the nuclear power plant profile (35 parameters) was compiled and a data set on nuclear power plants was formed (60 scientific publications containing information about the Ling Ao NPP). A program was developed that allows processing the data contained in scientific publications by loading articles into a language model, writing queries and receiving responses for subsequent compilation of a profile of a complex technological object. The results of the work showed that the proposed technique allows programmatic processing of scientific publications to compile a profile of a NPP.
Matveeva et al. (Mon,) studied this question.