Abstract:In view of the lack of a relatively simple and effective method for the classification of nuclear power plant unit capacity factor, based on the data of the performance indicators of the World Association of Nuclear Operators (WANO) in the first to eighth National Reports to the Convention on Nuclear Safety of the People's Republic of China, put a unit capacity factor classification method of the random forest model (random forest, RF), constructs the optimal random forest classification model by estimating the tree of the decision tree of the random forest model and the minimum number of samples required for internal node subdivision. The rapid and fine classification of the capability factor is of great significance for qualitatively grasping the power generation status of my country's nuclear power plants and the status of the units in the industry in the ninth national report. At the same time, Logistic regression which solves two classifications is selected as a comparative test. The test results show that the overall accuracy of the RF classification algorithm reaches 77.27%, and the Kappa coefficient is 0.705 3, which reaches the standard range of high consistency test, which is significantly higher than the 51.14% and 0.110 1 of Logistic regression. It shows the advantages of good classification effect, high accuracy and stable performance, which can effectively improve the classification accuracy of unit capacity factors.