Abstract:As a representative of the generation of artificial intelligence technology, ChatGPT needs to obtain intelligent conclusions and make trend predictions on the basis of large -scale data refinement, analysis, and learning. Specifically, as a large-scale natural language processing model, ChatGPT involves four types of data in the training and operation process: pre-training data, manually labeled data, capture data, and human-computer interaction data. Different types of data face different legal risks and challenges in the application process.Based on data, the basic raw material of AI and algorithm models, it is of urgent practical significance to carry out risk analysis, legal response, and technical regulation. In this regard, China can consider can consider from the perspective of conforming to industrial development, balance security and efficiency, clarify the border of obligations, improve regulations, improve the legislative system, and form a targeted standardized system with a cautious attitude in cooperation with the technological development trend. thus seizing the opportunity to "overtake" in the era of digital economy with the help of AI technology.