Substation Operation Sequence Inference Model Based on Deep Reinforcement Learning
Abstract
:1. Introduction
- (1)
- Based on the natural similarity between the power network and the graph network, the main wiring model of the substation is designed using the knowledge base of Neo4j graph. The model can respond to operation and update the state of the main wiring in real time, providing an interactive environment for deep reinforcement learning.
- (2)
- A task-state perception module is designed to identify the working state of the device from the knowledge network and obtain action space for the deep reinforcement learning model to make decisions.
- (3)
- The inference model is constructed using the deep reinforcement learning model, and the reward function and penalty function oriented to reverse switching operation are designed according to the general rules of switching operation so as to deduce the sequence of operations.
2. Design of Substation Operation Ticket Inference Model
- (1)
- The main wiring module converts the main wiring into the graph knowledge model, which is driven by the action module, receives the operation from the action module, changes the topological state of the main wiring, updates the state of the device, and sends the real-time state of the main wiring and the device to the task-state awareness module.
- (2)
- Task-state awareness module: Firstly, the operation task is analyzed, and the task device is obtained. Then, starting from the task device, the action space and associated device required by the operation are searched according to the real-time state sent by the main wiring module, and the action space and associated device state are sent to the operation sequence module for reasoning.
- (3)
- The operation sequence reasoning module firstly evaluates the state of associated devices through action-state evaluation, then obtains reasonable operations from the action space through deep reinforcement learning, and finally gradually forms a complete and correct operation sequence.
- (4)
- The action module mainly updates the knowledge base model of substation diagram with the operation action of the reasoning model, including node attributes and relations.
3. Diagram Model of the Main Wiring
3.1. Modeling
- (1)
- Entity. The reverse operation is to set the working state of the task equipment as the target state by operating the switch group. The switch group is set as an operable entity. The switch group includes the circuit breaker, isolation switch and grounding brake. Other devices such as busbars, transformers, incoming cables, and outgoing cables are set to inoperable entities. Introducing an “endpoint” entity ensures the topological integrity of the main wiring.
- (2)
- Entity attributes. Entity attributes describe the characteristics of entities in the graph model, including static attributes and dynamic attributes. Static attributes include device type, voltage level, operable or unavailable. Dynamic attributes include the following: on/off, off/on, maintenance/cold standby/hot standby/operation/transition/bad state.
- (3)
- The maintenance/cold standby/hot standby/operation/transition/bad state attribute of the entity is related to the shutdown state of the connected switch group. If the entity is a switchgear, its running state is determined by the state of the switching group in which it is located.
- (i)
- Operation σ1: The circuit breaker and the isolation switch are closed, and the grounding switch is open.
- (ii)
- Hot standby σ2: The circuit breaker and ground switch are open, and the isolation switch is closed.
- (iii)
- Cold standby σ3: The circuit breaker, isolation switch and grounding brake are all open.
- (iv)
- Maintenance σ4: The circuit breaker and isolation switch are open, and the grounding switch is closed.
- (v)
- Transition σ5: It is in between four states. For example, the circuit breakers are open, but the isolation switch is not closed, or the circuit breaker and the isolation switch are all closed, but the grounding switch is not fully closed.
- (vi)
- Bad state σ6: This state includes the following: (a) the isolation switch in the switch group operates but the circuit breaker does not operate; (b) repeated action of the same switch; (c) when the load is powered off during the operation when a power recovery path exists; (d) when two or more switch groups are in transition state at the same time.
- (4)
- Relationships. The relationship indicates the connection between the devices. The value can be connected or disconnected.
3.2. Rules for Model Updates
4. Task—State Awareness Module
4.1. Get Task Space
- (1)
- Starting from the target device, search the power supply path connected to the non-switching device, and record the device set in the path.
- (2)
- Select circuit breakers and isolation switches in the path, and record the switchgear in the same path as a switch group.
- (3)
- Arrange the isolation switchgear in the switch group in order from near to far from the load.
- (4)
- Through the circuit breaker device, search the ground switch device with its common endpoint, and record the switch group of the circuit breaker.
- (5)
- If the task device is a non-switching device, search for the ground tool switch that is directly related to the task device and refer to it as the ground tool switch group of the task device.
4.2. Get the Transfer Space
- (1)
- Whether there is a path for restoring power supply to the non-switching devices supplied through the path of the task device, that is, whether power devices such as a live bus or an incoming line can be retrieved from the path.
- (2)
- Whether the path of the task device is the only power supply path of the load device.
- (1)
- Starting from the non-switching devices affected by the device, the attributes of the device nodes and the topology of the graph are used to search for the path through which devices such as live bus can be retrieved.
- (2)
- Pick the path. Select the transfer path with circuit breaker to ensure the safe operation of restored power supply. As a priority, select the path on the same side of the power supply device to ensure the stability of the power supply quality. Finally, record the device collection.
- (3)
- Pick out the circuit breakers and isolation switchgear in the path and record them as a switch group.
- (4)
- In contrast to the tripping space, the isolation switchgear in the switching group is arranged in order from far to near the load.
- (5)
- Through the circuit breaker device, search the ground switch device with its common endpoint, and record the switch group of the circuit breaker.
4.3. Identifying the Status of the Target Device
5. Action Module
- (1)
- After the action of the switching device, change the opening and closing attributes of the action device itself. Before the action, “close” is changed to “divide”, and before the action, “divide” is changed to “close”.
- (2)
- When the switch device is finished, the properties of the relationship between it change. For example, when the switch device is switched from close to open, the relationship changes from “connected” to “disconnected”.
- (3)
- After the updating of relation and attribute, each entity in the substation model is retrieved through “Cypher” statement to judge whether the entity in the model can retrieve the incoming line entity only through the “connected” relation. If it can, the live attribute is updated to “On-load”; otherwise, it is updated to “No-load”.
6. Operation Sequence Inference Module
6.1. Deep Reinforcement Learning Module
6.1.1. Deep Q Network
- (1)
- S represents the set of all environmental states in the decision-making process, and represents the perceived state of the agent in the environment during time t. S is the close/divide state set of the action space switch group.
- (2)
- A is the set of all executable actions of the agent, and represents the actions taken by the agent in time t. In order to simplify the size of the motion space, the size of A is the size of the motion space. The action content is to select the switch device from the action space for the on–off state attribute transformation.
- (3)
- R represents the reward function, and R (,) represents the immediate reward that the agent gets by performing ction a in state .
- (4)
- is the policy set of the agent, representing the mapping from state space S to action space A.
6.1.2. Dueling Deep Q Network
6.2. Procedure Real-Time Evaluation Module
7. Numerical Example Verificatione
- (1)
- The equipment and steps of three maintenance operations are different, but the accuracy of 100% is maintained.
- (2)
- The three maintenance are connected to the transfer channel first, to ensure that the load will not outage, and then maintenance. For example, during the maintenance of QF3, firstly power the line with QFP through steps 1–6, and then overhaul QF3 through steps 7–11. To repair T1, first use steps 1–5 to supply power to WBI, and then use steps 6–15 to repair T1. To overhaul the WBI, first power the line with QFP through steps 1–6, and then overhaul the WBI through steps 7–17.
- (3)
- The operation of the circuit breaker and disconnecting switch complies with the rule that the circuit breaker is first on and then off, and the busbar disconnecting switch is first on and then off to ensure safe operation.
8. Conclusions
- (1)
- Using Neo4j to build a graph structure model containing the main wiring structure and status information, and using its own path search tool, can improve the search efficiency of information such as device connection relationship, device status, transfer path, outage path, load distance, etc.
- (2)
- The Neo4j diagram structure model is used to search the task state space, and then the required environment space and action space of DuelingDQN are constructed according to the task space, which can reduce the required resources of DuelingDQN and improve the computational efficiency of DuelingDQN.
- (3)
- The DuelingDQN model and Neo4j model interact in real time, and the operation rules of “five defense” and isolation switch are used to automatically complete the reasoning of operation sequence, avoiding the problem of rule redundancy in the expert system, ensuring the universality of reasoning, and giving play to the decision-making ability of reinforcing learning to eliminate human intervention.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Li, Y.; Zhang, Q.; Zheng, D.; Xu, P.; Wu, B. On operation ticket expert system for a substation using object oriented technique. In Proceedings of the 2008 27th Chinese Control Conference, Kunming, China, 16–18 July 2008; pp. 2–5. [Google Scholar]
- Zhang, F.; Guo, B. Discussion on the combination of intelligent operation ticket system and programmed operation system in substation. Electr. Power Autom. Equip. 2010, 30, 117–120. [Google Scholar]
- Chen, K.; Zhang, B.; Wu, W.; Sun, H. An intelligent checking system for power system operation tickets. In Proceedings of the 2011 4th International Conference on Electric Utility Deregulation and Restructuring and Power Technologies (DRPT), Weihai, China, 6–9 July 2011; pp. 757–762. [Google Scholar]
- Jiang, C.; Zhou, H.; Deng, Q. A new system of operation ticket for generation and misoperation prevention in smart distribution network. In Proceedings of the 2011 International Conference on Electric Information and Control Engineering, Wuhan, China, 15–17 April 2011; pp. 60–64. [Google Scholar]
- Xu, J.; Zhao, J.; Rao, M.; Shi, J.; Ren, Y. Design of Regional Power Grid Dispatching Operation Ticket System based on SCADA System. Power Syst. Prot. Control. 2010, 38, 104–107, 112. [Google Scholar]
- Zhang, X.; Cheng, X.-T.; Zhao, D.-M. Power Grid Operation Ticket Rule Extraction based on Rough Set. Power Grid Technol. 2014, 38, 1600–1605. [Google Scholar]
- Luo, P.; Xia, W.; Lin, J.-K.; Zheng, W.-H. A New Method of Automatic invoicing Based on Rule Tree. Electr. Power 2011, 44, 79–84. [Google Scholar]
- Dong, S.-C.; Cheng, J.; Peng, B.; Chen, X. Automatic Generation method of Graph-Library-Rule-operation ticket based on Interval Model. Autom. Electr. Power Syst. 2015, 39, 84–89. [Google Scholar]
- Zhang, D.; Yang, J.; Huang, J.; Wang, W. General method of Automatic Generation of operation ticket task. Autom. Electr. Power Syst. 2018, 42, 169–174. [Google Scholar]
- Wang, R.; Li, C.; Fu, W.; Tang, G. Deep Learning Method Based on Gated Recurrent Unit and Variational Mode Decomposition for Short-Term Wind Power Interval Prediction. IEEE Trans. Neural Netw. Learn. Syst. 2020, 31, 3814–3827. [Google Scholar] [CrossRef] [PubMed]
- Fang, P.; Fu, W.; Wang, K.; Xiong, D.; Zhang, K. A compositive architecture coupling outlier correction, EWT, nonlinear Volterra multi-model fusion with multi-objective optimization for short-term wind speed forecasting. Appl. Energy 2022, 307, 118191. [Google Scholar] [CrossRef]
- Ren, S.; Lv, X.; Sun, K.; Xu, Z.; Long, T. Chinese Operation Ticket Automatic Verification Method Based on CNN-BiGRU-Attention in Intelligent Substations. In Proceedings of the 2022 IEEE International Conference on High Voltage Engineering and Applications (ICHVE), Chongqing, China, 25–29 September 2022; pp. 1–4. [Google Scholar]
- Liu, T.; Li, S.; Gu, X.; Wang, T.; Lu, P.; Cao, X.; Yang, X.; Wang, W.; Lv, H.; Feng, C. Historical Similar Ticket Matching and Extraction used for Power Grid Maintenance Work Ticket Decision Making. In Proceedings of the 2019 3rd International Conference on Data Science and Business Analytics (ICDSBA), Istanbul, Turkey, 11–12 October 2019; pp. 302–307. [Google Scholar]
- Zhang, Z.; Zhang, D.; Qiu, R.C. Deep reinforcement learning for power system applications: An overview. CSEE J. Power Energy Syst. 2019, 6, 213–225. [Google Scholar]
- Wei, T.; Chu, X.; Yang, D.; Ma, H. Power balance control of RES integrated power system by deep reinforcement learning with optimized utilization rate of renewable energy. Energy Rep. 2022, 8, 544–553. [Google Scholar] [CrossRef]
- Damjanović, I.; Pavić, I.; Puljiz, M.; Brcic, M. Deep Reinforcement Learning-Based Approach for Autonomous Power Flow Control Using Only Topology Changes. Energies 2022, 15, 6920. [Google Scholar] [CrossRef]
- Zhang, Y.; Ai, Q.; Li, Z. Intelligent demand response resource trading using deep reinforcement learning. CSEE J. Power Energy Syst. 2021, 1–10. [Google Scholar] [CrossRef]
- Elbattah, M.; Roushdy, M.; Aref, M.; Salem, A.B.M. Large-scale ontology storage and query using graph database-oriented approach: The case of Freebase. In Proceedings of the 2015 IEEE Seventh International Conference on Intelligent Computing and Information Systems (ICICIS), Cairo, Egypt, 12–14 December 2015; pp. 39–43. [Google Scholar]
- Jiang, W.; Wang, M.; Chen, J.; Liu, J.; Pu, S.; Xu, Z. Reliability calculation of Distribution network based on Neo4j graph database. Autom. Electr. Power Syst. 2022, 46, 104–111. [Google Scholar]
- Shi, P.; Fan, G.; Li, S.; Kou, D. Big data storage technology for smart distribution grid based on neo4j graph database. In Proceedings of the 2021 IEEE 4th International Conference on Electronics Technology (ICET), Chengdu, China, 7–10 May 2021; pp. 441–445. [Google Scholar]
The Optimal Operation Sequence when QF3 Exits the Running Overhaul | |
---|---|
Step 1 | Open grounding switch QSP17/QSP37 |
Step 2 | Open grounding switch QSP37/QSP17 |
Step 3 | Close disconnecting switch QS1 |
Step 4 | Close disconnecting switch QS3 |
Step 5 | Close disconnecting switch QSp1 |
Step 6 | Close breaker QFP |
Step 7 | Open breaker QF3 |
Step 8 | Open disconnecting switch QS32 |
Step 9 | Open disconnecting switch QS31 |
Step 10 | Close grounding switch QS317/QS327 |
Step 11 | Close grounding switch QS327/QS117 |
The Optimal Operation Sequence when T1 Exits the Running Overhaul | |
---|---|
Step 1 | Open grounding switch QSD17/QSD27 |
Step 2 | Open grounding switch QSD27/QSD17 |
Step 3 | Close disconnecting switch QSD2 |
Step 4 | Close disconnecting switch QSD1 |
Step 5 | Close breaker QFD |
Step 6 | Open breaker QF2/QF1 |
Step 7 | Open breaker QF1/QF2 |
Step 8 | Open disconnecting switch QS12/QS22 |
Step 9 | Open disconnecting switch QS11/QS21 |
Step 10 | Open disconnecting switch QS22/QS12 |
Step 11 | Open disconnecting switch QS21/QS11 |
Step 12 | Close grounding switch QS127/QS117/QS227/QS217 |
Step 13 | Close grounding switch QS117/QS127/QS217/QS227 |
Step 14 | Close grounding switch QS227/QS217/QS127/QS117 |
Step 15 | Close grounding switch QS217/QS227/QS117/QS127 |
The Optimal Operation Sequence when WBI Exits the Running Overhaul | |
---|---|
Step 1 | Open grounding switch QSP17/QSP27 |
Step 2 | Open grounding switch QSP27/QSP17 |
Step 3 | Close disconnecting switch QS2 |
Step 4 | Close disconnecting switch QS3 |
Step 5 | Close disconnecting switch QSp1 |
Step 6 | Close breaker QFP |
Step 7 | Open breaker QF3/QF1 |
Step 8 | Open breaker QF1/QF3 |
Step 9 | Open disconnecting switch QS12/QS32 |
Step 10 | Open disconnecting switch QS11/QS31 |
Step 11 | Open disconnecting switch QS32/QS12 |
Step 12 | Open disconnecting switch QS31/QS11 |
Step 13 | Close grounding switch QS227/QS217/QS327/QS317 |
Step 14 | Close grounding switch QS217/QS227/QS317/QS327 |
Step 15 | Close grounding switch QS327/QS317/QS227/QS217 |
Step 16 | Close grounding switch QS317/QS327/QS217/QS227 |
Step 17 | Close grounding switch QS17 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, T.; Li, H.; Cao, Y.; Zhang, Z. Substation Operation Sequence Inference Model Based on Deep Reinforcement Learning. Appl. Sci. 2023, 13, 7360. https://0-doi-org.brum.beds.ac.uk/10.3390/app13137360
Chen T, Li H, Cao Y, Zhang Z. Substation Operation Sequence Inference Model Based on Deep Reinforcement Learning. Applied Sciences. 2023; 13(13):7360. https://0-doi-org.brum.beds.ac.uk/10.3390/app13137360
Chicago/Turabian StyleChen, Tie, Hongxin Li, Ying Cao, and Zhifan Zhang. 2023. "Substation Operation Sequence Inference Model Based on Deep Reinforcement Learning" Applied Sciences 13, no. 13: 7360. https://0-doi-org.brum.beds.ac.uk/10.3390/app13137360