Article, 2024

Multi-agent fuzzy Q-learning-based PEM fuel cell air-feed system control

International Journal of Hydrogen Energy, ISSN 0360-3199, 1879-3487, Volume 75, Pages 354-362, 10.1016/j.ijhydene.2024.02.129

Contributors

Yildirim, Burak 0000-0002-2118-4297 (Corresponding author) [1] Gheisarnejad, Meysam 0000-0003-1841-8053 [2] Özdemir, Mahmut Temel 0000-0002-5795-2550 [3] Khooban, Mohammad Hassan [4]

Affiliations

  1. [1] Bingöl University
  2. [NORA names: Turkey; Asia, Middle East; OECD];
  3. [2] University of Quebec in Montreal
  4. [NORA names: Canada; America, North; OECD];
  5. [3] Fırat University
  6. [NORA names: Turkey; Asia, Middle East; OECD];
  7. [4] Aarhus University
  8. [NORA names: AU Aarhus University; University; Denmark; Europe, EU; Nordic; OECD]

Abstract

In this study, a novel ultra-local model (ULM) control structure using multi-agent system fuzzy Q learning (MAS-FQL) is proposed for the air-feed system of a polymer electrolyte membrane fuel cell (PEMFC). The primary aim of the control goal is to optimize the net power output of the fuel cell while also preventing oxygen starvation. This is achieved by effectively managing the oxygen excess ratio to maintain it at its optimal value, particularly during rapid load fluctuations. In this study, a new advanced control structure for PEMFCs is first presented to effectively manage the oxygen excess rate in the PEMFC system. This work uses an ULM technique in conjunction with an extended state observer (ESO) to effectively manage the control-related concerns connected with the PEMFC. Furthermore, the inclusion of the MAS-FQL has been used to dynamically manage the gains of the ULM controller in an online adaptive manner. The analysis findings demonstrate that the controller exhibits robustness and has satisfactory performance when subjected to load fluctuations. Across all scenario assessments, the proposed controller consistently exhibits an improvement in oxygen excess ratio regulation of more than 31.32% compared to the proportional integral derivative (PID) controller, more than 17.51% compared to the model-free sliding mode control (SMC) controller, and more than 11.40% compared to the fuzzy PID controller across different performance criteria.

Keywords

PID controller, Q-learning, adaptive manner, air-feed system, analysis, analysis findings, assessment, cells, concerns, conjunction, control, control goal, control structure, criteria, derivatives, electrolyte membrane fuel cells, excess rate, excess ratio, extended state observer, findings, fluctuations, fuel, fuel cells, fuzzy PID controller, fuzzy Q-learning, gain, goal, improvement, inclusion, integral derivative, load, load fluctuations, manner, membrane fuel cells, mode control, model, model-free sliding mode controller, net power output, observations, optimal value, output, oxygen, oxygen excess ratio, oxygen starvation, performance, performance criteria, polymer, polymer electrolyte membrane fuel cell system, polymer electrolyte membrane fuel cells, power output, prevent oxygen starvation, proportional integral derivative, rate, ratio, ratio regulation, regulation, robustness, satisfactory performance, sliding mode control, starvation, state observer, structure, study, system, technique, ultra-local model, ultra-local model control, values

Funders

  • Scientific and Technological Research Council of Turkey

Data Provider: Digital Science