Article, 2024

Data-driven diagnosis of high temperature PEM fuel cells based on the electrochemical impedance spectroscopy: Robustness improvement and evaluation

Journal of Energy Chemistry, ISSN 2095-4956, Volume 96, Pages 544-558, 10.1016/j.jechem.2024.05.014

Contributors

Yu, Dan [1] Li, Xingjun (Corresponding author) [1] [2] Araya, Samuel Simon 0000-0001-9294-0793 [3] Sahlin, Simon Lennart 0000-0002-3337-1874 [1] Liso, Vincenzo 0000-0002-7597-3849 [1]

Affiliations

  1. [1] Aalborg University
  2. [NORA names: AAU Aalborg University; University; Denmark; Europe, EU; Nordic; OECD];
  3. [2] The University of Texas at Austin
  4. [NORA names: United States; America, North; OECD];
  5. [3] Luxembourg Institute of Science and Technology
  6. [NORA names: Luxembourg; Europe, EU; OECD]

Abstract

Utilizing machine learning techniques for data-driven diagnosis of high temperature PEM fuel cells is beneficial and meaningful to the system durability. Nevertheless, ensuring the robustness of diagnosis remains a critical and challenging task in real application. To enhance the robustness of diagnosis and achieve a more thorough evaluation of diagnostic performance, a robust diagnostic procedure based on electrochemical impedance spectroscopy (EIS) and a new method for evaluation of the diagnosis robustness was proposed and investigated in this work. To improve the diagnosis robustness: (1) the degradation mechanism of different faults in the high temperature PEM fuel cell was first analyzed via the distribution of relaxation time of EIS to determine the equivalent circuit model (ECM) with better interpretability, simplicity and accuracy; (2) the feature extraction was implemented on the identified parameters of the ECM and extra attention was paid to distinguishing between the long-term normal degradation and other faults; (3) a Siamese Network was adopted to get features with higher robustness in a new embedding. The diagnosis was conducted using 6 classic classification algorithms—support vector machine (SVM), K-nearest neighbor (KNN), logistic regression (LR), decision tree (DT), random forest (RF), and Naive Bayes employing a dataset comprising a total of 1935 collected EIS. To evaluate the robustness of trained models: (1) different levels of errors were added to the features for performance evaluation; (2) a robustness coefficient (Roubust_C) was defined for a quantified and explicit evaluation of the diagnosis robustness. The diagnostic models employing the proposed feature extraction method can not only achieve the higher performance of around 100% but also higher robustness for diagnosis models. Despite the initial performance being similar, the KNN demonstrated a superior robustness after feature selection and re-embedding by triplet-loss method, which suggests the necessity of robustness evaluation for the machine learning models and the effectiveness of the defined robustness coefficient. This work hopes to give new insights to the robust diagnosis of high temperature PEM fuel cells and more comprehensive performance evaluation of the data-driven method for diagnostic application.

Keywords

Bay, ECM, Naive Bayes, PEM fuel cell, SVM, Siamese, Siamese network, accuracy, applications, attention, cells, classification, classification algorithms—support vector machine, coefficient, comprehensive performance evaluation, data-driven diagnosis, data-driven method, dataset, decision, decision tree, degradation, degradation mechanism, diagnosis, diagnosis model, diagnosis robustness, diagnostic applications, diagnostic model, diagnostic performance, diagnostic procedures, distribution, distribution of relaxation times, durability, effect, electrochemical impedance spectroscopy, embedding, error, evaluation, evaluation of diagnostic performance, extraction, extraction method, fault, feature extraction, features, forest, fuel cells, high performance, high temperature PEM fuel cell, identified parameters, impedance spectroscopy, improvement, initial performance, interpretation, k-nearest, k-nearest neighbor, learning models, learning techniques, level of error, levels, logistic regression, machine, machine learning models, machine learning techniques, mechanism, method, model, neighboring, network, normal degradation, parameters, performance, performance evaluation, procedure, random forest, regression, relaxation time, robust diagnosis, robust evaluation, robust improvement, robustness, robustness coefficient, robustness of diagnosis, robustness of trained models, selection, simplicity, spectroscopy, superior robustness, system, system durability, task, technique, time, trees, triplet loss method, vector machine

Funders

  • China Scholarship Council

Data Provider: Digital Science