open access publication

Preprint, 2024

ListPred: A predictive ML tool for virulence potential and disinfectant tolerance in Listeria monocytogenes

bioRxiv, Page 2024.01.29.577690, 10.1101/2024.01.29.577690

Contributors

Gmeiner, Alexander 0000-0002-1787-8368 (Corresponding author) [1] Ivanova, Mirena 0000-0002-0077-5207 [1] Kaas, Rolf Sommer 0000-0002-5050-8668 [1] Xiao, Yinghua [2] Otani, Saria 0000-0002-2538-8086 [1] Leekitcharoenphon, Pimlapas 0000-0002-5674-0142 [1]

Affiliations

  1. [1] Technical University of Denmark
  2. [NORA names: DTU Technical University of Denmark; University; Denmark; Europe, EU; Nordic; OECD];
  3. [2] Arla Foods (Denmark)
  4. [NORA names: Arla Foods; Private Research; Denmark; Europe, EU; Nordic; OECD]

Abstract

ABSTRACT Despite current surveillance and sanitation strategies, foodborne pathogens continue to threaten the food industry and public health. Whole genome sequencing (WGS) has reached an unprecedented resolution to analyse and compare pathogenic bacterial isolates. The increased resolution significantly enhances the possibility of tracing transmission routes and contamination sources of foodborne pathogens. In addition, machine learning (ML) on WGS data has shown promising applications for predicting important microbial traits such as virulence, growth potential, and resistance to antimicrobials. Many regulatory agencies have already adapted WGS and ML methods. However, the food industry hasn’t followed a similarly enthusiastic implementation. Some possible reasons for this might be the lack of computational resources and limited expertise to analyse WGS and ML data and interpret the results. Here, we present ListPred, a ML tool to analyse WGS data of Listeria monocytogenes , a very concerning foodborne pathogen. ListPred is able to predict two important bacterial traits, namely virulence potential and disinfectant tolerance, and only requires limited computational resources and practically no bioinformatic expertise, which is essential for a broad application in the food industry. AUTHOR SUMMARY The contamination of food products with microbes such as pathogenic bacteria is a big concern for the food industry. The rapid detection, characterisation and eradication of microbial contaminants are of utmost importance to ensure safe food products. Fortunately, strict food safety regulations and stringent cleaning protocols prevent the transmission of harmful bacteria to humans. Individual bacteria of the same species can have varying abilities to resist cleaning agents or infect a host, meaning that some pathogen isolates might be more dangerous than others. Novel techniques such as genome sequencing and machine learning can help to determine such differences in individual bacteria. Unfortunately, these techniques require a lot of computational power and expertise that is limited in the food industry. This is why we developed an easy-to-use software tool called ListPred that can be used with few computational requirements and little expertise. ListPred helps food companies to answer two essential questions: how dangerous are Listeria monocytogenes pathogens, and how to get rid of them most efficiently?

Keywords

Abstract, Author summary, Listeria, Listeria monocytogenes, Listeria monocytogenes pathogenicity, ML data, ML methods, ML tools, ability, agencies, agents, antimicrobials, applications, authors, bacteria, bacterial isolates, bacterial traits, bioinformatics expertise, characterisation, cleaning, cleaning agents, cleaning protocols, companies, computational power, computational requirements, computational resources, contamination, contamination of food products, contamination sources, data, detection, differences, disinfection tolerance, enthusiastic implementation, eradication, expertise, food, food companies, food industry, food products, food safety regulations, foodborne pathogens, genome, genome sequence, growth, growth potential, health, host, humans, implementation, increased resolution, individual bacteria, industry, isolates, lack, lack of computational resources, learning, machine, machine learning, method, microbes, microbial contamination, microbial traits, monocytogenes, novel techniques, pathogenic bacteria, pathogenic bacterial isolates, pathogenic isolates, pathogens, potential, power, production, protocol, public health, regulation, regulatory agencies, requirements, resistance, resolution, resources, results, route, safety regulations, sanitation, sanitation strategies, sequence, software, software tools, source of foodborne pathogens, species, strategies, summary, surveillance, technique, tolerance, tools, traits, transmission, transmission routes, virulence, virulence potential, whole-genome sequencing, whole-genome sequencing data

Data Provider: Digital Science