Environmental Impacts of Machine Learning Applications in Protein Science.

Loïc Lannelongue; Michael Inouye
Abstract
Computing tools and machine learning models play an increasingly important role in biology and are now an essential part of discoveries in protein science. The growing energy needs of modern algorithms have raised concerns in the computational science community in light of the climate emergency. In this work, we summarize the different ways in which protein science can negatively impact the environment and we present the carbon footprint of some popular protein algorithms: molecular simulations, inference of protein-protein interactions, and protein structure prediction. We show that large deep learning models such as AlphaFold and ESMFold can have carbon footprints reaching over 100 tonnes of COe in some cases. The magnitude of these impacts highlights the importance of monitoring and mitigating them, and we list actions scientists can take to achieve more sustainable protein computational science.2
Journal COLD SPRING HARBOR PERSPECTIVES IN BIOLOGY
ISSN 1943-0264
Published 01 Dec 2023
Volume 15
Issue 12
Pages
DOI 10.1101/cshperspect.a041473
Type Journal Article | Review
Sponsorship