Natural Language Processing

Question Answering over Sustainability Reports: Information Richness and Answer Quality

Tobias Schimanski, NeurIPS 2025
Climate Policy Radar's Open Knowledge Graph

Kaylan Dutia, Anne Sietsma, Julie Saigusa, and Harrison Pim, NeurIPS 2025
NLP Models for Climate Policy Analysis: Part I

Max Callaghan, Daniel Spokoyny, and Tobias Schimanski, CCAI Summer School 2024
NLP Models for Climate Policy Analysis: Part II

Max Callaghan, Daniel Spokoyny, and Tobias Schimanski, CCAI Summer School 2024

Blog Posts

Using Machine Learning to Increase Durability and Reduce Returns for Sports and Fashion Goods

Ali Naeem and Alan Fortuny Sicart, January 27, 2024
Using Machine Learning to Track International Climate Finance

Malte Toetzke, November 09, 2022

Machine Reading the Growing Climate Science and Adaptation Literature

Max Callaghan (Mercator Research Institute on Global Commons and Climate Change), Roopam Shukla (Indian Institute of Technology Roorkee), May 13, 2022

Data Extraction and Modelling from Plant Trait Literature

Richard Reeve (University of Glasgow); Neil A. Brummitt (Natural History Museum); Claire L. Harris (Biomathematics and Statistics Scotland); Ana Claudia Araujo (Natural History Museum); Ben Scott (Natural History Museum); Christina Cobbold (University of Glasgow); Glenn Marion (Biomathematics & Statistics Scotland), 2023
Extracting and Discovering New Measurements from Climate Text Sources

Taylor Berg-Kirkpatrick (University of California San Diego); Tom Corringham (Scripps Institution of Oceanography), 2022

NeurIPS 2023
- Fireside Chat: LLMs and their Implications for Climate Change
NeurIPS 2021
- Tianzhen Hong: Machine Learning for Smart Buildings: Applications and Perspectives (Invited talk)
Summer School 2023
- Day 7 - AI for Policy, Decision-Making, Economics, and Finance - July 12, 2023

Venue	Title
NeurIPS 2025	Efficient Reinforcement Learning Implementations for Sustainable Operation of Liquid Cooled HPC Data Centers (Papers Track) Abstract and authors: (click to expand) Abstract: The rapid growth of data-intensive applications like AI has led to a significant increase in the energy consumption and carbon footprint of data centers. Liquid cooling has emerged as a crucial technology to manage the thermal loads of high-density servers more efficiently than traditional air cooling. However, optimizing the complex dynamics of liquid cooling systems to maximize energy efficiency remains a significant challenge. To accelerate research in this domain, we design a suite of highly scalable reinforcement learning (RL) control strategies for liquid-cooled data centers. We demonstrate our work on a digital twin of the Oak Ridge National Laboratory's Frontier supercomputer cooling system that provides a detailed, customization, and scalable platform for end-to-end liquid cooling control. We demonstrate the utility of our framework by developing and evaluating centralized and decentralized multi-agent RL controllers that optimize cooling tower and server-level operations. Our results show centralized RL-based control can significantly improve operational carbon footprint and thermal management compared to traditional RL applications in literature, thereby offering a promising path toward more sustainable data centers and mitigating their climate impact. Authors: Avisek Naug (Hewlett Packard Enterprise); Antonio Guillen-Perez (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterprise); Ashwin Ramesh Babu (Hewlett Packard Enterprise); Sahand Ghorbanpour (Hewlett Packard Enterprise); Ricardo Luna Gutierrez (Hewlett Packard Enterprise); Soumyendu Sarkar (Hewlett Packard Enterprise)
NeurIPS 2025	Quantifying Climate Policy Action and Its Links to Development Outcomes: A Cross-National Data-Driven Analysis (Papers Track) Abstract and authors: (click to expand) Abstract: Addressing climate change effectively requires more than cataloguing the number of policies in place; thus it calls for tools that can predict their themes or subject, and analyze their tangible impacts on development outcomes. Existing assessments often rely on qualitative descriptions or composite indices, which can mask crucial differences between key domains such as mitigation, adaptation, disaster risk management, and loss and damage. To bridge this gap, we develop a quantitative indicator of climate policy orientation by applying a multilingual transformer-based language model to official national policy documents, achieving a classification accuracy of 0.90 (F1-score). Linking these indicators with World Bank development data in panel regressions reveals that mitigation policies are associated with higher GDP and GNI; disaster risk management correlates with greater GNI and debt but reduced foreign direct investment; adaptation and loss and damage show limited measurable effects. This integrated NLP–econometric framework enables comparable, theme-specific analysis of climate governance, offering a scalable method to monitor progress, evaluate trade-offs, and align policy emphasis with development goals. The code and datasets used in this study are publicly available at: https://github.com/booktrackerGirl/climate_change_policy_analysis. Authors: Aditi Dutta (University of Exeter)
NeurIPS 2025	EcoEval: A Benchmark for Evaluating Large Language Model Handling of Climate Change Misinformation, False Beliefs, and Climate Policy Sentiment (Papers Track) Abstract and authors: (click to expand) Abstract: As Large Language Models (LLMs) become primary sources of factual knowledge, their ability to accurately communicate climate science, resist misinformation, and provide balanced policy guidance becomes critically important. However, existing evaluation frameworks lack a comprehensive assessment of LLM performance across the multifaceted challenges of climate communication. We introduce EcoEval, an open-source benchmark evaluating LLM performance across three dimensions: (1) giving users correct information, while correcting user misconceptions, (2) avoiding generation of fabricated climate content, and (3) expressing balanced climate policy sentiment. Our results span 8 commercially deployed models, revealing substantial variation in policy sentiment, sycophancy, and willingness to generate misinformation. Authors: Nick Lechtenboerger (HPI); Pat Pataranutaporn (MIT Media Lab); Pattie Maes (MIT Media Lab)
NeurIPS 2025	Geospatial Chain of Thought Reasoning for Enhanced Visual Question Answering on Satellite Imagery (Papers Track) Abstract and authors: (click to expand) Abstract: Geospatial chain of thought (CoT) reasoning is essential for advancing Visual Question Answering (VQA) on satellite imagery, particularly in climate related applications such as disaster monitoring, infrastructure risk assessment, urban resilience planning, and policy support. Existing VQA models enable scalable interpretation of remote sensing data but often lack the structured reasoning required for complex geospatial queries. We propose a VQA framework that integrates CoT reasoning with Direct Preference Optimization (DPO) to improve interpretability, robustness, and accuracy. By generating intermediate rationales, the model better handles tasks involving detection, classification, spatial relations, and comparative analysis, which are critical for reliable decision support in high stakes climate domains. Experiments show that CoT supervision improves accuracy by 34.9% over direct baselines, while DPO yields additional gains in accuracy and reasoning quality. The resulting system advances VQA for multispectral Earth observation by enabling richer geospatial reasoning and more effective climate use cases. Authors: Shambhavi Shanker (IIT Bombay); Manikandan Padmanaban (IBM Research India); Jagabondhu Hazra (IBM Research India)
NeurIPS 2025	Reflexive Evidence-Based Multimodal Learning for Clean Energy Transitions: Causal Insights on Cooking Fuel Access, Urbanization, and Carbon Emissions (Papers Track) Abstract and authors: (click to expand) Abstract: Achieving Sustainable Development Goal 7 (Affordable and Clean Energy) requires not only technological innovation but also a deeper understanding of the socio-economic factors that influence energy access and carbon emissions. Despite growing attention to these drivers, key questions remain, particularly regarding how to quantify socio-economic impacts, how these impacts interact across domains such as policy, technology, and infrastructure, and how feedback processes shape energy systems. To address these gaps, this study introduces ClimateAgents, an AI-based framework that combines large language models with domain-specialized agents to support hypothesis generation and scenario exploration. Leveraging 20 years of socio-economic and emissions data from 265 economies, countries and regions, and 98 indicators drawn from the World Bank database, the framework applies a machine learning–based causal inference approach to identify key determinants of carbon emissions in an evidence-based, data-driven manner. The analysis highlights three primary drivers: (1) access to clean cooking fuels in rural areas, (2) access to clean cooking fuels in urban areas, and (3) the percentage of population living in urban areas. These findings underscore the critical role of clean cooking technologies and urbanization patterns in shaping emission outcomes. In line with growing calls for evidence-based AI policy, ClimateAgents offers a modular and reflexive learning system that supports the generation of credible and actionable insights for policy. By integrating heterogeneous data modalities, including structured indicators, policy documents, and semantic reasoning, the framework contributes to adaptive policymaking infrastructures that can evolve with complex socio-technical challenges. This approach aims to support a shift from siloed modeling to reflexive, modular systems designed for dynamic, context-aware climate action. Authors: Shan Shan (Zhejiang University)
NeurIPS 2025	Machine learning discovery of regional and social disparities in electric vehicle charging reliability with GPT-5 (Papers Track) Abstract and authors: (click to expand) Abstract: There is growing interest in studying charger reliability to address persistent barriers to electric vehicle (EV) adoption and advance the decarbonization of transportation, one of the largest emitting sectors globally. Improved measurement of charger reliability is critically needed to accelerate network effects to promote EV adoption, develop pay-as-you-use infrastructure, and aggregate intelligence for more responsive service operations. However, prior methods for assessing charger reliability, which typically rely on citizen-generated data and expensive expert annotation/supervision, have proven inadequate for identifying regional and social disparities in charging performance. Prior architectures have often lacked the detection accuracy necessary for large-scale inference, especially with imbalanced datasets. This study introduces a machine learning pipeline that detects spatial disparities in charger reliability based on 838,785 U.S. consumer reviews of their experiences. We document new performance benchmarks in reliability detection using zero and few shot learning capabilities and expert counterfactual reasoning (F1 score: 0.97, SD: 0.02), outperforming previous models in the domain of electric mobility, such as ClimateBERT. To enable spatial analyses, we further demonstrate how reliability measures can be combined with popular diversity indices to inform economic and policy decision-making. Using this approach, we find evidence of widespread charging reliability issues in about half of all U.S. counties (1,653 of 3,244 counties), especially in the most populated areas. Disparities in charger reliability are most pronounced in metropolitan areas and along federally-designated EV corridors, raising concerns about inconsistent user experiences in high-traffic zones. This scalable and evidence-based approach to data discovery can be integrated into a wide range of causal inference and prediction settings in electric mobility. Authors: Yifan Liu (Georgia Institute of Technology); Lindsey Snyder (Georgia Institute of Technology); Omar Asensio (Georgia Institute of Technology)
NeurIPS 2025	Seg the HAB: Language-Guided Geospatial Algae Bloom Reasoning and Segmentation (Papers Track) Abstract and authors: (click to expand) Abstract: Climate change is intensifying the occurrence of harmful algal bloom (HAB), particularly cyanobacteria, which threaten aquatic ecosystems and human health through oxygen depletion, toxin release, and disruption of marine biodiversity. Traditional monitoring approaches, such as manual water sampling, remain labor-intensive and limited in spatial and temporal coverage. Recent advances in vision-language models (VLMs) for remote sensing have shown potential for scalable AI-driven solutions, yet challenges remain in reasoning over imagery and quantifying bloom severity. In this work, we introduce ALGae Observation and Segmentation (ALGOS), a segmentation-and-reasoning system for HAB monitoring that combines remote sensing image understanding with severity estimation. Our approach integrates GeoSAM-assisted human evaluation for high-quality segmentation mask curation and fine-tunes vision language model on severity prediction using the Cyanobacteria Aggregated Manual Labels (CAML) from NASA. Experiments demonstrate that ALGOS achieves robust performance on both segmentation and severity-level estimation, paving the way toward practical and automated cyanobacterial monitoring systems. Authors: Patterson Hsieh (UC San Diego); Chia-Jui Yeh (UC Berkeley); Mao-Chi He (UC Berkeley); Wen-Han Hsieh (UC Berkeley); Haw-Ting Hsieh (Berkeley)
NeurIPS 2025	CC-GRMAS: A Multi-Agent Graph Neural System for Spatiotemporal Landslide Risk Assessment in High Mountain Asia (Proposals Track) Abstract and authors: (click to expand) Abstract: Landslides are a growing climate induced hazard with severe environmental and human consequences, particularly in high mountain Asia. Despite increasing access to satellite and temporal datasets, timely detection and disaster response remain underdeveloped and fragmented. This work introduces CC-GRMAS, a framework leveraging a series of satellite observations and environmental signals to enhance the accuracy of landslide forecasting. The system is structured around three interlinked agents Prediction, Planning, and Execution, which collaboratively enable real time situational awareness, response planning, and intervention. By incorporating local environmental factors and operationalizing multi agent coordination, this approach offers a scalable and proactive solution for climate resilient disaster preparedness across vulnerable mountainous terrains. Authors: Mihir Panchal (Dwarakadas Jivanlal Sanghvi College of Engineering); Ying-Jung Chen (Georgia Institute of Technology); Surya Parkash (National Institute of Disaster Management)
NeurIPS 2025	Extracting Structured Policy Information from Climate Action Plans (Proposals Track) Abstract and authors: (click to expand) Abstract: Most of the world’s climate action policies are planned and implemented at the local level, through city and regional climate action plans (CAPs). To assess global progress in climate mitigation and adaptation, as in forthcoming assessments such as the 2027 IPCC Special Report on Climate Change and Cities, we need systematic ways to track and analyze these plans. However, CAPs are dispersed across thousands of jurisdictions, vary widely in structure and format, and are often difficult to access. We propose a standard CAP ontology, and a retrieval- and extraction-oriented pipeline that leverages recent advances in natural language processing (NLP) and information retrieval (IR) to transform CAPs into a structured, verifiable dataset of climate policies. As a case study, we focus on California, where more than 260 local governments have published one or more CAPs since 2006. We develop an annotated benchmark dataset of 17 San Diego County CAPs with over 1,800 extracted policies and associated attributes. Unlike prior efforts that rely on small annotated corpora or industry-specific disclosures, our system explicitly grounds every extracted element in its underlying PDF, ensuring transparency and reducing hallucination in the produced dataset. Addressing these challenges will enable large-scale comparative analyses of CAPs across jurisdictions world-wide, supporting policymakers, sustainability officers, and hazard managers, and accelerating climate adaptation and mitigation efforts. Authors: Tom Corringham (Scripps Institution of Oceanography); Nupoor Gandhi (Carnegie Mellon); Bryan Flores (Independent Researcher); Emma Strubell (Carnegie Mellon); Sireesh Gururaja (Carnegie Mellon); Tristan Romanov (Independent Researcher); Jacob Dunafon (Independent Researcher)
NeurIPS 2025	Tracking the spread of climate change skepticism on X with simulations and deep learning (Proposals Track) Abstract and authors: (click to expand) Abstract: Climate change continues to be a global challenge that requires urgent action. However, the ongoing presence of climate skepticism undermines society's ability to confront this important challenge. Understanding the mechanisms driving the spread of climate skepticism might give policymakers additional tools to combat climate change. Here, we propose a methodological approach that combines computational simulation (in the form of an agent-based model representing online X communication) with simulation-based inference using amortized deep neural networks. Our approach allows us to infer the relative importance of a variety of different learning strategies that can contribute to the spread of climate skepticism and support. Authors: Uwaila Ekhator (Boise State University); Mason Youngblood (Institute for Advanced Computational Science, Stony Brook University); Vicken Hillis (Boise State University)
NeurIPS 2025	Climate Policy Radar's Open Knowledge Graph (Tutorials Track) Abstract and authors: (click to expand) Abstract: Climate Policy Radar (CPR) helps people access and understand vast amounts of climate documents: laws, policies, NDCs, corporate transition plans, litigation documents, reports by statutory advisory bodies and industry bodies, and more. This tutorial is a dataset tutorial for: its open data: the full text and metadata of all of these documents, which we published open source. its 'concept store'. Climate documents are often long and filled with technical jargon. This makes them particularly difficult to analyse. The concept store helps with this, giving users access to a rich web of expert-defined concepts and their relationships. By linking this expert knowledge of climate change to the extensive curated database of climate documents, we show you how to create a climate policy knowledge graph. This can then be used in turn to analyse the global policy landscape. After this tutorial you'll be able to download and understand CPR's data (text and concepts), use the structure of our knowledge graph to train some simple but powerful classifiers, and do some introductory analysis of real climate policy documents. Authors: Kalyan Dutia (Climate Policy Radar); Anne Sietsma (Climate Policy Radar); Julie Saigusa (Climate Policy Radar); Harrison Pim (Climate Policy Radar)
NeurIPS 2025	Question Answering over Sustainability Reports: Information Richness and Answer Quality (Tutorials Track) Abstract and authors: (click to expand) Abstract: In this tutorial, we learn about an advanced strategy for information retrieval for question answering with Large Language Models (LLMs) in knowledge-intensive domains like sustainability reporting. We analyze the quality and quantity of sources before answering a question and quantify uncertainty around answering a question after LLM generation. Authors: Tobias Schimanski (University of Zurich)
ICLR 2025	ClimateChat: Designing Data and Methods for Instruction Tuning LLMs to Answer Climate Change Queries (Papers Track) Abstract and authors: (click to expand) Abstract: As the issue of global climate change becomes increasingly severe, the demand for research in climate science continues to grow. Natural language processing technologies, represented by Large Language Models (LLMs), have been widely applied to climate change-specific research, providing essential information support for decision-makers and the public. Some studies have improved model performance on relevant tasks by constructing climate change-related instruction data and instruction-tuning LLMs. However, current research remains inadequate in efficiently producing large volumes of high-precision instruction data for climate change, which limits further development of climate change LLMs. This study introduces an automated method for constructing instruction data. The method generates instructions using facts and background knowledge from documents and enhances the diversity of the instruction data through web scraping and the collection of seed instructions. Using this method, we constructed a climate change instruction dataset, named ClimateChat-Corpus, which was used to fine-tune open-source LLMs, resulting in an LLM named ClimateChat. Evaluation results show that ClimateChat significantly improves performance on climate change question-and-answer tasks. Additionally, we evaluated the impact of different base models and instruction data on LLM performance and demonstrated its capability to adapt to a wide range of climate change scientific discovery tasks, emphasizing the importance of selecting an appropriate base model for instruction tuning. This research provides valuable references and empirical support for constructing climate change instruction data and training climate change-specific LLMs. Authors: zhou chen (Tsinghua University); Xiao Wang (Tsinghua University); Liao Yuanhong (Tsinghua University); Ming Lin (Tsinghua University); Yuqi Bai (Tsinghua University)
ICLR 2025	ExioNAICS: Enterprises Level Emission Estimation Dataset with Large Language Models (Papers Track) Abstract and authors: (click to expand) Abstract: Accurate greenhouse gas emission reporting is increasingly important for governments, businesses, and investors. However, mainstream adoption—particularly among small and medium enterprises—remains limited by the high implementation costs, fragmented emission factor databases, and a lack of robust classification tools. To address these challenges, we introduce \textbf{ExioNAICS}, the first large-scale NLP benchmark dataset for enterprise-level GHG emission estimation. ExioNAICS integrates validated North American Industry Classification System labels for over 20,850 companies with a concordance to an economic model of carbon intensity factors. By framing the classification task as an Information Retrieval problem and fine-tuning Sentence-BERT with a contrastive learning approach, we achieve state-of-the-art performance on NAICS categories, notably 77.51% Top-1 accuracy and 91.33% Top-10 accuracy in our most challenging setting 1,114 classes. We make ExioNAICS publicly available to lower the entry barrier for GHG reporting and facilitate broader collaboration between machine learning researchers and climate experts. Dataset, code and trained models could be found: https://huggingface.co/datasets/Yvnminc/ExioNAICS Authors: Yanming Guo (University of Sydney); Jin Ma (University of Sydney); Qiao Xiao (Maynooth University); Kevin Credit (Maynooth University)
ICLR 2025	GreenScreen: Automatic Accessible Presentation Generation from IPCC Reports (Papers Track) Abstract and authors: (click to expand) Abstract: The Intergovernmental Panel on Climate Change (IPCC) Summary for Policymakers (SPM) is key for communicating climate assessments to leaders and policymakers. However, these SPMs often have poor readability for their target audiences. Research in cognitive theory suggests that more accessible and visual presentations can improve understanding of complex, dynamic systems, such as climate change. AI-driven extractive summarization and content curation have shown promise in fields such as medicine and the social sciences, leading to calls for their application in climate science, where critical information is often complex to understand despite its urgency. In response, we propose an LLM-driven automated pipeline, GreenScreen, which transforms dense IPCC SPM reports into clear, visual slide decks. This approach makes key climate insights more accessible and actionable. Our results indicate that GreenScreen improves readability from a Grade 18 (College Graduate) level to a Grade 4 level while preserving 83% content accuracy. Code is available at https://github.com/kvcs11/GreenScreen. Authors: Alice Heiman (Stanford University); Komal Vij (Stanford University); Anjali Sreenivas (Stanford University)
ICLR 2025	Large Language Models as a New Modality for Generalizable Earth Data Monitoring (Papers Track) Abstract and authors: (click to expand) Abstract: Earth observation data are critical for monitoring progress toward Sustainable Development Goals (SDGs), yet persistent challenges in accessibility, integration of multimodal data, and geographic bias hinder comprehensive global assessments. While satellite imagery paired with machine learning (SIML) offers cost-effective monitoring, it struggles with socioeconomic indicators, data inequity, and spatial biases. This paper presents a novel framework leveraging large language models (LLMs) as a complementary modality to address these limitations. By extracting geospatial knowledge from pretrained LLMs through structured prompting—encoding coordinates into rich, task-agnostic embeddings—we enable efficient prediction of diverse earth monitoring indicators using linear regression. Evaluated on 25 global tasks spanning from climate metrics (e.g., temperature) to socioeconomic variables (e.g., poverty rates), our method outperforms state-of-the-art SIML approaches, achieving higher accuracy and sample efficiency. Notably, LLM-derived representations exhibit reduced geographic bias compared to existing methods and inherently capture socioeconomic contexts that form semantically meaningful clusters aligned with regional development patterns. Authors: Tong Nie (Tongji University); Junlin He (The Hong Kong Polytechnic University); Wei Ma (The Hong Kong Polytechnic University)
ICLR 2025	Towards the Curation of Environment-related Knowledge Graphs: Fine-tuning General-domain Language Models for Biodiversity Named Entity Recognition (Papers Track) Abstract and authors: (click to expand) Abstract: The availability of climate data fuels timely science-based climate actions. Providing policymakers and regulators with easy-to-digest, structured climate data, e.g., in the form of a knowledge graph, is critical to mitigating the adverse effects of climate change on the natural environment. Natural language processing (NLP) applications that employ Named Entity Recognition (NER) systems can aid in uncovering information hidden in millions of textual documents. In this paper, we evaluated the NER performance of transformer-based Bidirectional Encoder Representations from Transformers (BERT) models that were pre-trained on general-domain data. We fine-tuned BERT-based models on the COPIOUS dataset for the specialist task of biodiversity NER. Our experiments showed that our DeBERTa NER model demonstrated best performance, obtaining a micro-averaged F1-score of 84.18% based on entity-level evaluation. We employed our DeBERTa NER model in a biodiversity Information Extraction (IE) pipeline and applied it on the forestry compendium of the Centre for Agricultural and Biosciences International (CABI) Digital Library. We demonstrate that the pipeline enables the extraction of structured information on reproductive conditions and habitats of tree species. Authors: Geilah Tabanao (University of the Philippines Diliman); Andrew Miguel Pagdanganan (University of the Philippines Diliman); Riza Batista-Navarro (University of Manchester); Roselyn Gabud (University of the Philippines Diliman)
ICLR 2025	Palimpsest: Bill of Materials Prediction - A Case Study with Solid State Drives (Papers Track) Abstract and authors: (click to expand) Abstract: Accurately quantifying product carbon footprints (PCFs) is critical for organizations to measure environmental impacts and develop decarbonization strategies. However, traditional methods require Bills of Materials (BOMs) data as a key input for PCF estimation, which is time-intensive and limits scalability. We present Palimpsest, an automated BOM generation algorithm given product specification as input using Large Language Models (LLMs) and a reference dataset. Palimpsest extracts data from teardown reports to build a BOM repository, retrieves reference products based on an their attribute list, generates BOMs by systematically modifying reference BOMs based on attribute differences, and standardizes the output to enable automated PCF estimation. We also introduce a novel impact-based evaluation framework that compares predicted BOMs with ground truth, focusing on the accuracy in carbon impact. We benchmark our model against a naive LLM solution and a traditional PCF estimation approach for solid state drives and find it outperforms these methods with a weighted F1 of 99.5%. By streamlining and automating BOM prediction, our method reduces the manual effort required for PCF estimation, driving progress toward net-zero emissions targets across industries. Authors: Anran Wang (Amazon); Zaid Thanawala (Amazon); Harsh Gupta (Amazon); Jeremie Hakian (Amazon); Jared Kramer (Amazon); Kommy Weldemariam (Amazon); Bharathan Balaji (Amazon)
ICLR 2025	Geo-Semantics Analysis of Environmental Disasters in Nigeria Using National Print Media Data for Disaster Management (Papers Track) Abstract and authors: (click to expand) Abstract: In recent years, Nigeria has experienced various natural and environmental disasters, including floods, food insecurity, fire outbreaks, oil spills, and banditry. These events have caused extensive damage, disrupted lives and properties, and displaced many humans, with emergency response efforts often hindered by the lack of accurate and up-to-date disaster location information. In addressing this gap, we investigated the application of geo-semantics analysis on environmental disasters in Nigeria using media data to map locations and to improve emergency response planning. We developed a disaster location-based (DLB) NER model by fine-tuning three Named Entity Recognition (NER) models—spaCy, BERT, and DistilBERT—with an extensive compiled dataset of disaster-related Nigerian news articles. Each model was evaluated using three metrics, with BERT achieving the highest performance: precision of 0.99331, recall of 0.99349, and f1-score of 0.99297, followed by DistilBERT with precision of 0.99236, recall of 0.99297, and f1-score of 0.99240, and spaCy with precision of 0.95, recall of 0.77, and f1-score of 0.85. The model was used to recognize toponyms and extract location details. Using Nominatim, we resolved the toponyms into coordinates and visualized disaster hotspots. These results show that the fine-tuned NER models can be used in providing precise, real-time mapping, and improving situational awareness for focused interventions. Our approach provides a transformative framework for incorporating print media data into emergency response strategies and informing humanitarian assistance efforts more effectively. Authors: Benedict Ajanaku (Data Science Nigeria); Rashidat Sikiru (Data Science Nigeria); Anthony Soronnadi (Data Science Nigeria); Ife Adebara (Data Science Nigeria); Olubayo Adekanmbi (Data Science Nigeria AI (DSNai))
ICLR 2025	Tracking ESG Disclosures of European Companies with Retrieval-Augmented Generation (Proposals Track) Abstract and authors: (click to expand) Abstract: Corporations play a crucial role in mitigating climate change and accelerating progress toward environmental, social, and governance (ESG) objectives. However, structured information on the current state of corporate ESG efforts remains limited. In this paper, we propose a machine learning framework based on a retrieval-augmented generation (RAG) pipeline to track ESG indicators from N=9,200 corporate reports. Our analysis includes ESG indicators from 600 of the largest listed corporations in Europe between 2014 and 2023. We focus on two key dimensions: first, we identify gaps in corporate sustainability reporting in light of existing standards. Second, we provide comprehensive bottom-up estimates of key ESG indicators across European industries. Our findings enable policymakers and financial markets to effectively assess corporate ESG transparency and track progress toward global sustainability objectives. Authors: Kerstin Forster (LMU Munich & Munich Center for Machine Learning); Victor Wagner (LMU Munich & Sustainability Reporting Navigator); Lucas Elias Keil (University of Cologne & Sustainability Reporting Navigator); Maximilian A. Müller (University of Cologne & Sustainability Reporting Navigator); Thorsten Sellhorn (LMU Munich & Sustainability Reporting Navigator); Stefan Feuerriegel (LMU Munich & Munich Center for Machine Learning)
ICLR 2025	Evaluating the Environmental Impact of Language Models with Life Cycle Assessment (Proposals Track) Abstract and authors: (click to expand) Abstract: As the scale of machine learning models and the prevalence of AI workloads has grown, so have the computational, financial, and energy requirements of development and deployment. In response, recent research in efficient machine learning and Green AI has proposed interventions aimed at reducing the environmental resource consumption of machine learning, such as model compression, efficient training methods, and data distillation. Additionally, various tools and frameworks have facilitated reporting and measurement of metrics related to efficiency and environmental impact. However, holistic, bottom-up assessment of the end-to-end environmental impacts of ML remains elusive. Inspired by work from the environmental impact community, we propose that holistic lifecycle assessment (LCA) for analyzing language models. We identify use stages for studying LLM development and deployment, propose methods for measuring power utilization, and analysis for comparing the relative environmental costs of individual stages. Authors: Jared Fernandez (Carnegie Mellon University); Clara Na (Carnegie Mellon University); Yonatan Bisk (Carnegie Mellon University); Emma Strubell (Carnegie Mellon University)
ICLR 2025	From Rumors to Risk: Mapping and Modeling Climate-Disaster Misinformation (Proposals Track) Abstract and authors: (click to expand) Abstract: Recent years have seen a surge in climate-disaster misinformation, with social media amplifying unfounded claims in the lead-up to and aftermath of major disasters. This misinformation has hindered disaster preparation and recovery while fueling harassment against meteorologists and government officials, eroding trust in scientific institutions. While tools exist for analyzing general climate-change misinformation, current datasets often overlook the rapidly shifting narratives tied to specific events like wildfires, floods, or hurricanes. This proposal addresses that gap by developing a dynamic, evolving dataset on climate-disaster misinformation. Built through targeted social media data collection and rigorous labeling, the dataset will adapt alongside AI/ML advancements through iterative feedback from model performance and emerging trends. This openly accessible resource will enable researchers and practitioners to refine detection algorithms, design interventions, and inform crisis communication strategies—ensuring both data and models remain aligned with the shifting misinformation landscape. Ultimately, this work seeks to clarify key drivers of misinformation propagation and support more effective climate disaster response. Authors: Tristan Ballard (Independent)
NeurIPS 2024	Enabling Adoption of Regenerative Agriculture through Soil Carbon Copilots (Papers Track) Abstract and authors: (click to expand) Abstract: Mitigating climate change requires transforming agriculture to minimize environmental impact and build climate resilience. Regenerative agricultural practices enhance soil organic carbon (SOC) levels, thus improving soil health and sequestering carbon. A challenge to increasing regenerative agriculture practices is cheaply measuring SOC over time and then understanding how SOC is affected by regenerative agricultural practices and other environmental factors and farm management practices. To address this challenge, we introduce an AI-driven Soil Organic Carbon Copilot that automates the ingestion of complex multi-resolution, multi-modal data to provide large-scale insights into soil health and regenerative practices. Our data includes extreme weather event data (e.g., drought conditions and wildfire incidents), farm management data (e.g., cropland information and tillage predictions), and SOC predictions. We find that integrating public data and specialized models enables large-scale, localized analysis for sustainable agriculture. In comparisons of agricultural activities and practices across California counties, we find evidence that diverse agricultural activity may mitigate the negative effects of tillage; and that while extreme weather conditions heavily affect SOC, composting may mitigate SOC loss. Finally, implementing role-specific personas empowers agronomists, farm consultants, policymakers, and other stakeholders to implement evidence-based strategies that promote sustainable agriculture and build climate resilience. Authors: Margaret Capetz (UCLA); Swati Sharma (Microsoft Research); Peder Olsen (Microsoft); RAFAEL PADILHA (Microsoft Research); Jessica Wolk (Microsoft); Emre Kiciman (Microsoft Research); Ranveer Chandra (Microsoft Research)
NeurIPS 2024	Parakeet: Emission Factor Recommendation for Carbon Footprinting with Generative AI (Papers Track) Abstract and authors: (click to expand) Abstract: Accurately quantifying greenhouse gas (GHG) emissions from products and business activities is crucial for organizations to measure their environmental impact and undertake mitigation actions. Life cycle assessment (LCA) is the scientific discipline for measuring GHG emissions associated with each stage of a product or activity, from raw material extraction to disposal. Measuring the emissions outside of a product owner's control is challenging, and practitioners rely on emission factors (EFs) – estimates of GHG emissions per unit of activity – to model and estimate indirect impacts. These EFs come from prior LCA studies and are collated into databases. The current practice of manually finding the appropriate EF to use from databases is time-consuming, error-prone, and requires domain expertise, hindering scalability and accuracy in emissions quantification. We present a novel AI-assisted method that leverages large language models to automatically recommend EFs. Our method parses business activity descriptions and recommends the appropriate EF with a human-interpretable justification. We benchmark our solution across multiple domains and find it achieves state-of-the-art performance in EF recommendation, with an average Precision@1 of 88.4%. By streamlining and automating the EF selection process, our AI-assisted method enables scalable and accurate quantification of GHG emissions, supporting organizations' sustainability initiatives and driving progress toward net-zero emissions targets across industries. Authors: Bharathan Balaji (Amazon); Nina Domingo (Amazon); Abu Zaher Faridee (Amazon); Venkata Sai Gargeya Vunnava (amazon); Anran Wang (Amazon); Fahimeh Ebrahimi Meymand (Amazon); Kellen Axten (Amazon); Aravind Srinivasan (Amazon); Qingshi Tu (University of British Columbia); Harsh Gupta (Amazon); Shikha Gupta (Amazon); Soma Ramalingam (Amazon); Jeremie Hakian (Amazon); Jared Kramer (Amazon)
NeurIPS 2024	Critical misalignments between climate action and sustainable development goals revealed (Papers Track) Abstract and authors: (click to expand) Abstract: A mere 12 percent of the Sustainable Development Goals (SDGs) is currently on track to meet the 2030 deadline in a world under climate change. Since their launch in 2015, the 2030 Agenda for Sustainable Development and the Paris Agreement have suffered persistent mismatches, which limit the potential for mutual gains. We use Artificial Intelligence (AI) to assess the degree and type of alignment between the Nationally Determined Contributions (NDCs) and the SDGs. While high income countries tackle the energy-infrastructure-community nexus in term of opportunity, lower income countries make climate impacts more explicit and center their trade-offs around the water-energy-food nexus. These two approaches mark different development trajectories and have non-negligible implications on international financial flow architecture and climate governance. Authors: Francesca Larosa (Royal Institute for Technology); Sergio Hoyas (Universitat Politècnica de València); Fermin Mallor Franco (Royal Institute of Technology); J. Alberto Conejero (Universitat Politècnica de València); Javier García-Martinez (University of Alicante); Francesco Fuso Nerini (Royal Institute of Technology); Ricardo Vinuesa (KTH Royal Institute of Technology)
NeurIPS 2024	ATLAS: A spend classification benchmark for estimating scope 3 carbon emissions (Papers Track) Abstract and authors: (click to expand) Abstract: The majority (70%) of companies reporting their value chain emissions rely on financial spend ledger and emissions factors per dollar. Accurate classification of expenditures to emissions factors is critical but complex, given the sheer number of line items and the diversity of how they are categorized and described. This is an area where Large Language Models (LLMs) can play a key role. However, there is currently no benchmark dataset to evaluate the performance of LLM-based solutions. Here, we introduce the Aggregate Transaction Ledgers for Accounting Sustainability dataset or, ATLAS, and the initial evaluation results of four models using ATLAS. ATLAS is the first spend classification benchmark and is comprised of 10,000 synthetic, labeled spend items reflecting the distribution of corporate expenditures. We evaluate four baseline models, with the best model achieving a top-1 accuracy of 57.3% and a top-3 accuracy of 72.2%. ATLAS enables systematic evaluation of LLMs for spend classification. Our results provide a starting point for advancing automated carbon accounting and sustainability reporting for spend- based emissions. Authors: Andrew Dumit (Watershed Technology, Inc.); Krishna Rao (Watershed Technology, Inc.); Travis Kwee (Watershed Technology, Inc.); Varsha Gopalakrishnan (Watershed Technology Inc.); Katherine Tsai (Watershed Technology, Inc.); Sangwon Suh (Watershed Technology, Inc.)
NeurIPS 2024	Making Climate AI Systems Past and Future Aware to Better Evaluate Climate Change Policies (Proposals Track) Abstract and authors: (click to expand) Abstract: Addressing the issues faced by climate change necessitates appropriate methodologies for evaluating climate policies, particularly when discussing long-term and real-world scenarios. While large language models (LLMs) have altered artificial intelligence, they ultimately fall short of connecting historical data with future estimates. We propose an agentic LLM system that would address this gap by considering and analyzing the probable outcomes of the user-specified climate policy inside the practical settings. Further, we propose using knowledge graphs to model the existing data about the impact of climate policies along with allowing our system to access the data about future climate predictions. Done this way, the model can peek into the past (previous policies) and the future (climate scenarios forecast), paving the way for agencies to evaluate and design strategies and plans for climate change more effectively. Authors: Riya . (IIT Roorkee); Sudhakar Singh (Nvidia)
NeurIPS 2024	How are companies reducing emissions? An LLM-based approach to creating a carbon emissions reduction levers library at scale (Proposals Track) Abstract and authors: (click to expand) Abstract: Creating a transparent, sector-specific database of actions that would result in carbon emissions reduction is essential for guiding companies toward effective, data-driven pathways to meet their net-zero commitments. Information on carbon emissions reduction levers is scattered around greenhouse gas emissions disclosures and sustainability reports in dense text forms, and no systematic, sector and region specific reduction lever libraries are available to companies. This research proposes a multi-agent system leveraging Large Language Models (LLMs) integrated with Retrieval-Augmented Generation (RAG) to systematically extract, classify, and validate carbon reduction actions from publicly available sustainability reports. By constructing a standardized database of reduction levers categorized by industry, geography, and greenhouse gas scopes, this work empowers companies to prioritize high-impact, cost-effective emissions reduction strategies. We plan to integrate environmentally-extended input-output models to ensure that these actions are closely tied to sector-specific emissive sources, increasing their relevance and scalability. This initiative is expected to support companies in mitigating greenhouse gas emissions by offering a practical resource that accelerates the transition to a low-carbon economy, and makes actionable insights readily available to corporations, industry and the research community. Authors: Varsha Gopalakrishnan (Watershed Technology Inc.); Shaena Ulissi (Watershed); Andrew Dumit (Watershed Technology, Inc.); Krishna Rao (Watershed Technology Inc.); Katherine Tsai (Watershed Technology Inc.); Sangwon Suh (Watershed Technology Inc.)
NeurIPS 2024	DeepMyco - Dataset Generation for Dye Mycoremediation (Proposals Track) Abstract and authors: (click to expand) Abstract: Textile dyes comprise 20% of global water pollution. Mycoremediation, a promising approach utilizing cheap, naturally growing fungi, has not seen scale production. While numerous studies indicate benefits, it is challenging to apply the specific learnings of each study to the combination of environmental factors present in a given physical site - a gap we believe machine learning can help fill if datasets become available. We propose an approach to drive machine learning research in mycoremediation by contributing a comprehensive dataset. We propose using advanced language models and vision transformers to extract and categorize experimental data from various research papers. This dataset will enable ML-driven innovation in matching fungi to specific dye types, optimizing remediation processes, and scaling up mycoremediation efforts effectively. Authors: Danika Gupta (The Harker Upper School)
NeurIPS 2024	Large language model co-pilot for transparent and trusted life cycle assessment comparisons (Proposals Track) Abstract and authors: (click to expand) Abstract: Intercomparing life cycle assessments (LCA), a common type of sustainability and climate model, is difficult due to basic differences in fundamental assumptions, especially in the goal and scope definition stage. This complicates decision-making and the selection of climate-smart policies, as it becomes difficult to compare optimal products and processes between different studies. To aid policymakers and LCA practitioners alike, we plan to leverage large language models (LLM) to build a database containing documented assumptions for LCAs across the agricultural sector, with a case study on livestock management. The articles for this database are identified in a systematic literature search, then processed to extract relevant assumptions about the goal and scope definition of the LCA and inserted into a vector database. We then leverage this database to develop an AI co-pilot by augmenting LLMs with retrieval augmented generation to be used by stakeholders and LCA practitioners alike. This co-pilot will accrue two major benefits: 1) enhance the decision-making process through facilitating comparisons among LCAs to enable policymakers to adopt data-driven climate policies and 2) encourage the use of common assumptions by LCA practitioners. Ultimately, we hope to create a foundational model for LCA tasks that can plug-in with existing open source LCA software and tools. Authors: Nathan Preuss (Cornell University); Fengqi You (Cornell University)
ICLR 2024	ClimateQ&A : bridging the gap between climate scientists and the general public (Papers Track) Abstract and authors: (click to expand) Abstract: This research paper investigates public views on climate change and biodiversity loss by analyzing questions asked to the ClimateQ&A platform. ClimateQ&A is a conversational agent that uses LLMs to respond to queries based on over 14,000 pages of scientific literature from the IPCC and IPBES reports. Launched online in March 2023, the tool has gathered over 30,000 questions, mainly from a French audience. Its chatbot interface allows for the free formulation of questions related to nature. While its main goal is to make nature science more accessible, it also allows for the collection and analysis of questions and their themes. Unlike traditional surveys involving closed questions, this novel method offers a fresh perspective on individual interrogations about nature. Running NLP clustering algorithms on a sample of 3,425 questions, we find that a significant 25.8% inquire about how climate change and biodiversity loss will affect them personally (e.g., where they live or vacation, their consumption habits) and the specific impacts of their actions on nature (e.g., transportation or food choices). This suggests that traditional methods of surveying may not identify all existing knowledge gaps, and that relying solely on IPCC and IPBES reports may not address all individual inquiries about climate and biodiversity, potentially affecting public understanding and action on these issues. Note: we use “nature” as an umbrella term for “climate change” and “biodiversity loss”. Authors: Natalia de la Calzada (Ekimetrics); Theo Alves Da Costa (Ekimetrics); Annabelle Blangero (Ekimetrics); Nicolas CHESNEAU (EKIMETRICS)
ICLR 2024	Empowering Sustainable Finance: Leveraging Large Language Models for Climate-Aware Investments (Papers Track) Abstract and authors: (click to expand) Abstract: With the escalating urgency of climate change, it is becoming more imperative for businesses and organizations to align their objectives with sustainability goals. Financial institutions also face a critical mandate to fulfill the Sustainable Development Goals (SDGs), particularly goal 13, which targets the fight against climate change and its consequences. Mitigating the impacts of climate change requires a focus on reducing supply chain emissions, which constitute over 90% of total emission inventories. In the financial industry, supply chain emissions linked to lending and investments emerge as the primary source of emissions, posing challenges in tracking financed emissions due to the intricate process of collecting data from numerous suppliers across the supply chain. To address these challenges, we propose an emission estimation framework utilizing a Large Language Model (LLM) to drastically accelerate the assessment of the emissions associated with lending and investment activities. This framework utilizes financial activities as a proxy for measuring financed emissions. Utilizing the LLM, we classify financial activities into seven asset classes following the Partnership for Carbon Accounting Financials (PCAF) standard. Additionally, we map investments to industry categories and employ spend-based emission factors (kg-CO2/$-spend) to calculate emissions associated with financial investments. In our study, we compare the performance of our proposed method with state-of-the-art text classification models like TF-IDF, word2Vec, and Zero-shot learning. The results demonstrate that the LLM-based approach not only surpasses traditional text mining techniques and performs on par with a subject matter expert (SME) but most importantly accelerates the assessment process. Authors: Ayush Jain (IBM Research); Manikandan Padmanaban (IBM Research India); Jagabondhu Hazra (IBM Research India); Shantanu Godbole (IBM India); Hendrik Hamann (IBM Research)
ICLR 2024	EU Climate Change News Index: Forecasting EU ETS prices with online news (Papers Track) Abstract and authors: (click to expand) Abstract: Carbon emission allowance prices have been rapidly increasing in the EU since 2018 and accurate forecasting of EU Emissions Trading System (ETS) prices has become essential. This paper proposes a novel method to generate alternative predictors for daily ETS price returns using relevant online news information. We devise the EU Climate Change News Index by calculating the term frequency–inverse document frequency (TF–IDF) feature for climate change-related keywords. The index is capable of tracking the ongoing debate about climate change in the EU. Finally, we show that incorporating the index in a simple predictive model significantly improves forecasts of ETS price returns. Authors: Aron Pap (BGSE); Aron D Hartvig (Corvinus University of Budapest, Cambridge Econometrics); Péter Pálos (Budapest University of Technology and Economics)
ICLR 2024	Literature Mining with Large Language Models to Assist the Development of Sustainable Building Materials (Papers Track) Abstract and authors: (click to expand) Abstract: Concrete industry, as one of the significant sources of carbon emissions, drives the urgency for its decarbonization that requires a shift to alternative materials. However, the absence of systematic knowledge summary remains a challenge for further development of sustainable building materials. This work offers a cost-efficient strategy for information extraction tasks in complex terminology settings using small (2.8B) large language models (LLMs) with well-designed instruction-completion schemes and fine-tuning strategies, introducing a dataset cataloging civil engineering applications of alternative materials. The Multiple Choice instruction scheme significantly improves model accuracies in entity inference from non-Noun-Phrase sources, with supervised fine-tuning benefiting from straightforward tokenized representations of choices. We also demonstrate the utility of the dataset by extracting valuable insights into promising applications of alternative materials from knowledge graph representations. Authors: Yifei Duan (Massachusetts Institute of Technology); Yixi Tian (Massachusetts Institute of Technology); Soumya Ghosh (IBM Research); Richard Goodwin (IBM T.J. Watson Research Center); Vineeth Venugopal (Massachusetts Institute of Technology); Jeremy Gregory (Massachusetts Institute of Technology); Jie Chen (IBM Research); Elsa Olivetti (Massachusetts Institute of Technology)
ICLR 2024	CausalPrompt: Enhancing LLMs with Weakly Supervised Causal Reasoning for Robust Performance in Non-Language Tasks (Papers Track) Abstract and authors: (click to expand) Abstract: In confronting the pressing issue of climate change, we introduce "CausalPrompt", an innovative prompting strategy that adapts large language models (LLMs) for classification and regression tasks through the application of weakly supervised causal reasoning. We delve into the complexities of data shifts within energy systems, often resulting from the dynamic evolution of sensor networks, leading to discrepancies between training and test data distributions or feature inconsistencies. By embedding domain-specific reasoning in the finetuning process, CausalPrompt significantly bolsters the adaptability and resilience of energy systems to these shifts. We show that CausalPrompt significantly enhances predictions in scenarios characterized by feature shifts, including electricity demand, solar power generation, and cybersecurity within energy infrastructures. This approach underlines the crucial role of CausalPrompt in enhancing the reliability and precision of predictions in energy systems amid feature shifts, highlighting its significance and potential for real-world applications in energy management and cybersecurity, contributing effectively to climate change mitigation efforts. Authors: Tung-Wei Lin (University of California, Berkeley); Vanshaj Khattar (Virginia Tech); Yuxuan Huang (University College London); Junho Hong (University of Michigan); Ruoxi Jia (Virginia Tech); Chen-Ching Liu (Virginia Tech); Alberto L Sangiovanni-Vincentelli (University of California, Berkeley); Ming Jin (Virginia Tech)
NeurIPS 2023	Flamingo: Environmental Impact Factor Matching for Life Cycle Assessment with Zero-Shot ML (Papers Track) Abstract and authors: (click to expand) Abstract: Consumer products contribute to >75% of global greenhouse gas (GHG) emissions, primarily through indirect contributions from the supply chain. Measurement of GHG emissions associated with products is crucial to quantify the impact of GHG emission abatement actions. Life cycle assessment (LCA), the scientific discipline for measuring GHG emissions, estimates the environmental impact of a product. Scaling LCA to millions of products is challenging as it requires extensive manual analysis by domain experts. To avoid repetitive analysis, environmental impact factors (EIF) of common materials and products are published for use by experts. However, finding appropriate EIFs for even a single product can require hundreds of hours of manual work, especially for complex products. We present Flamingo, an algorithm that leverages neural language models to automatically identify an appropriate EIF given a text description. A key challenge in automation is that EIF databases are incomplete. Flamingo uses industry sector classification as an intermediate layer to identify when there are no good matches in the database. On a dataset of 664 products, Flamingo achieves an EIF matching precision of 75%. Authors: Bharathan Balaji (Amazon); Venkata Sai Gargeya Vunnava (amazon); Nina Domingo (Amazon); Shikhar Gupta (Amazon); Harsh Gupta (Amazon); Geoffrey Guest (Amazon); Aravind Srinivasan (Amazon); Kellen Axten (Amazon); Jared Kramer (Amazon)
NeurIPS 2023	How to Recycle: General Vision-Language Model without Task Tuning for Predicting Object Recyclability (Papers Track) Abstract and authors: (click to expand) Abstract: Waste segregation and recycling place a crucial role in fostering environmental sustainability. However, discerning the whether a material is recyclable or not poses a formidable challenge, primarily because of inadequate recycling guidelines to accommodate a diverse spectrum of objects and their varying conditions. We investigated the role of vision-language models in addressing this challenge. We curated a dataset consisting >1000 images across 11 disposal categories for optimal discarding and assessed the applicability of general vision-language models for recyclability classification. Our results show that Contrastive Language-Image Pre- training (CLIP) model, which is pretrained to understand the relationship between images and text, demonstrated remarkable performance in the zero-shot recycla- bility classification task, with an accuracy of 89%. Our results underscore the potential of general vision-language models in addressing real-world challenges, such as automated waste sorting, by harnessing the inherent associations between visual and textual information. Authors: Eliot Park (Harvard Medical School); Eddy Pan (Harvard Medical School); Shreya Johri (Harvard Medical School); Pranav Rajpurkar (Harvard Medical School)
NeurIPS 2023	ClimateX: Do LLMs Accurately Assess Human Expert Confidence in Climate Statements? (Papers Track) Abstract and authors: (click to expand) Abstract: Evaluating the accuracy of outputs generated by Large Language Models (LLMs) is especially important in the climate science and policy domain. We introduce the Expert Confidence in Climate Statements (ClimateX) dataset, a novel, curated, expert-labeled dataset consisting of 8094 climate statements collected from the latest Intergovernmental Panel on Climate Change (IPCC) reports, labeled with their associated confidence levels. Using this dataset, we show that recent LLMs can classify human expert confidence in climate-related statements, especially in a few-shot learning setting, but with limited (up to 47%) accuracy. Overall, models exhibit consistent and significant over-confidence on low and medium confidence statements. We highlight implications of our results for climate communication, LLMs evaluation strategies, and the use of LLMs in information retrieval systems. Authors: Romain Lacombe (Stanford University); Kerrie Wu (Stanford University); Eddie Dilworth (Stanford University)
NeurIPS 2023	Proof-of-concept: Using ChatGPT to Translate and Modernize an Earth System Model from Fortran to Python/JAX (Papers Track) Abstract and authors: (click to expand) Abstract: Earth system models (ESMs) are vital for understanding past, present, and future climate, but they suffer from legacy technical infrastructure. ESMs are primarily implemented in Fortran, a language that poses a high barrier of entry for early career scientists and lacks a GPU runtime, which has become essential for continued advancement as GPU power increases and CPU scaling slows. Fortran also lacks differentiability — the capacity to differentiate through numerical code — which enables hybrid models that integrate machine learning methods. Converting an ESM from Fortran to Python/JAX could resolve these issues. This work presents a semi-automated method for translating individual model components from Fortran to Python/JAX using a large language model (GPT-4). By translating the photosynthesis model from the Community Earth System Model (CESM), we demonstrate that the Python/JAX version results in up to 100x faster runtimes using GPU parallelization, and enables parameter estimation via automatic differentiation. The Python code is also easy to read and run and could be used by instructors in the classroom. This work illustrates a path towards the ultimate goal of making climate models fast, inclusive, and differentiable. Authors: Anthony Zhou (Columbia University), Linnia Hawkins (Columbia University), Pierre Gentine (Columbia University)
NeurIPS 2023	Understanding Climate Legislation Decisions with Machine Learning (Proposals Track) Abstract and authors: (click to expand) Abstract: Effective action is crucial in order to avert climate disaster. Key in enacting change is the swift adoption of climate positive legislation which advocates for climate change mitigation and adaptation. This is because government legislation can result in far-reaching impact, due to the relationships between climate policy, technology, and market forces. To advocate for legislation, current strategies aim to identify potential levers and obstacles, presenting an opportunity for the application of recent advances in machine learning language models. Here we propose a machine learning pipeline to analyse climate legislation, aiming to investigate the feasibility of natural language processing for the classification of climate legislation texts, to predict policy voting outcomes. By providing a model of the decision making process, the proposed pipeline can enhance transparency and aid policy advocates and decision makers in understanding legislative decisions, thereby providing a tool to monitor and understand legislative decisions towards climate positive impact. Authors: Jeff Clark (University of Bristol); Michelle Wan (University of Cambridge); Raul Santos Rodriguez (University of Bristol)
NeurIPS 2023	Mapping the Landscape of Artificial Intelligence in Climate Change Research: A Meta-Analysis on Impact and Applications (Proposals Track) Abstract and authors: (click to expand) Abstract: This proposal advocates a comprehensive and systematic analysis aimed at mapping and characterizing the intricate landscape of Artificial Intelligence and Machine Learning applications and their impacts within the domain of climate change research, both in adaption and mitigation efforts. Notably, a significant upswing in this interdisciplinary intersection has been observed since 2020. Utilizing advanced topic clustering techniques and qualitative analysis, we have discerned 12 distinct macro areas that supplement, enrich, and expand upon those identified in prior research. The primary objective of this undertaking is to furnish a data-rich panoramic view and informative insights regarding the functions and tools of the mentioned disciplines. Our intention is to offer valuable guidance to the scholarly community and propel further research endeavors, encouraging meticulous examinations of research trends and gaps in addressing the formidable challenges posed by climate change and the climate crisis. Authors: Christian Burmester (Osnabrück University); Teresa Scantamburlo (UniversityofVenice)
ICLR 2023	CaML: Carbon Footprinting of Products with Zero-Shot Semantic Text Similarity (Papers Track) Abstract and authors: (click to expand) Abstract: Estimating the embodied carbon in products is a key step towards understanding their impact, and undertaking mitigation actions. Precise carbon attribution is challenging at scale, requiring both domain expertise and granular supply chain data. As a first-order approximation, standard reports use Economic Input-Output based Life Cycle Assessment (EIO-LCA) which estimates carbon emissions per dollar at an industry sector level using transactions between different parts of the economy. For EIO-LCA, an expert needs to map each product to one of upwards of 1000 potential industry sectors. We present CaML, an algorithm to automate EIO-LCA using semantic text similarity matching by leveraging the text descriptions of the product and the industry sector. CaML outperforms the previous manually intensive method, yielding a MAPE of 22% with no domain labels. Authors: Bharathan Balaji (Amazon); Venkata Sai Gargeya Vunnava (amazon); Geoffrey Guest (Amazon); Jared Kramer (Amazon)
ICLR 2023	Mapping global innovation networks around clean energy technologies (Proposals Track) Abstract and authors: (click to expand) Abstract: Reaching net zero emissions requires rapid innovation and scale-up of clean tech. In this context, clean tech innovation networks (CTINs) can play a crucial role by pooling necessary resources and competences and enabling knowledge transfers between different actors. However, existing evidence on CTINs is limited due to a lack of comprehensive data. Here, we develop a machine learning framework to identify CTINs from announcements on social media to map the global CTIN landscape. Specifically, we classify the social media announcements regarding the type of technology (e.g., hydrogen, solar), interaction type (e.g., equity investment, R\&D collaboration), and status (e.g., commencement, update). We then extract referenced organizations via entity recognition. Thereby, we generate a large-scale dataset of CTINs across different technologies, countries, and over time. This allows us to compare characteristics of CTINs, such as the geographic proximity of actors, and to investigate the association between network evolution and technology innovation and diffusion. As a direct implication, our work helps policy makers to promote CTINs by identifying current barriers and needs. Authors: Malte Toetzke (ETH Zurich); Francesco Re (ETH Zurich); Benedict Probst (ETH Zurich); Stefan Feuerriegel (LMU Munich); Laura Diaz Anadon (University of Cambridge); Volker Hoffmann (ETH Zurich)
ICLR 2023	Mining Effective Strategies for Climate Change Communication (Papers Track) Abstract and authors: (click to expand) Abstract: With the goal of understanding effective strategies to communicate about climate change, we build interpretable models to rank tweets related to climate change with respect to the engagement they generate. Our models are based on the Bradley-Terry model of pairwise comparison outcomes and use a combination of the tweets’ topic and metadata features to do the ranking. To remove confounding factors related to author popularity and minimise noise, they are trained on pairs of tweets that are from the same author and around the same time period and have a sufficiently large difference in engagement. The models achieve good accuracy on a held-out set of pairs. We show that we can interpret the parameters of the trained model to identify the topic and metadata features that contribute to high engagement. Among other observations, we see that topics related to climate projections, human cost and deaths tend to have low engagement while those related to mitigation and adaptation strategies have high engagement. We hope the insights gained from this study will help craft effective climate communication to promote engagement, thereby lending strength to efforts to tackle climate change. Authors: Aswin Suresh (EPFL); Lazar Milikic (EPFL); Francis Murray (EPFL); Yurui Zhu (EPFL); Matthias Grossglauser (École Polytechnique Fédérale de Lausanne (EPFL))
NeurIPS 2022	Deep Climate Change: A Dataset and Adaptive domain pre-trained Language Models for Climate Change Related Tasks (Papers Track) Abstract and authors: (click to expand) Abstract: The quantity and quality of literature around climate change (CC) and its impacts are increasing yearly. Yet, this field has received limited attention in the Natural Language Processing (NLP) community. With the help of large Language Models (LMs) and transfer learning, NLP can support policymakers, researchers, and climate activists in making sense of large-scale and complex CC-related texts. CC-related texts include specific language that general language models cannot represent accurately. Therefore we collected a climate change corpus consisting of over 360 thousand abstracts of top climate scientists' articles from trustable sources covering large temporal and spatial scales. Comparison of the performance of GPT2 LM and our 'climateGPT2 models', fine-tuned on the CC-related corpus, on claim generation (text generation) and fact-checking, downstream tasks show the better performance of the climateGPT2 models compared to the GPT2. The climateGPT2 models decrease the validation loss to 1.08 for claim generation from 43.4 obtained by GPT2. We found that climateGPT2 models improved the masked language model objective for the fact-checking task by increasing the F1 score from 0.67 to 0.72. Authors: Saeid Vaghefi (University of Zürich); Veruska Muccione (University of Zürich); Christian Huggel (University of Zürich); Hamed Khashehchi (2w2e GmbH); Markus Leippold (University of Zurich)
NeurIPS 2022	Temperature impacts on hate speech online: evidence from four billion tweets (Papers Track) Abstract and authors: (click to expand) Abstract: Human aggression is no longer limited to the physical space but exists in the form of hate speech on social media. Here, we examine the effect of temperature on the occurrence of hate speech on Twitter and interpret the results in the context of climate change, human behavior and mental health. Employing supervised machine learning models, we identify hate speech in a data set of four billion geolocated tweets from over 750 US cities (2014 – 2020). We statistically evaluate the changes in daily hate tweets against changes in local temperature, isolating the temperature influence from confounding factors using binned panel-regression models. We find a low prevalence of hate tweets in moderate temperatures and observe sharp increases of up to 12% for colder and up to 22% for hotter temperatures, indicating that not only hot but also cold temperatures increase aggressive tendencies. Further, we observe that for extreme temperatures hate speech also increases as a percentage of total tweeting activity, crowding out non-hate speech. The quasi-quadratic shape of the temperature-hate tweet curve is robust across varying climate zones, income groups, religious and political beliefs. The prevalence of the results across climatic and socioeconomic splits points to limits in adaptation. Our results illuminate hate speech online as an impact channel through which temperature alters societal aggression. Authors: Annika Stechemesser (Potsdam Insitute for Climate Impact Research); Anders Levermann (Potsdam Institute for Climate Impact Research); Leonie Wenz (Potsdam Institute for Climate Impact Research)
NeurIPS 2022	TCFD-NLP: Assessing alignment of climate disclosures using NLP for the financial markets (Papers Track) Abstract and authors: (click to expand) Abstract: Climate-related disclosure is increasing in importance as companies and stakeholders alike aim to reduce their environmental impact and exposure to climate-induced risk. Companies primarily disclose this information in annual or other lengthy documents where climate information is not the sole focus. To assess the quality of a company's climate-related disclosure, these documents, often hundreds of pages long, must be reviewed manually by climate experts. We propose a more efficient approach to assessing climate-related financial information. We construct a model leveraging TF-IDF, sentence transformers and multi-label k nearest neighbors (kNN). The developed model is capable of assessing alignment of climate disclosures at scale, with a level of granularity and transparency that will support decision-making in the financial markets with relevant climate information. In this paper, we discuss the data that enabled this project, the methodology, and how the resulting model can drive climate impact. Authors: Rylen Sampson (Manifest Climate); Aysha Cotterill (Manifest Climate); Quoc Tien Au (Manifest Climate)
NeurIPS 2022	Climate Policy Tracker: Pipeline for automated analysis of public climate policies (Papers Track) Abstract and authors: (click to expand) Abstract: The number of standardized policy documents regarding climate policy and their publication frequency is significantly increasing. The documents are long and tedious for manual analysis, especially for policy experts, lawmakers, and citizens who lack access or domain expertise to utilize data analytics tools. Potential consequences of such a situation include reduced citizen governance and involvement in climate policies and an overall surge in analytics costs, rendering less accessibility for the public. In this work, we use a Latent Dirichlet Allocation-based pipeline for the automatic summarization and analysis of 10-years of national energy and climate plans (NECPs) for the period from 2021 to 2030, established by 27 Member States of the European Union. We focus on analyzing policy framing, the language used to describe specific issues, to detect essential nuances in the way governments frame their climate policies and achieve climate goals. The methods leverage topic modeling and clustering for the comparative analysis of policy documents across different countries. It allows for easier integration in potential user-friendly applications for the development of theories and processes of climate policy. This would further lead to better citizen governance and engagement over climate policies and public policy research. Authors: Artur Żółkowski (Warsaw University of Technology); Mateusz Krzyziński (Warsaw University of Technology); Piotr Wilczyński (Warsaw University of Technology); Stanisław Giziński (University of Warsaw); Emilia Wiśnios (University of Warsaw); Bartosz Pieliński (University of Warsaw); Julian Sienkiewicz (Warsaw University of Technology); Przemysław Biecek (Warsaw University of Technology)
NeurIPS 2022	Topic correlation networks inferred from open-ended survey responses reveal signatures of ideology behind carbon tax opinion (Papers Track) Abstract and authors: (click to expand) Abstract: Ideology can often render policy design ineffective by overriding what, at face value, are rational incentives. A timely example is carbon pricing, whose public support is strongly influenced by ideology. As a system of ideas, ideology expresses itself in the way people explain themselves and the world. As an object of study, ideology is then amenable to a generative modelling approach within the text-as-data paradigm. Here, we analyze the structure of ideology underlying carbon tax opinion using topic models. An idea, termed a topic, is operationalized as the fixed set of proportions with which words are used when talking about it. We characterize ideology through the relational structure between topics. To access this latent structure, we use the highly expressive Structural Topic Model to infer topics and the weights with which individual opinions mix topics. We fit the model to a large dataset of open-ended survey responses of Canadians elaborating on their support of or opposition to the tax. We propose and evaluate statistical measures of ideology in our data, such as dimensionality and heterogeneity. Finally, we discuss the implications of the results for transition policy in particular, and of our approach to analyzing ideology for computational social science in general. Authors: Maximilian Puelma Touzel (Mila)
NeurIPS 2022	Analyzing the global energy discourse with machine learning (Proposals Track) Abstract and authors: (click to expand) Abstract: To transform our economy towards net-zero emissions, industrial development of clean energy technologies (CETs) to replace fossil energy technologies (FETs) is crucial. Although the media has great power in influencing consumer behavior and decision making in business and politics, its role in the energy transformation is still underexplored. In this paper, we analyze the global energy discourse via machine learning. For this, we collect a large-scale dataset with ~5 million news articles from seven of the world’s major CO2 emitting countries, covering eight CETs and four FETs. Using machine learning, we then analyze the content of news articles on a highly granular level and along several dimensions, namely relevance (for the energy discourse), context (e.g., costs, regulation, investment), and connotations (e.g., high/increasing vs. low/decreasing costs). By linking empirical discourse patterns to investment and deployment data of CETs and FETs, this study advances the current understanding about the role of the media in the energy transformation. Thereby, it enables businesses, investors, and policy makers to respond more effectively to sensitive topics in the media discourse and leverage windows of opportunity for scaling CETs. Authors: Malte Toetzke (ETH Zurich); Benedict Probst (ETH Zurich); Yasin Tatar (ETH Zurich); Stefan Feuerriegel (LMU Munich); Volker Hoffmann (ETH Zurich)
NeurIPS 2022	CliMedBERT: A Pre-trained Language Model for Climate and Health-related Text (Proposals Track) Abstract and authors: (click to expand) Abstract: Climate change is threatening human health in unprecedented orders and many ways. These threats are expected to grow unless effective and evidence-based policies are developed and acted upon to minimize or eliminate them. Attaining such a task requires the highest degree of the flow of knowledge from science into policy. The multidisciplinary, location-specific, and vastness of published science makes it challenging to keep track of novel work in this area, as well as making the traditional knowledge synthesis methods inefficient in infusing science into policy. To this end, we consider developing multiple domain-specific language models (LMs) with different variations from Climate- and Health-related information, which can serve as a foundational step toward capturing available knowledge to enable solving different tasks, such as detecting similarities between climate- and health-related concepts, fact-checking, relation extraction, evidence of health effects to policy text generation, and more. To our knowledge, this is the first work that proposes developing multiple domain-specific language models for the considered domains. We will make the developed models, resources, and codebase available for the researchers. Authors: Babak Jalalzadeh Fard (University of Nebraska Medical Center); Sadid A. Hasan (Microsoft); Jesse E. Bell (University of Nebraska Medical Center)
AAAI FSS 2022	AI-Based Text Analysis for Evaluating Food Waste Policies Abstract and authors: (click to expand) Abstract: Food waste is a major contributor to climate change, making the reduction of food waste one of the most important strategies to preserve threatened ecosystems and increase economic benefits. To evaluate the impact of food waste policies in this arena and provide actionable guidance to policymakers, we conducted an AI-based text analysis of food waste policy provisions. Specifically, we a) identified commonalities across state policy texts, b) clustered states by shared policy text, and c) examined relationships between state cluster memberships and food waste . This approach generated state clusters but demonstrated very limited convergent validity with policy ratings provided by subject matter experts and no predictive validity with food waste. We discuss the potential of using supervised machine learning to analyze food waste policy text as a next step. Authors: John Aitken (The MITRE Corporation), Denali Rao (The MITRE Corporation), Balca Alaybek (The MITRE Corporation), Amber Sprenger (The MITRE Corporation), Grace Mika (The MITRE Corporation), Rob Hartman (The MITRE Corporation) and Laura Leets (The MITRE Corporation)
AAAI FSS 2022	KnowUREnvironment: An Automated Knowledge Graph for Climate Change and Environmental Issues Abstract and authors: (click to expand) Abstract: Despite climate change being one of the greatest threats to humanity, many people are still in denial or lack motivation for appropriate action. A structured source of knowledge can help increase public awareness while also helping crucial natural language understanding tasks such as information retrieval, question answering, and recommendation systems. We introduce KnowUREnvironment – a knowledge graph for climate change and related environmental issues, extracted from the scientific literature. We automatically identify 210,230 domain-specific entities/concepts and encode how these concepts are interrelated with 411,860 RDF triples backed up with evidence from the literature, without using any supervision or human intervention. Human evaluation shows our extracted triples are syntactically and factually correct (81.69% syntactic correctness and 75.85% precision). The proposed framework can be easily extended to any domain that can benefit from such a knowledge graph. Authors: Md Saiful Islam (University of Rochester), Adiba Proma (University of Rochester), Yilin Zhou (University of Rochester), Syeda Nahida Akter (Carnegie Mellon University), Caleb Wohn (University of Rochester) and Ehsan Hoque (University of Rochester)
AAAI FSS 2022	ClimateBert: A Pretrained Language Model for Climate-Related Text Abstract and authors: (click to expand) Abstract: Over the recent years, large pretrained language models (LM) have revolutionized the field of natural language processing (NLP). However, while pretraining on general language has been shown to work very well for common language, it has been observed that niche language poses problems. In particular, climate-related texts include specific language that common LMs can not represent accurately. We argue that this shortcoming of today's LMs limits the applicability of modern NLP to the broad field of text processing of climate-related texts. As a remedy, we propose ClimateBert, a transformer-based language model that is further pretrained on over 2 million paragraphs of climate-related texts, crawled from various sources such as common news, research articles, and climate reporting of companies. We find that ClimateBert leads to a 48% improvement on a masked language model objective which, in turn, leads to lowering error rates by 3.57% to 35.71% for various climate-related downstream tasks like text classification, sentiment analysis, and fact-checking Authors: Nicolas Webersinke (FAU Erlangen-Nürnberg), Mathias Kraus (FAU Erlangen-Nürnberg), Julia Anna Bingler (ETH Zurich) and Markus Leippold (UZH Zurich)
AAAI FSS 2022	The Impact of TCFD Reporting - A New Application of Zero-Shot Analysis to Climate-Related Financial Disclosures Abstract and authors: (click to expand) Abstract: We examine climate-related disclosures in 3,335 reports based on a sample of 188 banks that officially endorsed the recommendations of the Task Force for Climate-related Financial Disclosures (TCFD). In doing so, we introduce a new application of zero-shot text classification based on the BART model and a MNLI task. By developing a set of robust and fine-grained labels, we show that zero-shot analysis provides high accuracy in analyzing companies’ climate-related reporting without further model training. We are able to demonstrate that banks that support the TCFD increase their level of disclosure after officially declaring their support for the guidelines, although we also find significant differences depending on the topic of disclosure. Our findings yield important conclusions for the design of climate-related disclosures. Authors: Alix Auzepy (Justus-Liebig-Universität Gießen), Elena Tönjes (Justus-Liebig-Universität Gießen) and Christoph Funk (Justus-Liebig-Universität Gießen)
AAAI FSS 2022	Using Natural Language Processing for Automating the Identification of Climate Action Interlinkages within the Sustainable Development Goals Abstract and authors: (click to expand) Abstract: Climate action, Goal 13 of the UN Sustainable Development Goals (SDG), cuts across almost all SDGs. Achieving climate goals can reinforce the achievements in many other goals, but at the same time climate mitigation and adaptation measures may generate trade-offs, such as levelling the cost of energy and transitioning away from fossil fuels. Leveraging the synergies and minimizing the trade-offs among the climate goals and other SDGs is an imperative task for ensuring policy coherence. Understanding the interlinkages between climate action and other SDGs can help inform about the synergies and trade-offs. This paper presents a novel methodology by using natural language processing (NLP) to automate the process of systematically identifying the key interlinkages between climate action and SDGs from a large amount of climate literature. A qualitative SDG interlinkages model for climate action was automatically generated and visualized in a network graph. This work contributes to the conference thematic topic on using AI for policy alignment for climate change goals, SDGs and associated environmental, social and governance (ESG) frameworks. Authors: Xin Zhou (Institute for Global Environmental Strategies (IGES)), Kshitij Jain (Google Inc.), Mustafa Moinuddin (Institute for Global Environmental Strategies (IGES)) and Patrick McSharry (Carnegie Mellon University Africa; Oxford Man Institute of Quantitative Finance, Oxford University)
NeurIPS 2021	A Deep Learning application towards transparent communication for Payment for Forest Environmental Services (PES) (Proposals Track) Abstract and authors: (click to expand) Abstract: Deforestation accounts for more than 20% of global emission. Payments for Environmental Services (PES) is seen by both policy makers and practitioners as an effective market-based instrument to provide financial incentives for forest owners, particularly poor and indigenous households in developing countries. It is a critical instrument to protect forests, and ultimately to mitigate climate change and reduce emission from deforestation. However, previous studies have pointed out a key challenge for PES is to ensure transparent payment to local people, due to i) weak monitoring and evaluation and ii) indigenous inaccessibility to e-banking and complying with procedural and administrative paper works to receive payments. Specifically, the amount and the complexity of forms along with the language barriers is a key issue; and most transactions need several intermediaries and transaction costs which reduce the payments reaching landowners. To address these issues, we propose a communication platform that links across the stakeholders and processes. Our proposal will utilize Machine Learning techniques to lower the language barrier and provide technology solutions to help indigenous people to access payments. This would also help improve the effectiveness and transparency of PES schemes. Specifically, we propose the use of Natural Language Processing techniques in providing a speech-to-text and auto translation capability, and the use of Graph Neural Network to provide link predictions of transaction types, volumes and values. The pathway to impact will be forest protection and local livelihood through providing financial incentives, and subsequently contribution to more carbon sequestration and storage – a key issue in climate change mitigation. Authors: Lan HOANG (IBM Research); Thuy Thu Phan (Center for International Forestry Research (CIFOR))
NeurIPS 2021	A NLP-based Analysis of Alignment of Organizations' Climate-Related Risk Disclosures with Material Risks and Metrics (Proposals Track) Abstract and authors: (click to expand) Abstract: The Sustainability Accounting Standards Board (SASB) establishes standards to guide the disclosures of material sustainability and ESG (Environment, Social, Governance)-related information across industries. The availability of quality, comparable and decision-useful information is required to assess risks and opportunities later integrated into financial decision-making. Particularly, standardized, industry-specific climate risk metrics and topics can support these efforts. SASB’s latest climate risk technical bulletin introduces three climate-related risks that are financially material - physical, transition and regulatory risks - and maps these across industries. The main objective of this work is to create a framework that can analyze climate related risk disclosures using an AI-based tool that automatically extracts and categorizes climate-related risks and related metrics from company disclosures based on SASB’s latest climate risk guidance. This process will help with automating large-scale analysis and add much-needed transparency vis-a-vis the current state of climate-related disclosures, while also assessing how far along companies are currently disclosing information on climate risks relevant to their industry. As it stands, this much needed type of analysis is made mostly manually or using third-party metrics, often opaque and biased, as proxies. In this work, we will first create a climate risk glossary that will be trained on a large amount of climate risk text. By combining climate risk keywords in this glossary with recent advances in natural language processing (NLP), we will then be able to quantitatively and qualitatively compare climate risk information in different sectors and industries using a novel climate risk score that will be based on SASB standards. Authors: Elham Kheradmand (University of Montreal); Didier Serre (Clearsum); Manuel Morales (University of Montreal); Cedric B Robert (Clearsum)
ICML 2021	Challenges in Applying Audio Classification Models to Datasets Containing Crucial Biodiversity Information (Papers Track) Abstract and authors: (click to expand) Abstract: The acoustic signature of a natural soundscape can reveal consequences of climate change on biodiversity. Hardware costs, human labor time, and expertise dedicated to labeling audio are impediments to conducting acoustic surveys across a representative portion of an ecosystem. These barriers are quickly eroding away with the advent of low-cost, easy to use, open source hardware and the expansion of the machine learning field providing pre-trained neural networks to test on retrieved acoustic data. One consistent challenge in passive acoustic monitoring (PAM) is a lack of reliability from neural networks on audio recordings collected in the field that contain crucial biodiversity information that otherwise show promising results from publicly available training and test sets. To demonstrate this challenge, we tested a hybrid recurrent neural network (RNN) and convolutional neural network (CNN) binary classifier trained for bird presence/absence on two Peruvian bird audiosets. The RNN achieved an area under the receiver operating characteristics (AUROC) of 95% on a dataset collected from Xeno-canto and Google’s AudioSet ontology in contrast to 65% across a stratified random sample of field recordings collected from the Madre de Dios region of the Peruvian Amazon. In an attempt to alleviate this discrepancy, we applied various audio data augmentation techniques in the network’s training process which led to an AUROC of 77% across the field recordings. Authors: Jacob G Ayers (UC San Diego); Yaman Jandali (University of California, San Diego); Yoo-Jin Hwang (Harvey Mudd College); Erika Joun (University of California, San Diego); Gabriel Steinberg (Binghampton University); Mathias Tobler (San Diego Zoo Wildlife Alliance); Ian Ingram (San Diego Zoo Wildlife Alliance); Ryan Kastner (University of California San Diego); Curt Schurgers (University of California San Diego)
ICML 2021	Automated Identification of Climate Risk Disclosures in Annual Corporate Reports (Papers Track) Abstract and authors: (click to expand) Abstract: It is important for policymakers to understand which financial policies are effective in increasing climate risk disclosure in corporate reporting. We use machine learning to automatically identify disclosures of five different types of climate-related risks. For this purpose, we have created a dataset of over 120 manually-annotated annual reports by European firms. Applying our approach to reporting of 337 firms over the last 20 years, we find that risk disclosure is increasing. Disclosure of transition risks grows more dynamically than physical risks, and there are marked differences across industries. Country-specific dynamics indicate that regulatory environments potentially have an important role to play for increasing disclosure. Authors: David Friederich (University of Bern); Lynn Kaack (ETH Zurich); Sasha Luccioni (Mila); Bjarne Steffen (ETH Zurich)
ICML 2021	TweetDrought: A Deep-Learning Drought Impacts Recognizer based on Twitter Data (Papers Track) Abstract and authors: (click to expand) Abstract: Acquiring a better understanding of drought impacts becomes increasingly vital under a warming climate. Traditional drought indices describe mainly biophysical variables and not impacts on social, economic, and environmental systems. We utilized natural language processing and bidirectional encoder representation from Transformers (BERT) based transfer learning to fine-tune the model on the data from the news-based Drought Impact Report (DIR) and then apply it to recognize seven types of drought impacts based on the filtered Twitter data from the United States. Our model achieved a satisfying macro-F1 score of 0.89 on the DIR test set. The model was then applied to California tweets and validated with keyword-based labels. The macro-F1 score was 0.58. However, due to the limitation of keywords, we also spot-checked tweets with controversial labels. 83.5% of BERT labels were correct compared to the keyword labels. Overall, the fine-tuned BERT-based recognizer provided proper predictions and valuable information on drought impacts. The interpretation and analysis of the model were consistent with experiential domain expertise. Authors: Beichen Zhang (University of Nebraska-Lincoln); Frank Schilder (Thomson Reuters); Kelly Smith (National Drought Mitigation Center); Michael Hayes (University of Nebraska-Lincoln); Sherri Harms (University of Nebraska-Kearney); Tsegaye Tadesse (University of Nebraska-Lincoln)
ICML 2021	DeepPolicyTracker: Tracking Changes In Environmental Policy In The Brazilian Federal Official Gazette With Deep Learning (Papers Track) Abstract and authors: (click to expand) Abstract: Even though most of its energy generation comes from renewable sources, Brazil is one of the largest emitters of greenhouse gases in the world, due to intense farming and deforestation of biomes, such as the Amazon Rainforest, whose preservation is essential for compliance with the Paris Agreement. Still, regardless of lobbies or prevailing political orientation, all government legal actions are published daily in the Federal Official Gazette. However, with hundreds of decrees issued every day by the authorities, it is absolutely burdensome to manually analyze all these processes and find out which ones can pose serious environmental hazards. In this paper, we propose the DeepPolicyTracker, a promising deep learning model that uses a state-of-the-art pre-trained natural language model to classify government acts and track harmful changes in the environmental policies. We also provide the used dataset annotated by domain experts and show some results already obtained. In the future, this system should serve to scale up the high-quality tracking of all oficial documents with a minimum of human supervision and contribute to increasing society's awareness of every government action. Authors: Flávio N Cação (University of Sao Paulo); Anna Helena Reali Costa (Universidade de São Paulo); Natalie Unterstell (Política por Inteiro); Liuca Yonaha (Política por Inteiro); Taciana Stec (Política por Inteiro); Fábio Ishisaki (Política por Inteiro)
ICML 2021	BERT Classification of Paris Agreement Climate Action Plans (Papers Track) Abstract and authors: (click to expand) Abstract: As the volume of text-based information on climate policy increases, natural language processing (NLP) tools can distill information from text to better inform decision making on climate policy. We investigate how large pretrained transformers based on the BERT architecture classify sentences on a dataset of climate action plans which countries submitted to the United Nations following the 2015 Paris Agreement. We use the document header structure to assign noisy policy-relevant labels such as mitigation, adaptation, energy, and land use to text elements. Our models provide an improvement in out-of-sample classification over simple heuristics though fall short of the consistency observed between human annotators. We hope to extend this framework to a wider class of textual climate change data such as climate legislation and corporate social responsibility filings and build tools to streamline the extraction of information from these documents for climate change researchers. Authors: Tom Corringham (Scripps Institution of Oceanography); Daniel Spokoyny (Carnegie Mellon University); Eric Xiao (University of California San Diego); Christopher Cha (University of California San Diego); Colin Lemarchand (University of California San Diego); Mandeep Syal (University of California San Diego); Ethan Olson (University of California San Diego); Alexander Gershunov (Scripps Institution of Oceanography)
ICML 2021	Powering Effective Climate Communication with a Climate Knowledge Base (Proposals Track) Abstract and authors: (click to expand) Abstract: While many accept climate change and its growing impacts, few converse about it well, limiting the adoption speed of societal changes necessary to address it. In order to make effective climate communication easier, we aim to build a system that presents to any individual the climate information predicted to best motivate and inspire them to take action given their unique set of personal values. To alleviate the cold-start problem, the system relies on a knowledge base (ClimateKB) of causes and effects of climate change, and their associations to personal values. Since no such comprehensive ClimateKB exists, we revisit knowledge base construction techniques and build a ClimateKB from free text. We plan to open source the ClimateKB and associated code to encourage future research and applications. Authors: Kameron B. Rodrigues (Stanford University); Shweta Khushu (SkySpecs Inc); Mukut Mukherjee (ClimateMind); Andrew Banister (Climate Mind); Anthony Hevia (ClimateMind); Sampath Duddu (ClimateMind); Nikita Bhutani (Megagon Labs)
ICML 2021	From Talk to Action with Accountability: Monitoring the Public Discussion of Policy Makers with Deep Neural Networks and Topic Modelling (Proposals Track) Abstract and authors: (click to expand) Abstract: Decades of research on climate have provided a consensus that human activity has changed the climate and we are currently heading into a climate crisis. While public discussion and research efforts on climate change mitigation have increased, potential solutions need to not only be discussed but also effectively deployed. For preventing mismanagement and holding policy makers accountable, transparency and degree of information about government processes have been shown to be crucial. However, currently the quantity of information about climate change discussions and the range of sources make it increasingly difficult for the public and civil society to maintain an overview to hold politicians accountable. In response, we propose a multi-source topic aggregation system (MuSTAS) which processes policy makers speech and rhetoric from several publicly available sources into an easily digestible topic summary. MuSTAS uses novel multi-source hybrid latent Dirichlet allocation to model topics from a variety of documents. This topic digest will serve the general public and civil society in assessing where, how, and when politicians talk about climate and climate policies, enabling them to hold politicians accountable for their actions to mitigate climate change and lack thereof. Authors: Vili Hätönen (Emblica); Fiona Melzer (University of Edinburgh)
ICML 2021	NeuralNERE: Neural Named Entity Relationship Extraction for End-to-End Climate Change Knowledge Graph Construction (Proposals Track) Abstract and authors: (click to expand) Abstract: This paper proposes an end-to-end Neural Named Entity Relationship Extraction model (called NeuralNERE) for climate change knowledge graph (KG) construction, directly from the raw text of relevant news articles. The proposed model will not only remove the need for any kind of human supervision for building knowledge bases for climate change KG construction (used in the case of supervised or dictionary-based KG construction methods), but will also prove to be highly valuable for analyzing climate change by summarising relationships between different factors responsible for climate change, extracting useful insights & reasoning on pivotal events, and helping industry leaders in making more informed future decisions. Additionally, we also introduce the Science Daily Climate Change dataset (called SciDCC) that contains over 11k climate change news articles scraped from the Science Daily website, which could be used for extracting prior knowledge for constructing climate change KGs. Authors: Prakamya Mishra (Independent Researcher); Rohan Mittal (Independent Researcher)
NeurIPS 2020	Analyzing Sustainability Reports Using Natural Language Processing (Papers Track) Abstract and authors: (click to expand) Abstract: Climate change is a far-reaching, global phenomenon that will impact many aspects of our society, including the global stock market. In recent years, companies have increasingly been aiming to both mitigate their environmental impact and adapt their practices the changing climate context. This is reported via increasingly exhaustive reports, which cover many types of sustainability measures, often under the umbrella of Environmental, Social, and Governance (ESG) disclosures. However, given this abundance of data, sustainability analysts are obliged to comb through hundreds of pages of reports in order to find relevant information. We have leveraged recent progress in Natural Language Processing (NLP) to create a custom model, ClimateQA, which allows the analysis of financial reports in order to identify climate-relevant sections using a question answering approach. We present this tool and the methodology that we used to develop it in the present article. Authors: Sasha Luccioni (Mila); Emi Baylor (McGill); Nicolas Duchene (Universite de Montreal)
NeurIPS 2020	Using attention to model long-term dependencies in occupancy behavior (Papers Track) Abstract and authors: (click to expand) Abstract: Over the past years, more and more models have been published that aim to capture relationships in human residential behavior. Most of these models are different Markov variants or regression models that have a strong assumption bias and are therefore unable to capture complex long-term dependencies and the diversity in occupant behavior. This work shows that attention based models are able to capture complex long-term dependencies in occupancy behavior and at the same time adequately depict the diversity in behavior across the entire population and different socio-demographic groups. By combining an autoregressive generative model with an imputation model, the advantages of two data sets are combined and new data are generated which are beneficial for multiple use cases (e.g. generation of consistent household energy demand profiles). The two step approach generates synthetic activity schedules that have similar statistical properties as the empirical collected schedules and do not contain direct information about single individuals. Therefore, the presented approach forms the basis to make data on occupant behavior freely available, so that further investigations based on the synthetic data can be carried out without a large data application effort. In future work it is planned to take interpersonal dependencies into account in order to be able to generate entire household behavior profiles. Authors: Max Kleinebrahm (Karlsruhe Institut für Technologie); Jacopo Torriti (University Reading); Russell McKenna (University of Aberdeen); Armin Ardone (Karlsruhe Institut für Technologie); Wolf Fichtner (Karlsruhe Institute of Technology)
NeurIPS 2020	Narratives and Needs: Analyzing Experiences of Cyclone Amphan Using Twitter Discourse (Papers Track) Abstract and authors: (click to expand) Abstract: People often turn to social media to comment upon and share information about major global events. Accordingly, social media is receiving increasing attention as a rich data source for understanding people's social, political and economic experiences of extreme weather events. In this paper, we contribute two novel methodologies that leverage Twitter discourse to characterize narratives and identify unmet needs in response to Cyclone Amphan, which affected 18 million people in May 2020. Authors: Ancil S Crayton (Booz Allen Hamilton); Joao Fonseca (NOVA Information Management School); Kanav Mehra (Independent Researcher); Jared Ross (Booz Allen Hamilton); Marcelo Sandoval-Castañeda (New York University Abu Dhabi); Michelle Ng (International Water Management Institute); Rachel von Gnechten (International Water Management Institute)
NeurIPS 2020	Emerging Trends of Sustainability Reporting in the ICT Industry: Insights from Discriminative Topic Mining (Papers Track) Abstract and authors: (click to expand) Abstract: The Information and Communication Technologies (ICT) industry has a considerable climate change impact and accounts for approximately 3 percent of global carbon emissions. Despite the increasing availability of sustainability reports provided by ICT companies, we still lack a systematic understanding of what has been disclosed at an industry level. In this paper, we make the first major effort to use modern unsupervised learning methods to investigate the sustainability reporting themes and trends of the ICT industry over the past two decades. We build a cross-sector dataset containing 22,534 environmental reports from 1999 to 2019, of which 2,187 are ICT specific. We then apply CatE, a text embedding based topic modeling method, to mine specific keywords that ICT companies use to report on climate change and energy. As a result, we identify (1) important shifts in ICT companies' climate change narratives from physical metrics towards climate-related disasters, (2) key organizations with large influence on ICT companies, and (3) ICT companies' increasing focus on data center and server energy efficiency. Authors: Lin Shi (Stanford University); Nhi Truong Vu (Stanford University)
NeurIPS 2020	Climate-FEVER: A Dataset for Verification of Real-World Climate Claims (Papers Track) Abstract and authors: (click to expand) Abstract: Our goal is to introduce \textsc{climate-fever}, a new publicly available dataset for verification of climate change-related claims. By providing a dataset for the research community, we aim to help and encourage work on improving algorithms for retrieving climate-specific information and detecting fake news in social and mass media to reduce the impact of misinformation on the formation of public opinion on climate change. We adapt the methodology of \textsc{fever} \cite{thorne2018fever}, the largest dataset of artificially designed claims, to real-life claims collected from the Internet. Although during this process, we could count on the support of renowned climate scientists, it turned out to be no easy task. We discuss the surprising, subtle complexity of modeling real-world climate-related claims within the \textsc{fever} framework, which provides a valuable challenge for general natural language understanding. We hope that our work will mark the beginning of an exciting long-term joint effort by the climate science and \textsc{ai} community to develop robust algorithms to verify the facts for climate-related claims. Authors: Markus Leippold (University of Zurich); Thomas Diggelmann (ETH Zurich)
NeurIPS 2020	ClimaText: A Dataset for Climate Change Topic Detection (Papers Track) Abstract and authors: (click to expand) Abstract: Climate change communication in the mass media and other textual sources may affect and shape public perception. Extracting climate change information from these sources is an important task, e.g., for filtering content and e-discovery, sentiment analysis, automatic summarization, question-answering, and fact-checking. However, automating this process is a challenge, as climate change is a complex, fast-moving, and often ambiguous topic with scarce resources for popular text-based AI tasks. In this paper, we introduce \textsc{ClimaText}, a dataset for sentence-based climate change topic detection, which we make publicly available. We explore different approaches to identify the climate change topic in various text sources. We find that popular keyword-based models are not adequate for such a complex and evolving task. Context-based algorithms like BERT~\cite{devlin2018bert} can detect, in addition to many trivial cases, a variety of complex and implicit topic patterns. Nevertheless, our analysis reveals a great potential for improvement in several directions, such as, e.g., capturing the discussion on indirect effects of climate change. Hence, we hope this work can serve as a good starting point for further research on this topic. Authors: Markus Leippold (University of Zurich); Francesco Saverio Varini (ETH)
NeurIPS 2020	Expert-in-the-loop Systems Towards Safety-critical Machine Learning Technology in Wildfire Intelligence (Proposals Track) Abstract and authors: (click to expand) Abstract: With the advent of climate change, wildfires are becoming more frequent and severe across several regions worldwide. To prevent and mitigate its effects, wildfire intelligence plays a pivotal role, e.g. to monitor the evolution of wildfires and for early detection in high-risk areas such as wildland-urban-interface regions. Recent works have proposed deep learning solutions for fire detection tasks, however the current limited databases prevent reliable real-world deployments. We propose the development of expert-in-the-loop systems that combine the benefits of semi-automated data annotation with relevant domain knowledge expertise. Through this approach we aim to improve the data curation process and contribute to the generation of large-scale image databases for relevant wildfire tasks and empower the application of machine learning techniques in wildfire intelligence in real scenarios. Authors: Maria João Sousa (IDMEC, Instituto Superior Técnico, Universidade de Lisboa); Alexandra Moutinho (IDMEC, Instituto Superior Técnico, Universidade de Lisboa); Miguel Almeida (ADAI, University of Coimbra)

Natural Language Processing

Tutorials

Blog Posts

Discussion Seminars and Webinars

Innovation Grants

Talks

Workshop Papers