Skip navigation
Library
Project

Developing an AI-powered tool for data extraction from texts

How can artificial intelligence (AI) be used to support the data extraction stage of systematic evidence synthesis and policy analyses? That is explored in this project where an AI-powered and expert-validated process is developed and tested.

Active project

2024

SEI champions systematic evidence syntheses (including systematic maps and reviews) to support evidence-informed decision-making. A comprehensive synthesis of available research is invaluable to understand complex societal challenges, for example related to the environment and development. It can however be quite costly and time consuming if the synthesis involves a large body of research.

In this project, we are developing and testing an AI-powered and expert-validated process for data extraction from texts. The aim is to enhance the efficiency and comprehensiveness of systematic evidence syntheses, policy analyses and similar methodologies that apply text analysis. If proven reliable and sufficiently precise, this AI tool could improve decision-support capabilities by efficiently providing more accurate evidence.  

The proposed approach applies Large Language Models (LLMs) with semantic search to extract information buried in documents. This makes it possible to identify answers to questions through meaning (rather than relying on keyword search). The AI-driven semantic coding is then combined with human expert validation.

In this collaboration between SEI and the KTH Royal Institute of Technology in Stockholm, the AI-powered data extraction process is piloted on a review of how well Water, Sanitation and Hygiene (WASH) policies integrate climate policy objectives, and vice versa. To ensure diverse perspectives and domain-specific knowledge, a pool of subject experts from various SEI centres will validate the AI outputs.

The project is a collaboration between SEI experts in evidence synthesis methods, WASH and climate and KTH’s Climate Action Centre. It is co-lead by Biljana Macura (Senior Research Fellow at Stockholm Environment Institute, SEI) and Haluk Akay (Postdoc at KTH Climate Action Centre).

Biljana Macura
Biljana Macura

Senior Research Fellow and Team Lead

SEI Headquarters

Daniel Ddiba
Daniel Ddiba

Research Fellow

SEI Headquarters

Carla Liera
Carla Liera

Research Associate

SEI Headquarters

Nhilce N. Esquivel
Nhilce N. Esquivel

Research Associate

SEI Headquarters

Adriana Soto
Adriana Soto Trujillo

Research Associate

SEI Headquarters

Vivian Ribeiro
Vivian Ribeiro

Senior Data Scientist

SEI Headquarters

Lutta Alphayo
Alphayo Lutta

Research Fellow

SEI Africa

Kevin Hicks

Senior Research Fellow

SEI York

Chris Malley

Senior Research Fellow

SEI York

Camilo Andrés González

Research Associate

SEI Latin America

Maria Sköld
Maria Sköld

Senior Communications and Impact Officer

Communications

SEI Headquarters

Carla Liera
Carla Liera

Research Associate

SEI Headquarters

The project is a collaboration between SEI and the Climate Action Centre at KTH Royal Institute of Technology in Stockholm. It is jointly funded by the SEI’s core funding and the Digital Futures Postdoc Fellowship Data-Driven Design for Climate Action.

Design and development by Soapbox.