Skip to main navigation Skip to main content Skip to page footer

Working Paper

Forecasting Bilateral Refugee Flows with High-dimensional Data and Machine Learning Techniques

Authors

  • Boss
  • K.
  • Gröger
  • A.
  • Heidland
  • T.
  • Krüger
  • F.
  • Zheng
  • C.

Publication Date

JEL Classification

C53 C55 F22

Key Words

Asylum Seekers

Forced Migration

Forecasting

Google Trends

Machine Learning

Migration

Refugee Flows

Related Topics

Migration

Globalization

European Union & Euro

Emerging Markets & Developing Countries

Europe

We develop monthly refugee flow forecasting models for 150 origin countries to the EU27, using machine learning and high-dimensional data, including digital trace data from Google Trends. Comparing different models and forecasting horizons and validating them out-of-sample, we find that an ensemble forecast combining Random Forest and Extreme Gradient Boosting algorithms consistently outperforms for forecast horizons between 3 to 12 months. For large refugee flow corridors, this holds in a parsimonious model exclusively based on Google Trends variables, which has the advantage of close-to-real-time availability. We provide practical recommendations about how our approach can enable ahead-of-period refugee forecasting applications.

Kiel Institute Expert

  • Prof. Dr. Tobias Heidland
    Research Director

More Publications

Subject Dossiers

  • View over cargo ship deck with containers

    International Trade

  • Colorful flags of European countires in front of an official EU building.

    Tension within the European Union

  • People demonstrating against war in the Ukraine

    War against Ukraine

Research Center

  • International Development