Skip to main content

Learning to Land Anywhere: Transferable Generative Models for Aircraft Trajectories

Paper ID

SIDs-2025-090

Conference

SESAR Innovation Days

Year

2025

Theme

Trajectory prediction and management

Project Name

SESAR 3 ER1 project SynthAIR

Keywords:

Air traffic management; embeddings; variational autoencoders; operational analytics; trajectory clustering; outlier detection; synthetic data; generative models

Authors

Olav Finne Præsteng Larsen, Massimiliano Ruocco, Michail Spitieris, Abdulmajid Murad and Martina Ragosta

DOI

https://doi.org/10.61009/SID.2025.1.41

Project Number

101114847

Abstract

Access to trajectory data is a key requirement for developing and validating Air Traffic Management (ATM) solutions, yet many secondary and regional airports face severe data scarcity. This limits the applicability of machine learning methods and the ability to perform large-scale simulations or “what- if” analyses. In this paper, we investigate whether generative models trained on data-rich airports can be efficiently adapted to data-scarce airports using transfer learning. We adapt state- of-the-art diffusion- and flow-matching–based architectures to the aviation domain and evaluate their transferability between Zurich (source) and Dublin (target) landing trajectory datasets. Models are pretrained on Zurich and fine-tuned on Dublin with varying amounts of local data, ranging from 0% to 100%. Results show that diffusion-based models achieve competitive performance with as little as 5% of the Dublin data and reach baseline-level performance around 20%, consistently outper- forming models trained from scratch across metrics and visual inspections. Latent Flow Matching and Latent Diffusion models also benefit from pretraining, though with more variable gains, while Flow Matching models show weaker generalization. Despite challenges in capturing rare trajectory patterns, these findings demonstrate the potential of transfer learning to substantially reduce data requirements for trajectory generation in ATM, enabling realistic synthetic data generation even in environments with limited historical records.