1. Barcelona’s approach procedures to RWY 24R
The ORCI project focuses on the West Parallel Runways configuration at Barcelona Airport, which is the most used configuration, with landing operations conducted on RWY 24R.
Arrivals to this runway are managed through RNAV1 instrument approach procedures known as transitions, which are specifically designed to sequence inbound traffic at high-density airports. An RNAV transition is a published procedure consisting of: 1 initial segment, 1 outbound leg and 1 inbound leg, which links a STAR (from an IAF or some earlier point) to a point from which it is possible to accomplish the final approach segment of an approach procedure to the ILS or equivalent approach.

2. 𝗗𝗮𝘁𝗮 𝗖𝗹𝗲𝗮𝗻𝗶𝗻𝗴 / Trajectory Validation
The ORCI project focuses on landing operations on runway RWY 24R and does not consider operations on other runways. In addition, only flights that perform the transition maneuver and whose trajectories are reasonably similar to the published procedure are included.
To prepare the data for training, it is necessary to carefully clean and select the flight trajectories. Although the trombone procedure is clearly defined, it is not always followed in practice. For example, during periods of low traffic, the trombone may be skipped and aircraft may be guided directly to the initial fix (TEBLA). For this reason, only those flights that closely follow the defined approach procedure have been selected, ensuring that the training data contains representative trajectories. The first two images show trajectories where the transition is not used, while the third image shows trajectories that follow the transition as published.



3. Creating synthetic flights.
The main objective of ORCI is to predict the spacing between two consecutive aircraft at the point where the follower intercepts the localizer. For this purpose, each aircraft is paired with the one immediately ahead, designating the first to land as the preceding aircraft and the second as the follower. For subsequent pairs, the same aircraft may act as a follower in one pair and as the preceding aircraft in the next, depending on its position in the landing sequence.
Since the tool is developed using real traffic data, the range of situations it is exposed to is limited to those that occur in normal operations. However, for ORCI to provide reliable predictions, it must also be able to handle a wider variety of situations, including rare or extreme cases that are not present in historical data.
To overcome this limitation, synthetic training samples are created by shifting the timing of the leading aircraft in each valid flight pair forward or backward by a few seconds, while keeping the following aircraft unchanged and preserving the original 3D trajectories. This allows a single real flight pair to generate multiple scenarios with different spacing conditions.
This process makes it possible to cover a full range of situations, including:
- Real operational situations
- Excessive spacing situations
- Unrealistic situations where the following aircraft overtakes or collides with the leading aircraft
By expanding the original dataset from around 3,200 real flights to approximately 55,000 samples, the tool is exposed to separation values ranging roughly from −2 to 8 NM, enabling it to learn from both tighter and wider spacing scenarios than those observed in real operations.


4. Machine Learning Models:
The dataset includes information for both the follower and the preceding aircraft at the theoretical moment when the turn instruction is commanded. Some columns contain data directly extracted from the flight information at that instant; other columns are derived features. Finally, the dataset includes the target column.
| Raw Flight variables | Calculated variables | Prediction Field |
| Geographical coordinates for both aircraft GS for both aircraft FL for both aircraft Wake turbulence follower | HDG deviation from LOC of the preceding aircraft Distance to THR of the follower aircraft Relative distance projected in LOC Median-centered GS for both aircraft Median-centered FL for both aircraft | Inter-distance when the follower aircraft intercepts LOC |
The training and test sets were split by allocating three weeks, one of each month to the test dataset and the remaining weeks to the training set, resulting in an approximate training and test ratio of 75%–25%. This temporal separation was chosen to prevent synthetic samples derived from the same flight from appearing in both datasets.
The geometry of the “trombone” structure means that spacing evolution is strongly affected by how advanced the leading aircraft is within its turning maneuver. For this reason, two separate linear regression models were trained: one applicable when the preceding aircraft is approximately in the first half of the turn, and another when it is in the second half. This distinction allows the predictive model to better capture the nonlinear geometric effects that occur during the turn, improving local accuracy while maintaining overall model simplicity and interpretability.
ML Results
The development of the ORCI prediction tool prioritized explainable machine-learning models, particularly linear regression, which offers a good balance between transparency and performance. During the exploratory phase, more complex algorithms such as Random Forests, gradient boosting methods and neural networks were also evaluated, but they did not provide significant improvements in predictive accuracy.
An initial linear regression model was first trained to predict the spacing between aircraft. In this context, prediction error is defined as the difference between predicted and actual spacing. Positive errors correspond to situations where aircraft end up closer than predicted, which represents the least desirable outcome from a safety perspective.
To mitigate this effect, a second iteration of the model was developed using a weighted linear regression. Higher weights were assigned to cases with positive errors, allowing the model to focus on these critical situations and reduce the likelihood of non-conservative predictions.
| Model /Metric | MAE | MSE | R2 | p10 | p50 | p90 | ||
| Conventional LR | Model 1 | Train | 0.380 | 0.248 | 0.934 | -0.581 | 0.003 | 0.603 |
| Test | 0.400 | 0.276 | 0.927 | -0.625 | 0.001 | 0.596 | ||
| Model 2 | Train | 0.367 | 0.228 | 0.967 | -0.555 | -0.001 | 0.588 | |
| Test | 0.382 | 0.239 | 0.963 | -0.607 | -0.035 | 0.566 | ||
| Weighted LR | Model 1 | Train | 0.439 | 0.319 | 0.916 | -0.856 | -0.242 | 0.369 |
| Test | 0.462 | 0.356 | 0.905 | -0.88 | -0.250 | 0.356 | ||
| Model 2 | Train | 0.421 | 0.289 | 0.959 | -0.807 | -0.228 | 0.373 | |
| Test | 0.445 | 0.317 | 0.951 | -0.858 | -0.263 | 0.355 | ||


Conventional linear regression models show better global performance, with lower MAE values (around 0.37–0.40 NM) and higher R² (up to 0.96), indicating strong overall accuracy and consistent behaviour between training and test datasets.
Weighted linear regression introduces a moderate increase in MAE (around 0.42–0.46 NM) and a slight reduction in R², reflecting the expected trade-off due to the applied weighting strategy. However, this is accompanied by a significant improvement in the distribution of critical errors.
In particular, the upper percentile (p90) is substantially reduced from approximately 0.56–0.60 in the conventional models to around 0.35–0.37 in the weighted models, indicating a strong reduction in large positive errors. At the same time, the distribution shifts towards more negative values (p50 around -0.24 to -0.26), showing a more conservative behaviour with increased overestimation of spacing.
Overall, the results show a clear trade-off: conventional models optimize global accuracy, while weighted models prioritize operational safety by reducing the most critical error cases.
Follow ORCI on LinkedIn to stay up to date with the latest news.


