Documente Academic
Documente Profesional
Documente Cultură
Abstract
International flights arrive and depart from the United States daily at an increasing rate. These flights are monitored
and booked by airlines and the Transportation Security Authorities (TSA). A good forecast for international flights
and travelers arriving and departing from the US may help to plan for immigration arrival proceedings, security
screenings, monitoring, help to plan government budget spending on TSA processes as well as help to estimate
income for the tourism industry among others. This study presents an analysis and forecast assessment of
international flights and determines the best model for forecasting international flights. The study examines the
seasonality of the data, finds the best model and finally makes a forecast prediction for the 2012 year. We then
illustrate the model chosen for this forecast. In executing this study, we look at numerous modeling alternatives;
time series regression, winter’s exponential smoothing, decomposition method (automated and spread sheet),
ARIMA, combination model with ARIMA and dynamic regression. The study concludes that dynamic regression is
the best forecasting model with its unique handling capabilities of interventions and seasonality, and presents
recommendations.
Keywords
Forecast, seasonality, prediction, interventions
1. Introduction
International flights arrive and depart the United States daily. These flights are monitored and booked in and out at
all times. Arrivals to the United States by port-of-entry are tracked on a monthly basis [1]. We highlight summary of
statistics international travel reports by departments in charge. The U.S. Department of Commerce has arrivals data
on more than 40 U.S. ports-of-entry from all world regions [2]. In 2011, 62.7 million international visitors traveled
to the U.S., generating $153 billion in receipts and a $43 billion trade surplus [3]. According to the Office of Travel
and Tourism Industries, 2010 was a very good year for the travel and tourism industry; The United States welcomed
60 million international visitors, a record level of visitors to the United States, 5 million more than the year before
[4]. In 2010, the top inbound markets continued to be Canada and Mexico, both of which were up in arrivals along
with eight of the nine overseas regional markets. Asia, South America, Oceania and the Middle East experienced the
strongest growth in 2010, due in part to record level non-resident visits to the United States from Brazil, South
Korea, Australia, China, India and Colombia [4]. The US International Trade Administration reported in their
analysis that the top three ports of entry (New York JFK, Miami and Los Angeles) accounted for 39 percent of all
overseas arrivals, one and one-half percentage points less than last year [2]. A monthly independent forecast and
prediction of arrivals for these airports will be a great benefit to the management. Developing forecasts of aviation
demand and activity levels continues to be challenging as the aviation industry evolves and prior relationships
change [5]. A good forecast for international flights arrival and departure from the USA will help to estimate income
for the tourism industry, plan for immigration arrival proceedings, security screenings and monitoring among other
things. This study presents analysis and determines the best forecasting model for predicting international flights in
the USA.
Atuahene, Acosta-Amado, Sawhney, Appiah-Kubi and Upreti
We now briefly review literature on aviation forecasting. Accurate prediction of international flight arrivals and
departure is critical and fundamental to airport security, play an integral role in the assessment of future facility
needs at airports [6] as well as tourism planning among others. Using data from a major North American cruise
company, Xiaodong et al (2011) applied a variety of (24) forecasting methods, which are divided into three
categories (non-pickup methods, classical pickup (CP) methods and advanced pickup (AP) methods), to generate
forecasts of final bookings for the cruises that have not yet departed at a particular reading point. The study then
used a two-stage framework to test alternative forecasting methods and compared their performance [7].
Weatherford et al. (2003) presented the first published research paper on the technique of neural network forecasting
as applied to the airline industry, which was compared with the traditional forecasting techniques (moving averages,
exponential smoothing, regression, etc.) [8]. A paper presentation by Littlewood (2005) described the early work on
applying mathematical models to the development of revenue management in the airline industry. The study focused
on describing passenger forecasting and revenue control methods, and introduced the idea of maximizing the
revenue received on a particular flight, rather than maximizing the number of passengers carried [9]. Keith and
Leyton (2007) applied statistical, probabilistic forecasts and categorical forecasts methods in the forecasts of low
ceiling and/or reduced visibility and their corresponding impact on forecast value for flights arriving at three major
airports in the United States [10]. In a similar study, Rudack and Ghirardelli (2010) applied categorical and
probabilistic forecasts to conduct a comparative verification of localized aviation model output statistics program
(LAMP) and numerical weather prediction (NWP) model forecasts of ceiling height and visibility [11]. Chmielecki
and Raftery (2011) applied the Bayesian model averaging to probabilistic visibility forecasting using a predictive
PDF that is a mixture of discrete point mass and beta distribution components to develop three approaches to
developing predictive PDFs for visibility [12].
Researchers have noted that the main factors that significantly influence flight arrivals at both international and local
airports are weather conditions, terrorist attacks, spike in oil prices and recession to mention a few [7, 8]. Other
natural conditions like tornadoes, earthquakes and diseases like the H1N1 flu in the cold or winter season, etc may
also influence arrivals and departures. Several approaches and techniques have been used to forecast US travel
demand, commercial aviation forecasts and assumptions, aerospace forecast, etc. Conventionally, they were based
on time series, regression, and econometric approaches with iterative processes [13]. Among the approaches are
those of the FAA which they are confident that the forecasts accurately predict future aviation demands, however
they report that due to the large uncertainty of the operating environment the variance around the forecasts is wider
than in prior years [5]. Different forecasting approaches to the aviation industry (income, load, passenger
enplanement, etc can be found in literature. Commercial aviation forecasts reported thus far are considered
unconstrained in that they assume there will be sufficient infrastructure to handle the projected levels of activity [14,
15]. The FAA’s economic forecasts developed by Global Insight, Inc. do not assume further contractions of the
industry through bankruptcy, consolidation, or liquidation [5, 16]. On the other hand, we present a variety of
forecasting models that accommodates interventions and uncertainties and compare to select the best model.
2. Objective
The objective of this study is to analyze international flights arriving and departing from the US, from 2000 to 2011.
The data was collected by month for 12 years beginning 2000 and going through the next to the last month of
December 2011. The last 12 months of 2011 is used as a holdout sample. The study examines the seasonality of the
data; find the best forecasting model for international flights, etc. Finally, we attempt to make forecasts for 2012 and
compare it with the FAA 2012 data reported. We then illustrate the model chosen for this forecast. In executing this
study, the projects looks at numerous modeling alternatives; time series regression, winter’s exponential smoothing,
decomposition (automated and spread sheet), ARIMA, combination model with ARIMA and dynamic regression.
Finally, we conduct a forecast assessment and selection of the best forecasting model and then presents
recommendations. The forecasts and predictions we present are based on historical passenger statistics from the
United States Immigration and Naturalization Services and also from the Bureau of Transportation Statistics. The
data was obtained from the Bureau of Transportation Statistics T-100 Segment data section [17].
3. Methodology
130000.0 130000.0
Variables
International Box Plot
130000.0
115000.0
110000.0
International
International
100000.0 110000.0
International
90000.0
85000.0 90000.0
In treating outlier we used statistical process control (SPC) methodology. The ARIMA model with regular (1,0,0)
and seasonal (1,0,1) gives white noise , significant coefficients and no high parameter estimates, so was chosen for
the outlier analysis on the original data. This delineated 2 observations (2001, Month 9: September, and 2009 Month
5: May) as possible outliers. A normality test on the residuals of the untouched data with Shapiro-Wilk W test,
Kolmogorov-Smirnov test, D'Agostino Skewness test, D'Agostino Kurtosis test and D'Agostino Omnibus test did
not reject normality and showed that the data was normal. We also performed a white noise test which yielded p-
values =0, indicating that there is no white noise at alpha of 0.05 level.
Before using the various methodologies for the forecast analysis, an intervention analysis and identification was
performed using the SAS software to identify the possibility of interventions in the historical data. Based on the
analysis, the data was suspected of having interventions. We deal with the intervention in the dynamic forecast
section of this study. The first intervention was identified as a major change point that occurred early in the
series.There was a likely second intervention later and possibly a third in the series. In this case dynamic linear
regression may help with such situations. The intervention seems to have occurred in the year 2001 and in the month
of October which extends through November 2001 and January 2002. These interventions were investigated and we
found out that it may have to do with (9/11) after the bombing of the world trade center. At this point, international
flights were halted and flights from some countries restricted and limited for a period of time. The possibly second
intervention which occurred through September 2008 and Oct 2008 could be as a result of the number of things that
occurred through this period across the United States. Among these were hurricane Gustav on September 1st and
hurricane Ike, around September 13th with their effects spreading through the month of October. Another possible
one seems to have occurred around September 2009 which may be as a result of September just being a bad month
for travel since it’s the end of summer and also tail end of the hurricane seasons. Another reason which is not full
proof may be linked to International flights that are likely to drop every year during the month of September as a
result of the effect of 9/11.
The coefficients ,…, measure the effect of each predictor after taking account of the effect of all other predic-
tors in the model [18, 19]. With this approach, we considered numerous modeling alternatives using the NCSS
software. The models explored are; time series regression, Additive with trend only (T), additive with trend and
seasonality (T+S), Additive with trend and seasonality plus the interaction of trend and seasonality (T+S+T*S), and
Robust additive with trend and seasonality. Also, we explored Multiplicative with trend only (T), Multiplicative
with trend and seasonality (T+S), Multiplicative with trend and seasonality plus the interaction of trend and
seasonality (T+S+T*S) and Robust Multiplicative with trend and seasonality. Upon comparing all these models, the
additive with trend and seasonality (T+S) was found to be the best time series regression model. The ANOVA test
on the overall model as well as overall months yielded a p-value=0 which indicates that there is a difference in
month adjusting for trend. Also trend adjusting for seasonality was significant (with p-value of 0.00). The regression
equation identified that the p-value for the intercept is significant, but not for months 1, 3, 4, 5, 6. The regression
coefficient analysis gave an intercept of 84705.6818, and trend for each additional month increase by 228.7885
adjusting for trend and seasonality. The Durbin-Watson test for serial correlation rejects Ho. In the residual
diagnostic analysis section of the results, observations 25 and 26 seemed higher than the others and were suspicious
to be possible outliers. These are the months right after the September 2001, 9/11 intervention. The R-squared value
for the model is 0.7637 which shows that 76 % of the variation in international flights is explained by trend and
seasonality. The press R-squared value is 0.7089. There was no normality in the residuals; hence the assumptions
are not satisfied. Also some problems with the serial correlations were identified. The variance inflation factors were
acceptable since they were less than 5 and hence no co-linearity problems present.
References
1. Office of Travel and tourism industries, 2011, available at
http://www.tinet.ita.doc.gov/research/programs/i94/index.html
2. Office of Travel and tourism industries, Monthly tourism statistics, 2012. Available at
http://tinet.ita.doc.gov/view/m-2012-I-001/index.html
3. International Trade Administration, New Report Forecasts Continued Growth in International Travel to the
United States During Next Five Years, 2000. available at http://tinet.ita.doc.gov/view/f-2000-99-
001/forecast/Forecast_Summary.pdf
4. Office of Travel and Tourism Industries, 2010, “International Visitation to the United States: A statistical
Summary of US visitation (2010)”. Available at
http://tinet.ita.doc.gov/outreachpages/download_data_table/2010_Visitation_Report.pdf
5. FAA Aerospace Forecast, Fiscal Years 2011-2031, 2011, Available at
http://www.faa.gov/about/office_org/headquarters_offices/apl/aviation_forecasts/aerospace_forecasts/2011
-2031/media/2011%20Forecast%20Doc.pdf
6. Quad City International Airport, Master plan update, 2012. Available at http://qciaairportmasterplan.com/
7. Xiaodong, S., Gauri D. K., and Webster S., 2011, Forecasting for cruise line revenue management, Journal
of Revenue & Pricing Management 10, 306-324.
8. Weatherford, L. R., Gentry T. W. and Wilamowski B., 2003, Neural network forecasting for airlines: A
comparative analysis, Journal of Revenue & Pricing Management, 1, 319–331.
9. Littlewood, K., 2005, Special Issue Papers: Forecasting and control of passenger bookings, Journal of
Revenue & Pricing Management, 4, 111–123.
10. Ross, k., and Leyton S.M., 2007, An Experiment to Measure the Value of Statistical Probability Forecasts
for Airports, Wea. Forecasting, 22, 928–935.
11. Rudack, D. E., and Ghirardelli, J. E., 2010, A Comparative Verification of Localized Aviation Model
Output Statistics Program (LAMP) and Numerical Weather Prediction (NWP) Model Forecasts of Ceiling
Height and Visibility, Wea. Forecasting, 25, 1161–1178.
12. Chmielecki, R. M., and Raftery A. E., 2011, Probabilistic Visibility Forecasting Using Bayesian Model
Averaging, Mon. Wea. Rev., 139, 1626–1636.
13. National Bureau of Economic Research. Available at http://www.nber.org/
14. FAA Aerospace Forecast Fiscal Years 2012-2032, 2012. Available at:
http://www.faa.gov/about/office_org/headquarters_offices/apl/aviation_forecasts/aerospace_forecasts/2012
-2032/media/FAA%20Aerospace%20Forecasts%20FY%202012-2032.pdf
15. Field, C., 2005, Aviation Activity Forecasts, Master plan Update, Jacksonville Aviation Authority (Chapter
3). Available at:
http://www.flyjacksonville.com/pdfs/VQQMP/Chapter%203,%20Aviation%20Activity%20Forecasts.pdf
16. FAA Aerospace Forecast, Fiscal years 2009-2025, 2009, U.S. Department of Transportation Federal
Aviation Administration Aviation Policy and Plans. Available at
http://www.faa.gov/data_research/aviation/aerospace_forecasts/2009-
2025/media/2009%20Forecast%20Doc.pdf
17. Research and Innovative Technology Administration, Bureau of Transportation Statistics, 2012. Available
at http://www.transtats.bts.gov/Data_Elements.aspx?Data=1
18. DeLurgio, S.A., 1998, “Forecasting Principles and Applications”, 1st Edition, Graphic Creations, Richard D
Irwin; 1st edition.
19. Montgomery, D.C., Jennings, C.L., and Kulahci, M., 2008, “Introduction to Time Series Analysis and
Forecasting,” John Wiley & Sons, Inc.
Atuahene, Acosta-Amado, Sawhney, Appiah-Kubi and Upreti
20. Makridakis S. G., Wheelwright S. C., and Hyndman R. J., 1997, “Forecasting: Methods and Applications”,
Wiley; 3 edition.