Documente Academic
Documente Profesional
Documente Cultură
11.23.18
Perspectives
Beyond
Black-Scholes:
A New Option
for Options
Pricing
By Balazs Mezofi and Kristof Szabo
WorldQuant, LLC
1700 East Putnam Ave.
Third Floor
Old Greenwich, CT 06870
www.weareworldquant.com
WorldQuant Beyond Black-Scholes: A New Option for Options Pricing
Perspectives 11.23.18
IN 1973, FISCHER BLACK, MYRON SCHOLES AND ROBERT MERTON an assumed risk premium similar to the valuation of investment
published their now-well-known options pricing formula, projects. In corporate finance, one of the most frequently used
which would have a significant influence on the development of models for the valuation of companies is the discounted cash flow
quantitative finance.1 In their model (typically known as Black- model (DCF), which calculates the present value of a company
Scholes), the value of an option depends on the future volatility of as the sum of its discounted future cash flows. The discount
a stock rather than on its expected return. Their pricing formula rate is based on the perceived risk of investing capital in that
was a theory-driven model based on the assumption that stock company. The revolutionary idea behind Black-Scholes was
prices follow geometric Brownian motion. Considering that the that it is not necessary to use the risk premium when valuing
Chicago Board Options Exchange (CBOE) opened in 1973, the an option, as the stock price already contains this information.
floppy disk had been invented just two years earlier and IBM was In 1997, the Royal Swedish Academy of Sciences awarded the
still eight years away from introducing its first PC (which had two Nobel Prize in economic sciences to Merton and Scholes for their
floppy drives), using a data-driven approach based on real-life groundbreaking work. (Black didn’t share in the prize. He died in
options prices would have been quite complicated at the time for 1995, and Nobel Prizes are not awarded posthumously.)
Black, Scholes and Merton. Although their solution is remarkable,
If all option prices are available in the market, Black-Scholes
it is unable to reproduce some empirical findings. One of the
can be used to calculate the so-called implied volatility based on
biggest flaws of Black-Scholes is the mismatch between the
option prices, as all the other variables of the formula are known.
model volatility of the underlying option and the observed volatility
Based on Black-Scholes, the implied volatility should be the same
from the market (the so-called implied volatility surface).
for all strike prices of the option, but in practice researchers found
Today investors have a choice. We have more computational that the implied volatility for options is not constant. Instead, it is
power in our mobile phones than state-of-the-art computers had skewed or smile-shaped.
in the 1970s, and the available data is growing exponentially. As
Researchers are actively seeking models that are able to price
a result, we can use a different, data-driven approach for options
options in a way that can reproduce the empirically observed
pricing. In this article, we present a solution for options pricing
implied volatility surface. One popular solution is the Heston
based on an empirical method using neural networks. The main
model, in which the volatility of the underlying asset is determined
advantage of machine learning methods such as neural networks,
using another stochastic process. The model, named after
compared with model-driven approaches, is that they are able to
University of Maryland mathematician Steven Heston, is able to
reproduce most of the empirical characteristics of options prices.
reproduce many empirical findings — including implied volatility
— but not all of them, so financial engineers have used different
Introduction to Options Pricing advanced underlying processes to come up with solutions to
With the financial derivatives known as options, the buyer pays generate empirical findings. As the pricing models evolved, the
a price to the seller to purchase a right to buy or sell a financial following difficulties arose:
instrument at a specified price at a specified point in the future.
Options can be useful tools for many financial applications,
including risk management, trading and management
compensation. Not surprisingly, creating reliable pricing models
The revolutionary idea
for options has been an active research area in academia. behind Black-Scholes was
that it is not necessary to
One of the most important results of this research was the
Black-Scholes formula, which gives the price of an option use the risk premium when
based on multiple input parameters, such as the price of the valuing an option, as the
underlying stock, the market’s risk-free interest rate, the time
until the option expiration date, the strike price of the contract
stock price already contains
and the volatility of the underlying stock. Before Black-Scholes, this information.
practitioners used pricing models based on the put-call parity or
• The underlying price dynamics got more complex mathematically parallel. One of their most interesting properties is duality in
and became more general — for example, using Lévy calculation speed: Although the training can be quite time-
processes instead of Brownian motions. consuming, once the process is finished and the approximation of
• The pricing of options became more resource intensive. the function is ready, the prediction is extremely fast.
Though the Black-Scholes model has a closed-form solution
for pricing European call options, today people usually use Neural Networks
more computationally intensive Monte Carlo methods to
price them. The essential concept of neural networks is to model the behavior
of the human brain and create a mathematical formulation of that
• It takes deeper technical knowledge to understand and use the
brain to extract information from the input data. The basic unit
pricing models.
of a neural network is a perceptron, which mimics the behavior
Applying machine learning methods to options pricing addresses of a neuron and was invented by American psychologist Frank
most of these problems. There are different algorithms that are Rosenblatt in 1957.4 But the potential of neural networks was not
able to approximate a function based on the function’s inputs and unleashed until 1986, when David Rumelhart, Geoffrey Hinton
outputs if the number of data points is sufficiently large. If we see and Ronald Williams published their influential paper on the
the option as a function between the contracted terms (inputs) backpropagation algorithm, which showed a way to train artificial
and the premium of the option (output), we can simply ignore all neurons.5 After this discovery, many types of neural networks
of the financial questions related to options or stock markets. were built, including the multilayer perceptron (MLP), which is the
Later we will see how adding some financial knowledge back into focus of this article.
the model can help improve the accuracy of the results, but on the
basic level no finance-related information is needed. The MLP is made up of layers of perceptrons, each of which
has an input: the sum of the output of the perceptrons from the
One of these approximation techniques uses artificial neural previous layer multiplied by their weights; it can be different for
networks, which have a number of useful properties. For each perceptron. The perceptrons use a nonlinear activation
example, some members of artificial neural networks are function (like the S-shaped sigmoid function) to transform the input
universal approximators — meaning that if the sample is large signals into output signals and send these signals into the next
enough and the algorithm is complex enough, then the function layer. The first layer (the input layer) is unique; perceptrons in this
that the network learned will be close enough to the real one for layer have just an output, which is the input data. The last layer (the
any practical purpose, as showed by George Cybenko (1989)2 and output layer) is unique in the sense that in regression problems it
Kurt Hornik, Maxwell B. Stinchcombe and Halbert White (1989).3 usually consists of a single perceptron. Any layers between these
Artificial neural networks are suitable for large databases because two layers are usually called hidden layers. For an MLP with one
the calculations can be done easily on multiple computers in hidden layer, the visualization is as follows in Figure 1.
Information Processing
Weight Weight
1
Activation
Σ
function Activation
x Σ Output
Activation function
Σ
function
x
Figure 1
Figure 1 can be written mathematically between the hidden layer between the model price and the observed price on the market) to
and the input layer as: reach good out-of-sample performance.
into too many parts, the chance of overfitting increases and the
When the dataset is small, model’s precision on out-of-sample results is reduced.9
choosing the best way to The world has come a long way since Black, Scholes and Merton
measure a function can published their seminal papers on options pricing in 1973. The
exponential growth in computational power and data, particularly
increase the precision over the past decade, has allowed researchers to apply machine
further. learning techniques to price derivatives with a precision unforeseen
in the ’70s and ’80s. Back then, options pricing was driven mainly
by theoretical models based on the foundation of stochastic
calculus. In this article, we provide an alternative method that uses
useful if the size of the dataset is small and you are more exposed machine learning, in particular neural networks, to price options
to overfitting problems. with a data-driven approach. We believe that this approach could
be a valuable addition to the tool set of financial engineers and may
While there is disagreement about which function researchers replace traditional methods in many application areas. ◀
should try to predict, there is a second debate about whether the
dataset should be split into subsets based on various qualities.
Balazs Mezofi is a Quantitative Researcher at WorldQuant, LLC,
Malliaris and Salchenberger argue that in-the-money and out-of-
and has an MSc in Actuarial and Financial Mathematics from
the-money options should be split into different datasets. From
Corvinus University in Budapest.
a practical point of view, this approach can be useful because the
magnitude of the option premiums can be very different in the two Kristof Szabo is a Senior Quantitative Researcher at WorldQuant,
groups. Sovan Mitra, a senior lecturer in mathematical sciences LLC, and has an MSc in Actuarial and Financial Mathematics from
at the University of Liverpool, contends that if the data are split Eötvös Loránd University in Budapest.
ENDNOTES
1. Stephen M. Schaefer. “Robert Merton, Myron Scholes and 6. Mary Malliaris and Linda M. Salchenberger. “A Neural
the Development of Derivatives Pricing.” Scandinavian Network Model for Estimating Option Prices.” Applied
Journal of Economics 100, no. 2 (1998): 425-445. Intelligence 3, no. 3 (1993): 193-206.
2. George Cybenko. “Approximation by Superpositions of 7. James M. Hutchinson, Andrew W. Lo and Tomaso Poggio.
a Sigmoidal Function.” Mathematics of Control, Signals and “A Nonparametric Approach to Pricing and Hedging Derivative
Systems 2, no. 4 (1989): 303-314. Securities Via Learning Networks.” Journal of Finance 49,
no. 3 (1994): 851-889.
3. Kurt Hornik, Maxwell B. Stinchcombe and Halbert White.
“Multilayer Feedforward Networks Are Universal 8. Mary Malliaris and Linda Salchenberger. “Using Neural
Approximators.” Neural Networks 2, no. 5 (1989): 359-366. Networks to Forecast the S&P 100 Implied Volatility.
”Neurocomputing 10, no. 2 (1996): 183-195.
4. Frank Rosenblatt. “The Perceptron: A Probabilistic Model for
Information Storage and Organization in the Brain.” 9. Sovan K. Mitra. “An Option Pricing Model That Combines
Psychological Review 65, no. 6 (1958): 386-408. Neural Network Approach and Black Scholes Formula.”
Global Journal of Computer Science and Technology 12,
5. David E. Rumelhart, Geoffrey E. Hinton and Ronald J. Williams. no. 4 (2012).
“Learning Representations by Back Propagating Errors.”
Nature 323, no. 6088 (1986): 533-536.
Thought Leadership articles are prepared by and are the property of WorldQuant, LLC, and are being made available for informational and educational
purposes only. This article is not intended to relate to any specific investment strategy or product, nor does this article constitute investment advice or convey
an offer to sell, or the solicitation of an offer to buy, any securities or other financial products. In addition, the information contained in any article is not
intended to provide, and should not be relied upon for, investment, accounting, legal or tax advice. WorldQuant makes no warranties or representations,
express or implied, regarding the accuracy or adequacy of any information, and you accept all risks in relying on such information. The views expressed
herein are solely those of WorldQuant as of the date of this article and are subject to change without notice. No assurances can be given that any aims,
assumptions, expectations and/or goals described in this article will be realized or that the activities described in the article did or will continue at all or in
the same manner as they were conducted during the period covered by this article. WorldQuant does not undertake to advise you of any changes in the
views expressed herein. WorldQuant and its affiliates are involved in a wide range of securities trading and investment activities, and may have a significant
financial interest in one or more securities or financial products discussed in the articles.