CLICK HERE TO DOWNLOAD THE PDF
Florian Bourgey and Julien Guyon introduce a novel discrete-time-continuous-time exact calibration method. They build an S&P 500/VIX jointly calibrated discrete-time model, which they later extend to continuous time by martingale interpolation. The benefit of this technique is that both steps can be made much faster than the known methods that calibrate a continuous-time model directly
In previous work, Guyon (2020, 2024) showed how to build a nonparametric discrete-time arbitrage-free model that perfectly matches market data on Standard & Poor’s 500 index value (SPX) futures, SPX options, Chicago Board Options Exchange Volatility Index (VIX) futures and VIX options. The probability distribution is built by minimising the relative entropy with respect to a reference probability measure, and this Schrödinger problem is solved numerically using an extended Sinkhorn algorithm. This provided the first exact solution to this joint calibration problem, a difficult problem (especially for short maturities) that had eluded quants for many years. Jointly calibrating to SPX and VIX futures and options is important to prevent arbitrage and ensure accurate pricing of liquid hedging instruments; calibrating to VIX derivatives means incorporating market information on SPX forward volatilities. Figure 1 showcases the additional information contributed by the VIX by quantifying how model-free bounds for SPX path-dependent payoffs tighten when the prices of VIX futures and VIX options are included. The extra information is also seen from table A, where we compare the prices of several options in models that are all calibrated to SPX smiles but not all calibrated to VIX futures and smiles. Table A shows that to avoid mispricing some payoffs (particularly forward-starting payoffs, which are sensitive to forward volatilities) it is important that the model also fits VIX futures and options, even when the payoff depends only on SPX prices.
The aim of this paper is twofold:
to speed up the construction of the discrete-time model by turning to Newton-type methods for solving the Schrödinger system (in the next section, we show numerically that a mixed Newton–Sinkhorn method and an implied Newton method converge much faster than the Sinkhorn algorithm); and
to quickly build a continuous-time extension of the model, allowing the pricing of options depending on SPX values at any date , while ensuring calibration to the market smiles of SPX and VIX.
The first step draws inspiration from De March (2018), who explored the entropic approximation for discrete-time multi-dimensional martingale optimal transport, excluding VIX, using Newton’s method. Our continuous-time construction in the second step may appear similar to the Bass local volatility of Conze & Henry-Labordère (2022) but it is fundamentally different. First, unlike the construction by Conze & Henry-Labordère (2022), our purely forward Markov functional construction does not require the solution of a fixed-point problem; it is thus much faster. This is because we first build an arbitrage-free multi-marginal discrete-time model consistent with market data. Second, our continuous-time interpolated model fits not only SPX option prices but also VIX market data.
By following both steps, we thus quickly (in less than a minute) build a continuous-time model that is, by construction, exactly calibrated to SPX and VIX smiles and futures. By contrast, other known continuous-time exact solutions to the joint calibration problem (Guo et al 2022; Guyon 2022a) are more involved and demand significantly more computation time. Approximate parametric continuous-time solutions, including those based on rough or rough-like path-dependent volatility models, classical stochastic volatility models or signature-based models, are also costly in terms of computation time; we refer the reader to the extended online version of this article (Bourgey & Guyon 2022) for references. The main benefit of our novel discrete-time-continuous-time calibration method is that both steps are much faster than the known methods that directly calibrate a continuous-time model.
A natural practical application of our continuous-time model is the pricing and hedging of structured products by exotics desks (see table A). With our model, the pricing and hedging of structured products on the SPX indeed take into account all the information given by SPX smiles (the risk-neutral distributions of future SPX values) as well as all the information brought by VIX futures and VIX smiles (the risk-neutral distributions of some future SPX implied volatilities). Once the model has been calibrated (this takes less than a minute per VIX expiry), it is straightforward to implement and use, as it is a Markov functional model that involves simulating only one Brownian motion, along with the VIX at VIX future expiries. The model can also be used for computing reserves and other valuation adjustments and for assessing model risk.
Fast exact joint calibration in discrete time
Setting and notation
Let denote a VIX future maturity and set , where days. For simplicity, assume zero interest rates, repos and dividends. We take as given the full market smiles of the SPX index at and (ie, the full continuum of SPX call prices of maturity for any and all strikes ) as well as the full market smile of the VIX index at , (ie, the full continuum of VIX call prices for all strikes ). For , we use the shorthand notation . We use the term forward-starting log contract (FSLC) to denote the financial derivative that pays at . From the VIX definition (substituting the strip of out-of-the-money options with the log contract for simplicity), the price of the FSLC at is .
Table A: Prices of various options in the Dupire local volatility model, local two-factor Bergomi model and our continuous-time model. [We choose the same parameters as Guyon (2022b, table 4) for the local two-factor Bergomi, jointly calibrated to the term-structures of SPX ATM skew and VIX implied volatility: , , , , , , . We define , ]
Our continuous- | |||
---|---|---|---|
Price | LV | LV + Bergomi 2F | time model |
For each maturity , , the absence of static SPX arbitrage is equivalent to the existence of a risk-neutral measure , , such that the price of any vanilla option written on is the expectation of the payoff under . Similarly, by the absence of static VIX arbitrage, there exists a risk-neutral measure such that the price of any vanilla option written on is the expectation of the payoff under .
In the absence of dynamic SPX arbitrage (or calendar arbitrage), and are of convex order (ie, for any convex function ), even if we allow trading in the FSLC at . By the absence of arbitrage, the price of at time is the initial SPX spot value (ie, ). Furthermore, , where is the value at time of the VIX future maturing at . Finally, in order for the log contracts and the VIX squared to have finite prices, in the remainder of this section we assume the following.
Assumption 1
For any , the given marginals , and satisfy , and , .
Let and define the strictly convex function . For a probability distribution on , we denote the associated cumulative distribution by (ie, for every ). Let (respectively, ) denote the set of all probability measures on (respectively, ).
Let be the set of all measurable functions , , satisfying for , and , bounded. We use the shorthand notation:
to denote the profit-and-losses (P&Ls) from delta-hedging at time in the SPX and the log-contract, respectively. Finally, let denote the set of all VIX-constrained martingale probability measures:
Entropy minimisation
Solving the joint calibration problem is equivalent to building a probability measure . In the absence of joint SPX/VIX arbitrage, there may exist an infinite number of models within the convex set of jointly calibrated models. To build a specific, meaningful jointly calibrated model, in the spirit of Avellaneda et al (1997), Guyon (2020, 2024) suggests a minimum entropy approach: choose a reasonable (though not jointly calibrated) model on , possibly one derived from a model already in use at the financial institution, and build the probability measure that is closest to in the entropic sense (ie, has minimum relative entropy with respect to ):
(M) |
From Guyon (2024), if the minimum entropy problem is finite, there exists a unique minimiser of the form:
where , if they exist, are maximisers (called Schrödinger potentials) of the dual problem:
(P) |
Additionally, in any case, . Note that (P) is an unconstrained concave maximisation problem. Both problems are dual to each other; following a terminology proposed by Dupire, (M) is a measure problem while (P) is a portfolio problem.
As, in practice, only a finite number of SPX and VIX vanilla options are available for trading, we consider vanilla payoffs , and that are linear combinations of finitely many call options, along with one position in the bond, one position in and one position in the VIX futures. Therefore, we consider a market data set composed of call options on , and , which we denote by , and with respective strikes , and , and we build a model of the form:
where is an element of the set of all such that , , , and are bounded measurable functions of . The measure is then a consistent, arbitrage-free model that is jointly calibrated to the market prices of SPX/VIX futures and options if and only if solves the so-called -Schrödinger system:
(1) |
The first equation states that is a probability measure, while the others state that it belongs to the set of probability measures satisfying:
Remark 1
The value of the prior measure is chosen by the modeller. Examples include a lognormal prior:
where and is the distribution of:
with ;11 1 Assuming that is lognormally conditioned on and that is financially natural, which may not be the best choice in practice. Indeed, in this case, for , so the expectations in (1) may not be well defined. In practice, we avoid those integrability issues by working with a finite-support approximation of stemming from the Gaussian quadrature approximation of the expectations (see remark 2). or the independent prior (product measure) .
Remark 2
To evaluate the expectations arising in (1), we use a Gauss–Legendre quadrature when integrating with respect to (respectively, ) with grid (respectively, ). For , in the case of the lognormal prior, we use a Gauss–Hermite quadrature with knots
Solving the Schrödinger system (1)
The Sinkhorn algorithm
The classical method for solving Schrödinger systems is the Sinkhorn algorithm, an iterative method that sequentially solves the individual equations in the Schrödinger system (here, (1)), which converge to the optimiser of the whole system. This algorithm has recently gained popularity in machine learning, where it is used to quickly compute Wasserstein distances and more generally to solve optimal transport problems, via a small entropic penalty. It has also been applied to martingale optimal transport problems (De March 2018) and, in particular, in quantitative finance to quickly build arbitrage-free smiles (De March & Henry-Labordère 2019). In Guyon (2024) the Sinkhorn algorithm was extended to accommodate the martingality and consistency constraints in (1), and shown to converge toward a jointly calibrated model. However, the convergence was somewhat slow (see the ‘Comparison of the different algorithms’ section below). In the following sections, we present two faster alternatives for solving (1) numerically; both rely on solving the portfolio problem (P).
The Newton–Sinkhorn algorithm
Observe that if we define the concave function:
solving the -Schrödinger system is equivalent to cancelling the gradient of . Hence, to solve the system and build , one can directly solve the portfolio problem:
(2) |
which is the ‘finitely many payoffs’ version of (P). To this end, we suggest the following Newton–Sinkhorn algorithm. Each iteration involves a Newton step followed by a Sinkhorn step.
Newton step. Starting from an initial guess , we first solve for every iteration and the portfolio problem (2):
(3) |
where:
Since the Hessian of is known in closed form, this step is extremely fast. We solve (3) using the function scipy.optimize.minimize (method="trust-exact") from the scipy library.
Sinkhorn step. Then, for all , we jointly solve for and the two-dimensional nonlinear system:
(4) | ||||
(5) |
where is the optimal vector from the previous step, (3), and for all :
This is the Sinkhorn step, where the last two equations of (1) are both solved.
We use the Levenberg–Marquardt algorithm to solve (4) and (5) via scipy.optimize.root(method="lm"). A full Newton algorithm with parameterised deltas has also been considered in the extended online version of this article (Bourgey & Guyon 2022).
The implied Newton algorithm
Inspired by De March (2018), we observe that:
where solves the two-dimensional nonlinear system:
That is, for each , we first optimise over and , and then we optimise over . Note that the inner optimisation depends on only through .
Figure 2: Comparison of the performance of the Newton–Sinkhorn, implied Newton (with a 10-iteration pure Sinkhorn warm-start) and Sinkhorn algorithms. We plot the common logarithm of the calibration error as a function of the computational time as of April 20, 2020, with days.
As with , the gradient and Hessian of are known in closed form. In fact, has the same gradient and Hessian as , except for the terms involving differentiation with respect to , whose exact computation is detailed in Bourgey & Guyon (2022).
Comparison of the different algorithms
All the numerical tests were performed on a MacBook Pro laptop with a 2.6 GHz six-core Intel Core i7 processor and 32 GB memory using Python 3.9.6.
To compare the various algorithms, we plot the logarithm (in base 10) of the calibration error for the SPX smiles at and the VIX smile at as a function of the computational time as of April 20, 2020 ( days); see figure 2. Other calibration dates in 2018 and 2022 are considered by Bourgey & Guyon (2022). The calibration error is computed as the sum of the absolute relative errors of the three futures, the mean of the three smiles and the total mass of , which must be a probability measure.
We choose the lognormal prior for the reference measure (see remark 1) and we take knots for the integration with respect to and , and knots for the integration with respect to . We set , and , with for the lowest and highest values of the quadrature grids of and .
We compare the calibration speed of the Sinkhorn (S), Newton–Sinkhorn (NS) and implied Newton (IN) algorithms. For IN, as suggested by De March (2018, section 7.1), we initialise the algorithm with 10 iterations of a (pure) S algorithm (warm-start procedure). A warm-start initialisation of NS yielded no improvement.
Figure 3: Futures and smiles of , , after 35 seconds in the discrete-time model calibrated with the implied Newton algorithm versus market smiles as of April 20, 2020, with days. (a) Smile of SPX, days. (b) Smile of VIX, days. (c) Smile of SPX, days.
Figure 4: SPX smiles at and and VIX smile at (market, discrete-time model computed using the implied Newton algorithm and continuous-time extension), as of August 1, 2018. (a) Smile of SPX, days. (b) Smile of VIX, days. (c) Smile of SPX, days.
In figure 2, we observe that IN is the fastest, and S the slowest. In figure 3, we plot the futures and smiles for , and obtained with the IN algorithm after 60 seconds as well as the market smiles: the fits are perfect. The martingality and VIX constraints (reported in Bourgey & Guyon (2022)) are also perfectly satisfied. Similar plots are obtained for the other two calibration dates by Bourgey & Guyon (2022).
We also tested the three methods when the reference measure is the product measure (independent prior). The numerical results are reported in Bourgey & Guyon (2022). We observe that the three methods seem to be less stable with the independent prior and typically require a higher number of nodes; we chose . With the independent prior, IN needs more steps to converge.
Continuous-time extension
We now extend the discrete-time model (and the probability space) to build a continuous-time model for . The model will also include the VIX at , . That is, we build a probability on representing the distribution of . Here, plays the role of a discrete-time stochastic volatility, representing the stochastic volatility anticipated at for the period, but our model involves no continuous-time stochastic volatility process. Similarly to that of Conze & Henry-Labordère (2022), our model is computationally efficient, as it only requires simulating one Brownian motion (and ).
The key advantage of our construction compared with the Bass local volatility of Conze & Henry-Labordère (2022) is that it starts directly from a joint discrete distribution of , making the continuous-time interpolation a purely forward construction with no need to solve a fixed-point problem. Moreover, it includes a stochastic volatility component, , for the calibration to VIX futures and VIX smiles in addition to SPX smiles. As a result, in contrast with Conze & Henry-Labordère (2022) and in line with the path-dependency observed in financial markets (Guyon & Lekeufack 2023), our model is path-dependent: the SPX dynamics after depend on both and .
Step 1: simulation of
We want to be a -martingale and to have distribution under . To achieve this, one possible choice is to use a Markov functional model , , where is a -Brownian motion and satisfies the heat equation:
Further, define the filtration:
In such a case, is an -martingale, and the terminal condition is determined via quantiles so that has distribution , ie, for all , we set:
The solution to the heat equation is explicit and given by:
(6) |
where is the convolution operator and where, for , is the heat kernel.
Step 2: simulation of given
At this stage, we have simulated such that is an -martingale and has distribution under . Now, we simulate given under as follows: the distribution of given under is assumed to depend only on and is taken equal to the distribution of given under . Since has distribution under both and , this means that the distribution of is the same under and ; in particular, has the distribution under .
Step 3: simulation of given
. In this last step, we build dynamics for conditional on such that is an -martingale starting from , and has distribution . We use once again a Markov functional construction. Given and , we consider:
for all and where is the distribution of given and under . Then, given , we define for :
where for every :
(7) |
It is easy to check that is an -martingale starting from . Moreover, the distribution of given is the same under and . As the distribution of is the same under and , we conclude that the distribution of is the same under and . In particular , and have distributions , and under . We have thus built a model on such that (a) , and have distributions , and under ; (b) is an -martingale; and (c) is the VIX at , since by construction:
Remark 3 (Extension to several VIX maturities)
Note that this approach can easily be iterated on intervals . For example, after step 3, we will have generated . Then, we just need to generate and repeat the same procedure. Here, we disregard the Wednesday/Friday issue and make the approximation that the VIX future maturities are exactly 30 days apart.
The numerical implementation of the continuous-time extension is easy; for details we refer the reader to Bourgey & Guyon (2022). The resulting smiles (market, discrete and continuous time) for the SPX at and and that of the VIX at are displayed in figure 4; they were computed by Monte Carlo simulation with paths.
Pricing
Our continuous-time model can be used to price path-dependent options on the SPX with the guarantee that the model exactly matches the SPX smiles at and and the VIX future and VIX smile at , thus taking into account information about the forward volatility at that is not included in the SPX smiles. As a pricing exercise, we compare it with models commonly used by practitioners: the Dupire local volatility model (Dupire 1994) and the local stochastic version of the two-factor Bergomi model (Bergomi 2005); those models are calibrated to the full SPX implied volatility surface but not to VIX smiles. We consider spot-starting, forward-starting and mixed versions of lookback and Asian options. The prices are reported in table A along with their 95% confidence interval. We used Monte Carlo paths and a trapezoidal rule to approximate the time integral for the Asian option. Note that for forward-starting options the price in our jointly calibrated model is always larger than in the other two models: ignoring the VIX information leads to underpricing these forward-starting payoffs.
Conclusion
In this article, we have:
improved model-free bounds on SPX options by incorporating VIX options data;
built the minimum-entropy jointly calibrated discrete-time model very fast using an implied Newton method;
seamlessly extended this discrete-time model to continuous time in a purely forward fashion, using Markov functionals.
Thus, we have established a swift process for creating an arbitrage-free continuous-time model for SPX that accurately calibrates to SPX smiles, VIX futures and VIX smiles. Such a model can be used for pricing and hedging exotic options, computing reserves or valuation adjustments, and assessing model risk. Our main methodological contribution is that we first build a jointly calibrated discrete-time model (where the are the calibrated maturities) that is later extended to continuous time using an arbitrage-free martingale time interpolation. Since the discrete-time model can be exactly calibrated much faster than continuous-time models, and since extremely fast extrapolations exist, this novel approach seems to be a promising new avenue for calibrating models.
Florian Bourgey is a quantitative researcher at Bloomberg in New York, while Julien Guyon is a professor of applied mathematics at École des Ponts ParisTech in Paris. They appreciate the valuable feedback from the anonymous referees.
Email: fbourgey@bloomberg.net, julien.guyon@enpc.fr.
References
- Avellaneda M, C Friedman, R Holmes and D Samperi, 1997
Calibrating volatility surfaces via relative-entropy minimization
Applied Mathematical Finance 4(1), pages 37–64 - Bergomi L, 2005
Smile dynamics II
Risk October, pages 67–73 - Bourgey F and J Guyon, 2022
Fast exact joint S&P 500/VIX smile calibration in discrete and continuous time
Preprint, SSRN 4315084 - Conze A and P Henry-Labordère, 2022
A new fast local volatility model
Risk April, http://www.risk.net/7944906 - De March H, 2018
Entropic approximation for multi-dimensional martingale optimal transport
Preprint, arXiv:1812.11104 - De March H and P Henry-Labordère, 2019
Building arbitrage-free implied volatility: Sinkhorn’s algorithm and variants
Preprint, SSRN 3326486 - Dupire B, 1994
Pricing with a smile
Risk January, pages 18–20 - Guo I, G Loeper, J Obłój and S Wang, 2022
Joint modeling and calibration of SPX and VIX by optimal transport
SIAM Journal on Financial Mathematics 13(1), pages 1–31 - Guyon J, 2020
The joint S&P 500/VIX smile calibration puzzle solved
Risk April, http://www.risk.net/7518926 - Guyon J, 2022a
Dispersion-constrained martingale Schrödinger bridges: joint entropic calibration of stochastic volatility models to S&P 500 and VIX smiles
Preprint, SSRN 3853237 - Guyon J, 2022b
The VIX future in Bergomi models: fast approximation formulas and joint calibration with S&P 500 skew
SIAM Journal on Financial Mathematics 13(4), pages 1,418–1,485 - Guyon J, 2024
Dispersion-constrained martingale Schrödinger problems and the exact joint S&P 500/VIX smile calibration puzzle
Finance and Stochastics 28(1), pages 27–79 - Guyon J and J Lekeufack, 2023
Volatility is (mostly) path-dependent
Quantitative Finance 23(9), pages 1,221–1,258 - P Henry-Labordère, 2017
Model-Free Hedging: A Martingale Optimal Transport Viewpoint
Chapman and Hall/CRC
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@risk.net or view our subscription options here: http://subscriptions.risk.net/subscribe
You are currently unable to print this content. Please contact info@risk.net to find out more.
You are currently unable to copy this content. Please contact info@risk.net to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@risk.net
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@risk.net
More on Cutting Edge
Choosing trading strategies using importance sampling
The sampling technique is more efficient than A-B testing at comparing decision rules
A comparison of FX fixing methodologies
FX fixing outcomes are mostly driven by length of calculation window
Quantum cognition machine learning: financial forecasting
A new paradigm for training machine learning algorithms based on quantum cognition is presented
Backtesting correlated quantities
A technique to decorrelate samples and reach higher discriminatory power is presented
A hard exit threshold strategy for market-makers
A closed-form solution to derive optimal stop-loss and profit-taking levels is presented
Pricing share buy-backs: an alternative to optimal control
A new method applies optimised heuristic strategies to maximise share buy-back contracts’ value
CVA sensitivities, hedging and risk
A probabilistic machine learning approach to CVA calculations is proposed
Podcast: Alvaro Cartea on collusion within trading algos
Oxford-Man Institute director worries ML-based trading could have anti-competitive effects