1. Introduction
The “law of small numbers” is the name given to the well documented empirical regularity that people tend to overinfer from small samples in Tversky and Kahneman (1971). This post discusses a few of the results from Rabin (2002) which applies the law of small numbers to the beliefs of stock market traders. This paper is particularly nice because it captures this behavioral bias and its many interesting implications using only a small tweak to a simple Bayesian learning problem.
This post contains two parts: First, in Section
, I characterize the biased beliefs of a trader who is suffering from the law of small numbers. For brevity, I refer to this traders as Bob in the text below. Then, in Section
, I show how returns in a market populated by Bobs would display excess volatility.
2. The Core Idea
First, I define our hero’s problem. Suppose that Bob watches a sequence of signals
for
. The signal Bob sees
each period is an iid draw from a binomial distribution with intensity,
:
(1) ![Rendered by QuickLaTeX.com \begin{align*} \mathtt{Pr}[s_t=a] &= \theta, \qquad \theta \in [0,1] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-a841a9fec322db0765a911f919955904_l3.png)
There are a finite number of possible
‘s and Bob doesn’t know which
governs the stream of signals he observes. Let
denote the set of all rates that could occur with positive probability,
, so that
. Bob’s challenge is to infer which
is governing the string of signals he is observing.
Next, I define Bob’s inference strategy in light of his bias due to the law of small numbers. Suppose that he has correct beliefs about the distribution of
‘s and is fully Bayesian; however, he believes that there is some positive integer
such that signals are drawn without replacement from an urn containing
signals of
and
signals of
. Finally, so that the game does not end after
periods, Bob thinks that this urn is refilled every two draws. Thus, while odd and even draws are correlated, pairs of draws are iid.
In order for this inference strategy to be well defined, it has to be the case that Bob believes there is some
such that there are at least two
and
signals that can be drawn at each point in time. Thus, there exists
such that:
(2) 
implying that
. Let
represents Bob’s posterior beliefs about the probability of each
governing his string of signals after a history of signals
given that he is a type-
sufferer of the law of small numbers. As a clarifying example, note that
beliefs represent the beliefs of a fully rational agent. In the text below, I will can this fully rational agent Alice for concreteness.
With his problem and inference strategy in place, I now prove two results characterizing Bob’s beliefs. I first compute Bob’s beliefs immediately after seeing either
or
for a signal
on an odd period:
Proposition: For all
,
and
:
(3)
so that both
and
are increasing in
.
Proof: The expressions for
and
follow immediately from Bayes’ rule as, for example:
(4)
The fact that
is increasing in
follows from a Markov clever rewriting:
(5)
follows from the fact that Bob believes the signals are drawn from an urn
signals deep without replacement where one signal has already been removed. Since
is independent of
and
is increasing in
then
is increasing in
. The result for
follows from symmetry.
There are two interesting features of this result. First, note that Bob’s beliefs are identical to an agent with proper Bayesian beliefs in the first period. Second, because he believes that the signals are draw from an urn without replacement, Bob underestimates drawing two
‘s in a row or two
‘s in a row in a manner that decreases in the size of the urn.
Next, I characterize Bob’s posterior beliefs about two different
‘s given an extreme set of signals:
Proposition: Let
be a history of
signals and let
be a history of
signals. For all
and
such that
, both
and
are strictly decreasing in
.
Proof: For even
, note that:
(6)
Thus, this ratio is decreasing if and only if
. Extending the argument to odd values of
only changes the counting convention and symmetry yields the same result for
.
This proposition implies that, following an extreme sequence of signals, Bob overinfers that he is facing an extreme rate. Intuitively, if Bob thinks that the signals are drawn from an urn without replacement, then he is too surprised when he sees extreme signals because once a signal of
has been drawn in an odd period he believes that same signal cannot be drawn again in the following even period.
3. Excess Volatility
I now apply this reasoning to the behavior of returns in a market populated by Bobs. First, I describe the assets. Consider a market with countably infinitely many stocks indexed by
. Each month, every stock realizes either a positive or negative return denoted by
for positive returns or
for negative returns which is drawn iid from a binomial distribution with parameter
. Thus, in this market, positive returns for stock
today do not in fact predict positive returns tomorrow or vice versa. Suppose that a fraction
of the stocks have
, a fraction
of the stocks have
and the remaining fraction
of the stocks have
.
Next, I describe the trading strategy of the Bobs which I index with
. Let
denote the list of stocks not chosen by Bob
from month
up to but not including month
. Each Bob then adheres to the following trading strategy:
- At month
, Bob
picks one stock
at random and holds onto one share for the next four months,
. - Then, in month
, Bob
sells this share and picks a new stock at random at random from
. He buys a shares and holds onto it for the next four months,
. - Then, in month
, Bob
sells this share and picks a new stock at random at random from
. He buys a shares and holds onto it for the next four months,
. - And so on

Thus, via the law of large numbers, each stock will have the same number of Bobs holding is at each point in time with exactly
of the Bobs exchanging the stock for another each period.
I now consider the average beliefs of traders in a market populated by Bobs who suffer from the law of large numbers. First, I compute the probability that Bayesian traders and traders suffering from the law of small numbers believe that stock
‘s return parameter is
after observing different strings of returns in the left two columns of the table below. Then, I compute the probability that these two types of traders beliefs that the next return will be
given these previous return realizations in the right two columns of the same table.
Consistent with the second proposition in Section
above, note that the Bobs overestimate the probability that an asset’s returns are generated by the parameter
following a string of positive returns. Next, in the table below, I conclude by computing the average belief about the probability that
among both Bayesian traders (i.e., Alices) and traders suffering from the law of small numbers (i.e., Bobs) computed over the four groups of traders who have seen no signals, one signal, two signals and three signals for asset
respectively. Again, this table reveals that for extremely positive return histories, the Bobs overinfer the probability of
and thus
; however, for more balanced histories the Bobs underestimate the probability that
relative to the Bayesian Alices.
Thus, if all traders were Bobs, they would overreact to strings of positive returns and generate excess volatility.

and
are increasing in
and
follow immediately from Bayes’ rule as, for example:![Rendered by QuickLaTeX.com \begin{align*} \pi_1^N(\theta|s_1=a) &= \frac{\mathtt{Pr}[s_1 = a|\theta] \cdot \mathtt{Pr}[\theta]}{\mathtt{Pr}[s_1 = a]} \\ &= \frac{\theta \cdot \pi(\theta)}{\sum_{\theta' \in \Theta} \theta' \cdot \pi(\theta')} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-993618374145bae728c45119a1c382d7_l3.png)

follows from the fact that Bob believes the signals are drawn from an urn
is increasing in
is increasing in
follows from symmetry.
be a history of
be a history of
and
such that
, both
and
are strictly decreasing in 
underlying assets (e.g., mortgages) and
derivative assets (e.g., CDOs) where:
mortgages which each payout
defined as:
is an iid random variable such that
with probability
and
with probability
while
is Gaussian white noise with distribution
. For simplicity, I set
and
in the analysis below.
for each
while the buyer does not.
in cash to
in mortgages with
.
mortgages from the seller, he would be worried that the seller would put all of the bad mortgages in this collection so that the buyer would receive a collection of mortgages worth
rather than a “fair value” of
. In such a world, the buyer would never purchase mortgages from the seller and the seller would be forced to hold onto all of the mortgages which he values at only
due to his liquidity concerns. This loss of
is the core of the asymmetric information problem.
and (b) holds onto the junior tranche. After agreeing to accept the first
since the white noise terms ![Rendered by QuickLaTeX.com \begin{align*} (1/2) \cdot \sum_{i=1}^I x_i &= I/2 = \mathtt{E} \left[ \sum_{i=1}^I y_i \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-f49af96221ac292eee4732ffadbcaf94_l3.png)
with probability
of the mortgages in the entire pool. Each of these
where
if no more than
of the underlying mortgages default and
otherwise with:

of the
mortgages have a payout of
.
denote the lemons cost to the buyer if the seller knows a set
:![Rendered by QuickLaTeX.com \begin{align*} \kappa_j(L) &= \mathtt{E}\left[ V_j \right] - \underset{\mathcal{L} \subseteq \mathcal{I}}{\mathtt{min}} \left\{ \mathtt{E}\left[ \ V_j \ \middle| \ x_i = 0, \ \forall i \in \mathcal{L} \ \right] \right\} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-3f61423b10803e43ee431118ffb6aec2_l3.png)
has units of dollars.
so that the lemons cost would have been
.
and
, over the bipartite graph above:
random mortgages for every CDO.
CDOs (i.e., boobytrapped CDOs) and a number of lemons
from the total population of mortgages and
and
, then there is no polynomial time algorithm to distinguish between the distributions
of the
Notation):
is little-
as
approaches
” and write
as
to mean that:
of the assets in each CDO to affect payout as the expected number of defaults would be
while the standard deviation would be
. Thus, the lemons cost for a boobytrapped CDO will be
where
is the increase in the probability that a CDO does not payout due to the seller’s boobytrap and
denotes the standard normal distribution. The seller then chooses the number of CDOs to boobytrap, 
is concave for
and convex if
, the optimal choice of
or
for some small constant
.
given this optimization program:
and
, the seller will earn a utility:![Rendered by QuickLaTeX.com \begin{align*} U &\geq \left(1 - 2 \cdot \mathtt{N}[-\phi/(2 \cdot \sqrt{D})] - o(1) \right) \cdot B \cdot \bar{v} \approx L \cdot \sqrt{I/J} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-bc9d36535f0a2a63dc5b2bf086ef5e13_l3.png)
, let
denote the total number of defaulted mortgages. Since the CDO is boobytrapped
can be computed as: ![Rendered by QuickLaTeX.com \begin{align*} \mathtt{E}[s_j] = \frac{D - \theta}{2} + \theta \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-72709bac801dbc1073683be3b9e98ed1_l3.png)
![Rendered by QuickLaTeX.com \begin{align*} \mathtt{Pr}\left[ \ s_j \geq \left(\frac{D + \phi}{2}\right) \ \right] &= 1 - \mathtt{N}\left[-\frac{\phi}{2 \cdot \sqrt{D}}\right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-5f4f4f6cb4e26fe1be49a5f7516383d1_l3.png)

, plus the proportion of mortgages in the non-boobytrapped CDOs that default,
. The right hand side is the number of defaults per securitized mortgage times the average number of times each mortgage is securitized into a CDO. Thus, by solving for 
can be computed as:![Rendered by QuickLaTeX.com \begin{align*} \mathtt{N}\left[ - 3 - \frac{B \cdot \theta}{2 \cdot (J - B) \cdot D} \right] &= \mathtt{N}\left[ - \frac{\phi}{2 \cdot \sqrt{D}} \right] - \left. \frac{d\mathtt{N}}{dx} \right|_{x = -3} \cdot \left( \frac{B \cdot \theta}{2 \cdot (J - B) \cdot D} \right) \\ &= \mathtt{N}\left[ - \frac{\phi}{2 \cdot \sqrt{D}} \right] - O \left( \frac{B \cdot \theta}{2 \cdot (J - B) \cdot D} \right) \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-4878e917f2b92a24735fd62fb8fe07fd_l3.png)
which is roughly
smaller than the case with no boobytrapping.
denote Bill’s value function in units of
given an action 
, the current US GDP,
, the recent increase in the cost of break pads,
, and the completion of the new
. Likewise, the vector
denotes how much weight the Bill should place on each of the
are in units that convert decision factors into units of
. So, for example,
would have units of
while
would have units of
.
is a constant with units of
which balances the equation.
has units of
denote his choice of how many cars to produce where
for most
:
but
then Bill will adjust the number of cars he produces in response to a change in the US GDP but not in response to a change in the price of break pads. Thus, Bill’s choice of how many cars to produce
.
which is closer to the optimal choice
as he would suffer a quadratic loss from deviating from this optimal strategy defined by the function
below, but no compensating “cognitive” gain from not having to think about how the construction of a bridge in Minneapolis should affect his production decision:![Rendered by QuickLaTeX.com \begin{align*} L(m,\mu) &= \mathtt{E} \left[ \ V(a(\mu;x); \mu,x) - V(a(m;x); \mu,x) \ \right] \\ &= - \frac{\gamma}{2} \cdot \mathtt{E} \left[ \ \left( \sum_{n=1}^N \left( m_n - \mu_n \right) \cdot x_n \right)^2 \ \right] \\ &= \frac{\gamma}{2} \cdot \sum_{n=1}^N \left( m_n - \mu_n \right)^2 \cdot 1_{\scriptscriptstyle \mathtt{dim}[x_n]}^2 \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-b2c7f37d4c1b7945b5060f55b3472422_l3.png)
is a place holder which helps balance the units in the equation. One method for incorporating cognitive costs into Bill’s decision problem would be to charge him
units of utility per
unit of emphasis on each decision factor. So, for example, if
and Bill increased his production by
in GDP growth, then Bill would pay a cognitive cost of
where
which I denote
:![Rendered by QuickLaTeX.com \begin{align*} L^{\scriptscriptstyle \mathtt{BR}}(m,\mu;\alpha) &= \min_{m \in \mathcal{R}^N} \left\{ \ \frac{\gamma }{2} \cdot \sum_{n=N} (m_n - \mu_n)^2 \cdot 1_{\scriptscriptstyle \mathtt{dim}[x_n]}^2 + \kappa \cdot \sum_{n=1}^N \left( |m_n| \cdot 1_{\scriptscriptstyle \frac{\mathtt{dim}[x_n]}{\mathtt{cars}}} \right)^\alpha \ \right\} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-6c023b61dd62c2b95d56527a7b7cdc45_l3.png)
is constant with units
in order to balance the equation.
. Below, I give
with respect to an arbitrarily chosen dimension
for simplicity we the optimality condition:
:
whenever
.
curve is exactly equal to the slope of the dashed black line.
with the convention that
, then I would want to again set:

in
in the figure below. We can see for the blue solid line representing the
case, an incremental increase in
where
is just marginally greater than
where
the increase in the weighting factor
.
![Rendered by QuickLaTeX.com \begin{align*} 0 &= \partial_{m_n} L^{\scriptscriptstyle \mathtt{BR}}(m,\mu;1) \\ &= - (m_n - \mu_n) - \kappa \cdot \mathtt{sgn} [m_n] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-7e11f869715d416214a723970f380f19_l3.png)
denotes the sign operator which returns the sign of a real constant. Thus, we have 
will be ignored by the decision maker.
being the only power coefficient which delivered both sparsity and a tractable solution; however, this result depended on Bill’s utility gain to increasing his weighting
being linear. e.g., look at the black dashed line in the
and action
. For instance, in Bill’s problem above, this would be like allowing his value function
to be non-quadratic and then approximating his problem as quadratic around a reference
implying that his default decision is to ignore all variables except for last period’s demand and the current value of GDP and number of cars to produce
.
and
as:
. I then use these
which captures how much information is lost via the quadratic approximation:![Rendered by QuickLaTeX.com \begin{align*} \Lambda &= - \mathtt{E} \left[ V_{a,m} V_{a,a}^{-1} V_{a,m} \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-8601e840c81bd3471ec83e91da61a973_l3.png)
used in GMM to differentially interpret the error terms from each of the equations a la
term in the weighting matrix 
captures the cognitive cost of a model and is defined as:
plays the same role as
in the motivating example and simply controls the units and scale of the agent’s choice of representations. In general,
will not be problem specific in any material sense.
:
, he might look at the facts
described by the preferences below:![Rendered by QuickLaTeX.com \begin{align*} \mathtt{E}\left[ \ \sum_{t=0}^\infty \beta^t \cdot \frac{c_t^{1 - \gamma}}{1-\gamma} \ \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-a05f09627d32681282aaa60ba2fe9c67_l3.png)
where
for simplicity. Then, I can write this representative agent’s problem recursively as follows:![Rendered by QuickLaTeX.com \begin{align*} \overline{V}(w_t) &= U(c_t) + \beta \cdot \mathtt{E}_t \left[ \ V\left( (1 + \bar{r}) \cdot (w_t -c_t) + \bar{y} \right) \ \right] \\ w_{t+1} &= (1 + \bar{r}) \cdot (w_t - c_t) + \bar{y} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-730a0cf0bb7e807235e94d1c991a4876_l3.png)
is the agent’s value function given wealth level
and
is the constant endowment rate. In this world, it is easy to derive that the optimal consumption choice is given by:![Rendered by QuickLaTeX.com \begin{align*} \ln \bar{c}_t &= \ln \left[ \bar{r} \cdot w_t + \bar{y} \right] - \bar{r} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-3e3caefb37b719b65d5f3e4e15930d0a_l3.png)
and
where the idiosyncratic terms
and
evolve according to
processes below with mean
and
:
with ![Rendered by QuickLaTeX.com \begin{align*} V(w_t;r_t,y_t) &= u(c_t) + \beta \cdot \mathtt{E}_t \left[ \ V\left( (1 + \bar{r} + \hat{r}_t) \cdot (w -c) + \bar{y} + \hat{y}_t;r_{t+1},y_{t+1} \right) \ \right] \\ w_{t+1} &= (1 + \bar{r} + \hat{r}_t) \cdot (w_t - c_t) + \bar{y} + \hat{y}_t \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-7c011ba190062fc9c92182d1a3e7e67d_l3.png)
.

and
are given by the rule:
for
as it would have the interpretation that the agent would set
whenever taking the interest rate or endowment level into accound would change the standard deviation of log consumption by
. Thus, an sparsity seeking boundedly rational representative agent has a particular rule in mind when figuring out which higher order Taylor terms to ignore.
) tells us that a (log) stock price follows a semi-martingale if and only if there is no arbitrage where a semi-martingale is defined as follows:
defined on the filtered probability space
is called a semi-martingale if it can be decomposed:
is a local martingale and
is an adapted process with locally bounded variation.
![Rendered by QuickLaTeX.com \begin{align*} X_t &= X_0 + \underbrace{\int_0^t b_s \cdot ds}_{\text{``Drift''}} + \underbrace{\int_0^t \sigma_s \cdot dW_s}_{\text{``Brownian''}} + \underbrace{\int_0^t \int_{|x| \leq \epsilon} x \cdot (\mu - \nu)[ds,dx]}_{\text{\tiny ``Jump''}} + \underbrace{\int_0^t \int_{|x| > \epsilon} x \cdot \mu[ds,dx]}_{\text{\Large ``Jump''}} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-374cc0c03bf207af9655b7ffae6ef5f6_l3.png)
which marks the cutoff between large and small jumps is arbitrary, but must be fixed. In this paper, the authors develop a suite of statistical tools based on spectrographic analysis in order to examine a time series properties of high frequency stock data over the time interval
and determine whether or not the data has:
over the interval
seconds. For, notational convenience, the authors write the change in
to observation 
. For instance, if
then we’d have
.
defined below to decompose movements in 
. As
the smaller
the larger
the power variation estimator
the estimator is tuned to pick up jumps in
. Truncating large increments will eliminate large jumps from the series.
and
. For instance, if
, then the power variation will diverge to
for a particular power level
the power variation with converge to a finite value or to 

exists in the stochastic process
the power variation of
since only the size of the jumps not their frequency will matter. This intuition yields a test statistic
:

defined below with 

operator computes the nearest integer less than or equal to its argument. So for instance,
.
in units of
‘s at time
‘s whose evolution under the risk neutral measure is described by the equations below:
is the risk free rate in units of
,
is the instantaneous variance of the stock price in units of
and
are constants parameterizing the volatility of the variance of the stock price where
and 
be the payout of a European option on this stock with strike price
(in
as follows:![Rendered by QuickLaTeX.com \begin{align*} P(X,V;t) &= \mathtt{E} \left[ \ \phi(X_T) \ \middle| \ X_t, V_t \ \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-a78d3d0d03a60e88d21deeaa54b19741_l3.png)
there is no closed form solution for
by computing the difference between the infinitesimal generators of
where
.
be a time homogeneous Ito diffusion in
. The infinitesimal generator
of
is defined as:![Rendered by QuickLaTeX.com \begin{align*} \mathtt{A}[f(y)] &= \lim_{t \searrow 0} \left\{ \frac{\mathtt{E}[f(Y_t)] - f(y)}{t} \right\} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-212ca5eedc9ac967514d2d6213381754_l3.png)
into the future. The theorem below gives a method for computing the infinitesimal generator.
with a compact support then: ![Rendered by QuickLaTeX.com \begin{align*} \mathtt{A}[f(y)] &= \sum_{n=1}^N m_n(y) \cdot \partial_n[f(y)] + \frac{1}{2} \cdot \sum_{n=1}^N \sum_{n'=1}^N \left( s_n(y) s_{n'}(y)^{\top} \right) \cdot \partial_{n,n'}[f(y)] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-0579da7fdb5c34ee59d59893fbce10da_l3.png)
![Rendered by QuickLaTeX.com \begin{align*} 0 &= \overset{\mathtt{\scriptscriptstyle BS}}{\mathtt{A}}[\overset{\mathtt{\scriptscriptstyle BS}}{P}(X;t)] - r \cdot \overset{\mathtt{\scriptscriptstyle BS}}{P}(X;t) \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-7ce80403c6bf15d39487e06d9863b10a_l3.png)

as follows:
. Plugging this differential equation into the formula above yield:
![Rendered by QuickLaTeX.com \begin{align*} 0 &= \mathtt{A}[P(X,V;t)] - r \cdot P(X,V;t) \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-017cf3e7a03da9f1983c701c23bc809a_l3.png)

and defining
for shorthand yields an expression for
as:![Rendered by QuickLaTeX.com \begin{align*} \mathtt{d}f(y) &= \partial_t [f(y)] \cdot \mathtt{d}t + \partial_x [f(y)] \cdot \mathtt{d}X + \partial_v [f(y)] \cdot \mathtt{d}V \\ &\qquad \qquad + \frac{1}{2} \cdot \left\{ \ \partial_{x,x} [f(y)] \cdot (\mathtt{d}X)^2 + \partial_{x,v} [f(y)] \cdot (\mathtt{d}X \cdot \mathtt{d}V) + \partial_{v,v}[f(y)] \cdot (\mathtt{d}V)^2 \ \right\} \\ &= \partial_t [f(y)] \cdot \mathtt{d}t + \partial_x [f(y)] \cdot \left\{ r \cdot X \cdot \mathtt{d}t + V^{\scriptscriptstyle \frac{1}{2}} \cdot X \cdot \mathtt{d}B_x \right\} \\ &\qquad + \partial_v [f(y)] \cdot \left\{ \kappa \cdot (\alpha - V) \cdot \mathtt{d}t + \omega \cdot |V|^\xi \cdot \mathtt{d}B_v \right\} \\ &\qquad \qquad + \frac{1}{2} \cdot \left\{ \ \partial_{x,x} [f(y)] \cdot \left( V \cdot X^2 \right) \cdot \mathtt{d}t \right. \\ &\qquad \qquad \qquad \left. + \partial_{x,v} [f(y)] \cdot \left( \rho \cdot \omega \cdot V^{\xi + {\scriptscriptstyle \frac{1}{2}}} \cdot X \right) \cdot \mathtt{d}t \right. \\ &\qquad \qquad \qquad \qquad \left. + \partial _{v,v} [f(y)] \cdot \left( \omega^2 \cdot V^{2 \cdot \xi} \right) \cdot \mathtt{d}t \ \right\} \\ &= \Big\{ \ \partial_t [f(y)] + \partial_x [f(y)] \cdot r \cdot X + \partial_v [f(y)] \cdot \kappa \cdot (\alpha - V) \\ &\qquad \qquad + \frac{1}{2} \cdot \left( \partial_{x,x} [f(y)] \cdot \left( V \cdot X^2 \right) \right. \\ &\qquad \qquad \qquad \left. + \partial_{x,v} [f(y)] \cdot \left( \rho \cdot \omega \cdot V^{\xi + {\scriptscriptstyle \frac{1}{2}}} \cdot X \right) \right. \\ &\qquad \qquad \qquad \qquad \left. + \partial _{v,v} [f(y)] \cdot \left( \omega^2 \cdot V^{2 \cdot \xi} \right) \right) \ \Big\} \cdot \mathtt{d}t \\ &\qquad \qquad + \left\{ \partial_x [f(y)] \cdot V^{\scriptscriptstyle \frac{1}{2}} \cdot X \right\} \cdot \mathtt{d}B_{x,t} \\ &\qquad \qquad \qquad + \left\{ \partial_v [f(y)] \cdot \omega \cdot |V|^\xi \right\} \cdot \mathtt{d}B_{v,t} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-3eb871e380685c7fe76273afd9ad5e7d_l3.png)
which given
yields the expression:![Rendered by QuickLaTeX.com \begin{align*} \delta(X,V,\sigma;t) &= \mathtt{A}\left[ P(X,V;t) - \overset{\mathtt{\scriptscriptstyle BS}}{P}(X;t) \right] - r \cdot \left(P(X,V;t) - \overset{\mathtt{\scriptscriptstyle BS}}{P}(X;t) \right) \\ &= \frac{(\sigma^2 - V_t) \cdot X_t^2}{2} \cdot \partial_{x,x}[ \overset{\mathtt{\scriptscriptstyle BS}}{P}(X;t)] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-3239af90338f171a08a7dd93d8df13e8_l3.png)
for all
has the natural interpretation of the hedging cost incurred by a trader continuously hedging the risk of the European option with stochastic volatility using the incorrect constant volatility model. Thus, we can write the true price as the Black and Scholes price plus an error correction term: ![Rendered by QuickLaTeX.com \begin{align*} P(X,V;t) &= \overset{\mathtt{\scriptscriptstyle BS}}{P}(X;t) + \mathtt{E}\left[ \ \int_t^T e^{-r \cdot (u-t)} \cdot \delta(X,V;u) \cdot \mathtt{d}u \ \middle| \ X_t, V_t \ \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-8164f09b6a2b8686506a51dc8ae8807f_l3.png)

![Rendered by QuickLaTeX.com \begin{align*} \delta_n(X,V;t) &= \mathtt{A}[\delta_{n-1}(X,V;t)] - r \cdot \delta_{n-1}(X,V;t) \\ \delta_0(X,V;t) &= \frac{(\sigma^2 - V_t) \cdot X_t^2}{2} \cdot \partial_{x,x}[ \overset{\mathtt{\scriptscriptstyle BS}}{P}(X;t)] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-94ba84d7aec75442424e83a183d0479d_l3.png)

![Rendered by QuickLaTeX.com \begin{align*} P_0(X,V;t) &= \overset{\mathtt{\scriptscriptstyle BS}}{P}(X;t) + \frac{(\sigma^2 - V_t) \cdot X_t^2}{2} \cdot \left( \frac{N(d_1)}{X \cdot \sigma} \right) \\ P_1(X,V;t) &= P_0(X,V;t) + \frac{1}{2} \cdot \left[ \frac{\kappa \cdot (\alpha - V_t) \cdot X_t^2}{2} \cdot \left( \frac{N(d_1)}{X \cdot \sigma} \right) + \omega \cdot \rho \cdot V_t^{\xi + {\scriptscriptstyle \frac{1}{2}}} \cdot X_t \cdot \left\{ X_t \cdot \right\} \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-bfd2c7b475243c674b237be9eab69b82_l3.png)
to kill off the
term in my numerical analysis. Below I replicate Figure
and plot the mispricing as a fraction of the Black and Scholes (1973) price with
for
. The strike price is
, the time to maturity is
,
,
,
and
.
iterations at
price points between
and
.
Geoffrey Taylor used the theorem to work out the energy payload released by the 1945 
, which is some unknown function of
.
fundamental dimensions labeled
. Thus, I can define a dimension operator which gives the dimensions of an arbitrary variable
and write its output as:![Rendered by QuickLaTeX.com \begin{align*} \mathtt{dim}[z] &= \prod_{m=1}^{M} c_m^{a_m} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-301820102a65da59da5b9b73b24019eb_l3.png)
is measuring pressure on the surface of a table, I could write
where
,
,
and
.![Rendered by QuickLaTeX.com \begin{align*} \mathtt{dim}[z] &= 1 \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-bdc2521191c6564c3bf8a38ea56ab8ec_l3.png)
as a function of dimensionless quantities. First, I define
as an
matrix of dimensional exponents for
and
vector of dimensional exponents for 
solutions to the system of equations
. Define
matrix of solutions.
:
with
fundamental dimensions can be restated as:
,
is an unknown function and
are dimensionless parameters constructed from the physical parameters
using equations of the form below:
with variables:
: Time elapsed since the explosion took place,
: Initial pressure.
and
with fundamental dimensions of length
, mass 
, we have only 
as:
is in units of distance yielding the 
unknowns:
:
and tuning
, I can compute ![Rendered by QuickLaTeX.com \begin{align*} \pi &= \delta \cdot \left[ \frac{\epsilon \cdot \tau^2}{\rho} \right]^{-1/5} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-9a8f5f9350b1d0849063950119a56b2d_l3.png)
and an unknown function of the dimensionless quantity
:![Rendered by QuickLaTeX.com \begin{align*} \delta &= \left[ \frac{\epsilon \cdot \tau^2}{\rho} \right]^{1/5} \cdot g \left( \pi_1 \right) \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-fb291bc669a7adf9f8d8af5934fd5601_l3.png)
where
yields a formulation where
with scaling constant 
, the
plot of
‘s in the plot below:
at a random date
. There is an informed trader and a stream of uninformed traders who arrive with Poisson intensity
. The model end date
denote the net position of the noise traders up to but not including date
denote the net position of the informed up to but not including date
so that
generates a
which represents the market maker’s information set.
. Let
and
denote the bid and ask prices at time ![Rendered by QuickLaTeX.com \begin{align*} a_t &= \mathbb{E} \left[ \ v \ \middle| \ \mathcal{F}_{t-}^y, \ dy_t = 1 \ \right] \\ b_t &= \mathbb{E} \left[ \ v \ \middle| \ \mathcal{F}_{t-}^y, \ dy_t = -1 \ \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-8a95021aefdabff55ab532509a468273_l3.png)
be the left limit of the price
at time
. The informed trader chooses a trading strategy
in order to maximize his end of game wealth at random date
and let
.![Rendered by QuickLaTeX.com \begin{align*} w &= \max_{\{dx_t\}_{t \leq \tau}} \left\{ \mathbb{E} \left[ \ \int_0^\tau \left( v - a_t \right) \cdot dx_t^+ + \int_0^\tau \left( b_t - v \right) \cdot dx_t^- \ \middle| \ v \ \right] \right\} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-95b3f533c03894b98fe31e3685ce3fb9_l3.png)
![Rendered by QuickLaTeX.com \begin{align*} \infty > \max_{\{dx_t\}_{t \leq \tau}} \left\{ \mathbb{E} \left[ \ \int_0^\tau \left( v - a_t \right) \cdot dx_t^+ \ \middle| \ v = 1 \ \right] \right\} \\ \infty > \max_{\{dx_t\}_{t \leq \tau}} \left\{ \mathbb{E} \left[ \ \int_0^\tau \left( b_t - v \right) \cdot dx_t^- \ \middle| \ v = 0 \ \right] \right\} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-1f40c9a1e2fdb2d18709b3514416b03a_l3.png)
, an equilibrium consists of a pair of bid and ask prices,

![Rendered by QuickLaTeX.com \begin{align*} 0 &= \mathbb{E} \left[ \ x_t^+ - v \cdot \int_0^t \theta_{H,B} \left( p_{s-} \right) \cdot ds - \left( 1 - v \right) \cdot \int_0^t \theta_{L,B} \left( p_{s-}\right) \cdot ds \ \middle| \ \mathcal{F}_{t-}^y \ \right] \\ 0 &= \mathbb{E} \left[ \ x_t^- - v \cdot \int_0^t \theta_{H,S} \left( p_{s-} \right) \cdot ds - \left( 1 - v \right) \cdot \int_0^t \theta_{L,S} \left( p_{s-}\right) \cdot ds \ \middle| \ \mathcal{F}_{t-}^y \ \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-57fe3886bd6d1a5da7e9b88606231f00_l3.png)
and
subscripts denote the realized value and trade directions for the informed traders. So, for example,
denotes the trading intensity at some time 
and
denote the value functions of the high and low type informed traders respectively.
for all
with
and
since:![Rendered by QuickLaTeX.com \begin{align*} \mathbb{E} \left[ \ v \ \middle| \ \mathcal{F}_{t-}^y, \ dy_t = 1 \ \right] \geq \mathbb{E} \left[ \ v \ \middle| \ \mathcal{F}_{t-}^y \ \right] \geq \mathbb{E} \left[ \ v \ \middle| \ \mathcal{F}_{t-}^y, \ dy_t = -1 \ \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-d7f4c383a065102eb5e64f8cdb5dde5e_l3.png)
and
. What’s more,
and
are absorbing points for
while
.
or from
so we can think about the stochastic process
as composed of a deterministic drift component
and
and
.![Rendered by QuickLaTeX.com \begin{align*} \mathbb{E}\left[ dp \right] &= \mu(p) \cdot dt + \lambda_a \cdot \left\{ a(p) - p \right\} \cdot dt + \lambda_b \cdot \left\{ b(p) - p \right\} \cdot dt \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-eefa0de62a74cc1218cd11af65838e6f_l3.png)
and
can be computed using Bayes’ rule.
and
from above yields an expression for the price change that is purely in terms of the trading intensities and the price.![Rendered by QuickLaTeX.com \begin{align*} \mathbb{E}\left[dp\right] &= \mu(p) \cdot dt + p \cdot \left( 1 - p \right) \cdot \left( \theta_{H,B}(p) + \theta_{H,S}(p) - \theta_{L,B}(p) - \theta_{L,S}(p) \right) \cdot dt \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-c44af83ee8a9d4725cf26fbcf95132d1_l3.png)
.


. There are
, the value change due to an uninformed trader placing a buy order with probability 
conditions pins down the equilibrium.
is a non-increasing function of
is a non-decreasing function of
, then the trading strategies are optimal for all
and
for simplicity. I compute the value functions
denote the vector of 
and
denote the vector of value function levels over each point in the price grid
. I use the teletype style
denotes the level of the value function at price point
after 
and
.
which I define in Step 5 below is sufficiently small. The estimation strategy uses the fixed point problem in Equation (13) to compute 

and
at each point.


and
linearly. Let
and let
be the closest price level to
.
using Equation (9).
and
.
and
in terms of only prices.

and ensure that Equation (14) is satisfied. If the high type informed traders want to sell at price
.![Rendered by QuickLaTeX.com \begin{align*} &\mathtt{if} \Big[ w_H(p_n;\mathtt{i}) < \left( b(p_n;\mathtt{i}) - 1 \right) + w_H \left( b(p_n;\mathtt{i});\mathtt{i}\right) \Big] \ \{ \\ &\qquad \qquad \mu(p_n;\mathtt{i}) = \left( 1 + \alpha \right) \cdot \mu(p_n;\mathtt{i}) \\ &\} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-817d64dba1bc6fb4f6bc88fe4e9b6049_l3.png)
![Rendered by QuickLaTeX.com \begin{align*} &\mathtt{if} \Big[ w_L(p_n;\mathtt{i}) < -a(p_n;\mathtt{i}) + w_L \left( a(p_n;\mathtt{i});\mathtt{i} \right) \Big] \ \{ \\ &\qquad \qquad \mu(p_n;\mathtt{i}) = \left( 1 - \alpha \right) \cdot \mu(p_n;\mathtt{i}) \\ &\} \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-4bedac5f575c02d4e72fed25c5e0a333_l3.png)
by adding
times the between trade indifference error from Equation (15).


month pair. I restrict my attention to the time period from January 1988 to December 2010 to focus on the period of time over which the Fama and French (1988) industry classification scheme would have been widely known. I keep only actively traded firms listed on the NYSE, NASDAQ and AMEX exchanges. I require that the firm reports a non-missing price, return, share count and SIC code for a given month. I also remove any observations which lack valid data in the previous month. This leaves me with
total firm
firms. The figure below plots the total number of firms in the dataset each month.
firm industry classification used by
different clusters and one for the Moskowitz and Grinblatt (1999) scheme with
industry from the Fama and French (1988) scheme:
and
tags denote the initial and ending SIC codes for each subindustry. The Moskowitz and Grinblatt (1999) scheme is less complex. There is a simple start and stop date for each of the
industries except for Candy and Soda, Coal, Non-Metalic and Industrial Mining, Pharmaceutical Products, Precious Metals and Trading display a single peaked pattern indicating that the number of firms in each industry dramatically expanded around 2000.
where
is the
.
.
litres of milk.
with
denoting the set of all assets. Agents get (possibly time dependent) utility from holding different combinations of the
agents indexed by
with the set of all agents denoted by
. Every agent has an inventory of products in the quantity
for asset
denoting the vector of holdings for agent
. Let
denote the set of all possible states with
.
denote an asset specific positive scalar constant. Then, by scale invariance I mean that the economy should be unchanged if for every agent
, we multiply the agent’s holdings of asset
:
, no actual real outcomes should be changed. This restriction will imply that all essential functions will be homogeneous of degree
denote the beliefs of agent
with entries denoted by
which denotes the number of units of good
denote the set of assets for which agent
matrix. Thus, 

of scalar constants used to renormalize the asset units leaves the equilibrium allocations unchanged.

as defined below:
with
as agent
and
for each asset
where
for brevity. Also, suppose that between period
, an agent
, then I will abbreviate the corresponding change in happiness as:
![Rendered by QuickLaTeX.com \begin{align*} U_n \left( X_n \right) &= \phi \cdot \ln \left[ \Psi_n^{\top} X_n \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-795238dd2ab595c6d31aa62fe5aec79a_l3.png)
is a
vector of free parameters. For instance, consider the utility specification below which equates agent
:![Rendered by QuickLaTeX.com \begin{align*} U_n &= \ln \left[ \sum_{a \in \mathcal{A}} \left( x_{n,a} \cdot m_{n,a:1} \right)^{\phi} \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-2ff816706c666703e920b000aae2e0cc_l3.png)
is an arbitrary economic operator, then we have that:
which removes asset holdings but increases utility,
which recombines asset holdings at a net surplus,
which exchanges asset holdings between agents, or…
which removes asset holdings but does not compensate agents with a utility boost.
terminology used to denote changes in satisfaction due to changes in holdings to be operator specific. In particular, for an arbitrary economic operator
as:![Rendered by QuickLaTeX.com \begin{align*} \Delta_{\mathbb{H}} V_n &= \mathbb{E} \left[ U \left( \mathbb{H} X_n \right) - U \left( X_n \right) \mid \mathcal{I}_{\alpha} \right] \end{align*}](http://www.alexchinco.com/wp-content/ql-cache/quicklatex.com-c5fc91b0bb6d2e34f2ecb3f918902297_l3.png)
matrix of allocations
as well as a
symmetric matrix of exchange rates
with a unit diagonal representing
unique elements such that given the exchange rate matrix 

is given by:
is a agent dependent positive constant.
:
. Then, an equilibrium would represent a fixed point such that the following 
around its true fixed point yields first and second order terms:
in its mirror image yields the equations above.
pairwise exchange rates
such that given each exchange rate, we have that:

exchange as a result of a bargaining process
is given by:

