07 — Kim, Korajczyk & Neuhierl (2020)

Arbitrage Portfolios

Review of Financial Studies 34(6), 2813–2856 · Kei Matsumae · 2026-05-15

What this paper does

Constructs arbitrage portfolios directly from a panel of characteristics — long high-predicted-return stocks, short low — and tests whether they earn returns unexplained by standard risk factors.
Complementary to CRW: sidesteps the structural factor model and goes directly to portfolio formation. Less elegant, more pragmatic.
Uses 50+ characteristics in a regularised cross-sectional regression.
Arbitrage portfolios earn annualised alpha 8–15% vs FF5, robust across decades, survives moderate transaction costs.
Triangulation for CRW. Two methodologically different estimators, same conclusion: characteristic-driven mispricing exists at exploitable scale.

1. Why this paper exists

CRW (2023) tests $\alpha(\cdot) = 0$ in a fully structural factor model. KKN asks the simpler, more practical question: if I take a vector of stock characteristics and build the optimal portfolio, does it earn alpha after standard factor controls?

Motivation: many anomaly studies look at one characteristic at a time. KKN's portfolio captures the joint information from all characteristics in a single instrument. Testing one portfolio for alpha is more powerful than testing each characteristic individually.

2. The model

Posit a predictive relationship:

$$ \mathbb{E}[r_{i, t+1} \mid z_{it}] \;=\; z_{it}'\,\theta. $$

The arbitrage portfolio has weights:

$$ w_{it} \;\propto\; z_{it}'\,\theta - \overline{z_t'\theta}, $$

i.e. tilted long toward stocks whose characteristics predict high returns, short stocks whose characteristics predict low returns, demeaned each month for dollar-neutrality.

Portfolio return: $R^{\text{arb}}_{t+1} = \sum_i w_{it}\, r_{i, t+1}$.

3. Estimating $\theta$

High-dimensional cross-sectional regression. KKN use regularised regression:

Lasso, ridge, or elastic-net of $r_{i, t+1}$ on $z_{it}$, pooled across $t$.
Optionally a kernel-localised version.
Out-of-sample portfolios from rolling re-estimation (avoid look-ahead).

Even with $K \approx 50$ characteristics and $T \approx 600$ months, $\hat\theta$ is informative enough to build economically large portfolios.

4. Empirical findings

Sample: US individual stocks, ~50 characteristics, 1965–2014.

Test	Result
Raw arbitrage portfolio Sharpe	1.5–2.5
FF5-adjusted alpha (Sharpe of residuals)	~1.5
Annualised alpha vs FF5	8–15%
Survives transaction costs (30 bps)	Yes
Robust across decades	Yes, declining magnitude post-2000
Stable across sub-universes (size, sector)	Mostly yes

Claim: too large and too stable to be a missing-risk-factor story of plausible content. Either (a) genuine mispricing, or (b) a peculiar yet-uncaptured risk factor. KKN lean toward (a).

5. Comparison with CRW

	KKN (2020)	CRW (2023)
What it estimates	Predictive coefficient $\theta$ + portfolio weights	Full factor model: $\alpha(\cdot), \beta(\cdot), f_t$
Functional form	Linear in $z$ (or kernel)	Nonparametric via sieves
Test for mispricing	TS alpha of arbitrage portfolio	Wald test of $\alpha(\cdot)=0$ in structural model
Asymptotics	Standard large-$N$-large-$T$	Fixed-$T$ via bootstrap
Empirical conclusion	Mispricing exists, exploitable	Mispricing exists, Sharpe > 3 on pure-$\alpha$
Strength	Direct, intuitive, implementable	Structural, decomposes risk vs alpha cleanly
Weakness	No clean attribution of alpha source	More machinery; harder to explain

Triangulation value. Two methodologically different procedures, same economic-scale finding of mispricing — much stronger evidence than either alone.

6. Connection to other papers in this series

Freyberger-Neuhierl-Weber (2020) — provides characteristic panel; KKN extends by going directly to portfolio formation.
Kelly-Pruitt-Su (2019) IPCA — opposite conclusion (no mispricing) on similar data. KKN directly counters IPCA's interpretation.
CRW (2023) — independent confirmation of mispricing-exists conclusion.

7. What this gives the AOF model

A simpler initial implementation path. Before building the full CRW factor model, AOF can build a KKN-style arbitrage portfolio in ~50 lines of Python (Lasso + monthly rebalance). If this earns substantial after-cost Sharpe, you have early validation before investing in the full CRW machinery.
An out-of-sample reality check. Once CRW is running, compute the pure-$\alpha$ portfolio from CRW's $\hat\alpha(\cdot)$ AND the KKN arbitrage portfolio on the same out-of-sample window. Returns should be highly correlated (~0.6–0.8 in our prior). If uncorrelated, something's wrong with one estimator.

For Japan: KKN-style is much easier to run on JP first than full CRW — tolerates missing characteristics (Lasso drops them), no basis-design decision.

8. Reading next

CRW (2023) — the full structural treatment.
Back to learning base index for the synthesis.

← 学習ベース index · 8 本中 7 本目

07 — Kim, Korajczyk & Neuhierl (2020)

アービトラージ・ポートフォリオ

Review of Financial Studies 34(6), 2813–2856 · 松前景一郎 · 2026-05-15

論文の要点

特性パネルから直接アービトラージポートフォリオを構築 — 予測リターンの高い銘柄をロング、低い銘柄をショート — 標準的リスクファクターでは説明できないリターンを得るか検定。
CRW に対して補完的：構造的ファクターモデルを迂回しポートフォリオ形成へ直行。エレガントさは劣るが実務的。
50 超の特性を正則化クロスセクション回帰で使用。
アービトラージポートフォリオは FF5 比で年率 8〜15% のアルファ、数十年にわたりロバスト、moderate な取引コスト下でも生存。
CRW の三角検証。方法論的に異なる 2 つの推定量が同じ結論：特性駆動のミスプライシングは exploit 可能な規模で存在。

1. なぜこの論文が必要か

CRW (2023) はフル構造ファクターモデルで $\alpha(\cdot) = 0$ を検定。KKN はよりシンプルかつ実務的な問いに答える：銘柄特性ベクトルから最適ポートフォリオを構築したとき、それは標準的ファクター控除後にアルファを得るか？

動機：多くのアノマリー研究は特性を 1 つずつ見る。KKN のポートフォリオは全特性の同時情報を単一のインストルメントに集約。1 つのポートフォリオを検定するほうが、個々の特性を検定するより検出力が高い。

2. モデル

予測関係を仮定：

$$ \mathbb{E}[r_{i, t+1} \mid z_{it}] \;=\; z_{it}'\,\theta. $$

アービトラージポートフォリオのウェイト：

$$ w_{it} \;\propto\; z_{it}'\,\theta - \overline{z_t'\theta}, $$

特性が高リターンを予測する銘柄に正、低リターンを予測する銘柄に負に傾斜。各月で平均除去してドルニュートラル。

ポートフォリオリターン：$R^{\text{arb}}_{t+1} = \sum_i w_{it}\, r_{i, t+1}$。

3. $\theta$ の推定

高次元クロスセクション回帰問題。KKN は正則化回帰を用いる：

$r_{i, t+1}$ を $z_{it}$ に Lasso・リッジ・エラスティックネット回帰（$t$ で pool）。
選択肢としてカーネル局所化バージョン。
ローリング再推定で out-of-sample ポートフォリオ構築（look-ahead 回避）。

$K \approx 50$ 特性、$T \approx 600$ 月でも、$\hat\theta$ は経済的に大きなポートフォリオを構築するに足る情報を持つ。

4. 実証結果

サンプル：米国個別株、約 50 特性、1965–2014。

検定	結果
生のアービトラージポートフォリオのシャープ	1.5–2.5
FF5 調整後アルファ（残差のシャープ）	~1.5
FF5 比年率アルファ	8–15%
取引コスト 30bps で生存	はい
数十年で頑健	はい、2000 年以降は大きさが低下
サブユニバース（規模・業種）で安定	概ねはい

主張：規模・安定性の両面で、もっともらしい内容の欠落リスクファクターでは説明しきれない。(a) 純粋なミスプライシング、または (b) いまだ捉えられていない奇妙なリスクファクター。KKN は (a) に傾く。

5. CRW との比較

	KKN (2020)	CRW (2023)
推定対象	予測係数 $\theta$ ＋ポートフォリオウェイト	フルファクターモデル：$\alpha(\cdot), \beta(\cdot), f_t$
関数形	$z$ について線形（またはカーネル）	篩によるノンパラメトリック
ミスプライシング検定	アービトラージポートフォリオの TS アルファ	構造モデルの $\alpha(\cdot)=0$ Wald 検定
漸近	標準的大 $N$・大 $T$	ブートストラップで固定 $T$
実証結論	ミスプライシング存在、exploit 可能	同左、純 $\alpha$ でシャープ > 3
強み	直接的・直感的・実装容易	構造的、リスク vs アルファをクリーンに分解
弱み	アルファの出所のクリーンな帰属がない	機械装置が大きい、説明が難しい

三角検証の価値。方法論的に異なる 2 つの手続きが、同じ経済的規模でミスプライシングを発見 — どちらか単独より証拠は遥かに強い。

6. 本シリーズ内での位置づけ

Freyberger-Neuhierl-Weber (2020) — 特性パネルを提供；KKN はポートフォリオ形成へ直接進む。
Kelly-Pruitt-Su (2019) IPCA — 類似データで逆の結論（ミスプライシングなし）。KKN は IPCA の解釈に直接反論。
CRW (2023) — ミスプライシング存在の独立した確認。

7. AOF モデルへの貢献

シンプルな初期実装パス。フル CRW ファクターモデルを構築する前に、AOF チームは Python 約 50 行で KKN 流アービトラージポートフォリオを構築可能（Lasso ＋月次リバランス）。これが取引コスト後に顕著なシャープを稼げば、CRW 機械装置に投資する前にテーゼの早期検証が得られる。
Out-of-sample のリアリティチェック。CRW 稼働後、同じ OOS 窓で CRW の $\hat\alpha(\cdot)$ 由来の純 $\alpha$ ポートフォリオと KKN アービトラージポートフォリオの両方を計算。リターンは強相関のはず（事前見立て 0.6〜0.8）。無相関ならどちらかの推定量がおかしい。

日本：KKN 流推定はフル CRW モデルより日本で先に実装するのがはるかに容易 — 特性欠損に寛容（Lasso が落とす）、基底設計の意思決定不要。

8. 次に読むべきもの

CRW (2023) — フル構造処理。
学習ベース index に戻る（統合論へ）。