← Learning base index · 06 of 8

06 — Kelly, Pruitt & Su (2019) — IPCA

Characteristics Are Covariances: A Unified Model of Risk and Return

Journal of Financial Economics 134(3), 501–524 · Kei Matsumae · 2026-05-15

What this paper does

IPCA = Instrumented Principal Component Analysis. The "instruments" are firm characteristics.
Model: $r_{i,t+1} = \alpha_{it} + \beta_{it}' f_{t+1} + \varepsilon_{i,t+1}$, with $\alpha_{it} = z_{it}'\Gamma_\alpha$ and $\beta_{it} = z_{it}'\Gamma_\beta$ — linear in characteristics.
Estimator: alternating least squares for $\Gamma_\alpha, \Gamma_\beta, f$ jointly.
Claim: 5-factor IPCA explains the cross-section of US individual stocks vastly better than FF5.
Headline thesis — "characteristics are covariances" — characteristics' predictive power comes through time-varying loadings, not through mispricing.
Direct benchmark for CRW (2023). CRW competes on the same data; relaxes linearity and reaches opposite conclusion.

1. Why this paper exists

By 2017 hundreds of characteristics had predictive power documented. Two questions:

Can they be organised into a low-dimensional factor structure?
Are characteristic-return relationships risk (time-varying betas) or mispricing (non-zero alphas)?

KPS say yes to (1) and decisively answer "risk" to (2) — characteristics are covariances.

2. The model

$$ r_{i, t+1} \;=\; \alpha_{it} \;+\; \beta_{it}'\, f_{t+1} \;+\; \varepsilon_{i, t+1}, $$

with the identifying linear restrictions:

$$ \alpha_{it} = z_{it}'\,\Gamma_\alpha, \qquad \beta_{it} = z_{it}'\,\Gamma_\beta. $$

Symbol	Meaning
$z_{it}$	$L \times 1$ characteristics (lagged)
$\Gamma_\alpha$	$L \times 1$ characteristic-to-alpha mapping
$\Gamma_\beta$	$L \times K$ characteristic-to-loading mapping
$f_{t+1}$	$K \times 1$ latent factors
$K$	number of factors (KPS advocate 5)

Two specifications. Restricted IPCA-R imposes $\Gamma_\alpha = 0$ (no mispricing). Unrestricted IPCA-U lets $\Gamma_\alpha \neq 0$. The Wald test of $\Gamma_\alpha = 0$ is KPS's "mispricing test".

3. The estimator — alternating least squares

The model is non-convex in $(\Gamma_\alpha, \Gamma_\beta, f)$ jointly but bi-linear: given any two, OLS gives the third in closed form.

3.1 Step 1 — Given $\Gamma_\alpha, \Gamma_\beta$, estimate $f_{t+1}$

$$ \hat f_{t+1} \;=\; \bigl(\Gamma_\beta'\, Z_t' Z_t\, \Gamma_\beta\bigr)^{-1} \Gamma_\beta'\, Z_t'\, (R_{t+1} - Z_t \Gamma_\alpha). $$

3.2 Step 2 — Given $f$, estimate $\Gamma_\alpha, \Gamma_\beta$

$$ [\hat\Gamma_\alpha,\, \hat\Gamma_\beta] \;=\; \arg\min \sum_{i,t} \bigl( r_{i,t+1} - z_{it}'\Gamma_\alpha - z_{it}'\Gamma_\beta\, f_{t+1}\bigr)^2. $$

Iterate to convergence; provably finds a stationary point of the joint MSE.

4. What IPCA is really doing

Define characteristic-managed portfolios $x_{t+1} = Z_t' R_{t+1} / N$:

$$ x_{t+1} \;=\; (Z_t' Z_t / N) \Gamma_\alpha \;+\; (Z_t' Z_t / N) \Gamma_\beta \, f_{t+1} \;+\; (Z_t' \varepsilon_{t+1} / N). $$

So IPCA is really PCA on the time series of $L$ characteristic-managed portfolios. Similar in spirit to CRW's regressed-PCA — but IPCA uses the original characteristics $Z_t$ as the projection basis, while CRW uses a flexible basis $\phi(z_t)$.

5. Empirical findings

Sample: same 36-char × 12,813 stock × Sep-1962 to May-2014 monthly panel as FNW and CRW.

Specification	Total $R^2$	MV Sharpe	Conclusion
FF-5 (pre-specified)	low	moderate	Inferior to IPCA
IPCA-R, $K=5$ (no alpha)	~17%	high	Strong fit
IPCA-U, $K=5$ (with alpha)	~17.8%	high	Marginally better than R
IPCA-R vs IPCA-U Wald	—	—	Cannot reject $\Gamma_\alpha = 0$ at conventional levels

Headline claim: IPCA-R fit is so good that residual alpha improvement is small. So characteristics' expected-return role is captured by time-varying loadings on 5 latent factors, not mispricing.

The 5 IPCA factors plot economically — loosely market, size, value-like, momentum-like, quality-like — but are not exactly FF factors; they are the empirically-best-fit latent versions.

6. The CRW critique

CRW (2023) point out three issues with IPCA's "no mispricing" claim:

Linearity. $\alpha = z'\Gamma_\alpha$ forces a linear shape on mispricing. If true $\alpha(z)$ is nonlinear, the linear restriction can absorb genuine alpha into spurious betas. CRW's nonparametric specification reveals nonzero alphas.
Joint objective. IPCA's TS+XS joint MSE lets factors absorb characteristics' average-return pattern. CRW extracts factors explaining comovement only, leaving cross-sectional patterns for $\alpha$.
K selection. KPS pre-commit to $K=5$. CRW's data-driven eigenvalue-ratio selector picks $K=1$ or $2$ on the same data.

CRW empirical conclusion is the opposite: characteristics carry both risk loadings and non-zero mispricing; pure-$\alpha$ portfolios earn Sharpe > 3.

7. Connection to other papers in this series

Connor-Linton (2007) — IPCA's conceptual ancestor; KPS modernise with linear form and time-varying $z$.
Freyberger-Neuhierl-Weber (2020) — provides IPCA's 36-char panel.
Bai-Ng (2002), Bai (2003) — IPCA's asymptotic theory builds on these.
CRW (2023) — direct critique and alternative.

8. What this gives the AOF model

The pre-existing benchmark. Any AOF factor model has to be compared against IPCA on the same data. First AOF deliverable: "regressed-PCA gives X, IPCA gives Y; here's why and what the differences mean".
Reference for alternating-least-squares machinery. Even if AOF runs regressed-PCA, the IPCA codebase is useful as a robustness check (test for $\alpha = 0$ under linearity).
A cautionary tale. Pre-committing to a specific $K$ (FF style) can flip empirical conclusions about mispricing. The AOF pipeline should always select $K$ from data and report both R and U fits.

Implementation: ~150 lines of Python. Public reference implementations exist (Bryan Kelly's GitHub). For AOF, run IPCA in parallel with regressed-PCA as a "reality check" panel.

9. Reading next

Kim, Korajczyk & Neuhierl (2020) — alternative semiparametric estimator that also disputes the IPCA "no mispricing" conclusion.
CRW (2023) — the full critique and replacement.

← 学習ベース index · 8 本中 6 本目

06 — Kelly, Pruitt & Su (2019) — IPCA

特性は共分散である：リスクとリターンの統一モデル

Journal of Financial Economics 134(3), 501–524 · 松前景一郎 · 2026-05-15

論文の要点

IPCA = Instrumented PCA。「インストルメント」は企業特性。
モデル：$r_{i,t+1} = \alpha_{it} + \beta_{it}' f_{t+1} + \varepsilon_{i,t+1}$、$\alpha_{it} = z_{it}'\Gamma_\alpha$、$\beta_{it} = z_{it}'\Gamma_\beta$ — 特性について線形。
推定量：$\Gamma_\alpha, \Gamma_\beta, f$ について交互最小二乗。
主張：5 ファクター IPCA が米国個別株のクロスセクションを FF5 よりはるかに良く説明。
主題 — 「特性は共分散である」 — 特性のリターン予測力は時間変動するローディングを通じてであり、ミスプライシングではない。
CRW (2023) の直接の比較対象。同じデータで競争、線形性を緩めて逆の結論に到達。

1. なぜこの論文が必要か

2017 年時点で数百の特性が予測力を文書化されていた。2 つの問い：

これらは低次元のファクター構造に整理可能か？
特性とリターンの関係はリスク（時間変動ベータ）かミスプライシング（非ゼロアルファ）か？

KPS は (1) に「然り」、(2) に断固「リスク」と答える — 特性は共分散である。

2. モデル

$$ r_{i, t+1} \;=\; \alpha_{it} \;+\; \beta_{it}'\, f_{t+1} \;+\; \varepsilon_{i, t+1}, $$

識別のための線形制約：

$$ \alpha_{it} = z_{it}'\,\Gamma_\alpha, \qquad \beta_{it} = z_{it}'\,\Gamma_\beta. $$

記号	意味
$z_{it}$	$L \times 1$ 特性（ラグ付き）
$\Gamma_\alpha$	$L \times 1$ 特性→アルファ写像
$\Gamma_\beta$	$L \times K$ 特性→ローディング写像
$f_{t+1}$	$K \times 1$ 潜在ファクター
$K$	ファクター数（KPS は 5 を主張）

2 つの仕様。制約付き（IPCA-R）は $\Gamma_\alpha = 0$ を課す（ミスプライシングなし）。制約なし（IPCA-U）は $\Gamma_\alpha \neq 0$ を許容。$\Gamma_\alpha = 0$ の Wald 検定が KPS の「ミスプライシング検定」。

3. 推定量 — 交互最小二乗

$(\Gamma_\alpha, \Gamma_\beta, f)$ について同時には非凸だが双線形：2 つ与えれば、3 つ目は OLS で閉形式。

3.1 第 1 段階 — $\Gamma_\alpha, \Gamma_\beta$ 既知で $f_{t+1}$ を推定

$$ \hat f_{t+1} \;=\; \bigl(\Gamma_\beta'\, Z_t' Z_t\, \Gamma_\beta\bigr)^{-1} \Gamma_\beta'\, Z_t'\, (R_{t+1} - Z_t \Gamma_\alpha). $$

3.2 第 2 段階 — $f$ 既知で $\Gamma_\alpha, \Gamma_\beta$ を推定

$$ [\hat\Gamma_\alpha,\, \hat\Gamma_\beta] \;=\; \arg\min \sum_{i,t} \bigl( r_{i,t+1} - z_{it}'\Gamma_\alpha - z_{it}'\Gamma_\beta\, f_{t+1}\bigr)^2. $$

収束まで反復。同時 MSE の定常点に証明可能に到達。

4. IPCA が本当にやっていること

特性マネージドポートフォリオ $x_{t+1} = Z_t' R_{t+1} / N$ を定義：

$$ x_{t+1} \;=\; (Z_t' Z_t / N) \Gamma_\alpha \;+\; (Z_t' Z_t / N) \Gamma_\beta \, f_{t+1} \;+\; (Z_t' \varepsilon_{t+1} / N). $$

つまり IPCA は実質的に $L$ 個の特性マネージドポートフォリオの時系列に対する PCA。CRW の regressed-PCA と精神的に類似 — ただし IPCA は原特性 $Z_t$ を射影基底とするのに対し、CRW は柔軟な基底 $\phi(z_t)$ を用いる。

5. 実証結果

サンプル：FNW・CRW と同じ 36 特性 × 12,813 銘柄 × 1962 年 9 月〜2014 年 5 月、月次。

仕様	Total $R^2$	MV シャープ	結論
FF-5（事前指定）	低	中	IPCA 劣後
IPCA-R, $K=5$（アルファなし）	~17%	高	強いフィット
IPCA-U, $K=5$（アルファあり）	~17.8%	高	R より僅かに良
R vs U の Wald	—	—	$\Gamma_\alpha = 0$ を慣例的水準で棄却できず

主要主張：IPCA-R のフィットが十分良いため、残余のアルファ寄与は小さい。したがって特性の期待リターンへの寄与は5 潜在ファクターへの時間変動ローディングで捉えられ、ミスプライシングではない。

5 つの IPCA ファクターは経済的に解釈可能 — 大まかにマーケット、規模、バリュー的、モメンタム的、クオリティ的 — だが FF ファクターと正確には一致しない、実証的に最適化された潜在版。

6. CRW の批判

CRW (2023) は IPCA の「ミスプライシングなし」主張に 3 つの問題を指摘：

線形性。$\alpha = z'\Gamma_\alpha$ はミスプライシングを線形に強制。真の $\alpha(z)$ が非線形ならば、線形制約が genuine α を spurious β に吸収する可能性。CRW のノンパラ仕様は非ゼロアルファを露呈する。
同時目的関数。IPCA の TS+XS 同時 MSE はファクターが特性の平均リターンパターンを吸収するのを許す。CRW は共動のみを説明するファクターを抽出し、クロスセクションパターンを $\alpha$ に残す。
K 選択。KPS は $K=5$ にコミット。CRW のデータ駆動固有値比セレクターは同じデータで $K=1$ or $2$ を選ぶ。

CRW の実証結論は逆：特性はリスクローディングと非ゼロミスプライシングの両方を担う；純 $\alpha$ ポートフォリオはシャープ > 3 を稼ぐ。

7. 本シリーズ内での位置づけ

Connor-Linton (2007) — IPCA の概念的祖先。KPS が線形・時間変動 $z$ で近代化。
Freyberger-Neuhierl-Weber (2020) — IPCA の 36 特性パネルを提供。
Bai-Ng (2002), Bai (2003) — IPCA の漸近論はこれらに基づく。
CRW (2023) — 直接の批判と代替。

8. AOF モデルへの貢献

既存ベンチマーク。あらゆる AOF ファクターモデルは同じデータで IPCA と比較されるべき。AOF の最初の deliverable：「regressed-PCA はこれを得る、IPCA はこれを得る、その理由と差の意味」。
交互最小二乗の参照実装。AOF が regressed-PCA を採用しても、IPCA コードベースは線形制約下での $\alpha = 0$ ロバストネスチェックとして有用。
戒めとしての事例。特定の $K$ への事前コミット（FF 流）は実証結論を大きく変えうる。AOF パイプラインは常に $K$ をデータから選択し、R と U の両仕様を報告する。

実装：Python 約 150 行。公開参照実装あり（Bryan Kelly の GitHub）。AOF では regressed-PCA と並走させて「現実チェック」パネルとする。

9. 次に読むべきもの

Kim, Korajczyk & Neuhierl (2020) — IPCA の「ミスプライシングなし」結論に異議を唱える別のセミパラメトリック推定量。
CRW (2023) — 完全な批判と代替。