← Learning base index · 05 of 8

05 — Freyberger, Neuhierl & Weber (2020)

Dissecting Characteristics Nonparametrically

Review of Financial Studies 33(5), 2326–2377 · Kei Matsumae · 2026-05-15

What this paper does

Answers two questions simultaneously: (i) of the many firm characteristics that predict returns in isolation, how many carry independent information once you account for all the others? (ii) Are those relationships linear?
Method: spline basis per characteristic + adaptive group LASSO for joint selection.
Sample: 36 characteristics × ~5,000 individual US stocks × 1963–2015 monthly.
Finding 1: ~15 of 36 characteristics survive joint selection.
Finding 2: Only 7 survive when restricted to stocks above the 20% NYSE size cutoff.
Source of the 36-characteristic panel that Kelly et al. (2019) and CRW (2023) both use.
For AOF: tells you which characteristics deserve basis dimensions.

1. Why this paper exists

By 2015 the academic literature had documented 300+ "anomaly" characteristics. Two problems:

Joint significance. Most studied one at a time; correlated with each other (size with momentum, with B/M, with profitability). Which survive a joint test?
Functional form. Almost all studies impose linearity (or coarse portfolio sorts). What if the true relationship is nonlinear?

FNW propose a single procedure handling both jointly.

2. The model

$$ r_{it+1} \;=\; \sum_{j=1}^{M} m_j(C_{ijt}) \;+\; \varepsilon_{it+1}, $$

$C_{ijt}$ is characteristic $j$ for stock $i$ at $t$ (rank-transformed to $[0, 1]$).
$m_j(\cdot)$ is an unknown smooth function capturing how characteristic $j$ predicts next-month return.
Many $m_j(\cdot)$ are expected to be identically zero — the selection part.

This is a return-prediction model, not a factor model. FNW does not estimate loadings or latent factors. It's a single cross-sectional regression with characteristic-driven means. Use FNW as input to selecting relevant characteristics; use IPCA or CRW for the factor structure.

3. The estimator — adaptive group LASSO with splines

For each characteristic $j$, approximate $m_j(\cdot)$ by a B-spline with $L$ knots:

$$ m_j(c) \;\approx\; \sum_{\ell=1}^{L} \theta_{j\ell} \, p_\ell(c) \;=\; p(c)' \theta_j. $$

$\theta_j$ is a group: either all components zero (excluded) or some non-zero (included).

Estimator:

$$ \min_{\{\theta_j\}} \;\sum_{i, t} \Bigl( r_{it+1} - \sum_j p(C_{ijt})' \theta_j \Bigr)^2 \;+\; \lambda \sum_j w_j \, \|\theta_j\|_2. $$

The group penalty $\|\theta_j\|_2$ sets entire groups to zero. The adaptive weights $w_j$ (from an initial unpenalised fit) ensure consistent selection.

4. The 36 characteristics

Eleven market/return-based, twenty-five fundamentals-based. Examples:

Market: market cap (LME), turnover, beta, idio vol, r2-1, r12-2, r12-7, r36-13, closeness to 52-week high, bid-ask spread, SUV.
Fundamentals: total assets, B/M, profitability, ROA, ROE, investment, accruals, Tobin's Q, free CF, leverage, asset turnover, cash holdings, D&A, R&D, SG&A, etc.

This is the panel that Kelly et al. (2019) inherit and CRW (2023) build on.

5. Empirical findings

5.1 All stocks above $5 price (full sample)

15 characteristics survive. Most influential (approximate order):

Market cap (LME) — strongly nonlinear
Closeness to 52-week high
SUV
Lagged turnover
r12-2 momentum
r2-1 reversal
r12-7 intermediate momentum
Idio vol
Free cash flow
Bid-ask spread

Surprisingly dropped: most balance-sheet ratios, including value (B/M), beta, ROE, accruals, investment. Look significant in linear single-variable analyses but drop out once you control for the survivors with flexible functional form.

5.2 Large-cap subsample (top 80% by NYSE size)

Only 7 characteristics survive: size, momentum, SUV, lagged turnover, idio vol, profitability, intermediate momentum.

5.3 Functional form

Most surviving characteristics show clear nonlinearity — sometimes a jump at extreme deciles with a flat middle, sometimes a smooth concave curve. Linear Fama-MacBeth slopes miss this.

6. Connection to other papers in this series

flowchart LR CL["Connor & Linton (2007)
3 chars, kernel"] FNW["FNW (2020)
36 chars, splines + group LASSO"] KPS["Kelly-Pruitt-Su (2019)
uses 36-char panel + IPCA"] CRW["CRW (2023)
uses 36-char panel + regressed-PCA"] CL --> FNW FNW --> KPS FNW --> CRW style FNW fill:#fff6e3,stroke:#b8651e

Connor & Linton (2007) used 3 characteristics and kernel smoothing. FNW scales to 36 with splines + group LASSO, much faster, but doesn't extract factors.
Kelly-Pruitt-Su (2019) IPCA — reuses the same 36-char panel; runs the IPCA factor model on it.
Chen-Roussanov-Wang (2023) — same 36-char panel, regressed-PCA estimator, lets $\beta(\cdot)$ be nonparametric in characteristics (more flexible than FNW's $m_j$).

FNW vs. CRW: FNW is a return-prediction model ($E[r|z]$); CRW is a factor model (separates risk loading from mispricing). Complementary — FNW tells you which characteristics deserve modelling; CRW tells you whether their predictive content is risk or alpha.

7. What this gives the AOF model

FNW feeds the AOF pipeline at three points:

AOF design step	FNW contribution
Choice of $z$ vector	The 36-char list itself. Don't reinvent; this is the consensus academic panel.
Selecting which $z_j$ to keep	The 15-char "matters jointly" shortlist for small-cap-friendly; the 7-char list for large-cap-only. Saves basis dimensions, reduces overfitting risk.
Basis design	Empirical nonlinearity profile. Strongly nonlinear chars (size, 52-week high) deserve more knots; smooth-linear chars (momentum) use $(1, z)$.

For Japan: re-run adaptive group LASSO on JP 36-char panel as a separate research step. Selected JP shortlist will differ — likely momentum and size hold up; value (B/M) and profitability may be stronger in JP than US.

8. Reading next

Kelly-Pruitt-Su (2019) — IPCA companion on the same panel.
CRW (2023) — full factor-model treatment.

← 学習ベース index · 8 本中 5 本目

05 — Freyberger, Neuhierl & Weber (2020)

特性のノンパラメトリック分解

Review of Financial Studies 33(5), 2326–2377 · 松前景一郎 · 2026-05-15

論文の要点

2 つの問いに同時回答：(i) 単独でリターンを予測する多数の企業特性のうち、他をすべて控除した上で独立な情報を持つのは何個か？ (ii) その関係は線形か？
手法：特性ごとにスプライン基底＋同時選択のためのadaptive group LASSO。
サンプル：36 特性 × 約 5,000 米国個別株 × 1963–2015 月次。
結果 1：36 中 15 特性が同時選択で生き残る。
結果 2：NYSE 上位 80% に限定すると7 特性のみ。
Kelly et al. (2019) と CRW (2023) が共に用いる 36 特性パネルの源。
AOF 用途：どの特性に基底次元を割くべきかを示す。

1. なぜこの論文が必要か

2015 年までに学術文献では 300 超の「アノマリー」特性が文書化されていた。2 つの問題：

同時有意性。多くは単独で研究されている。相互に相関（規模はモメンタムと、B/M と、収益性と相関）。同時検定で残るのは？
関数形。ほぼすべての研究が線形を仮定（または粗いポートフォリオソート）。真の関係が非線形ならどうなる？

FNW は両方を同時に扱う単一手続きを提案。

2. モデル

$$ r_{it+1} \;=\; \sum_{j=1}^{M} m_j(C_{ijt}) \;+\; \varepsilon_{it+1}, $$

$C_{ijt}$ は時点 $t$ の銘柄 $i$ の特性 $j$（$[0, 1]$ にランク変換）。
$m_j(\cdot)$ は特性 $j$ が翌月リターンをどう予測するかの未知の滑らかな関数。
多くの $m_j(\cdot)$ は恒等的に 0と予想される — 選択の対象。

これはリターン予測モデルであって、ファクターモデルではない。FNW はローディングや潜在ファクターを推定しない。特性駆動の平均を持つ単一のクロスセクション回帰。FNW は「どの特性が関連するか」の選択入力に用い、ファクター構造には IPCA や CRW を使う。

3. 推定量 — スプライン付き adaptive group LASSO

各特性 $j$ について、$L$ 節点の B スプラインで $m_j(\cdot)$ を近似：

$$ m_j(c) \;\approx\; \sum_{\ell=1}^{L} \theta_{j\ell} \, p_\ell(c) \;=\; p(c)' \theta_j. $$

$\theta_j$ はグループ：全成分 0（除外）か、一部非ゼロ（採用）。

推定量：

$$ \min_{\{\theta_j\}} \;\sum_{i, t} \Bigl( r_{it+1} - \sum_j p(C_{ijt})' \theta_j \Bigr)^2 \;+\; \lambda \sum_j w_j \, \|\theta_j\|_2. $$

グループペナルティ $\|\theta_j\|_2$ がグループ全体を 0 にする。adaptive weights $w_j$（初期非ペナルティ推定から得る）は一致選択を保証。

4. 36 特性

11 種が市場・リターンベース、25 種がファンダメンタルベース。例：

市場：時価総額 (LME)、回転率、ベータ、特異ボラ、r2-1、r12-2、r12-7、r36-13、52 週高値との近接度、ビッドアスクスプレッド、SUV。
ファンダメンタル：総資産、B/M、収益性、ROA、ROE、投資、accruals、Tobin Q、フリー CF、レバレッジ、資産回転率、現金保有、D&A、R&D、SG&A など。

これが Kelly et al. (2019) が継承し、CRW (2023) が構築するパネル。

5. 実証結果

5.1 全銘柄（5 ドル超）フルサンプル

15 特性が生き残る。影響度上位（おおむね順）：

時価総額（LME） — 強い非線形
52 週高値との近接度
SUV
ラグ回転率
r12-2 モメンタム
r2-1 リバーサル
r12-7 中期モメンタム
特異ボラ
フリーキャッシュフロー
ビッドアスクスプレッド

意外に脱落：多くのバランスシート系比率 — バリュー (B/M)、ベータ、ROE、accruals、投資。線形・単一変量分析では有意に見えるが、柔軟な関数形で他の生存者を控除すると消える。

5.2 大型株サブサンプル（NYSE 上位 80%）

7 特性のみ生存：規模、モメンタム、SUV、ラグ回転率、特異ボラ、収益性、中期モメンタム。

5.3 関数形

生存特性のほとんどが明確な非線形性を示す — 極端な十分位でジャンプして中央が平坦なケース、滑らかな凹曲線のケースなど。線形 Fama-MacBeth 傾きはこれを取り逃がす。

6. 本シリーズ内での位置づけ

flowchart LR CL["Connor & Linton (2007)
3 特性、カーネル"] FNW["FNW (2020)
36 特性、スプライン＋group LASSO"] KPS["Kelly-Pruitt-Su (2019)
36 特性パネル＋IPCA"] CRW["CRW (2023)
36 特性パネル＋regressed-PCA"] CL --> FNW FNW --> KPS FNW --> CRW style FNW fill:#fff6e3,stroke:#b8651e

Connor & Linton (2007) は 3 特性＋カーネル平滑化。FNW はスプライン＋group LASSO で 36 特性にスケール、はるかに高速だがファクターは抽出しない。
Kelly-Pruitt-Su (2019) IPCA — 同じ 36 特性パネルに IPCA ファクターモデルを走らせる。
Chen-Roussanov-Wang (2023) — 同じ 36 特性パネル、regressed-PCA、$\beta(\cdot)$ を特性についてノンパラメトリックに（FNW の $m_j$ より柔軟）。

FNW vs. CRW：FNW はリターン予測モデル（$E[r|z]$）、CRW はファクターモデル（リスクローディングとミスプライシングを分離）。補完関係 — FNW は「どの特性をモデルに入れる価値があるか」、CRW は「その予測成分はリスクかアルファか」を答える。

7. AOF モデルへの貢献

FNW は AOF パイプラインの 3 か所に寄与：

AOF 設計ステップ	FNW の貢献
$z$ ベクトルの選択	36 特性リストそのもの。再発明しない；これが学術コンセンサスのパネル。
$z_j$ の絞り込み	「同時に意味を持つ」15 特性ショートリスト（小型株対応版）、または 7 特性リスト（大型株専用）。基底次元を節約、過学習リスク低減。
基底設計	実証的非線形プロファイル。強い非線形（規模、52 週高値）にはより多くの節点、滑らかな線形効果（モメンタム）には $(1, z)$ で十分。

日本：JP の 36 特性パネルに対し adaptive group LASSO を別研究ステップとして再実行。JP のショートリストは米国とは異なる — モメンタムと規模は維持、バリュー (B/M) と収益性は JP の方が強い可能性。

8. 次に読むべきもの

Kelly-Pruitt-Su (2019) — 同じパネルに対する IPCA。
CRW (2023) — ファクターモデル本格処理。