A self-contained learning base covering the 7 papers that lead up to Chen, Roussanov & Wang (2023) and feed directly into the AOF quant model. Each paper has a bilingual lecture note + PDF (where freely available). Reading order is the order that builds understanding fastest, from statistical foundations to the headline paper.
Dependency map
flowchart TB
BN["01 Bai & Ng (2002)
K-selection"]
B03["02 Bai (2003)
Inference for PCA"]
ON["03 Onatski (2010)
Eigenvalue-ratio"]
CL["04 Connor-Linton (2007)
semiparametric loadings"]
FNW["05 FNW (2020)
36-char panel + LASSO"]
KPS["06 Kelly-Pruitt-Su (2019)
IPCA (benchmark)"]
KKN["07 Kim-Korajczyk-Neuhierl (2020)
arbitrage portfolios"]
CRW["08 Chen-Roussanov-Wang (2023)
regressed-PCA — AOF target"]
AOF["Synthesis
AOF Quant Model Blueprint"]
BN --> B03
BN --> ON
CL --> KPS
FNW --> KPS
FNW --> KKN
KPS --> CRW
KKN --> CRW
ON --> CRW
B03 --> CRW
CRW --> AOF
style CRW fill:#fbe9c4,stroke:#b8651e,stroke-width:2px
style AOF fill:#f4eaf6,stroke:#7a3e87,stroke-width:2px
Papers
01
Bai & Ng (2002)
"Determining the Number of Factors in Approximate Factor Models"
Foundation: how to choose $K$. Six penalised information criteria. Default selector when $N$ and $T$ are large.
02
Bai (2003)
"Inferential Theory for Factor Models of Large Dimensions"
Inference layer. Asymptotic distributions for PCA factor and loading estimates. Analytical SEs when $T$ is long.
03
Onatski (2010)
"Determining the Number of Factors from Empirical Distribution of Eigenvalues"
Eigenvalue-ratio test. Robust to weak factors. Direct ancestor of CRW's $\hat K$ selector.
04
Connor & Linton (2007)
"Semiparametric Characteristic-Based Factor Model"
Conceptual seed. First semiparametric factor model in finance — loadings as functions of characteristics.
05
Freyberger, Neuhierl & Weber (2020)
"Dissecting Characteristics Nonparametrically"
The data panel. 36 firm characteristics × ~12k US stocks. Adaptive group LASSO identifies which 15 (full-sample) survive joint test.
06
Kelly, Pruitt & Su (2019) — IPCA
"Characteristics Are Covariances"
The direct benchmark. Linear in characteristics, joint TS+XS objective, claims no mispricing with 5 factors. The model AOF must beat.
07
Kim, Korajczyk & Neuhierl (2020)
"Arbitrage Portfolios"
Triangulation. Independent methodology, same conclusion: mispricing exists at exploitable scale. Useful starter implementation.
08
Chen, Roussanov & Wang (2023) — CRW
"Semiparametric Conditional Factor Models"
The headline paper. Regressed-PCA: 2-step estimator with nonparametric $\alpha, \beta$, fixed-$T$ asymptotics, Sharpe-3 arbitrage. AOF target.
99
Synthesis — AOF Quant Model Blueprint
how all 7 papers compose into the AOF model
Concrete model architecture. Each design choice cites the paper that justifies it.
At-a-glance
| # | Paper | Year | Year-1 contribution to AOF model |
| 01 | Bai & Ng | 2002 | $K$ selection (large-panel default) |
| 02 | Bai | 2003 | analytical SE for factor estimates |
| 03 | Onatski | 2010 | $K$ selection (weak-factor / small-$T$) |
| 04 | Connor & Linton | 2007 | conceptual frame: loadings = functions of characteristics |
| 05 | FNW | 2020 | 36-char list + which to keep |
| 06 | KPS — IPCA | 2019 | benchmark to beat |
| 07 | KKN | 2020 | starter portfolio + reality check |
| 08 | CRW | 2023 | target model — full implementation |
How to use this
- Read 01 → 02 → 03 in sequence to nail the statistical foundations.
- Read 04 → 05 → 06 → 07 to understand the finance-specific literature.
- Read 08 last — by this point, the contribution structure is crystal clear.
- Then the synthesis note for the actual AOF build plan.
Each note is ~2,000–5,000 words, bilingual (English / Japanese), with MathJax equations, Mermaid diagrams, and "What this gives AOF" callouts. Designed to be read in any sequence — but the order above is fastest.
Chen, Roussanov & Wang (2023) に至り、AOF クオンツモデルに直接フィードする 7 本の論文を網羅する自己完結型の学習ベース。各論文に日英対訳の講義ノート+PDF(自由入手可能な場合)。読書順は出版順ではなく、統計的基礎から本命論文へと最短距離で理解を構築する順序。
依存マップ
flowchart TB
BN["01 Bai & Ng (2002)
K 選択"]
B03["02 Bai (2003)
PCA の推測理論"]
ON["03 Onatski (2010)
固有値比"]
CL["04 Connor-Linton (2007)
セミパラメトリックローディング"]
FNW["05 FNW (2020)
36 特性パネル+LASSO"]
KPS["06 Kelly-Pruitt-Su (2019)
IPCA(ベンチマーク)"]
KKN["07 Kim-Korajczyk-Neuhierl (2020)
アービトラージポートフォリオ"]
CRW["08 Chen-Roussanov-Wang (2023)
regressed-PCA — AOF 目標"]
AOF["統合論
AOF クオンツモデル設計図"]
BN --> B03
BN --> ON
CL --> KPS
FNW --> KPS
FNW --> KKN
KPS --> CRW
KKN --> CRW
ON --> CRW
B03 --> CRW
CRW --> AOF
style CRW fill:#fbe9c4,stroke:#b8651e,stroke-width:2px
style AOF fill:#f4eaf6,stroke:#7a3e87,stroke-width:2px
論文一覧
一覧表
| # | 論文 | 年 | AOF モデルへの初年度貢献 |
| 01 | Bai & Ng | 2002 | $K$ 選択(大パネルデフォルト) |
| 02 | Bai | 2003 | ファクター推定値の解析的 SE |
| 03 | Onatski | 2010 | $K$ 選択(弱因子・短 $T$) |
| 04 | Connor & Linton | 2007 | 概念フレーム:ローディング = 特性の関数 |
| 05 | FNW | 2020 | 36 特性リスト+採択指針 |
| 06 | KPS — IPCA | 2019 | 超えるべきベンチマーク |
| 07 | KKN | 2020 | 初期ポートフォリオ+リアリティチェック |
| 08 | CRW | 2023 | 目標モデル — 完全実装 |
使い方
- 01 → 02 → 03 を順に読み、統計的基礎を固める。
- 04 → 05 → 06 → 07 を読み、ファイナンス固有の文献を理解。
- 08 を最後に読む — この時点で貢献構造が極めて明確。
- その後、統合論で実際の AOF 構築プランへ。
各ノートは約 2,000〜5,000 語、日英対訳、MathJax 数式、Mermaid 図、「AOF への貢献」コールアウト付き。任意の順序で読めるよう設計しているが、上記順序が最速。