Part 17 / 16
Appendix: Reference Material
17.1 Notation summary
| Symbol | Dimensions | Meaning | First used |
|---|---|---|---|
| scalar | number of assets | Ch. 1 | |
| scalar | number of factors | Ch. 1 | |
| scalar | number of time periods | Ch. 1 | |
| asset returns over one period | Ch. 2 | ||
| factor exposure (loading) matrix | Ch. 2 | ||
| factor returns over one period | Ch. 2 | ||
| specific (idiosyncratic) returns | Ch. 2 | ||
| factor covariance matrix | Ch. 2 | ||
| diag. | specific variance matrix, | Ch. 2 | |
| asset covariance, | Ch. 2 | ||
| weights: generic, portfolio, benchmark, active () | Ch. 2, Ch. 9 | ||
| factor exposures of a portfolio, | Ch. 2, Ch. 9 | ||
| scalar | portfolio volatility, tracking error | Ch. 2 | |
| scalar | raw descriptor value, stock , factor | Ch. 3 | |
| scalar | standardization location (cap-weighted) and scale (equal-weighted) | Ch. 3 | |
| scalar | least-squares objective, (weighted) sum of squared residuals minimized to get | Ch. 6 | |
| diag. | regression weights (convention: ) | Ch. 6 | |
| (or more rows) | constraint matrix, | Ch. 6 | |
| restriction matrix, feasible | Ch. 6 | ||
| pure factor portfolio matrix, | Ch. 6, Ch. 7 | ||
| OLS projection (“hat”) matrix, | §17.2 | ||
| general specific-return covariance in GLS; reduces to when diagonal | §17.2 | ||
| scalar | time-series beta to a factor; portfolio beta to a hedge instrument | Ch. 1, Ch. 12 | |
| scalar / | Fama–MacBeth risk premium; Lagrange multiplier in the characteristic-portfolio derivation | Ch. 7 | |
| varies | hedge notionals, characteristic portfolio weights | Ch. 7, Ch. 12 | |
| scalar | risk aversion (Ch. 11), EWMA decay (Ch. 8), context disambiguates | Ch. 8, Ch. 11 | |
| scalar | bias statistic, | Ch. 8, Ch. 14 | |
| scalar | marginal contribution , contribution | Ch. 9 |
Conventions: returns are arithmetic and in decimal unless a table is marked %. Risk numbers are annualized unless noted. “Exposure” always means a column-standardized or dummy loading per Chapter 3.
17.2 Refresher: the least-squares family in one place
OLS: . Fitted values with the projection (“hat”) matrix : symmetric, idempotent (), projecting onto the column space of . Residuals are orthogonal to that space: .
WLS: With positive diagonal : . Equivalent to OLS on , . Best linear unbiased when (Aitken/GLS). The convention approximates this for equities (Ch. 6).
GLS: General : . In the factor-model context (diagonal), so GLS = WLS with .
Covariance algebra used throughout: For conformable constant matrices and random vectors : ; when . These two lines, applied to , are the derivation of .
EWMA: Weights on lag . Half-life . Recursive update (normalized form). Effective sample size for large .
Eigendecomposition/PCA: Symmetric , orthonormal, diagonal with for PSD. PCA: factor ‘s exposures = , factor variance = . The rank- truncation is the best rank- approximation in Frobenius norm (Eckart–Young). Marchenko–Pastur noise edge for aspect ratio : .
17.3 Derivation collection
D1 - Constrained WLS (Ch. 6): Minimize s.t. ( independent rows). Restriction form: pick () whose columns span the null space of (so and any feasible ). Substitute: is unconstrained in , and by the WLS formula with design : , . Lagrangian form: . Stationarity gives the bordered system . Same solution, and the multiplier prices the constraint.
D2 - Pure factor portfolios, (Ch. 7): Unconstrained: . Row of is a portfolio with exposure vector = row of = : unit own-factor, zero others. Constrained: with . Then for all , identity on the feasible subspace. Style rows are exactly pure, market/industry rows carry the constraint’s structure (Ch. 7 table).
D3 - Euler risk decomposition (Ch. 9): is positively homogeneous of degree 1. Euler’s theorem for homogeneous functions: . Directly: , so . ∎ Factor-space version: with (factor block), contributions sum to .
D4 - Characteristic portfolio (Ch. 7): s.t. . Lagrangian . Stationarity . The constraint fixes : , with minimized variance . The GLS pure portfolio for factor solves the same problem with the added constraints of zero exposure to the other factors. Stacking those constraints and applying D1’s bordered system shows the GLS () regression rows solve it: estimation efficiency = portfolio efficiency.
D5 - Woodbury identity for (Ch. 11): For with invertible: . Verify by multiplication: . Factor from the middle term: , so the middle term collapses to , cancelling. ∎ Cost: vs. .
D6 - Minimum-variance hedge ratio (Ch. 12): . Minimize over : . Multi-instrument: , the population regression coefficients of the portfolio on the instruments. Through the model, and .
17.4 Glossary
- Active return/risk: portfolio minus benchmark return. Volatility thereof (tracking error).
- Alpha: expected return not explained by factor exposures. In construction, the forecast vector fed to an optimizer.
- Bias statistic: std of realized returns standardized by forecast volatility. The calibration score of a risk model.
- Characteristic portfolio: minimum-variance portfolio with unit exposure to a characteristic: .
- Coverage universe: all assets the model assigns exposures and risk to (cf. estimation universe).
- Descriptor: a raw measurable per-stock quantity (B/P, 12-1 return) before standardization. Factors blend one or more.
- Estimation universe: the curated asset set on which factor returns are estimated.
- Exposure (loading): a stock’s sensitivity to a factor: dummy (industry/country) or standardized z-score (style).
- Factor-mimicking/pure factor portfolio: the long–short portfolio (row of ) whose return is the estimated factor return. Unit own-exposure, zero other-exposure.
- Factor return: per-period payoff to unit exposure of a factor, estimated by cross-sectional regression.
- Half-life: lag at which an EWMA weight halves. The responsiveness dial of covariance estimation.
- Idiosyncratic/specific risk: return variance unique to a stock. Diagonal of . Diversifiable.
- Information coefficient (IC): cross-sectional correlation between a signal and forward returns. In alpha research, the number that matters is the IC of the factor-residualized signal.
- Linked assets: multiple listings of one issuer (ADR, share classes) sharing factor exposures and correlated specifics.
- Pure factor portfolio: see factor-mimicking portfolio.
- Restriction matrix: basis of a constraint’s null space, converting constrained to unconstrained regression.
- Style factor: continuous characteristic-based factor (value, momentum, size, quality…).
- Tracking error: annualized volatility of active return.
- VIF (variance inflation factor): of one exposure regressed on the others. The redundancy gauge for candidate factors.
- Winsorization: clipping extreme descriptor values before standardization.
- Z-score: standardized exposure: (descriptor − cap-weighted mean) / equal-weighted std.
17.5 The mini example: complete dataset
Everything below reproduces every number in Chapters 2–15 (NumPy, deterministic, no randomness). The full script is reproduced on its own page: Mini Example Source Code.
Universe, descriptors, month-1 data, portfolios:
| Stock | Industry | Cap $bn | B/P | Mom (12-1) | Spec vol (ann) | (%) | |
|---|---|---|---|---|---|---|---|
| AXIOM | Tech | 150 | 0.15 | +0.32 | 18% | +4.2 | 0.10 |
| BINARY | Tech | 80 | 0.25 | +0.18 | 22% | +2.8 | 0.08 |
| CIPHER | Tech | 40 | 0.45 | −0.05 | 30% | +0.5 | 0.10 |
| DIGIT | Tech | 10 | 0.60 | +0.40 | 38% | +6.0 | 0.03 |
| EVERGREEN | Fin | 120 | 0.85 | +0.06 | 16% | +0.8 | 0.22 |
| FIDELIS | Fin | 60 | 0.95 | −0.02 | 20% | −0.6 | 0.14 |
| GUARDIAN | Fin | 20 | 1.10 | −0.12 | 28% | −1.8 | 0.06 |
| HARVEST | Cons | 90 | 0.40 | +0.10 | 17% | +1.2 | 0.15 |
| INDIGO | Cons | 30 | 0.55 | +0.02 | 26% | +2.0 | 0.08 |
| JUNIPER | Cons | 15 | 0.70 | −0.08 | 32% | −0.5 | 0.04 |
Benchmark = cap weights (total cap 615). SIZE descriptor = ln(cap). Style standardization: cap-weighted mean, equal-weighted std (Ch. 3. Resulting tabulated there). Regression weights , normalized. Constraint: cap-weighted industry factor returns sum to zero, industry cap weights (0.4553, 0.3252, 0.2195).
Factor covariance (annualized): Volatilities: MKT 16%, TECH 9%, FIN 7%, CONS 5%, VALUE 4%, MOM 6%, SIZE 4%. Correlations:
| MKT | TECH | FIN | CONS | VALUE | MOM | SIZE | |
|---|---|---|---|---|---|---|---|
| MKT | 1 | 0.10 | −0.05 | −0.10 | −0.20 | 0.05 | 0.15 |
| TECH | 1 | −0.40 | −0.30 | −0.35 | 0.30 | 0.05 | |
| FIN | 1 | −0.10 | 0.40 | −0.15 | 0.00 | ||
| CONS | 1 | 0.05 | −0.05 | −0.05 | |||
| VALUE | 1 | −0.45 | 0.10 | ||||
| MOM | 1 | 0.05 | |||||
| SIZE | 1 |
(Symmetric, eigenvalues all positive: PSD verified in the script.)
Stipulated later-month factor returns (Ch. 10. CONS set by the constraint): = (−2.0, −1.5, +1.0, +1.63, +1.2, −0.8, +0.5)%. = (+3.0, +0.8, −0.5, −0.919, −0.6, +1.0, −0.3)%. Active specific returns months 2–3: +0.30%, −0.10%. Constant exposures assumed across the quarter.
Hedge instruments (Ch. 12): index future = benchmark exposures (1, 0.4553, 0.3252, 0.2195, 0, 0, 0). Small-cap future = (1.05, 0.35, 0.30, 0.35, 0.10, −0.05, −1.20). Both specific-risk-free.
Alpha-research candidate (Ch. 13): raw descriptor = book-to-price − 0.30·(12-1 momentum) + a fixed per-stock idiosyncratic term, standardized to exposure . Regressing on gives the spanning results below.
Key computed results (cross-chapter checkpoints): month-1 factor returns = (1.821, 0.768, −1.282, 0.306, 0.548, 1.962, 0.046)%. Portfolio/benchmark/active vols 17.55% / 18.14% / 5.42%. Active exposures (0, −0.145, 0.095, 0.051, 0.385, −0.332, −0.275). Quarter attribution VALUE +0.46% MOM −0.72% specific +0.24% total −0.102%. Optimization TE 5.42->4.65% at 25.2% turnover. Hedge (−0.759, −0.229) giving 17.55->8.14%. Alpha-research candidate: corr with VALUE/MOM/SIZE +0.82/−0.44/−0.50, spanned fraction 0.884, IC vs −0.49 raw / −0.13 residualized, signal long–short −0.63% = factor −0.46% + specific −0.17%.
17.6 Annotated bibliography
Foundations
- Sharpe (1964), “Capital Asset Prices,” JF: the one-factor beginning. Beta as the first exposure.
- Ross (1976), “The Arbitrage Theory of Capital Asset Pricing,” JET: multi-factor pricing without specifying the factors. The license under which all factor models operate.
- Fama & MacBeth (1973), “Risk, Return, and Equilibrium,” JPE: the two-pass cross-sectional methodology (Ch. 7). Still the standard premia test.
- Chen, Roll & Ross (1986), “Economic Forces and the Stock Market,” JB: the canonical macroeconomic factor model (Ch. 4).
- Fama & French (1992, 1993) JF/JFE; (2015) JFE; Carhart (1997) JF: the style-factor canon: size and value, the three-factor model, profitability/investment, momentum. The sorted-portfolio construction of Ch. 7.
- Rosenberg (1974), “Extra-Market Components of Covariance,” JFQA: the founding paper of the fundamental cross-sectional architecture this primer centers on.
Books
- Grinold & Kahn, Active Portfolio Management (2nd ed.): the practitioner bible: characteristic portfolios, the fundamental law, IR-based thinking. The source of Ch. 7’s optimization view and Ch. 11’s alpha discipline.
- Connor, Goldberg & Korajczyk, Portfolio Risk Analysis: the most rigorous book-length treatment of factor risk models per se, all three families.
- Qian, Hua & Sorensen, Quantitative Equity Portfolio Management: cross-sectional modeling and construction with worked detail.
- Litterman et al., Modern Investment Management: risk decomposition and budgeting culture (Goldman’s quantitative tradition).
Methods
- Ledoit & Wolf (2003, 2004): shrinkage covariance estimation (Ch. 8): “Honey, I Shrunk the Sample Covariance Matrix.”
- Newey & West (1987), Econometrica: autocorrelation-consistent covariance. The horizon-scaling fix of Ch. 8.
- Shanken (1992), RFS: errors-in-variables correction for Fama–MacBeth (Ch. 7).
- Menchero (2000s, various): multi-period attribution linking (Ch. 10), Carino (1999): the log-linking algorithm.
- Black & Litterman (1992), FAJ: equilibrium-anchored expected returns (Ch. 11).
- Michaud (1989), “The Markowitz Optimization Enigma,” FAJ: error maximization named and shamed (Ch. 11).
- Harvey, Liu & Zhu (2016), RFS, “…and the Cross-Section of Expected Returns”: the factor zoo’s multiple-testing reckoning (Ch. 16). Hou, Xue & Zhang (2020), RFS: the replication audit.
- Kelly, Pruitt & Su (2019), JFE: IPCA, Gu, Kelly & Xiu (2020), RFS: ML asset pricing (Ch. 16 directions).
Practitioner references
- MSCI Barra model handbooks (USE4, GEM3 and successors): full disclosure of a production fundamental model: descriptor recipes, estimation universes, regression weights, specific-risk blending. The single best way to see every choice in Chapters 3–8 made concretely, with parameters.
- Axioma/SimCorp research papers: practical treatments of alpha alignment (Ch. 11), statistical-vs-fundamental hybrids (Ch. 4), and bias-statistic methodology (Ch. 14).
- Menchero, Orr & Wang (MSCI), “The Barra US Equity Model (USE4)” research notes: readable bridge between the handbooks and the academic literature.