AMIS Seminar | Professor Doug Schroeder

Econometric Analysis of Accounting Choice

Summer 2022, Spring 2023

Instructor Information
Instructor: Professor D. Schroeder
Office: 424 Fisher Hall
Email: schroeder.9@osu.edu
Phone: 614-292-6427

Potential topics (we won’t find time for all but most have been addressed over the past five years):

Identification of causal effects:

Structural causal models (SCM) and DAGs (directed acyclic graphs)
- Pearl, Glymour, and Jewell, 2016, Causal inference in statistics: A primer, Wiley
Back-door confounding, Simpson’s paradox and nonparametric identification (Bayes theorem and do-calculus)
- Ralphs technology
- Pearl 1995, Biometrika, “Causal diagrams for empirical research”
back-door adjustment
- Ralph’s back-door adjustment
front-door adjustment
- Ralph’s front-door adjustment
Berkson’s paradox or collider bias
- Ralph’s kitchen sink fallacy
Instrumental variables and projections
- Ralph’s instrumental variables
Cyclical models, equilibrium and instrumental variables
- Ralph’s structural queries
- Haavelmo, 1943, Econometrica, The statistical implications of a system of simultaneous equations.
Linear models and path coefficients
- Ralph’s path coefficients
DAG construction and diagnostic checking
- Ralph’s DAG
Sample selection bias and adjustment
- Ralph’s technology selection
- notes on sample selection
Mediation
- notes on mediation
- Pearl, 2012, Prevention Science, The causal mediation formula – A guide to the assessment of pathways and mechanisms.
Data missing at random and not at random
- SCM graphs and missing data
- Mohan and Pearl, 2020, Journal of the American Statistical Association, “Graphical models for processing missing data.”
External validity, meta analysis (combining evidence from multiple studies)
- Ralph’s transportability
- notes on SCM and meta-analysis
- Pearl and Bareinboim, 2014, Statistical Science, “External validity: From do-calculus to transportability across populations”
Necessary, sufficient and necessary & sufficient causal effects (non-point, partial identification)
- notes on causes of effects
- Tian and Pearl, 2000, “Probabilities of causation: bounds and identification.”
- Mueller, Li, and Pearl, 2021, “Causes of effects: Learning individual responses from population data.”

Experimental design and inference:

Method of moments
Clustering, fixed effects (within cluster variation), and standard errors
- notes on SCM and fixed effects
- notes on clustering, fixed effects, random coefficients, and standard errors
SCM with diff-in-diff design
Proxy variables
- SCM and z-identification
- SCM and proxy variables
SCM and inverse-propensity score weighting
- notes on SCM and inverse-propensity score weighting
- notes on treatment effects and inverse-propensity score weighting
Entropy balanced causal effects
- notes on entropy balancing
- Hainmueller, 2012, Political Analysis, Entropy balancing for causal effects.

Other topics of interest to participants

treatment effects and regression discontinuity designs
treatment effects and control or replacement functions
treatment effects and matching methods
dynamic treatment effects
synthetic controls
…

Archived material:

Focus:

Explore issues of (broad) interest raised in colloquium, papers, or formative ideas from a causal perspective.

Describe the general problem.
Identify a (causal) research question and the specific effects or quantities of interest.
Explain your identification strategy for the effects of interest.
Formulate an identification-consistent research design (possibly including a causal graph).
Discuss interpretation of potential results.

Closely attend to the role of causal relations, probability assignment, and counterfactuals in devising an identification strategy and data analysis.

Background reading:

Pearl, Glymour, and Jewell, 2016, Causal inference in statistics: A primer, Wiley.

Angrist and Pischke, 2015, Mastering Metrics, Princeton University Press.

Angrist and Pischke, 2009, Mostly Harmless Econometrics, , Princeton University Press.

Wooldridge, 2002, Econometric analysis of cross section and panel data, The MIT Press.

Davidson and MacKinnon, 2004, Econometric theory and methods, Oxford University Press.

Cameron and Trivedi, 2005, Microeconometrics: Methods and applications, Cambridge University Press.

Strang, Introduction to linear algebra or Linear algebra and its applications.

Tentative outline for summer and spring:

session	topic	examples	background reading
	prediction (statistical)
1 summer	inferring transactions	Ralph’s structure; Ralph’s accounting information	Arya, Felligham, Glover, Schroeder, and Strang, 2000, Contemporary Accounting Research, “inferring transactions from financial statements”; notes on updating from financial statements
2 summer	accruals as statistics	Ralph’s optimal accruals; Ralph’s GLS; Ralph’s Bayesian Accruals	Arya, Fellingham, and Schroeder, 2002, European Accounting Review, “Depreciation in a model of probabilistic investment “; Demski, Fellingham, Lin, and Schroeder, 2007, “On the role of acccruals,” working paper
	causality (structure and evidence)
3 spring	Simpson’s paradox & causality – intervention & counterfactuals	Ralph’s Technology	Pearl, Glymour, and Jewell, 2016, Causal inference in statistics: A primer, Wiley
4	do-calculus	Ralph’s back-door adjustment	Pearl et al Primer; Pearl 1995, Biometrika, “Causal diagrams for empirical research”
5	do-calculus	Ralph’s front door adjustment	Pearl et al Primer; Pearl, Biometrika
6	linear models & instruments	Ralph’s instrumental variables	Pearl et al Primer notes on SCM and fixed effects SCM, random coefficients, fixed effects, and clustering SCM and z-identification SCM and proxy variables
7	cyclical models & instruments	Ralph’s structural queries	Pearl et al Primer
8	Berkson’s paradox; DAG testing	Ralph’s “Kitchen Sink” Fallacy; Ralph’s covariate selection	Pearl et al Primer
9	linear models & DAG testing	Ralph’s Path Coefficients, Ralph’s DAG	Pearl et al Primer
10	sampling selection bias	Ralph’s Technology Selection	notes on Sampling selection parametric sample selection
11,12	external validity & missing data	Ralph’s Transportability	Pearl and Bareinboim, 2014, Statistical Science, “External validity: From do-calculus to transportability across populations” SCM and meta-analysis Mohan and Pearl, 2020, Journal of the American Statistical Association, “Graphical models for processing missing data.” SCM graphs and missing data (examples 1, 2, and 10 from Mohan and Pearl [2020])
13,14	ETT, counterfactuals & partial identification	examples in SCM with diff-in-diff design	notes on SCM with diff-in-diff design notes on causes of effects Tian and Pearl, 2000, “Probabilities of causation: bounds and identification.” Mueller, Li, and Pearl, 2021, “Causes of effects: Learning individual responses from population data.”
15	treatment effects & counterfactuals	examples in Inverse propensity score weighting & SCM	notes on SCM and inverse propensity score weighting notes on ignorable treatment and inverse propensity-score weighting
16	maxent; treatment effects & counterfactuals	Ralph’s probability assignment; examples in entropy balanced causal effects	ch 4 Maximum entropy distributions; notes on entropy balanced causal effects; Hainmueller, 2012, Political Analysis, Entropy balancing for causal effects.

Papers:

Greenland, Pearl, and Robins, 1999, Epidemiology, Causal diagrams for epidemiology research.

Haavelmo, 1943, Econometrica, The statistical implications of a system of simultaneous equations.

Pearl, 2012, Prevention Science, The causal mediation formula – A guide to the assessment of pathways and mechanisms.

notes on mediation

Demerjian et al, forthcoming, Contemporary Accounting Research, Income smoothing and the usefulness of earnings for monitoring in debt contracting.

Bonsall, Green, and Muller, 2018, The Accounting Review, Are credit ratings more rigorous for widely covered firms?

Genser, Teles, Baretto, and Fischer, 2015, Environmental Health, Within- and between-group regression for improving the robustness of causal claims in cross-sectional analysis.

Petersen, 2009, Review of Financial Studies, Estimating standard errors in finance panel data sets: Comparing approaches.

Gow, Ormazabal, and Taylor, 2010, The Accounting Review, Correcting for cross-sectional and time-series dependence in accounting research.

Thompson, 2011, Journal of Financial Economics, Simple formulas for standard errors that cluster by both firm and time.

Wooldridge, 2011, Labour Lectures EIEF, Cluster samples and clustering.

Abadie, Athey, Imbens, Wooldridge, 2017, NBER, When should you adjust standard errors for clustering?

Abadie, Athey, Imbens, Wooldridge, 2020, Econometrica, Sampling-based versus design-based uncertainty in regression analysis.

Arellano, 1987, Oxford Bulletin of Economics and Statistics, Computing robust standard errors for within-groups estimators.

Arellano, 2011, Cluster-robust standard errors.

Heckman, Ichimura, Todd, 1997, Review of Economic Studies, Matching as an econometric evaluation estimator: Evidence from evaluating a job training programme.

Heckman, Ichimura, Smith, Todd, 1998, Econometrica, Characterizing selection bias using experimental data.

Bareinboim, Tian, and Pearl, 2014, AAAI Conference on Artificial Intelligence, Recovering from selection bias in causal and statistical inference.

notes on method of moments

notes on preference estimation via MSM

Duffie and Singleton 1993, Econometrica, Simulated moments estimation of Markov models of asset prices.

Kim, 2019, working paper, Spillover of financial reporting on public firm’s corporation’s investment.

Liang, 2019, working paper, How much value does accounting precision enhance?

Bareinboim and Pearl, 2016, Proceedings of the National Academy of Science, Causal inference and the data-fusion problem. (paper provides a nice summary of recent developments)

Knox, Lowe, and Mummolo, 2020, American Political Science Review, Administrative records mask racially biased policing.

Knox, Lowe, and Mummolo, 2020, on-line appendix, Administrative records mask racially biased policing.

Fryer, 2019, Journal of Political Economy, An empirical analysis of racial differences in policy use of force.

Potential project:

Identify a topic of interest.
Apply the above outline to the subject.
Simulate data and apply your research design.
If you have archival data, subject these data to your vetted design and interpret the results; otherwise, interpret the results for the simulated data.

“No successful mechanical algorithm for discovering causal or structural models has yet been produced, and it is unlikely that one will ever be found. At the same time, it is unlikely that the quest for a mechanical algorithm for determining causality from data will ever be abandoned. The tension between the use of tacit knowledge and formal algorithmic methods is likely to be a permanent feature of empirical research in economics. It arises because in most empirical studies there is always more knowledge about a problem being studied than appears in the sampling distributions of the measured variables being analyzed or in well-specified Bayesian priors. The best empirical work in economics uses economic theory as a framework for integrating all of the available evidence, tacit and algorithmic, to tell a convincing story.”

– James Heckman, Quarterly Journal of Economics 115, no. 1 Feb. 2000, p. 46.

Objectives:

The objectives are (i) to build a foundation for meaningful discussions, (ii) to complement econometrics (and statistics) courses (in my view, coursework provides an opportunity to survey what’s out there – this is an opportunity to develop a deeper understanding), and (iii) to explore implications for accounting.

Monograph: Accounting and causal effects: Econometric challenges (can be found below)
Work in progress: Accounting, managerial experimentation and causal effects (can be found below)
ASSETS (additional notes, some are repeated below, follow link on right side of page)
Bayesian Networks

Complementary Bayesian network exercises:

Pearl’s criteria for complete, consistent econometric treatment of structural causal modeling (SCM)

A. Applicability of Econometric Models

The author presents example problems that require causal reasoning.
The author presents example problems that require prediction alone.

B. Interpreting Model Parameters

The author states that each structural equation in the econometric model is meant to convey a causal relationship.
The author defines beta to be causal and the equality, betaX = E [Y|X], is NOT generally valid.
The author does NOT define the error term, in general, to be equal to the difference between E[Y|X] and Y.
The author interprets the error term as omitted variables that (together with X) determine Y.
The author states that each structural equation in the econometric model is meant to capture a ceteris paribus or “everything else held fixed” relationship.
The author does NOT assume that exogeneity of X is inherent to the model.

C. Distinguishing E [Y | X] and E [Y | do (X)]

The author makes clear the difference in the assumptions needed for answering causal as opposed to predictive problems?
The author uses separate notation for E [Y | X ] and E [Y | do (X )].
The author uses separate notation for the slope of the line associated with E [Y | X] and that associated with E [Y | do (X)].

Pearl’s fundamental structural causal modeling (SCM) questions

other papers and notes:

Gow, Larcker, and Reiss, “Causal inference in accounting research,” April 2015.
Pearl, Glymour, and Jewell, Causal Inference in Statistics: A Primer, Wiley, 2016. (and references therein)
Pearl and Mackenzie, The Book of Why: The New Science of Cause and Effect, Basic Books, 2018. (and references)
Angrist and Pischke, Mostly Harmless Econometrics: An Empiricist’s Companion, Princeton University Press, 2009.
Angrist and Pischke, Mastering Metrics: A Path from Cause to Effect, Princeton University Press, 2015.
Judea Pearl’s home page for numerous causal inference and Bayesian network papers
AAA 2012 slides: Accounting and causal effects
Imbens and Wooldridge’s NBER econometric minicourse
Fellingham monograph: Accounting as an information science
Ron Dye’s AAA survey of accounting theory
Ross, 2011, The Recovery Theorem.
Ross, 2015, “The recovery theorem,” Journal of Finance, April 2015, 615-648.
Barro, 2006, “Rare disasters and asset prices in the twentieth century, Quarterly Journal of Economics, 823-866.
Dybvig, Ross,2003 Arbitrage, state prices and portfolio theory, Handbook of the Economics of Finance.
Kelly, J. 1956“A new interpretation of information rate,” The Bell System Technical Journal, July, 917-926.
Information Synergy part 1: the Kelly-Ross theorem
Information Synergy part 2: Accounting and other information
Information Synergy part 3: belief updating
Li, Chen, 2012. “Mutual monitoring within the management team: A structural modeling investigation,” Carnegie-Mellon working paper.
Margiotta, MIller, 2000 “Managerial compensation and the cost of moral hazard,” International Economic Review 41 (3), 669-719.
Gayle, MIller 2009, “Has moral hazard become a more important factor in managerial compensation?” American Economic Review 99 (5), 1740-1769.
Gayle, Miller, 2012, “Identifying and testing models of managerial compensation,” Carnegie-Mellon working paper.

Outline:

Below is a list of potential topics for seminar. We surely will not cover them all (some will naturally get combined in the course of our discussions). It goes without saying, this is an ambitious, time and energy-taxing endeavor, but this seminar is purely optional (it’s an opportunity not an obligation). If you decide to participate, I would like you to choose a topic and tentative date for discussion. I believe you’ll get more out of seminar if you actively participate, therefore my plan for this summer is to give you some latitude in leading the discussion. Don’t think you have to exhaust the topic – that’s not possible, many of these topics warrant a course of their own. Do not expect closure.

There is some coverage of each of these issues in my monograph, Accounting and causal effects: Econometric challenges. But don’t think you have to limit yourself to these pages. The monograph is posted below along with some supplemental material developed (and still under development) subsequent to completion of the monograph along with some sample R programs.

As you know, linear algebra is an indispensable tool for these explorations. Amongst the supplemental materials is an appendix (not listed with the topics below) which includes a brief survey of some important concepts from linear algebra along with some examples.

Make use of the best tools at your disposal (spreadsheets, Mathematica, Matlab, etc.). I find R, the open source statistical software, to be invaluable especially for computationally intensive work like McMC simulation. Since it’s open source, it evolves rapidly and many of the contributors have fabulous programming skills that make R extraordinarily efficient. I plan to post a few sample programs to, hopefully, ease the learning curve but expect to discover your own tricks for gaining efficiency (sadly, not my forte).

Treatment effects

Treatment effects (a special case of causal effects) receive a substantial amount of attention in the topics below as well as in the monograph. Some key features (illustrated in the Tuebingen-style and supplementary pages examples) are

omitted, correlated variables (personally, I devote much of my efforts to this problem, in this case it leads to potential selection bias)
counterfactuals& inferring population-level parameters (this distinguishes the treatment effect problem from, say, a run of the mill endogenous regressor problem and is a source of controversy among scientists; see Dawid)
common support (how confident are you extrapolating outside the relevant range?)
there are numerous definitions of treatment effects (average treatment effect, average treatment effect on the treated, average treatment effect on the untreated, local average treatment effect, marginal treatment effect, policy-relevant treatment effect, ambiguity of treatment effects identified by linear IV, etc.) not all of them are accessible without full support and they are not all equally well suited to the problem at hand
choice of instruments is even more challenging than usual and greater care is required in interpreting the results
higher explanatory power (in either the selection or outcome equations) does not necessarily lead to a well-identified model

Topics:

Linear models (OLS, GLS, FWL, etc.)
- Equilibrium earnings management (chapters 2 & 3)
Fixed effects & differences-in-differences and random effects & random coefficients
Linear instrumental variables (IV)
Discrete choice models
Nonlinear regression
Maximum likelihood estimation (James-Stein shrinkage estimators)
Nonparametric & semiparametric regression
Bootstrapping & posterior simulation
McMC (Markov chain Monte Carlo) simulation (chapters 7 & 12 and supplemental materials Bayesian notes)
Strategic choice models
Treatment effects (chapter 8 Tuebingen-style examples)
Ignorable treatment effects (chapter 9 Tuebingen-style examples)
- Identification strategies: exogenous dummy variable regression, nonparametric regression, propensity score & propensity score matching, control functions, regression discontinuity design, etc. (mean conditional independence (ignorability) vis-a-vis conditional independence (strong ignorability))
- Asset revaluation regulation (limitations of outcome measures for inferring welfare effects; chapters 2& 9)
Nonignorable (IV) treatment effects (chapter 10 Tuebingen-style examples)
- Generalized Roy model interpretation
- Identification strategies: endogenous dummy variable IV regression, propensity score IV, ordinate control function IV, inverse Mills control function IV, LATE and linear IV, etc.
- IV control functions & projections examples (ANOVA, ANCOVA to control functions examples in supplemental materials projections notes)
- Continuous treatment & (correlated) random coefficients
- Regulated report precision (nonignorable versus ignorable identification strategy effectiveness; chapters 2 & 10)
Marginal treatment effects
- Identification and LIV (local instrumental variables)
- Regulated report precision (apparent nonnormality & marginal treatment effects; chapters 2 & 11)
Bayesian treatment effects
- Identification via McMC data augmentation
- Identification & McMC IV (restricted) data augmentation (supplemental materials Bayesian notes)
- Regulated report precision (marginal & average treatment effects as well as policy-relevant treatment effects; chapters 2 & 12)
Bayesian networks
Informed priors & Bayesian data analysis (probability as logic)
- Maximum entropy probability assignment & Jaynes’ widget problem
- Conjugate families (supplemental materials Bayesian notes)
- Inverting financial statements (to recover transactions)
- Smooth accruals (valuation and performance evaluation implications)
- Earnings management (stochastic& selective manipulation)
Strategic disclosure (some papers are listed at the bottom of the page)

Ross’ recovery theorem

Kelly’s long-run wealth maximizing investment strategy
Quantum production, synergy, and (product market) strategic disclosure
Other topics of interest to you

Monograph: Accounting and causal effects: Econometric challenges

table of contents for Accounting and causal effects: Econometric challenges

ch 1 Introduction
ch 2 Accounting choice
ch 3 Linear models
- including OLS, GLS, FWL, and IV estimators
ch 4 Loss functions & estimation
- including MLE, James-Stein shrinkage estimators, and nonlinear regression
ch 5 Discrete choice models
- employed as propensity score in treatment effect analysis
ch 6 Nonparametric regression
- employed with Heckman’s MTE
ch 7 Repeated-sampling inference
- including McMC Bayesian analysis
ch 8 Overview of endogeneity
ch 9 Treatments effects: ignorability
ch 10 Treatment effects: IV
ch 11 Marginal treatment effects
ch 12 Bayesian treatment effects
ch 13 Informed priors
- including widget example
- including inferring transactions from financial statements
Appendix: asymptotic theory
- including convergence in probability, convergence in distribution, and rates of convergence
References
Index
Errata (corrections for the 2010 Springer printed version)

Supplemental materials: (most of this is work in progress – expect updates)

Probability as logic
Accounting, managerial experimentation and causal effects
- ch 1 Introduction
  - stewardship and the search for a better “mouse trap”
- ch 2 Classical linear models
  - ANOVA, ANCOVA, linear regression
  - double residual regression (FWL)
- ch 3 Classical causal effect strategies
  - ignorable treatment
  - limited common support
  - regression discontinuity designs
  - synthetic controls
  - dynamic treatment effects
  - LATE& 2SLS (linear) IV
  - control functions
  - continuous treatment effects
  - Differenece-in-difference causal effect strategies
  - Propensity-score matched regression strategies
  - Saturated propensity-score matched regression
  - Quantile treatment effect strategies
  - Causal effects In controlled experiments
  - Fixed effects identification of causal effects
  - Entropy balanced causal effects
- ch 4 Maximum entropy distributions
  - Bayes’ theorem & consistent reasoning
  - maximum entropy probability assignment
- ch 5 Loss functions
- ch 6 Conjugate families
- ch 7 Bayesian simulation
  - posterior simulation & conjugate families
  - independent& conditional posterior simulation
  - Markov chain Monte Carlo (McMC) simulation
  - irreducibility, time reversibility, & stationarity
  - Gibbs sampler
  - Metropolis-Hastings (MH) algorithm
  - data augmented sampler for Probit
  - random walk MH logit
  - uniform data augmented Gibbs sampler for logit
- ch 8 Bayesian regression
  - ANOVA, ANCOVA, regression
- ch 9 Bayesian causal effect strategies
  - data augmented Gibbs sampler for treatment effects
  - data augmented IV restricted Gibbs sampler for treatment effects
- ch 10 Bayesian treatment effects without joint distribution of outcomes
  - Chib’s ATE and LATE
  - extensions (via bounding) to ATT and ATUT
- ch 11 Partial identification and missing data
  - Missing outcomes
    - quantiles
    - expected values (point and partial identification)
    - inference
    - respecting stochastic dominance
    - refutability
    - missing covariates and outcomes
    - missing at random
    - missing by choice
    - stochastic dominance and treatment effects
  - Selection problem
    - various instrumental variable strategies (missing at random, statistical independence, means missing at random, mean independence)
    - monotone instrumental variable strategies (mean monotonicity, exogenous treatment selection, monotone treatment selection, monotone treatment response, MM-MTR, MTR-MTS)
    - quantile treatment effects (point and partial identification)
  - Extrapolation and the mixing problem
  - Appendix: bounds on spreads
- appendix
  - linear algebra basics
  - fundamental theorem of linear algebra
    - subspaces
    - exactly, under-, and over-identified
  - matrix decomposition
    - LU factorization
    - Cholesky decomposition
    - singular value decomposition (SVD) & pseudo inverse
    - spectral decomposition
  - Gram-Schmidt orthogonalization
  - some determinant identities (& LU factorization)
  - iterated expectations
  - multivariate normal theory
  - generalized least squares (GLS)
  - two stage least squares IV (2SLS-IV)
  - seemingly unrelated regression (SUR)
  - maximum likelihood estimation of discrete choice models
  - quantum information
  - common distributions

R Statistical Computing Package

R is a freely available, open-source version of S/S-plus.
Follow this link: R statistical computing package

R samples programs

Tutorials:

R tutorial
http://en.wikibooks.org/wiki/R_Programming/
find your favorite on the web

source files for data augmented Gibbs sampler bivariate probit:

you will need to add bayesm library from Cran website if it’s not including in your base program

Gibbs probit.R
source files for Metropolis-Hastings bivariate logit (self-contained code and code utilizing MCMCpack library which you may need to add)
MH logit.R

source files for data augmented Gibbs sampler bivariate selection analysis of treatment effects:

you will need to add MASS library from Cran website if it’s not including in your base program

Another Bayesian McMC strategy for identifying treatment effects (perhaps, with panel data) can be found in Chib’s papers posted below:

Strategic disclosure papers include:

Shin, 1994, “News management and the value of firms,” RAND Journal of Economics, 25(1), 58-71.
Shin, 2003, “Disclosures and asset returns,” Econometrica, 71(1), 105-133.
Shin, 2006, “Disclosure risk and price drift,” Journal of Accounting Research, 44(2), 351-379.
Arya, Mittendorf. forthcoming, “The interaction between corporate tax structure and disclosure policy,” Annals of Finance.
Arya Mittendorf, 2010, “Input markets and the strategic organization of the firm,” Foundations and Trends in Accounting, 5(1), 1-97.