Citation Reference

Purpose: Lightweight citation index for verifying and deduplicating citations accumulated during pipeline execution. Used by the orchestrator for citation management and by the report-writer and data-verifier for completeness checking. Audience: Orchestrator, report-writer, data-verifier When to read: Orchestrator consults on-demand when managing STATE.md citations. Report-writer loads at Stage 11 for verification. Data-verifier loads at Stage 12.

Citation Philosophy

Three questions determine whether something warrants citation in a DAAF report:

Did this method or tool directly produce an analytical result? (e.g., pyfixest ran the regression, not polars which just loaded the data)
Would a different methodological choice here have changed the findings? (e.g., choosing Callaway-Sant'Anna over TWFE for staggered DiD)
Does the creator deserve credit for enabling this specific work? (e.g., a survey analysis library whose weighting implementation you relied on)

If yes to any — include with rationale. If no to all — omit.

Parsimony principle: A report with 5 well-justified citations is better than one with 30 perfunctory ones. Every citation should earn its place.

Inclusion Thresholds

Software & Tools

Cite when the library's analytical functionality drives a result. Do NOT cite for routine data wrangling (e.g., using polars to load a parquet file).

Library	Canonical Citation	Cite When	Do NOT Cite When
DAAF	Kim, B.H. (2026). DAAF: Data Analyst Augmentation Framework (Version 2.0.0) [Computer software]. https://github.com/DAAF-Contribution-Community/daaf	Always (pre-populated in STATE.md)	—
pyfixest	Berge, L., Butts, K., & McDermott, G. (2026). pyfixest: Fast high-dimensional fixed effects estimation [Computer software]. Based on fixest (R).	Used for regression estimation or DiD	Only imported but not used for estimation
statsmodels	Seabold, S. & Perktold, J. (2010). "Statsmodels: Econometric and Statistical Modeling with Python." Proceedings of the 9th Python in Science Conference.	Used for GLM, time series, or statistical testing	Only used for post-estimation diagnostics supporting another library's estimation
linearmodels	Sheppard, K. linearmodels: Econometric models for panel, IV, and system regression [Computer software]. https://bashtage.github.io/linearmodels/	Used for panel, IV/GMM, or system estimation	—
scikit-learn	Pedregosa, F. et al. (2011). "Scikit-learn: Machine Learning in Python." Journal of Machine Learning Research, 12, 2825-2830.	Used for ML models, clustering, or dimensionality reduction	Only used for a single preprocessing step
geopandas	Jordahl, K. et al. geopandas: Python tools for geographic data [Computer software]. https://geopandas.org/	Used for spatial operations, joins, or mapping	Only used to read a shapefile for reference
PySAL	Rey, S.J. et al. (2022). "The PySAL Ecosystem of Open-Source Python Packages for the Analysis of Spatial Data." Geographical Analysis, 54(3), 467-487.	Used for spatial weights, autocorrelation, or spatial regression	—
svy	Diallo, M.S. svy: Python package for complex survey sampling and analysis [Computer software]. (Formerly samplics.)	Used for survey-weighted estimation	—
polars	Vink, R. et al. Polars: Blazingly fast DataFrames [Computer software]. https://pola.rs/	Core data processing engine for the analysis	Only used for trivial file I/O
plotnine	Kibirige, H. et al. plotnine: Grammar of graphics for Python [Computer software]. https://plotnine.org/	Primary visualization library producing report figures	Only used for a quick exploratory plot
plotly	Plotly Technologies Inc. Plotly: Interactive graphing library [Computer software]. https://plotly.com/	Primary visualization library producing report figures	Only used for a quick exploratory plot
marimo	marimo team. marimo: Reactive Python notebook [Computer software]. https://marimo.io/	Always (analysis notebook is a marimo notebook)	—

Methodological References

Cite the primary citation per method — the one paper you would cite in a journal article, not a comprehensive bibliography.

Causal Inference

Method	Primary Citation	Cite When
DiD (staggered, Callaway-Sant'Anna)	Callaway, B. & Sant'Anna, P.H.C. (2021). "Difference-in-Differences with Multiple Time Periods." Journal of Econometrics, 225(2), 200-230.	Callaway-Sant'Anna estimator is the primary identification strategy
DiD (TWFE heterogeneity concerns)	Goodman-Bacon, A. (2021). "Difference-in-Differences with Variation in Treatment Timing." Journal of Econometrics, 225(2), 254-277.	Bacon decomposition or TWFE bias discussion
DiD (did2s)	Gardner, J. (2022). "Two-Stage Differences in Differences." arXiv:2207.05943.	did2s estimator used
DiD (Sun-Abraham)	Sun, L. & Abraham, S. (2021). "Estimating Dynamic Treatment Effects in Event Studies with Heterogeneous Treatment Effects." Journal of Econometrics, 225(2), 175-199.	Interaction-weighted estimator used
DiD (imputation)	Borusyak, K., Jaravel, X., & Spiess, J. (2024). "Revisiting Event-Study Designs: Robust and Efficient Estimation." Review of Economic Studies, 91(6), 3253-3285.	Imputation estimator used
DiD (doubly robust)	Sant'Anna, P.H.C. & Zhao, J. (2020). "Doubly Robust Difference-in-Differences Estimators." Journal of Econometrics, 219(1), 101-122.	Doubly robust DiD estimation used
DiD (TWFE negative weights)	de Chaisemartin, C. & D'Haultfoeuille, X. (2020). "Two-Way Fixed Effects Estimators with Heterogeneous Treatment Effects." American Economic Review, 110(9), 2964-2996.	TWFE decomposition or negative weight analysis performed
DiD (sensitivity analysis)	Rambachan, A. & Roth, J. (2023). "A More Credible Approach to Parallel Trends." Review of Economic Studies, 90(5), 2555-2591.	Sensitivity analysis for parallel trends violations performed
DiD (practitioner's guide)	Baker, A., Callaway, B., Cunningham, S., Goodman-Bacon, A., & Sant'Anna, P.H.C. (Forthcoming). "Difference-in-Differences Designs: A Practitioner's Guide." Journal of Economic Literature. arXiv:2503.13323.	DiD is the primary identification strategy (comprehensive methodology reference)
IV (weak instruments)	Staiger, D. & Stock, J.H. (1997). "Instrumental Variables Regression with Weak Instruments." Econometrica, 65(3), 557-586.	IV estimation with first-stage F-test
RD design	Cattaneo, M.D., Idrobo, N., & Titiunik, R. (2020). A Practical Introduction to Regression Discontinuity Designs. Cambridge University Press.	RD is primary identification strategy
Synthetic control	Abadie, A., Diamond, A., & Hainmueller, J. (2010). "Synthetic Control Methods for Comparative Case Studies." Journal of the American Statistical Association, 105(490), 493-505.	Synthetic control method used
Propensity score	Rosenbaum, P.R. & Rubin, D.B. (1983). "The Central Role of the Propensity Score in Observational Studies for Causal Effects." Biometrika, 70(1), 41-55.	Propensity score matching or weighting used

Survey Analysis

Method	Primary Citation	Cite When
Complex survey design	Heeringa, S.G., West, B.T., & Berglund, P.A. (2017). Applied Survey Data Analysis (2nd ed.). CRC Press.	Survey-weighted analysis with stratification/clustering
Weighting decisions	Solon, G., Haider, S.J., & Wooldridge, J.M. (2015). "What Are We Weighting For?" Journal of Human Resources, 50(2), 301-316.	Explicit weighting methodology discussion

Machine Learning

Method	Primary Citation	Cite When
Prediction vs explanation	Shmueli, G. (2010). "To Explain or to Predict?" Statistical Science, 25(3), 289-310.	ML methods used for prediction in social science context
Causal forests	Wager, S. & Athey, S. (2018). "Estimation and Inference of Heterogeneous Treatment Effects Using Random Forests." Journal of the American Statistical Association, 113(523), 1228-1242.	Heterogeneous treatment effect estimation
SHAP	Lundberg, S.M. & Lee, S.-I. (2017). "A Unified Approach to Interpreting Model Predictions." NeurIPS.	SHAP values used for model interpretation

Geospatial

Method	Primary Citation	Cite When
Spatial autocorrelation	Anselin, L. (1995). "Local Indicators of Spatial Association--LISA." Geographical Analysis, 27(2), 93-115.	Moran's I or LISA computed

Decomposition

Method	Primary Citation	Cite When
Oaxaca-Blinder	Blinder, A.S. (1973). "Wage Discrimination: Reduced Form and Structural Estimates." Journal of Human Resources, 8(4), 436-455; Oaxaca, R. (1973). "Male-Female Wage Differentials in Urban Labor Markets." International Economic Review, 14(3), 693-709.	Oaxaca-Blinder decomposition performed

Clustered Standard Errors

Method	Primary Citation	Cite When
Clustering guidance	Abadie, A., Athey, S., Imbens, G.W., & Wooldridge, J.M. (2023). "When Should You Adjust Standard Errors for Clustering?" Quarterly Journal of Economics, 138(1), 1-35.	Clustered SEs used with explicit justification for clustering level

Reporting Standards

Standard	Citation	Cite When
GUIDE-LLM	Feuerriegel, S. et al. (2026). "Generative AI Models in Science: Risks and Opportunities -- The GUIDE-LLM Checklist."	AI disclosure section present (always in DAAF reports)
Do No Harm Guide	Schwabish, J. & Feng, A. (2021). Do No Harm Guide: Applying Equity Awareness in Data Visualization. Urban Institute.	Equity-sensitive visualizations or race/ethnicity data
Causal language	Haber, N.A. et al. (2022). "Causal and Associational Language in Observational Health Research." American Journal of Epidemiology, 191(12), 2020-2028.	Causal claims made or explicitly hedged

Accumulation Protocol

After each Stage 6 script: Orchestrator extracts data source citation from research-executor output and appends to STATE.md > Citations Accumulated > Data Sources.
After each Stage 7-8 script: Orchestrator extracts method and software citations from research-executor output and appends to STATE.md > Citations Accumulated > Methodological References and/or Software & Tools. Deduplicate by checking if the citation already exists in STATE.md.
At project setup: Orchestrator pre-populates the DAAF, marimo, and GUIDE-LLM citations in STATE.md (these are always present).
At Stage 11: Report-writer reads STATE.md > Citations Accumulated as the primary source for the report's References section. Consults this file for verification if needed.
At Stage 12: Data-verifier checks that all accumulated citations appear in the report and that no uncited methods or tools are present.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Citation Reference

Citation Philosophy

Inclusion Thresholds

Software & Tools

Methodological References

Causal Inference

Survey Analysis

Machine Learning

Geospatial

Decomposition

Clustered Standard Errors

Reporting Standards

Accumulation Protocol

FilesExpand file tree

CITATION_REFERENCE.md

Latest commit

History

CITATION_REFERENCE.md

File metadata and controls

Citation Reference

Citation Philosophy

Inclusion Thresholds

Software & Tools

Methodological References

Causal Inference

Survey Analysis

Machine Learning

Geospatial

Decomposition

Clustered Standard Errors

Reporting Standards

Accumulation Protocol