This course is now roughly 10 13 years old (first run: Fall 2010). The pretty dirty repo history shows it. It's time to think of version 2.0, although version 1.0 never got its release tag (release tags did not even exist when we started).
And perhaps even more importantly, but (perhaps, even) more time-consumingly:
Additional do-files
My many TODO files from 2017, (especially) 2018, 2019, 2020 have suggestions of extra do-files to create — shorter ones, ones that cover extra stuff beyond the scope of the course (e.g. merging, panel data).
I also have some very short "demo" do-files that I use in the first hour, as recaps of the previous session + introduction to the second hour of the current one.
Use that as an opportunity to…
Bonus do-files (which will move out some stuff from the main ones, and will cover some intermediate/advanced topics):
xtra01-pca -- plot a map + demo PCA (see below)
xtra02-merge -- download additional data from online + merge
xtra03-svy -- survey weights: WVS 99-04
xtra04-bootstrap -- survey weights: NHIS 2017 (repeat?) + bootstrap
xtra05-export -- export descriptive stats with estout
xtra06-tests -- survey weights: ESS 2008 (repeat?) + other association tests with ranks
xtra07-ts -- QOG time series with (extract of) 2023 edition? (serial correlation)
xtra08-panels -- robust and clustered SEs, fixed and random effects with QOG time series
xtra09-export -- export regression results with estout
xtra10-logit -- AUC/ROC, predicted probabilities, ordinal logit, multinomial (?)
xtra11-mfx -- marginal effects, bootstrap (already there at end, remove)
xtra12-count -- survey weights: GSS + neg binomial, count, Poisson etc.?
PCA example:
pca popgrowth-safewater
scoreplot, ms(i) mlab(country)
// note: tried using `kountry` to convert country names, failed so far
loadingplot
// demo arch effect, no strong 2nd dimension
pca lexp-safewater
scoreplot
Leaves out:
- MCA
- quantile regression, L1 (lasso), L2 (ridge)
- bootstrapped SEs in models
- Bayesian models
- multilevel models
Beyond teaching
I once considered publishing the Stata Guide, but publishing a Stata Guide, even though some publishers would take it, sounds bizarre in 2021. R is the current standard, with Julia and Python probably coming next or along.
- At least look at LeanPub, like Roger D. Peng
- Ask PSIA or the Presses de Sciences Po about it
- Go for Sage, like some kind of updated, no-menus Mehmetoglu and Jakobsen?
This course is now roughly
1013 years old (first run: Fall 2010). The pretty dirty repo history shows it. It's time to think of version 2.0, although version 1.0 never got its release tag (release tags did not even exist when we started).version nversion 13to 'freeze' some commands, e.g.table,margins?setup, especially unpublished ones; see also svyplot #26lobbying.dta(Baumgartner),ebm2009(Eurobarometer)And perhaps even more importantly, but (perhaps, even) more time-consumingly:
Additional do-files
My many
TODOfiles from 2017, (especially) 2018, 2019, 2020 have suggestions of extra do-files to create — shorter ones, ones that cover extra stuff beyond the scope of the course (e.g. merging, panel data).I also have some very short "demo" do-files that I use in the first hour, as recaps of the previous session + introduction to the second hour of the current one.
Use that as an opportunity to…
estoutproperly? (both for "Table 1" and regression tables)week01,week02…week12for obsessive neatness?havetoo much work: streamline theweek0*-recapdo-files with just the essentialsweek**onesxtra01toxtra12-- one 'bonus' do-file per week (see below)Bonus do-files (which will move out some stuff from the main ones, and will cover some intermediate/advanced topics):
xtra01-pca-- plot a map + demo PCA (see below)xtra02-merge-- download additional data from online + mergextra03-svy-- survey weights: WVS 99-04xtra04-bootstrap-- survey weights: NHIS 2017 (repeat?) + bootstrapxtra05-export-- export descriptive stats withestoutxtra06-tests-- survey weights: ESS 2008 (repeat?) + other association tests with ranksxtra07-ts-- QOG time series with (extract of) 2023 edition? (serial correlation)xtra08-panels-- robust and clustered SEs, fixed and random effects with QOG time seriesxtra09-export-- export regression results withestoutxtra10-logit-- AUC/ROC, predicted probabilities, ordinal logit, multinomial (?)xtra11-mfx-- marginal effects,bootstrap(already there at end, remove)xtra12-count-- survey weights: GSS + neg binomial, count, Poisson etc.?PCA example:
Leaves out:
Beyond teaching
I once considered publishing the Stata Guide, but publishing a Stata Guide, even though some publishers would take it, sounds bizarre in 2021. R is the current standard, with Julia and Python probably coming next or along.