Thu 2024-Dec-05

US Political Parties and the Budget Deficit

Tagged: MathInTheNews / Politics / Statistics

Previously, we looked at the party control of the 3 US government branches and their effect on the economy. We unaccountably left out the budget deficit, so let’s have a look at that: Republicans are commonly thought to be good for the budget, but are they really?

The Budget Data

We’re really just redoing the analysis that previously told us Republicans were bad for unemployment (U6) and inflation (CPI-U). ^[1]

Now let’s look at the budget deficit. We’ll look at the deficit itself, as well as the deficit as a percent of GDP, to account for economic growth. As before, we’ll look at the era from 1977 to the present, since before 1977 US politics (especially Republican politics) looked quite different.
St. Louis Fed: US government deficit 1977-2023, in millions, annually reported on Sep-30

Budget surplus/deficit: Here we used the time series FYFSD, from the FRED databases of the St. Louis Fed. ^[2] It reports the annual federal government’s budget deficit (negative) or surplus (positive), in millions of dollars. (Yes, there were some years of budget surplus under Clinton in the late 90s.) It looks as shown here, reported on Sept 30 of each year since that’s the federal fiscal year-end. The collected data run 1977 - 2023.

As before, we combine 2 years by adding together the deficits, to get a composite figure for each congressional term. Then we compare this to the previous congressional term, and annotate a term as “Better” if the deficit was a less negative number, or “Worse” if the deficit went further negative. This gives us an idea of whether the policies of that term moved the budget needle or not, in raw dollar terms.

Since these figures are (a) over short time spans and (b) relative to the previous term, we don’t need to correct for inflation.

St. Louis Fed: US government deficit 1977-2023, as % of GDP, annually reported on Jan-01

Budget surplus/deficit as a % of GDP: But even if we don’t need to correct for inflation, it may make sense to correct for the size of the GDP, which does, after all, grow! We should be less worried about a deficit if the economy is dramatically larger. There is also a counter-cyclical effect, where the deficit grows in times of recession due to falls in income and hence government tax revenue.

To do this, we used the time series FYFSGDA188S, also from the FRED databases of the St. Louis Fed. ^[3] It takes the annual deficit data in the FYFSD series, and combines them with the GDP series GDPA ^[4], in the following way:
$\mbox{FYFSGDA188S} = \frac{\mbox{FYFSD} / 1000}{\mbox{GDPA}} \times 100$
The factor of 1000 is because FYFSD is reported in millions, while GDPA is reported in billions and those are a factor of 1000 different. The final factor of 100 is to report the ratio as a percentage. (We did not do this calculation ourselves, but accepted the FRED database’s version of it. No need to check the Fed’s arithmetic!)

The collected data here also run 1977 - 2023.

Then, when combining years into a figure for a 2-year congressional term, we add the percentages. Thus for a congressional term, the max would be a 200% figure.

All these datasets were loaded, transformed into Better/Worse binaries, and joined with the partisanship data from the previous analysis by a new R script analogous to the previous one. ^[5]

The composite dataset it constructs, consisting of years, congressional terms, partisan ship of House/Senate/Presidency, deficit data, and binarization to Better/Worse, is also available as a tab-separated text file for inspection. ^[6] It is also shown here in all its glory.

(NB: “All its glory” is, of course, self-directed sarcasm. There are only 23 data points, so not enough to do cross-validation and L1 LASSO regularization, which are kind of the bare minimum. This analysis is exploratory, at best.)

(NB: Also, the indicator variables are “NA” for 1977, since they are based on a change with respect to the previous year. We’ve elected here to chop of the data at 1977 since before that we’re into Nixon and a different world. We could, perhaps, have included 1976 just for purposes of the first year’s difference, but we didn’t do that.)

Analyses, or Why Jumping to Conclusions is Silly & Dangerous

As before, we performed a logistic regression using both the deficit and deficit as a percent of GDP, predicting from the partisanship of the 3 branches of the federal government:

$\log{\frac{\Pr(\mbox{Worse})}{\Pr(\mbox{Better})}} = \beta_1 \mbox{PartyPresident} + \beta_2 \mbox{PartyHouse} + \beta_3 \mbox{PartySenate} + \alpha$

Here, the R regression software glm() took the D level of each variable as the base, and the R level as the contrast with respect to that baseline. What this means is that a positive regression coefficient will mean that Republicans in that branch are generally bad; a negative regression coefficient will mean Democrats in that branch are generally bad.

Also as before, we did not attempt to include interaction terms to account for when one party has more than 1 branch under its control. The power of a trifecta is certainly real, but that ends up being an 8-parameter model which we simply cannot fit with only 23 data points!

Since we don’t have the data to crossvalidate, we’ll again be satisfied with:

We’ll assess the statistical significance of the fits with the Akaike Information Criterion (AIC). This is a penalized log-likelihood measure, in which models with more parameters are penalized and thus have to achieve better log likelihoods. The preferred model has the smaller AIC. This will permit us to compare the 4- and 8-parameter models above on an even basis. It is also as close as we can come to fighting over-fitting, without enough data to power LASSO.

It will not, however, tell us a significance cutoff threshold. That’s ok, we’re not crossvalidating due to lack of data anyway.
To assess the strength of the model, we’ll run each regression model on its own training data (A cardinal sin! Or at least an ordinal sin?). That will give us a probability of “Worse” for each congressional term. We’ll sort by that probability, and then do a Mann-Whitney rank test to see if the Worse/Better outcomes are statistically significantly sorted.

That is to say, a significant Mann-Whitney $p$ -value will mean at least we can predict well on our own training set. If we can’t even do that, the model is nonsense. If we can, it might mean something, but we don’t have the data to crossvalidate.

The Deficit Itself

Here’s the result of the logistic regression for the deficit:

Coefficients:
                 Estimate Std. Error z value Pr(>|z|)  
(Intercept)      0.396021   0.941318   0.421   0.6740  
PartyPresidentR  2.577359   1.337714   1.927   0.0540 .  
PartyHouseR     -2.570177   1.475261  -1.742   0.0815 .   
PartySenateR    -0.009574   1.540517  -0.006   0.9950  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

(Dispersion parameter for binomial family taken to be 1)

    Null deviance: 29.767  on 21  degrees of freedom
Residual deviance: 17.846  on 18  degrees of freedom
AIC: 25.846

Number of Fisher Scoring iterations: 5

This says a Republican president is bad news, but a Democratic House is also bad news. Both of those are just marginally below statistical significance with $t$ -test $p$ -values of 5.4% and 8.1%, respectively. The Senate is more or less irrelevant.

Does it predict much of anything? Let’s use the model to compute $\hat{p}$ , the model’s probability that things will get worse. Sort by that, and then do a Mann-Whitney test to see if Better/Worse are enriched at the top/bottom of the list.

The result looks like this, with Mann-Whitney very significant at $p \sim 0.0045$ :

       Years PartyPresident PartyHouse PartySenate DepVar      pHat
1995-1996              D          R           R Better 0.1012211
1997-1998              D          R           R Better 0.1012211
1999-2000              D          R           R Better 0.1012211
2015-2016              D          R           R Better 0.1012211
2011-2012              D          R           D Better 0.1020954
2013-2014              D          R           D Better 0.1020954
2003-2004              R          R           R  Worse 0.5971560
2005-2006              R          R           R Better 0.5971560
2017-2018              R          R           R  Worse 0.5971560
1979-1980              D          D           D  Worse 0.5977312
1993-1994              D          D           D Better 0.5977312
2009-2010              D          D           D  Worse 0.5977312
2021-2022              D          D           D  Worse 0.5977312
2001-2002              R          R           D  Worse 0.5994569
1981-1982              R          D           R  Worse 0.9509119
1983-1984              R          D           R  Worse 0.9509119
1985-1986              R          D           R  Worse 0.9509119
2019-2020              R          D           R  Worse 0.9509119
1987-1988              R          D           D Better 0.9513569
1989-1990              R          D           D  Worse 0.9513569
1991-1992              R          D           D  Worse 0.9513569
2007-2008              R          D           D  Worse 0.9513569

Note that, as we interpreted above, the R values are enriched at the bottom (Worse) end of the presidency column, but the D values are at the bottom of the House column. The Senate column is mixed.

Conclusion: We have exploratory (non-crossvalidated, non-regularized, underpowered) evidence that Republicans are bad in the Presidency, but Democrats are bad in the House, and that the Senate doesn’t matter.

The Deficit as a % of GDP

Now let’s repeat that analysis using the deficit as a % of the GDP as the variable being predicted:

Coefficients:
                Estimate Std. Error z value Pr(>|z|)  
(Intercept)     -2.01936    1.22494  -1.649   0.0992 .
PartyPresidentR  2.80601    1.28211   2.189   0.0286 *
PartyHouseR     -0.33920    1.17607  -0.288   0.7730  
PartySenateR     0.04027    1.16236   0.035   0.9724  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

(Dispersion parameter for binomial family taken to be 1)

    Null deviance: 29.767  on 21  degrees of freedom
Residual deviance: 21.690  on 18  degrees of freedom
AIC: 29.69

Number of Fisher Scoring iterations: 4

Again the presidency is significant, with Republicans there being bad, and statistically significant with $p \sim 2.9\%$ . However, neither the House nor the Senate are statistically significant ( $p \sim 77\%$ and $p \sim 97\%$ , respectively).

Doing the same trick as above, making predictions via the logistic regression $\hat{p}$ , sorting, and doing Mann-Whitney we also get a statistically significant prediction at $p \sim 0.015$ :

       Years PartyPresident PartyHouse PartySenate DepVar       pHat
2011-2012              D          R           D Better 0.08638853
2013-2014              D          R           D Better 0.08638853
1995-1996              D          R           R Better 0.08962004
1997-1998              D          R           R Better 0.08962004
1999-2000              D          R           R Better 0.08962004
2015-2016              D          R           R Better 0.08962004
1979-1980              D          D           D Better 0.11718570
1993-1994              D          D           D Better 0.11718570
2009-2010              D          D           D  Worse 0.11718570
2021-2022              D          D           D Better 0.11718570
2001-2002              R          R           D  Worse 0.61003387
2003-2004              R          R           R  Worse 0.61956964
2005-2006              R          R           R Better 0.61956964
2017-2018              R          R           R  Worse 0.61956964
1987-1988              R          D           D Better 0.68711157
1989-1990              R          D           D  Worse 0.68711157
1991-1992              R          D           D  Worse 0.68711157
2007-2008              R          D           D Better 0.68711157
1981-1982              R          D           R  Worse 0.69570273
1983-1984              R          D           R  Worse 0.69570273
1985-1986              R          D           R Better 0.69570273
2019-2020              R          D           R  Worse 0.69570273

Consistent with our interpretation from the regression coefficients, the Presidency column is enriched with R’s at the bottom/Worse end. The House and Senate are mixed.

Conclusion: We have exploratory (non-crossvalidated, non-regularized, underpowered) evidence that Republicans are bad in the Presidency, and that the House and Senate don’t matter much.

Summary

The fit of the raw deficit is slightly better quality (lower AIC), and slightly more Mann-Whitney significant in predicting. On the other hand, the deficit/GDP model includes important information about the growth of the economy over the period studied. Also, the Presidency regression coefficient when using GDP to scale the deficit is statistically significant, whereas none of the coefficients are when just using just the deficit.

In either case, they agree that Republican Presidents are bad. They differ as to whether Democrats in the House matter or not. Neither model cares about the Senate.

The Weekend Conclusion

Republican presidents are bad news for the deficit, whether by itself or as a percent of GDP.
Other branches and whether Democrats occupy them are arguable either way, at least according to this rather small dataset that couldn’t be cross-validated or LASSO regularized.

(Ceterum censeo, Trump incarcerandam esse.)

Notes & References

1: Weekend Editor, “US Political Parties and the Economy”, Some Weekend Reading blog, 2024-Oct-28. We will be re-using the script from this post for this analysis. ↩

2: St. Louis Fed Staff, “Federal Surplus or Deficit (FYFSD)”, FRED databases, US Federal Reserve Bank of St Louis, data from 1977 – 2023 retrieved 2024-Dec-04. A local copy is archived here for peer review purposes. ↩

3: St. Louis Fed Staff, “Federal Surplus or Deficit - as Percent of Gross Domestic Product (FYFSGDA188S)”, FRED databases, US Federal Reserve Bank of St Louis, data from 1977 – 2023 retrieved 2024-Dec-04. A local copy is archived here for peer review purposes. ↩

4: St. Louis Fed Staff, “Gross Domestic Product”, US Federal Reserve Bank of St Louis, data from 1977 – 2023 retrieved 2024-Dec-04. ↩

5: Weekend Editor, “R script to analyze partisanship of government branches and budget deficit”, Some Weekend Reading blog, 2024-Dec-05.

There is also available a text transcript of running this script, so you can check that it says what I say it says.↩

6: Weekend Editor, “Consolidated Congressional Term Dataset of Partisanship of Federal Government Branches and Economic Results, 1977-2023”, Some Weekend Reading blog, 2024-Dec-05. ↩

Published Thu 2024-Dec-05

Gestae Commentaria

Comments for this post are closed pending repair of the comment system, but the Email/Twitter/Mastodon icons at page-top always work.

Reserved for future page visit counter.
Site last built:	Thu 2025-Apr-03 14:26 UTC