Predicting College Closures: Now with Machine Learning

Small, rural, private schools stand out to me as the most likely to show up on lists of closed colleges. This summer I discussed a 2020 paper by Robert Kelchen that identified additional predictors using traditional regression:

sharp declines in enrollment and total revenue, that were reasonably strong predictors of closure. Poor performances on federal accountability measures, such as the cohort default rate, financial responsibility metric, and being placed on the most stringent level of Heightened Cash Monitoring

Kelchen just released a Philly Fed working paper (joint with Dubravka Ritter and Doug Webber) that uses machine learning and new data sources to identify more predictors of college closures:

The current monitoring solution to predicting the financial distress and closure of institutions — at least at the federal level — is to provide straightforward and intuitive financial performance metrics that are correlated with closure. These federal performance metrics represent helpful but suboptimal measures for purposes of predicting closures for two reasons: data availability and predictive accuracy. We document a high degree of missing data among colleges that eventually close, show that this is a key impediment to identifying institutions at risk of closure, and also show how modern machine learning algorithms can provide a concrete solution to this problem.

The paper also provides a great overview of the state of higher ed. The sector is currently quite large:

The American postsecondary education system today consists of approximately 6,000 colleges and universities that receive federal financial aid under Title IV of the federal Higher Education Act…. American higher education directly produces approximately $700 billion in expenditures, enrolls nearly 25 million students, and has approximately 3 million employees

Falling demand from the demographic cliff is causing prices to fall, in addition to closures:

Between the early 1970s and mid-2010s, listed real tuition and fee rates more than tripled at public and private nonprofit colleges, as strong demand for higher education allowed colleges to continue increasing their prices. But since 2018, tuition increases have consistently been below the rate of inflation

Most college revenue comes from tuition or from state support of public schools; gifts and grants are highly concentrated:

Research funding is distributed across a larger group of institutions, although the vast majority of dollars flows to the 146 institutions that are designated as Research I universities in the Carnegie classifications…. Just 136 colleges or university systems in the United States had endowments of more than $1 billion in fiscal year 2023, but they account for more than 80 percent of all endowment assets in American higher education. Going further, five institutions held 25 percent of all endowment assets, and 25 institutions held half of all assets

Now lets get to closures. As I thought, size matters:

most institutions that close are somewhat smaller than average, with the median closed school enrolling a student body of about 1,389 full-time equivalent students several years prior to closure

As does being private, especially private for-profit (states won’t bail you out when you lose money):

As do trends:

variables measuring ratios of financial metrics and those measuring changes in covariates are generally more important than those measuring the level of those covariates

When they throw hundreds of variables into a machine learning model, it can predict most closures with relatively few false positives, though no one variable stands out much (FRC is Financial Responsibility Composite):

My impression is that the easiest red flag to check for regular people who don’t want to dig into financials is “is total enrollment under 2000 and falling at a private school”.

They predict that the coming Demographic Cliff (the falling number of new 18-year-olds each year) will lead to many more closures, though nothing like the “half of all colleges” you sometimes hear:

The full paper is available ungated here. I’ll close by reiterating my advice from the last post: would-be students, staff, and faculty should do some basic research to protect themselves as they consider enrolling or accepting a job at a college. College employees would also do well to save money and keep their resumes ready; some of these closures are so sudden that employees find out they are out of a job effective immediately and no paycheck is coming next month.

College Major, Marriage, and Children

The American Community Survey began in 2000, and started asking about college majors in 2009, surveying over 3 million Americans per year. This has allowed all sorts of excellent research on how majors affect things like career prospects and income, like this chart from my PhD advisor Doug Webber:

See here for the interactive version of this image

But the ACS asks about all sorts of other outcomes, many of which have yet to be connected to college major. As far as I can tell this was true of marriage and children, though I haven’t searched exhaustively. I say “was true” because a student in my Economics Senior Capstone class at Providence College, Hannah Farrell, has now looked into it.

The overall answer is that those who finished college are much more likely to be married, and somewhat more likely to have children, than those with no college degree. But what if we regress the 39 broad major categories from the ACS (along with controls for age, sex, family income, and unemployment status) on marriage and children? Here’s what Hannah found:

Every major except “military technologies” is significantly more likely than non-college-grads to be married. The smallest effects are from pre-law, ethnic studies, and library science, which are about 7pp more likely to be married than non-grads. The largest effects are from agriculture, theology, and nuclear technology majors, each about 18pp more likely to be married.

For children the story is more mixed; library science majors have 0.18 fewer children on average than non-college-graduates, while many majors have no significant effect (communications, education, math, fine arts). Most majors have more significantly more children than non-college graduates, with the biggest effect coming from Theology and Construction (0.3 more children than non-grads).

In this categorization the ACS lumps lots of majors together, so that economics is classified as “Social Sciences”. When using the more detailed variable that separates it out, Hannah finds that economics majors are 9pp more likely than non-grads to be married, but don’t have significantly more children.

I love teaching the Capstone because I get to learn from the original empirical research the students do. In a typical class one or two students write a paper good enough that it could be published in an academic journal with a bit of polishing, and this was one of them. But its also amazing how many insights remain undiscovered even in heavily-used public datasets like the ACS. We’ve also just started to get good data on specific colleges, see this post on which schools’ graduates are the most and least likely to be married.