National Survey of Children’s Health Backup

The NSCH is the latest casualty of the new administration taking down major datasets from government websites. Between Archive.org and what I had downloaded for old projects, I was able to get all the 2016-2023 topical NSCH files and post them on an Open Science Foundation page.

I took this as a chance to improve the data- the government previously only made the topical Public Use Files available in SAS and Stata formats one year at a time, so I added a merged version for all available years in both Stata and Excel formats.

I hope and expect that the National Survey Children’s Health will be back up at official websites soon. But I expect that other datasets will be taken down permanently, so now is the time to download what you think you might need and add it to your data hoard– especially if you want anything from the Department of Education.

Triumph of the Data Hoarders

Several major datasets produced by the federal government went offline this week. Some, like the Behavioral Risk Factor Surveillance Survey and the American Community Survey, are now back online; probably most others will soon join them. But some datasets that the current administration considers too DEI-inflected could stay down indefinitely.

This serves as a reminder of the value of redundancy- keeping datasets on multiple sites as well as in local storage. Because you never really know when one site will go down- whether due to ideological changes, mistakes, natural disasters, or key personnel moving on.

External hard drives are an affordable option for anyone who wants to build up their own local data hoard going forward. The Open Science Foundation site allows you to upload datasets up to 50 GB to share publicly; that’s how I’ve been sharing cleaned-up versions of the BRFSS, state-levle NSDUH, National Health Expenditure Accounts, Statistics of US Business, and more. If you have a dataset that isn’t online anywhere, or one that you’ve cleaned or improved to the point it is better than the versions currently online, I encourage you to post it on OSF.

If you are currently looking for a federal dataset that got taken down, some good places to check are IPUMS, NBER, Archive.org, or my data page. PolicyMap has posted some of the federal datasets that seem particularly likely to stay down; if you know of other pages hosting federal datasets that have been taken down, please share them in the comments.

The Big Ideas

Do I really think that the things I write about here and in my papers are the most important things in the world? No. Like most academics, I tend to emphasize the issues where I think I bring a unique perspective, rather than most important issues. But if you don’t realize this, you might get the impression that I think the things I normally talk about are the most important, rather than simply the most neglected and tractable / publishable. I don’t work on the most important issues because I see no good way for me to attack them- but if you do see a way, that is where you should focus. So what are the big issues of the 2020’s?

I see two issues that stand out above the many other important events of the day:

  • Artificial Intelligence: At minimum, the most important new technology in a generation; has the potential to bring about either utopia or dystopia. Do you have ideas for how to nudge it one way or another?
  • Rise of China: From extreme poverty to the world’s manufacturing powerhouse in two generations. What lessons should other countries learn from this for their own economic policy? How can we head off a world war and/or Chinese hegemony?

Focusing a bit more on economics, I see two perennial issues where there could be new opportunities to solve vital old questions:

  • Economic Development: We still don’t have a definitive answer to Adam Smith’s founding question of economics- why are some countries rich while other countries are poor, and how can the poor countries become rich? I think economic freedom is still an underrated answer, but even if you agree, the question remains of how to advance freedom in the face of entrenched interests who benefit from the status quo.
  • Robust Prediction: How can we make economics into something resembling a real science, one where predictions that include decimal places don’t deserve to be laughed at? Can you find a way to determine how much external validity an experiment has? Or how to use machine learning to get at causality? Or at least push existing empirical research to be more replicable?

I’ve added these points to my ideas page, since all this was inspired by me talking through the ideas on the page with my students and realizing how small and narrow they all seemed. Yes, small and narrow ideas are currently easier to publish in economics, but there is more to research and life than easy publications.

Forecasting 2025

WSJ’s survey of economists reports that inflation expectations for 2025 were around 2% before the election, but are closer to 3% now. Their economists expect GDP growth slowing to 2%, unemployment ticking up slightly but staying in the low 4% range, with no recession. The basic message that 2025 will be a typical year for the US macroeconomy, but with inflation being slightly elevated, perhaps due to tariffs.

Kalshi has a lot of good markets up that give more detailed predictions for 2025:

For those who hope for DOGE to eliminate trillions in waste, or those who fear brutal austerity, the message from markets is that the huge deficits will continue, with the federal debt likely climbing to over $38 trillion by the end of the year. This is one reason markets see a 40% chance that the US credit rating gets downgraded this year.

While the US has only a 22% chance of a recession, China is currently at 48%, Britain at 80%, and Germany at 91%. The Fed probably cuts rates twice to around 4.0%.

Will wage growth keep pace with inflation? It’s a tossup. Corporate tax cuts are also a tossup. The top individual rate probably won’t fall below it’s current 37%.

If you want to make your own predictions for the year, but don’t want to risk money betting on Kalshi, there are several forecasting contests open that offer prizes with no risk:

ACX Forecasting Contest: $10,000 prize pool, 36 questions, must submit predictions by Jan 31st

Bridgewater Forecasting Contest: $25,000 prize pool, half of prizes are reserved for undergraduates. Register now to make predictions between Feb 3rd and March 31st. Doing well could get you a job interview at Bridgewater.

The Little Book of Common Sense Investing

John Bogle, the founder of Vanguard, wrote a short book in 2006 that explains his investment philosophy. I can sum it up at much less than book length: the best investment advice for almost everyone is to buy and hold a diversified, low-fee fund that tracks an index like the S&P 500.

Of course, a strategy that is simple to state may still take time to understand and effort to stick to. So the book helps to build intuition for why this strategy makes sense. I think Bogle makes his case well, though the book is getting a bit dated- the charts and examples end in 2006, and he sets up mutual funds as the big foil, when today it might be high-fee index funds or picking your own stocks.

The silver lining of any dated investing book is that we can check up on how its predictions have fared. In chapter 8, Bogle compared the performance of the 355 equity mutual funds that existed in 1970 to that of the S&P over the 1970-2006 period. He notes that 223 of the funds had gone out of business by 2006, and even most of the surviving funds underperformed the S&P. But he identifies 3 funds that outperformed the S&P significantly (over 2% per year) on a sustained basis (consistently good performance, not just high returns at the beginning when they were small): Davis New York Venture, Fidelity Contrafund, and Franklin Mutual Shares. But how have they done since the book came out?

It is a huge victory for the S&P (in blue). Franklin Mutual Shares is basically flat over the past 20 years, while Davis New York Fund actually lost money. Fidelity Contrafund returned a respectable 281% (about 7% per year), and matched the S&P as recently as 2020. But as of 2025 the S&P is the clear winner, up 411% in 20 years (over 8% per year). Score one for Bogle.

But I still have to wonder if there is a way to beat the S&P- and I think one of Bogle’s warnings is really an idea in disguise. He warns repeatedly about “performance chasing”:

But whatever returns each sector ETF may earn, the investors in those very ETFs will likely, if not certainly, earn returns that fall well behind them. There is abundant evidence that the most popular sector funds of the day are those that have recently enjoyed the most spectacular recent performance, and that such “after-the-fact” popularity is a recipe for unsuccessful investing.

The claim is that investors pile into funds that did well over the past 1-3 years, but these funds subsequently underperform. But if this is true, could you succeed by reversing the strategy, buying into the unpopular sectors that have recently underperformed? I’ve been wondering about this, though I have yet to try seriously backtesting the idea. I was surprised to see Mr. Index Fund himself support such attempts to beat the market toward the end of his book:

Building an investment portfolio can be exciting…. If you crave excitement, I would encourage you to do exactly that. Life is short. If you want to enjoy the fun, enjoy! But not with one penny more than 5 percent of your investment assets.

He goes on to say that even for the fun 5% of the portfolio he still doesn’t recommend hedge funds, commodity funds, or closet indexers. But go ahead and try buying individual stocks, or actively managed mutual funds “if they are run buy managers who own their own firms, who follow distinctive philosophies, and who invest for the long term, without benchmark hugging.”

New Website

Don’t worry, EWED is in the same place as always, but my personal website is moving.

Temple University has generously hosted my site long after my 2014 graduation. But next week they are moving to a more typical policy where alumni lose access to online university resources like web-hosting, email, and library datasets starting one year after graduation.

My new personal website is at jamesbaileyecon.com. Unless you just trying to learn more about me or my research, I think the big draws are the pages where I share cleaned-up datasets and ideas for research papers.

2024 in Books

Quick thoughts on what I read in 2024- though note that none of these were published in 2024, since almost all the best stuff is older. First some econ books I reviewed here this year:

Rockonomics– “Alan Kreuger’s 2019 book on the economics of popular music…. a well-written mix of economic theory, data, and interviews with well-known musicians, by an author who clearly loves music.”

We’ve Got You Covered– “Liran Einav and Amy Finkelstein are easily two of the best health economists of their generation.… while I don’t agree with all of their policy proposals, the book makes for an engaging, accurate, and easily readable introduction to the current US health care system.”

The Psychology of Money– “Morgan Housel’s Psychology of Money is not much like other personal finance books…. The book is not only pleasant to read, but at least for me exerts a calming effect I definitely do not normally associate with the finance genre, as if the subtext of ‘just be chill, be patient, follow the plan and everything will be alright’ is continually seeping into my brain.”

One Up on Wall Street– “Peter Lynch was one of the most successful investors of the 1970’s and 1980’s as the head of the Fidelity Magellan Fund. In 1989 he explained how he did it and why he thought retail investors could succeed with the same strategies”

Leave Me Alone and I’ll Make You Rich– “a 2020 book by Dierdre McCloskey and Art Carden…. attempts to sum up McCloskey’s trilogy of huge books on the ‘Bourgeois Virtues‘ in one short, relatively easy to read book”

Non-fiction I didn’t previously mention here:

The Simple Path to Wealth (JL Collins, 2016): the book is indeed simple, and its advice is indeed likely to leave you fairly wealthy in terms of money. One sentence summarizes it well: save a large portion of your income and invest it in VTSAX, and perhaps VBTLX. Easy to read, a bit like reading a series of blog posts, which is how much of the material originated. Good introduction to the lean-FIRE type mentality. But the book, like that mentality, is too frugal and debt-averse for my taste, and I say that as someone much more frugal and debt-averse than the average American.

The Great Reversal: How America Gave Up on Free Markets: Thomas Philippon argues that markets have been growing less competitive in America because of weakening antitrust enforcement, and that this has harmed consumers and productivity. He acknowledges that over-regulation can also harm competition, but clearly thinks antitrust is much more important; I think otherwise and didn’t find the book convincing. He sets European markets as an example for what America should aspire to, which means the book has aged poorly since its 2019 publication. It still of course has some value, and I may do a full review at some point.

The Storm Before the Storm: The Beginning of the End of the Roman Republic (Mike Duncan, 2017): Non-fiction but more exciting than most novels. A story of obvious importance to those who worry about modern republics teetering, but fresh compared to the much more famous events around Julius and Augustus Caesar and the ‘official’ fall of the Republic. Though arguably the Republic fell in the 80s BC, not the 40s- the book explains that Rome was taken over three times in this era by armies seeking political change.

Self-Help Is Like a Vaccine: Essays on Living Better: Nice collection of Brian Caplan blog posts on the subject.

Fiction:

Ivanhoe (Walter Scott, 1819): A particularly medieval telling of the Robin Hood tale, with a focus on the nobility and knights of England at that time. Chivalric romance, trial by combat, storming a castle. Highs are high but it needed an editor, could be cut by at least 1/3 without losing anything.

Kim (Rudyard Kipling, 1901): Three books in one, all excellent: a coming of age story, a spy thriller, and a portrait of the many different types of people and religions to be found in India around 1900. All wrapped together with beautiful English prose that makes heavy use of Indian loan words.

Final Thoughts:

Obviously I’m not Tyler Cowen reading a book a day, unless you count the kids books I read to my 1-year-old. But overall 2024 was a good year, better than I realized before I put this post together. Partly I credit the 1-year-old who wants to take my phone and computer but doesn’t mind when I have a book in my hands.

What I Learned from Erwin Blackstone

I’m told that Professor Erwin Blackstone died earlier this year, but I haven’t been able to find anything like an obituary online; consider this a personal memorial.

I knew Dr. Blackstone first as the professor of my Industrial Organization class at Temple University, where he taught since 1976. He was a model of how to take students seriously and treat them respectfully; he always called on us as “Mr./Ms. Last Name” and thought carefully about our questions.

Of course I learned all sorts of particular things about IO, especially US antitrust law and history- from Judge Learned Hand and baseball’s antitrust exemption to current merger guidelines and cases. I would later ask Dr. Blackstone to join my thesis committee, where he would heavily mark up my papers with comments and critiques.

He was a key part of how I was able to become a health economist despite the fact that Temple lacked a true health economist on the tenure-track economics faculty while I was there (as opposed to IO or labor economists who did some health). Blackstone’s coauthor Joseph Fuhr– a true health economist who also had Blackstone on the committee of his 1980 dissertation- came part-time to teach graduate health economics. Blackstone and Fuhr worked together to write the health economics field exam I took.

Finally, I learned from Blackstone by reading his papers. While he wrote many on health economics, my personal favorite was his work with Andrew Buck and Simon Hakim on foster care and adoption. It convincingly demonstrated the problems of having one fixed price in an area that most people don’t think about as a “price” at all- adoption fees. Having one fairly high fee for all children means the few seen as most desirable by adopting parents (typically younger, whiter, healthier) get adopted quickly, while those seen as less desirable by would-be adoptive parents linger in foster care for years. Like much of his work, it pairs a simple economic insight with a rich explanation of the relevant institutional details.

Academics hope to live on through our work- through our writing and the people we taught. Having taught many thousands of students at Cornell, Dartmouth, and Temple over 55 years, served on dozens of dissertation committees, and published over 50 papers and several books, I expect that it will be a long, long time before Erwin Blackstone is forgotten.

Source: Academic Tree. Charles Franklin Dunbar founded the Quarterly Journal of Economics in 1886.

WSJ: Nothing Important Happened in China, India, or AI This Year

I normally like the Wall Street Journal; it is the only news page I check directly on a regular basis, rather than just following links from social media. But their “Biggest News Stories of 2024” roundup makes me wonder if they are overly parochial. When I try to zoom out and think of the very biggest stories of the past five to ten years, three of the absolute top would be the rapid rise of China and India, together with the astonishing growth in artificial intelligence capabilities.

All three of those major stories continued to play out this year, along with all sorts of other things happening in the two most populous countries in the world, and all the ways existing AI capabilities are beginning to be integrated into our businesses, research, and lives. But the Wall Street Journal thinks that none of this is important enough to be mentioned in their 100+ “Biggest Stories”.

To be fair, China and AI do show up indirectly. AI is driving the 4 (!) stories on NVIDIA’s soaring stock price, and China shows up in stories about spying on the US, hacking the US, and the US potentially forcing a sale of TikTok. But there are zero stories regarding anything that happened within the borders of China, and zero that let you know that AI is good for anything besides NVIDIA’s stock price.

Plus of course, zero stories that let you know that India- now the world’s most populous country, where over one out of every six people alive resides- even exists.

AI’s take on India’s Prime Minister using AI

This isn’t just an America-centric bias on WSJ’s part, since there is lots of foreign coverage in their roundup; indeed the Middle East probably gets more than its fair share thanks to “if it bleeds, it leads”. For some reason they just missed the biggest countries. They also seem to have a blind spot for science and technology; they don’t mention a single scientific discovery, and only had two technology stories, on SpaceX catching a rocket and doing the first private spacewalk.

The SpaceX stories at least are genuinely important- the sort of thing that might show up in a history book in 50+ years, along with some of the stories on U.S. politics and the Russia-Ukraine war, but unlike most of the trivialities reported.

I welcome your pointers to better takes on what was important in 2024, or on what you consider to be the best news source today.

Predicting College Closures: Now with Machine Learning

Small, rural, private schools stand out to me as the most likely to show up on lists of closed colleges. This summer I discussed a 2020 paper by Robert Kelchen that identified additional predictors using traditional regression:

sharp declines in enrollment and total revenue, that were reasonably strong predictors of closure. Poor performances on federal accountability measures, such as the cohort default rate, financial responsibility metric, and being placed on the most stringent level of Heightened Cash Monitoring

Kelchen just released a Philly Fed working paper (joint with Dubravka Ritter and Doug Webber) that uses machine learning and new data sources to identify more predictors of college closures:

The current monitoring solution to predicting the financial distress and closure of institutions — at least at the federal level — is to provide straightforward and intuitive financial performance metrics that are correlated with closure. These federal performance metrics represent helpful but suboptimal measures for purposes of predicting closures for two reasons: data availability and predictive accuracy. We document a high degree of missing data among colleges that eventually close, show that this is a key impediment to identifying institutions at risk of closure, and also show how modern machine learning algorithms can provide a concrete solution to this problem.

The paper also provides a great overview of the state of higher ed. The sector is currently quite large:

The American postsecondary education system today consists of approximately 6,000 colleges and universities that receive federal financial aid under Title IV of the federal Higher Education Act…. American higher education directly produces approximately $700 billion in expenditures, enrolls nearly 25 million students, and has approximately 3 million employees

Falling demand from the demographic cliff is causing prices to fall, in addition to closures:

Between the early 1970s and mid-2010s, listed real tuition and fee rates more than tripled at public and private nonprofit colleges, as strong demand for higher education allowed colleges to continue increasing their prices. But since 2018, tuition increases have consistently been below the rate of inflation

Most college revenue comes from tuition or from state support of public schools; gifts and grants are highly concentrated:

Research funding is distributed across a larger group of institutions, although the vast majority of dollars flows to the 146 institutions that are designated as Research I universities in the Carnegie classifications…. Just 136 colleges or university systems in the United States had endowments of more than $1 billion in fiscal year 2023, but they account for more than 80 percent of all endowment assets in American higher education. Going further, five institutions held 25 percent of all endowment assets, and 25 institutions held half of all assets

Now lets get to closures. As I thought, size matters:

most institutions that close are somewhat smaller than average, with the median closed school enrolling a student body of about 1,389 full-time equivalent students several years prior to closure

As does being private, especially private for-profit (states won’t bail you out when you lose money):

As do trends:

variables measuring ratios of financial metrics and those measuring changes in covariates are generally more important than those measuring the level of those covariates

When they throw hundreds of variables into a machine learning model, it can predict most closures with relatively few false positives, though no one variable stands out much (FRC is Financial Responsibility Composite):

My impression is that the easiest red flag to check for regular people who don’t want to dig into financials is “is total enrollment under 2000 and falling at a private school”.

They predict that the coming Demographic Cliff (the falling number of new 18-year-olds each year) will lead to many more closures, though nothing like the “half of all colleges” you sometimes hear:

The full paper is available ungated here. I’ll close by reiterating my advice from the last post: would-be students, staff, and faculty should do some basic research to protect themselves as they consider enrolling or accepting a job at a college. College employees would also do well to save money and keep their resumes ready; some of these closures are so sudden that employees find out they are out of a job effective immediately and no paycheck is coming next month.