Economic history as it’s happening is alway relative

This is the chart that I’ve been thinking about today.

The US government has been able to borrow on the cheap for most of it’s existence, with the exception of 70s and 80s when stagflation put the clamp down. Treasury rates are soaring right now…or at least, it feels that way because for most of my adult life the United States has been viewed as arguably the safest borrower in history. What follows are in some ways the only two questions that matter for the US economy. Is the US government a reliable institution? Is economic growth going to keep pace with inflation? The answer to each question (and their subcomponents) is, of course, unknown, but the market seems to think the net of that question is going in the wrong direction.

That said, for all of the neverending parade of (sometimes unintential) nostalgia that seems to pollute the discourse, wow, 1975-1985 was not exactly macroeconomically “aspirational”.

Will AI kill the research paper?

Will AI kill the research paper?. I don’t know, probably not. But I do know that what has constituted a research paper has changed many times before and will change many times again.

Before the the 1940’s, economics research papers were largely prose. Analytic in nature, sure, but prose. Some graphs, maybe a box. A little math, but math largely for the sake of demonstrating logical relationships. Then Samuelson hit, reframed economics as thermodynamics and differential calculus. What was previously a research paper was was now a polemic, a monograph at best. Thought experiments were out, high theory was in.

This era of high theory flourished in the 70s, the math changed, and at some point computers arrived with the possibility of data sufficiently rich and numerous you couldn’t just plot all of the observations in Figure 1. That data couldn’t stand on its own, though. To be a credible publication you really needed to bundle your analysis with some theory that generated testable predictions. Pure theory papers gave way to an era of applied imperialism as economic models found themselves applied to every quantified context under the social scientific sun.

Causal identification became a thing of interest, and we got really good at telling stories again. Specifically, stories about instrumental variables. You needed a story to convince anyone, but we told so many that some folks started to notice that these stories were often pretty weak. That, in part, turned up the heat on a credibility revolution that was already in swing, which meant now you needed even better data and you needed to defend you identification strategy to the death. What was a paper before was now an embarassment you should probably consider retracting (nb: no one retracted anything, but that doesn’t mean people were suggesting it behind their backs).

Which kept rolling in data set after data set until we woke up one day and realized you either need to go out in the world and create your own actual experiment (nothing quasi- about it) or you needed to cultivate access to better…no, better…no, the very best-est, most detailed and granular administrative data ever, preferably a universe if possible. Data so perfect as to allow for contributions unassailable in their legitimacy. Do you have friends at the Danish Census? If you want tenure you should probably start flirting with someone at the Danish Census.

So a paper was a paper. Until it wasn’t a paper anymore. Until that wasn’t a paper anymore. Until that wasn’t a paper. The Recursive Dundee Theory of Research*, if you will. They all met the criteria of a contribution, until they didn’t.

So what does this mean for AI and research papers now? Well, if we look to thermodynamics in the 40s and cheap computing power in the 90’s for analogues, then I’d say it’s going to reshape the criteria for a contribution in no small part because it lowers the cost of mediocrity. Mediocre analysis will no doubt persist, but it will shift over into blog posts and journals no one ackowledges as legitimate. Do remember, please, that mediocrity is a relative concept. The quality of blog posts and publications in scam journals will likely massively improve as what can be accomplished in an afternoon’s work is radically increased. Don’t worry, I have no intention of improving beyond my current warm bath of blogging unremarkableness, but others will likely cave in to the pressure.

What about the papers in top journals, though? The papers Tyler is presumably talking about. Will AI kill those economic research papers? Probably not, but it will likely improve it significantly. Why? For the same reason that Michael Kremer says that technology and quality of life improve with the size of the human population. More people means more ideas, and there is nothing more important to economic growth than the sheer number of ideas. And no, I do not mean ideas generated by AI’s. I mean the raw number of researchers with the capacity to make major contributions is increasing dramatically because we’re all getting research assistants. We’re all getting copy editors. We’re all getting support. That’s how AI is going to change the research paper: by giving more ideas the support they need to reach the light of publication. The bar is going to get higher for the same reason that the level of sports improve as you widen the geography they pull from. There’s someone at a directional state school who didn’t get the placement they deserved out of grad school. Sure they have to teach a 3-3 load, but they’re licking their chops right now because they don’t need an army of grad assistants. Summer is here and they’ve got everything they need to make a contribution.

Or I don’t know. Maybe AI will do all of our thinking in 50 years. Forecasting technology beyond 5 years is like forecasting weather beyond 5 days: I can’t do it and neither can you.**

*Apologies to Justin Wolfers and all my Aussie friends for a bit of cultural appropriation. I promise to put some Vegemite on toast while enjoying a flat white and explaining Aussie Rules Football to a friend within 90 days.

**Except for Neal Stephenson. That guy’s the Warren Buffet of Sci Fi forecasting. Maybe he’s the one in a billion person actually experiencing one in a billion level luck, but that doesn’t make it any less impressive.

On implicit numéraire

Just a quick thought today. When we, economist or otherwise, talk about the opportunity cost of time, the most common default is an individual’s expected wage. This ends up becoming a sort of implicit numéraire, a unit of measurement and exchange that captures value of an individual’s time.

Now, to be clear, this is a gross reduction of the complexity of opportunity cost and decision-making, but such reductionism is a necessity when observing the world on a day to day basis. People are generally, I hope, aware of this reductionism, but also understand that cognitive tractability is a necessity for getting through life. That also means, however, that there is no shortage of traps. If you reduce decision-making to a single variable equation, you can get yourself in a lot of trouble picking the wrong variable.

Which brings me back to expected wage as a single variable numeraire revealing the opportunity cost of time. Sure, such a simple model is a great way for understanding why high income CEO’s outsource and delegate so many of their “life maintenance” tasks while I, for example, do not. That same logic, however, can be a trap when looking at decision making at the other end of the income distribution. Why wouldn’t someone making minimum wage leave work to pick up their sick kid from school or bail their cousin out of jail? Their forgone wages, their opportunity cost of time, is relatively low, right?

Actually, no, it isn’t. In fact their opportunity cost of time is exceptionally high, it’s just that you’re using the wrong numeraire. The opportunity cost of time isn’t the wages foregone, but rather the additional risk that they are taking on. It is quite common for individuals to lack the precautionary savings necessary to maintain solvency and housing stability during a dip in earnings or unexpected job loss. Nobody likes asking their boss if they can leave work for two hours on no notice when they can’t afford to risk losing an extra shift, let alone their job. The opportunity cost of their time is best measured in the marginal probability of household economic catastrophe rather the explicit wages gained or lost.

A lot of economic decision-making is easy to make sense of when you get your single-variable numeraire right, but that is easier said than done. A good rule of thumb: if someone else’s decision-making looks grossly irrational to you, you probably aren’t using the right variable.

Are Americans Thriving Under Trump? No, According to the Cost of Thriving Index

The Cost of Thriving Index from Oren Cass’s American Compass is an attempt to calculate how well US families are doing financially, but without using traditional inflation adjustments to income. Instead, Cass and crew have chosen 5 categories of goods and services, and tracked those over time relative to median earnings for men ages 25 and older (in the baseline model — it can also be applied to different categories of workers).

Scott Winship and I wrote a detailed critique of the COTI, which I summarized in a previous blog post. Our critique comes from several angles, including correcting several major errors in COTI, as well as arguing that standard inflation adjustments to median income are superior to this new approach.

Based on our critique, I don’t think COTI is a very good measure of how well US families are doing financially. But the COT Index still has many fans. And Cass seems to think Trump is in large part pursuing many policies that should help out US workers and families, such as Trump’s tariff policies. Thus, it will be useful to see if Trump’s policies are leading to American workers “thriving” in the first year of Trump’s presidency.

Unfortunately, even using Cass’s preferred approach, Americans don’t appear to be thriving under Trump.

Continue reading

Our glorious future is tech troubleshooting in space

Having enjoyed the quotes from our brave astronauts about software troubles, I wrote for Econlog:

Tech Troubleshooting in Space (EconLog)

Click to learn the story of email quote and why it went viral. With all due respect to Christina Koch, I think I’m the first woman in history to paraphrase The Notorious B.I.G. at Econlog.

Are we complaining? Tech has made our lives better. With only a few exceptions, everyone in the country chooses to have TVs and smartphones.

Digital tools like email save me time over what I can only imagine used to be sending paper memos or something. Did people have owls or pigeons or what? But some of that saved time goes to fighting new problems of evil people in cyberspace. Someone (Tyler?) points out that the “better angels of our nature” argument doesn’t look quite as rosy if you consider of all the digital criminality.

I do not know whom to credit for this banger: “Man is born free and everywhere he has to 2-factor authenticate.”

I had to do my annual mandatory employee Cyber Security training session this week. I don’t get paid extra to do this. It’s just work on top of my job. It’s estimated to take 40 minutes to complete. (I powered through in under 15 minutes.) We are obviously living in the future with iPads that translate foreign languages for refugee kids in real time and all, but it would feel more glorious if I could stop these phishing trainings.

If quantum/AI means the end of privacy and cheap tech connectivity, then what will that mean for productivity? To send a secure message to someone, we might need to go back to owl post. Get ready for mandatory annual owl training.

A Canticle for Aadam Jacobs

For the talk of the future of generating art, let’s not forget the task of remembering the art we’ve already made. Behold: more than 10,000 cassette recorded concerts, from as far back as 1984, recorded in community centers, church basements, taverns, all-ages clubs, and hundreds of other unsung “venue” owners who let then (and often always) unknown bands play shows for a a couple dozen attendees, all in the hopes that door money and beverages might keep the owner out of the red on a random weeknight while.

I have a couple bootlegs from concerts I attended, but it never occurred to me that I might get to listen to a 1995 Blonde Redhead show at The Empty Bottle or The Blow Pops playing 1991 show at a Milwaukee spot I’ve never heard of. These shows have always had an ephemeral quality to them, existing far more in the stories of those who claimed to be there that night than the actual direct artistic footprint.

But maybe not. Maybe the internet can and does, in fact, remember. Because while there is a lot to be absorbed from the finished product, but there is often so much more learn from the imperfect and unpolished early stages. A band before they slowed down or ventured beyond their first 3 chords, a writer still stuck in the first person, a disseratation chapter still haunted by the writing of the insecure graduate student we all were. The awkard phases when an artist (or artists) are still finding their voice. Perhaps, more than ever, we need to remember the importance of not skipping over the embarassing, exhausting, and, yes, often futile work at the beginning and middle. There are more shortcuts than ever to making a thing, but no shortcut to becoming the version of yourself that can make the thing that only you can make.

Education is a core US export

While there is no shortage of examples of willful ignorance and outright lying in politics, the idea that blocking foreign students from attending US universities is anything other than disastrous to US students is positively enraging. The real curiousity here is whether the value of a US degree has yet dipped below the full tuition price tags that foreign students almost always pay. Beyond the billions in tuition received and tuition subsidies indirectly consumed, I couldn’t even begin to put a price on the cultural power accrued from being the global center for higher education for the last century. This administration’s capacity to find new and innovative ways to tear down US institutions is unrivaled and beyond even the grandest dreams of our most optimistic enemies.

The actual AI problem in academic economics

There is a steady flow of takes on the impact of AI on academic economics research, whether its the example of someone writing an ostensibly legitimate, if somewhat trite, research paper with only a few hours effort to the implication that there is already no need to continue writing papers as the AIs are already better at at. Oh, what shall all the candlemakers do now that the sun has risen?

I think the idea that AI has already rendered the research paper an obselete endeavor is very wrong, almost to the point of negligence. It both vastly underestimates the quality of the median contribution provided in the 80 to 100 or so best journals and vastly overestimates the reliablity of current AI attempts at research on the margin. Putting such concerns aside for the moment, it’s still worth pondering how we can extrapolate from current AI as a tool for status quo research to forecast if it might reshape labor as an input 5 or 10 years from now. That’s far enough away that it borders on futurism and, more importantly, the kind of forecasting that I shy away from. Feel free to tell me in the comments where we are headed.

At this moment, however, we already are in the middle of a far more subtle disruption in academic research that I haven’t seen anyone write about yet. The quiet, but pronounced, uptake of AI tools in the writing of referee reports for academic journals. If you’ve submitted papers for review in the last 18 months, dollars to donuts you’ve received a referee report that has been lengthy, well-organized, with an unusual number of bullet points and headers discussing your paper, summarizing it’s contributions, and offering suggestions that on their face seem reasonable but upon a moment’s reflection are quickly realized to be entirely vapid by someone familiar with the structure of the data and relevant literature.

There is something uniquely frustrating about working on a research project for 3 to 5 years only to have judgement passed down on the basis, at least in part, of a review written by ChatGPT that is not just wrong but, well, kind of stupid. I’ve already personally had to deal with having a paper refereed via ChatGPT, rejected, and then, thanks to it being internalized by ChatGPT into their text base, it being reconfigured into a citaton hallucination cited by other papers that, to maxmize comedy, replaced 3 (including me) of the 4 authors with other (nicer? better looking?) economists. What’s most frustrating, however, is that this is hitting economics journals that do not seem to have any plan in place to deal with it. Not to suggest this is an easy problem to solve (not remotely), but it certainly should not be coming as a surprise to anyone. Let’s look at the facts:

  1. Academic economists are almost universally overcommitted.
  2. Journal referees are, for the most part, unpaid for their time.
  3. As the number of quality articles produced and submitted to journals has increased, so has the strain on the entire editorial process, including review writing.
  4. The only thing holding it together at all has been reputational incentives (i.e. nobody wants a bad reputation with the editors that are going to consider your future work) and a disciplinary sense of “civic duty”. Reputation is, of course, the load bearing mechanism here.
  5. A technology was introduced that, at the very least, pantomimes the review process well enough that it can produce a low quality fascimile of a review that, with a few sentences tossed in at the beginning and a short separate letter written directly the editor by the reviewer, can allow a task that used to take a 0.5 to 1.5 work days can now be crossed off your to-do list in less than an hour.

Is it really that hard to see what’s coming? Of course academic economists are going to be tempted to ask ChatGPT to write a review for them. There are almost no direct rewards for writing good reviews, while the costs are significant. Evaluating a genuinely new and distinct piece of research that has never been done before is hard work and takes significant time.

Now, how this is playing out across the body of journals is an open question. Here’s my best educated guess:

At the top journals, reputational concerns are the strongest, but so is the opportunity cost of everyone’s time and the competition for limited article space. Referees might not have the courage to outsource the actual decision to ChatGPT, but they’ll be awfully tempted to offload as much of the grunt work as they can. If I were an editor at a top 10-15 journal, I would expect a growing number of reports from referees who read the paper quickly (<15 minutes), then made a decision to recommend acceptance or rejection based on 1) if they knew any of the authors, 2) whether the content is a complement or a substitute for their own research, 3) whether they had seen the paper presented in person and was well-received, 4) the general bundle of status associated with the authors and the subject, and 5) whether they liked the paper (you can, in fact, have a strong opinion on paper you’ve looked at for 15 minutes. We’re all guitly of it). Having arrived at their positive or negative assessment, they then outsource the actual first draft of the the review to ChatGPT, with the instruction to write a positive or negative review. Now, given the strong reputational considerations that any credible reviewer at a top journal should have, I expect there to then be significant rewriting of the review, including that addition of the reviewer’s preferried economist gripes about identification, whether the results generalize, etc, giving an otherwise generic report some more bespoke vibes. This isn’t the real recommendation anyway, that’s the letter to the editor that goes unseen by the research authors. I don’t think most referees will have the brass to outsource that.

That’s probably not great, especially for young authors trying to break into a field. But honestly, none of those problems are new. If anything it takes a very old problem (i.e. overcommitted economist at top school asks his or her student to write a referee report rejecting an article for them) and just tweaks it slightly (i.e. overcommitted economist at top school asks ChatGPT to write a referee report rejecting an article for them, freeing up a PhD student to get back to work cleaning and analyzing their data for them). Not optimal, but hey, what is?

The real problem, I am sad to say, is the next tier down. The field journals. The second-tier general journals. The oddball and heterodox journals. The journals that used to struggle to get enough good submissions and now struggle to find anyone to referee for them. What used to be a trickle is now a deluge of higher quality research. That deluge, however, comes from authors who also constitute a referee pool that is far busier than they were before and without the same resources that come with appointment as top institutions.

I promise you, from experience, keeping a significant research agenda going during my salad days when I was teaching a 3-3 load was not easy. What happens when the 71st ranked journal that you might submit an article to one day sends a seemingly acceptable, if mediocre and slightly banal article to review? Are you really going to give it a precious work day? Or are you going to give it a once over, ask chatGPT to review it, and then give a recommendation based on a 5 minute skim? I want believe that I would never associated my professional reputation with a half-assed review, but that’s easier to say on this side of the R1 tenure fence.

Now’s the part where I smugly tell you the obvious solution and call it a night. As is often the case, however, I don’t have one. Not one that anyone is going to like, at least. Because, the only solution I have is precisely the suggestion that got Jerry Maguire fired. We could simply publish and write fewer papers. If we write fewer papers, we can review fewer papers. If we review fewer papers we can pay people to review them. If we can pay people to review them, we can hold them to higher quality standards. Editors can review the reviews. Every now and then someone suggests we get rid of anonymous reviewers, but I worry that anonymity is load bearing when it comes to the quality standards that are in many ways the hallmark of modern economics. I don’t think we can give up on quality. Quality is our comparative advantage. So maybe its time we let go of quantity. If your dean says you’ve written some good and important article, but there aren’t enough lines on your vitae, then what they’re really saying is that they don’t want research faculty, they want AI middlemen.

Don’t be an AI middleman.

The theory of the firm remains unfinished

Why do firms exist? Transaction costs. Specialization. Returns to scale. Risk pooling. Reputation. Institutional capital. Is that everything? Probably not.

It wasn’t that long ago we were talking about the prevalence of zero marginal productivity employees within firms. Perhaps we should add low (zero?) marginal productivity employers to our list of considerations.

Graduate students rejoice, there remains more work to be done.