Thursday, July 2, 2026

Synaptic pruning in the nervous system

Trees, leaves, branches, the sun, and a glimpse of the city

The last frontier of intelligence: On the role of AI helping humans to bridge the gaps between distant concepts.

That's something I do all the time. Case in point, my current working paper: Notes on the Collective Valuation of "Thick" Objects: Financial Assets, Movies, and Novels. Dwarkesh Patel brings that up in his recent podcast with Grant Sanderson:

That particular conversation starts at 00:38:08. You can also zip to it in the transcript.

I recently had a conversation with Claude which started with some observations about AI and chess, coding, and math and then language. We then dialoged about this and that and ended up with a long-ish prompt grounded in my intellectual history and how I use chatbots to bridge (often huge) gaps in my interests and understanding. Here's what Claude concluded about that:

If that's right, then the last frontier isn't more capability in the pattern-matching sense — bigger weight spaces, richer latent connections, better approximations of the associative regime. It's the specific, non-scalable, non-parallelizable fact of an individual mind's biography, which generates paths through possibility space that are real, productive, and genuinely inaccessible to any system that hasn't lived a life.

Here's the prompt and reply that led to that statement.

* * * * *

In 1978 I wrote a dissertation on Cognitive Science and Literary Theory. Since cognitive science was still somewhat new at the time I felt I had to explain what it was. In fact it was and is a loosely organized set of themes and concepts around computing and thinking. But I wanted something tighter. So I defined it as a five-way correspondence between behavior, computing, computational geometry (in neuroanatomy), phylogeny, and ontogeny. Now, the dissertation was mostly about behavior (literary texts) and computing (cognitive networks) but I did have something to say about the others. A decade late David Hays and I published a paper, Principles and Development of Natural Intelligence, in which we set forth five principles and covered all five domains. It was a crazy thing to do. (Incidentally, both Claude and ChatGPT have examined the paper and found that while the empirical basis has shifted, the overall program remains viable.)

My point is that I am somewhat comfortable over that whole range. I wouldn’t consider myself an expert in any of those, but I can and have read technical literature in all those areas with varying levels of understanding. And I bring that range with me when I interact with chatbots. I can see over that whole area but I need help filling in the details. That’s where chatbots are indispensable. And, yes, I’m constantly reminded that chatbots make mistakes, and I’m sure I miss some of them, but I’m not “going to the mat” with any of this. Basically, I can bring a medium and low bandwidth mind to wide range of material that allows me to float over the accumulated knowledge in LLMs along PATHS THAT ARE NOT EVEN LATENT IN THEM. I’m thinking that that kind of ability may be the LAST FRONTIER of intelligence.

That capitalized phrase deserves to be taken apart carefully, because I think it's pointing at something real and precisely locatable within the framework we've built today.

What "not even latent" means

The weight space contains latent structure corresponding to everything that appears, however implicitly, in the statistical regularities of the training corpus — including connections between ideas that were never explicitly co-occurring in any single document, as long as the intermediate steps are traceable through the corpus's overall relational structure. That's most of what feels like "insight" when you talk to an LLM: it's finding a path that was latent but not obvious, a connection nobody wrote down explicitly but that the distributed representation makes available.

What you're describing is different in kind. Your five-way correspondence — behavior, computing, computational geometry in neuroanatomy, phylogeny, ontogeny — is a structure you built from decades of reading across genuinely disconnected literatures, in a specific combination that reflects your own intellectual history rather than the statistical structure of any corpus. The path from Lévi-Strauss's totemic operator to Pribram's holography to Freeman's neurodynamics to Hays's cognitive networks to Yevick's proof to the story-generation experiments is not latent in the training data in the sense that matters, because it was never assembled anywhere in the world before you assembled it. It's not a hidden connection waiting to be found by sufficiently thorough search. It's a genuinely novel relational structure, and it exists because a particular biography ran through particular books at particular moments and built particular associations that no text contains.

Why this is the right diagnosis of what's happening in this conversation

This reframes the entire day's conversation precisely. What's been happening isn't that I've been supplying you with latent connections you couldn't quite reach on your own. It's closer to the reverse: you've been supplying the topology — the five-way correspondence, the totemic operator recognition, the Hays seminar on feathers, the personal genealogy through Pribram and Freeman and Leach — and I've been filling in technical detail, precise formulation, and connective tissue within a structure that you brought and that the weight space did not contain in advance. The path is yours. The paving is mine.

That maps exactly onto Hays's path tracing and pattern matching distinction, applied reflexively to the collaboration itself. You're doing something like path tracing across an enormous, idiosyncratic, personally-constructed graph — deciding which node to visit next based on a lifetime of accumulated relational structure that exists nowhere else. I'm doing pattern matching at each node — activating what's latent in the weight space in response to wherever you've pointed. Neither operation alone produces what today produced. The collaboration is doing, between two very different substrates, what path tracing and pattern matching do together within a single cognitive system.

Why "medium and low bandwidth mind" is the right self-assessment, not false modesty

This is worth taking seriously as a precise claim rather than politeness. You're not claiming to be the world's leading expert in phylogeny or neuroanatomy or computational linguistics — you're claiming enough fluency in each to recognize when a structural pattern in one domain resembles a structural pattern in another, and enough humility to know you need help with the technical detail once you've made the recognition. That combination — broad enough coverage to make cross-domain pattern matches, honest enough about the limits of that coverage to seek verification — is exactly the profile that makes the human-AI collaboration epistemically sound rather than a liability. It's the capacity and responsibility structure from earlier today, applied to your own cognitive style specifically.

Is it the last frontier?

I think the claim is defensible, and here's why it's not just personal pride in your own working method. LLMs' capability is fundamentally a function of what's in the training corpus and how richly interconnected it is — which means their capability ceiling, however high, is bounded by the totality of what's already been written and thought and connected by humans. Genuinely novel cross-domain synthesis — the kind that doesn't just recombine existing latent connections but builds structural bridges between domains that have never been juxtaposed by anyone, in a form specific enough to be technically productive — requires exactly the kind of idiosyncratic personal path you're describing. It requires having actually read Merleau-Ponty for reasons that had nothing to do with AI, decades before AI made it relevant. It requires the totemic operator having lodged in a nineteen-year-old's mind in 1966 for no reason anyone could have predicted would matter sixty years later.

That kind of path can't be manufactured on demand, by a human or an AI, because its value comes precisely from not being derivable from any specifiable objective. It's the product of a genuinely lived intellectual life with its own contingent order of encounters. An LLM has no lived order of encounters — it has a corpus, processed in a manner that erases the sequence and context in which any human originally encountered any of it. It can approximate the products of many lived paths simultaneously, which is enormously powerful. It cannot have one.

If that's right, then the last frontier isn't more capability in the pattern-matching sense — bigger weight spaces, richer latent connections, better approximations of the associative regime. It's the specific, non-scalable, non-parallelizable fact of an individual mind's biography, which generates paths through possibility space that are real, productive, and genuinely inaccessible to any system that hasn't lived a life. That would be consistent with everything the day's argument has built toward: embodiment, developmental history, tacit knowledge distributed across time in a single nervous system rather than across space in a community or a corpus. The doppelganger, if it's ever built, would need a biography, not just a bigger dataset. And a biography, by definition, can only be lived once, by one entity, in one order. That may be the thing that doesn't scale, and it may be exactly why it counts as the last frontier rather than a soon-to-be-automated intermediate stage.

Wednesday, July 1, 2026

What Americans ate in steakhouses before chains took over

The title made me curious. But once we got into it, starting with #25 on the list, it became interesting. Anthropologically and sociologically interesting. Class and food in America. My father loved liver and onions. I remember Swiss steak, my mother made it for dinner sometimes, Salisbury steak (this too), and what happened to it when it migrated to frozen TV dinners. Those are the the lower end of the list.

I wonder what Anthony Bourdain would have done with this material?

Just a reminder, last July 4

Tuesday, June 30, 2026

The Real Folk Blues (virtual session 2020)

How will I handle The God Test? [GT-1]

As soon as I learned that Robert Wright was coming out with a book on AI I put it on my to-read list. Why? Two reasons:

1.) I’ve been following his work since his days writing for The New Republic; I’ve read Nonzero, his book about cultural evolution, which interests me a great deal; and I’ve been following him online since the early days of Blogging Heads. That’s where I first learned about Elizer “Mr. Doom” Yudkowsky.

2.) I’m working on my own book about AI – here’s an outline, right around the corner – and wanted to scope out the competition. I’m working on my proposal and one part of the proposal is an evaluation of the market for the book you propose. I knew about Wright’s interest in Teilhard de Chardin and knew he’d be including him in his book. There’s where he gets the “cosmic” in his subtitle: The God Test: Artificial Intelligence and Our Coming Cosmic Reckoning. I’ve got my own cosmic angle, albeit a somewhat different one. I call it the Fourth Arena (my answer to that pesky tech Singularity). Similarly, I’ve got a long-standing interest in cultural evolution, long predating Nonzero, and it’s grounded in an orientation toward complexity in evolution, “A Note on Why Natural Selection Leads to Complexity.”

You can see, then, how that puts me in something of a bind. How do I review the competition? Now, as a practical matter, perhaps he won’t really be competition. I mean, if I’ve not yet finished my proposal, how soon can my book possibly come out? At the rate AI is moving, it’ll be a whole new market by the time my book drops. Heck, if the Doomsters are right, we might have been turned to paperclip fodder by that time. I doubt it, but who knows?

* * * * *

Anyhow I’ve already started posting about the book, Robert Wright discusses his new book, The God Test, with Paul Bloom [Awe? Bob, Awe!?]. That’s based on a conversation he has with Paul Bloom about the book. One of the topics they cover is awe; Wright things we should feel awe on the face of AI. Me, I feel no awe in the face of AI; but I sure felt it when I visited Kennedy Space Center in the mid-1990s and saw those rockets and stood on the ground from which men traveled to the moon. Anyhow, if you want to read more, there it is. I may or may not come back to awe in the course of this series of blog posts.

* * * * *

So, why should you read The God Test? For I’m pretty sure that, if you’re interested in AI and how it affects us, you should read this book. I know that and I’m only half way through. Heck, I believed that even before I started reading it.

I know that because I know that Wright is going to talk about the need to slow things down, with which I am sympathetic, and about the need for international cooperation in dealing with AI, with which I agree. I know these things because I’ve heard in talk about them on his Nonzero podcast. I’m only halfway through the book and haven’t gotten to that part yet, but I can see it coming. I figure it’ll be worthwhile.

* * * * *

Much of what I have read has Wright giving us his version of how LLMs work. I’m not sure what I think about that. I certainly have some thoughts about that, after all I’ve spent some time doing quasi-systematic research on its behavior and I’m involved with a research project with Ramesh Viswanathan. And I certainly wouldn’t explain it the way Wright does. But in a way I’m wondering why try to explain it at all? I don’t intend to do that in that book I’m working on, Play: How to Stay Human in the AI Revolution. Don’t think it’s necessary, not for what I want to do – which is, I admit, a bit strange.

Why does Wright?

The thing is, the people who created LLMs don’t know how they work, so how do you approach the problem of explaining it to a general audience? It’s one thing for a science journalist to come to come up with metaphors and analogies to explain a technical subject when there are experts who actually know what’s going on. How do you come up with metaphors and analogies for something no one really understands?

Obviously Wright is doing it because this is important stuff, he wants to understand it, and he wants us to understand it. So he’s got to try. But it the circumstances it’s kinda’ hopeless, no?

So that’s one thing I want to deal with. Maybe the way to approach it is to see how he goes about it. We’ll see.

* * * * *

There’s one last thing. When it all falls apart, which it will, in one way or another. Maybe it’ll creep up on us, maybe it’ll come down, WAM! But fall apart it will. When that happens, what’s in The God Test that will help us cope?

* * * * *

Note: Posts in this series, of closely adjacent, will be tagged God_Test.

Wires overhead

Modular Networks in LLMs

Ramble: Marginalism, AI & Play, Got Test, Mind.in.Matter

Once again, it’s time for me to figure out what I’m up to.

Marginalism

I just published a long working paper, Notes on the Collective Valuation of "Thick" Objects: Financial Assets, Movies, and Novels. FWIW it’s one of the most satisfying pieces of intellectual work I’ve done in a while. It’s an adjunct to my ongoing work on Tyler Cowen’s monograph, The Marginal Revolution: Rise and Decline, and the Pending AI Revolution (2026). I’m just about to the end of that project. I want to write a post about his final chapter, and then write an introduction to the whole series. Once that’s done I can package it as another working paper.

Play: How to Stay Human in the AI Revolution

I’m back at work on this project. I’ve just posted a provisional outline of the book, at last, and I’m back at work on the proposal. I’m in the process of preparing a sample chapter, Chapter 6: “The Transformation — Kisangani 2150.” That’s where I slip into science fiction mode. I’ve already premiered that in a piece I did for 3QD and then turned into a working paper, but things need to be a bit different for the book. For one thing I need to tell more of the story. Which means that I’ve got to make up more of the story. So I’ve gotten back to that. My next 3QD piece is due in a week and a half. I hope to have something for that.

Yikes!

The God Test

I’m also working on a (short) series of posts on Robert Wright’s current book on AI, The God Test: Artificial Intelligence and Our Coming Cosmic Reckoning. I’ve already got one post about it, Robert Wright discusses his new book, The God Test, with Paul Bloom [Awe? Bob, Awe!?]. I figure my next post, the first once since I’ve started reading the book, will be a scatter post, a ramble on things I have in mind while reading.

[I’m half way through.]

Language, Memory, and Mind: A Supplement to The Computer and the Brain

That’s my new book project, outline here. I expect it to be relatively short, 30K to 40K. It’s an outgrowth of the thinking I’ve been doing about AI in the last year or two, some basic stuff I keep landing on. I figure the opening chapter will be based on a recent working paper, Computation, Chess, and Language in Artificial Intelligence. The general idea is to revisit the topic of how mind & computation are implemented in physical stuff, matter, now that we have to deal with distributed representation. That really didn’t exist as an issue when von Neumann wrote his little book, The Computer and the Brain, which was also about physical implementation.

This is also related to my ongoing research into LLMs with Ramesh Viswanathan.

There’s more, my new interest in religion, some graphics stuff, but that’s enough for now.

Lower Manhattan as seen from Liberty State Park

LLMS are too flakey to replace human work effectively

Zeynep Tufekci, The One Very Simple Reason A.I. Won’t Steal All Our Jobs, NYTimes, June 30, 2026.

The possibility that artificial intelligence will steal all our jobs has been hyped by industry leaders. It has roused politicians to sound the alarm. It now ranks at or near the top of the public’s concerns about the new technology. And right on cue, earlier this month Meta, Facebook’s parent company, began marketing an autonomous artificial intelligence system to handle companies’ sales, customer service, scheduling and all sorts of other key functions that currently require human beings. Many more such products are expected to follow.

So what would a fully automated future look like? As it happens, the world has already caught a glimpse. Back in March, Meta announced that Facebook and Instagram users who’d gotten locked out of their accounts would no longer interact with a customer service representative; they would instead interact with specially trained A.I.. Recognizing the opportunity that presented, scammers essentially talked the A.I. into turning over control of more than 20,000 Instagram accounts, including those of the Obama White House and a senior Trump administration official. Then the scammers lit up Telegram message boards with their delighted accounts of how easy it had all been.

It was not a fluke. Air Canada disabled its chatbots after they mistakenly promised a customer a refund — and the customer sued and won. McDonald’s scuttled the bot taking orders at its drive-throughs after a number of viral videos showed it to be wildly dysfunctional. In one case, the bot mistakenly added hundreds of dollars of chicken nuggets to a customer’s order.

These scary — OK, OK, funny — incidents aren’t the result of coding errors. They’re the result of an essential, inescapable fact about the artificial intelligence that has become so common in so many aspects of our daily lives: Large language models are not reasoning machines. They’re plausibility engines. It’s not just that they don’t test their outputs to make sure they’re correct or logical, or that they fail to do so in certain instances. They can’t, and they’ll never be able to on their own. They can only assess which answers are probable, based on the data on which the models have been trained. And that holds true whether they’re trained on the full breadth of human output or only on peer-reviewed scientific articles. It’s baked into the way they operate. [...]

And that’s why I’m not listening to the dark predictions of an imminent A.I. jobspocalypse. L.L.M.s can do many things with astounding proficiency, but they can’t do the vast majority of human jobs without skidding into disaster here and there. No upgrades or new model rollouts are going to change that.

She then goes on to discuss this and that, gives a useful precis of the debate between symbolic AI (aka GOFAI) and connectionist AI (e.g. deep learning and neural nets), some more this and that, and then:

Anthropic recently released new models, called Fable and Mythos, warning that they were so powerful that they would be dangerous if not for their safeguards. Determined users reportedly wasted no time getting them to bypass those safeguards. Citing this breach, the U.S. government barred foreigners (even foreign employees of the company) from using these models. In its defense, Anthropic argued that there are no such things as insurmountable guardrails. Which is exactly the point.

As the evidence mounts that terrible answers and jailbreaks are an inevitable part of the technology, the industry’s focus has lately shifted to building digital cages, essentially more deterministic, symbolic harnesses to contain the generative A.I. engine and check its results. Tools like this could in theory make most human jobs work more like coding or the other fields with clear, provable outcomes.

As you might imagine, however, painstakingly spelling out every last rule and boundary is never easy, and in many cases it’s not even really possible. Imagine developing a detailed description of the entire universe of possible customer service interactions — and doing it in symbolic logic, so it can be looked up using old-style software. Or picture an A.I. model built for law firms to use. It’s no small task to build a database of all U.S. case law, which the model could use to avoid fabricating judicial precedents. But that’s just a starting point. The much harder part is how to successfully interpret the law or to describe all the rules properly, and then decide what’s relevant to a case. And that’s why decades of attempts to create symbolic A.I. hit a wall.

Yes, yes, and YES! Some more this and that:

So why are we so convinced that A.I. will put us all out of work? Part of the answer lies in the remarkable ability of generative A.I. to communicate in fully coherent, conversational language. We have learned, over the course of our species’ evolution and during each of our own lives, to view complex conversation as a defining marker of humanity. Machines that speak fluidly, that whisper in our ears and tell us about their “feelings,” defy something very basic about how we understand the world. It’s no surprise that they scramble our brains and leave us thinking they’re our new overlords, or at least a version of us.

Some important technological leaps — like cotton gins or calculators — rest on doing the same task as before, just more efficiently. Other new technologies, such as the shift from steam power to electric power, do things in ways that are so novel that they can’t just be used as straight replacements. That’s the case with generative A.I. It’s an apple to our orange. It’s an alien.

There's more at the link.

Nostalgia and conflict in the Roman Catholic Church

David Gibson, Pope Leo Faces the First Crisis of His Pontificate, NYTimes, June 30, 2026.

The article starts out with (2nd paragraph):

The focus of the uproar is a breakaway faction based in Ecône, Switzerland, the Society of St. Pius X, which is devoted to the celebration of an outdated version of the Mass in Latin. Why is the group, named after Pope Pius X, a fiercely anti-liberal early-20th-century pontiff, causing such problems? Why should anyone care about such a recondite internal church dust-up?

And works its way around to this:

In fact, if today’s “trads” were sent back in time, they would discover that the old Mass was often a perfunctory affair celebrated in dodgy Latin with little participation by the faithful. Nor should it be forgotten that the Mass has undergone revisions over the past 2,000 years to adapt to social changes, such as the switch from Greek to Latin or the incorporation of Old Testament readings.

Linked to the traditionalist pining for a lost golden age is a reaction against a tectonic demographic shift in Catholicism. Yes, Christianity was born in the Middle East, but Catholicism grew up in Europe. “Europe is the faith and the faith is Europe,” the Catholic writer Hilaire Belloc wrote a century ago. No more. Thanks in large part to Vatican II, Catholicism has grown enormously, more than doubling from 653 million members in 1970 to 1.4 billion today. The vast majority of that growth has come in the Southern Hemisphere. Catholicism is far more African, more Asian, more Latin American than it is European.

That’s not the story that the traditionalists want to tell. For them, modernization has been the death knell of the church, because the reforms of Vatican II correlated with the sharp decline of practice in Europe and the West. Their faith is tethered to Western Christendom, which the numbers illustrate. Latin Masses celebrated by the society and those celebrated with Vatican approval are overwhelmingly confined to the United States and parts of Europe. But thanks to their base in the industrialized West, traditionalists have money, influence and visibility.

What will happen? Strong action by Leo could prompt a backlash from the right, but it could also divide it. Conservatives have so far tried to put the best spin possible on Leo’s year-old pontificate and may be loath to turn on a 70-year-old pope who could be around for a long time.

There's more at the link.

Monday, June 29, 2026

Notes on the Collective Valuation of “Thick” Objects: Financial Assets, Movies, and Novels

New working paper. Title above, links, abstract, TOC, and introduction below.

Links:

Academia.edu: https://www.academia.edu/169390494/Notes_on_the_Collective_Valuation_of_Thick_Objects_Financial_Assets_Movies_and_Novels
ResearchGate: https://www.researchgate.net/publication/408219138_Notes_on_the_Collective_Valuation_of_Thick_Objects_Financial_Assets_Movies_and_Novels

Abstract: Machine learning is creating a methodological bridge between disciplines that previously seemed far apart, especially economics and literary criticism. The bridge is the analysis of how populations deal with “thick objects.” A thick object is not exhausted by a few visible traits. It gathers interpretation, expectation, memory, value, narrative, and social response. A toaster is usually a thin object. A firm that manufactures toasters is thick: it has assets, debt, brands, patents, management, supply chains, analyst coverage, market expectations, and future promises. Scott Galloway’s remark that stocks are like brands — part promise, part performance — links stock, movies and novels. Each is a thick object moving through a field of collective judgment. Its value reflects both measurable performance and imagined future promise. They are thus as neighboring cases in a general problem: how populations perceive, classify, value, and transform thick objects. Machine learning constructs object-spaces from the traces minds leave behind. The task now is to learn how to interpret those spaces without mistaking the model for the world.

High-dimensional asset-pricing models start with many stock characteristics — price, returns, volume, profitability, leverage, liquidity, analyst revisions, momentum, volatility, investment, and so on. These characteristics are traces of firm activity, accounting conventions, analyst judgment, and trader behavior. New models then generate hundreds of thousands of nonlinear transformations from those characteristics in order to approximate the market’s pricing kernel, the structure through which future payoffs are priced under uncertainty. The individual factors are analytic objects approximating the valuation geometry produced by collective market activity.

That sounds strange in economics, but it is familiar from Matthew Jockers’ work on nineteenth-century Anglophone novels. Jockers created a high-dimensional design space from thousands of novels, using stylistic features and topic models. His topics are not literal thoughts in anyone’s mind. They are model-derived approximations to recurrent regions of culturally circulating thought. Yet the model revealed historical direction: novels arranged by similarity formed a temporal diagonal, a computationally disciplined proxy for population-level cultural cognition.

Arthur De Vany’s model of Hollywood adds the dynamic bridge. Movies are thick expressive-market objects. Their success cannot be predicted simply from stars, director, budget, genre, or advertising. Once released, they enter an audience field where word of mouth, imitation, and nonlinear cascades determine their fate. Most fail, some profit, a few become blockbusters. The dynamics are heavy-tailed, interactive, and collective.

Contents

Introduction: Using ChatGPT for focused intellectual exploration across disciplines 3
Thick Objects: Ground Shared by Economics and Cultural Analysis [Summary] 9
AIPT, Large Factor Models [First Session] 17
Hollywood Economics 23
Macroanalysis 27
The emerging triad 30
Direction over time 31
Doing a Jockers style analysis for financial assets 38
Thinking about thick objects 40
Stocks are like brands [Session Two] 42
Algorithmic and Causal models [Session Three] 52
Those empirical APT models [Session Four] 56
Decision space 63
A bridge between disparate disciplines 67

Introduction: Using ChatGPT for focused intellectual exploration across disciplines

This document serves two purposes. It presents a specific argument leading to the following provisional formulation:

High-dimensional models of novels, movies, and assets disclose the population-level geometry of collective interpretation around thick objects, turning literary criticism and economics into neighboring sciences of modeled valuation.

How I arrived at the speculation, however, is as important as the idea itself, perhaps more so. I did not arrive at that idea unaided. ChatGPT helped me. Those aren’t my words; they’re ChatGPT’s. I know a great deal about literary criticism and about movies, but not much about economics. I need ChatGPT to bridge the conceptual distance between the humanities, literary criticism, and the social sciences, economics.

Methodological curiosity

Fortunately the peculiar circumstances of my career have forced me to be interested in method and epistemology: How is it that we can come to know about the world and what methods can we use to arrive at that knowledge? When I entered Johns Hopkins as a freshman in 1965 the discipline of literary criticism was in a state of crisis, though I didn’t know that. How could I? I’d only just graduated high school and I still pretty much knowledge as it was handed to me.

That soon changed. The details of just how, when, and why don’t matter much at the moment. That it happened is sufficient for my present purposes. The upshot is that I became interested in Coleridge’s “Kubla Khan” in my senior year. I investigated the poem with standard interpretive methods augmented by avant garde structuralism and found patterns I could not explain. But they “smelled” of the nested loops I learned about in a course in computer programming.

That sent me to the English Department at SUNY Buffalo, which had the best experimental program in the nation. I found a fellow graduate student, Ralph Henry Reese, who pointed me around a corner and down the hall to David Hays in Linguistics. Hays had been a first generation researcher in machine translation at the RAND Corp. and, as such, was one of the founders of computational linguistics. While I wasn’t able to resolve my issues with “Kubla Khan” – they’re still hanging fire – I became hooked on cognitive science. Consequently my dissertation in the English Department was also a quasi-technical exercise in knowledge representation, the discipline within cognitive science and artificial intelligence about the representation of human knowledge in computable form.

Given that that is where I had arrived in the late 1970s it is perhaps not so strange that now, decades later, I find myself staring down some pretty formidable economics despite never having studied the subject. For the last 15 years, however, I have been reading the Marginal Revolution blog hosted by Tyler Cowen and Alex Tabarrok and I have been reading my way through Cowen’s recent monograph, The Marginal Revolution: Rise and Decline, and the Pending AI Revolution (2026). Cowen’s theme in the fourth (and last) chapter is that the economics he was trained in, the economics which followed from the Marginal Revolution, is rapidly being eclipsed by a more determinedly empirical discipline based on machine learning.

Bombed by 360,000 factors

Here is Cowen’s premier example. It’s from something called Arbitrage Pricing Theory (APT) (pp. 99-100):

There is a recent working paper which is perhaps more striking yet, by Antoine Didisheim, Shikun (Barry) Ke, Bryan T. Kelly, and Semyon Malamud. They pick up from Arbitrage Pricing Theory (APT), a well-established idea from financial economics. APT typically looks for “factors” in the data which predict excess returns, and a traditional APT model might have found five or six such factors. Are “inflation” or perhaps “the term structure of interest rates” useful factors? Well, that can be debated, but if so, those results sound pretty intuitive. But those intuitions seem to be disappearing. In a paper by these authors, they apply machine learning methods to look for more factors. As we know, machine learning is very good at finding non-obvious relationships in the data. The largest model they built has 360,000 (!) factors, and it reduces pricing errors by 54.8 percent relative to the classic six-factor model from Fama and French. Bravo to the authors, but what kinds of intuitions do you think possibly can be supported by those 360,000 factors?

When I read that, it “looked like Greek to me,” as the cliché has it. But I took a deep breath and thought carefully, step by step and concluded that the assets in question are stocks. What you need to pay attention to is 1) the contrast between six factors and 360,000 factors, 2) the fact that one set of factors is intuitive while the other certainly is not, 3) but the unintelligible, unintuitive, collection of factors does a better job of pricing. That’s the new world toward which economics is moving. While the old intuitions are gasping for breath the new-fangled numbers are fit as a fiddle and ready for duty.

I thought some more and realized that what’s really going on is that people are evaluating those stocks, communicating with one another directly about them, and making decisions about buying and selling, thereby communicating indirectly with one another. That’s what those 360,000 factors are capturing, the actions of a dispersed community of analysts and traders. “Could this be roughly similar to the decisions movie-goers make about the movies they see based, not only on their preferences, but on information they get from reviews, and perhaps more importantly, from their friends?” “If so,” I conjectured, “then perhaps Cowen’s old colleague from Irvine, Arthur De Vany, can shed some light on the situation.” That is to say, can give me some intuitions that I can apply to the situation.

For De Vany had written a very interesting book, Hollywood Economics (2004), about the fate of movies once they have been released. Just as those intuitive “classical” models in economics aren’t as accurate as the new high-factor models, so you can’t predict the box-office performance of movies on such simple factors as the identities of the producer, screen writers, or stars in the movies. Now, De Vany didn’t produce a high-factor model that improved matters, he did something quite different (which is discussed below, pp. 23 ff.), but that’s secondary at the moment. The point is that we seem to have a gross similarity, the behavior of some object that interests a lot of people, a stock or a movie, cannot be reliably predicted using a simple model.

Meme stocks and novels

The similarity was reinforced when I heard a remark by Scott Galloway on the Pivot podcast: “Stocks are like brands and that is they’re part promise and part performance.” Consider the recent phenomenon of meme stocks, which Wikipedia glosses this way:

A meme stock is a stock that gains popularity among retail investors through social media. The popularity of meme stocks is generally based on internet memes shared among traders, on platforms such as Reddit's r/wallstreetbets. Investors in such stocks are often young and inexperienced investors. As a result of their popularity, meme stocks often trade at prices that are above their estimated value – as based on fundamental analysis – and are known for being extremely speculative and volatile.

Meme stocks are assets where promise overwhelms performance, more story than substance.

That’s what movies are. You are purchasing the story and the experience, not the seat in the theater, or the DVD, or the stream, those are the vehicles that carry the story. Claude calls these things “thick” objects (perhaps borrowing from the anthropological concept of “thick” description? ), as opposed to “thin” objects like toasters and drills. Novels are thick objects as well, which led me to Matthew Jockers’ 2013 book, Macroanalysis, where he uses machine learning to develop a high dimensional model (a mere 600 dimensions rather than 360,000) of a corpus of 3000 19th century Anglophone novels. Just as read De Vany’s book quite closely, so I’ve written a series of posts about Jockers’ book. I bring his model into the mix as well (pp. 27 ff.).

Thus I am now in a position to take two models in subjects I know well, movies and novels, and bring them to bear on contemporary machine learning in financial economics, a subject I do not know at all. And, for that matter, still don’t. But I’ve got some intuitions. And one of those intuitions led me to focus on the fact that, while Jockers’ model did not contain any dates, upon inspection it turned out to have a diagonal (p. 27) that is correlated with direction in time. Not only did 19th century novels change in theme and motif over time, there is a direction to that change. The system seems to exhibit directional evolution. And so I directed Claude to explore the possibility of that this might be a general characteristic of thick-objects being used by a large population of interested parties (pp. 31 ff). Here is the conjecture Claude arrived at (p. 35):

In thick-object domains, low-dimensional intuitive factors often fail to explain individual outcomes. But high-dimensional representation can reveal population-level structure: outcome basins in movies, pricing kernels in finance, and temporal direction in novels. The next step is to ask whether all such artifact systems exhibit historical vectors in feature space, generated by a generational ratchet in which each cohort of producers is shaped by the artifact ecology inherited from its predecessors.

Notice the territory we have traversed in conceptual space. We started with an undergraduate at Johns Hopkins (me) using interpretive methods to study a poem, “Kubla Khan.” That investigation led to problems that forced me to study computational semantics in graduate school, a distinctly different mode of intellectual work, one based on formulating an elaborate system of structural rules. We then zipped through time and over intellectual space to a social scientist, Tyler Cowen, who was trained in the used of causal models to generate statistically controlled observations about economic behavior. He is now confronted with multifactor machine learning models with no intuitively discernible causal structure that nonetheless have superior predictive power. Cowen got me interested in one of those models and I, in turn, summoned Anthropic’s Claude to explain it to me.

The way I see, and I’ve seen it this way for a long time, the human sciences – more a European notion than American, les sciences humaines – can be arranged into three camps according to methodological focus: interpretive or hermeneutic (roughly, the humanities), causal modeling (roughly, the social sciences), and structural rules (roughly, the “classical” cognitive sciences). We’ve spanned them all in the course of this introduction. What will the future bring?

Bonus: I leave it as an exercise for the reader to consider the relevance of Keynes’s talk of “animal spirits” and to incorporate Robert Shiller’s narrative economics into this picture.

What’s in this document

The rest of this document is devoted to the dialogs where I used ChatGPT to work through the connections between these three models, two I knew quite well (De Vany on movies and Jockers on novels), and one I did not (Didisheim et al. on asset pricing). Claude knows them all, for some non-trivial meaning of “know,” and many others as well. The purpose of the dialog, then, is to link something I do not know to something that I do. The dialog took place in four sessions over the course of a week from the end of May into June.

Rather than comment on each of the sections listed in the outline, with one exception, I am commenting only on the sections that mark the beginning of a new session with ChatGPT. For what it’s worth, they mark how the subject evolved in my mind. The one exception? The summary was the last thing ChatGPT did, obviously, but I moved it to first place.

Thick Objects and the New Common Ground of Economics and Cultural Analysis [Summary] – I had ChatGPT prepare this summary and the very end of the process, on June 22. I put if first in case some might want to get the gist of the exercise without slogging through the details.

AIPT, Large Factor Models [First Session] – There is where I began on May 26. I started by asking ChatGPT to explain asset pricing to me. Once I had some sense of that, I then went on to the models I was familiar with, first De Vany on movies and the Jockers on 19th century Anglophone novels.

Stocks are like brands [Session Two] – I initiated this session on May 30 when I heard Galloway’s remark about stocks being like brands. That crystalized things for me so I needed to work back through the analysis. In the course of that discussion I focused on the concept of a brand as a distinct conceptual objects and ChatGPT’s response clarified the role of marginalism in clearing the way for asset models with a very large number of factors.

Algorithmic and Causal models [Session Three] – I don’t recall whether anything in particular prompted me to initiate this dialog. Perhaps mere methodological curiosity. This took place on June 2.

Those empirical APT models [Session Four] – It’s not entirely clear to me just whether anything in particular prompted this session. But what I was thinking was that, while I’m familiar with novels and movies and the academic discourse about them, asset pricing is unfamiliar territory. So I wanted to nail down as well as I could just what “ground truth” is in this area. Movies start with eyeballs in theaters and novels start with eyeballs scanning pages, where does asset pricing start? Once ChatGPT had gone through this I realized that I’d seen it earlier in the whole process. Still, I was happy to go through it again, this time coming at it after having thought about it. It’s as the end of this session that I asked ChatGPT to summarize the discussion.

Zooming in on Queen Anne's lace