Talk of âsubagentsâ is getting pretty popular in certain circles. Given your delightful use of âstiltsââwhich I take to be a pretty elegant metaphor for the way that an oft-used frame of âfriends as supportsâ really works from the insideâI think it might be valuable for us to untangle some of the other metaphors that this intersects with. Iâm hoping this will make future discussion a little easier.
As the title of this letter suggests, there are four terms I want to gesture towards: subagents, supports, stilts, and shims. The first two are âin the airâ. The third is yours. The last is mine. Letâs take them in turn.
First, thereâs subagents. I take the subagents view to be (basically) rooted in some variation on the following general structure of insights:
- We tend to think of individual humans as unified/coherent âagentsâ. This is an imperfect model (or âleaky abstractionâ if you prefer).
- In reality, most individual humans are (to at least some extent) fragmented/incoherent in the sense that they exhibit (and usually report having) internally conflicting desires, motivations, and needs. As a result, most individual humans take not-totally-coherent actions. Itâs possible to âlet yourself get the better of your better selfâ.
- Given (2), a better model is that a given individual human is âcomposed of [sub]agentsâ, where each of those subagents has its own (more coherent) wants and needs.
- The action of an individual human arises out of (some kind of) process of negotiation between that individual humanâs subagents.
There are various approaches which make radically different claims on top of this basic structure. If you donât mind a bit of woo with your crumpets, there are theraputic frameworks such as âInternal Family Systemsâ (IFS) which suggest techniques by which âyouâ can access, name, and stage productive conversations between your subagents. If instead youâre more into âtrying desperately to make actual progress towards being able to guarantee robustly safe & aligned AI before itâs too lateâ, then you can use this subagent model as a way into more general work on the possibility of ensuring that any truly powerful systems we build are both âinnerâ and âouterâ aligned in various important senses of those terms. And then, of course, as always, thereâs a universe of stuff that builds on that trauma book, and points out the extent to which symptoms of incoherence & conflict between these âsubagentsââincluding, perhaps, âakrasiaââoverlap with certain kinds of trauma symptoms.
What are the consequences of letting go of the brain as a machine, and favouring the brain as a parliament? By introducing politics into our internal decision making models?
I take discussions of subagents, and the negotiated resolution of disagreements between subagents, to be at least in part discussions of that.
Second, thereâs talk of (social) supports. In my mind, this kind of discourse shades all the way from vanilla âself-careâ to social justice âsolidarityâ to anarchist infrastructures of âcommunityâ. Whereas the universe of âsubagentsâ discourse focuses on a rethinking of individual agents, my sense is that the âsocial supportâ discourse is instead rooted a desire to reorient outwards, towards a line of claims thatâs something like:
- We think of ourselves as atomic, self-sufficient individuals.
- In reality, weâre all intensely embedded in networks of relations to other individuals.
- Given (2), we should consciously build and maintain strong, positive relations to other individuals so that we âsupportâ and âbe supportedâ by them when bad things happen.
- Most people donât do (3) enough.
This is all fine and good. I bounce off most self-care-is-solidarity-and-community talk, as you know. But this is all fine. (Honestly, I dream of one day stumbling onto a peer-reviewed paper that suggests âcommunityâ is in fact unimportant to self-reported wellbeing, or that strong/positive social ties have absolutely no impact on any measured physiological stress markers. At least then the little Welsh troll at the base of my skull that whispers phrases like âfile draw effectâ would shut up.)
Third, thereâs your addition: stilts. I see this as something like âsupports, but with awareness of subagentsâ. I take the thing youâre saying (indirectly) to be something like:
- If you introspect, you notice that neither âindividual identityâ nor âinterpersonal relationsâ are particularly well-defined concepts.
- In practice, any serious attempt to be a coherent agent in the world entails some amount of âbeing able to rely on interpersonal relationships to hold the-you-that-you-are upâ, and some amount of ârecognising that those interpersonal relationships also alter, direct, and transform the-you-that-you-are (and the direction âyouâ go in)â.
- There are inherent tensions in (2). In particular: the people at the other end of the interpersonal relationships are themselves constantly shifting.
- âBalancingâ in light of (3) is less of an identifiable âstate of affairsâ and more of a âconstant processâ.
Iâm reminded of the famous line from Clifford Geertz, in âThick Descriptionâ:
The concept of culture I espouse, and whose utility the essays below attempt to demonstrate, is essentially a semiotic one. Believing, with Max Weber, that man is an animal suspended in webs of significance he himself has spun, I take culture to be those webs, and the analysis of it to be therefore not an experimental science in search of law but an interpretive one in search of meaning. It is explication I am after, construing social expressions on their surface enigmatical. But this pronouncement, a doctrine in a clause, demands itself some explication. (Geertz 5)
In the stilts account, you come to understand (and tame?) the âsolipsistic selfâ which (inevitably?) intrudes by tracing the webs of significance in which you find yourself suspended. In a coarse-grained, first-pass, quick-and-dirty sort of way, self is world. In a more careful analysis, âselfâ and âworldâ are phenomenologically suspect terms. Insofar as thereâs a habit to be developed, here, or a reflex to be overcome, itâs something like âlearning to stop unconsiously replacing the phrase âto help understand the world around meâ with the phrase âto help understand myselfââ. Here, relations are just too entangled with selves ââsupportsâ too entangled with âsubagentsââfor us to have any hope of explicating one without simultaneous explication/construal of the other. No sitting in a darkened cave, alone, and hoping for deep understanding that will generalise. Be pragmatic.
This brings us, finally, to a new metaphor I want to introduce: shims.
In general, a âshimâ is a small, thin, tapered or wedged piece of material. A thing you use to close gaps, modify spaces, make things level, or provide a smoother interface. A shim is that little piece of metal that you slide underneath the foot of your lathe when you install it so that you can be sure that itâs perfectly level. A shim is the bit of wood you use to align a gap between two large chunks of timber when youâre building the frame of a cottage wall. In a pinch, when youâre picking a padlock, a shim is the thin piece of a Red Bull can that you cut with your Leatherman and then slide into the space between the shackle and the lock body to bypass the catch mechanism. And in computing, âshimsâ are libraries that intercept calls to one API and transparently reformulates the call (or redirects it entirely) so that it can be handled by a different API. Writing a âshimâ is the thing you do when you need older code to be compatible with newer code; as Axel Rauschmayer puts it, âa shim is a library that brings a new API to an older environment, using only the means of that environmentâ.Itâs solid material as lubricant, as glue, as improvised spacer.
In this metaphor, I claim, humans are often trying to create (or discover, or repair, or modify, or share) shims of various sorts. At the level of individual humans interacting only with themselves, âshimsâ are the habits we form (and masts we decide to bind ourselves to) so that we can fill some of the gaps between our subagents and reduce the damage they would otherwise do to each other when they rattle about. And at the level of the âinterpersonalâ, shims are the things we use to make our relationships more manageable and robustly positive-sum (or our stilts more easy to balance on). They can be rules, or habits, or psychological technologies, or physical technologies. The key is that a metaphorical shim, here, is
- small,
- often transparent once installed, and
- either (a) used to fill an otherwise-damaging gap between two systems, or (b) used to create a useful gap where otherwise a damaging, friction-filled interface would exist.
When I decide to answer the phone if and only if I reflexively smile when I see the name of the contact thatâs calling me, Iâm (unilaterally) acting to increase the energy I put into the vibrantly positive-sum relationships in my life. That blanket decision rule is a shim.
When Zvi describes having lost 100 pounds using Timeless Decision Theory, I claim thatâin the moment of actually internalising the insight that âsticking to the rules Iâd decided to follow meant I would stick to rules Iâd decided to followââZvi was (in effect) wedging a generalised shim into the spaces between some of the-subagents-that-exist-within-the-human-agent-we-designate-as-Zvi.
And when you and I commit to writing these letters to one another, in public, as a continuation of conversations that began in private over a decade ago, this blog is a shim.