2 Comments

"Recursive self-improvement is picking up steam." This is not recursive self-improvement. This is instead trying to make an LLM self-consistent (when applied to generate feedback for itself) or to make two LLMs mutually consistent (when applied to generate feedback for each other). The effect is similar to message passing in probabilistic reasoning: we are trying to get the various parts of the network to agree with each other about the generated outputs. This will not lead to a "takeoff".

Expand full comment
author

good point. This was a sloppy use of terminology to refer to a less direct positive feedback loop. Thanks for pointing this out!

Expand full comment