Semantic ablation: Why AI writing is boring and dangerous

Powderhorn@beehaw.org · 30 days ago

Semantic ablation: Why AI writing is boring and dangerous

kbal@fedia.io · 30 days ago

That’s a fine illustration of the problem, whatever it’s properly called.

Having paused to search the web I find that “ablation” according to wikipedia is a term used in AI since 1974. Arxiv.org has a recent paper talking specifically about “semantic ablation” which phrase it uses to describe an operation deliberately removing semantic information from an LLM’s representation of a sentence in an attempt to see what purely syntactical information is left over afterwards, or something like that.

apparia@discuss.tchncs.de · edit-2 30 days ago

Interesting, thanks for doing the research!

As an extreme non-expert, I would say “deliberate removal of a part of a model in order to study the structure of that model” is a somewhat different concept to “intrinsic and inexorable averaging of language by LLM tools as they currently exist”, but they may well involve similar mechanisms, and that may be what the OP is referencing, I don’t know enough of the technical side to say.

That paper looks pretty interesting in itself; other issues aside, LLMs are really fascinating in the way they build (statistical) representations of language.