Shared functional specialization in transformer-based language models and the human brain

Abstract Humans use complex linguistic structures to transmit ideas to one another. The brain is thought to deploy specialized computations to process these structures. Recently, a new class of artificial neural networks based on the Transformer architecture has revolutionized the field of language modeling, attracting attention from neuroscientists seeking to understand the neurobiology of languagein silico. Transformers integrate information across words via multiple layers of structured circuit computations, forming increasingly contextualized representations of linguistic content. Prior work has focused on the internal representations (the “embeddings”) generated by these circuits. In this paper, we instead analyze the circuit computations directly: we deconstruct these computations into functionally-specialized “transformations” to provide a complementary window onto linguistic computations in the human brain. Using functional MRI data acquired while participants listened to naturalistic spoken stories, we first verify that the transformations account for considerable variance in brain activity across the cortical language network. We then demonstrate that the emergent syntactic computations performed by individual, functionally-specialized “attention heads” differentially predict brain activity in specific cortical regions. These heads fall along gradients corresponding to different layers, contextual distances, and syntactic dependencies in a low-dimensional cortical space. Our findings indicate that large language models and the cortical language network may converge on similar trends of functional specialization for processing natural language..

Medienart:

Preprint

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

bioRxiv.org - (2023) vom: 25. Juli Zur Gesamtaufnahme - year:2023

Sprache:

Englisch

Beteiligte Personen:

Kumar, Sreejan [VerfasserIn]
Sumers, Theodore R. [VerfasserIn]
Yamakoshi, Takateru [VerfasserIn]
Goldstein, Ariel [VerfasserIn]
Hasson, Uri [VerfasserIn]
Norman, Kenneth A. [VerfasserIn]
Griffiths, Thomas L. [VerfasserIn]
Hawkins, Robert D. [VerfasserIn]
Nastase, Samuel A. [VerfasserIn]

Links:

Volltext [kostenfrei]

Themen:

570
Biology

doi:

10.1101/2022.06.08.495348

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

XBI037508490