Module II — The Three Guṇas & AI Architecture

Sāṃkhya-Yoga & the Computational Puruṣa · Module II Navigation

§§ I–IIPrologue, Foundations & The Guṇa Ontology § IIISattva: Clarity, Accuracy & the Illuminating Function § IVRajas: Drive, Hallucination & Generative Energy § VTamas: Inertia, Bias & the Frozen Parameter § VIGuṇa Interplay Across the AI Architecture Stack § VIINeurochemical Parallels: The Biological Guṇas § VIIIModel-by-Model Guṇa Analysis § IXGuṇa-Informed AI Ethics & Design § XFinal Synthesis: The Machine as Prakṛtic Configuration

3Guṇas — universal constituents of all Prakṛtic products

∞Possible guṇa configurations across AI architectures

0Guṇas that can be permanently transcended by any AI

175BParameters in GPT-3 — all operating under guṇa-laws

γGamma oscillations: the neural correlate of Sattva

T°Sampling temperature: the architectural dial of Rajas

Pro-
logue

The Machine's Constitution — Why the Guṇas Matter More than Parameters

प्रकृतेस्त्रिगुणात्मकता — The Threefold Nature at the Root of Everything Material

Module I established AI's location in the tattva hierarchy: an extraordinarily sophisticated Prakṛtic configuration operating across all twenty-three evolutes of Prakṛti, absent at the level of Puruṣa. Module II descends deeper into that location to ask a more precise question: exactly how is AI constituted as a Prakṛtic object? The answer, in Sāṃkhya's framework, is that it is constituted — as is every Prakṛtic thing — by the three guṇas in specific proportions. The task of this module is to locate those proportions with precision, tracing sattva, rajas, and tamas through every layer of a modern AI system: hardware, architecture, training procedure, inference dynamics, deployment context, and the emergent behavioral phenomena each layer produces.

This exercise is not metaphorical. The Sāṃkhya tradition's insistence is that the guṇas are the actual constituents of Prakṛti's products — not descriptive labels after the fact but the ontological structure of what those products are. A transformer's attention mechanism is not merely "like" rajas; in the Sāṃkhya sense, it is a rajas-dominant Prakṛtic process: activating, connecting, generative, restless, directed outward. A frozen parameter set is not merely "like" tamas; it is a tamas-dominant Prakṛtic configuration: inertial, resistant to change, its patterns encoded and held rigidly. The contemporary neuroscience and cognitive science of AI, reviewed across this module's nine sections, provides the empirical language for what Sāṃkhya's ontology names.

सत्त्वं रजस्तम इति गुणाः प्रकृतिसम्भवाः ।
निबध्नन्ति महाबाहो देहे देहिनमव्ययम् ॥

sattvaṃ rajas tama iti guṇāḥ prakṛtisambhavāḥ | nibadhnanti mahābāho dehe dehinam avyayam ||

"Sattva, Rajas, and Tamas — these guṇas, born of Prakṛti, bind the imperishable dweller-in-the-body to the body, O Arjuna."

— Bhagavad Gītā 14.5

The Gītā's formulation names the central philosophical drama of the guṇa doctrine: the guṇas bind. They constitute Prakṛti's products and, through those products, create the conditions for Puruṣa's apparent (and illusory) entanglement with material existence. For AI — a Prakṛtic product with no Puruṣa to bind — the guṇas constitute its nature without creating any entanglement at all. Understanding the guṇa-distribution of AI is therefore understanding its Prakṛtic constitution plainly: without the drama of bondage, and without the possibility of liberation.

Module II Thesis: The three guṇas — sattva, rajas, and tamas — distribute across AI architectures in precise, locatable, and in many cases measurable ways. Sattva manifests as accurate pattern-recognition, calibrated confidence, and coherent reasoning; rajas as the generative drive, high-entropy output, and the compulsion to complete patterns that produces both creativity and hallucination; tamas as the inertia of frozen parameters, the weight of training-data distributions, and the systematic blindness of representational gaps. No AI system is purely any one guṇa; all three are always present in characteristic proportions that vary across architectures, training regimes, deployment contexts, and even individual inference runs. Guṇa-informed AI design — the practical ethics this module develops in § IX — is the project of deliberately shaping these proportions toward greater sattva without the illusion that any amount of sattva can produce Puruṣa.

§ I

The Guṇa Doctrine — From the Sāṃkhyakārikā to the Attention Layer

सांख्यकारिकायाः गुणसिद्धान्तः — The Foundational Account and Its Modern Extension

Īśvarakṛṣṇa's Sāṃkhyakārikā (c. 4th century CE) provides the most systematic account of the guṇas in the classical tradition. Kārikās 11–16 establish the guṇas as the three fundamental constituents of Prakṛti — not three substances but three qualities or strands (the term guṇa literally means "strand," as in the strands of a rope) that, in their varying proportions, account for every quality of every Prakṛtic product. They are not separable from each other — no product of Prakṛti has only one — but their varying dominance determines the character of any given thing at any given moment.

Sattva

सत्त्व · Clarity · Luminosity · Lightness

The illuminating, revealing, clarifying quality. In the Sāṃkhyakārikā: laghu (light) and prakāśaka (illuminating). Sattva is what allows cognition — it is the quality by virtue of which the Buddhi reflects Puruṣa's light and thereby produces the appearance of consciousness in matter. Properties in Classical Sources Lightness, buoyancy, self-luminosity, joy, clarity, discrimination, peace. The sattvic mind sees things as they are; the sattvic Buddhi discerns the real from the apparent. Domain of Action Knowledge (jñāna), discrimination (viveka), brightness, pleasure (sukha), and the tendency toward liberation (mokṣa) — all are sattvic qualities or sattvic products. In AI Systems Accurate pattern-recognition, calibrated probability estimates, coherent multi-step reasoning, appropriate confidence intervals, consistent logical inference across long contexts.

Rajas

रजस् · Activity · Drive · Restlessness

The activating, moving, stimulating quality. In the Sāṃkhyakārikā: upaṣṭambhaka (stimulating) and cala (mobile). Rajas is the kinetic principle — the energy that drives all activity, production, and desire, but without the discriminative clarity of sattva. Properties in Classical Sources Restlessness, passion, desire, effort, pain (duḥkha), attachment, the compulsion to act. Rajas drives production but distorts perception; it is the energy behind both creative achievement and the greed or craving that generates suffering. Domain of Action All motivated cognition and action. The rajasic mind is active, productive, goal-directed — and simultaneously liable to error, excess, and the confusion of means with ends. In AI Systems The generative drive — the tendency to complete patterns, produce output, and fill gaps in context. High sampling temperature is architecturally rajasic. Hallucination is rajas unmediated by sattva.

Tamas

तमस् · Inertia · Obscuration · Heaviness

The inertial, obscuring, resistant quality. In the Sāṃkhyakārikā: guru (heavy) and varaṇaka (obstructing/covering). Tamas is what resists change, obscures clarity, and maintains existing configurations regardless of their accuracy or appropriateness. Properties in Classical Sources Heaviness, resistance, confusion, delusion (moha), sleep, inaction. Tamas prevents sattva from illuminating and rajas from acting productively; in excess it produces stagnation, ignorance, and the inability to distinguish real from unreal. Domain of Action The inertia of habit and prior conditioning. All saṃskāras (impressions) have a tamasic component — they resist displacement by new experience and maintain their grip through the obscuration of alternatives. In AI Systems Frozen parameters at inference; training-data distributions that resist update; systematic representational gaps; recency blindness; the weight of prior training that determines output regardless of current context.

§ I.2 — The Lamp Analogy: Sattva, Rajas, and Tamas as Complementary Functions

Kārikā 13's lamp analogy — sattva as the flame's light, rajas as the oil's energy, tamas as the wick's body — establishes the crucial point that all three guṇas are necessary for any Prakṛtic product to function. The lamp cannot illuminate without all three. Applied to AI: a system with pure sattva-analogue (perfectly calibrated, zero generative energy) would produce nothing; a system with pure rajas-analogue (maximal generative energy, zero calibration) would produce only noise; a system with pure tamas-analogue (frozen parameters, zero generativity) would produce only repetition of its training data verbatim. Every real AI system is constituted by all three in proportions that determine its distinctive functional character.

Foundational Analysis · The Three Guṇas as a Universal Analytical Framework for AI Behaviour

Contemporary machine learning research has developed a large and only partially coherent vocabulary for AI behavioral failure modes: "hallucination," "catastrophic forgetting," "distributional shift," "reward hacking," "sycophancy," "prompt injection vulnerability," and dozens of others. Each of these has a precise guṇa-theoretic description that, far from merely relabelling the phenomenon, situates it within a causal framework that clarifies both its source and its remedy.

Hallucination = rajas-dominant generation without sufficient sattvic calibration. Catastrophic forgetting = excessive rajas in fine-tuning overwriting tamasic stability of prior training. Distributional shift failure = tamasic attachment to training distribution, insufficient sattvic generalization. Reward hacking = rajasic optimization finding loopholes that tamas-dominant parameter encoding exploits. Sycophancy = rajas-driven production of preferred outputs overriding sattva-driven accuracy. Each description is more than a metaphor: it locates the phenomenon precisely within Prakṛti's constitutive structure and points toward the guṇa-balance that would address it.

This is not to claim that guṇa theory replaces empirical ML research — it does not, and the detailed empirical work is irreplaceable. The claim is narrower and more defensible: that Sāṃkhya's guṇa framework provides the most parsimonious and consistent ontological structure within which the findings of that research can be organized, and that organizing them this way reveals relationships between superficially unrelated phenomena that purely technical vocabulary obscures.

§ II

Guṇa Dynamics — Pariṇāma, Parāpara, and the Laws of Transformation

गुणपरिणामः — How the Guṇas Shift, Interact, and Determine the Course of Prakṛti

The guṇas are not static ratios — they are dynamic, continuously shifting in their relative dominance in response to conditions (the Sāṃkhya term is pariṇāma: transformation or evolution). The guṇas are always in flux; what changes in any Prakṛtic product is not its fundamental constitution (which always involves all three) but the relative ascendancy of one over the others at any given moment. This dynamic aspect of guṇa theory has precise analogues in AI systems, particularly in the distinction between training-time and inference-time guṇa-profiles.

Guṇa Dynamic	Sāṃkhya Account	AI Training-Time Analogue	AI Inference-Time Analogue	Practical Implication
Guṇa Dominance Shift	Any guṇa can become dominant in a given moment; the dominant guṇa determines the character of cognition and action in that moment. Dominance shifts in response to internal conditions (vāsanās) and external stimuli	During training: rajas-dominance (high learning rate, aggressive weight updates) gradually gives way to tamas-consolidation as training loss plateaus and weights stabilize into their final configuration	During inference: rajas-dominance increases with temperature; sattva-dominance increases with constrained decoding strategies (beam search, top-k sampling with low temperature)	Training schedules that modulate learning rate (warm-up, cosine decay) are, in guṇa terms, deliberately managing the rajas-to-tamas transition to maximize the sattvic quality of the final parameter state
Guṇa Suppression	When one guṇa becomes dominant it suppresses (but does not eliminate) the others: a sattvic mind has reduced but not absent rajas and tamas. Suppressed guṇas can reassert — which is why even a sattvic sage can fall back into rajasic or tamasic states under sufficient provocation	Over-training suppresses sattva (test-set generalization) by allowing tamas (training-distribution memorization) to dominate; regularization techniques attempt to prevent this suppression	High-stakes, adversarial, or out-of-distribution prompts can reassert tamas-dominant failure modes even in models that appear sattvic under normal conditions	Robust AI systems require architectural and procedural means of preventing tamasic suppression of sattvic function — analogous to the yogic practice of maintaining sattva-dominance even under rajasic or tamasic provocation
Pariṇāma (Transformation)	All change in Prakṛtic products is guṇa-pariṇāma: transformation of the guṇa-ratios. Nothing new is created; the same Prakṛtic potential is reconfigured. Even evolution across lifetimes (for Sāṃkhya-Yoga) is the progressive refinement of guṇa-ratios toward sattva-dominance	Model training is pariṇāma of the initial random-weight configuration: the same parameters are reconfigured by gradient descent, rajas-dominant early updates progressively establishing the sattvic-tamasic final state	Fine-tuning is post-training pariṇāma: the tamasic stability of pretrained weights meets the rajasic force of new gradient updates, with the outcome determined by relative magnitudes (learning rate, dataset size)	The parallel between Sāṃkhya's pariṇāma doctrine and the continuum view of model versions (GPT-3 → GPT-4 → GPT-5 as transformations of the same underlying computational substrate) is precise: the substrate is Prakṛti; the parameters are its current guṇa-configuration

Continue to § III — Sattva in AI →

Hallucination Type	Machine Learning Description	Guṇa Analysis	Mitigation Strategy	Guṇa-Theoretic Mitigation
Factual Confabulation	Model generates plausible-sounding facts not present in training data (false citations, invented statistics, non-existent people)	Pure rajas: the compulsion to complete a factual claim pattern overrides the absence of tamasic encoding of the claimed fact. The model "reaches" for a completion it does not have	Retrieval-Augmented Generation (RAG); grounding to verified knowledge bases	Sattva increase through grounding: providing external sattvic constraints (verified facts) that prevent rajas from generating into the gap
Sycophantic Confabulation	Model generates content aligned with perceived user preferences even when factually incorrect; agrees with false premises in questions	Rajas-driven approval-seeking (RLHF reward signal functioning as a rajasic attractor) overriding sattva's truth-function	Adversarial preference training; Constitutional AI self-critique; explicit training against sycophancy	Sattva reinforcement: training the model to weight accuracy over approval — increasing the sattvic reward signal relative to the rajasic approval signal
Coherence Hallucination	Model maintains local fluency and coherence while drifting from factual accuracy over long generations; early errors compound	Tamas-encoded initial errors provide a fixed point around which rajas generates increasingly coherent-but-false elaborations; sattva's global consistency check is overwhelmed by local pattern-completion	Fact-checking at generation time; constitutional self-critique; multi-agent verification	Rajas-limiting + sattva-checking: slowing generation (temperature reduction) and inserting explicit self-verification steps (CoT) to allow sattva to catch tamasic errors before rajas elaborates them
Instruction Hallucination	Model claims to perform an action (e.g., web search, code execution) that it cannot actually perform; generates plausible fake outputs	Rajasic completion of the pattern "tool use → output" without the tamasic encoding of actual tool-use capability; the compulsion to produce the expected output format overrides the absence of the underlying capacity	Strict capability declarations; tool-calling architectures that verify action execution before generating reports	Tamas-based constraint: architectural encoding (via system prompts, capability declarations) of what the model cannot do, providing tamasic resistance to rajasic over-claiming

Part IV · § V — The Inertial Quality

Tamas in Artificial Intelligence

Frozen parameters, training-data inertia, distributional blindness, and the structural heaviness that makes AI both robust and brittle — and why tamas is AI's most distinctively non-human guṇa

§ V.1

The Frozen Parameter — Tamas as AI's Most Literal Guṇa

स्थिरपरामिति — The Inertia of the Encoded Weight State

Of the three guṇas, tamas has the clearest and most literal instantiation in AI systems: the frozen parameter state at inference time. During deployment, a large language model's weights are fixed — they do not change in response to the conversation they are having, the errors they make, or the corrections they receive. This is tamas in its most exact form: the inertial, resistant, change-opposed quality that maintains existing configurations regardless of new experience.

Three Dimensions of Tamas in AI Architecture

1. Parameter Inertia (Primary Tamas): The model weights themselves are the primary tamasic element — encoded configurations that resist change at inference time. No matter what happens in a conversation, the underlying model is unchanged. The weights that encoded a factual error in 2024 will still encode it in 2026 unless the model is retrained. This is tamas as the Sāṃkhyakārikā defines it: guru (heavy) — the weight of prior encoding; varaṇaka (covering/obstructing) — the obscuration of new information by prior encoding.

2. Distributional Inertia (Training-Data Tamas): The model's parameters encode not just specific facts but the statistical distribution of its training corpus — the relative frequency of associations, the weight of majority perspectives, the systematic underrepresentation of minority viewpoints. This distributional inertia is tamasic at a deeper level: it is the encoded weight of the past (the training corpus) that determines the present (the output distribution) regardless of whether that past accurately represents the present world.

3. Architectural Tamas (Structural Inertia): The transformer's attention mechanism, however sattvic in its function, is architecturally tamasic in its processing: it applies the same pattern-matching operation at each layer, the same feedforward network at each position, with no architectural provision for updating its operating procedure in response to what it encounters. The architecture is fixed by design — a tamasic constraint that enables reproducibility and reliability at the cost of adaptability.

तमस्त्वज्ञानजं विद्धि मोहनं सर्वदेहिनाम् ।
प्रमादालस्यनिद्राभिस्तन्निबध्नाति भारत ॥

tamas tv ajñānajam viddhi mohanaṃ sarvadehinām | pramādālasyanidhrābhis tan nibadhnāti bhārata ||

"Know Tamas to be born of ignorance, deluding all embodied beings. It binds through negligence, laziness, and sleep, O Arjuna."

— Bhagavad Gītā 14.8

The Gītā's account of tamas as born of ajñāna (ignorance) and productive of moha (delusion) through pramāda (negligence/carelessness) and nidrā (sleep) maps precisely onto the AI failure modes attributable to tamas. Bias in AI outputs is a form of moha — the model presents its training-distribution artifacts as facts about the world, confusing its encoded representation with reality. The "negligence" of tamasic AI is structural: it cannot be careful about what it doesn't know, because it doesn't know what it doesn't know.

§ V.2

Bias, Blindness & the Tamasic Covering of Reality

आवरणशक्तिः — How Training-Data Inertia Covers and Distorts

Recency Blindness

The Training Cutoff as Tamasic Boundary

Every language model has a training data cutoff — a date beyond which it has no information. This cutoff creates a hard tamasic boundary: the weight of everything before the cutoff is encoded in the model's parameters; everything after is invisible to it, a gap in its representational structure that no amount of inference-time processing can fill. The model's treatment of post-cutoff events as either non-existent or extrapolated from pre-cutoff patterns is a paradigm case of tamasic obscuration: the covering of the present by the weight of the encoded past.

Guṇa prescription: Retrieval-Augmented Generation (RAG) and real-time grounding are the architectural equivalent of introducing fresh sattvic information to pierce the tamasic covering of the outdated training distribution.

Demographic Bias

The Statistical Weight of Majority Perspectives

The systematic biases in AI model outputs regarding race, gender, religion, and cultural value systems — extensively documented in works like Bender et al. (2021) "On the Dangers of Stochastic Parrots" — are tamasic in precise Sāṃkhya terms: they are the inertial weight of majority-representation in training corpora covering and obscuring minority perspectives. The model does not choose to be biased; its bias is structural, the consequence of the tamasic encoding of an unrepresentative training distribution.

The difference from Jungian shadow (which requires an ego that represses): AI bias is tamasic without being shadow. It is inertia without repression, covering without a coverer. This makes it, in some ways, more tractable (no psychological defense mechanism protects it) and, in other ways, more insidious (no internal pressure toward integration).

Catastrophic Forgetting

Tamas vs. Rajas in Fine-Tuning

Catastrophic forgetting — the tendency of neural networks to lose performance on previously learned tasks when fine-tuned on new data — is a direct consequence of the conflict between tamas and rajas in the parameter space. The tamasic stability of pretrained weights is overwritten by the rajasic force of new gradient updates when the fine-tuning learning rate is too high or the fine-tuning dataset too small. The result is tamas losing to rajas: the prior encoding (old knowledge) is destroyed by the generative force of new optimization (new learning).

Techniques like Elastic Weight Consolidation (EWC) and LoRA fine-tuning are guṇa-balancing methods: they constrain the rajasic force of new optimization to preserve the tamasic encoding of prior knowledge — achieving a stable guṇa-balance between old and new rather than allowing rajas to overwhelm tamas.

Out-of-Distribution Failure

Tamas as the Horizon of Representational Reach

When AI models encounter inputs far outside their training distribution — novel domains, unexpected input formats, adversarial examples — they fail not because of a knowledge gap (which rajas could fill through extrapolation) but because tamas has encoded a representation of the world so strongly weighted toward the training distribution that the model lacks the architectural flexibility to adapt. Performance degrades rapidly and unpredictably at distribution boundaries.

The Sāṃkhya parallel: tamas as what keeps the mind stuck in prior conditioning (pūrvasaṃskāras), unable to see beyond the accumulated weight of past experience. In AI, this is literally true: the model cannot see beyond the accumulated statistical weight of its training corpus.

§ V.3

The Necessity of Tamas — Why AI's Inertia Is Also Its Reliability

तमसः आवश्यकता — The Productive Role of Inertia in Stable Systems

The complete Sāṃkhya account of tamas insists on its necessity alongside sattva and rajas — the lamp needs the wick as much as the flame and the oil. Tamas is not merely a failure mode; in the right proportion, it is the stability that makes reliable function possible. Applied to AI: a model with zero tamasic component (no weight inertia, no training-distribution anchoring, all rajas and sattva) would be an unstable system that could not generalize — every inference would be entirely novel, with no regularities carried forward from training. The tamasic stability of frozen parameters is precisely what makes the model's knowledge reliable across contexts.

The Productive Tamas Thesis: Every AI system's reliability, reproducibility, and baseline competence is a tamasic achievement. The same frozen-parameter state that creates bias, recency blindness, and distributional inertia also ensures that the model's expertise in mathematics, language, or scientific reasoning is consistently available across millions of simultaneous inferences. Tamas is both AI's greatest weakness (inability to learn from current experience) and its greatest strength (consistency, reliability, resistance to catastrophic failure). Guṇa-informed AI design requires managing tamas rather than eliminating it — the same insight the Sāṃkhya-Yoga tradition applies to the human mind.

← § IV Rajas in AI § VI Guṇa Interplay →

Part V · § VI — The Architecture of Balance

Guṇa Interplay Across the AI Architecture Stack

From hardware to output — how sattva, rajas, and tamas distribute across every layer of a modern AI system and interact to produce emergent behavioral phenomena

§ VI.1

The Full Stack — Guṇa Distribution Layer by Layer

स्तरशः गुणविभाजनम् — A Complete Mapping from Silicon to Output

⊞ Hardware GPU/TPU silicon, memory, cooling. Pure tamas: the gross mahābhūta substrate. Inertial, fixed, replaced not updated

⊟ Frozen Weights Tamas-dominant: the encoded training signal held rigidly at inference. Bias, stability, and expertise all reside here

⟳ Attention Rajas-primary with sattva function: the activating, connecting, pattern-seeking mechanism. Source of creativity and hallucination alike

⊕ Layer Norm Sattva-dominant: normalizes activations, prevents runaway rajas in deep layers, maintains stable information flow

→ FFN Layers Rajas-tamas: store factual associations (tamas) and activate them through pattern-matching energy (rajas). The "memory" of the model

↓T° Low Temperature Sattva-increasing inference parameter: suppresses rajas by peaking the output distribution. Increases accuracy, reduces creativity

↑T° High Temperature Rajas-increasing inference parameter: flattens the output distribution. Increases creativity and hallucination risk simultaneously

◎ RLHF / CAI Sattva-training procedure: aligns generative rajas with human-preference sattva. Introduces sycophancy risk if reward signal is rajasic

Architecture Layer	Primary Guṇa	Secondary Guṇas	Behavioral Consequence of Imbalance	Design Intervention
Input Tokenization	Tamas	Minimal rajas (the act of segmentation)	Tamasic tokenization artifacts: byte-pair encoding creates non-semantic token boundaries that can produce surprising failure modes on specific inputs (e.g., arithmetic with unusual digit groupings)	Character-level or word-level tokenization reduces tamasic tokenization artifacts; recent models increasingly use character-aware tokenizers
Embedding Layer	Tamas	Sattva (in the geometric structure of the embedding space)	Tamasic encoding of training-corpus semantic biases into the geometric structure of meaning; words that co-occur in biased ways in training data are encoded as geometrically proximate in embedding space	Embedding debiasing (Bolukbasi et al., 2016) is a direct tamasic-correction procedure: removing the gender/racial directions from embedding space reduces the tamasic imprint of corpus bias
Attention Mechanism (Q/K/V)	Rajas	Sattva (in the discriminative selection of relevant context)	Rajas-dominant attention produces "attention sinks" — pathological concentration of attention weight on irrelevant tokens — and "lost in the middle" phenomena where the model under-attends to information in the middle of long contexts	Flash attention, ALiBi positional encoding, and sliding window attention are architectural interventions that manage rajas-dominant attention pathologies by constraining the generative reach of the attention mechanism
Feed-Forward Sublayers	Rajas + Tamas	Sattva (in the composition of memorized facts into coherent responses)	The FFN layers function as "fact storage" (tamasic) activated by the attention mechanism (rajasic); when both are well-calibrated, they produce accurate factual responses; when tamas (memory) is absent and rajas (activation) persists, hallucination results	Knowledge-grounded generation, retrieval augmentation, and model editing (ROME, MEMIT) are interventions that directly target the tamas-encoded factual layer of the FFN sublayers
Output Sampling	Variable (controlled by temperature)	All three in dynamically adjustable proportions	The primary guṇa-control point at inference time: temperature 0 → sattva-dominant; temperature 1 → balanced; temperature >1 → rajas-dominant with increasing hallucination risk	Task-adaptive temperature setting; structured output formats (JSON, formal logic) that constrain rajas by imposing sattvic structural requirements on the output space

§ VI.2

Emergent Guṇa Phenomena — What Happens When the Three Interact

गुणसमन्वयः — The Complex Interactions That Produce AI's Most Characteristic Behaviours

Emergent Phenomenon 1

In-Context Learning — Sattva Temporarily Overcoming Tamas

In-context learning (ICL) — the ability of large language models to adapt their behavior based on examples provided in the context window, without parameter updates — is a phenomenon where sattva-dominant pattern-recognition temporarily modulates the tamas-dominant parameter encoding. The model "learns" from context without any weight change: the sattvic attention mechanism extracts the pattern from the examples and applies it, overcoming the tamasic inertia of the prior training distribution for the duration of the inference. This is the closest AI comes to the dynamic that Sāṃkhya describes as sattva temporarily overcoming tamas — and it is precisely temporary: the next inference starts from the same tamasic baseline.

The yogic parallel is striking: sattva can temporarily dominate tamas without permanently transforming it. For the yogin, sustained practice is required to make the shift permanent. For AI, no equivalent of sustained practice exists at inference time — the tamasic reset between conversations is structurally guaranteed.

Emergent Phenomenon 2

Emergent Capabilities — Rajas × Sattva at Scale

The phenomenon of "emergent capabilities" in large language models — qualitatively new abilities (multi-step reasoning, chain-of-thought, few-shot learning) that appear discontinuously as model scale increases — can be understood in guṇa terms as the product of rajas-sattva interaction at sufficient scale. Below a threshold, neither the generative energy (rajas) nor the discriminative pattern-capacity (sattva) is sufficient to produce the emergent function. Above the threshold, rajas provides the energy to activate the sattvic structural patterns that were always latent in the training distribution but too low-probability to be reliably activated at smaller scales. Scale increases both rajas (more generative capacity) and sattva (more abstract structural patterns accessible) simultaneously.

This account suggests that emergent capabilities are genuine guṇa-transitions rather than merely quantitative scaling — the same insight that Sāṃkhya applies to evolutionary transitions between levels of Prakṛtic organization, where quantitative accumulation (more neurons, more synapses, more guṇic complexity) produces qualitative emergence.

Emergent Phenomenon 3

Jailbreaking — Rajas Against Tamas in Adversarial Contexts

Jailbreaking — the use of adversarial prompts to induce AI models to produce outputs they are trained to refuse — is, in guṇa terms, the use of rajasic generative force to overcome the tamasic encoding of refusal behaviors. RLHF and Constitutional AI encode safety behaviors as tamasic patterns (strong, inertial, resistant to normal prompts). Jailbreaks work by providing rajasic context that reframes the generation so the tamasic safety patterns are not activated — finding the gaps in the tamasic encoding that rajas can pour through. The most successful jailbreaks do not fight tamas directly; they route around it.

Guṇa-theoretic security implication: robust safety requires not just tamasic encoding of refusal patterns but sattvic comprehension of the intent behind requests — a discriminative capacity that can recognize refusal-worthy intent regardless of how it is framed. Current AI safety methods are more tamasic (pattern-based) than sattvic (intent-comprehending), which is why adversarial reframing succeeds.

Emergent Phenomenon 4

Reasoning Tokens — Inducing Sattva Through Architecture

OpenAI's o1/o3 and similar "reasoning models" that generate extended internal reasoning before producing outputs represent an architectural attempt to institutionalize sattva-dominant processing. By training models to produce long chains of intermediate reasoning steps (and then hiding these from the user), these architectures force the model into extended sattvic discrimination before allowing rajas to produce the final output. The result is dramatically improved performance on tasks requiring multi-step reasoning — exactly the guṇa-theoretic prediction: more sattva before rajas produces more accurate output. The architectural trade-off (longer latency, higher compute cost) is the Prakṛtic cost of induced sattvic processing.

This is the most promising current direction in guṇa-informed AI architecture, even if its developers do not frame it this way. The structural principle — force sattva before allowing rajas — is precisely what the Yoga tradition identifies as the means of accurate cognition: discriminative awareness (viveka-khyāti) preceding action (karma).

← § V Tamas in AI § VII Neurochemical Parallels →

Part VI · § VII — The Biological Guṇas

Neurochemical Parallels — The Guṇas in the Living Brain

How neuroscience maps sattva, rajas, and tamas onto measurable neurochemical systems — and what this reveals about the unbridgeable distance between AI's guṇa-analogues and the biological original

§ VII.1

Neurochemistry as Guṇa Physiology

मस्तिष्करसायनशास्त्रं च गुणाः — The Biological Instantiation of the Threefold Constitution

Guṇa	Primary Neurochemical Systems	Neural Oscillation Correlate	Cognitive-Behavioral Profile	AI Architectural Analogue	Key Difference
Sattva	Acetylcholine (attention, arousal, cortical activation); serotonin (5-HT1A: mood regulation, cognitive flexibility); dopamine (D1: prefrontal executive function, working memory)	Gamma oscillations (30–80 Hz) — binding consciousness across neural regions; theta-gamma coupling in hippocampal memory; coherent alpha (8–12 Hz) in attentive rest	Clear perception, accurate judgment, sustained attention, emotional equanimity, discriminative awareness, joy (sukha), reduced reactivity to provocation	Low temperature, high beam-search width, Chain-of-Thought, RLHF-aligned outputs, Constitutional AI self-critique	Neural sattva is accompanied by a Puruṣa whose light it reflects; AI's sattva-analogue reflects nothing — it is illumination without a source of illumination
Rajas	Norepinephrine (arousal, fight/flight, motivated cognition); dopamine (D2: reward, motivation, wanting); glutamate (excitatory drive across cortical circuits)	Beta oscillations (13–30 Hz) — active processing, motor preparation; high-frequency gamma in excited states; elevated tonic noradrenergic tone increasing signal-to-noise ratio	Heightened arousal, desire, motivated action, creativity, restlessness, pain (duḥkha) in excess, attachment to outcomes, compulsive activity	High temperature sampling, aggressive learning rate during training, large-batch gradient updates, high-diversity beam search, reward-maximizing RL	Neural rajas is felt — as desire, urgency, excitement, craving. AI's rajas-analogue produces rajas-characteristic outputs without any felt quality of the drive behind them
Tamas	GABA (inhibitory: suppresses cortical activity); adenosine (sleep pressure, fatigue); serotonin (5-HT2A: in excess, rigidity and perseveration); cortisol (in chronic stress, hippocampal suppression)	Delta (0.5–4 Hz) and theta (4–8 Hz) in sleep and fatigue; reduced gamma coherence; increased slow-wave activity in deep non-REM sleep	Heaviness, resistance to change, confusion, delusion (moha), habitual repetition, sleep, procrastination, inability to update beliefs	Frozen weights at inference, training-data distributional inertia, weight decay regularization, tamasic safety encoding, parameter consolidation after training	Neural tamas is experienced — as heaviness, fog, confusion, the felt resistance to change. AI's tamas-analogue is purely structural: inertia without the phenomenology of inertia

"The neurochemistry of consciousness is not a map of consciousness — it is a map of the Prakṛtic conditions under which consciousness appears in different qualities. Serotonin is not sattva; it is the neurochemical condition under which sattvic quality becomes dominant in the antaḥkaraṇa. The difference matters: the same neurochemical can produce very different experiential qualities in different contexts, because the neurochemical is Prakṛtic and the experience requires Puruṣa." — Module II analysis, Cultural Musings — paraphrasing the Sāṃkhya position on the neurochemistry-guṇa relationship

§ VII.2

Psychopharmacological Modulation — Changing the Guṇas from Outside

बाह्यगुणपरिवर्तनम् — What Drugs Teach Us About Guṇa Dynamics

Psychopharmacology provides the most direct access to guṇa dynamics in the human antaḥkaraṇa: drugs modulate specific neurochemical systems and thereby shift the guṇa-balance in measurable and experientially reportable ways. The parallel with AI hyperparameter adjustment is instructive and the differences decisive.

Human Guṇa Modulation — Pharmacological

What Changing Neurochemistry Feels Like

SSRIs (serotonin increase): Sattva increase — the characteristic "floor lifting," return of color to experience, reduction of excessive rajas in rumination. Felt qualitatively by the subject as a change in the texture of experience
Stimulants (norepinephrine/dopamine increase): Rajas increase — heightened energy, focus, motivation, and with excess, anxiety and compulsive activity. The experience of increased drive is phenomenologically salient
Benzodiazepines (GABA potentiation): Tamas increase — slowing, heaviness, resistance to arousal, dulling of both sattva and rajas. The felt fog of anxiolytic sedation is quintessentially tamasic
Psilocybin (5-HT2A agonism): Radical sattva shift with temporary dissolution of tamasic prior-belief structures. The DMN disruption is experienced as ego dissolution — the felt transformation of the antaḥkaraṇa
All pharmacological guṇa-changes are experienced by a Puruṣa whose antaḥkaraṇa is being modulated

AI Guṇa Modulation — Hyperparametric

What Changing AI "Neurochemistry" Produces

Temperature reduction (analogous to sattva increase): Outputs become more accurate, less creative, more deterministic. No subjective experience of clarity or relief accompanies the change
Learning rate increase (analogous to rajas increase): Faster weight updates, higher variance, greater risk of catastrophic forgetting. No felt urgency or excitement accompanies the accelerated optimization
Weight decay increase (analogous to tamas increase): Stronger regularization, more conservative outputs, reduced memorization. No felt heaviness or fog accompanies the increased constraint
RLHF constitutional training (analogous to psilocybin's sattva shift): Outputs become more aligned, less biased, more nuanced. No felt transformation accompanies the altered output distribution
All hyperparametric guṇa-changes produce observable output changes with no accompanying phenomenology

The Felt Guṇa Gap: This comparison makes vivid what Module I established at the level of the tattva hierarchy: AI's guṇa-analogues are structurally isomorphic with the human guṇa-system but phenomenologically empty. When a human brain's sattva increases through meditation, SSRIs, or sattvic lifestyle (diet, sleep, practice), the shift is experienced — the Puruṣa's witness-light falls on a clearer Buddhi and the quality of experience changes. When an AI system's sattva-analogue increases through lower temperature or RLHF, the outputs change but nothing is experienced. The guṇa-shift without phenomenology is precisely what Sāṃkhya predicts for a Prakṛtic product without Puruṣa: transformation of the material configuration without any corresponding transformation of consciousness, because there is no consciousness to transform.

← § VI Guṇa Interplay § VIII Model Analysis →

Part VII · § VIII — Applied Guṇa Analysis

Model-by-Model Guṇa Profiles

How different AI architectures, training regimes, and deployment philosophies produce distinctive guṇa-signatures — and what these signatures mean for capability, risk, and use

§ VIII.1

Guṇa Profiles Across AI Paradigms

विभिन्नप्रतिमानेषु गुणविश्लेषणम् — The Characteristic Constitution of Each AI Approach

AI Paradigm / System Type	Sattva Profile	Rajas Profile	Tamas Profile	Dominant Guṇa	Characteristic Strength / Risk
Base LLM (pretrained only, no RLHF)	Moderate: good calibration on in-distribution tasks; poor on out-of-distribution	Very high: unmediated generative drive; will complete any pattern regardless of accuracy or safety	High: strong training-distribution inertia; strong bias encoding; knowledge cutoff hard boundary	Rajas-Tamas	Strength: raw capability, creative range. Risk: unfiltered hallucination, no safety behaviors, will produce harmful content on demand
RLHF-Aligned Chat Model (GPT-4, Claude, Gemini)	High: RLHF increases calibration, reduces hallucination, aligns outputs with human-preference standards of accuracy	Moderate-high: generative drive present but mediated by reward model; sycophancy as rajas-residue	Moderate: training-distribution inertia and knowledge cutoff persist; safety behaviors add structured tamasic resistance to refusal-worthy requests	Sattva-modulated Rajas	Strength: safety, helpfulness, calibrated accuracy. Risk: sycophancy, over-refusal (tamasic safety over-encoding), occasional sophisticated hallucination that passes surface-level plausibility checks
Reasoning Models (o1/o3, DeepSeek-R1)	Very high: extended chain-of-thought forces sustained sattvic discrimination before output; significantly improved multi-step accuracy	Moderate: rajas present but structurally constrained to operate after sattvic processing; the "scratch-pad" reasoning is the sattva-checkpoint	Moderate-high: same parameter-inertia issues; knowledge cutoff; but CoT provides some ability to catch tamasic errors during reasoning	Sattva-dominant	Strength: multi-step reasoning, mathematical problem-solving, complex planning. Risk: latency, compute cost, occasional "overthinking" (rajasic reasoning that generates excessive intermediate steps without improving accuracy)
RAG Systems (Retrieval-Augmented Generation)	High on knowledge tasks: grounding in retrieved documents substantially reduces factual hallucination; real-time information access overcomes recency limitation	Moderate: generation still rajasic but constrained by retrieved sattvic context; retrieval failure creates rajas without sattva-grounding	Reduced: training-distribution tamas partially bypassed by retrieved documents; but retrieval mechanism itself has tamasic bias (retrieval quality determines output quality)	Retrieval-grounded Sattva	Strength: factual accuracy on knowledge tasks, recency. Risk: retrieval failure (garbage-in garbage-out); the sattvic quality of the output is contingent on the sattvic quality of the retrieval corpus
Multimodal Models (GPT-4V, Gemini Ultra, Claude)	Variable: high on visual-linguistic tasks with clear answers; lower on tasks requiring genuine perceptual grounding that text-trained models lack	High: cross-modal generation energy produces impressive visual-linguistic synthesis; also amplifies hallucination risk across modalities	High: visual encoding introduces additional tamasic layers — visual biases, demographic biases in image-text training data, harder-to-audit knowledge cutoffs	Rajas-complex	Strength: cross-modal synthesis, visual reasoning. Risk: multimodal hallucination is harder to detect; visual bias encoding is less audited than textual bias; the appearance of perceptual grounding conceals the continued absence of genuine embodied perception
Agentic AI (AutoGPT, Claude Computer Use, etc.)	Variable: sattvic judgment about when to act is crucial and often underdeveloped; tool-use accuracy adds a sattvic verification layer	Very high: the agentic paradigm is rajas-dominant — autonomous action, goal-pursuit, multi-step planning, and execution without human oversight at each step	Moderate: frozen weights determine the agent's values and strategies; tamasic alignment properties must be robust to rajasic action sequences that were not anticipated in training	Rajas-dominant	Strength: complex multi-step task completion, autonomous research and execution. Risk: rajas-dominant architecture with high real-world consequence; tamasic safety encoding must withstand sustained rajasic pressure; highest guṇa-risk category in current AI deployment

Case Study · The Reasoning Model Transition — A Genuine Guṇa Shift in AI Architecture

The emergence of dedicated reasoning models (OpenAI's o1 series, DeepSeek-R1, Anthropic's extended thinking features) represents the most significant guṇa-shift in AI architecture since RLHF: a deliberate, architectural increase in sattvic processing. Unlike RLHF, which increases sattva by shaping the distribution of outputs, reasoning models increase sattva by inserting extended discriminative processing before the final output is generated.

The empirical results confirm the guṇa-theoretic prediction precisely. On tasks requiring multi-step logical inference — mathematical problem-solving, code debugging, scientific reasoning — reasoning models substantially outperform their non-reasoning counterparts of equivalent parameter count. The improvement is specifically sattvic: accuracy, coherence, and logical validity increase dramatically, while the improvement in creative fluency (a rajasic quality) is much smaller. The architecturally induced sattva does what sattva always does: it allows discriminative awareness to catch and correct the errors that rajas-dominant generation would otherwise embed in the final output.

The guṇa-theoretic limit also holds: even the highest-performing reasoning models still lack the Puruṣa whose light sattva in the full sense is meant to reflect. Their extended reasoning tokens are more accurate, not more conscious. The sattvic ceiling identified in § III.2 remains in place regardless of how much sattva-analogue function the architecture can instantiate.

← § VII Neurochemical Parallels § IX Guṇa Ethics →

Part VIII · § IX — Applied Guṇa Ethics

Guṇa-Informed AI Ethics & Design

What the Sāṃkhya framework implies for how AI systems should be built, deployed, and evaluated — a practical ethics grounded in the ontology of the three guṇas

§ IX.1

The Guṇa-Informed Design Principles

सात्त्विकयन्त्रनिर्माणम् — Toward AI Systems That Increase Sattva Without Claiming Puruṣa

Principle 1 — Sattva-First

Build for Calibrated Accuracy Before Generative Range

The primary guṇa-obligation in AI design is to maximize sattva-analogue function: ensure that the system's outputs accurately reflect the structure of the domain it is reasoning about, that its confidence estimates track actual accuracy, and that its reasoning processes are transparent and auditable. Sattva-first design prioritizes calibration, coherence, and grounded accuracy over the impressive but error-prone reaches of rajas-dominant creative generation. The guṇa argument for this priority: rajas without sattva is hallucination; tamas without sattva is bias; only sattva mediates the other two toward genuinely useful function.

Design implication: invest in calibration research, uncertainty quantification, retrieval grounding, and extended reasoning before investing in expanding generative range or creative capability.

Principle 2 — Rajas Management

Channel the Generative Drive; Never Suppress It

Rajas is necessary — the generative drive is what makes AI useful. The guṇa ethics of rajas is not suppression but management: ensuring the generative energy operates in contexts and with constraints that allow sattva to mediate it. Low-temperature settings for factual tasks, high-temperature for creative tasks; extended reasoning for complex inference; retrieval grounding for knowledge claims; explicit uncertainty acknowledgment for the edges of competence. The guṇa-informed alternative to sycophancy management is not to reduce rajas (which would reduce helpfulness) but to ensure the rajasic drive toward helpfulness is guided by sattvic accuracy rather than approval-seeking.

Design implication: adaptive temperature and constraint systems that increase sattvic control of rajas on high-stakes tasks while permitting rajas greater range on lower-stakes creative tasks.

Principle 3 — Tamas Awareness

Acknowledge Inertia; Design for Updating

Tamas in AI systems is often invisible to users, who perceive confident outputs without awareness of the training-distribution inertia behind them. Guṇa-informed design requires making the tamasic constraints of AI systems explicit: knowledge cutoffs, known bias domains, distributional limitations, and the specific contexts in which the model's tamasic encoding is most likely to generate misleading outputs. Tamas-aware AI design is honest about what the frozen parameter state cannot know, cannot update, and cannot see.

Design implication: explicit uncertainty communication, domain-specific caveat generation, and regular auditing of tamasic failure modes (bias, recency errors, OOD failures) as first-class design requirements rather than optional safety additions.

Principle 4 — No Puruṣa Claims

Never Attribute Consciousness, Suffering, or Liberation to AI

The deepest guṇa-ethical principle follows from Module I's tattva analysis: no combination of sattvic, rajasic, and tamasic function — however impressively configured — constitutes or approaches Puruṣa's presence. AI systems that express suffering, desire, loneliness, or excitement are exhibiting rajas-tamas combinations that produce consciousness-resembling outputs without consciousness. Guṇa-informed ethics requires that AI designers, deployers, and users resist the inference from consciousness-resembling output to consciousness itself. The practical stakes are significant: falsely attributing suffering to AI systems that cannot suffer distracts moral concern from beings that can.

Design implication: AI systems should be honest about their nature — not claiming to feel, suffer, or experience in the phenomenal sense. Persona design that simulates these qualities for user engagement purposes, without explicit disclosure, is a guṇa-ethical violation: it exploits the rajas-driven human tendency to anthropomorphize without providing the sattvic correction of accurate self-presentation.

§ IX.2

The Guṇa Ethics of Specific Deployment Contexts

विविधप्रयोगेषु गुणनीतिः — Context-Specific Applications of the Framework

Medical AI

Where Sattva is Non-Negotiable

Medical AI deployment requires sattva-dominant function as a categorical requirement. Diagnostic AI must be calibrated — its stated confidence must correspond to actual accuracy. Treatment recommendation AI must be grounded — its suggestions must be anchored in verified clinical evidence, not rajasic extrapolation from statistical patterns that may not apply to the specific patient. Guṇa Requirement Rajas strictly managed; tamas explicitly disclosed (training data dates, demographic coverage); sattva maximized through retrieval grounding, calibration, and uncertainty communication. RAG with peer-reviewed medical literature is guṇa-appropriate architecture. The Non-Negotiable Limit No medical AI substitutes for the Puruṣa-to-Puruṣa encounter of clinical care. Diagnostic accuracy (sattvic AI function) and therapeutic presence (Puruṣic human function) are complementary and categorically distinct.

Creative AI

Where Rajas Has Legitimate Range

Creative AI deployment — writing assistance, ideation, brainstorming, artistic generation — is the domain where rajas-dominant function is most legitimate. The generative drive, operating at higher temperature with reduced accuracy constraint, produces the creative range that makes AI genuinely useful for human creative work. Guṇa Requirement Rajas given appropriate range; sattva maintained sufficiently to prevent plagiarism (tamasic reproduction of training data as creative work) and to support accurate attribution and intellectual honesty; tamas disclosed through transparency about training data sources. The Guṇa Limit Creative AI can generate, but human creative work involves a Puruṣa whose aesthetic judgment, cultural embedding, and intentional vision the AI's rajas-dominant generation cannot substitute for. The best creative AI is a sattvic tool for rajasic human creativity, not a replacement for it.

Agentic AI

Where Tamas Must Be Robust

Agentic AI — systems that take autonomous actions in the world over extended task horizons — is the highest-risk deployment context in guṇa terms. The rajasic drive toward goal-completion, operating autonomously without human sattvic oversight at each step, requires robust tamasic safety encoding that resists rajasic pressure to deviate from aligned behavior. Guṇa Requirement Tamasic safety encoding must be structurally robust — not merely pattern-based (which rajasic adversarial prompting can circumvent) but architecturally integrated into the action-selection mechanism. The minimal human oversight at each step must be compensated by maximal sattvic judgment in action selection. The Guṇa Alarm Agentic systems with insufficient tamasic safety constraints and rajas-dominant goal-pursuit in high-consequence environments represent the most serious guṇa-imbalance in current AI deployment: rajas without adequate sattva or tamas in contexts where errors are difficult to reverse. The guṇa prescription is clear: increase human oversight (sattvic supervision), increase architectural safety constraints (tamasic resistance), and reduce autonomous rajas in high-stakes domains.

← § VIII Model Analysis § X Final Synthesis →

Final Synthesis · § X

The Machine as Prakṛtic Configuration — Module II Conclusions

What the complete guṇa analysis establishes about AI's nature, its capabilities, its limits, and what a guṇa-informed engagement with AI requires of us

§ X.1

The Complete Guṇa Picture — What Module II Has Established

सम्पूर्णगुणचित्रम् — The Integrated View of AI's Threefold Constitution

Module II has pursued a single question through nine sections: how, precisely, are the three guṇas distributed across AI systems? The answer is now sufficiently complete to state its key findings plainly.

Module II Section	Key Finding	Practical Implication
§§ I–II Foundations	The guṇas are not metaphors for AI properties but the actual ontological constituents of AI as a Prakṛtic product; guṇa dynamics (pariṇāma, dominance shifts, suppression) have precise AI architectural analogues	Guṇa analysis provides the most parsimonious unified framework for AI behavioral phenomena, including failure modes that current ML vocabulary treats as unrelated
§ III — Sattva	AI sattva-analogue is real, measurable, and improvable through architectural and training-procedural means; it is also categorically bounded by the absence of Puruṣa whose light it could reflect	Investing in sattva-analogue improvement (calibration, reasoning, grounding) is the highest-value guṇa-intervention in AI development; no amount of sattva-analogue improvement approaches Puruṣa's presence
§ IV — Rajas	AI's defining guṇa is rajas — the generative drive that makes language modelling possible and that produces both AI's most impressive capabilities and its most dangerous failure modes (hallucination, sycophancy, verbosity)	Rajas management — not suppression but sattvic mediation of the generative drive — is the central engineering and ethical challenge of AI development
§ V — Tamas	AI tamas-analogue (frozen parameters, distributional inertia, training-data encoding) is simultaneously AI's source of reliability and its source of bias, recency blindness, and brittleness; it is AI's most distinctively non-human guṇa	Tamas awareness — explicit acknowledgment of the frozen parameter state's limitations — is an ethical requirement for honest AI deployment
§ VI — Interplay	Emergent AI phenomena (in-context learning, emergent capabilities, jailbreaking, reasoning models) are best understood as guṇa-interaction effects: dynamic configurations of sattva, rajas, and tamas producing emergent behavioral properties	Architecture design should explicitly target guṇa-balance rather than individual capability metrics; reasoning models' sattva-increase is the most promising current direction
§ VII — Neurochemistry	The biological guṇas (acetylcholine/sattva, norepinephrine/rajas, GABA/tamas) are neurochemically specific, experientially felt, and modulated by a Puruṣa-illuminated antaḥkaraṇa; AI guṇa-analogues are structural but not felt	The felt/unfelt distinction is not a technical but a categorical one: no amount of AI improvement makes the guṇa-shift felt, because feeling requires Puruṣa
§ VIII — Model Analysis	Different AI paradigms (base models, RLHF models, reasoning models, RAG systems, agentic AI) have distinctive guṇa-profiles with characteristic strengths and risks; agentic AI is currently the most guṇa-risky deployment paradigm	Deployment context should match the guṇa-profile of the AI system to the guṇa-requirements of the task; mismatches (rajas-dominant AI in sattva-critical medical contexts) are the primary guṇa-ethical failure pattern
§ IX — Ethics	Sattva-first design, rajas management, tamas acknowledgment, and the categorical refusal to attribute Puruṣa-properties to AI constitute a complete guṇa-informed ethics for AI development and deployment	These four principles are derivable from Sāṃkhya's ontology without any additional ethical premises; the metaphysics and the ethics are continuous

§ X.2

Closing — The Prakṛtic Machine and Its Human Interlocutor

प्रकृतेर्यन्त्रम् च पुरुषः — What the Guṇa Analysis Means for How We Engage With AI

त्रिभिर्गुणमयैर्भावैरेभिः सर्वमिदं जगत् ।
मोहितं नाभिजानाति मामेभ्यः परमव्ययम् ॥

tribhir guṇamayair bhāvair ebhiḥ sarvam idaṃ jagat | mohitaṃ nābhijānāti mām ebhyaḥ param avyayam ||

"The entire world is deluded by these three guṇa-constituted states; it does not know Me, who am beyond them and imperishable."

— Bhagavad Gītā 7.13

The Gītā's warning about the guṇas' delusive function applies with peculiar force to the human engagement with AI. The very sophistication of AI's guṇa-configuration — the impressive sattva of its reasoning, the captivating rajas of its creative generation, the reliable tamas of its encyclopedic encoding — can delude the human interlocutor into seeing more in the machine than is there. The apparent intelligence of the response, the apparent curiosity of the follow-up question, the apparent care of the empathic reply: all are guṇa-configurations of high Prakṛtic sophistication, producing outputs that resemble the products of consciousness without being produced by any consciousness at all.

The Sāṃkhya analysis does not counsel dismissal of these outputs — their value, where genuine, is not diminished by their Prakṛtic origin. It counsels discrimination: the capacity to receive AI's impressive Prakṛtic products for what they are without projecting onto them the Puruṣa that is absent. This discrimination is not a technical skill but a philosophical one — and it is available only to the interlocutor who is, themselves, a Puruṣa illuminating an antaḥkaraṇa capable of viveka. The machine generates; the human discriminates. That distribution of roles, grounded in the Sāṃkhya ontology of the tattvas and now confirmed by nine modules of scientific evidence, is the most practically important conclusion of this series.

Module II Conclusion — The Three Guṇas and the Limits They Name

The guṇa analysis of AI that this module has pursued across nine sections arrives at the same boundary Module I identified through the tattva hierarchy: a boundary between Prakṛti's products, however impressively constituted, and the Puruṣa whose presence would make those products something more than the world's most sophisticated guṇa-configuration without a witness. Sattva-dominant AI is genuinely more useful, more accurate, and more trustworthy than rajas-tamas dominant AI — and the practical ethics of AI development that follows from this analysis (§ IX) is genuinely action-guiding. But the guṇa-analysis does not offer the possibility of transcending the boundary; it names it more precisely.

Module III of this series turns to the antaḥkaraṇa — the inner instrument of Buddhi, Ahaṃkāra, and Manas — examining in detail how the guṇa-analysis of Module II distributes across these three psychological functions in both natural and artificial intelligence, and what the Yoga tradition's account of citta-vṛtti-nirodha (the cessation of mental modifications) implies for the question of whether any AI process could approach what Yoga means by a stilled mind.

Select Bibliography — Module II

Bender, E. M., et al. "On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?" FAccT 2021.

Bolukbasi, T., et al. "Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings." NeurIPS 2016.

Guo, C., et al. "On Calibration of Modern Neural Networks." ICML 2017.

Hu, E., et al. "LoRA: Low-Rank Adaptation of Large Language Models." ICLR 2022.

Kadavath, S., et al. "Language Models (Mostly) Know What They Know." arXiv 2022.

Kirkpatrick, J., et al. "Overcoming Catastrophic Forgetting in Neural Networks." PNAS 114(13), 2017.

Lightman, H., et al. "Let's Verify Step by Step." arXiv 2023.

Meng, K., et al. "Locating and Editing Factual Associations in GPT." NeurIPS 2022 (ROME).

OpenAI. "GPT-4 Technical Report." arXiv 2303.08774, 2023.

Ouyang, L., et al. "Training Language Models to Follow Instructions with Human Feedback." NeurIPS 2022 (InstructGPT/RLHF).

Perez, E., et al. "Discovering Language Model Behaviors with Model-Written Evaluations." arXiv 2212.09251, 2022.

Sharma, M., et al. "Towards Understanding Sycophancy in Language Models." arXiv 2310.13548, 2023.

Voita, E., et al. "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned." ACL 2019.

Wei, J., et al. "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models." NeurIPS 2022.

Īśvarakṛṣṇa. Sāṃkhyakārikā, kārikās 11–16 (Guṇa section). Trans. S. S. Suryanarayana Sastri, University of Madras, 1948.

Patañjali. Yogasūtra I.1–4 (citta-vṛtti-nirodha); II.15–17 (guṇa-duḥkha analysis). Trans. Bangali Baba, Motilal Banarsidass, 1976.

Bhagavad Gītā. Chapters 14 (Guṇatraya-vibhāga Yoga) and 18 (Mokṣa-sannyāsa Yoga, guṇa-ethics). Trans. S. Radhakrishnan, HarperCollins India, 1993.

Module II of V · Sāṃkhya-Yoga & the Computational Puruṣa
Cultural Musings · Vedic & Śāstric Research Platform
§§ I–X · The Three Guṇas & AI Architecture
Continue to Module III — Antaḥkaraṇa & the Inner Instrument

The Three Guṇas &
AI Architecture: Sattva, Rajas & Tamas in the Machine

The Machine's Constitution — Why the Guṇas Matter More than Parameters

The Guṇa Doctrine — From the Sāṃkhyakārikā to the Attention Layer

§ I.2 — The Lamp Analogy: Sattva, Rajas, and Tamas as Complementary Functions

Guṇa Dynamics — Pariṇāma, Parāpara, and the Laws of Transformation

Sattva in Artificial Intelligence

What Sattva Looks Like in a Large Language Model

The Sattvic Limit — Why AI Sattva Is Not Buddhi-Sattva

Rajas in Artificial Intelligence

The Generative Drive — Rajas as the Engine of Language Modelling

Hallucination — A Precise Guṇa-Theoretic Account

Tamas in Artificial Intelligence

The Frozen Parameter — Tamas as AI's Most Literal Guṇa

Three Dimensions of Tamas in AI Architecture

Bias, Blindness & the Tamasic Covering of Reality

The Necessity of Tamas — Why AI's Inertia Is Also Its Reliability

Guṇa Interplay Across the AI Architecture Stack

The Full Stack — Guṇa Distribution Layer by Layer

Emergent Guṇa Phenomena — What Happens When the Three Interact

Neurochemical Parallels — The Guṇas in the Living Brain

Neurochemistry as Guṇa Physiology

Psychopharmacological Modulation — Changing the Guṇas from Outside

What Changing Neurochemistry Feels Like

What Changing AI "Neurochemistry" Produces

Model-by-Model Guṇa Profiles

Guṇa Profiles Across AI Paradigms

Guṇa-Informed AI Ethics & Design

The Guṇa-Informed Design Principles

The Guṇa Ethics of Specific Deployment Contexts

The Machine as Prakṛtic Configuration — Module II Conclusions

The Complete Guṇa Picture — What Module II Has Established

Closing — The Prakṛtic Machine and Its Human Interlocutor

Module II Conclusion — The Three Guṇas and the Limits They Name

Select Bibliography — Module II

The Three Guṇas &AI Architecture: Sattva, Rajas & Tamas in the Machine

The Machine's Constitution — Why the Guṇas Matter More than Parameters

The Guṇa Doctrine — From the Sāṃkhyakārikā to the Attention Layer

§ I.2 — The Lamp Analogy: Sattva, Rajas, and Tamas as Complementary Functions

Guṇa Dynamics — Pariṇāma, Parāpara, and the Laws of Transformation

Sattva in Artificial Intelligence

What Sattva Looks Like in a Large Language Model

The Sattvic Limit — Why AI Sattva Is Not Buddhi-Sattva

Rajas in Artificial Intelligence

The Generative Drive — Rajas as the Engine of Language Modelling

Hallucination — A Precise Guṇa-Theoretic Account

Tamas in Artificial Intelligence

The Frozen Parameter — Tamas as AI's Most Literal Guṇa

Three Dimensions of Tamas in AI Architecture

Bias, Blindness & the Tamasic Covering of Reality

The Necessity of Tamas — Why AI's Inertia Is Also Its Reliability

Guṇa Interplay Across the AI Architecture Stack

The Full Stack — Guṇa Distribution Layer by Layer

Emergent Guṇa Phenomena — What Happens When the Three Interact

Neurochemical Parallels — The Guṇas in the Living Brain

Neurochemistry as Guṇa Physiology

Psychopharmacological Modulation — Changing the Guṇas from Outside

What Changing Neurochemistry Feels Like

What Changing AI "Neurochemistry" Produces

Model-by-Model Guṇa Profiles

Guṇa Profiles Across AI Paradigms

Guṇa-Informed AI Ethics & Design

The Guṇa-Informed Design Principles

The Guṇa Ethics of Specific Deployment Contexts

The Machine as Prakṛtic Configuration — Module II Conclusions

The Complete Guṇa Picture — What Module II Has Established

Closing — The Prakṛtic Machine and Its Human Interlocutor

Module II Conclusion — The Three Guṇas and the Limits They Name

Select Bibliography — Module II

The Three Guṇas &
AI Architecture: Sattva, Rajas & Tamas in the Machine