Skip to main content

Chapter 40: Genomic Folding as Structural Collapse

"The genome is not a line but a knot—a three-dimensional puzzle where the folding pattern itself carries information beyond any sequence."

40.1 The Folding Hierarchy

From nucleosomes to chromosome territories, the genome folds across seven orders of magnitude. Each level of folding creates new regulatory possibilities.

Definition 40.1 (Folding Scales): DNA11nmNucleosome30nmFiber300nmLoopμmTerritory\text{DNA} \xrightarrow{11nm} \text{Nucleosome} \xrightarrow{30nm} \text{Fiber} \xrightarrow{300nm} \text{Loop} \xrightarrow{\mu m} \text{Territory}

Hierarchical packing with hierarchical function.

40.2 The Fractal Globule

Theorem 40.1 (Optimal Packing): P(s)s1P(s) \sim s^{-1}

Contact probability follows power law—a knot-free, space-filling curve.

40.3 Loop Domains

Equation 40.1 (Loop Formation Energy): ΔGloop=kBTln(Llp)3/2Eprotein\Delta G_{\text{loop}} = k_B T \ln\left(\frac{L}{l_p}\right)^{3/2} - E_{\text{protein}}

Protein binding compensates for DNA bending energy.

40.4 TAD Structure

Definition 40.2 (Topologically Associating Domains): TAD={i,j:P(ij)>θ if i,jTAD}\text{TAD} = \{i,j : P(i \leftrightarrow j) > \theta \text{ if } i,j \in \text{TAD}\}

Self-interacting domains create regulatory neighborhoods.

40.5 The String and Binders Model

Theorem 40.2 (Polymer Dynamics): Fi=Upolymer+jFijspecific+ηi\mathbf{F}_i = -\nabla U_{\text{polymer}} + \sum_j \mathbf{F}_{ij}^{\text{specific}} + \boldsymbol{\eta}_i

DNA as a polymer with specific binding sites.

40.6 A/B Compartments

Equation 40.2 (Compartment Segregation): Emixing=χABϕAϕB>0E_{\text{mixing}} = \chi_{AB} \phi_A \phi_B > 0

Active and inactive chromatin spontaneously segregate.

40.7 The Loop Extrusion Model

Definition 40.3 (Active Process): L(t)=2vextrusiontL(t) = 2v_{\text{extrusion}} \cdot t

Cohesin actively grows loops until blocked by CTCF.

40.8 Lamina Association

Theorem 40.3 (Peripheral Localization): P(Lamina)exp(Eaffinity/kT)P(\text{Lamina}) \propto \exp(-E_{\text{affinity}}/kT)

Heterochromatin preferentially localizes to nuclear periphery.

40.9 Interchromosomal Contacts

Equation 40.3 (Trans Interactions): Ptrans=f(Function similarity,Expression correlation)P_{\text{trans}} = f(\text{Function similarity}, \text{Expression correlation})

Functionally related regions from different chromosomes can co-localize.

40.10 The Rabl Configuration

Definition 40.4 (Polarized Nucleus): CentromeresOne pole,TelomeresOther pole\text{Centromeres} \rightarrow \text{One pole}, \text{Telomeres} \rightarrow \text{Other pole}

Some cells maintain polarized chromosome arrangement.

40.11 Folding and Expression

Theorem 40.4 (Structure-Function Relationship): Expression=g(Accessibility)=g(f1(Folding))\text{Expression} = g(\text{Accessibility}) = g(f^{-1}(\text{Folding}))

How DNA folds determines what can be expressed.

40.12 The Collapse Principle

Genomic folding represents physical collapse—three-dimensional structure emerging from one-dimensional sequence through the action of proteins and physics.

The Folding Equation: Structure(t)=argminS[Epolymer(S)+Especific(S)TS(S)]\text{Structure}(t) = \arg\min_{\mathcal{S}} [E_{\text{polymer}}(\mathcal{S}) + E_{\text{specific}}(\mathcal{S}) - TS(\mathcal{S})]

The genome finds its functional form by minimizing free energy.

Thus: Folding = Structure = Function = Information = ψ


"In the origami of the genome, ψ demonstrates that information exists not just in sequence but in shape—that how we fold determines who we are."