SQUARE45

Definition

Protein Secondary Structure Classification

Let the local backbone geometry of a residue

i

be defined by the dihedral angles

(\phi_i, \psi_i)

. The classification of secondary structure is based on the allowed regions in the Ramachandran plot. Define the structural motif

\mathcal{S}

as the set of allowed

(\phi, \psi)

pairs: \n\n

\mathcal{S} = \mathcal{S}_{\alpha} \cup \mathcal{S}_{\beta} \cup \mathcal{S}_{\text{random}}

\n\nwhere

\mathcal{S}_{\alpha}

is the region corresponding to

\alpha

-helices (e.g.,

-60^{\circ} < \phi < -30^{\circ}, -70^{\circ} < \psi < -30^{\circ}

),

\mathcal{S}_{\beta}

is the region corresponding to

\beta

-strands (e.g.,

-120^{\circ} < \phi < -60^{\circ}, 10^{\circ} < \psi < 140^{\circ}

), and

\mathcal{S}_{\text{random}}

encompasses the remaining, less constrained regions.

Axiom

Chirality and Stereochemistry

Consider an amino acid residue

i

with a stereocenter at the

\alpha

-carbon,

\mathbf{C}_{\alpha}

. The chirality is determined by the spatial arrangement of the four distinct substituents (amino group, carboxyl group, side chain

R_i

, and backbone

\text{N}

). The stereochemistry is quantified by the absolute configuration, typically assigned using the Cahn-Ingold-Prelog (CIP) rules, yielding the

R

or

S

designation. Mathematically, this requires defining the handedness of the local coordinate system

(\mathbf{v}_1, \mathbf{v}_2, \mathbf{v}_3)

formed by the bonds emanating from

\mathbf{C}_{\alpha}

: \n\n

\text{Chirality} = \text{sgn}(\mathbf{v}_1 \cdot (\mathbf{v}_2 \times \mathbf{v}_3))

\n\nFor biological proteins, the overwhelming preference is for the L-amino acid configuration, corresponding to a specific, consistent sign for this scalar triple product.

Theorem

Ramachandran Plot

Let

\phi

and

\psi

be the dihedral angles defining the backbone conformation of an amino acid residue. The allowed conformational space is restricted by steric hindrance and local energy minima. The Ramachandran plot defines the allowed region

\mathcal{R}

in the

(\phi, \psi)

plane:

\mathcal{R} = \{(\phi, \psi) \in [-\pi, \pi)^2 \mid E_{steric}(\phi, \psi) < E_{cutoff} \text{ and } E_{torsion}(\phi, \psi) < E_{cutoff} \}.

The function

E_{steric}

accounts for atomic overlaps, and

E_{torsion}

accounts for backbone bond angle strain.

Theorem

Alpha Helix Geometry

Let

\mathbf{r}_i

be the position vector of the

i

-th residue's

\text{C}\alpha

atom. The geometry of an

\alpha

-helix is defined by the constraints on the backbone dihedral angles

(\phi_i, \psi_i)

and the hydrogen bonding pattern

\text{C}(i) - \text{H} \cdots \text{O}(i+4)

. Specifically, the ideal geometry requires: \n\n

\phi_i \approx -57^{\circ} \text{ and } \psi_i \approx -47^{\circ}

\n\nFurthermore, the hydrogen bond constraint dictates that the distance

d(\text{C}(i), \text{O}(i+4))

and the angle

\angle(\text{C}(i) - \text{H}(i) - \text{O}(i+4))

must approximate the ideal values for a stable hydrogen bond, ensuring a pitch

P \approx 5.4 \text{ \AA}

and a rise per residue

h \approx 1.5 \text{ \AA}

.

Theorem

Beta Sheet Formation

Define two adjacent polypeptide segments,

S_1

and

S_2

, with backbone atoms

\mathbf{r}_{i}^{(1)}

and

\mathbf{r}_{j}^{(2)}

, respectively. The formation of a

\beta

-sheet is stabilized by inter-strand hydrogen bonds between the backbone carbonyl oxygen

\text{O}(i)

and the amide proton

\text{N}(j)

of the adjacent strand. The stability is maximized when the potential energy

E_{\text{H-bond}}

is minimized, subject to the geometric constraints: \n\n

E_{\text{H-bond}} = \sum_{i, j} \left[ \frac{A}{d(\text{O}(i), \text{N}(j))^2} - \frac{B}{d(\text{O}(i), \text{N}(j))} \right] + \text{Angle Penalty}

\n\nwhere

d(\cdot, \cdot)

is the distance between atoms, and the angle penalty enforces the near-planarity and optimal dihedral angles characteristic of the extended

\beta

-strand conformation.

Theorem

The Radius of Gyration

Let

\mathbf{r}_i

be the position vector of the

i

-th atom (where

i=1, 2, \dots, N

) in the protein structure. The center of mass

\mathbf{R}_{CM}

is defined as the weighted average of atomic positions: \n\n

\mathbf{R}_{CM} = \frac{1}{N} \sum_{i=1}^{N} \mathbf{r}_i

\n\nThe Radius of Gyration,

R_g

, is then defined as the root mean square distance of all atoms from the center of mass: \n\n

R_g = \sqrt{\frac{1}{N} \sum_{i=1}^{N} ||\mathbf{r}_i - \mathbf{R}_{CM}||^2}

Principle

The Primary Structure-Folding Relationship

Let

\mathbf{S} = (a_1, a_2, \dots, a_N)

be the primary sequence, where

a_i \in \{A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y\}

. Define the conformational energy function

E(\mathbf{r})

as a function of the atomic coordinates

\mathbf{r} = (\mathbf{r}_1, \dots, \mathbf{r}_N)

. The folding process seeks the native state

\mathbf{r}_N

such that the free energy

G(\mathbf{r}) = E(\mathbf{r}) - T S(\mathbf{r})

is minimized:

G(\mathbf{r}_N) = \min_{\mathbf{r}} G(\mathbf{r}) \quad \text{subject to } \mathbf{r} \text{ being compatible with } \mathbf{S}.

Principle

Hydrophobic Effect

Consider the free energy change

\Delta G_{hydro}

associated with the folding of a protein from an unfolded state (U) to a folded state (N) in an aqueous solvent. This effect is primarily driven by the increase in solvent entropy

S_{solvent}

upon burial of nonpolar surface area

A_{nonpolar}

:

\Delta G_{hydro} = -T \Delta S_{solvent} = T \Delta S_{solvent} \approx \gamma A_{nonpolar},

where

\gamma

is the surface tension coefficient related to the nonpolar solute-water interaction, and

A_{nonpolar}

is the total nonpolar surface area buried in the core of the native state relative to the unfolded state.

Protein Structure

Sequence of Expressions

Hydrogen Bonding

Protein Secondary Structure Classification

Chirality and Stereochemistry

The Van der Waals Force

Ramachandran Plot

Alpha Helix Geometry

Beta Sheet Formation

The Radius of Gyration

The Primary Structure-Folding Relationship

Hydrophobic Effect