Keng
This page is used by the judges to evaluate your team for the medal criterion or award listed below.
Homology Modeling Overview
As mentioned in the project description, our goal is to link five genes from the taxol biosynthesis pathway. In order to better understand the behavior of the proteins that we have isolated from the taxol biosynthesis pathway, we are using homology modeling to learn about active site architecture and catalytic functions. Homology modeling is based on the observation that related protein structures tend to have similar 3-D structures and functions. During homology modeling, 1 or more template proteins are used to identify structurally conserved regions, and to predict structurally variable regions that often include mutations from an already known structure. Through homology modeling, we can learn a lot about details about the protein such as active site architecture, ligand binding, and etc. Usually, when the target sequence is 30-50% similar (30%-50% identical amino acids) to the template sequence, they will share 80%+ shared 3-D structures. During the modeling process, we will be looking for template sequences with 30%+ sequence identity as good models for the target sequence.
Tools
PyMol: a software modeling application that allows users to view the 3-d structure of any proteins, including secondary, tertiary, and quaternary structures and the molecular interactions between side chains. It's the main tool that we use to visualize active site, and terminals of proteins.
ModWeb: a web-based database with all protein templates present and target sequences are matched with reliable models of template proteins.
Literature: previously published scholarly articles on research of proteins that include reliable template sequences for our specific model. Many mutations and active site construction has already been discovered in these articles.
BAPT/DBAT
Template proteins: 5KJV/5KJS
Organism: A. thaliana
Belongs to BAHD family of acyltransferases (Members can be identified by sequence homology & universally conserved HXXXD & DFGWG motifs)
General features
2 quasi-symmetric N-terminal (1-171 & 374-394) & C-terminal (223-373 & 395-431) domains, connected by a long loop (172-222)
- Each domain has beta sheet core flanked by alpha helices with similar spatial arrangement
- The active site is at the interface of the domains
Structural features
Tau-nitrogen of His153, H-bond with shikimate 5-O of rho-coumaroylshikimate
- A base that deprotonates 5-hydroxyl of the acyl acceptor substrate
- An oxyanion hole that stabilizes negative charge on the tetrahedral intermediate as a leaving group to produce ester product rho-coumaroylshikimate
Arg 356, salt-bridge with carboxyl of shikimate moiety
Distinctive active site conformational states in HCTs
Apo & Holo AtHCT structures have active site conformational changes
Catalytic His153 (switchlike conformational shift)
- Imidazole side chain stabilized in a 180o rotation relative to apo conformation
- In rho-coumaryol-CoA-bound AtHCT, His153 adopts side chain rotation ~90o to apo state
- In presence of rho-coumaroylshikimate, side chain stabilized inside active site
- Likely mediated by electrostatic attraction between positively charged guanidinium side chain of arginine side chain & negatively charged carboxyl group of shikimate
- Shift inward upon binding of various ligands, causing active site to shrink
Universally Conserved Residues
An arginine handle dictates acyl acceptor specificity in HCT
- Universal conservation of Arg356 & surrounding residues (354-362)
More roles of Arg 356
Positively charged Arg356 side chain & negatively charged shikimate electrostatic attraction facilitates binding of acyl acceptor substrate to active site
Orients substrate’s functional groups properly relative to enzyme’s catalytic machinery
Contributes to catalytic mechanism
- Salt bridge formation between arginine side chain and shikimate carboxyl confers acyl acceptor substrate binding affinity
- Arg356 handle orients shikimate in a catalytically productive conformation in HCT’s active site, increase fraction of productive encounters
- Mutating Arg356 resulted in a total loss of native enzymatic activity, mutating Arg356 to negatively charged residues increased specificity toward certain positive non-native substrates
badA
Template protein: 4RLQ
Organism: Rhodopseudomonas palustris (bacteria)
Class: PFAM00501 (ATP-dependent adenylation enzymes)
Plants and bacteria employ aroyl CoA thioesters for biosynthesis of specialized metabolites
Mechanism: ATP-dependent CoA ligases use 2 half-reactions to catalyze thioesterification
General Features
BadA has C-terminal/N-terminal domains joined by a flexible hinge
- badA always in conformation primed for thioesterification
- Nonconserved active site Lys427, present in badA active site in thiolation conformation, not required for adenylation but necessary for thioesterification
- N-terminal (1-434), C-terminal (435-522) domains fold analogously to the benzoate CoA ligase BCLM (2v7b)
- N-terminal contains benzoate binding site
- C-terminal domain contacts other edge of benzoate binding pocket, positions benzoate into active site through charged interaction between carboxylate of benzoate and Lys427
Features of active site
Largely hydrophobic
Para- carbon of BzO neighbors carbonyl of Leu332 along peptide backbone
2 meta- carbons of BzO point toward His333 & Ala227
Si- & re- faces of BzO positioned between backbone amide bonds, comprising Gly327, Ser328, Thr329 on 1 face, Tyr on other face
When Bz-AMP forms, Lys427 moves from interaction with carboxylate of substrate to several polar contacts with Bz-AMP & peptide backbone
Indicates Lys427 is needed for thiolation reaction
Rational mutation of the BadA active site
4 residues (Ala227, Leu332, His 333, Ile334) surround the phenyl ring of BzO in the active site
The residue positioned near a given carbon of BzO depends on whether BzO is in carboxylate-bound orientation or rotated Bz-AMP-bound orientation
Targeted mutations of Ala227Gly, Leu332Ala, His333Ala, Ile334Ala should show relative importance of 2 orientations
BadA structures & Homology
Enzymes in this family typically fold in larger N-terminal domain (400-550 residues) & a smaller C-terminal domain (~110 residues), active site lies in between
N-terminal domain contains residues that bind carboxylate substrate
C-terminal residues coordinate ribose & phosphate groups
Phe226 residue in badA offset by ~72o away from BzO, opening CoA binding channel
Catalytically Important Lysine Residues
Uses C-terminal lysine (Lys427/512 for BadA) to orient BzO substrate
Lys427 coordinates BzO in the active site & Lys512 is far from active site and solvent-exposed in C-domain
Lys427 of BadA makes 4 polar contacts with Bz-AMP, 1 with benzoyl oxygen, 3 with AMP moiety, and 2 to Gly303 & Gly430 (these contacts anchor Bz-AMP)
Predict Lys427Ala-badA mutant would show the second thioesterification of badA
- When badA assumes the adenylation conformation, Lys572 enters the active site