Membrane Protein Structure Determination: Methods and Protocols (Methods in Molecular Biology, Vol. 654) [1st Edition.] 1607617617, 9781607617617, 9781607617624

Membrane proteins, representing nearly 40% of all proteins, are key components of cells involved in many cellular proces

465 39 9MB

English Pages 473 Year 2010

Report DMCA / Copyright

DOWNLOAD PDF FILE

Table of contents :
Cover......Page 1
Methods in Molecular Biology, Vol. 654......Page 2
Membrane Protein StructureDetermination......Page 4
ISBN 9781607617617......Page 5
Preface......Page 6
Contents......Page 8
Contributors......Page 10
Part I: Membrane Protein Purification......Page 15
1. Introduction......Page 17
2. Materials......Page 19
3.1. Detergent Concentration Determination......Page 20
3.2. Detergent Removal Using Bio-Beads......Page 23
3.3. Proteoliposomes Formation......Page 25
3.4. Ligand Binding......Page 27
4. Notes......Page 30
References......Page 31
1. Introduction......Page 33
2.2. Buffer Compositions......Page 34
3.2. The 9-Histidine Tag: Mutagenesis, Plasmids, and Strains......Page 35
3.5. AAH6 Purification......Page 36
3.7. Protein Concentration......Page 37
3.9. Purification of bAncp......Page 38
3.10.1. Practical Aspects......Page 39
3.10.2. The Genetic Approach......Page 40
References......Page 41
1. Introduction......Page 43
2.1. Expression......Page 44
2.2. Purification......Page 45
2.3. Characterization......Page 46
3.2.2. Large-Scale Production in a 5-L Bioreactor......Page 47
3.2.3. Production of Labelled Protein for NMR Studies......Page 48
3.3.1. Solubilization of Inclusion Bodies......Page 49
3.3.3. Small-Scale Purification, Manual Chromatographic Procedure......Page 50
3.3.4. General Considerations on IMAC Purification......Page 52
3.4.1. Protein Composition and Oligomeric State......Page 54
3.4.2. Optimal Detergent Concentration......Page 55
4. Conclusions......Page 56
5. Notes......Page 57
Acknowledgment......Page 58
References......Page 59
1. Introduction......Page 61
2.2. Molecular Biology......Page 63
2.3. Biochemistry......Page 64
3.1.1.2. Obtaining Mutant Strains by Host Selection......Page 67
3.1.1.3. ABCG2 Overexpression Using a Selected Strain......Page 69
3.1.2.1. Insect Cell Transfection......Page 70
3.1.2.4. Baculovirus Amplification......Page 71
3.2.1. Inverted Membrane Vesicles from the Selected Bacteria Strain......Page 72
3.2.3.1. Fluorescent-Substrates Transport Measurements......Page 73
3.2.3.2. ATPase Activity Measurements......Page 74
3.3.3. Purification of ABCG2 Overexpressed in High-Five Cells......Page 76
3.4.1. Quantification of Secondary Structure......Page 78
3.4.2. Ligand-Induced Quenching of Tryptophan Intrinsic Fluorescence......Page 80
3.5.1. Differential Solubilization of the Two ABCG2 Forms by Various Detergents......Page 81
3.5.2. Conversion of Lower into Upper Form of ABCG2 During SDS–PAGE in Presence of 9 M Urea......Page 83
3.5.3. Effects of Translation and Transcription Inhibitors on ABCG2 Expression in High-Five cells......Page 84
4. Notes......Page 85
Acknowledgments......Page 87
References......Page 88
Part II: X-Ray Crystallography......Page 91
1. Introduction......Page 93
2. Materials......Page 94
3.1. Crystallization by the Vapor Diffusion Method......Page 95
3.2. Practical Approach to lipidic cubic phase (LCP) Crystallization......Page 97
3.3. Example of Bacteriorhodopsin (BR)......Page 98
3.4.1. Crystallization in the Sponge Phase......Page 100
3.4.2. Crystallization from Vesicles......Page 102
3.4.3. Crystallization from Bicelles......Page 103
3.4.4. On the Way to a General Method......Page 104
3.5.2. Crystallography......Page 105
4. Notes......Page 106
References......Page 114
1. Introduction......Page 119
2.1. General Strategy......Page 120
2.2. 3D Crystallization......Page 121
2.4. Description of the Structure......Page 122
2.5. Oligomerization, Organization in the Membrane, 2D Crystals......Page 124
3.1. What Is Suggested by the Structure?......Page 125
3.2. Molecular Dynamics Simulations on AAC......Page 126
3.3. Comparison and Differences with Other Transporters......Page 127
4. The Mitochondrial Carrier Family......Page 128
5. Conclusion......Page 129
References......Page 130
1. Introduction......Page 133
2. Use of Structural Analogues to Stabilize Intermediary Transport Steps......Page 136
3. Phosphorylation of Ca2+-ATPase by ATP......Page 138
4. Membrane Binding of Ca2+......Page 143
5. The Trans-location Step ((Ca2)E1P to E2P Transition)......Page 144
6. The Mechanism of E2-P Dephosphorylation......Page 147
7. The Dephosphorylated E2 and E1 States......Page 148
8. Concluding Comments......Page 150
References......Page 151
1. Introduction......Page 155
2.2.1. Protein Labeling......Page 158
2.2.2. Sample Setup......Page 159
3.1. Reconstitution of Protein in LCP......Page 160
3.1.2. Mixing Lipid with Protein......Page 161
3.2. LCP-FRAP Crystallization Pre-screening Assay......Page 162
3.2.1. Protein Labeling......Page 163
3.2.3. FRAP Data Collection......Page 164
3.2.4. Data Analysis......Page 166
3.3. Crystallization in Lipidic Mesophases......Page 167
3.3.1. Assembling Glass Sandwich Plates......Page 168
3.3.3. Crystal Detection......Page 169
3.3.4. Crystal Optimization......Page 170
3.3.5.2. Harvesting......Page 172
3.4.1. Alignment of Invisible Crystals with the Minibeam......Page 173
3.4.2. Crystallographic Data Collection......Page 174
5. Notes......Page 175
Acknowledgements......Page 180
References......Page 181
Part III: Electron Microscopy......Page 184
1. Introduction......Page 185
2.2. Products......Page 190
3.1. Ultrastructural Cytology......Page 191
3.3. Negative Staining......Page 192
3.5. Gold Immunolabelling on Ultra-thin London Resin White Sections......Page 193
3.6. Gold Immunolabelling on Isolated Membranes......Page 194
3.8. Cryo-Electron Microscopy......Page 195
3.9. Image Processing......Page 196
4. Conclusions......Page 197
5. Notes......Page 198
References......Page 199
1. Introduction......Page 201
2.1. Analysis of Lipid......Page 202
2.4. Solid Carbon Films......Page 203
3. Methods......Page 204
3.1. Analysis of Lipid Species Copurifying with Membrane Proteins......Page 205
3.2. 2D Crystallization Using Biobeads......Page 206
3.3. 2D Crystallization by Detergent Removal......Page 207
3.4. Preparation of Solid Carbon Film for Electron Microscopy......Page 208
3.5. Preparation of Holey Carbon Films......Page 210
3.6. Preparation of Negatively Stained Samples......Page 211
3.7. Screening of 2D Crystallization Trials by Electron Microscopy......Page 212
4. Conclusion......Page 215
5. Notes......Page 216
References......Page 219
1. Introduction......Page 221
2. Materials......Page 223
3.2. Membrane Protein Reconstitution......Page 224
3.4. Tomogram Generation from Cryo-Tilt Series......Page 225
3.5. Single Particle Analysis......Page 228
4. Notes......Page 232
References......Page 233
1. Introduction......Page 235
2.2. Sample Preparation for Electron Microscopy......Page 236
3.1. Sample Preparation......Page 237
3.2. Images Acquisition......Page 238
3.3.1. Image Alignment......Page 240
3.3.2. Tomograms Reconstructions......Page 242
3.4. Conclusions......Page 245
4. Notes......Page 247
References......Page 249
1. Introduction......Page 251
2. Materials......Page 253
3.1. Flexible Fitting of Atomic Structures into an EM Map of an Isolated Protein......Page 254
3.2. Analysis of the Fitting Procedure as a Conformational Transition......Page 258
3.3. Analysis of the Fitted Structures......Page 261
3.4. Localization of Intramolecular Decavanadate......Page 264
3.5. Localization of Intermolecular Decavanadate After Tubular Reconstruction......Page 266
4. Conclusions......Page 267
5. Notes......Page 269
References......Page 270
Part IV: Nuclear Magnetic Resonance......Page 274
1. Introduction......Page 275
2.1. Over-expression......Page 277
2.2. Tag......Page 278
3.1.1.1. Choice of Detergent/Phopholipids......Page 279
3.1.1.2. Labelling......Page 280
3.1.2.2. Backbone Chemical Shifts Assignments......Page 281
3.1.2.3. Long Range Restraints......Page 283
3.1.2.4. Residual Dipolar Couplings (RDCs)......Page 284
3.1.2.6. Toward MP Dynamics and Mechanisms......Page 285
3.2. Solid-State NMR of Membrane Proteins......Page 286
3.2.1.1. Aligned Samples......Page 288
3.2.1.2. Resonance Assignments and Structure Determination......Page 289
3.2.2.1. Sample Preparation......Page 290
3.2.2.2. Resonance Assignments......Page 291
4. Conclusion and Perspectives......Page 292
References......Page 293
1. Introduction......Page 297
3.1. Secondary Structures of Fragments of Membrane Proteins Corresponding to Single Loops or to Single Transmembrane Helices......Page 300
3.1.1. Transmembrane Helical Fragments......Page 301
3.1.2. Fragments Corresponding to Loops of Membrane Proteins......Page 302
3.1.3. General Features of Membrane Protein Structure Accessible from Studies of Membrane Protein Fragments......Page 305
3.2. Two-Helix Fragments......Page 306
3.3. Full Scan of Secondary Structure Using Membrane Protein Fragments......Page 307
3.4. Three-Dimensional Structures from Protein Fragments......Page 309
3.5. NMR Studies on Intact Membrane Proteins......Page 311
References......Page 312
1. Introduction......Page 317
2.1.3. Purification of PLN......Page 319
2.4.1. Solution-State Samples for Titration......Page 320
3.1.2. PLN Expression......Page 321
3.1.3. PLN Purification......Page 322
3.3.2. ATPase Activity Measurements......Page 323
3.3.3. 31P NMR Activity Assays......Page 324
3.4.2. Kd Measurements by NMR......Page 325
3.4.3. Preparation of Oriented Samples and Solid-State NMR Spectroscopy......Page 326
3.4.4. Structure Determination of Monomeric PLN Using a Hybrid Solution and Solid-State NMR Method......Page 327
3.5.1. Allosteric Model of SERCA Regulation by PLN......Page 328
3.5.2. Rational Design of PLN Mutants as Possible Candidates for Gene Therapy......Page 329
4. Notes......Page 330
References......Page 331
1. Introduction......Page 335
2.1. Cells Culture and Isotope Labelling......Page 336
2.3. Detergent Screening and NMR Sample Preparation......Page 337
3.1.1. Uniform (2H,13C,15N) Labelling......Page 338
3.1.3. Fractional 13C (10%) labelling......Page 339
3.2. In Vitro Refolding from E. coli Inclusion Bodies......Page 340
3.3. Detergent Screening and NMR Sample Preparation......Page 341
3.4. Solution-State NMR Experiments for Resonance Assignments and Structure Determination......Page 342
3.4.2. Measurement of Backbone HN–HN NOEs Using a 4D 15N,15N Separated NOESY-TROSY Experiment......Page 343
3.4.3. Selective Methyl Protonation Approach to Structure Determination......Page 346
3.5. Structure Calculation......Page 349
4. Notes......Page 351
References......Page 352
1. Introduction......Page 355
2.1. Samples, Preparation Tools and Measuring Cells......Page 356
2.5. NMR Instrumentation for HR-MAS 1H-NMR......Page 357
3. Methods......Page 358
3.1. Determination of Lipid Phase Nature (Lamellar (Liposomes), Oriented-Bicelle, Isotropic) Using Wide Line 31P-NMR......Page 361
3.3. Determination of Lipid Dynamics in Non-oriented Multilamellar Bilayers (Gel, Fluid, Liquid-Ordered) Using Wide Line 2H......Page 364
3.4. Determination of Lipid Molecular Structure in a Membrane Environment Using 1H HR-MAS (Fig. 3)......Page 366
3.5. Determination of Lipid Internal Dynamics in Oriented Bilayers Using 2H-NMR Spin-Lattice Relaxation Time Measurements......Page 369
4. Notes......Page 371
References......Page 372
Part V: Molecular Modelling......Page 375
1. Introduction......Page 377
3.1.1. With Available Known Homologues......Page 379
3.1.2. Without Known Homologues......Page 382
3.2.1.2. Helices Conformation and Relative Positioning......Page 384
3.2.2.1. Molecular Dynamics Simulations and Related Approaches......Page 386
3.2.2.2. Normal Mode Analysis......Page 388
3.3. Conclusion......Page 389
4. Notes......Page 390
References......Page 396
1. Introduction......Page 401
3. Methods......Page 402
3.1. Sequence Analyses: Multiple Sequence Alignment and Bibliographic Researches......Page 405
3.2. Of Helices and Transmembrane Segments......Page 407
3.3. Comparative Modeling and Other Approaches......Page 408
3.5. Helix–Helix Interactions......Page 411
3.7. Of Loops and Mutants......Page 412
4. Notes......Page 413
Acknowledgments......Page 414
References......Page 415
1. Introduction......Page 417
2.1. Computers......Page 418
2.3. Software......Page 419
2.6. Simulation Parameters......Page 420
3.1. General Considerations on the Comparison of MD to Experiments......Page 422
3.2. Comparison of MD to Biochemical Data......Page 423
3.3. Comparison of MD to Biophysical Data......Page 425
3.4. One Guided Example: Comparison of MD Simulations to 2H NMR Data on a Transmembrane Peptide......Page 427
4. Notes......Page 431
References......Page 433
1.1. Membranes and Membrane Proteins......Page 437
2.1. Molecular Modeling......Page 438
2.2. Quantum Mechanics......Page 439
2.4. Limitations of Computer Simulations......Page 441
2.5. Applications......Page 442
2.5.2.1. G Protein-Coupled Receptors......Page 443
25.2.2. Transport Across the Membrane: Transporters and Channels......Page 444
2.5.2.4. Channels......Page 445
2.5.2.5. Ion Channels......Page 446
2.5.3.2. Protein Aggregation and Protein-Membrane Self-Assembly......Page 447
2.5.4. Beyond the Microsecond......Page 449
Acknowledgments......Page 450
References......Page 451
1. Introduction......Page 455
3.1. Mapping of CCK1R Binding Site: Synergy Between Laboratory and In Silico Experiments......Page 457
3.2. Local GPCR Drug Design......Page 462
3.3. Global GPCRs Drug Design......Page 464
References......Page 466
Index......Page 469
Recommend Papers

Membrane Protein Structure Determination: Methods and Protocols (Methods in Molecular Biology, Vol. 654) [1st Edition.]
 1607617617, 9781607617617, 9781607617624

  • 0 0 0
  • Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up
File loading please wait...
Citation preview

METHODS

IN

MOLECULAR BIOLOGY™

Series Editor John M. Walker School of Life Sciences University of Hertfordshire Hatfield, Hertfordshire, AL10 9AB, UK

For other titles published in this series, go to www.springer.com/series/7651

as

Membrane Protein Structure Determination Methods and Protocols

Edited by

Jean-Jacques Lacapère INSERM U773/CRB3, Université Paris Diderot – Paris 7, Paris, France

Editor Jean-Jacques Lacapère INSERM U773/CRB3 Université Paris Diderot – Paris 7 Paris France [email protected]

ISSN 1064-3745 e-ISSN 1940-6029 ISBN 978-1-60761-761-7 e-ISBN 978-1-60761-762-4 DOI 10.1007/978-1-60761-762-4 Springer New York Dordrecht Heidelberg London Library of Congress Control Number: 2010930685 © Springer Science+Business Media, LLC 2010 All rights reserved. This work may not be translated or copied in whole or in part without the written permission of the publisher (Humana Press, c/o Springer Science+Business Media, LLC, 233 Spring Street, New York, NY 10013, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are not identified as such, is not to be taken as an expression of opinion as to whether or not they are subject to proprietary rights. While the advice and information in this book are believed to be true and accurate at the date of going to press, neither the authors nor the editors nor the publisher can accept any legal responsibility for any errors or omissions that may be made. The publisher makes no warranty, express or implied, with respect to the material contained herein. Cover illustration: Membrane protein determination starts from extraction-purification and reaches atomic structure by crystal formation and X-ray diffraction or electron microscopy analysis, or nuclear magnetic resonance studies combined or not with molecular modelling. Printed on acid-free paper Humana Press is part of Springer Science+Business Media (www.springer.com)

Preface Membrane proteins represent almost 40% of all proteins, but only a small number of their structures have been determined. Alone or associated with other proteins, membrane proteins play several roles in the cells. They are involved in signal transduction, ion exchanges, transport of metabolites, molecules or proteins. Cellular communications are controlled or regulated by membrane proteins. Indeed, they are involved in communications between cells, outside/inside cell exchanges, cytosolic traffic among different organelles as well as cytosol/organelles exchanges. Only a few functional classes of membrane proteins have been structurally characterized and mostly are transporters working alone. Membrane proteins are difficult to study mostly because they are often poorly abundant and thus difficult to purify in amounts compatible with structural studies. Heterologous overexpression of recombinant membrane protein is a strategy that has permitted the study of several membrane protein structures at an atomic level. However, membrane proteins are located in an hydrophobic environment such as the cellular bilayers, and their functions often involve hydrophilic contacts with lipids, resulting in the paradox that membrane proteins need lipids to work but they also need detergent addition to be purified. When proteins are associated in complexes in a functional way, their stabilization is often difficult in purification protocols and requires numerous trial and error steps. Determination of a structure is a crucial step but never solves the functional question. Indeed, activation or inactivation of membrane proteins involves numerous factors such as ligand binding, phosphorylation of specific residues, and posttranslational modifications. From a pharmacological point of view, ligands induce or block a functional response that may involve either a single protein or a cascade of several proteins mixing membranous and soluble ones. This assembly of proteins can form stable or dynamic interacting complexes. In the cellular environment, these complexes are probably quite easy to form if one considers on one hand the protein concentrations inside the cytosol or the membranes and on the other hand the relative proximity of organelles in these cells. A fundamental aim of structural biology is to move from understanding structure and dynamics to controlling molecular function. This book describes major techniques used in the field of membrane protein structure determination. It is divided into five sections describing different techniques used to solve atomic structure either from purified membrane proteins or in silico. It also describes techniques that permit the capture of atomic scale pictures of membrane proteins in their lipid and protein environment to make “movies” from different instant pictures that will describe membrane protein functioning. It presents techniques scaling up from atomic to molecular that will render protein complexes in membrane of organelles and cells. The first section presents various strategies to purify membrane proteins since getting pure and homogenous material is a significant hurdle. Chapter 1 describes some techniques to characterize membrane protein preparations such as detergent content. Chapter 2 is devoted to the specific case of the adenosine nucleotide transporter (ANT), which a natural abundance in the inner membrane of mitochondria has permitted its three-dimensional structure determination, whereas structure–function relationships have been studied using

v

vi

Preface

mutants over expressed in yeast. Chapter 3 focuses on the importance of overexpression when membrane proteins are not naturally abundant. The specific case of a bacterial expression system used for a small mitochondrial membrane protein, the translocase TSPO is presented. Chapter 4 highlights the difficulties encountered for large membrane proteins overexpression; it describes a different expression system used for the specific case of an ABC transporter. The second section presents various strategies to get three-dimensional crystals and solve the structure by X-ray diffraction. Chapter 5 describes the various steps of membrane protein crystallography in two different approaches that are vapor diffusion and lipidic phases. Chapter 6 discusses the gain for a membrane protein family to solve the atomic structure of one of its members. Chapter 7 analyzes what can be learned about the function of a single protein from its various atomic structures through the example of the sarcoplasmic calcium pump (SERCA-ATPase). Chapter 8 presents recent progress in the study of a membrane protein with high potential as a pharmaceutical target, the G protein-coupled receptor (GPCR) family. The third section presents the various possibilities to gain structural information for a membrane protein using electron microscopy observations. Chapter 9 uses the insect aquaporin AQPcic to go from its characterization in situ to its homotetrameric structure of purified protein reconstituted in membrane. Chapter 10 describes two-dimensional crystal formation and basic electron microscopy image analysis of membrane proteins. Chapter 11 presents a specific combination of cryo-electron tomography and single particle analysis of membrane protein embedded in stacked lipid bilayers. Chapter 12 describes, step-by-step, the process of electron tomography of mitochondria containing numerous membrane proteins. Chapter 13 is devoted to molecular modeling processes that permit to reach atomic structure of membrane protein conformation, combining its electron microscope derived map and atomic structure from a different conformation. The fourth section presents recent advances in nuclear magnetic resonance (NMR) to study membrane proteins and lipids. Chapter 14 goes through the various strategies that are available to solve atomic structure or protein–protein and protein–ligand interactions using different NMR approaches. Chapter 15 is devoted to the analysis of what can be learned from the structure of membrane protein fragments in regard to the overall protein. Chapter 16 used the peculiar example of the phospholamban to show by NMR analysis the structural dynamic of regulation of a membrane protein by a smaller interacting membrane protein. Chapter 17 describes step-by-step detergent solubilized membrane protein structure determination by solution-state NMR. Chapter 18 presents how solid-state NMR is a powerful tool to study lipid structure and dynamics in a membrane environment. The fifth section presents molecular modeling strategies that can be used either to get membrane protein structures or to move from atomic structure to dynamic understanding of a molecular functioning mechanism. Chapter 19 goes through the various possibilities to build and to analyze membrane protein models. Chapter 20 describes step by step how to build a three-dimensional model of a membrane protein. Chapter 21 presents molecular dynamics of membrane peptides and proteins in their lipid environment. Chapter 22 further describes membrane protein dynamics, presenting increasing time scale ranging from femtoseconds to seconds. Chapter 23 shows synergy between experimental data and computational modeling to delineate the ligand binding pocket of a GPCR, a step toward a rational for drug design. Paris, France

Jean-Jacques Lacapère

Contents Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Contributors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

PART I

MEMBRANE PROTEIN PURIFICATION

1 Characterization of Membrane Protein Preparations: Measurement of Detergent Content and Ligand Binding After Proteoliposomes Reconstitution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Mariano A. Ostuni, Soria Iatmanen, David Teboul, Jean-Claude Robert, and Jean-Jacques Lacapère 2 Native Membrane Proteins vs. Yeast Recombinant: An Example: The Mitochondrial ADP/ATP Carrier . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bertrand Arnou, Cécile Dahout-Gonzalez, Ludovic Pelosi, Guy J.-M. Lauquin, Gérard Brandolin, and Véronique Trézéguet 3 Bacterial Overexpressed Membrane Proteins: An Example: The TSPO . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Jean-Claude Robert and Jean-Jacques Lacapère 4 Insect Cell Versus Bacterial Overexpressed Membrane Proteins: An Example, the Human ABCG2 Transporter . . . . . . . . . . . . . . . . . . . . . . . . . . . Alexandre Pozza, José M. Pérez-Victoria, and Attilio Di Pietro

PART II

v ix

3

19

29

47

X-RAY CRYSTALLOGRAPHY

5 Crystallography of Membrane Proteins: From Crystallization to Structure . . . . . . 79 Aurélien Deniaud, Ekaterina Moiseeva, Valentin Gordeliy, and Eva Pebay-Peyroula 6 Structural Approaches of the Mitochondrial Carrier Family . . . . . . . . . . . . . . . . . . 105 Hugues Nury, Iulia Blesneac, Stephanie Ravaud, and Eva Pebay-Peyroula 7 What Can Be Learned About the Function of a Single Protein from Its Various X-Ray Structures: The Example of the Sarcoplasmic Calcium Pump . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119 Jesper Vuust Møller, Claus Olesen, Anne-Marie Lund Winther, and Poul Nissen 8 Recent Progress in the Structure Determination of GPCRs, a Membrane Protein Family with High Potential as Pharmaceutical Targets . . . . . 141 Vadim Cherezov, Enrique Abola, and Raymond C. Stevens

PART III

ELECTRON MICROSCOPY

9 Observation of Membrane Proteins In Situ: AQPcic, the Insect Aquaporin Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171 Daniel Thomas and Annie Cavalier

vii

viii

Contents

10 Two-Dimensional Crystallization of Integral Membrane Proteins for Electron Crystallography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . David L. Stokes, William J. Rice, Minghui Hu, Changki Kim, and Iban Ubarretxena-Belandia 11 Structure Determination of Membrane Protein by Both Cryo-Electron Tomography and Single Particle Analysis . . . . . . . . . . . . . . . . . . . . Sylvain Trépout, Jean-Christophe Taveau, and Olivier Lambert 12 Electron Microscope Tomography of Native Membranes . . . . . . . . . . . . . . . . . . . Gabriel Péranzi, Cedric Messaoudi, Leeyah Issop, and Jean-Jacques Lacapère 13 From Electron Microscopy Maps to Atomic Structures Using Normal Mode-Based Fitting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Konrad Hinsen, Edward Beaumont, Bertrand Fournier, and Jean-Jacques Lacapère

PART IV

207 221

237

NUCLEAR MAGNETIC RESONANCE

14 Determination of Membrane Protein Structures Using Solution and Solid-State NMR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Pierre Montaville and Nadège Jamin 15 Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure of Membrane Proteins . . . . . . . . . . . . . . . . . . . . . . . . . . . . Philip L. Yeagle and Arlene D. Albert 16 What Can We Learn from a Small Regulatory Membrane Protein? . . . . . . . . . . . . Gianluigi Veglia, Kim N. Ha, Lei Shi, Raffaello Verardi, and Nathaniel J. Traaseth 17 Solution-State NMR Spectroscopy of Membrane Proteins in Detergent Micelles: Structure of the Klebsiella pneumoniae Outer Membrane Protein A, KpOmpA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Marie Renault, Olivier Saurel, Pascal Demange, Valérie Reat, and Alain Milon 18 NMR Spectroscopy of Lipid Bilayers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Axelle Grélard, Cécile Loudet, Anna Diller, and Erick J. Dufourc

PART V

187

261

283 303

321

341

MOLECULAR MODELLING

19 Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Catherine Etchebest and Gaelle Debret 20 3D Structural Models of Transmembrane Proteins . . . . . . . . . . . . . . . . . . . . . . . . Alexandre G. de Brevern 21 Molecular Dynamics of Membrane Peptides and Proteins: Principles and Comparison to Experimental Data . . . . . . . . . . . . . . . . . . . . . . . . . Patrick F.J. Fuchs 22 Membrane Protein Dynamics from Femtoseconds to Seconds . . . . . . . . . . . . . . . . Christian Kandt and Luca Monticelli 23 The Family of G Protein-Coupled Receptors: An Example of Membrane Proteins . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Irina G. Tikhonova and Daniel Fourmy

363 387

403 423

441

Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 455

Contributors ENRIQUE ABOLA • Department of Molecular Biology, The Scripps Research Institute, La Jolla CA, USA ARLENE D. ALBERT • Department of Molecular & Cell Biology, University of Connecticut, Storrs, CT, USA BERTRAND ARNOU • Laboratoire de Physiologie Moléculaire et Cellulaire, Institut de Biochimie et Génétique Cellulaires-UMR 5095, CNRS-Université Bordeaux 2, Bordeaux, France; CEA, iBiTecS (Institut de Biologie et Technologies de Saclay), CNRS, URA 2096, Gif-sur-Yvette, France EDWARD BEAUMONT • INSERM U773, Centre de Recherche Biomédicale Bichat-Beaujon (CRB3), Faculté de Médecine X, Bichat, Université Paris 7, Paris, France IULIA BLESNEAC • Institut de Biologie Structurale, CEA-CNRS and Université Joseph Fourier, Grenoble, France GÉRARD BRANDOLIN • Laboratoire de Biochimie et Biophysique des Systèmes Intégrés (BBSI), Institut de Recherches en Technologies et Sciences du Vivant (iRTSV), UMR 5092 CNRS-CEA-Université Joseph Fourier, Grenoble, France ALEXANDRE G. DE BREVERN • INSERM UMR-S 665, Dynamique des Structures et Interactions des Macromolécules Biologiques (DSIMB), Institut National de Transfusion Sanguine (INTS), Université Paris Diderot - Paris 7, Paris, France ANNIE CAVALIER • CNRS, Interactions Cellulaires et Moléculaires, UMR 6026, Université de Rennes 1, Rennes, France VADIM CHEREZOV • Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA CÉCILE DAHOUT-GONZALEZ • Laboratoire de Biochimie et Biophysique des Systèmes Intégrés (BBSI), Institut de Recherches en Technologies et Sciences du Vivant (iRTSV), UMR 5092 CNRS-CEA-Université Joseph Fourier, Grenoble, France GAELLE DEBRET • Service d’Ingénierie MOléculaire des PROtéines (SIMOPRO), IbiTec-S, DSV, CEA, CE-Saclay, Gif-sur-Yvette, France PASCAL DEMANGE • UPS, Institut de Pharmacologie et de Biologie Structurale, Université de Toulouse, CNRS, UMR 5089, Toulouse, France AURÉLIEN DENIAUD • European Molecular Biology Laboratory, Grenoble Outstation, B.P. 181, Grenoble, France ATTILIO DI PIETRO • Institut de Biologie et Chimie des Protéines, UMR5086 CNRS-Université Lyon 1 et IFR128 BioSciences Gerland, Lyon, France ANNA DILLER • Chimie et Biologie des Membranes et des Nanoobjets (CBMN), UMR5248, CNRS - Université Bordeaux - ENITAB, IECB, Pessac, France ERICK J. DUFOURC • Chimie et Biologie des Membranes et des Nanoobjets (CBMN), UMR5248, CNRS - Université Bordeaux - ENITAB, IECB, Pessac, France

ix

x

Contributors

CATHERINE ETCHEBEST • INSERM UMR-S 665, Equipe Dynamique des Structures et des Interactions des Macromolécules Biologiques (DSIMB), Institut National de Transfusion Sanguine (INTS), Université Paris Diderot - Paris 7, Paris, France DANIEL FOURMY • INSERM, Institut National de la Santé et de la Recherche Médicale, Université de Toulouse 3, Toulouse, France BERTRAND FOURNIER • Laboratoire Léon Brillouin (CEA-CNRS), CEA Saclay, Gif-sur-Yvette, France PATRICK F.J. FUCHS • Equipe de Bioinformatique Génomique et Moléculaire, INSERM UMR-S726, Institut National de Transfusion Sanguine, Université Paris Diderot – Paris 7, Paris, France VALENTIN GORDELIY • Institut de Biologie Structurale, CEA-CNRS and Université Joseph Fourier, Grenoble, France AXELLE GRÉLARD • Chimie et Biologie des Membranes et des Nanoobjets (CBMN), UMR5248, CNRS - Université Bordeaux - ENITAB, IECB, Pessac, France KIM N. HA • Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, USA KONRAD HINSEN • Centre de Biophysique Moléculaire (CNRS), Orléans, France; Division Expériences, Synchrotron SOLEIL, Saint Aubin, Gif-sur-Yvette, France; Laboratoire Léon Brillouin (CEA-CNRS), CEA Saclay, Gif-sur-Yvette, France MINGHUI HU • New York Structural Biology Center, New York, NY, USA SORIA IATMANEN • INSERM U773, Centre de Recherche Biomédicale Bichat-Beaujon (CRB3), Faculté de Médecine X, Bichat, Université Paris 7, Paris, France LEEYAH ISSOP • INSERM U773, Centre de Recherche Biomédicale Bichat-Beaujon (CRB3), Faculté de Médecine X, Bichat, Université Paris 7, Paris, France NADÈGE JAMIN • CEA, iBiTecs, URA 2096, SB2SM, Gif-sur-Yvette, France CHRISTIAN KANDT • Computational Structural Biology, Chair of Life Science Informatics B-IT, Life & Medical Sciences (LIMES) Center, Rheinische Friedrich-Wilhelms-University Bonn, Bonn, Germany CHANGKI KIM • New York Structural Biology Center, New York, NY, USA JEAN-JACQUES LACAPÈRE • INSERM U773, Centre de Recherche Biomédicale Bichat-Beaujon (CRB3), Faculté de Médecine X, Bichat, Université Paris 7, Paris, France OLIVIER LAMBERT • CBMN UMR-CNRS 5248, University of Bordeaux, Talence, France GUY J.-M. LAUQUIN • Laboratoire de Physiologie Moléculaire et Cellulaire, Institut de Biochimie et Génétique Cellulaires-UMR 5095, CNRS-Université Bordeaux 2, Bordeaux, France CÉCILE LOUDET • IECB, UMS 3033, CNRS - Université Bordeaux - ENITAB, IECB, Pessac, France CEDRIC MESSAOUDI • INSERM U759, Imagerie intégrative, Orsay, France; Laboratoire Raymond Latarjet, Centre Universitaire d’Orsay, Institut Curie, Centre de Recherche, Orsay, France ALAIN MILON • UPS, Institut de Pharmacologie et de Biologie Structurale, Université de Toulouse, CNRS, UMR 5089, Toulouse, France EKATERINA MOISEEVA • Institut de Biologie Structurale, CEA-CNRS and Université Joseph Fourier, Grenoble, France

Contributors

xi

JESPER VUUST MØLLER • Centre for Membrane Pumps in Cells and Disease – PUMPKIN, Danish National Research Foundation, Copenhagen, Denmark; Department of Physiology and Biophysics, University of Aarhus, Aarhus, Denmark PIERRE MONTAVILLE • CEA, iBiTecs, URA 2096, SB2SM, Gif-sur-Yvette, France LUCA MONTICELLI • UMR-S665, DSIMB, INSERM, 6, rue Alexandre Cabanel, 75015, Paris, France POUL NISSEN • Centre for Membrane Pumps in Cells and Disease – PUMPKIN, Danish National Research Foundation, Copenhagen, Denmark; Department of Molecular Biology, University of Aarhus, Aarhus, Denmark HUGUES NURY • Institut Pasteur, Unit if Structural Dynamics of Macromolecules, CNRS, URA 2185, Paris, France CLAUS OLESEN • Centre for Membrane Pumps in Cells and Disease – PUMPKIN, Danish National Research Foundation, Copenhagen, Denmark; Department of Physiology and Biophysics, University of Aarhus, Aarhus, Denmark MARIANO A. OSTUNI • INSERM U773, Centre de Recherche Biomédicale Bichat-Beaujon (CRB3), Faculté de Médecine X, Bichat, Université Paris 7, Paris, France EVA PEBAY-PEYROULA • Institut de Biologie Structurale, CEA-CNRS and Université Joseph Fourier, Grenoble, France LUDOVIC PELOSI • Laboratoire de Biochimie et Biophysique des Systèmes Intégrés (BBSI), Institut de Recherches en Technologies et Sciences du Vivant (iRTSV), UMR 5092 CNRS-CEA-Université Joseph Fourier, Grenoble, France GABRIEL PÉRANZI • INSERM U773, Centre de Recherche Biomédicale Bichat-Beaujon (CRB3), Faculté de Médecine X, Bichat, Université Paris 7, Paris, France JOSÉ M. PÉREZ VICTORIA • Instituto de Parasitología y Biomedicina “López-Neyra”, CSIC Parque Tecnológico de Ciencias de la Salud, Armilla, Granada, Spain ALEXANDRE POZZA • Institut de Biologie et Chimie des Protéines, UMR5086 CNRS-Université Lyon 1 et IFR128 BioSciences Gerland, Lyon, France STEPHANIE RAVAUD • Institut de Biologie Structurale, CEA-CNRS and Université Joseph Fourier, Grenoble, France VALÉRIE REAT • UPS, Institut de Pharmacologie et de Biologie Structurale, Université de Toulouse, CNRS, UMR 5089, Toulouse, France MARIE RENAULT • UPS, Institut de Pharmacologie et de Biologie Structurale, Université de Toulouse, CNRS, UMR 5089, Toulouse, France WILLIAM J. RICE • New York Structural Biology Center, New York, NY, USA JEAN-CLAUDE ROBERT • INSERM U773, Centre de Recherche Biomédicale Bichat-Beaujon (CRB3), Faculté de Médecine X, Bichat, Université Paris 7, Paris, France OLIVIER SAUREL • UPS, Institut de Pharmacologie et de Biologie Structurale, Université de Toulouse, CNRS, UMR 5089, Toulouse, France LEI SHI • Department of Chemistry, University of Minnesota, Minneapolis, MN, USA RAYMOND C. STEVENS • Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA DAVID L. STOKES • Skirball Institute of Biomolecular Medicine, New York University

xii

Contributors

School of Medicine, New York, NY, USA; New York Structural Biology Center, New York, NY, USA JEAN-CHRISTOPHE TAVEAU • CBMN UMR-CNRS 5248, University of Bordeaux, Talence, France DAVID TEBOUL • INSERM U773, Centre de Recherche Biomédicale Bichat-Beaujon (CRB3), Faculté de Médecine X, Bichat, Université Paris 7, Paris, France DANIEL THOMAS • CNRS, Interactions Cellulaires et Moléculaires, UMR 6026, Université de Rennes 1, Rennes, France IRINA G. TIKHONOVA • INSERM, Institut National de la Santé et de la Recherche Médicale, Université de Toulouse 3, Toulouse, France NATHANIEL J. TRAASETH • Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, USA SYLVAIN TRÉPOUT • CBMN UMR-CNRS 5248, |University of Bordeaux, Talence, France VÉRONIQUE TRÉZÉGUET • Laboratoire de Physiologie Moléculaire et Cellulaire, Institut de Biochimie et Génétique Cellulaires-UMR 5095, CNRS-Université Bordeaux 2, Bordeaux, France IBAN UBARRETXENA • Department of Structural and Chemical Biology, Mt. Sinai School of Medicine, New York, NY, USA GIANLUIGI VEGLIA • Department of Chemistry and Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, USA RAFFAELLO VERARDI • Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, USA ANNE-MARIE LUND WINTHER • Centre for Membrane Pumps in Cells and Disease – PUMPKIN, Danish National Research Foundation, Copenhagen, Denmark; Department of Molecular Biology, University of Aarhus, Aarhus, Denmark PHILIP L. YEAGLE • Office of the Dean of Arts & Sciences, Rutgers University, Newark, NJ, USA

as

Part I Membrane Protein Purification

as

Chapter 1 Characterization of Membrane Protein Preparations: Measurement of Detergent Content and Ligand Binding After Proteoliposomes Reconstitution Mariano A. Ostuni, Soria Iatmanen, David Teboul, Jean-Claude Robert, and Jean-Jacques Lacapère Abstract The study of membrane proteins is a difficult task due to their natural embedding in hydrophobic environment made by lipids. Solubilization and purification from native membranes or overexpressed system involves the use of detergent to make them soluble while maintaining their structural and functional properties. The choice of detergent is governed not only by their ability to reach these goals, but also by their compatibility with biochemical and structural studies. A different detergent can be used during purification, and characterization of the detergent amounts present in each purification step is crucial. To address this point, we developed a colorimetric method to measure detergent content in different preparations. We analyzed detergent present in the collected fractions from the purification of the recombinant membrane translocator protein (RecTSPO). We followed detergent removal during the reconstitution of RecTSPO in liposomes and observed by electron microscopy the formation of proteoliposomes. We addressed the RecTSPO functionality by testing its ability to bind high affinity drug ligand [3H]PK 11195. We described the different parameters that should be controlled in order to optimize the measurement of this ligand binding using a filtration procedure. These protocols are useful to characterize functionality and detergent content of membrane protein, both key factors for further structural studies. Key words: Detergent, DPC, Mitochondrial membrane protein, Mitochondrial TSPO, PBR, Purification, Reconstitution, SDS

1. Introduction The quantity and distribution of naturally expressed membrane protein is a limiting step for functional and structural studies, which usually require important amounts of highly purified and concentrated protein (1). Along the past decades, various Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_1, © Springer Science+Business Media, LLC 2010

3

4

Ostuni et al.

strategies have been developed to either purify or overexpress membrane proteins. These strategies include the isolation and enrichment of naturally abundant proteins from their original tissues or the heterologous overexpression production systems and purification protocols (2). In this latter case, several vectors, constructions, and expression systems, including eukaryotes organism and cells, prokaryotes, and acellular systems (see further chapters), have been developed to optimize both expression and purification yields. Although there are exceptional cases of naturally abundant proteins, which can be highly enriched in a few steps (e.g., sarco– endoplasmic reticulum calcium ATPase (SERCA) from rabbit fast skeletal muscle or adenine nucleotide transporter (ANT) from heart liver mitochondria), membrane protein production protocols often include numerous purification steps mostly in the presence of a detergent (3). The choice of the detergent is complex since, on one hand, the membrane protein should be solubilized but remaining functional, (4) and on the other hand, the detergent should be compatible with biochemical and structural studies. For instance, noncharged detergents are a requisite for ion desorption experiments (MALDI-TOF), and deuterated detergents are needed for nuclear magnetic resonance (NMR) studies. Furthermore, depending on structural studies performed, additional steps might be needed in order to eliminate, diminish, or replace the detergent used to purify the membrane protein. The other major component of a membrane protein preparation is the lipid environment. Lipids could be present from extraction, or added or even exchanged, depending on the purification process. Characterization of the various components present in the final membrane protein preparation is required to assure not only the reproductibleness of experiments, but also the knowledge of the structural and the functional states of the protein. Indeed, any additional step in purification protocols implies the possibility of introducing an unexpected structural modification that may lead to misfolding and/or loss of function. This characterization is important to avoid the loss of time and money used in the analysis of improper samples and it may include several measurements: (1) the assessment of protein purity on gels and the characterization of contaminant proteins as well as the polymeric state of the protein of interest; (2) the analysis of the secondary and tertiary folding not only by spectroscopic (circular dichroism (CD)) techniques, but also by other biophysical techniques (5); (3) the determination of the protein, lipid, and detergent ratios that can be gained from the individual measurement of each component; and (4) the functionality of the purified membrane protein. This can be determined in detergent/protein complexes and/or in reconstituted proteoliposomes.

Characterization of Membrane Protein Preparations: Measurement of Detergent Content

5

Protein concentration determination may be difficult due to the presence of detergent that perturbs classical colorimetric measurements. However, protein concentration could be easily addressed using a nondestructive method based on the absorbance (in the aromatic region at 280 nm), in the presence of the detergent, and the calculated extinction coefficient (6). Several protocols of lipid extraction, purification, and characterization have been described from thin layer chromatography (7) to more recent mass spectroscopy (8), and they usually enable good lipid content determination. Detergent content measurements are accurately determined when radioactive forms exist. Some other approaches, such as the use of specific electrodes (9) or thin layer chromatography (10), have been described over the last decades to measure detergent content. We present here methods to address this latter point in different specific cases such as the measurement of detergent present in the membrane protein preparation and the following of detergent removal during membrane protein reconstitution in liposomes. The formation of proteoliposomes is a simple method to assess the protein functionality when coupled to ligand binding for a receptor or transport for a translocator. We illustrate the use of these methods taking advantage of the specific case of recombinant TranSlocator PrOtein (RecTSPO). This membrane protein previously named peripheral-type benzodiazepine receptor (PBR) is a transmembrane protein mostly located in mitochondria (11), whose expression and purification will be discussed in another chapter.

2. Materials 1. Absorption measurements: DO and absorption spectra were measured using an UV-300 Unicam UV-visible spectrometer (ThermoFisher Scientific, Courtaboeuf, France). 2. Phosphate buffer saline (PBS): 50 mM H2NaPO4; 150 mM NaCl; pH: 7.6. 3. Bio-Rad Protein assay kit and Bio-Beads SM2 (25–50 mesh) were purchased by Bio-Rad (Marne la Coquette, France). 4. Lipid solution: A mixture of dimyristoylphosphatidylcholine (DMPC) and dimyristoyl phosphatidylethanolamine (DMPE) (Avanti Polar Lipids, Alabaster, AL, USA) at a ratio of 9 to 1 weight/weight were mixed in buffer solution and sonicated in a water bath FB 15049 (Fischer Scientific, Illkirch, France) to remove multilamellar vesicles. 5. Detergents used: Sodium dodecyl sulfate (SDS, Sigma, SaintQuentin Fallavier, France) and dodecylphosphocholine (DPC, COGER, Paris, France).

6

Ostuni et al.

6. Protein used: RecTSPO (mouse recombinant TSPO) was expressed in E. Coli Bl21 bacteria (Invitrogen, Paisley, UK), extracted from inclusion bodies with SDS, purified by affinity chromatography on superflow Ni-NTA resin (Qiagen SA, Courtaboeuf, France), and protein concentration was determined from absorption spectra using extinction coefficient of 3.88/(mg/ml)–1 cm–1. 7. Radioactivity: Labeled PK 11195 ([3H]PK 11195) was purchased by New England Nuclear, NEN Life Science Products (Boston, MA, USA). Whatman Filters GF/C and PK 11195 were purchased by Sigma-Aldrich (Saint Quentin Fallavier, France). BSC liquid scintillation cocktail (GE Healthcare Europe, Saclay, France) and Wallac 1409 liquid scintillation counter (Perkin Elmer, Les Ulis, France). 8. Electron microscopy: JEOL 1200EX Transmission Electron microscope equipped with LaB6 filament operated at 120 kV and a sample holder (JEOL EM-SQH10). Cupper grids (400 mesh, Delta Microscopies, Labege, France) were covered with carbon film and negatively stained with 2% uranyl acetate (Sigma-Aldrich, Saint Quentin Fallavier, France).

3. Methods One of the key components of membrane protein purification is detergent, since its presence makes water soluble a protein naturally embedded in lipid bilayers. During purification process from either naturally abundant membrane protein as well as from overexpression systems, amount of detergent is usually only characterized from the initial detergent concentration in the used solutions. However, centrifugation, affinity column purification or dialysis steps can differentially concentrate detergent and proteins. Thus, quantification of detergent content of membrane protein preparation before structural or functional analysis is crucial (see Chapter 3 by Robert and Lacapere in this book). 3.1. Detergent Concentration Determination

It has been described for a long time that the presence of detergent modifies or impairs the colorimetric dosage of proteins by Bradford-based protein assay kits (Bio-Rad). We take advantage of the detergent interference to directly measure the effect of detergent addition to this colorimetric dye. 1. Standard calibration of sodium dodecyl sulfate (SDS) and dodecylphosphocholine (DPC) in the absence of protein: the Bio-Rad dye reagent was prepared by fivefold dilution in distilled water. 1 ml of diluted reagent was distributed into test tubes, and 0–300 µg of SDS or DPC were added.

Characterization of Membrane Protein Preparations: Measurement of Detergent Content

7

Absorption spectra were recorded (from 250 to 800 nm) for each SDS or DPC containing solutions (Figs. 1a and 2a). The spectrophotometer saturation level was around OD value as great as 3 (see left side of the spectra). The spectra exhibit a shift of the peak, and the maximal change of absorbance induced by SDS and DPC was observed at 650 and 600 nm, respectively. Thus, OD for the various concentrations of SDS and DPC was measured at 650 and 600 nm for SDS and a

b 3

d 2

c 1

b

Optical Density

Optical Density

3

d 2

c b 1

m a

a 0 200 300 400 500

0 200 300 400 500

600 700 800

Wavelength, nm

d

2

SDS, mg/ mL

c

1,5

e

2,5

25 20 15 10 5 0

1 0,5 0 0

100 200 300

400 500 600

SDS, µg / ml

RecTSPO, mg/mL

Optical Density at 650 nm

3

600 700 800

Wavelength, nm

20 15 10 5 0 0

5

10

Elution volume, mL

Fig. 1. Effect of SDS on the light absorption of the Bradford-based Bio-Rad dye. Panel (a) shows the effect of the addition of different quantities of SDS (traces a, b, c, and d correspond to 0, 75, 125, and 200 µg/ml SDS, respectively) on the Bio-Rad dye absorption spectra. Several peaks could be observed, but largest differences in optical density due to SDS were obtained at 650 nm. Panel (b) shows the effect of methanol on the dye spectrum in the presence of different quantities of SDS (trace a, dye solution alone; trace m, in the presence of 3% methanol; traces b, c, d, and e, in the presence of methanol and 75, 125, and 200 µg/ml SDS, respectively). Panel (c) shows plotted optical density obtained at 650 nm without (filled circles) or with 3 and 7.5% added methanol (open triangles and diamonds, respectively). Solid and dashed lines depict the linear portion of the titration curves (100–300 and 200–500 µg/ml). Panels (d) and (e) show chromatograms of RecTSPO purification in the presence of SDS. Detergent contents in the eluted fractions (closed triangles in (d)) were measured by precipitating the protein and then mixing an aliquot with Bio-Rad dye to get detergent concentration in the supernatant. Protein contents in the eluted fractions (opened circles in (e)) were measured by recording absorption spectra of each fraction and calculating protein concentration using extinction coefficient of 3.88/(mg/ml)–1 cm–1 at 280 nm.

8

Ostuni et al.

a

b 3 Optical Density

Optical Density

3

d

2

c 1

b

2

d c

1

m/b

a 0 200 300 400 500 600 700 800

0 200 300 400 500 600 700 800

Wavelenght, nm

Wavelenght, nm

d

3

Optical Density at 450 nm

Optical Density at 600 nm

c

a

2,5 2 1,5 1 0,5 0

1,6 1,4 1,2 1 0,8 0,6

0

100

200

DPC, µg/ml

300

0

100

200

300

DPC, µg/ml

Fig. 2. Effect of DPC on the light absorption of the Bradford-based Bio-Rad dye. Panel (a) shows the effect of the addition of different quantities of DPC (traces a, b, c, and d correspond to 0, 75, 125, and 200 µg/ml DPC, respectively) on the Bio-Rad dye absorption spectra. Several peaks could be observed but largest differences in optical density were obtained at 600 nm. Panel (b) shows the effect of the presence of methanol on the addition of different quantities of DPC (trace a, dye solution alone; trace m, in the presence of methanol; traces b, c, d, and e, in the presence of methanol and 75, 125, and 200 µg/ml DPC, respectively). Panel (c) shows plotted optical density obtained at 600 nm without (filled circles) or with (open triangles) methanol. Solid and dotted lines depict the linear portion of the titration curve in the absence and in the presence of methanol, respectively. Panel (d) shows plotted OD obtained at 450 nm without (filled circles) or with (open triangles) methanol. Solid line depicts the linear portion of the titration curve (100–300 µg/ml).

DPC, respectively, and data were plotted versus standard concentrations (Figs. 1c and 2c). Both curves gained in the presence of SDS and DPC exhibit a linear interval permitting accurate dosages of detergent in the range of 50–200 and 100–250 µg/ml for SDS and DPC, respectively (see Note 1). 2. Assay for protein containing samples: Bio-Rad dye reagent is protein sensitive; thus, to avoid cross-reaction with detergent staining, protein can be precipitated in methanol. One volume of protein samples was added to three volumes of methanol in a 1.5-ml tube and strongly mixed. A white precipitate

Characterization of Membrane Protein Preparations: Measurement of Detergent Content

9

is formed and centrifuged for 5 min at 200 × g. The amounts of detergents present in the supernatant can be determined using standard curves performed in the presence of methanol, since small amount of methanol (3%, accordingly to final concentration in the spectrometer cell) has a small effect on Bio-Rad dye reagent absorption spectrum (Figs. 1b and 2b). Absorption spectra were recorded (from 250 to 800 nm) for each SDS and DPC containing solutions in the presence of 3% methanol. MeOH shifts the calibration curve of SDS and DPC when looking absorption at 600 nm (Figs. 1c and 2c), but the effect is smaller for SDS than for DPC. Indeed, a clear effect is observed for 7.5% MeOH on the calibration curve of SDS, whereas only 3% has a significant effect on DPC calibration curve (Figs. 1c and 2c). Methanol also induced a small decrease at 450 nm of absorption spectrum of Bio-Rad dye reagent. DPC calibration curves performed at this absorption wavelength (Fig. 2d) shows that methanol has no effect in the linear part of the curve, for 100–250 mg of DPC (see Note 1). 3. Application to membrane protein purification: RecTSPO is overexpressed in E. coli cells (see Chapter 3 by Robert and Lacapere in this book), extracted from inclusion bodies with SDS and purified by immobilized metal ion affinity chromatography in the presence of detergent. SDS-solubilized RecTSPO is loaded into Ni-NTA resin, column is washed and protein eluted with imidazole and SDS containing buffer. Collected fractions were analyzed for detergent and protein content and chromatogram drawn (Fig. 1d, e). An aliquot of each fraction was mixed with methanol (see above) and detergent content of the supernatant determined using calibration curves (see Note 2). An absorption spectrum of each fraction (diluted if needed) was recorded and protein contents determined using extinction coefficient of 3.88/(mg/ml)–1 cm–1 at 280 nm. 3.2. Detergent Removal Using Bio-Beads

Bio-Beads are macroporous polystyrene beads used in hydrophobic interaction chromatography (12). They have a high surface area for adsorbing organics of molecular weight less than 2,000 from aqueous solutions. The Bio-Beads are useful for the adsorption of nonpolar substances or surface active agents such as detergents. Kinetics of detergent removal by theses Bio-Beads are more or less specific for each detergent (13–15). The above described technique can be used to follow detergent removal from a solution by measuring detergent content of aliquots. 1. Bio-Beads preparation: Dry SM2 Bio-Beads were extensively washed before use, firstly with methanol (three times) to eliminate impurities and polystyrene dust and then with water (four times) to hydrate them.

10

Ostuni et al.

2. Kinetics of detergent removal: 10–15 mg detergents were added to 5 ml solution of PBS. Solution was gently stirred and kept at room temperature (25°C). At various time intervals, weighted amount of wet Bio-Beads were added (see Note 3). Aliquots of 40 µl were removed at several time points and their detergent content was determined by mixing in 1 ml containing Bio-Rad dye solution (1:5 dilution), which OD was measured at the detergent corresponding wavelength (Fig. 3a, b) (see Note 2). 3. Calculation of binding capacity of Bio-Beads: Capacity was defined as the total amount of detergent that was removed by 1 g of Bio-Beads. The amount of detergent removed by each

a

b 10 75 mg

14

Total DPC in solution, mg

Total SDS in solution, mg

16 25 mg

12 10 8 6 4 2 0

9

100 mg

8

25 mg

7 6 5 4 3 2 1 0

0

30 60 90 120 150 180 210 240

0

30

60

Time, min

c

d

8

120

150

180

8 7

DPC removed, mg

7 SDS removed, mg

90

Time, min

6 5 4 3 2 1

6 5 4 3 2 1

0

0 0

25

50

75

Bio-Beads, mg

100

0

25

50

75

100

Bio-Beads, mg

Fig. 3. Detergent absorption by Bio-Beads. Panel (a) shows the time course of SDS removal by the repeated addition (signaled by arrows) of 25 (closed squares) or 75 (closed triangles) mg of BB in 6 ml solution containing ~11–13 mg total SDS. Panel (b), kinetics of DPC elimination in the presence of two different amounts of Bio-Beads (closed squares and triangles for 25 and 100 mg, respectively). Panels (c) and (d) show the quantity of SDS and DPC absorbed as function of amount of wet Bio-Beads added. A binding capacity of 70 mg SDS or DPC per g of Bio-Beads was calculated from respective slopes.

Characterization of Membrane Protein Preparations: Measurement of Detergent Content

11

addition of Bio-Beads was measured and data plotted (Fig. 3c, d for SDS and DPC, respectively). The slope of the curves permitted to calculate a value of 70–80 mg SDS or DPC removed by gram of Bio-Beads (in agreement with previously published results for SDS (15). 3.3. Proteoliposomes Formation

Protein incorporation into liposomes is a powerful tool to investigate both functional and structural aspects of membrane proteins. Several strategies have been developed to achieve functional proteoliposomes including the use of organic solvent, mechanical protocols and the use of detergents (16). The latest is the most frequently chosen strategy as detergents are usually present in isolation and purification protocols. Standard procedures to incorporate detergent solubilized membrane protein in lipid bilayer involves different steps: (1) mixing of membrane protein solubilized in detergent with lipids to form a ternary complex, (2) exchange of detergent surrounding protein by lipid detergent, and (3) bilayers formation, mostly vesicle formation. Detergent removal can be followed by the above described method (Fig. 4a, d) and vesicle formation by measuring light scattering changes (Fig. 4b, e). Finally, proteoliposomes formation as well as the evolution in size and shape of the objects present in the solution can be assessed by electron microscopy (Fig. 4c, f). 1. A mixture of DMPC/DMPE (9/1 w/w) lipids at a final concentration of 0.5–1 mg/ml is added to a 6-ml solution of PBS. 2. Membrane protein solubilized in detergent at a final protein concentration of 0.1–0.2 mg/ml is added to the lipid containing solution. 3. OD at 550 nm of the solution is measured before and after the addition of protein. The presence of detergent from the purified membrane protein solution is usually enough to solubilize the lipids and is clearly observed by an important decrease in OD. At this wavelength, most proteins do not absorb and the size of the detergent, protein, lipids complexes are too small to induce large light scattering. 4. Detergent content of the starting solution: Aliquots of 40 µl was removed and mixed in 1 ml containing Bio-Rad dye solution (1:5 dilution). OD was measured at the detergent corresponding wavelength (Fig. 3a, b) and detergent content calculated from calibration curves (Figs. 1c and 2c). 5. For negative staining, 5 µl of the reconstitution samples were applied to carbon-coated grids, blotted, and stained with 1% uranyl acetate and the specimens were observed with an electron microscope (Fig. 4c, f).

12

Ostuni et al.

a

d 2

DPC, mg/ml

SDS mg/ml

1,5

1,0

0,5

0,0

1

0 0

30

60

90

120 150

0

60

Time, min

180

e 0,7 0,6 0,5 0,4

a b

c

d

0,3 0,2 0,1 0,0

0

60 120 180 240 Over night Time, min

c

0,7

Optical Density at 550 nm

b Optical Density at 550 nm

120

Time, min

0,6 0,5 0,4

b a

0,3 0,2

d

0,1

c

0

0 30 60 90 120150 Over night Time, min

f a

b

a

b

c

d

c

d

200 nm

200 nm

Fig. 4. Formation and characterization of RecTSPO containing proteoliposomes. Reconstitution process was followed by measuring detergent removal and vesicle formations from a solution containing lipids, RecTSPO and detergent (panels (a–c) and (d–f) for RecTSPO purified in SDS or DPC, respectively). Panel (a), time course of SDS removal by a single addition (signaled by arrows) of large amount (closed circles) or repeated addition of small amounts (closed triangles) of Bio-Beads. Panel (b), time course of light scattering changes upon SDS removal by a single (closed circles) or repeated addition of small amounts (closed triangles) of Bio-Beads. Panel (c), electron micrographs of negatively stained samples taken at different time of SDS elimination (0, 30, 90 min, and over night, as indicated in panel (b)). Panel (d), time course of DPC removal by a single addition (signaled by arrows) of large (closed squares) or small amount (closed circles) of Bio-Beads. Panel (e), time course of light scattering changes upon DPC removal by a single addition of large (closed squares) or small amounts (closed circles) of Bio-Beads. Panel (f), electron micrographs of negatively stained samples taken at different time of DPC elimination (0, 120 and 30, 90 min from large and small Bio-Beads additions, as indicated in panel (e)).

Characterization of Membrane Protein Preparations: Measurement of Detergent Content

13

6. Detergent present in the solution is removed by addition of Bio-Beads. The quantity of added Bio-Beads regulates the rate of vesicle formation by controlling the amount of detergent removed (compare Fig. 4a–c for SDS and 4d–f for DPC). Addition of Bio-Beads mass close to the binding capacity can be chosen to get a fast reconstitution, whereas addition of a smaller mass generates a slower reconstitution process. 7. Formation of proteoliposomes is basically correlated to detergent removal, but several remarks should be done. (1) The kinetics of vesicle formation is not exactly parallel with that of the detergent removal as revealed by the difference observed between the measurements of detergent residual (Fig. 4a, d for SDS and DPC, respectively) and the changes in OD550 (Fig. 4b, e for SDS and DPC, respectively). (2) The final OD550 reached after detergent removal correlates the size of the vesicles formed, but the size of the vesicles seems to vary with the type of detergent used to solubilize the protein (see Note 4). Comparison of Fig. 4c, f shows that vesicles are smaller when SDS was the solubilizing detergent. (3) In some cases, vesicles tend to fuse leading to an increase of OD550 well correlated with the observation of vesicle aggregation. 3.4. Ligand Binding

Ligand binding is a useful method to characterize functional state of a membrane protein. Several techniques can be used among which incubation of proteoliposomes in the presence of radioactive ligands is a very sensitive one. Separation of bound and free ligand can be obtained by filtration of proteoliposomes. In this case, the choice of the filter is crucial. In the specific case of TSPO, whose high affinity drug ligands are hydrophobic compounds, preliminary studies have to be performed to optimize the ligand binding experiments. 1. Choice of the filter: They should retain the proteoliposomes and minimize the radioactive background (mostly due to filter dead volume and nonspecific binding on the filter). Different composition and pore size filters are commercially available, and Table 1 shows the characteristics obtained with TSPO proteoliposomes and its high affinity drug ligand, PK 11195. Proteoliposome retention was performed by filtering a solution and measuring the TSPO intrinsic fluorescence of the filtrate compared to the proteoliposmes solution. The radioactive background was calculated by measuring retained radioactivity in the filters in the absence of proteoliposomes. Table 1 shows that the best filters to measure [3H]PK 11195 binding to RecTSPO are the Whatman GF/C and that background can be reduced by washing the filters with buffer.

14

Ostuni et al.

Table 1 Different filters tested for measuring radioactive ligand binding to TSPO Filter

Proteoliposomes retention (% of total)

Background radioactivity (% of total) Nonwashed

Washed

Whatman GF/B

74 ± 13

29.4

7.7

Whatman GF/C

62 ± 12

24.1

2.6

Whatman GF/F

60 ± 15

42.9

14.3

Millipore

86 ± 10

94.3

70.5

For proteoliposome retention by the various filters, 1 ml of a solution containing 0.5–10 µg of RecTSPO was filtered and the protein content of the filtrate (i.e., nonretained on the filter) was measured by intrinsic fluorescence of the RecTSPO using a calibration curve. Results are expressed as percentage of total protein filtered. For ligand retention by the various filters, 0.3 ml of a solution containing cold and [3H] PK 11195 was filtered under the same condition of binding experiments, filters were washed or not two times with 4 ml cold PBS, and radioactivity trapped in the filters was measured in a liquid scintillation counter. Retained radioactivity was expressed as a percentage of total sample radioactivity. Based on the results showed in this table, the GF/C grade glass microfiber filters were chosen to perform RecTSPO binding studies

2. Experimental conditions: The signal-to-background ratio can be optimized adjusting protein concentration and incubation time. Figure 5a shows that signal-to-background ratio increases linearly with RecTSPO amounts. For drug ligands, such as PK 11195, the affinity for RecTSPO is in the nanomolar range; thus, the optimal protein concentration used in the experimental condition of ligand binding should be lower to measure affinity constant. Figure 5b shows the incubation time is also important for such hydrophobic ligand such as PK 11195. Indeed, the binding of PK 11195 to the RecTSPO is fast, but the PK 11195 bound and free decreases with time. The optimal signal-to-background ratio is observed for 5–15-min incubation (see Note 5). 3. Direct ligand binding experiments: Proteoliposomes were incubated in the presence of increasing concentration of [3H] PK 11195 and filtered. For each ligand concentration, the specific binding was calculated as the difference between total binding and nonspecific binding (measured in the presence of a large excess of cold PK 11195). Figure 5c shows that [3H] PK 11195 bound increases with the total PK 11195 concentration and reaches a plateau corresponding to the saturation

Characterization of Membrane Protein Preparations: Measurement of Detergent Content

a

b

5000

8000 6000

3000

C.P.M.

C.P.M.

4000

15

2000

4000 2000

1000 0

0 0

1 2 TSPO, (µg)

0

10

20 30 40 Time, (min)

50

60

d 120 [3H]PK 11195 bound, (%)

[3H]PK 11195 bound, (nmol/mg)

c

3

40 30 20 10 0

0

10 20 PK 11195, (nM)

100 80 60 40 20 0 0,01

1 100 PK 11195, (nM)

10000

Fig. 5. Control of protein functionality by ligand-binding experiments. Panel (a) shows the ligand binding as function of reconstituted RecTSPO quantities incubated for 15 min at 25°C. Circles indicate the radioactivity due to total [3H]PK 11195 binding, whereas triangles show the nonspecific binding to filters and membranes in the presence of a large excess of nonradioactive PK 11195. Panel (b) shows the time course of ligand binding using 2 ng of reconstituted RecTSPO incubated at 25°C. Circles and triangles indicate the radioactivity due to total PK 11195 bound and nonspecific binding, respectively. Panel (c) shows the ligand-binding saturation curve obtained incubating 2 ng of reconstituted RecTSPO for 15 min at 25°C. Binding of [3H]PK 11195 is expressed as nmol of bound ligand per mg of RecTSPO. Panel (d) shows [3H]PK 11195 binding of 2 µg of RecTSPO incubated for 30 min at 25°C in the presence of raising concentration of cold PK 11195.

of the RecTSPO binding sites. The binding constants (affinity constant, Kd, and maximal binding value, Bmax) can be obtained by fitting the saturation curve with the following equation Y = (Bmax × S)/(Kd + S), giving a Kd of 6.5 ± 0.5 nM and a Bmax of 43 ± 8 nmol/mg protein. 4. Competition binding experiments: Proteoliposomes were incubated in the presence of a constant concentration of [3H] PK 11195 and rising concentration of cold PK 11195. The various solutions were filtered and the specific binding was calculated as the difference between total binding and nonspecific binding (the asymptotic value observed in the presence of a 1,000-fold excess of cold PK 11195). Figure 5d

16

Ostuni et al.

shows that [3H]PK 11195 bound decreases with increasing concentration of cold PK 11195. The inhibition constant (IC50) can be obtained by fitting the curve with the following equation Y = Max−(Max × S)/(Kd + S), giving an IC50 of 10 ± 2 nM. 3.5. Conclusions

The method presented here permits to reproducibly measure the amount of detergent present in various conditions during the processes leading from the purification, the functional characterization and the structure determination of a membrane protein. It has been characterized for detergent used to purify RecTSPO in SDS or DPC for the reconstitution of this membrane protein in liposomes and the further measurements of ligand binding. The results gained from this method permit to know the detergent at the different steps of sample preparations for functional and structural studies. It permits not only to follow complete detergent removal to form proteoliposomes used for ligand binding experiments but also to control the protein–detergent ratio, which is an important factor in the crystallization process.

4. Notes 1. It should be mentioned that calibration curves adjust to sigmoid function with threshold and saturation levels. The linear part corresponds to a narrow range of detergent concentrations but with an important change in OD giving a very sensitive method in this range of concentration (Figs. 1 and 2). The effect of methanol is small, but calibration curve has to be done taking into account the methanol content in the aliquot from the precipitate membrane protein. Moreover, volume added has to be as small as possible (10–40 µl in 1 ml) to reduce dilution effect. 2. When detergent content was too low to be in the linear part of the calibration curve, a first addition of well-characterized detergent solution was performed to reach the linear region and then an aliquot of the sample is added. The detergent concentration in the sample is finally calculated by difference. 3. Bio-Beads are prepared in aqueous solution, and thus, weighting needs water removal. However, beads also need to be maintained wet to keep their detergent absorption capacity. Real amounts of weighted Bio-Beads might vary from one operator to another since water present with the “wet” Bio-Beads might vary. The best is that each operator performs its own calibration curves.

Characterization of Membrane Protein Preparations: Measurement of Detergent Content

17

4. We have no clear explanation for the reasons leading to the formation of different sizes of vesicles when starting from RecTSPO purified in SDS or in DPC. It might be due to different types of interactions between detergent and proteins that either favor or delay lipid–proteins interactions. Different compositions of ternary complexes (protein–lipid–detergent) might be generated as function of the type of used detergent. This might lead to larger or smaller vesicles related to the lipid amounts around the protein. 5. Incubation time should not be extended too long since free PK 11195 diminishes with time. This might be due to nonspecific binding of hydrophobic PK 11195 to the glass tube surface.

Acknowledgment The authors would like to thank Professor V. Papadopoulos for the generous gift of TSPO plasmids, G. Péranzi and A. Letort for their contribution in preliminary experiments. This work was supported by CNRS (Centre National de la Recherche Scientifique) and ANR (Agence National pour la Recherche) Grant 06-Blan0190-01 to JJL. References 1. Lacapere J-J, Pebay-Peyroula E, Neumann J-M, Etchebest C (2007) Determining membrane protein structures: still a challenge! Trends Biochem Sci 32:259–270 2. Grisshammer R, Tate C (2003) Preface: overexpression of integral membrane proteins. Biochim Biophys Acta 1610: 1 3. Garavito RM, Ferguson-Miller S (2001) Detergents as tools in membrane biochemistry. J Biol Chem 276:32403–32406 4. LeMaire M, Champeil P, Møller JV (2000) Interaction of membrane proteins and lipids with solubilizing detergents. Biochim Biophys Acta 1508:86–111 5. Pebay-Peyroula E (ed) (2008) Biophysical analysis of membrane protein. Investigating structure and function. Wiley-VCH, Weinheim 6. Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A (2003) ExPASy: the proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res 31:3784–3788 7. Fleischer S, Rouser G, Fleischer B, Casu A, Kritchevski G (1967) Lipid composition of

8.

9.

10.

11.

mitochondria from bovin heart, liver, and kidney. J Lipid Res 8:170–180 Schiller J, Arnhold J, Benard S, Müller M, Reichl S, Arnold K (1999) Lipid analysis by matrix-assisted laser desorption and ionization mass spectroscopy: a methodological approach. Anal Biochem 267:46–56 Mokus M, Kragh-Hansen U, Letellier P, le Maire M, Møller JV (1998) Construction and use of a detergent-sensitive electrode to measure dodecyl sulfate activity and binding. Anal Biochem 264:34–40 Eriks LR, Mayor JA, Kaplan RS (2003) A strategy for identification and quantification of detergents frequently used in the purification of membrane proteins. Anal Biochem 323:234–241 Papadopoulos V, Baraldi M, Guilarte TR, Knudsen TB, Lacapere J-J, Lindemann P, Noremberg MD, Nutt D, Weizman A, Zhang M-R, Gavish M (2006) Translocator protein (18 kDa): new nomenclature for the peripheral-type benzodiazepine receptor based on

18

Ostuni et al.

its structure and molecular function. Trends Pharmacol Sci 27:402–409 12. Holloway PW (1973) A simple procedure for removal of triton X-100 from proteins samples. Anal Biochem 53:304–340 13. Rigaud J-L, Mosser G, Lacapere J-J, Olofson A, Levy D, Ranck J-L (1998) Bio-Beads: an efficient strategy for two-dimensional crystallization of membrane proteins. J Struct Biol 118:226–235 14. Lacapere J-J, Stokes DL, Olofsson A, Rigaud J-L (1998) Two-dimensional crystallization

of Ca-ATPase by detergent removal. Biophys J 75:1319–1329 15. Lacapere J-J, Delavoie F, Li H, Péranzi G, Maccario J, Papadopulos V, Vidic B (2001) Structural and functional study of reconstituted peripheral benzodiazepine receptor. Biochem Biophys Res Commun 284:536–541 16. Rigaud J-L, Pitard B, Levy D (1995) Reconstitution of membrane proteins into liposomes: application to energy-transducing membrane proteins. Biochim Biophys Acta 1231:223–246

Chapter 2 Native Membrane Proteins vs. Yeast Recombinant: An Example: The Mitochondrial ADP/ATP Carrier Bertrand Arnou, Cécile Dahout-Gonzalez, Ludovic Pelosi, Guy J.-M. Lauquin, Gérard Brandolin, and Véronique Trézéguet Abstract The mitochondrial ADP/ATP carrier (Ancp) has long been a paradigm for studies of the mitochondrial carrier family due to, among other properties, its natural abundance and the existence of specific inhibitors, namely, carboxyatractyloside (CATR) and bongkrekic acid (BA), which lock the carrier under distinct and stable conformations. Bovine Anc1p isolated in complex with CATR in the presence of an aminoxyde detergent (LAPAO) was crystallized and its 3D structure determined. It is the first mitochondrial carrier structure resolved at high resolution (2.2 Å, as reported by Pebay-Peyroula et al. (Nature 426:39–44, 2003)). Analyses revealed a monomer while most of the biochemical studies led to hypothesize Ancp functions as a dimer. To address the structural organization issue, we engineered a mutant of the yeast Ancp that corresponds to a covalent homodimer in view of 3D structure determination. We compare in this chapter the purification yield and quality of the chimera tagged either with six histidines at its C-ter end or nine histidines at its N-ter. We show that, as expected, length and position of the tag are important criteria for qualitative purification. We also discuss the advantages and drawbacks of purifying Ancp either from a natural source or from engineered yeast cells. Key words: Mitochondrial ADP/ATP carrier, Mitochondrial carrier, High yield purification

1. Introduction Major metabolic pathways occur within mitochondria, which is the primary site of ATP synthesis. It is isolated from the cytosol by two membranes (outer and inner) that are therefore important sites for the regulation of metabolic functions. The mitochondrial carrier family (MCF) members are integral membrane proteins that transport various metabolites through the mitochondrial inner membrane. Among them, the mitochondrial ADP/ATP carrier (Ancp) was the first to be identified around 40 years ago. Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_2, © Springer Science+Business Media, LLC 2010

19

20

Arnou et al.

It plays a key role in the energetic cell metabolism because it exchanges ATP and ADP, respectively product and substrate of the mitochondrial ATP synthase. Ancp is the most abundant among MCF members (up to 10% of mitochondrial proteins in beef heart mitochondria) and has to cope with high nucleotide amounts in order to fulfill cell energetic requirements. It can be purified pretty easily in one or two steps from beef mitochondria in high amount and in crystallization compatible quality. However, the high-resolution structure of the beef Anc1p in complex with its inhibitor carboxyatractyloside (CATR) evidenced a monomeric organization (1). This was contradictory to many previously published results about different MCF members, of which Ancp shares functional and structural features and was shown to be organized as dimers (for a review see (2)). We hypothesized that protein preparation, and more precisely, the protein concentration step necessary to crystallization trials was responsible for Ancp dimer dissociation. Therefore, we used an engineered Ancp mutant (3) that forces a dimeric organization of the carrier throughout the purification and crystallization processes; we named it AA in this chapter. However, we previously showed that it was necessary to tag the yeast Ancp to get it highly purified (4). Consequently, the chimera was tagged with either six histidines at its C-ter (AAH6) or nine histidines at its N-ter (H9AA).

2. Materials 2.1. Chemicals

1. n-dodecyl-b-D-maltoside (DDM) was purchased from Anatrace. [14C]DDM was kindly provided by Marc Le Maire (CEA, CNRS, Université Paris-Sud 11, France). 2. Atractyloside (ATR) and carboxyatractyloside (CATR) were obtained from Sigma. 3. The nickel-nitrilotriacetic acid agarose matrix (Ni-NTA) for metal affinity chromatography is from Qiagen. It is provided as a 1:1 (vol/vol) suspension in 50% ethanol (vol/vol). 4. 3-Laurylamido-N,N¢-dimethylpropylaminoxide (LAPAO) was synthesized as described in (5). It can now be purchased from Anatrace. 5. Hydroxyapatite Bio-Gel® was purchased from Bio-Rad. 6. Thrombin was from Sigma and used as recommended by the supplier.

2.2. Buffer Compositions

1. Buffer A: 150 mM NaPi pH 7.3, 10% (w/v) glycerol, 0.1% (w/v) DDM.

Native Membrane Proteins vs. Yeast Recombinant

21

2. Buffer B: 100 mM Na2SO4, 10 mM Tris–HCl pH 7.3 and 1 mM Na2-EDTA pH 8.0. 3. Buffer C: 33% (v/v) glycerol, 6 mM MgSO4, 150 mM NaPi pH 7.3. 4. Buffer D: 150 mM NaPi pH 7.3, 10% (w/v) glycerol, 0.05% (w/v) DDM, 40 mM imidazole pH 8.0. 5. Buffer E: 10 mM MOPS-NaOH pH 6.8, 150 mM NaCl, 10% (w/v) glycerol, 0.05% (w/v) DDM. 6. Buffer F: 500 mM imidazole pH 8.0, 10 mM MOPS-NaOH pH 6.8, 150 mM NaCl, 10% glycerol, 0.05% (w/v) DDM. 7. Buffer G: 10 mM MOPS-NaOH pH 6.8, 150 mM NaCl, 10% glycerol, 0.05% (w/v) DDM. 8. Buffer H: 100 mM NaCl, 10 mM Tris–HCl pH 7.4, 1 mM Na2-EDTA, 0.05% LAPAO (w/v). 9. Buffer I: 500 mM NaCl, 10 mM Tris–HCl pH 7.4, 1 mM Na2-EDTA. 10. Buffer J: 10 mM Tris–HCl pH 7.4, 1 mM Na2-EDTA, 0.05% LAPAO.

3. Methods 3.1. The 6-Histidine Tag: Mutagenesis, Plasmids, and Strains

3.2. The 9-Histidine Tag: Mutagenesis, Plasmids, and Strains

1. Construction of the gene encoding Anc2p tagged with six histidines at its C-ter (AH6) is described in (4). 2. To construct the gene coding for the covalent tandem dimer of Anc2p tagged with six histidines at its C-ter (AAH6), the wild type ScANC2 gene was amplified with the following primers: 5¢-GAATTCGGATCCATGTCTTCCAACGCCC AAGTCAAAA-3¢ and 5¢-CTGGATCCTTTGAACTTCTTA CCAAACAAGATC-3¢ to introduce BamHI sites (underlined) on each side of the ORF and remove the stop codon. The fragment was then subcloned into the unique BamHI site of the pAH6 plasmid (4), which contains the gene encoding for AH6. After control of the orientation of the fragment, the 3¢ ScANC2 terminator region (amplified with two primers containing each an XbaI site) was introduced. The resulting (PstI-NotI) fragment containing the promoter, the AAH6 encoding gene and the terminator was used for its integration into the JL1-3D2 strain (6) by homologous recombination at the ANC2 locus. The resulting strain was named JL-AAH6. 1. The His9(ANC2)2 gene was obtained by site directed mutagenesis using the Transformer™ Site-directed Mutagenesis Kit (CLONTECH Laboratories). The protein produced from

22

Arnou et al.

this gene corresponds to a covalent tandem homodimer of Anc2p (3) tagged with nine histidines at its N-ter. 2. The mutagenesis target plasmid was obtained from KSDIM5¢3¢ (3) that was digested first by SalI and NotI to shorten the ScANC2 3¢ noncoding region. The resulting plasmid was named KSDIM5¢3¢DSN. 3. The mutagenic primer 5¢catacatataagcaaatacaattgccATG GGT CAC CAT CAC CAC CAT CAC CAT CAC CAC TCT TCA GGT TTA GTT CCT AGA GGT TCT TCC AAC GCC CAA G3¢ was designed (1) to replace the single EcoRI site of KSDIM5¢3¢DSN with a MfeI site (lower case, underlined) to select for mutagenesis events; (2) to introduce a cluster of nine histidines (capital, bold) at the N-terminus of the covalent tandem dimer of Anc2p (capital, italic); and (3) to introduce the recognition and digestion sites of thrombin (capital, underlined). Successful mutagenesis was assessed by DNA sequencing. The resulting plasmid was named KSH9DIM. 4. The His9ANC2 gene was obtained by digestion of KSH9DIM by HindIII and re-ligation of the biggest fragment to obtain a single copy of the ScANC2 gene. The resulting plasmid was named KSH9MON. 5. The KpnI-SacI fragment containing either the His9ANC2 or the His9(ANC2)2 gene was used for its integration into the JL1-3D2 strain (6) by homologous recombination at the ANC2 locus. The resulting strain was named JL-H9A or JL-H9AA, and the produced protein is H9A or H9AA. 3.3. Isolation of Mitochondria from Yeast

1. Protocol and materials used to perform mitochondria isolation are described in (6). They are frozen in small beads in liquid nitrogen and stored at −70°C prior to protein purification (see Note 1). 2. In the case of H9AA and AAH6, a cocktail of protease inhibitors (1 µg/mL pepstatin A, 1 µg/mL leupeptin, 1 µg/mL antipain, 5 µg/mL aprotinin, and 1 mM Na2-EDTA, final concentrations) is added to the buffers during mitochondria isolation to prevent protein degradation (7).

3.4. Isolation of Mitochondria from Bovine Heart

3.5. AAH6 Purification

Bovine mitochondria were isolated from heart muscle by differential centrifugation as described by (8). They were suspended in 0.27 M sucrose, 2 mM Tris–HCl pH 7.4 and stored in liquid nitrogen (see Note 1). 1. The protocol used to purify AH6 and described in (4) is applied to purify AAH6. It consists of three steps: hydroxyapatite Bio-Gel® chromatography, immobilized metal ion chromatography, and removal of imidazole by dialysis or AcA202 chromatography.

Native Membrane Proteins vs. Yeast Recombinant

3.6. H9A and H9AA Purifications

23

1. The Ni-NTA resin (Qiagen) (1 ml resin for 5 mg total protein) is washed twice with 5 vol H2O then equilibrated with Buffer A (2 × 3 resin volumes). 2. When necessary, isolated mitochondria (100 mg) are incubated in the presence of 600 nmol CATR for 15 min at 4°C. 3. Membrane proteins are solubilized with 1% (w/v) or 3% (w/v) DDM in Buffer B for 15 min at 4°C supplemented with an antiprotease cocktail as described in (7) in the case of H9AA. During this step the final protein concentration is 10 mg/mL. 4. The lysate is centrifuged at 24,000 × g for 10 min at 4°C and the supernatant is supplemented with Buffer C (0,5 vol for 1 vol of supernatant) and 20 mM imidazole. It is loaded onto the equilibrated Ni-NTA resin using a batch procedure and incubated for 1 h at 4°C in a rotating apparatus. 5. The flow-through fraction is removed by a 10-min centrifugation at 4,000 × g and 4°C. 6. The resin is then resuspended with two volumes of Buffer D and poured into a column. It is thereafter washed with two volumes of the same buffer. The two eluted fractions (four resin volumes) are pooled for further analyses. 7. The resin is washed with five volumes of Buffer E. The eluate is usually collected in two fractions: one corresponds to the first 4.5 × resin volumes and the other to the last 0.5 × resin volume. 8. The protein is eluted with three resin volumes of Buffer F. Elution of the protein is followed by absorbance measurement at 280 nm. 9. Imidazole is removed by an Ultrogel® AcA 202 size exclusion chromatography (four resin volumes for one sample volume). The eluant is Buffer G. Elution of the protein is followed by absorbancy measurement at 280 nm. Both H9A and H9AA proteins are obtained highly purified as shown in Fig. 1.

3.7. Protein Concentration

1. The protein concentration after Ultrogel® AcA 202 chromatography is far from being sufficient for crystallization trials. The fraction of interest is loaded on Amicon® Centriprep® YM30 units (Millipore). The molecular weight cutoff is 30,000 Da. The units are centrifuged at 4°C and 900 × g to reduce ten times the fraction volume. 2. Detergent is concentrated at the same time and this may lead to protein denaturation. Thus, detergent is removed using by Bio-Beads® SM2. 3. Bio-Beads are activated as described in (10).

24

Arnou et al.

Fig. 1. Analyses of purified H9A and H9AA. Ten microliters of the imidazole elution fraction (lanes 1 and 2 : H9A; lanes 3 and 4: H9AA) is analyzed by SDS-PAGE (12.5%). The gels are divided in two parts, one is Coomassie blue stained (lanes 1 and 3 ) and the other is transferred onto a nitrocellulose membrane (lanes 2 and 4 ) and immunostained with an antibody directed against a peptide corresponding to the last 14 amino acids of ScAnc2p as described in (17).

4. The amount of Bio-Beads necessary to remove 90% of the detergent is calculated considering that 100 mg (dry weight) of Bio-Beads after activation can bind 1.8 mg of DDM after 3 h at 4°C, as determined using [14C]DDM. Therefore, the sample should contain at the end of this step about 0.05% DDM corresponding approximately to the initial DDM concentration. 5. The concentrated protein fraction is incubated with the appropriate Bio-Beads amount for 3 h at 4°C under mild shaking (see Note 2). Bio-Beads are removed by three successive centrifugations (3 min, 3,000 × g). 6. Steps 4 and 5 are repeated until the Ancp concentration reaches around 10 mg/mL. Protein concentration is determined by absorbance measurement at 280 nm. The molar extinction coefficients are calculated from the H9A and H9AA tryptophanyl and tyrosyl residue content (11): 35,870/M cm for H9A and 71,740/M cm for H9AA. 7. Concentrated H9A and H9AA can be stored at 4°C but preferably are immediately submitted to crystallization trials or to tag removal process. 3.8. Removal of the 9-Histidine Tag

1. Enzyme units (0–20) of thrombin are added to 1 mg of tagged H9AA in 1 ml and left over for 20 h at 4°C with mild shaking. Under such conditions, the histidine tag is almost completely removed for a thrombin to protein ratio of 20 U/mg (Fig. 2).

3.9. Purification of bAncp

1. Hydroxyapatite is suspended in an ice-cold Buffer H and washed with the same buffer according to the supplier’s instructions. 2. Mitochondria (50 mg of mitochondrial proteins) are unfrozen and incubated in Buffer I (4 ml final volume) with 25 µM CATR for 10–15 min at 0°C.

Native Membrane Proteins vs. Yeast Recombinant

25

Fig. 2. Polyhistidine tag removal. 0, 5, 10, or 20 enzymatic units of thrombin are added to 1 mg of tagged H9AA in 1 ml and left over for 20 h at 4°C. The samples are analyzed by SDS-PAGE (12.5%), and the protein is revealed by Coomassie blue staining.

3. Membrane proteins are solubilized by addition of 1 mL 10% (w/v) LAPAO and stirring of the mixture. After standing on ice for 10 min, the mitochondrial lysate is centrifuged at 20,000 × g for 10 min. 4. The supernatant is layered on a hydroxyapatite Bio-Gel® column (2.5 cm diameter, 25 mL settled gel) and the elution is performed using Buffer H and monitored online by UV absorbance at 280 nm. 5. The pass-through fraction is collected and concentrated to approximately 10 mL by pressure dialysis on an Amicon YM30 membrane. It is then subjected to a gel-exclusion chromatography on Ultrogel AcA202 resin (BioSepra) to remove small solutes such as Ca2+ ions, phosphate and nucleotides from the bAncp preparation. In addition, this step allows the protein preparation to be placed under appropriate conditions with respect to salt concentration and pH; whereas equivalent to dialysis, it is considerably shorter and in this way is probably less damaging. In most cases, the chromatography is carried out in a column (2.5 cm diameter) containing 40 ml of settled gel equilibrated in Buffer J, supplemented with either 5 or 100 mM NaCl. 6. The protein fraction containing the purified BAncp·CATR complex is treated with moist activated (see above) Bio-Beads (Bio-Rad, 100 mg per mg protein) for 2 h at 4°C to remove excess detergent (see Note 2). It is then filtered on a 0.45-mm nitrocellulose filter and concentrated to approximately 10 mg/mL on a Centricon YM 30 device (Amicon). In this preparation, the protein is exclusively present as the Anc1p isoform as assessed from in-gel proteolysis combined to mass spectrometry analysis. 3.10. Comments on Use of Native bAncp vs. Recombinant ScAncp 3.10.1. Practical Aspects

Experiments carried out with the bovine ADP/ATP carrier obviously benefit from the abundance and easiness to obtain the amount of biological material required for biochemical investigations. This is first illustrated by the fact that 5–10 g of mitochondrial proteins can be routinely extracted within a few hours from one beef heart. This amount corresponds to almost 0.5–1 g of bAncp. In addition, purification of bAncp is undoubtedly facilitated by the easiness of mitochondria solubilization with detergents

26

Arnou et al.

and by the high efficiency of adsorption chromatography on hydroxyapatite (12). This allows recovery of pure carrier in a single step with a 90–100% yield. Yeast mitochondria are isolated to a lower yield and operation extends over a longer time if considering 1–2 days required for cell growth. Approximately 10–20 mg of mitochondrial proteins, corresponding to 0.5–1 mg of ScAnc2p depending on the nature of strains, are extracted from 1 l of culture. Attempts to isolate ScAnc2p by chromatography on hydroxyapatite led to preparations containing other mitochondrial proteins such as the phosphate carrier and VDAC in various amounts, depending on the detergent nature. Therefore, it was necessary to engineer polyhistidine-tagged forms of ScAnc2p that we purified by IMAC. 3.10.2. The Genetic Approach

Yet, yeasts offer the undeniable advantage of genetic approaches to investigate structure–function relationships of the ADP/ATP carrier essentially because (1) they contain endogenous Ancps; (2) their genetics is well known; and (3) they are well suited for the rapid screening of the functional state of carrier mutants due to their ability to grow under either fermentative or respiratory conditions. Site-directed mutagenesis has been used to locate strategic amino acid residues expected to play a role in the transport mechanism. Mutated forms of Anc2p able to sustain the growth of yeast on nonfermentable carbon sources were characterized with respect to ADP/ATP transport in isolated mitochondria or in proteoliposomes. Other approaches consisted in removing/introducing appropriate residues in ScAncp without impairing its full transport activity for probing ligand-induced conformational changes of the isolated carrier or for assessing the topography of the membrane embedded carrier. This is illustrated, for example by the use of tryptophanyl mutants using fluorometric approaches (13,14) or that of cysteinyl mutants for chemical labeling experiments (reviewed in (2)). Genetic handling of yeasts was used with the purpose of functional and/or structural explorations to engineer chimeras in which ScAncp was linked to itself (3,15), to the phosphate carrier (7) or to cytochrome c (16), and also for the heterologous expression of the human Ancps to understand the role of pathogenic point mutations (6). Yeast will undoubtedly afford the means to isolate an Anc2p mutant stabilized in the BA conformation which so far could not be crystallized from isolated beef Ancp due to the difficulties to handle a stable bAncp·BA complex. BA refers to bongkrekic acid, which is the other specific Ancp inhibitor. It is recognized that CATR and BA, the binding of which to Ancp is mutually exclusive, stabilize the ADP/ADP carrier in two distinct conformations. This led to the conclusion that Ancp adopts at least two

Native Membrane Proteins vs. Yeast Recombinant

27

different conformations in the membrane, which are probably involved in nucleotide transport. Therefore understanding this process at the molecular level involves deciphering the 3D structure of the ScAnc2p BA complex in comparison with the 3D structure of the Ancp CATR complex.

4. Notes 1. It is recommended to store the frozen mitochondria as 50–100 µL beads. This facilitates the withdrawing of the wanted amount of material prior to each experiment. Beads are made immediately after isolation of mitochondria by dropping the suspension from a pipette into a small volume of liquid nitrogen. They are then handled with a spatula or with forceps. 2. The mixture is gently stirred in a tube rotating horizontally to prevent extensive bead break up. References 1. Pebay-Peyroula E, Dahout-Gonzalez C, Kahn R, Trézéguet V, Lauquin GJ-M, Brandolin G (2003) Structure of mitochondrial ADP/ATP carrier in complex with carboxyatractyloside. Nature 426:39–44 2. Nury H, Dahout-Gonzalez C, Trézéguet V, Lauquin GJ-M, Brandolin G, Pebay-Peyroula E (2006) Relations between structure and function of the mitochondrial ADP/ATP carrier. Annu Rev Biochem 75:713–741 3. Trézéguet V, Le Saux A, David C, Gourdet C, Fiore C, Dianoux A, Brandolin G, Lauquin GJ-M (2000) A covalent tandem dimer of the mitochondrial ADP/ATP carrier is functional in vivo. Biochim Biophys Acta 1757:81–93 4. Fiore C, Trézéguet V, Roux P, Le Saux A, Noël F, Schwimmer C, Arlot D, Dianoux A-C, Lauquin GJ-M, Brandolin G (2000) Purification of histidine-tagged mitochondrial ADP/ATP carrier: influence of the conformational states of the C-terminal region. Protein Expr Purif 19:57–65 5. Brandolin G, Doussiere J, Gulik A, GulikKrywicki T, Lauquin GJM, Vignais PV (1980) Kinetic, binding and ultrastructural properties of the beef heart adenine nucleotide carrier protein after incorporation into phospholipid vesicles. Biochim Biophys Acta 592:592–614 6. De Marcos Lousa C, Trézéguet V, Dianoux A-C, Brandolin G, Lauquin GJ-M (2002)

7.

8.

9.

10. 11.

12.

The human mitochondrial ADP/ATP carriers: kinetic properties and biogenesis of wild type and mutant proteins in the yeast S. cerevisiae. Biochemistry 41:14412–14420 Postis V, De Marcos Lousa C, Arnou B, Lauquin GJ-M, Trézéguet V (2005) Subunits of the yeast mitochondrial ADP/ATP carrier: cooperation within the dimer. Biochemistry 44:14732–14740 Smith AL (1967) Preparation, properties and conditions for assay of mitochondria, slaughterhouse material, small-scale. Methods Enzymol 10:81–86 Fiore C, Trézéguet V, Le Saux A, Roux P, Schwimmer C, Dianoux A-C, Noël F, Lauquin GJ-M, Brandolin G, Vignais PV (1998) The mitochondrial ADP/ATP carrier: structural, physiological and pathological aspects. Biochimie 80:137–150 Holloway PW (1973) A simple procedure for removal of Triton X-100 from protein samples. Anal Biochem 53:304–308 Pace CN, Vajdos F, Fee L, Grimsley G, Gray T (1995) How to measure and predict the molar absorption coefficient of a protein. Protein Sci 4:2411–2423 Riccio P, Aquila H, Klingenberg M (1975) Purification of the carboxyatractylate binding protein from mitochondria. FEBS Lett 56:133–138

28

Arnou et al.

13. Le Saux A, Roux P, Trézéguet V, Fiore C, Schwimmer C, Dianoux A-C, Vignais PV, Brandolin G, Lauquin GJ-M (1996) Conformational changes of the yeast mitochondrial adenosine diphosphate/adenosine triphosphate carrier studied through its intrinsic fluorescence. 1. Tryptophanyl residues of the carrier can be mutated without impairing protein activity. Biochemistry 35:16116–16124 14. Roux P, Le Saux A, Trézéguet V, Fiore C, Schwimmer C, Dianoux AC, Vignais PV, Lauquin GJ-M, Brandolin G (1996) Conformational changes of the yeast mitochondrial adenosine diphosphate/adenosine triphosphate carrier studied through its intrinsic fluorescence. 2. Assignment of tryptophanyl residues of the carrier to the responses

to specific ligands. Biochemistry 35: 16125–16131 15. Hatanaka T, Hashimoto M, Majima E, Shinohara Y, Terada H (1999) Functional expression of the tandem-repeated homodimer of the mitochondrial ADP/ATP carrier in Saccharomyces cerevisiae. Biochem Biophys Res Commun 262:726–730 16. Dassa EP, Dahout-Gonzalez C, Dianoux AC, Brandolin G (2005) Functional characterization and purification of a Saccharomyces cerevisiae ADP/ATP carrier-iso 1 cytochrome c fusion protein. Protein Expr Purif 40:358–369 17. Marty I, Brandolin G, Gagnon J, Brasseur R, Vignais PV (1992) Topography of the membrane-bound ADP/ATP carrier assessed by enzymatic proteolysis. Biochemistry 31:4058–4065

Chapter 3 Bacterial Overexpressed Membrane Proteins: An Example: The TSPO Jean-Claude Robert and Jean-Jacques Lacapère Abstract The mitochondrial membrane TranSlocator PrOtein (TSPO) is a 18-kDa transmembrane protein involved in various mitochondrial functions, among which the best characterised is cholesterol transport and steroid formation. Determination of its structure would be an important step to understand the mechanism of transport and its regulation. Purification from native membranes is difficult in respect with amounts of homogeneous purified proteins needed for biophysical, structural, and functional studies. Efficient heterologous overexpression in bacterial system, purification on affinity column, and biochemical characterisation has been successfully developed. Large-scale production of detergent-solubilized TSPO has been obtained with fermentation coupled to fast protein liquid chromatography procedure. Small-scale production at lower cost for isotopically labelled recombinant TSPO and/or detergent is also presented. Key words: Peripheral-type benzodiazepine receptor (PBR), Expression vector, E. coli bacteria, Inclusion body, Ni-NTA resin, NMR

1. Introduction Structural studies of membrane proteins require large amounts of purified and concentrated proteins (1). However, most membrane proteins are not naturally abundant and overexpression is one way that has been developed to overcome such difficulty. Bacteria, such as Escherichia coli (E. coli) cells, are the preferred host for recombinant protein expression because they are rather easy to genetically manipulate and expression is fast, typically producing protein in a single day or less than 24 h. A large number of vectors with different fusion tags for purification have been developed over the last decades (2). Histidine fusion tag is one of

Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_3, © Springer Science+Business Media, LLC 2010

29

30

Robert and Lacapère

the most commonly used since it often enables easy purification on NTA-Ni affinity columns. TranSlocator PrOtein (TSPO), previously named peripheraltype benzodiazepines receptor (PBR) is a transmembrane protein mostly located in mitochondria and initially discovered as a class of binding site for benzodiazepine distinct from the GABAa receptors from the central nervous system (3). TSPO expression is too low to permit easy purification from native membranes. cDNA from mouse was cloned, inserted into vectors, and E. coli cells were transformed with these vectors (4, 5). A small fraction of TSPO was found in bacterial membrane, and the major fraction was detected in inclusion bodies. Different fusion tags for purification were tested (4, 5), good results were obtained with six histidines fusion tag added in N-terminal position of the TSPO. Theoretically, the fusion tag placed in C-terminal position permits to remove non-fully expressed proteins, but in the case of TSPO, the C-terminal domain plays an important role in cholesterol binding (6) and transport (5) and thus is affected by the presence of a tag. The present chapter describes production and purification of recombinant TSPO (RecTSPO). The first part is a comparison of RecTSPO production in an incubator with Luria–Bertani (LB) broth or minimum medium (M9) complemented with isotopes for nuclear magnetic resonance (NMR) studies and in a fermentor with LB broth. The second part describes purification on affinity column. We developed two protocols with Ni-NTA resin, large-scale purification with fast protein liquid chromatography (FPLC) procedure and small-scale purification with “manual chromatography”. Advantages and disadvantages are discussed in the light of the target, that is, the structural approach used. Exchange of detergent on Ni-NTA resin is presented, since deuterated detergents are needed for NMR studies, whereas nonionic detergents are requested for mass spectroscopy analysis with MALDITOF. The last part presents protocols for characterisation of detergent-purified RecTSPO, for instance, to find optimal conditions of detergent/protein ratios, which appeared very useful for structural studies and reconstitution of membrane proteins into liposomes for functional studies.

2. Materials 2.1. Expression

1. cDNA from mouse and human TSPO was inserted in pET15b vector and E. coli bacteria strain BL21(DE3) were transformed with the TSPO cDNA containing plasmid (Novagen, VWR, Fontenay sous bois, France).

Bacterial Overexpressed Membrane Proteins: An Example: The TSPO

31

2. Bacteria were cultured either in LB broth medium (Sigma, Saint-Quentin Fallavier, France) or minimum medium M9 (7). Basic M9 medium: 2 g KH2PO4, 8 g Na2HPO4, 0.5 g NaCl, and 0.5 g MgCl2·6H2O in a final volume of 1 L and adjusted to pH 7.2. Enriched M9 medium: basic M9 medium supplemented with 10-mL oligoelement stock solution and 1-mL solution A. Oligoelement stock solution: 0.5 g EDTA, 83 mg FeCl3, 8.4 mg ZnCl2, 1.3 mg CuCl2 2H2O, 1 mg CoCl2 6H2O, and 1 mg H3BO3 in a final volume of 100 mL. Solution A: 50 mg CaCl2, 1 mg biotine, 1 mg thiamine, and 50 mg ampicillin in 1 mL water. M9 “plus” H2O or M9 “plus” D2O: Enriched M9 medium supplemented with isotopes (D-Glucose 13C6, D-Glucose 13C6 2 H7, (15NH4)2SO4, D2O, Euriso-Tope, Gif sur Yvette, France). 3. Incubation was performed in 0.5–2.0-L flasks and 1.8–2.0-L Fernbachs (D. Dutscher, Brumath, France) in a swiss minitron INFORS HT incubator (INFORS sarl, Massy, France). 4. Fermentation was performed in a 7.5-L bioreactor BioFlo 110 (New Brunswick Scientific, Paris, France). Temperature, pH, dissolved O2, and stirring were controlled and set at 37°C, 7.0 ± 0.1, 20% and 300–1,200 rpm, respectively. 2.2. Purification

1. Bacteria lysis was performed at 4°C in a sonicator (PG 1509) equipped with a 3/16″ tip (MSE, Pocklington, UK). 2. Inclusion bodies from bacteria were collected in 0.5-L bottle with by centrifugation at 5,000 × g, 4°C for 15 min (Beckman J21, Gagny, France). 3. Elimination of long DNA fragments from solubilized inclusion bodies was performed by addition of benzonase (Novagen, VWR, Fontenay sous bois, France). 4. “Manual” purification was performed on 1.2-mL superflow Ni-NTA resin (Qiagen SA, Courtaboeuf, France) loaded in 12-mL Poly-Prep columns (Bio-Rad, Marne la Coquette, France). 5. Large-scale purification was performed with 1-mL His Trap columns (GE Healthcare SA, Orsay, France) and fast protein liquid chromatography (FPLC) equipment (GE Healthcare SA, Orsay, France). 6. Buffer A: 150 mM NaCl and 50 mM Hepes-Na pH 7.8. 7. Detergents used during solubilization and purification processes: Sodium dodecyl sulphate (SDS, Sigma, Saint-Quentin Fallavier, France), N-lauroylsarcosine sodium salt (Sarkosyl), dodecylmaltopyranoside (DDM), dodecylphosphocholine

32

Robert and Lacapère

(DPC), 1,2-dihexanoyl-sn-glycero-3-phosphocholine (DHPC C6), and 1,2 diheptanoyl-sn-glycero-3-phosphocholine (DHPC C7) (COGER, Paris, France). Isotopically labelled dodecylphosphocholine D38 (Euriso-Top, Gif sur Yvette, France). 8. Buffer B: Buffer A supplemented with 1% (w/v) SDS. 9. Buffer C: Buffer B supplemented with 5 mM imidazole. 10. Buffer D: Buffer B supplemented with 250 mM imidazole. 11. Buffer E: Buffer A supplemented with 2.5% (w/v) DPC and 5 mM imidazole. 12. Buffer F: Buffer A supplemented with 1.0% (w/v) DPC and 5 mM imidazole. 13. Buffer G: Buffer A supplemented with 1.0% (w/v) DPC and 250 mM imidazole. 2.3. Characterization

1. Absorption spectra were recorded on UNICAM UV 300 (Fisher Scientific BIOBLOK, Illkirch, France). Extinction coefficient for RecTSPO was calculated using full sequence composition (tag sequence and mRecTSPO or hRcTSPO amino acids) and the ProtParamTools of ExPASy (8). Calculated extinction coefficients are 3.88 and 4.1 (mg/mL)−1 cm−1 for mRecTSPO and hRcTSPO, respectively. 2. Protein concentration was determined using Bio-Rad protein assay and DC protein assay in the presence of detergents (BioRad, Marne la Coquette, France). Bovin serum albumin (Fraction V, Sigma, Saint-Quentin Fallavier, France) was used as a standard. 3. Protein composition was analysed with SDS-polyacrylamide gel electrophoresis (SDS-PAGE) using Miniprotean II electrophoretic materials and power supply (Bio-Rad, Marne la Coquette, France). Stock solution (Sigma, Saint-Quentin Fallavier, France) of 30% aqueous solution of acrylamide– bisacrylamide (ratio 37.5/1) was used to prepare 12.5% gels. Reducing buffer was prepared as described in the Bio-Rad instruction manual. Protein electrophoresis SDS-PAGE standards were broad range (Bio-Rad, Marne la Coquette, France) and Seablue Plus 2 pre-stained standards (Invitrogen, CergyPontoise, France). Gels were stained with EZBlue staining solution (Sigma, Saint-Quentin Fallavier, France). Gels were dried on gel dryer model 583 (Bio-Rad, Saint-Quentin Fallavier France). 4. Detergent concentration was determined using modified colorimetric assays (Bio-Rad, Marne la Coquette, France) as described in Chapter 1 (this volume).

Bacterial Overexpressed Membrane Proteins: An Example: The TSPO

33

3. Methods 3.1. Transfection of Bacteria with TSPO Containing Plasmid

1. Plasmid containing mTSPO and hTSPO cDNA was sequenced to verify insertion into pET15b vector between Nde1 and BamH1 restriction sites, just following His tag and Thrombin cleavage site structurally constituent of pET15b vector. T7 promotor was used as initiator for sequencing. 2. Competent E. coli bacteria strain BL21(DE3) were transformed by plasmid as previously described (7). 3. Transformed E. coli BL21(DE3) cells were inoculated into 10-mL sterile LB broth (25 g/L) supplemented with ampicillin (50 mg/L) to select bacteria containing plasmid. 4. Clones were selected on LB/agarose plates containing ampicillin. 5. We observed that the level of RecTSPO expression decreases with time, thus transformation of E. coli bacteria has to be repeated to maintain a high level of protein expression.

3.2. Expression of TSPO

3.2.1. Small-Scale Production in a 200 rpm Shaking Incubator

Bacterial cells were grown at 37°C in a medium supplemented with ampicillin in an incubator for small-scale production or in a bioreactor for large-scale production. 1. Pre-culture: Transformed E. coli BL21(DE3) cells were inoculated into 10 mL sterile LB broth (25 g/L) supplemented with 50 µg/ mL ampicillin. After 8 h incubation, pre-culture was repeated twice by inoculating 0.5 mL of bacteria into 10 mL fresh LB supplemented with ampicillin. We observed that these pre-cultures increased RecTSPO productions and particularly in the case of mutants. 2. Culture was initiated by transferring the 10 mL pre-culture into 500 mL LB supplemented with ampicillin either in a 2-L flask or 1.8-L Fernbach. Incubation was continued until optical density (OD at 600 nm) reaching a value of 0.7. At this point, 1 mM Isopropyl-1-Thio Beta-D-galactopyranoside (IPTG) was added to induce recombinant protein production (see Note 1). 3. Bacterial cells were harvested by centrifugation usually after 5 h shaking.

3.2.2. Large-Scale Production in a 5-L Bioreactor

1. Three successive pre-cultures were performed. 2. 100 mL of transformed E. coli BL21(DE3) cells were transferred into a bioreactor filled with 5 L LB supplemented with ampicillin.

34

Robert and Lacapère

3. Growing parameters were controlled by computer acting on bioreactor probes and pumps. Temperature was maintained at 37°C by a mix of heating and freezing by circulating water. Stirring was set at 800 rpm. pH was stabilised at 7.1 ± 0.1 by two pumps adding either 1 N NaOH or 1 M phosphoric acid. 4. At 0.7 OD, 1 mM IPTG was added to produce proteins. After 3 h , a mixture of 200 mL glucose 6.25% and ammonium sulphate 1.5% was infused overnight via a peristaltic pump at 0.25 mL per minute of the E. coli BL21(DE3) cells (see Note 2) 5. At the end of the production, when OD at 600 nm reached a value of 8.5, bacterial cells were harvested by centrifugation. Clearly it appeared that fermentation with addition of D-Glucose and ammonium sulphate increased production of mTSPO (see Table 1 and Note 3). 3.2.3. Production of Labelled Protein for NMR Studies

For structural studies by NMR, efficient productions of uniformly 15 N, 13C, and 2H-labelled proteins are needed. This is obtained by adding labelled nutriments to bacteria cultured in a minimal medium (M9) (see Note 4). In order to reduce the cost of such production, we used an efficient method for labelling of recombinant proteins previously described (9) with our medium composition (see Note 5).

Table 1 Yields of mRecTSPO and hRecTSPO produced per litre of broth using the different protocols Method

Broth

Expression mRecTSPO hRecTSPO Volume (L) Final OD time (H) (mg/L) (mg/L)

Incubation

LB only

0.5

1.8

5

22



Incubation

LB only

0.13

2.1

4



17

Fermentation LB only

2.5

2.5

18

82



Fermentation LB complemented with glucose and ammonium sulphate

5.3

8.5

18

195



Fermentation LB complemented with glucose and ammonium sulphate

2

8.3

18



22

Bacterial cells were cultured either by incubation in Fernbach in a thermostated chamber or by fermentation with regulated pH, temperature, and pO2

Bacterial Overexpressed Membrane Proteins: An Example: The TSPO

35

1. Preparation of M9 media (7). M9 minimum medium was prepared and sterilised. Stock solution of oligoelement, solution A and nutriments were prepared with autoclaved H2O and injected through sterile filters (0.22 µm) into sterilised M9 medium. 2. Production of isotopic labelled RecTSPO. After three successive pre-cultures of transformed E. coli BL21(DE3) cells in 20 mL (each) LB/ampicillin medium, cells were transferred into two Fernbachs (1.8 L) and grown at 37°C in 0.5 L LB/ampicillin per Fernbach, shaken at 200 rpm in the incubator. When OD (at 600 nm) reaches a value of 0.7, cells were centrifuged for 20 min at 3,300 × g and 4°C. Cells in the pellet were washed with 120 mL enriched M9 H2O solution and recentrifuged. Cells were resuspended in 250 mL of the appropriate medium (enriched M9 H2O supplemented with glucose and ammonium sulphate or M9 “plus” H2O or M9 “plus” D2O) and incubated at 37°C in a small Fernbach (450 mL) shaken at 200 rpm to allow the recovery of growth and clearance of unlabelled metabolites. Proteins expression was induced after 1 h by addition of 1 mM IPTG. After 18 h, bacterial cells were harvested by centrifugation. 3.3. Purification of Mouse and Human RecTSPO by Immobilised Metal Ion Affinity Chromatography (IMAC)

3.3.1. Solubilization of Inclusion Bodies

Most of RecTSPO was produced in the inclusion bodies of E. coli BL21(DE3) cells. Generally, recombinant membrane proteins can be removed from inclusion bodies either with denaturing agents and the released proteins are then refolded by gradual removal of the denaturing reagents by dilution or dialysis in the presence of detergents, or inclusion bodies can be directly solubilized in SDS or Sarkosyl detergent (7, 10–12). TSPO is a membrane protein that needs detergent to be maintained solubilized in buffers; thus, SDS was used to solubilize TSPO containing inclusion bodies (5, 13) (see Note 6). 1. The bacterial cells were harvested, at the end of each TSPO’s production, by centrifugation at 5,000 × g and 4°C for 15 min, the supernatant was discarded into sodium hypochloride and the pellets resuspended in buffer A (without protease inhibitors). Washed bacterial cells were centrifuged (same conditions as above), the supernatant discarded and the pellets kept at −30°C. The content of one pellet depends on the production conditions and corresponds to roughly 50–100 mL LB biomass (except for specific production for NMR samples). 2. Bacterial lysates were obtained from one (or several) pellet(s) resuspended by adding 20 mL of buffer A, chilled at 0–1°C (for one pellet) and sonicated at 14 µm for 1 min in a water iced bath.

36

Robert and Lacapère

3. Inclusion bodies were collected by centrifugation (5,000 × g, 4°C, 20 min). 4. Inclusion bodies were solubilized at room temperature by adding 20 mL of buffer B to the pellet. The tube was vigorously vortexed (2 min), agitated on a roller (20 min), and centrifuged at 15,000 × g for 1 h at 20°C to avoid precipitation of SDS. Supernatant containing solubilized inclusion bodies was reserved at room temperature before purification. In order to fluidify the solution, 2 µL of benzonase (25 units per µL) was added 10 min before purification. 3.3.2. Large-Scale Purification, Fast Protein Liquid Chromatography (FPLC) Procedure

1. 200 mL of (1%SDS) solubilized inclusion bodies from mRecTSPO production gained by fermentation were injected on 1-mL HisTrap (Ni-NTA) column pre-equilibrated with buffer B. 2. Loading and purification on ÄKTA optimised purification protein system was analysed following changes in OD and protein content of collected fractions (Fig. 1). The first upward step observed in the chromatogram corresponds to the pass through of unbound proteins. The second upward step corresponds to the saturation of the column (corresponding to the binding capacity of 40 mg protein per mL of resin for such column). The third downward step corresponds to the washing of the column. The fourth peak step was induced by the linear imidazole gradient and corresponds to elution of mRecTSPO.

3.3.3. Small-Scale Purification, Manual Chromatographic Procedure

1. Preparation of the column: Empty Poly-Prep column was filled with 1.2 mL (named now one column volume, 1 V) of Ni-NTA resin. Resin was successively washed (by gravity) with 5 V of water, 5 V of 0.1 M ethylenediaminetetraacetic acid solution adjusted to pH 8.0, 5 V of water, 5 V of 0.1 M nickel sulphate solution, 5 V of water, 5 V of buffer A, and 5 V of buffer B. We noticed that the columns can be prepared “in advance” since the resin did not dry when no effluent flew away. 2. Standard purification: 10–60 mL of solubilized proteins were layered on the column with a peristaltic pump (worked at 0.3 mL per minute). Column was washed with 4–5 V of buffer B and collected fractions were named Wn. Protein was eluted with imidazole containing buffer C and collected fractions were named En. Fraction volume of 1 V or 0.25 V was collected in Eppendorf tubes of 1.5 mL for washing and elution fractions, respectively. 3. Purification with exchange of detergent (see Fig. 2): column was loaded as described above, but detergent was exchanged

Bacterial Overexpressed Membrane Proteins: An Example: The TSPO

37 100

3000

OD at 300nm (m AU)

80 70

2000

60 50

1500

40 1000

30 20

500

Imidazole gradient

90 2500

10 0

0 0

2

4

6

8

10

12

14

16

25

35

Fraction Number

116 kDa 97.4 kDa 66 kDa 45 kDa 31 kDa

mRecTSPO

21.5 kDa 14.5 kDa

L fractions

4

W MW FT E 14 17 27 28 29

Fig. 1. Purification of mRecTSPO in SDS. FPLC profile (top panel ) and SDS-PAGE of fractions (bottom panel). Bacterial inclusion bodies were solubilized in SDS (10 mg/mL SDS), loaded on 1 mL His Trap column at a flow rate of 1 mL/min. The chromatogram shows OD at 300 nm since OD at 280 nm is higher and goes rapidly out of scale. It can be divided in four parts (arrows correspond to fractions analysed by SDS-PAGE). Part 1 (fractions 1–6) shows an increase of OD and corresponds to proteins that are not fixed on the column. Part 2 (fractions 4–15) is characterised by a further increase of OD, attributed to the saturation of the column. Part 3 (fraction 16) shows a reduction of the OD and is correlated to the washing of the column (buffer A with 50 mg/mL SDS). Part 4 (fractions 16–35) corresponding to the elution step induced by flow of the imidazole gradient (0–250 mM). A narrow but concentrated peak of mRecTSPO is observed clearly observed. Volume of fractions is 10, 4 and 2 mL for fractions 1–15, 16 and 17–36, respectively. The silver stained SDS-Page (12.5% polyacrylamide) shows that loaded material (lane 1) contains mRecTSPO (arrow on the left ) and numerous other proteins. Flow through fractions (4 and 14 in lanes 2 and 3, respectively) show that no mRecTSPO was present in the first part of the load, whereas significant amount is observed in the second part suggesting an over load of the column. Almost no proteins are detected during the washing step (fraction 17, lane 4). Purified mRecTSPO is clearly observed within the elution peak (fractions 27–29 in lanes 5–7, respectively). Lane 8 shows molecular weight (MW) of broad range from Bio-Rad. The major spot in the gel (lane 5 corresponding to fraction 27) contains 200 ng pure mRecTSPO. The major peak in the chromatogram (fraction 27) contains 20 mg/mL mRecTSPO.

Robert and Lacapère

mRecTSPO ( ) and DPC ( ), mg/mL

38

W1

30

W2

D1

D2

E

25 20 15 10 5 0 0

5 10 Elution volume, mL

15

Fig. 2. Purification of mRecTSPO in DPC. Bacterial inclusion bodies solubilized in SDS (10 mg/mL) were loaded on manually prepared 1.2 mL of Ni-NTA (Qiagen) in 12 mL Poly-Prep column. The chromatogram presents three main steps, washing of the column (W1 and W2), detergent exchange on the column (D1 and D2) and elution of protein (E). Volume of fraction during the washing and detergent exchange steps is 1.4 mL, whereas collected fraction during the elution step is 0.3 mL. The chromatogram was drawn by recording absorption spectra of each fraction, measuring the OD at 280 nm and calculating protein concentration (black diamonds) using extinction coefficient of 3.88 (mg/mL)−1 cm−1. The washing step is initially done (W1) with flowing through loading buffer (buffer B containing SDS) followed by (W2) low imidazole buffer (buffer C) to remove impurities such as protein weakly attached to the resin. The detergent exchange step is performed on protein attached on the column by flowing through high DPC (D1) containing buffer (buffer E) followed by low DPC (D2) containing buffer (buffer F). mRecTSPO is eluted (E) in the presence of high concentration of imidazole (buffer G). Detergent content (opened square) in the eluted fraction was measured using colorimetric method (see Chapter 1, this volume). Detergent concentration profile shows a peak similar to that observed for mRecTSPO, suggesting that the protein is surrounded by detergent molecules. For instance the maximum protein concentration peak is 14 mg/ mL with a DPC concentration of 25 mg/mL, giving detergent protein ratio of 1.7 w/w (i.e. 106 mol DPC per mol mRecTSPO).

within the column during the washing steps. Resin was washed and eluted successively by 2 V buffer B (W1), 2 V buffer C (W2), 3 V buffer E (D1), and 3 V buffer F (D2). Protein was eluted with 2 V of buffer G (E). 4. Purification of samples for NMR studies: The above described protocol allowed changing nature of detergents and their concentrations during washing step with small volume addition. Deuterated DPC is therefore added just before elution (see Note 7). 3.3.4. General Considerations on IMAC Purification

1. The IMAC instruction manual (from Qiagen) gives some specific considerations on its use, in particular for compatibility of reagents and detergent with Ni-NTA matrices. In our

Bacterial Overexpressed Membrane Proteins: An Example: The TSPO

39

Table 2 Compatible and incompatible detergents for recovery of mRecTSPO sPDC

SDS

DDM

Sarkosyl

DPC

insPDC

C12E8

NOG

Triton X-100

DHPC C6 or C7

CHAPS

Detergents forming soluble and insoluble protein detergent complexes (sPDC and insPDC) were characterised by their ability to exchange or not with SDS in the affinity column loaded with mRecTSPO. Sodium dodecyl sulphate (SDS), dodecylmaltopyranoside (DDM), N-Lauroylsarcosine sodium salt (sarkosyl), dodecylphosphocholine (DPC), octaethylene glycol monododecylether (C12E8), N-octyl b D-glucopyranoside (NOG), Triton X-100, 1,2 dihexanoylsn-glycero-3-phosphocholine (DHPC C6), 1,2 diheptanoyl-sn-glycero-3-phosphocholine (DHPC C7), and 3-[(-cholamidopropyl)dimethylammonio]-1-propane sulphonate (CHAPS) were tested at a concentration of 50 mg/mL

hands, no troubleshooting with SDS and sarkosyl up to 5% appeared when used with Ni-NTA resin. 2. The choice of detergent to purify a membrane protein remains an empirical “decision” or a trials and errors procedure. We successfully solubilized mRecTSPO in SDS, exchanged it with DPC and other detergents (see Table 2). However, some other detergents were unable to exchange SDS (see Table 2). Baneres et al. (11) tested different detergents to purify G-protein coupled receptor (GPCR) overexpressed in E. coli. Their succeeding protocol involves GPCR removing from inclusion bodies with urea and refolding within the column in the presence of lauryldimethylamine oxide (LDAO) detergent. Bane et al. (12) attempt to overexpress different human GPCR in E. coli, and NK1 receptor was the only one giving good expression level. Extraction of proteins, mostly localised in inclusion bodies, was only possible with ionic detergent and purification was obtained with DPC. Columbus et al. (14) overexpressed several a-helical membrane proteins from Thermotoga maritima in E. coli. Proteins were extracted from membrane with DDM, fixed on affinity column and detergent exchanged. They screened a broad range of detergent with different physical and chemical characteristics, evaluated various biophysical properties of the solubilized proteins, and emphasised on the need to characterise protein–detergent complex in order to pursue structural studies. 3. The recovery yields after purification depends strongly on the quantity of protein immobilised in the column. We observed a linear relationship between amounts of total proteins deposited on the column and RecTSPO recovered. Figure 3 shows a summary of the various purification of mRecTSPO performed in the presence of different detergents using manual chromatography procedure.

40

Robert and Lacapère

Recovered mRecTSPO, mg

16 14 12 10 8 6 4 2 0 0

25

50

75

100

125

150

Layered Proteins, mg

Fig. 3. Linear relationship between recovered purified mRecTSPO from the 1.2 mL Ni-NTA resin and total proteins deposited on the column. mRecTSPO was isolated in SDS (closed squares), DPC (opened triangles), Sarkosyl (opened circles) or DDM (closed diamonds).

3.4. Characterisation of Purified Membrane Protein 3.4.1. Protein Composition and Oligomeric State

Analysis of samples along the various purification steps (Fig. 1) as well as at the end of the purification process often reveals not only the purity of the sample but also the possible oligomeric state of the protein (Fig. 4). For such purpose, different gel coloration (Coomassie-based and silver stain) and reducing conditions can be used. 1. Prepare a 0.75-mm thick, 12.5% gel following the manufacturer’s instructions based on Laemli method (15). Make the resolving (lower) 12.5% gels the day before of use and let it stand at 4°C to obtain reproducible results. Prepare the staking gel 2 h before sample migration. 2. Prepare samples in reductive or non-reductive conditions with or without 650 mM mercaptoethanol and in mild reductive conditions with variable concentrations of dithiothreitol (DTT) from 60 µM to 180 mM. To avoid protein aggregation, samples should not be heated. 3. All wells were filled with identical volumes of the buffer in which samples were diluted. The recommended voltage conditions for optimal resolution with minimal thermal band distortion were 200 V at constant voltage setting. Separations ran over less than 50 min. 4. Two methods of staining were used, a “single” step Coomassie blue-based one with the EZBlue gel staining reagent and the second one with silver nitrate staining (16). The gels were fixed in 50% ethanol/10% acetic acid for 30 min and they were slowly rehydrated by incubation in 5% ethanol/1% acetic acid for 15 min, followed by three times 5 min wash in distilled water.

Bacterial Overexpressed Membrane Proteins: An Example: The TSPO

41

Fig. 4. SDS-PAGE profile of hRecTSPO in various reducing conditions. Each lane was loaded with 7.5 µL of a solution made of 2.5 µL SDS non-reducing buffer and 5 µL of SDS solubilized hRecTSPO (4.5 µg total amounts) in the presence of variable concentrations of DTT. None (lane 2), 60 µM (lane 3), 180 µM (lane 4), 600 µM (lane 5), 1.8 mM (lane 6), 6 mM (lane 7), 18 mM (lane 8), 60 mM (lane 9) and 180 mM (lane 10). hRecTSPO is at a final concentration of 30 µM and DTT over protein molar ratio ranged thus from 2 to 6,000. Lane 11 contained 650 mM mercaptoethanol, concentration used in Laemli reducing conditions. Lanes 1 and 12 correspond to Invitrogen protein standards in non-reductive and reductive conditions, respectively. Gel was made with 12.5% polyacrylamide and stained with Ezblue.

For EZBlue method, staining intensity generally reached a maximum within 45 min to 1-h reagent incubation, after gels were rinsed with distilled water to minimise the background. For silver staining method, fixed and rehydrated gels were incubated in freshly prepared sodium thiosulphate (20%) for 1 min and then rinsed in distilled water three times for 30 s. Then, gels were stained by 20-min incubation in freshly prepared silver nitrate solution (100 mg silver nitrate and 37.5 µL saturated formaldehyde for 50-mL water solution) and then washed in distilled water two times for 30 s. Staining was revealed by incubation in developer solution (3% sodium carbonate, 0.4% sodium thiosulphate and 25-µL saturated formaldehyde for 50-mL water solution) as long as needed (1–2 min). Finally, reaction was stopped in 5% acetic acid for at least 5 min and gels were kept in distilled water. 3.4.2. Optimal Detergent Concentration

Detergent can be exchanged during the purification process using manual chromatography procedure (see Subheading 3, Fig. 2). The concentration needed to get full protein solubilization can be determined experimentally by repeating purification in the presence of various detergent concentrations in the washing and elution buffers. Figure 5 shows the total recovery of mRecTSPO as a function of DPC concentration. At low DPC concentrations, some proteins remained stacked on the Ni-NTA resin and were collected by elution in the presence of high SDS concentration.

Robert and Lacapère

mRecTSPO recovered, mg

42

10 8 6 4 2 0 0

1

2

3

4

5

6

7

8

9

10 11

DPC, mg/ mL

Fig. 5. Retrieval of purified mRecTSPO from affinity column as a function of added DPC concentrations during washing steps (D2). Protocol was identical to that presented in Fig. 3.2. mRecTSPO eluted in DPC (round black symbols). Resting mRecTSPO stacked in the column is eluted with buffer A supplemented with 50 mg/mL SDS (square white symbols). Total amount of mRecTSPO eluted in both DPC and SDS (triangular white symbols) is almost constant and close to 9 mg.

We suggested that a minimal concentration of 5 mg/mL DPC was necessary to elute a stable mRecTSPO detergent complex, which did not precipitate. It suggests that when DPC/mRecTSPO ratio was too low, protein tends to precipitate. Using the method described in a previous chapter, we measured the detergent content of each fraction during the purification process (Fig. 2). The DPC concentration started at a value of 10 mg/mL corresponding to the content of the elution buffer G. Then DPC concentrations increases similarly to the mRecTSPO elution profile. This shows that mRecTSPO forms a complex with DPC and the calculated DPC/mRecTSPO ratio was 1.6 (w/w) for the elution peak. This ratio was higher for the following fractions.

4. Conclusions Optimising bacterial E. coli cultures permit us to overexpress and purify high amounts of both mRecTSPO and hRecTSPO. Both proteins are pure and appear as a single band in SDS-PAGE in reducing conditions. However, in non-reductive conditions mRecTSPO, which has no cystein in its amino acid sequence, still appeared as a 20-kDa single band (Fig. 1), whereas polymers are observed for hRecTSPO (Fig. 4). This is consistent with the presence of two cysteins in the hTSPO amino acid sequence that could form disulphide bridges inducing the formation of polymers. Increasing the concentration of reducing agent induced

Bacterial Overexpressed Membrane Proteins: An Example: The TSPO

43

progressive polymers disappearance (Fig. 4). Furthermore, careful analysis of the gel profile reveals that apparent molecular weight of hRecTSPO monomers increases with DTT concentrations (compare lanes 1–7, in Fig. 4). This suggests that cysteins of hRecTSPO can form an intramolecular disulfide bridge. The initial monomer with S–S bond has a more compact structure run as a low molecular weight protein, whereas final monomer without S–S bridge has a more opened structure running as higher molecular weight protein. Along with the change in reduction conditions, the presence of the two forms of the monomers can coexist (see lane 5 in Fig. 4). This demonstrates that electrophoresis in reducing, non-reducing and partial reductive conditions could give information on protein conformations.

5. Notes 1. Final optical density (OD at 600 nm) reached for cultures in Fernbachs was higher than in flasks, probably because surface contact between LB medium and air was larger in Fernbachs than in flasks. 2. Many media and feeding solutions are described in the literature (17), but we choose to add only essential nutriments (glucose and ammonium sulphate) to LB. 3. For unknown reasons, we observed that amount and concentration of hTSPO were always lower than mTSPO whatever the production process used (see Table 1). We noticed that hTSPO was produced in polymeric forms in the inclusion bodies. 4. Production of labelled protein for NMR studies was performed step by step. A first enriched M9 medium solution was made in H2O with non-labelled ammonium sulphate and glucose. Then, labelled nutriments 15N and 13C were substituted sequentially to the medium in order to follow any changes in bacterial growth and recombinant protein production (see Table 3). Finally, a medium made with inclusion of labelled nutriments in D2O (M9D2O) was used to get fully labelled protein. 5. Marley et al. (9) described a protocol where cells were first grown in LB medium (since it favours protein expression yields), then concentrated and grown in a smaller volume of minimal M9 medium supplemented with labelled isotopes. They studied the effect of M9 over LB ratio (from 0.125 to 1), and suggested that the best conditions taking into account cost and protein production was the 0.25 ratio that we used in the present work (see Table 3). Using our enriched M9 media, the results show that averaged amount of recovered

44

Robert and Lacapère

Table 3 Yields of mRecTSPO produced per litre of broth in different isotopic conditions Isotope

None

15

N

15

N, 13C, deuterated H2O

15

N, 13C, 2H, deuterated H2O

Ratio M9/LB

0.25

0.25

0.25

0.25

Number of productions

1

3

2

6

mRecTSPO per litre 14.2 of LB (mg/L)

17 ± 1

14 ± 1

6.3 ± 0.4

Columns show non-isotopic M9 medium (second column), 15N-enriched M9 medium (third column), 15N, 13C and deuterated H2O-enriched M9 medium (third column), and 15N, 13C, 2H and deuterated H2O-enriched M9 medium (fourth column). 13C, 2H were gained from labelled D-Glucose and 15N from ammonium sulphate

mRecTSPO by LB litre (15 mg/L) is close to that obtained in sole LB broth (22 mg/L) using a different protocol (see Table 1). We do not have explanation for the reduction observed using deuterated D-Glucose. 6. Solubilization of integral membrane proteins (such as TSPO) from inclusion bodies in SDS seems to be less drastic than high dilution processes used for renaturation after urea and guanidine hydrochloride treatments. Such treatment greatly diluted proteins and thus needs further concentration steps often difficult with detergent-solubilized membrane proteins. 7. Purification procedure has to be adapted to needs of functional and structural studies. Batch procedures are often used by molecular biologist and permit to get easily small amounts of diluted purified protein (use of large washing, elution volumes compared to resin volume). Manual chromatography might be “old fashion” but reduces strongly void volumes of tubes, connections and pumps present in FPLC or HPLC apparatus. It permits collection of small concentrated elution volumes. If the advantages of FPLC are numerous (automatisation, control of parameters, use of pre-packed columns), it needs important volumes of mobile phases (from 50 to 500 mL), very expensive when using deuterated detergent and water.

Acknowledgment The authors would like to thank Professor V. Papadopoulos for the generous gift of TSPO plasmids. They would gratefully thank M.A. Ostuni for his help in the critical reading of this chapter. This work was supported by CNRS (Centre National de la Recherche Scientifique) and ANR (Agence National pour la Recherche) Grant 06-Blan-0190-01 to JJL.

Bacterial Overexpressed Membrane Proteins: An Example: The TSPO

45

References 1. Lacapere J-J, Pebay-Peyroula E, Neumann J-M, Etchebest C (2007) Determining membrane protein structures: still a challenge! Trends Biochem Sci 32:259–270 2. Peti W, Page R (2007) Strategies to maximise heterologous protein expression in Escherichia coli with minimal cost. Protein Expr Purif 51:1–10 3. Papadopoulos V, Baraldi M, Guilarte TR, Knudsen TB, Lacapere J-J, Lindemann P, Noremberg MD, Nutt D, Weizman A, Zhang M-R, Gavish M (2006) Translocator protein (18 kDa): New nomenclature for the peripheral-type benzodiazepine receptor based on its structure and molecular function. Trends Pharmacol Sci 27:402–409 4. Garnier M, Dimchev AB, Boujrad N, Price JM, Musto NA, Papadopoulos V (1994) In vitro reconstitution of a functional peripheral-type benzodiazepine receptor from mouse Leydig tumor cells. Mol Pharmacol 5:201–211 5. Li H, Papadopoulos V (1998) Peripheric-type benzodiazepine receptor function in cholesterol transport. Identification of a putative cholesterol recognition/interaction aminoacid sequence and consensus pattern. Endocrynology 139:4991–4997 6. Jamin N, Neumann J-M, Ostuni MA, Kim NV, Yao ZX, Murail S, Robert J-C, Giatzakis C, Papadopoulos V, Lacapere J-J (2005) Characterization of the cholesterol recognition amino acid consensus sequence of the peripheral-type benzodiazepine receptor. Mol Endocrinol 19:588–594 7. Sambrook J, Fritsch EF, Maniatis M (1989) Molecular cloning: a laboratory manual, 2nd edn. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY 8. Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A (2003) ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res 31:3784–3788

9. Marley J, Lu M, Bracken C (2001) A method for efficient isotopic labeling of recombinant proteins. J Biomol NMR 20:71–75 10. Charbonnier F, Köhler T, Pechère JC, Ducruix A (2001) Overexpression, refolding, and purification of the Histidine-tagged outer membrane efflux protein OprM of Pseudomonas aeruginosa. Protein Expr Purif 23:121–127 11. Baneres J-L, Martin A, Hullot P, Girard J-P, Rossi J-C, Parello J (2003) Structure-based analysis of GPCR function: conformational adaptation of both agonist and receptor upon leukotrienes B4 binding to recombinant BLT1. J Mol Biol 329:801–814 12. Bane SE, Velasquez JE, Robinson AK (2007) Expression and purification of milligram level of inactive G-protein coupled receptors in E. Coli. Protein Expr Purif 52:348–355 13. Lacapere J-J, Delavoie F, Li H, Péranzi G, Maccario J, Papadopulos V, Vidic B (2001) Structural and fonctional study of reconstituted peripheral benzodiazepine receptor. Biochem Biophys Res Commun 284: 536–541 14. Columbus L, Lipfert J, Klock H, Millett I, Doniach S, Lesley SA (2006) Expression, purification, and characterization of Thermotoga maritima membrane proteins for structure determination. Protein Sci 15:1–15 15. Laemli UK (1970) Cleavage of structural proteins during the assembly of the head of acteriophage T4. Nature 227:680–685 16. Yan JX, Wait R, Berkelman T, Harry RA, Westbrook JA, Wheeler CH, Dunn MJ (2000) A modified silver staining protocol for visualisation of proteins compatible with matrixassisted laser desorption/ionizatioin and electrospray ionization-mass spectroscopy. Electrophoresis 21:3666–3672 17. Riesenberg D, Gunthke R (1999) High cell density cultivation of microorganisms. Appl Microbiol Biotechnol 51:422–430

as

Chapter 4 Insect Cell Versus Bacterial Overexpressed Membrane Proteins: An Example, the Human ABCG2 Transporter Alexandre Pozza, José M. Pérez-Victoria, and Attilio Di Pietro Abstract The multidrug resistance phenotype of cancer cells has been often related to overexpression of plasma membrane ATP-binding cassette transporters, which are able to efflux many types of drug by using the energy of ATP hydrolysis. ABCG2 is a half-transporter recently involved. Its purification would help to understand the mechanism of both transport and its inhibition. Biophysical, structural, and functional studies are consuming great amounts of homogeneous purified proteins and require efficient overexpression systems. Heterologous overexpression of human membrane proteins is actually a challenge because these proteins are toxic for the host, and both translation and chaperone systems of the host are not well adapted to the biosynthesis of human proteins. Overexpression of ABCG2 has been assayed in both bacterial and insect cell/baculovirus systems. Although it was highly overexpressed in bacterial system, neither transport nor ATPase activity was found within inverted membrane vesicles. By contrast, insect cells/baculovirus system produces a low amount of protein, a part of which is active. Key words: ABCG2, ATPase activity, Baculovirus/insect cells, Biosynthesis and activity inhibitors, Drug transport, Heterologous membrane protein overexpression, Selected bacteria strain

1. Introduction ABCG2 is a human ABC (“ATP-binding cassette”) half-transporter involved in the multidrug resistance phenotype of cancer cells (1–3). This membrane protein has been heterologously overexpressed in Lactoccocus lactis bacteria (4), Pichia pastoris yeast (5), and both Sf9 and High-Five baculovirus-infected insect cells (6–9) as well as in Xenopus laevis (10). The insect cells/baculovirus system has been used despite significant criticisms. Indeed, the insect cell membranes contain a very low amount of cholesterol, which has been recently shown to modulate ABCG2 activity (11, 12). For instance, purified ABCG2 has been obtained only Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_4, © Springer Science+Business Media, LLC 2010

47

48

Pozza et al.

from High-Five insect cells (8, 9). Other human ABC transporters, also involved in the multidrug resistance phenotype, have been overexpressed in heterologous systems. P-glycoprotein has been overexpressed in Escherichia coli bacteria (13–15), P. pastoris yeast (16), baculovirus/insect cell system (17), and also in X. laevis (18). Multidrug resistance protein 1 has also been obtained in P. pastoris yeast (19) and baculovirus/insect cells system (20). The heterologous overexpression of human membrane proteins is quite difficult for several reasons. First, the lipid composition of host membranes is different from human cells (21), and specific protein–lipid interactions have a significant impact on the correct insertion, folding, structural integrity, and optimal functionality (22). Second, transcription, translation, and chaperone machinery are not fully adapted to human protein biosynthesis, and the quality control system of integral membrane proteins is less efficient than in human cells; for example, an inactive ABCG2 mutant was addressed to the cell surface membrane in insect Sf9 cells but not in human HEK-293 cells where it was hardly expressed, underglycosyled and localized inside the ER (23). Finally, some motifs, such as ATPase domains and transmembrane domains, may be toxic for the host (24) and induce growth suppression. It is worthwhile mentioning that functional overexpression of some membrane proteins in either bacteria, yeast, or insect cells is not possible as described by Tate et al. for the rat serotonin transporter (25) where only mammalian cells could produce a functional protein. In our case, ABCG2 overexpression has been tried in both bacterial and baculovirus/insect cell systems. In E. coli, different combinations of bacteria strains, expression plasmid, and growth conditions were assayed. Under the best conditions, a high amount of protein was produced, but the growth was stopped upon induction, thus limiting the bacteria biomass. To overcome this problem, we isolated a toxicity mutant from BL21 (DE3) as described by Miroux and Walker (26). Although the selected strain grew upon induction and produced ABCG2 as well as parental strain, neither ATPase activity nor transport has been detected within inverted membrane vesicles. This result was contradictory because an ABCG2-mediated transport of ethidium bromide has been detected by fluorescence on intact bacteria. Moreover, bacteria-overexpressed ABCG2 could only be solubilized by SDS. The insect cell/baculovirus system allowed obtaining functional ABCG2 within both whole cells and inverted membrane vesicles. Transport activity was monitored on whole cells by flow cytometry with rhodamine 123 as a substrate and on inverted membrane vesicles by fluorescence with Hoechst33342. Insect cell-overexpressed ABCG2 was partially solubilized by mild detergents, allowing its purification through nickel chromatography.

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

49

Purified ABCG2 exhibited a high vanadate-sensitive ATPase activity and was able to bind a number of substrates and inhibitors with high affinity. However, additional studies indicated that the insect cell/baculovirus system was not ideally adapted since ABCG2 was produced as two forms with different migration in SDS–PAGE; a number of evidences suggested that only the upper form was active. The homogeneity of any heterologously overexpressed protein should be checked prior to biochemical, biophysical, and especially structural studies. The transport activity within either inverted membrane vesicles or whole cells is not sufficient to guarantee that all expressed protein is indeed fully functional.

2. Materials 2.1. Host Strain and Culture Medium

1. Bacterial strains: E. coli strains BL21(DE3), BL21 (DE3) pLysS, Origami B(DE3), and Rosetta(DE3) (Novagen). The C41(DE3) and C43(DE3) strains were kindly given by Dr. J.E. Walker (MRC Cambridge, UK). 2. Bacterial medium: LB medium (10 g tryptone, 5 g yeast extract, 10 g NaCl, qsp 1 l) or 2TY medium (16 g tryptone, 10 g yeast extract, 5 g NaCl, qsp 1 l), both available from Bio101. 3. Insect cell strains: Sf9 and High-Five strains were kindly given by Dr. D. Hulmes from our Institute. 4. Insect cells medium: BacVector (Novagen) medium where 10% FCS were added, and High-Five cells in EXPRESS FIVE SFM supplemented with l-glutamine. Drug and chemical chaperone treatments were performed in 2× Grace medium upon twofold dilution with versole water (Aggettant, Lyon).

2.2. Molecular Biology

1. Expression plasmids: pET21b(+) and pTriEx-4-Neo were purchased from Novagen. The pcDNA3 plasmid containing R482T-ABCG2 cDNA was kindly provided by Dr. D. D. Ross (University of Baltimore, MD). 2. Site-directed mutagenesis: The R482T-ABCG2 cDNA was mutated to obtain R482-ABCG2 cDNA by site-directed mutagenesis using a “Quick-Change Site-directed mutagenesis” kit (Stratagene, La Jolla, CA). The primers used were: 5¢-CTGTTATCTGATTTATTACCCATGAGGATGTTA CCAAGTATTATATTTACC-3¢ and 5 ¢ - G G TA A ATATA ATA C T T G G TA A C AT C C T C AT GGGTAATAAATCAGATAACAG-3¢.

50

Pozza et al.

3. Baculovirus generation: Baculovirus were generated using BacVector Triple Cut-3000 transfection kit according to the manufacturer’s instructions (Novagen, VWR, Fontenay-sousBois, France). PtriEx-4-Neo was used as recombinant transfer plasmid and resuspended in 10 mM Tris–HCl, 0.1 mM EDTA, pH 8. All molecular biology experiments were performed with aerosol-barrier pipette tips to avoid contaminations. 2.3. Biochemistry

1. Chemical compounds: Most chemical were purchased from Sigma-Aldrich (Saint-Quentin Fallavier, France). Detergents: Fos-choline 16, Anatrace, Maumee, USA; Tris(2-carboxyethyl) phosphine (TCEP), Perbio Brebières, France; 3-((3-cholamidopropyl) dimethylammonio]-1-propanesulfonate (CHAPS), Euromedex, Souffelweyersheim, France; n-Dodecyl b dmaltopyranoside, Alexis Biochemicals, Axxora, Villeurbanne, France. Other compounds: Pheophorbide a (Frontier Scientific, Logan, USA), Ko 143 (kindly provided by Dr. A. H. Schinkel, The Netherlands Cancer Institute of Amsterdam, The Netherlands), 6-prenylchrysin, and other flavonoids (synthesized by Dr. A. Boumendjel, University of Grenoble, France), GF120918 kindly given by GlaxoSmithKline (Madrid, Spain), and perfluoro-octanoic acid (PFO) from Fluorochem, Derbyshire, UK. 2. Protein purification and concentration: Ni–NTA agarose (Qiagen, Courtaboeuf, France). Purified protein was concentrated using a centrifugal concentrator with a molecular weight cut-off of 30 kDa (Millipore, UK). Imidazole was removed using desalting column (Econo-Pac 10 DG, Bio-Rad). 3. Bacterial solubilization buffer: 0.5% SDS, 0.1 M KPO4, 15% glycerol, 0.1 M NaCl, 1 mM DTT. 4. Cell solubilization buffer: 50 mM HEPES/NaOH, pH 8, 18 mM CHAPS, 0.5 M NaCl, 10 mM imidazole, 20% glycerol. 5. Baculovirus/insect cell solubilization buffer A: 50 mM HEPES, pH 8, 20% glycerol, 0.3 M NaCl (with CHAPS or fos-choline 16), 5 mM TCEP, 10 µl/ml protease inhibitors. 6. Baculovirus/insect cell solubilization buffer B: 2% SDS, 50 mM HEPES, pH 8, 5 mM TCEP, and 10 µg/ml protease inhibitors. 7. Washing buffer A: 0.05% SDS, 0.1 M KPO4, 15% glycerol, 0.1 M NaCl, 1 mM DTT. 8. Washing buffer B: 50 mM HEPES/NaOH, pH 8, 18 mM CHAPS, 0.5 M NaCl, 30 mM imidazole, and 20% glycerol.

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

51

9. Elution buffer A: 0.05% SDS, 0.1 M KPO4, 15% glycerol, 0.1 M NaCl, 1 mM DTT, 0.2 M imidazole. 10. Elution buffer B: 50 mM HEPES/NaOH, pH 8, 18 mM CHAPS, 0.5 M NaCl, 250 mM imidazole, and 20% glycerol. 11. Dialysis buffer: 0.05% SDS, 0.1 M KPO4, 15% glycerol, 0.1 M NaCl, 1 mM DTT. 12. Fluorescence: Photon Technology International Quanta Master I spectrofluorimeter. Samples were loaded into a Suprasil quartz cuvette with an optical path of 5 mm (Hellma). 13. Circular dichroism: CD6 Jobin-Yvon dichrograph using quartz Suprasil cuvettes (Hellma) with an optical path of 2 mm. Protein concentration was determined with a visible/ UV DU640 Beckman spectrophotometer. 14. Dynamic light scattering: Zetaiser Nano S from Malvern Instruments (Worcestershire, UK). Buffer viscosity was measured with an Uddelohde viscosimeter from Schott-Geräte, and the refractometric coefficient with an ABBE refractometer. A Suprasil quartz cuvette with a 5-mm optical path was used to monitor particle size. 15. Cell lysis: SLM AMINCO French press. Insect cells were broken by successive passages through a 25 G × 5/8″ needle. 16. Cell lysis buffer: 1% SDS, 100 mM NaCl, 20% glycerol, 100 mM TCEP, 10 µl/ml protease inhibitors. 17. SDS–Polyacrylamide gel electrophoresis (SDS–PAGE) and western blotting: Mini-protean III from Bio-Rad. 18. Separating gel: 8% acrylamide, 0.21% bisacrylamide, 0.320 M Tris–HCl, pH 8.8, 0.1% SDS, 0.1% ammonium persulfate, 0.1% TEMED. 19. Staking gel: 5% acrylamide, 0.13% bisacrylamide, 0.130 M Tris–HCl, pH 6.8, 0.1% SDS, 0.1% ammonium persulfate, 0.1% TEMED. 20. Running buffer: 25 mM Tris–HCl, 192 mM glycine, 0.1% SDS. 21. Sample buffer: 62.5 mM Tris–HCl, pH 6.8, 10% glycerol, 2% SDS, 0.00625% bromophenol blue, and 0.1 M b-mercaptoethanol (added extemporaneously). 22. SDS–PAGE migration: Followed using prestained molecular weight markers (broad range, from Bio-Rad). 23. SDS–PAGE gels stain: 0.1% coomassie blue R 250 (w/v), 40% methanol (v/v), 10% acetic acid from 15 to 30 min. The SDS–PAGE gels were destained using 40% methanol, 10% acetic acid.

52

Pozza et al.

24. Electrotransfer membranes: Nitrocellulose (Bio-Rad). 25. Electrotransfer: Bio-Rad Trans-Blot SD electroblotter and a PowerPac 200 generator (for 60 min at 90 mA). 26. Transfer buffer: 25 mM Tris–HCl, pH 7.4, glycine 0.7 M. 27. Blocking buffer: 1% nonfat-dry milk (w/v), 0.1% Tween 20 (v/v), 10 mM Tris–Borate, 150 mM NaCl, pH 7.4). The blocking was performed for 1 h at room temperature. 28. Primary antibody buffer: Same composition that blocking buffer supplemented with monoclonal antibody anti-ABCG2 BXP21 (Alexis Biochemicals) diluted at 1:250. Primary antibody binding was performed for 1 h at room temperature. 29. Secondary antibody: Goat anti-mouse IgG (H + L)-alkaline phosphatase (AP) conjugate (Bio-Rad), used at 1:2,300 dilution for 1 h at room temperature. 30. Immunorevelation: “AP color Development Kit” from BioRad. Between each step, three washings for 5 min were performed with 10 ml of blocking buffer without dry milk. 31. Buffer A: 50 mM Tris–HCl, pH 8, 5 mM MgCl2, 1 mM DTT, 1 µg/ml DNase/RNase. 32. Buffer B: 20 mM Tris–HCl, pH 8, 1.5 mM EDTA. 33. Buffer C: 50 mM Tris–HCl, pH 8, 0.3 M sucrose, 1 mM EDTA. 34. Buffer D: 10 mM HEPES, pH 8, 10 mM NaCl. 35. Buffer E: 10 mM HEPES, pH 8, 50 mM NaCl, 0.3 M mannitol. 36. Transport buffer: 50 mM HEPES, pH 8, 2 mM MgCl2, 8.5 mM NaCl, 20 µg/ml pyruvate kinase, 4 mM phosphoenolpyruvate. 37. Reaction buffer (for ATPase activity measurements): 167 mM HEPES, pH 8, 5 mM DTT, 13.3 mM phosphoenol pyruvate, 200 µg/ml pyruvate kinase, 33.3 mM sodium azide, 1.7 mM EGTA, 3.3 mM ouabaine. 38. Stopping solution: 0.5% ammonium molybdate, 6% SDS, 3% ascorbic acid, and 0.5% HCl. 39. Revelation solution: 2% sodium citrate, 2% sodium arsenate, 2% acetic acid. 40. Circular dichroism detergent buffer: 10 mM NaPO4, pH 8, 2% glycerol, 1 mM DTT, 0.05% detergent (either SDS for ABCG2 R482T from bacteria or dodecylmaltoside for the protein from insect cells). 41. Fluorescence detergent buffer: 50 mM HEPES/NaOH, pH 8, 18 mM CHAPS, 0.5 M NaCl, and 20% glycerol.

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

53

3. Methods 3.1. cDNA Integration in Host Cells and Protein Overexpression

1. E. coli strains were transformed with pET21b(+)/ABCG2 R482T using a thermal-shock procedure. The transformed bacteria were plated onto an LB plate containing 100 µg/ml ampicillin and incubated overnight at 37°C (see Note 1).

3.1.1. Bacteria Transformation and ABCG2 Expression

2. A single colony was picked and inoculated into a 500-ml erlen containing 100 ml of LB supplemented with 100 µg/ml ampicillin. The preculture was incubated overnight at 37°C with shaking at 140 rpm.

3.1.1.1. Bacterial Transformation and Culture

3. Next morning (see Note 2), the preculture was diluted at OD600 nm = 0.1 into a 5-l flask containing 1 l of prewarmed 2YT, and the culture was performed at 37°C with shaking at 140 rpm. 4. When OD600 nm reached about 0.6, IPTG was added to a final concentration of 1 mM and bacteria were incubated with shaking. After induction, the growth stopped with BL21(DE3), Origami B(DE3) and Rosetta(DE3) strains, and no expression was observed with BL21(DE3)pLysS. Under these conditions, the biomass was low, and other parameters were therefore tested. First, several induction temperatures (20, 25, 30, and 37°C) and inducer concentrations (0.025, 0.05, 0.1, and 1 mM) were checked but without success. Second, induction was tried at high OD600 nm (1.5 to 3): a high biomass was obtained but the growth was stopped. Poor-culture medium, such as LB, has been used but growth arrest was still observed. Growth arrest was also observed with the Rosetta (DE3) strain, excluding codon bias as a possible cause. Since ABCG2 is composed of two toxic motifs known to induce growth suppression upon induction (25), a mutant of the BL21(DE3) strain that was still able to grow after induction has been selected. Using pTriEx in bacteria gave even worse results than pET21, and protein expression was assayed with either N-terminal or C-terminal hexahistidine tag, or without tag.

3.1.1.2. Obtaining Mutant Strains by Host Selection

The mutant strain was selected as described by Miroux and Walker (26) with minor modifications. In our case, bacteria should be taken just before the recovery growth. 1. A single colony was inoculated into 5 ml of 2YT medium supplemented with 100 µg/ml ampicillin. The culture was incubated at 30°C with shaking at 140 rpm. 2. When OD600 nm = 0.8, the culture was inducted by 1 mM IPTG.

54

Pozza et al.

3. Cell growth was monitored by measuring absorbance at 600 nm every hour. 4. Upon induction cell growth stopped, and just before the recovery growth (4 h in our case) the cells were collected. 5. The cells were immediately diluted with 2YT medium according to serial dilutions between 1:10 and 1:10,000. 6. The different dilutions were then plated onto 2YT agar plates containing 1 mM IPTG and 100 µg/ml ampicillin. 7. After overnight incubation at 37°C, two cell populations were present: large colonies represented mutants having lost the ability to express ABCG2 but keeping ampicillin resistance, whereas small colonies corresponded to mutants still able to express ABCG2 and ampicillin resistant. These last mutants should be isolated. 8. Protein expression and growth kinetics were monitored on several mutants by western blot. The mutant with highest expression level, and without toxic effects, was selected and named PV6. 9. The selected PV6 mutant strain was cured (see Note 3) of the plasmid using the following procedure. They were maintained in exponential phase in LB medium containing 1 mM IPTG, and every day, serial dilutions were plated onto LB agar plates containing either 1 mM IPTG or 100 µg/ml ampicillin or both. The plasmid was lost, when the absence of colony on LB agar plate containing ampicillin was concomitant with a lot of colonies observed on LB agar plate containing IPTG taking the size of the colony into account. This procedure took about 1 month and was checked monitoring the absence of plasmidic DNA. 10. Finally, cured PV6 mutants were transformed with pET21b(+)/ABCG2 R482T, and protein expression and growth kinetics were monitored. This step was very important to check that the mutation is indeed in the bacteria and not in the plasmid. This procedure allowed obtaining a selected strain producing ABCG2 and growing as fast as noninduced selected and parental strains (Fig. 1a, b). In contrast to PV6, C41(DE3) and C43(DE3) failed to express ABCG2, showing that the mutant host must be adapted to the protein to be expressed. On the contrary, PV6 was able to produce high level of active BmrA, under similar conditions as with C41. Electron microscopy analysis showed that ABCG2 was not located in intracellular membranes of PV6, as described by Arechaga et al. (27) with C41(DE3) and C43(DE3) strains (see Note 4).

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

55

Fig. 1. ABCG2 overexpression in bacterial and insect cells/baculovirus systems. (a) Time course of ABCG2 expression in the parental and selected PV6 strain. After induction by 1 mM IPTG, when OD600 nm reached 0.5, the ABCG2 expression was assessed at the times indicated by SDS–PAGE gel, stained with coomassie blue and revealed by western blot with the BXP-21 antibody. (b) Kinetics of growth of parental or selected strain upon induction. When OD600 nm reached 0.5, the parental and selected strains were either induced or not, then OD600 nm was measured every hour during 5 h. (■), BL21(DE3)/pET21a(+) ABCG2 R482T; (§), BL21(DE3)/pET21a(+) ABCG2 R482T induced; (▲), PV6/pET21a(+) ABCG2 R482T; ( ), PV6/pET21a(+) ABCG2 R482T induced. (c) Time course of ABCG2 expression in Sf9 and High-Five insect cells. Indicated times represent the number of hours after cell infection. Insect cells were infected with an M.O.I of 20 for 2 h and cultured in T25 flasks. 10 µg of protein were loaded on each lane. (d) Comparative overexpression of ABCG2 in High-Five using different culture methods. Lane 1, ABCG2 overexpressed in roller bottle; lane 2, ABCG2 overexpressed in spinner; lane 3, ABCG2 overexpressed in flask. The insect cells were infected with an M.O.I of 20 for 2 h then cultured in the different culture systems. The roller bottle and spinner cultures were conducted at 20 rpm. The insect cells were harvested 72-h postinfection, and 10 µg of protein were loaded on each lane.

3.1.1.3. ABCG2 Overexpression Using a Selected Strain

1. A single PV6 colony was inoculated into a 500-ml erlen containing 100 ml of LB supplemented with 100 µg/ml ampicillin. The preculture was incubated overnight at 37°C with shaking at 140 rpm. 2. Next morning, the preculture was diluted at OD600 nm = 0.1 into a 5-l flask containing 1 l of prewarmed 2YT and the culture was performed at 30°C with shaking at 140 rpm. 3. When OD600 nm = 0.8, IPTG was added to a final concentration of 1 mM and the bacteria were incubated under shaking. 4. After 5-h postinduction (2 < OD < 2.5), the bacteria were harvested. Functional assays have been tried on intact bacteria by transport measurement of fluorescent compounds, but unfortunately without success because Gram-negative bacteria are

56

Pozza et al.

poorly permeable. The TolC and TolR genes have therefore been inactivated (see Note 5). Mutant bacteria were more permeable and an ethidium bromide transport inhibitable by specific inhibitors was observed, but no ABCG2-dependent multidrug resistance has ever been observed. We tried to select a host mutant by spreading over drug-containing plates but also without success. ABCG2 overexpression using autoinduction method (28) has not been tried, although it constitutes an effective method to overproduce toxic proteins. 3.1.2. Baculovirus Generation and ABCG2 Overexpression

1. Seed 0.5 × 106 exponentially growing Sf9 insect cells in each well of a 6-well plate, then incubate at 27°C for 30 min. Use only the cultures that contain nearly 95% healthy cells.

3.1.2.1. Insect Cell Transfection

2. In a sterile 6-ml polystyrene tube, combine 0.1 µg of the BacVector-3000 Triple Cut Virus DNA, 0.5 µg pTriEx-4Neo/ABCG2, and BacVector Insect cell Medium qsp 25 µl. 3. In a separate sterile 6-ml polystyrene tube, add 20 µl of nuclease-free water and 5 µl of insect juice transfection reagent. 4. Immediately, add the entire DNA/medium mixture into the latter tube, mix gently, and incubate at room temperature for 15 min. 5. The DNA/medium/transfection reagent mixture was dilued at 1:10, 1:50, and 1:250 with BacVector medium. 6. The cell monolayer was washed twice, with 2 ml of the BacVector medium. 7. Immediately, 100 µl of each dilution was slowly loaded onto the center of the monolayer. 8. Incubate the plates at room temperature for 1 h. To keep the cells moist and to increase the contact of the DNA/liposome complex with the cells, the plates should be submitted to gentle rocking every 20–30 min. 9. During this period, the agarose overlay was prepared. Sterile BacPlaque agarose at 3% was completely melted using a microwave oven. Next, place the bottle in a 37°C water bath and add two volumes of prewarmed BacVector medium (at 37°C) containing 5% FCS. The agarose was kept at 37°C until use. 10. Two milliliters of the agarose was added into each well. The agarose was slowly poured at the side of the plate. Keep the plates at room temperature for 20 min, until the agarose has solidified. 11. One milliliter of BacVector medium containing 5% FCS was then added by gently pipetting the medium onto the middle of the well. 12. The plates were incubated at 27°C for 4–5 days.

Insect Cell Versus Bacterial Overexpressed Membrane Proteins 3.1.2.2. Plaque Identification and Isolation

57

1. The plaques appeared cloudy against a more transparent background. Several well-isolated plaques were picked up using a sterile glass pasteur pipette (see Note 6). 2. The plug was transferred into 1 ml of BacVector Medium in the sterile screw-cap vial. The plug was incubated overnight at 4°C to allow virus to diffuse into the medium. 3. The isolated virus were diluted at 1:10, 1:100, and 1:1,000, and 200 µl of each dilution was then used to infect cell monolayer as described in step 7 in Subheading “Insect Cell Transfection.” 4. Repeat the steps 8–12 described in Subheading “Insect Cell Transfection.” This procedure should be performed twice to purify each virus.

3.1.2.3. Baculovirus Titration

1. Make several dilutions of purified virus from 1:10 to 1:107 in BacVector medium. 2. Seed 0.5 × 106 exponentially growing Sf9 insect cells in each well of the 6-well plate, then incubate at 27°C for 30 min. Use only the cultures that contains nearly 95% healthy cells. 3. The medium was carefully aspirated, just before the addition of 200 µl of each virus dilution to the center of the well. 4. The plates were incubated for 1 h at room temperature and submitted to gentle rocking every 20 min. 5. During this period, the agarose overlay was prepared. See step 9 in Subheading “Insect Cell Transfection.” 6. Repeat the steps 10–12 in Subheading “Insect Cell Transfection.” 7. The plaques were visualized and counted.

3.1.2.4. Baculovirus Amplification

1. Seed one T25 flask with 2.5 × 106 exponentially growing Sf 9 cells. Incubate at 27°C for 30 min. 2. Remove the supernatant and add 0.3 ml of virus suspension to the cell. Then, the flasks were incubated for 1 h at room temperature and submitted to gentle rocking every 20–30 min. 3. After infection, 5 ml of medium was added and the flasks were incubated at 27°C until a cytopathic effect was observed. 4. The supernatant was harvested, constituting the seed-stock virus. 5. Seed one T75 flask with 20 × 106 exponentially growing Sf 9 cells. Incubate at 27°C for 30 min. 6. Remove the supernatant and add 0.5 ml of seed-stock virus suspension to the cells. Then, the flasks were incubated for 1 h at room temperature, with two gentle rocking. 7. After infection, 10 ml of medium was added and the flasks were incubated at 27°C until a cytopathic effect was observed.

58

Pozza et al.

8. The supernatant was harvested, constituting the master-stock virus. 9. Titrate the master-stock virus: see Subheading “Baculovirus Titration.” 10. Seed one T225 flask with 60 × 106 exponentially growing Sf 9 cells. Incubate at 27°C for 30 min. 11. Remove the supernatant and infect the cells with master-stock virus at a multiplicity of infection equal to 0.1. After incubation for 1 h at room temperature, the flasks were gently rocked, twofolds. 12. After infection, 30 ml of medium were added, and the flasks were incubated at 27°C until a cytopathic effect was observed (4–5 days). 13. The supernatant was harvested after centrifugation at 1,000 × g for 10 min, constituting the high-stock virus. The viral stocks should be stored in the dark at 4°C (see Note 7). 3.1.2.5. ABCG2 Overexpression in Insect Cells

The highest ABCG2 overexpression in flasks is detailed here. Roller bottles and spinners (see Note 8) have also been used, but the expression level was low (Fig. 1d). Moreover, two insect cell strains have been tried: Sf9 and High-Five. The ABCG2 expression level was greater with the High-Five strain (Fig. 1c). 1. Seed on T225 flask with 30 × 106 exponentially growing Sf9 or High-Five cells. Incubate at 37°C for 24 h. 2. Just before infection, the medium was removed. The insect cells were infected at a multiplicity of 20 for 1 h at room temperature with high-stock virus. 3. After infection, 30 ml of medium was added, and the flasks were incubated at 27°C. 4. The infected insect cells were harvested 72-h postinfection. ABCG2 functionality in intact cells has been checked by flow cytometry with rhodamine 6G as the substrate. ABCG2 expressed in insect cell was indeed able to efflux rhodamine 6G, an activity that could be abolished by specific inhibitors. 1. The bacteria was harvested by centrifugation at 7,500 × g for 10 min at 4°C, and then washed twice with PBS.

3.2. Preparation of Inverted Membrane Vesicles and Functional Assays

2. Next, the pellets were resuspended in buffer A at a final concentration of 1 mg protein/ml.

3.2.1. Inverted Membrane Vesicles from the Selected Bacteria Strain

3. The suspension was passed twofold through the French press at 18,000 psi, and 10 mM EDTA was immediately added.

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

59

4. Unlysed bacteria and cellular debris were removed by centrifugation at 15,000 × g for 30 min at 4°C. 5. Next, the supernatant was centrifuged at 100,000 × g for 1 h. 6. The pellets were washed by buffer B and centrifuged at 100,000 × g for 1 h at 4°C. 7. The pellets were resuspended in buffer C at a final protein concentration of 20 mg protein/ml. 8. The inverted membrane vesicles were homogenized with an Elvehjem potter. 9. Inverted membrane vesicles were quickly frozen and stored in liquid nitrogen. 10. The overexpression level was checked using an SDS–PAGE gel (Fig. 3a). ABCG2 expressed in bacteria seemed to be located within inverted membrane vesicles, although we have not checked if the overexpressed protein might form “proteo-lipidic aggregates” as described by Montigny et al. (29). Indeed, these aggregates were present in the cytosolic fraction and co-sedimented with the membrane fraction. 3.2.2. Inverted Membrane Vesicles from Insect Cells

1. After centrifugation at 500 × g for 15 min, the cells were washed twice with PBS without Ca2+ and Mg2+. 2. The cell contents of T225 flasks were suspended into 1 ml of lysis buffer D for 1 h at 4°C, next lysed with a syringe by 20 passages through a 25G × 5/8″ needle. 3. Cellular debris and unlysed cells were pelleted by centrifugation at 500 × g for 15 min, and discarded. 4. The supernatant was diluted into buffer E, prior to ultracentrifugation at 100,000 × g for 1 h. 5. The pellets were then suspended into the same buffer including 10% glycerol, by passing through the 25G × 5/8″ needle. 6. Finally, membrane fractions were quickly frozen and stored in liquid nitrogen. 7. The expression level was checked using western blot (Fig. 3d, lane 2).

3.2.3. Functional Assays on Inverted Membrane Vesicles 3.2.3.1. FluorescentSubstrates Transport Measurements

Hoechst33342 is a fluorescent compound, the quantum yield of which is increased in hydrophobic surroundings as plasmatic membranes. When ATP/Mg2+ is added, ABCG2 within inverted membrane vesicles translocates the Hoechst33342 toward the aqueous compartment, and the Hoechst33342 fluorescence decreases. The rate of Hoechst33342 transport must be greater

60

Pozza et al.

than its rate of diffusion to make the transport visible. These conditions are hardly obtained. 1. Four hundred microgram of membrane protein were diluted into transport buffer in either presence or absence of the BCRP-specific inhibitor Ko143. 2. The mixture was incubated for 1 min at 37°C under magnetic stirring. 3. Then, 0.1 µM Hoechst33342 was added, and the fluorescence followed until equilibrium. Fluorescence was read at 457 nm upon excitation at 355 nm, and the bandwidths were set at 4 nm. 4. Finally, 5 mM ATP was added, and the fluorescence was measured during several minutes (Fig. 2a). No Ko143-inhibitable transport activity was detectable on inverted membrane vesicles from bacteria, whereas a transport activity was indeed observed from insect cells. 3.2.3.2. ATPase Activity Measurements

The ATPase assay was based on a colorimetric ascorbic acid/ ammonium molybdate assay for measuring the release of inorganic phosphate as described by Doige et al. (30) with minor modifications. 1. The ABCG2 effectors were prepared as tenfold concentrated solutions in the presence of 10% DMSO. Inverted membrane vesicles were diluted at 1 µg/µl. All solutions were prepared before using and prewarmed at 37°C. 2. The reaction buffer (30 µl), ABCG2 effector (10 µl), and inverted membrane vesicles (10 µl) were added together and homogenized. 3. The reaction was started by adding 50 µl of 10 mM ATP/20 mM MgCl2. 4. After 40 min at 37°C, the reaction was stopped by 100 µl of stopping solution. 5. Five minutes later, 100 µl of revelation solution was added. 6. After 20 min, absorbance was read at 750 nm. The basal ATPase activity was determined on inverted membrane vesicles from control insect cells expressing b-galactosidase or inactive K86M ABCG2. The amount of released inorganic phosphate was expressed in nanomoles of phosphate per minute (per milligram of total protein). The basal ATPase activity of ABCG2 was modulated by drug substrates or inhibitors, and it depended on a mutation located at position 482 (Fig. 2b). No significant ABCG2-mediated ATPase activity was measured on inverted membrane vesicles from bacteria.

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

61

Fig. 2. Functional assay on inverted membrane vesicles from bacteria, and insect cells infected with baculovirus. (a) Hoechst33342 transport on inverted membrane vesicles from bacteria and insect cells containing ABCG2. 1,000 µg of inverted membrane vesicles from bacteria or 400 µg of inverted membrane vesicles from insect cells were diluted in 2 ml of 50 mM HEPES, pH 8, 2 mM MgCl2, 8.5 mM NaCl, 20 µg/ml pyruvate kinase, and 4 mM phosphoenolpyruvate. After 1 min, Hoechst33342 was added at 0.5 µM, then when fluorescence was stable, 5 mM ATP was added and the fluorescence was measured during 1,000 s. —, inverted membrane vesicles containing ABCG2 from bacteria; – –, inverted membrane vesicles containing ABCG2 from bacteria + 1 µM Ko 143; - - -, inverted membrane vesicles containing ABCG2 from insect cells; …, inverted membrane vesicles containing ABCG2 from insect cells + 1 µM Ko 143. (b) Effects of substrates and inhibitors on the ATPase activity of ABCG2 within the inverted membrane vesicles from insect cells. The ATPase activity of R482 (black columns) or R482T (gray columns) ABCG2 overexpressed in Sf9 was measured with 2.5 µg of protein, 5 mM ATP, and 20 mM MgCl2 at 37°C for 40 min.

62

Pozza et al.

3.3. Solubilization and Purification of ABCG2 3.3.1. ABCG2 Solubilization, See Subheading 3.5.1 3.3.2. Purification of ABCG2 Overexpressed in Selected or BL21 (DE3) Strain

ABCG2 expressed in bacteria was not solubilized by zwiterionic and mild detergents, in contrast to a little fraction of the protein expressed in insect cells. The reasons for such a difference will be discussed below. 1. Inverted membrane vesicles from bacteria were treated with bacterial solubilization buffer at a final protein concentration of 2 mg/ml for 60 min with gentle shaking at 30°C. 2. The sample was centrifuged at 100,000 × g for 1 h at 30°C. 3. The supernatant containing solubilized proteins was immediately diluted tenfold (see Note 9) and applied onto a Ni–NTA agarose resin, which was pre-equilibrated with the bacterial solubilization buffer containing only 0.05% SDS for 16 h at room temperature with gentle shaking. 4. The Ni–NTA agarose resin (478 µl/mg of total protein) with bound protein was washed with two-column volumes of washing buffer A. The protein was eluted with three-column volumes of elution buffer A. 5. Imidazole was removed by two successive dialyses (against 100 volumes of the dialysis buffer). 6. Protein purity was assessed on SDS–PAGE gel (Fig. 3b). Further studies by electronic microscopy indicate that ABCG2 was oligomerized after purification. Some optimizations, such as changing the detergent to DDM, have been tried, but no improvement has been obtained. Purified ABCG2 appeared as soluble aggregates. Neither ATPase activity nor drug binding has ever been observed, except for 6-prenylchrysin.

3.3.3. Purification of ABCG2 Overexpressed in High-Five Cells

1. Inverted membrane vesicles from High-Five cells infected by a baculovirus vector encoding the R482 or R482T transporter were treated with cell solubilization buffer at a final protein concentration of 2 mg/ml, for 30 min with gente shaking at 4°C. 2. The sample was then centrifuged at 15,000 × g for 30 min at 4°C. 3. The supernatant containing solubilized proteins was immediately applied onto a Ni–NTA agarose resin, which was preequilibrated in the cells solubilization buffer, for 2 h at 4°C with gente shaking. 4. The Ni–NTA agarose resin (80 µl/mg of total protein) (see Note 10) with bound protein was washed with ten-column volumes of washing buffer B. 5. The protein was eluted with two-column volumes of elution buffer B.

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

63

Fig. 3. ABCG2 purification from inverted membrane vesicles. (a) SDS–PAGE gel of inverted membrane vesicles from bacteria. The bacteria were lysed using French press, and 10 µg of protein was deposited by lane. (b) Purification of ABCG2 from inverted membrane vesicles of bacteria. ABCG2 was purified here from PV6. (c, d) SDS–PAGE gel and western blot, respectively, with BXP-21 antibody of the different steps of ABCG2 purification from inverted membrane vesicles of High-Five insect cells. Lane 1: prestained molecular weight markers; lane 2 : inverted membrane vesicles of High-Five cells infected with the baculovirus vector encoding either R482 or R482T ABCG2; lane 3 : supernatant after solubilization; lane 4: supernatant after centrifugation (15,000 × g, 30 min); lane 5 : detergent-insoluble pellet after centrifugation; lane 6 : supernatant after binding for 2 h; lane 7: Ni–NTA agarose gel after binding for 2 h; lane 8 : washing; lane 9 : Ni–NTA agarose gel after washing; lane 10 : elution; lane 11: Ni–NTA agarose gel after elution; lane 12 : after imidazole removal by gel filtration.

6. Imidazole was removed by gel filtration (Econo-Pac 10DG Columns) (see Note 11), and the purified protein was quickly frozen and stored in liquid nitrogen. 7. Protein concentration was determined before freezing with Bio-Rad protein assay using bovine serum albumin as a standard. 8. The absence of oligomerization of purified ABCG2 in detergent solution was monitored by dynamic light scattering, either before freezing or after freezing and thawing. All steps were performed in the presence of 1,4-dithio-DL-threitol (5 mM) (see Note 12) and protease inhibitors (10 µl cocktail/ml buffer). 9. Each step during the purification was controlled by western blot and SDS–PAGE (Fig. 3c, d) (see Note 13). Some detergents have been tried to solubilize and purify ABCG2, but none was satisfactory, except for CHAPS. Although ABCG2

64

Pozza et al.

purified in CHAPS exhibited a high vanadate-sensitive ATPase activity and was able to bind many substrates and inhibitors with high affinity, its stability was limited once purified; it showed a great propensity to form high molecular weight oligomers displaying vanadate-sensitive ATPase activity. An exhaustive study of the protein stability by dynamic light scattering has indicated that, in order to get ABCG2 stable at 0.1 mg/ml, the buffer must contain at least 0.3 M NaCl, 20% glycerol, and 15 mM CHAPS. The presence of these adjuvants is a great problem for additional steps of purification (gel filtration, ionic chromatography) and functional tests such as reconstitution into liposomes. Indeed, glycerol may induce membrane interdigitation, resulting in loss of protein activity (31). ABCG2 reconstitutions into liposomes have been tried, but without success (see Note 14). 3.4. Functional Assays on Purified ABCG2 3.4.1. Quantification of Secondary Structure

1. The protein concentration was precisely determined by absorbance at 280 nm using the molecular extinction coefficient calculated by the ProtParam tool software from protein sequence. 2. The measurements were performed with the circular dichroism detergent buffer solution and the spectra was scanned from 185–190 to 250 nm, with 0.2-nm step and 1-s data acquisition time, at 25°C under constant nitrogen flushing. The baseline was recorded under the same conditions with the detergent buffer solution. Each spectrum was the average of ten measurements. 3. Spectra were then subtracted from the contribution of detergent buffer solution, smoothed and normalized to 0 at 250 nm.

Fig. 4. Functional assay on purified ABCG2. (a) and (b) Circular dichroism spectra of purified ABCG2 from bacteria and insect cells, respectively. Measurements were performed under the following conditions: 10 mM NaPi, pH 8, 2% glycerol, 1 mM DTT, 0.05% detergent (SDS for 0.5 µM ABCG2 R482T from bacteria or dodecylmaltoside for 0.24 µM ABCG2 R482T from insect cells), and spectra was scanned from 185–190 to 250 nm with 0.2-nm step and 1-s data acquisition time at 25°C. (c) Binding of 6-prenylchrysin on purified ABCG2 R482T from bacteria or insect cells. The measurements were performed under the following conditions: 0.5 µM ABCG2 R482T, 0.05% SDS, 0.1 M KPi, 15% glycerol, 0.1 M NaCl, 1 mM DTT at 25°C for ABCG2 from bacteria (■), and 0.6 µM ABCG2 R482T, 50 mM HEPES/NaOH, pH 8, 18 mM CHAPS, 0.5 M NaCl, 20% glycerol ABCG2 for insect cell-expressed transporter (§). The fluorescence emission spectra were recorded from 310 to 370 nm at 25°C upon excitation at 295 nm. Curves were fitted with SigmaPlot with an on-site model. For bacteria ABCG2, apparent Kd = 0.96 ± 0.10 µM, Bmax = 86.40 ± 4.0; for insect cell ABCG2 apparent Kd = 0.67 ± 0.010 µM, Bmax = 33.84 ± 1.9. (d) Inhibition by vanadate of the ATPase activity of purified ABCG2 from insect cells. The inhibition of ATPase activity of R482 (squares) and R482T (triangles) transporter was measured in the presence of 5 mM MgATP, 0.5 µg protein, 50 µg azolectin, and a range of orthovanade concentrations from 0.001 to 10 mM at 37°C for 40 min. (e) Lineweaver–Burk plots of ATP hydrolysis by the purified transporter from insect cells. The ATPase activity of either R482 (squares) or R482T (triangles) purified ABCG2 was measured with 0.5 µg protein in the presence of 50 µg azolectin at 37°C for 40 min. (f) Summary of the apparent KD1 values obtained by drug-induced quenching of ABCG2 intrinsic fluorescence. Measurements were performed under the following conditions: 0.045 µg/µl ABCG2 (either R482 or R482T), 50 mM HEPES/NaOH, pH 8, 18 mM CHAPS, 0.5 M NaCl, 20% glycerol. The fluorescence emission spectra were recorded from 310 to 370 nm at 25°C upon excitation at 295 nm. Experimental data were fitted according to a two-site model, as described by Doppenschmitt et al. (32).

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

65

4. Finally, the arbitrary CD units were converted to values of mean residue ellipticity (degree/cm/dmol) (Fig. 4a, b, see Note 15), and these values were used for prediction of secondary structure content using different deconvolution programs available on Dichrowed site. The K2d deconvolution was used to compare ABCG2 expressed in the two expression systems.

66

Pozza et al.

Bacterial ABCG2 was constituted of 37% alpha helix, 18% beta sheet, and 45% random coil, whereas ABCG2 expressed in insect cell were composed of 64% alpha helix, 11% beta sheet, and 24% random coil. It clearly appears that purified ABCG2 from bacteria displays a greater content of random coil and a lower content of alpha helix in comparison to the same protein from insect cells. 3.4.2. Ligand-Induced Quenching of Tryptophan Intrinsic Fluorescence

The fluorescence spectroscopy is a good technique to monitor ligands binding on a purified protein. Some controls, however, need to be performed before the experiments: the ligands must not be fluorescent and not have a high absorbance around the excitation wavelength. 1. The purified protein was quickly thawed at 37°C. 2. Next, the purified protein was diluted at 0.045 µg/µl in the fluorescence detergent buffer. The same buffer was used for both the blank and the inner-filter control with N-acetyltryptophanamide (NATA). The inner-filter control is indeed necessary to correct the fluorescence attenuation produced by ligand absorption. The concentration of NATA was chosen to give a similar area as the emission fluorescence spectrum area of the purified protein. 3. A magnetic stirrer bar was added into the three cuvettes containing Protein, Blank, and NATA, which were left under magnetic stirring for 1 h at room temperature. This step was necessary to obtain a stable fluorescence spectrum. 4. Then, the fluorescence was read from 310 to 370 nm upon excitation at 295 nm four times for each cuvette to take into account any potential photo-induced fluorescence quenching. 5. The effector additions were done alternatively: after each fluorescence reading, the effector was added to the cuvette so that the time interval between all measurements was constant. 6. The blank spectra were subtracted from both the protein and NATA spectra. The area under spectra was used to calculate the percentage of fluorescence in comparison to the fluorescence value without drug addition for both protein and NATA. Then, the corrected fluorescence was obtained by dividing the protein fluorescence percentage by the NATA fluorescence percentage. Finally, the corrected values were used to determine the Kd values using Grafit program or to determine apparent Kd values using SigmaPlot program after curve fitting (Fig. 4c). Drug substrates and inhibitors showed biphasic quenching curves that were fitted with a two-site model as described by Doppenschmitt et al. (32). The apparent Kd values were not modified by the R482T point mutation (Fig. 4f), suggesting that the R482T point mutation does not act on binding but on subsequent step to transport.

Insect Cell Versus Bacterial Overexpressed Membrane Proteins 3.4.3. ATPase Assay on Purified ABCG2

67

The measurement was performed as described in Subheading “Fluorescent-Substrates Transport Measurements,” with minor modifications. The inhibitors (sodium azide, EGTA, ouabain) of contaminant ATPase was removed in the reaction buffer. The purified protein was prepared as follows: 1. Purified protein was quickly thawed at 37°C. 2. Immediately, azolectine multilamellar vesicles (see Note 16) were added to give a final lipid-to-protein ratio of 100 (w/w), and the mixture was incubated at 25°C for 30 min. 3. Then, the mixture was incubated at 4°C for 30 min before using. 4. 0.5 µg of purified protein was used for ATPase assay. The basal amount of phosphate was determined with the lipid– detergent buffer. The amount of released inorganic phosphate was expressed in nmoles of Pi per minute (per milligram of ABCG2). The ATPase activity of purified ABCG2 was modified by the R482T point mutation, which both increased Vmax and decreased Km (ATP/Mg2+) (Fig. 4e). Moreover, the ATPase activity of purified ABCG2 was inhibited by vanadate but less efficiently for the single-point mutant (Fig. 4d).

3.5. Limitations of the Baculovirus/ Insect Cell System to Overexpress ABCG2

Two forms of ABCG2 were clearly observed on western blot revealed with either the specific BCRP monoclonal antibody BXP-21 or the anti-histidine monoclonal antibody (see Note 17). These two forms were characterized by an apparent difference in molecular weight of about 2.8 kDa in SDS–PAGE but concomitant biosynthesis. Some experiments have been investigated to understand the origin of such a difference.

3.5.1. Differential Solubilization of the Two ABCG2 Forms by Various Detergents

The compared ability of detergents to solubilize ABCG2 was checked with 3-[(3-cholamidopropyl)-dimethylammonio]-1propanesulfonic acid (CHAPS), PFO, fos-choline 16 (FC-16), and sodium dodecylsulfate (SDS). 1. Membranes were solubilized at 2 mg/ml in Baculovirus/ insect cell solubilization buffer A for 30 min at either 4°C (with CHAPS or fos-choline 16) or room temperature (with SDS or PFO). 2. Then, soluble and insoluble fractions were separated by ultracentrifugation at 100,000 × g for 60 min, at either 4°C (for CHAPS and fos-choline 16) or 25°C (for SDS and PFO). 3. The insoluble fractions were solubilized baculovirus/insect cell solubilization buffer B and the suspension volume used was the same as the centrifuged one. 4. The solubilization was monitored by western blot as revealed with the specific BCRP monoclonal antibody BXP-21 (Fig. 5a).

68

Pozza et al.

Fig. 5. Limitations of the baculovirus/insect cell system to overexpress ABCG2. (a) Differential solubilization of the two ABCG2 forms by various detergents. Solubilization was performed at a constant protein concentration of 2 mg/ml and variable detergent concentrations, corresponding to the different detergent-to-protein ratio (DPR) values, as indicated, in a final volume of 500 µl. After 30 min with gentle shaking, soluble (supernatant) and insoluble (pellet) fractions were separated by ultracentrifugation at 10,000 × g for 60 min. Insoluble fractions were solubilized with a volume equal to the centrifuged volume. After addition of Laemmli sample buffer and heating for 30 min at 37°C, 10 µl of sample were loaded on 8% polyacrylamide gels. S soluble fraction; I insoluble fraction. Western blots were revealed with BXP-21. (b) Conversion of lower to upper form of ABCG2 during SDS–PAGE in presence of 9 M urea. Lane 1, upper form purified at low DPR with fos-choline 16; lanes 2–7, increasing amounts, as indicated, of purified ABCG2 with SDS. The 8% polyacrylamide gel was used 4 h after polymerization, run at 75 V, and stained with coomassie blue. (c) Effects of translation and transcription inhibitors on ABCG2 expression in High-Five cells. After 13-h postinfection, inhibitors were added to the medium at the final indicated concentration. The cell contents of T25 flasks were harvested 12 h later, washed twice with PBS, and lysed with 500 µl of lysis buffer; 10 µl of cell total extracts were loaded on 8% polyacrylamide gel, and the western blot was revealed with BXP-21.

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

69

The upper form was partially solubilized by CHAPS and other mild detergents, whereas the lower form was solubilized only by SDS, PFO, and fos-choline 16 at a high detergent-toprotein ratio. The reasons for such differences in solubility were not known. Two hypotheses are then possible, based on either a local difference in membrane lipid composition or, more likely, a protein posttranslational modification or conformational change. Three evidences indicate that the upper form has undergone a posttraductional modification. First, ABCG2 glycosylation was monitored using periodic acid-Schiff specific staining for polyacrylamide gels according to Kapitany’s procedure (33), and no mature glycosylation was experimentally observed on both upper and lower forms although a glycosylation core could not be excluded. Second, the tunicamycin treatment known to both inhibit protein glycosylation and induce protein misfolding favored the lower form. The two last arguments suggest that ABCG2 indeed has a glycosylation core. Finally, the lower form was converted into the upper form during SDS–PAGE in the presence of 9 M urea. 3.5.2. Conversion of Lower into Upper Form of ABCG2 During SDS–PAGE in Presence of 9 M Urea

Urea at 9 M produced two types of effect on purified ABCG2 during electrophoresis: first, it was able to disturb SDS binding to protein in a protein-dependent way (34); second, it altered ABCG2 structure since the maximal wavelength of tryptophanintrinsic fluorescence emission was slightly red-shifted from 335 to 336 nm. Others chaotropic agents are known to be more efficient in altering ABCG2 structure but their use is not compatible with SDS–PAGE. In the presence of 8 M guanidinium chloride or 6 M guanidinium thiocyanate, the maximal wavelength of tryptophan-intrinsic fluorescence emission was red-shifted by 2 or 11 nm, respectively. 1. The SDS gels, either with or without 9 M urea, were used 4 h after polymerization. 2. A range of SDS-purified ABCG2 (see Note 18), from 1 to 14 µg, was loaded on SDS polyacrylamide gels with or without 9 M urea. A control was performed with the upper band (6 µg). 3. Electrophoresis was run at 75 V until the 50-kDa marker nearly went out of the gel. 4. Then, the SDS gels were stained with coomassie blue (Fig. 5b). The results showed that the lower form was converted into the upper one, indicating that the lower and upper forms might indeed be two conformations of the same protein. However, the difference in conformation seemed to be very subtle.

70

Pozza et al.

Since urea is known to produce limited denaturing effects on hydrophobic cores, the differential effects produced here on ABCG2 conformation might likely concern the cytosolic nucleotide-binding domain. 3.5.3. Effects of Translation and Transcription Inhibitors on ABCG2 Expression in High-Five cells

The addition of inhibitors of either translation or transcription, by decreasing the rate of protein synthesis, might allow a better maturation by the molecular chaperone system. 1. After infection by baculovirus encoding ABCG2, at a multiplicity of 20 for 2 h, the virus suspension was removed, and 5 ml of Grace medium was immediately added. The cells were incubated at 27°C. 2. After 13-h postinfection (see Note 19), the drugs were added into the medium at various concentrations: cycloheximide 1–20 µg/ml, mitoxantrone 2 µg/ml, hygromycin D 400 µg/ ml, emitine 5 µg/ml, anisomycin 2 µg/ml, and actinomycin D 0.1–1 µg/ml. 3. The cells were harvested 12 h later and washed twice with phosphate-buffered saline. 4. The cells from T25 flasks were lysed with 500 µl of cells lysis buffer. 5. Cell lysate was heated at 37°C for 30 min. 6. The drug effects on ABCG2 overexpression were monitored by western blot using the specific BCRP monoclonal antibody BXP-21 (Fig. 5c). The addition of inhibitors of either translation or transcription, favored the upper form; it also seemed to diminish the higher molecular weight smear, possibly due to SDS-resistant ABCG2 aggregates or oxidized forms. Similar results have been observed when the insect cells were cultured in the presence of chemical chaperones (see Note 20) (e.g., glycerol, mannitol, trehalose). Pharmacological chaperones, such as mitoxantrone, did not improve the expression pattern of ABCG2. Chemical chaperones and inhibitors of either translation or transcription favored the upper form, which could indicate that the formation of the lower form is caused by a too high rate of protein synthesis during insect cell infection, compared with the normal rate of maturation fully controlled by molecular chaperones. The comparison of ABCG2 ATPase activity within inverted membrane vesicles obtained after treatment with or without cycloheximide indicates that the ATPase activity was essentially associated to the upper form and confirms that the upper form is probably active and mature while the lower form appears inactive and immature. These last studies indicate that the insect cells/baculovirus system is not ideally adapted to

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

71

overexpress ABCG2, although part of the protein is active and a significant improvement can be provided by addition of agents reducing the rate of protein synthesis.

4. Notes 1. It is not recommended to use glycerol stock because BL21 strains are able to transform plasmids containing genes encoding toxic proteins. Use fresh colonies after transformation to avoid such a problem. Moreover, ABCG2 overexpression was better with fresh transformed bacteria. 2. The preculture is very critical because the bacteria in stationary phase are able to lose plasmids. It is recommended to check the gene integrity by sequencing for validating this step the first time. 3. Do not use chemical compounds such as acridine orange, which is highly mutagenic. 4. The intracellular membranes have a lipid composition and a lipid-to-protein ratio that differs from that of the cytoplasmic membranes. This difference in density modifies the sedimentation pattern. 5. TolC is a porine that brings together both internal and external membranes. TolR is a protein that contributes to external membrane stability. These mutations were performed by Dr. J.-C. Lazzaroni from UMR5122 CNRS/INSA/UCBL1 at Villeurbanne (France). They are known to increase cell permeability and to sensitize E. coli to many different drugs. 6. Avoid neutral red coloration because this colorant is mutagenic. 7. The medium may be supplemented with 10% CFS, which allows a better conservation of the viral titer. 8. The culture conditions in both spinners and roller bottles might stress the insect cells, which are subdued to shear and gravity stress. 9. ABCG2 expressed in bacteria contains an N-terminal hexahistidine tag located. This chimeric protein has low affinity for Ni–NTA agarose, and a concentration of SDS higher than 0.15% inhibits the protein from binding to Ni–NTA and releases the bound protein during washing. 10. The ratio between Ni–NTA and total protein should be optimized; if it is too low, the bound protein will aggregate on the gel.

72

Pozza et al.

11. Desalting column allows removing other undesirable small molecules, such as divalent cations and residual DNA, that may induce protein aggregation or interfere in further functional tests. Purified ABCG2 can be concentrated with centrifugal concentrator, but about half of the purified protein was lost by unspecific binding to the concentrator surface, in spite of many trials to avoid this problem. Moreover, the ABCG2-specific activity after concentration was drastically decreased. 12. Dithiothreitol could be replaced by TCEP that is more stable and does not reduce Ni2+ ions. 13. The purification of overexpressed His-tagged-protein in insect cells is very difficult for many reasons. Some insect cell proteins are rich in histidine residues, which increase the presence of contaminants. The protein overproduction induces no specific protein–protein interaction, which increased the co-elued contaminants. Two steps are critical to avoid this problem: the protein solubilization and the washing. These phenomena explain the low reproducibility of protein purity among different preparations of inverted membrane vesicles from different ABCG2 productions. 14. Solubilized ABCG2 exhibits a lower vanadate-sensitive ATPase activity than ABCG2 within membrane and is also not subject to drug stimulation. The ABCG2 reconstitution to recover these enzymologic properties failed, probably, because the buffer composition was not adapted to both maintain ABCG2 dispersed and allow a good interaction between lipid/detergent micelles and lipid/detergent/protein complex. The research of the adapted buffer requires the determination of the second osmotic virial coefficient. This work is very time- and material consuming. 15. The CD spectra of ABCG2 purified from bacteria have a great background noise from 185 to 200 nm, in spite of the corrections and smoothing, which explains why only K2d deconvolution was used to determine the percentage of secondary structure. 16. Azolectine multilamellar vesicles were prepared using the following procedure: first, azolectine was dissolved at 1 mg/ml in chloroform/methanol (90/10, v/v) in the presence of 1% d,l-a-tocopherol into a round-bottomed flask. Then, the solvent was removed by rotary evaporation. The residual traces of solvent were removed with a vacuum pump for 4 h. Azolectine was rehydrated at a final concentration of 20 mg/ ml for 1 h at 37°C with the following buffer: 50 mM HEPES, pH 8, 10 mM NaCl. 17. The hexahistidine tag is difficult to reveal by western blot in cell total extracts or after solubilization by different detergents. To overcome this problem, protein extracts were solubilized

Insect Cell Versus Bacterial Overexpressed Membrane Proteins

73

at 1 or 0.5 µg/ml in sample buffer (SDS-to-protein ratio between 40,000 and 20,000) for 30 min at 37°C. Then, the sample was run at low voltage (about 75 V). 18. Insect cell ABCG2 purified in SDS was used to avoid detergent effects and detergent exchange, which would disturb the electrophoretic migration. ABCG2 was purified using the protocol described in Subheading 3.3.2 with minor modifications. The buffer is composed of 20% glycerol, 0.3% SDS, 50 mM HEPES, pH 8, and 5 mM TCEP. Protein purification was achieved at room temperature. 19. The inhibitors either of translation or transcription were added 13-h postinfection to reduce their toxicity. Moreover, ABCG2 synthesis started 13-h postinfection with our baculovirus. When the inhibitors were added either 24-h postinfection or later, then the two forms were produced. 20. When chemical chaperones were used, they were added immediately after infection.

Acknowledgments We warmly thank Drs. A.H. Schinkel (Amsterdam, The Netherlands) for providing Ko143, J.-C. Lazzaroni (University of Lyon 1) for constructing the TolR- and TolC- mutants, A. Boumendjel (Grenoble, France) for synthesizing 6-prenylchrysin and other flavonoid derivatives, O. Coulombel and R. Haser (Lyon, France) for giving access to refractometer and viscosimeter, and to dynamic light scattering apparatus, respectively, and J.-L. Rigaud and D. Levy for advices on membrane protein reconstitution into liposomes and TEM analysis of purified protein by negative staining. CD experiments were performed on the platform “Production et Analyse des Proteines” of the IFR 128 BioSciences Gerland-Lyon Sud. We also thank F. Penin for advice on membrane protein biochemistry, E. Vaganay on PAS staining, R. Monserret on circular dichroïsm, and A. Chaboud and I. Grosjean on cell culture. This work was supported by grants from the Association pour la Recherche sur le Cancer (ARC 3519 and 3942), the Ligue Nationale contre le Cancer (comités du Rhône, Drôme et Savoie), the Région Rhône-Alpes (thématique prioritaire Cancer), French–Spanish PAI-Picasso interministerial agreements (SAF-2003-04200-CO2-01), and the Agence Nationale de la Recherche (ANR-06-BLAN-0420). A.P. is a recipient of doctoral fellowships from the Ligue Nationale contre le Cancer (Comité du Rhône) and the Association pour la Recherche sur le Cancer, J.M.P.-V. of a Marie-Curie postdoctoral fellowship from the European Commission (HMPF-CF-2001-01244).

74

Pozza et al.

References 1. Allikmets R, Schriml LM, Hutchinson A, Romano-Spica V, Dean M (1998) A human placenta-specific ATP-binding cassette gene (ABCP) on chromosome 4q22 that is involved in multidrug resistance. Cancer Res 58:5337–5339 2. Doyle LA, Yang W, Abruzzo LV, Krogmann T, Gao Y, Rishi AK, Ross DD (1998) A multidrug resistance transporter from human MCF-7 breast cancer cells. Proc Natl Acad Sci U S A 95:15665–15670 3. Miyake K, Mickley L, Litman T, Zhan Z, Robey R, Cristensen B, Brangi M, Greenberger L, Dean M, Fojo T, Bates SE (1999) Molecular cloning of cDNAs which are highly overexpressed in mitoxantrone-resistant cells: demonstration of homology to ABC transport genes. Cancer Res 59:8–13 4. Janvilisri T, Venter H, Shahi S, Reuter G, Balakrishnan L, van Veen HW (2003) Sterol transport by the human breast cancer resistance protein (ABCG2) expressed in Lactococcus lactis. J Biol Chem 278:20645–20651 5. Mao Q, Conseil G, Gupta A, Cole SP, Unadkat JD (2004) Functional expression of the human breast cancer resistance protein in Pichia pastoris. Biochem Biophys Res Commun 320:730–737 6. Ozvegy C, Litman T, Szakacs G, Nagy Z, Bates S, Varadi A, Sarkadi B (2001) Functional characterization of the human multidrug transporter, ABCG2, expressed in insect cells. Biochem Biophys Res Commun 285:111–117 7. Ahmed-Belkacem A, Pozza A, MunozMartinez F, Bates SE, Castanys S, Gamarro F, Perez-Victoria JM, Di Pietro A (2005) Flavonoid structure activity studies identify 6-prenylchrysin and tectochrysin as potent and specific inhibitors of breast cancer resistance protein ABCG2. Cancer Res 65:4852–4860 8. Pozza A, Perez-Victoria JM, Sardo A, AhmedBelkacem A, Di Pietro A (2006) Purification of breast cancer resistance protein ABCG2 and role of arginine-482. Cell Mol Life Sci 63:1912–1922 9. McDevitt CA, Collins RF, Conway M, Modok S, Storm J, Kerr ID, Ford RC, Callaghan R (2006) Purification and 3D structural analysis of oligomeric human multidrug transporters ABCG2. Structure 14:1623–1632 10. Nakanishi T, Doyle LA, Hassel B, Wei Y, Bauer KS, Wu S, Pumplin DW, Fang HD, Ross DD (2003) Functional characterisation of human breast cancer resistance protein (BCRP, ABCG2) expressed in the oocytes of Xenopus laevis. Mol Pharmacol 64:1452–1462

11. Telbisz A, Muller M, Ozvegy-Laczka C, Homolya T, Szente L, Varadi A, Sarkadi B (2007) Membrane cholesterol selectively modulates the activity of the human ABCG2 multidrug transporter. Biochim Biophys Acta 1768:2698–2713 12. Pal A, Méhn D, Molnar E, Gedey S, Meszaros P, Nagy T, Glavinas H, Janaky T, von Richter O, Bathori G, Szente L, Krajcsi P (2007) Cholesterol potentiates ABCG2 activity in a heterologous expression system: improved in vitro model to study function of human ABCG2. J Pharmacol Exp Ther 321:1085–1094 13. Achard-Joris M, Bourdineaud JP (2006) Heterologous expression of bacterial and human multidrug resistance proteins protect Escherichia coli against mercury and zinc contamination. Biometals 19:695–704 14. El-Masry EM, Abou-Donia MB (2003) Reversal of P-glycoprotein expressed in Escherichia coli leaky mutant by ascorbic acid. Life Sci 73:981–991 15. George AM, Davey MW, Mir AA (1996) Functional expression of the human MDR1 gene in Escherichia coli. Arch Biochem Biophys 333:66–74 16. Beaudet L, Urbatsch IL, Gros P (1998) Mutations in the nucleotide-binding sites of P-glycoprotein that affect substrate specificity modulate substrate-induced adenosine triphosphatase activity. Biochemistry 37:9073–9082 17. Sarkadi B, Price EM, Boucher RC, Germann UA, Scarborough G (1992) Expression of the human multidrug resistance cDNA in insect cells generates a high activity drug-stimulated membrane ATPase. J Biol Chem 267:4854–4858 18. Nelson JA, Dutt A, Allen LH, Wright DA (1995) Functional expression of the renal organic cation transporter and P-glycoprotein in Xenopus laevis oocytes. Cancer Chemother Pharmacol 37:187–189 19. Cai J, Daoud R, Georges E, Gros P (2001) Functional expression of multidrug resistance protein 1 in Pichia pastoris. Biochemistry 40:8307–8316 20. Ren XQ, Furukawa T, Chen ZH, Okumura H, Aoki S, Sumizawa T, Tani A, Komatsu M, Mei XD, Akiyama SI (2000) Functional comparison between YCF1 and MRP1 expressed in Sf21 insect cells. Biochem Biophys Res Commun 270:608–615 21. Opekarova M, Tanner W (2003) Specific lipid requirement of membrane proteins – a putative bottleneck in heterologous expression. Biochim Biophys Acta 1610:11–22

Insect Cell Versus Bacterial Overexpressed Membrane Proteins 22. Hunt C (2005) Specific protein–lipid interactions in membrane proteins. Biochem Soc Trans 33:938–942 23. Polgar O, Robey RW, Bates SE (2008) ABCG2: structure, function and role in drug response. Expert Opin Drug Metab Toxicol 4:1–15 24. Inoue S, Sano H, Ohta M (2000) Growth suppression of Escherichia coli by induction of expression of mammalian genes with transmembrane or ATPase domains. Biochem Biophys Res Commun 268:553–561 25. Tate CG, Haase J, Baker C, Boorsma M, Francesca Magnani F, Vallis Y, Williams DC (2003) Comparison of seven different heterologous protein expression systems for the production of the serotonin transporter. Biochim Biophys Acta 1610:141–153 26. Miroux B, Walker E (1996) Over-production of proteins in Escherichia coli: mutant hosts that allow synthesis of some membrane proteins and globular proteins at high levels. J Mol Biol 260:289–298 27. Arechaga I, Miroux B, Karrasch S, Huijbregts R, de Kruijff B, Runswick MJ, Walker JE (2000) Characterisation of new intracellular membranes in Escherichia coli accompanying large scale over-production of the b subunit

28. 29.

30.

31. 32.

33. 34.

75

of F(1)F(0) ATP synthase. FEBS Lett 482: 215–219 Studier FW (2005) Protein production by auto-induction in high-density shaking cultures. Protein Expr Purif 41:207–234 Montigny C, Penin F, Lethias C, Falson P (2004) Overcoming the toxicity of membrane peptide expression in bacteria by upstream insertion of Asp-Pro sequence. Biochim Biophys Acta 1660:53–65 Doige CA, Yu X, Sharom FJ (1992) ATPase activity of partially purified P-glycoprotein from multidrug-resistant Chinese hamster ovary cells. Biochim Biophys Acta 1109:149–160 Lu JZ, Huang F, Chen JW (1999) The behaviors of Ca2+-ATPase embedded in interdigitated bilayer. J Biochem 126:302–306 Doppenschmitt S, Spahn-Langguth H, Regardh CG, Langguth P (1998) Radioligandbinding assay employing P glycoprotein-overexpressing cells: testing drug affinities to the secretory intestinal multidrug transporter. Pharm Res 15:1001–1006 Kapitanay RA, Zebrowski EJ (1973) A high resolution PAS stain for polyacrylamide gel electrophoresis. Anal Biochem 56:361–369 Schägger H (2006) Tricine–SDS-PAGE. Nat Protoc 1:16–23

as

Part II X-Ray Crystallography

as

Chapter 5 Crystallography of Membrane Proteins: From Crystallization to Structure Aurélien Deniaud, Ekaterina Moiseeva, Valentin Gordeliy, and Eva Pebay-Peyroula Abstract Although crystallographic studies of membrane proteins have progressed in the last 5 years, the field still remains challenging with several severe bottlenecks. The chapter focuses on the crystallization and describes two approaches, the classical vapor diffusion method and the more recent use of lipidic phases. General aspects on the crystallization principles as well as more practical details are given. In a more synthetic way, the chapter also addresses how structures are solved by X-ray crystallography, and highlights aspects that are specific to membrane proteins. Key words: Crystallization, Crystallography, Detergent, Diffraction, Lipid

1. Introduction Membrane proteins (MP) are the main functional units of membranes and represent roughly one-third of the proteins encoded in the genome. It has become clear in recent years that the study of membranes at the molecular level is of great importance not only in the deciphering of all cellular processes, but also in the understanding of the alterations leading to abnormal transformed cells and the action of the drugs. Indeed, 70% of drugs target membrane proteins. Although the number of membrane protein structures deposited since 1985, date of the first membrane protein structure (1), increases drastically it does not yet reach the rate achieved for soluble proteins (2). Currently, the RCSB Protein Data Bank contains more than 50,000 structures, among which less than 500 are Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_5, © Springer Science+Business Media, LLC 2010

79

80

Deniaud et al.

structures of membrane proteins (3). Major efforts undertaken in several laboratories in the last years as well as high-throughput crystallography offer a hope of correcting this imbalance. Nevertheless for the large-scale membrane protein structural biology to realize its full promise, significant challenges must be overcome. Two major bottlenecks await the experimentalist along the process toward the structure, the production of pure, stable and functional protein solubilized in amphiphiles, and the growth of well diffracting crystals. Most of the structural studies need to extract the proteins from the membrane using detergents. The matter is that the natural environment of these proteins is a lipid bilayer, and although detergents shield the hydrophobic surface of the protein from the solvent, they do not fully mimic membranes. Crystallization of a membrane protein should be undertaken only if sufficient biochemical and biophysical characterizations have been made. Solving the structure of a membrane protein by X-ray crystallography can encounter specific difficulties due to the detergent present in the crystals. This step is not defined as a major bottleneck but can prove to be difficult. This chapter describes the general strategy for solving membrane protein structures from the crystallization to the structure. In addition to the “classical” crystallization, it presents also alternate ways exploiting lipidic phases. Most of the crystallography is not specific to MPs, and only the main lines are indicated with references to other textbooks (4). The chapter focuses on aspects that are specific to MPs. All the steps described herein in a linear way, will not necessarily be followed sequentially. MPs very often crystallize with poor diffraction quality. In such cases, the procedure of getting good diffracting crystals and solving the structure will be an iterative process in which one has to cycle between the initial step of preparing the protein in a pure and soluble form for the crystallization, the various crystallization approaches, and the diffraction experiments.

2. Materials 1. Detergents: The chemical purity of detergents that are necessary for the crystallization is more important than for purification. In particular, it has a strong influence on the phase diagram. Most companies have several purity levels; the highest purity level is advisable (for example, high-purity ANAGRADE from ANATRACE®). 2. Lipids: The requirement for the purity of lipids is different when lipids are added as additives to the crystallization and when used for the crystallization in lipidic phases. In the latter case, high purity is needed in order to obtain the appropriate phases.

Crystallography of Membrane Proteins

81

As an example, monoolein (MO) for the cubic phase is purchased from NU-Chek Prep, Elysian, MN. 3. Storage of detergents and lipids: Detergents and lipids have to be stored frozen and dry. When large quantities are bought, they should be aliquoted. Flush away the air in each aliquot with a nitrogen flux before freezing the sample at −20°C. Repeat this each time the sample will be frozen again. When warming lipids or detergent at room temperature, keep the samples sealed in order to avoid water absorption. Store prepared detergent solutions at 4°C in the dark. 4. Proteins: Most membrane proteins are very unstable once they are solubilized in the presence of detergent. Each protein is different. Some protein solutions can be frozen. In general, set up crystallization straight after purification.

3. Methods 3.1. Crystallization by the Vapor Diffusion Method

After extracting the protein from the lipidic membrane and after purification, the membrane protein remains in solution surrounded by detergent molecules. The crystallization can then be achieved by adding a precipitant agent in a similar way as for soluble proteins (5). A series of precipitants and additives have then to be screened in order to identify the appropriate conditions. Although the process is quite similar to soluble proteins, several major differences make it more difficult. First, the solution is intrinsically heterogeneous due to the fact that it will contain protein–detergent complexes (PDC), detergent micelles, and individual detergent molecules (Fig. 1). If lipids are copurified with the protein, the solution is even more heterogeneous and the size of PDCs will be difficult to control. Extensive efforts to select the solubilization conditions (choice of detergent, concentration, ligands) are required to enhance the success rate of the crystallization using the vapor diffusion methods (Fig. 2). This initial step should not be overlooked and has to be monitored experimentally. Several methods used to characterize the protein–detergent complexes prior to crystallization exist and have to be chosen accordingly. Each membrane protein has a specific behavior also dependent on the detergent used for its purification. The steps listed below are indicative. 1. Start the crystallization experiments with well-defined pure proteins (SDS-gel, mass spectrometry, if possible N-terminal sequencing) solubilized in neutral or zwitter-ionic detergents known to be useable for crystallization with appropriate purity (see Note 1). Perform functional assays when possible (can also be ligand binding) (see Note 2).

82

Deniaud et al.

Fig. 1. From the membrane to the crystal. The solubilization and crystallization steps are represented schematically and highlight the role of the detergents.

Fig. 2. The vapor diffusion method. From the initial condition, vapor diffuses from the drop in order to equilibrate the precipitant concentrations in the reservoir and in the drop. In the final condition, the protein and precipitant concentrations of the drop are increased.

2. Concentrate the protein to 5–10 mg/mL by using molecular mass concentrators adapted to the protein (see Note 3). 3. Characterize the protein–detergent complexes before and after concentration, in size (size-exclusion chromatography, light scattering, analytical ultracentrifugation) and composition (content in detergent: radioactive labeled detergent or ATR-IR, content in lipids: Thin-layer chromatography, phosphate dosage, or mass spectrometry) (see Note 4). 4. Test the stability of the protein solubilized in detergent, i.e., by following its behavior in a size-exclusion chromatography

Crystallography of Membrane Proteins

83

after purification, and one or several days after. Identify and analyze proteolytic fragments as a function of time. Compare the stability of the protein with and without inhibitors or ligands (see Note 5). 5. Set up crystallization with commercial screens or PEG-screens with the vapor diffusion method (see Note 6). When a nanodrop robot is available, make 100 or 200 nL drops composed of ½ volume of protein solution and ½ volume of reservoir; the reservoir has a volume of 100 mL. Hanging drops are preferable to sitting drops (easier to observe). 6. From 1 or 2 days to 1 month after their setup, the drops should be observed regularly (see Note 7). 7. Conditions leading to crystalline precipitates, or to phase separation with precipitates have to be repeated and refined (see Note 8). Depending on the number of conditions, the drops will be done with the robot or manually. In the latter case, drops of typically 2 mL (1 mL protein solution + 1 mL reservoir) are equilibrated against reservoirs of 500 mL. The scale-up from 200 nL to 2 mL modifies the kinetics driving to the equilibrium, and therefore, the crystallization conditions are not always directly transposable from the small volume to the larger one. 8. Explore additives including lipids and detergents (see Note 9). 9. Exchange the detergent to another detergent during purification, and explore crystallization conditions starting with large screens (see Note 10). For this, bind the protein to an affinity column and wash with several column volumes of buffer containing the replacement detergent. 10. Modify the protein by cleaving floppy N- or C-termini (i.e., additional Tags), or increasing the protein surface accessible to protein–protein interactions by crystallizing the protein in complex with an antibody fragment (see Note 11). 11. Test the crystals for diffraction quality as soon as possible. Crystals can be flash-frozen or tested directly from the crystallization setups (see Note 12). 3.2. Practical Approach to lipidic cubic phase (LCP) Crystallization

In order to understand the mechanisms of function of membrane proteins, their structures have to be determined up to high resolution. A formidable obstacle in this way is a lack of well-ordered three-dimensional crystals, which is largely the result of the difficulties in handling membrane proteins. In addition, even if crystals are obtained, they are not necessarily of sufficient quality for X-ray crystallography means. From the low number of successes despite the efforts, “standard” crystallization from detergent solutions may not be an optimal way to crystallize membrane proteins.

84

Deniaud et al.

Fig. 3. Schematic representation of in meso crystallization process.

In 1996, a new alternative crystallization method, employing lipidic cubic phases (see Note 13), was developed by Rosenbusch and Landau to improve this situation (6). A fundamental difference between methods of standard crystallization and crystallization in the lipidic cubic phase is that in the latter, the solubilized protein is incorporated back in the native lipid bilayer and the crystallization is induced by addition of dry salt as precipitant agent (Fig. 3). Liquid crystalline systems formed by lipids in aqueous media can exist as infinite periodic minimal surfaces, which have a zero mean curvature and a periodicity in all three dimensions characterized by a cubic lattice (7). Luzzati first described the bicontinuous liquid crystal cubic phase (8) and later its geometric structure was supplied by (9). The system consists of two compartments: a continuous curved lipid bilayer forming a three-dimensional well-ordered structure, interwoven with continuous aqueous channels. The phase is very viscous, isotropic, and optically transparent. Bicontinuous cubic phases are ubiquitous in lipid–water mixtures. Cubic membranes are found in cell life (10), and they are used in food industry (11) as well as for drug delivery (12). 3.3. Example of Bacteriorhodopsin (BR)

Practical aspects of crystallization in the lipidic cubic phase look very simple and an example – crystallization of BR (see Note 14 for other examples) – can be described as the following procedure (13): 1. Weight into the PCR tube (200 mL) approximately 5 mg of dry MO, incubate tubes with MO at 40°C, and spin the lipid down for 10 min at 13,000 × g at room temperature.

Crystallography of Membrane Proteins

85

2. To begin with the crystallization processes, keep MO at 40°C during an additional 20 min to gain the isotropic fluid lipidic phase and then let the lipid phase cool to room temperature. If the MO phase becomes nontransparent at room temperature, repeat the melting procedure. Mix 1 mL of prepared 10 mg/mL BR solution comprising about 1.2 w/w% of n-octyl-b-D-glucopyranoside (OG) with 1 mg of MO. To homogenize lipid/protein mixture as well as to gain the cubic phase, centrifuge the PCR tubes with the sample at 9,200 RCF for at least 1 h at 22°C (rotating tubes within the rotor every 15 min by 90°). Incubate the samples during the day in the dark at 22°C. 3. To induce the crystallization, add a precipitant – a ground powder of KH2PO4 mixed with Na2HPO4 (95/5 w/w) with a final concentration of the salt mixture 1–2.5 M (pH 5.6). Repeat homogenizing centrifugation of samples as described in the previous item. Leave the crystallization batch in the dark at 22°C. The first BR microcrystals (10–20 mm in diameter) usually appear within 1 weak after induction of crystallization (Fig. 4) (see Note 15 for a more detailed crystallization mechanism). The described protocol of crystallization is quite close to the original one provided by Rosenbusch and Landau (6). 4. Harvest crystals directly from LCP by mechanical manipulation using microtools during observation with high resolution light microscopy, and flash-freeze a single crystal surrounded by some remaining lipidic phase for cryoprotection. Alternatively, add lipase to the lipidic phase to digest the monoolein at room temperature during several hours or days (14), and fish the crystal from the fluid phase (see Note 16).

Fig. 4. A single BR crystal and BR crystallization probe which is placed in a PCR tube.

86

Deniaud et al.

3.4. Perspectives for the Crystallization in Lipidic Phases 3.4.1. Crystallization in the Sponge Phase

The sponge phase (L3-phase) is the liquid analogue of the lipidic cubic phase with the reduced bending rigidity of membranes and without long-range order. The ordered cubic phase structure is perturbed by thermally excited shape fluctuations of membranes. The transformation of the cubic to the sponge phase may be induced by adding a solvent such as dimethyl sulfoxide, propylene glycol, polyethyleneglycol (Mw » 400), 2-methyl-2,4-pentanediol (MPD), or Jeffamine M600 to a lipid/water system (15). The diameter of aqueous pores in the monoolein cubic phase is relatively narrow (ca. 5 nm) compared to that of the sponge phase (10–15 nm) (16). The size of the pores of the L3-phase allows one to incorporate membrane proteins with large hydrophilic parts and let them diffuse freely within the plane of the membrane surface (15). A schematic representation of L3-phase is shown in Fig. 5. Among the literature regarding membrane proteins crystallization in the sponge phase, there was a report on the appearance of the sponge phase proved by “visual inspection, small-angle X-ray scattering and NMR spectroscopy” (17). Crystals of the reaction center from Rhodobacter sphaeroides were grown in the L3-phase by a conventional hanging-drop experiment, and were harvested directly without the addition of lipase or cryoprotectant, and the structure was refined to 2.2 Å resolution. In contrast to the earlier lipidic cubic phase reaction center structure (18), the mobile ubiquinone could be built and refined. In these experiments, the components similar to those of cubic phase crystallization (structural resolution 2.35 Å) (18) were used: the MO/membrane protein/detergent/buffer. The only additional component was a small amphiphilic molecule 1,2,3-heptanetriol or Jeffamine M600. In another work (19), crystals of the light harvesting II complex

Fig. 5. A schematic representation of the sponge phase.

Crystallography of Membrane Proteins

87

with structural 2.45 Å resolution were obtained by the sponge phase approach. In this study, the additives used were KSCN, butanediol, pentaerythritol propoxylate (PPO), t-butanol, Jeffamine, and 2-methyl-2,4-pentanediol (MPD). A 2.0 Å structure was available for the light harvesting II complex. It was obtained using vapor diffusion-grown crystals of the detergentsolubilized complex (20). The liquid properties of the sponge phase at room temperature can be used directly in hanging- or sitting-drop vapor-diffusion crystallization by commercially available robots. Recently, a sponge phase sparse matrix crystallization screen consisting of 48 different conditions was designed, and crystals for 8 proteins out of 11 tested were obtained (21). Is the “sponge phase” approach better than crystallization in the cubic phase? Unlike the former method, this one has not led to a breakthrough in structural biology of membrane protein. There was no structure of a new membrane protein or a principal improvement in structural resolution achieved by this method. In addition, the use of small additives to obtain a sponge phase can sometimes be harmful for a membrane protein. We have tested the additives described in (19) for BR crystallization. The results of this experiment presented in Table 1 show that indeed, most of the additives are harmful for BR, and the approach does not allow one to obtain BR crystals of the same quality as in the LCP (Moiseeva, personal communication). Does it mean that the sponge phase approach does not have the same (or higher)

Table 1 Crystallization of BR in swollen lipidic L3 mesophases. Probes with the stable protein are shown in dark boxes. In the other cases, the proteins were denaturated or were considerably stressed

88

Deniaud et al.

power as the LCP method? No, for some of the proteins it works well and a further development of this new method is possible. As in the case of the previous approach, unfortunately, there is lack of information about the behavior of in meso system in the course of crystallization. Again, such information could be obtained by a complimentary use of neutron and X-ray scattering. 3.4.2. Crystallization from Vesicles

As is known, bacteriorhodopsin of Halobacterium salinarum forms a hexagonal two-dimensional lattice within the purple membrane. Another crystallization approach (22, 23) exposed the observation that purple membranes treated with the neutral detergent under certain conditions lead to the creation of spherical protein clusters (~50 nm in diameter). Using a standard vapor diffusion method for crystallization from BR vesicles with a high protein/lipid ratio, birefringent hexagonal crystals diffracting X-rays beyond 2.5 Å resolution were obtained (22–24). This new crystal belongs to the space group P622 with unit cell dimensions of a = b = 104.7 Å and c = 114.1 Å. The highest announced structural resolution achieved by this method is 2.3 Å. The scheme of such an experiment is illustrated in Fig. 6. A number of experiments have been carried out in our laboratories to optimize the

Fig. 6. Crystallization from vesicles. (a) Schematic representation of crystallization process from vesicles. (b) The BR crystals grown from vesicles.

Crystallography of Membrane Proteins

89

method (Golubev et al., personal communication). It has been shown that one can obtain different types of BR crystals (Fig. 6); however, at present, there is no other membrane protein crystallized by this method. Nevertheless, it is not yet clear whether this approach is limited to some specific cases, like BR, or has a more general application. Unfortunately, as in two other cases mentioned above, this system of crystallization has not been sufficiently characterized. 3.4.3. Crystallization from Bicelles

Another approach is the crystallization from bicelles, which was first applied to obtain high quality BR crystals (25, 26). Bicelles are a liquid crystal phase consisting of disc-shaped lipid-rich bilayer particles formed from mixtures of dimyristoylphosphatidylcholine (DMPC) with certain detergents. The most employed detergents usually are either dihexanoylphosphatidylcholine (DHPC) or zwitterionic bile salt derivative, CHAPSO. The bicelles sizes at a 1:3 DMPC/DHPC molar ratio are: the bilayer thickness – 40 Å and the diameter – 400 Å. The 1ipid:detergent ratios present in the bicellar systems are quite high compared to other micellar systems, which have previously been used as model membranes in biophysical or biochemical studies (27, 28). Moreover, the bicellar systems are unusual in that bicelles can be magnetically oriented to yield oriented solid state NMR spectra of high quality (29). It is interesting to mention that investigation of bicelle formation is of great biological importance as well. It may help to understand better one of the fundamental processes in the cell: formation and transformation of biomembranes. The procedure of crystallization of membrane proteins from bicelles can be described as follows. The first step is preparation of bicelles. Then, solubilized protein is mixed with bicelles. It is believed that at this stage, the proteins are reconstituted into bicelles. These steps are schematically illustrated in Fig. 7. The last step is crystallization of the protein by a standard vapor diffusion method. BR crystals grown at room temperature are essentially identical to previously obtained at 37°C twinned crystals: space group P21 (2.0 Å resolution) with unit cell dimensions of a = 44.7 Å, b = 108.7 Å, c = 55.8 Å, ß = 113.6°. The other room-temperature crystals are untwinned and belong to space group C2221 (2.2 Å resolution) with the following unit cell dimensions: a = 44.7 Å, b = 102.5 Å, c = 128.2 Å. By this method, crystals of the human b2-adrenergic G-protein-coupled receptor were obtained (30). The structure was solved to 3.5/3.7 Å resolution, which is considerably lower than what was obtained by protein crystallization in the cubic phase (31). However, taking into account the long standing attempts to crystallize a GPCR, one has to accept that such a limited resolution

90

Deniaud et al.

Fig. 7. Crystallization from bicelles. (a) The bicelle components: DMPC, DHPC, CHAPSO. (b) Schematic representation of the bicelles: CHAPSO–DMPC, DHPC–DMPC and DHPC–DMPC bicelle with reconstituted integral membrane protein. (c) The BR crystals grown by this method.

is still a considerable success of the method under discussion. Very recently, the murine voltage dependent anion channel (mVDAC) was also crystallized in lipidic bicelles (32). The 2.3 Å resolution structure reveals a high-resolution presentation of membrane protein architecture. Also, the position and structural features of the voltage-sensing N-terminal segment were clearly assigned unlike the recent published NMR structure of human VDAC1 solubilized in detergent micelles (33). 3.4.4. On the Way to a General Method

Almost half of the small amount of known MP structures is determined at relatively low resolutions (with resolution 3 Å and lower), and only a small number of high-resolution structures are available (34). In contrast, for the RCSB (3) as a whole, ~4% of the entries are at 3 Å resolution or worse. The majority of membrane proteins with determined structures were crystallized in a nonmembrane environment by a standard method of vapor diffusion in the presence of detergents (Fig. 8) (35). Nevertheless, new approaches – crystallization from lipid bilayers (better to say “from membranes”) – have led to a breakthrough in the field of structural biology of membrane proteins. The important role of native lipidic environment in the stability and function of membrane proteins is known. Recent progress in crystallization, employing the most promising biomimetic environments: bicelle and lipidic cubic phases, has given hope to gain the three-dimensional structure of many other membrane proteins. Also a rational design of the crystallization experiment is not possible without deep knowledge of the used systems. A complimentary approach to the studies of complex fluids used for crystallization is of great importance for the development of more general and efficient methods of membrane protein crystallization. Physics with such powerful experimental methods as neutron and X-ray scattering as well as theoretical tools will play a key role in further

Crystallography of Membrane Proteins

91

Fig. 8. Statistics of membrane protein crystal structures determined by X-ray diffraction versus crystallization method used to obtain the crystals. Note: Count is high because methods, as listed, are not mutually exclusive (35).

development of crystallization methods. Success would have a high impact on our understanding of molecular mechanisms of the function of the living matter. 3.5. From Crystals to Diffraction Patterns 3.5.1. Heavy Atom Derivatives

3.5.2. Crystallography

Most of the membrane proteins solved are new structures and cannot be solved by molecular replacement. When expressed in Escherichia coli, the proteins can be labeled with Selenomethionines, and the structure is then solved using anomalous dispersion. Still several membrane proteins will need heavy atom derivatives to be solved. Binding heavy atom derivatives to MPs at specific locations may encounter two problems. First, polar protein surfaces are by nature limited, and therefore polar compounds have a more restricted number of putative binding sites. Second, hydrophobic compounds will be trapped within hydrophobic tails of detergent molecules. The search for heavy atom derivatives can be therefore more tedious than for a soluble protein. As soon as high quality diffraction data as well as phase information are available, solving the structure of a membrane protein is very similar to a soluble protein. It has to be noted that because of the difficulty in MP crystallization, whenever micro-crystals are available, these should be tested for their diffraction if possible on a micro-focus synchrotron radiation beam-line (or at least on a

92

Deniaud et al.

beam-line that allows to collimate the beam down to the size of the crystal). In some cases, structures can be solved from microcrystals. Several developments go along this direction (36). Finally, once the protein model is built, extra densities corresponding to lipids or detergents are detected in the electron density maps. They can be modeled from 2Fo-Fc or Fo-Fc electron density maps. Refining their positions and temperature factors will depend on the resolution.

4. Notes 1. Most of the crystallizations achieved so far imply only a limited number of detergent families (37): alkyl-dimethylamineoxides (CnDAO), with n ~ 12, alkyl-glucosides (CnG) with n ~ 8, alkyl-maltosides (CnM) with n ~ 12 or alkyl-oligoethylene glycol-monoethers (CnEm) i.e., C12E8, with small variations on the length of the alkyl chains. Detergents with short chains are not well suitable for membrane protein stability. However, the use of longer detergents can lead to steric hindrance during the crystal formation. It is admitted that ionic detergents would prevent protein–protein interactions by repelling the PDCs, therefore only nonionic detergents are appropriate for crystallization. The best detergent for the solubilization is not necessarily the best for crystallization and vice-versa. In that case, the detergent has to be exchanged during or after the purification procedure, during affinity chromatography, size exclusion chromatography, or by dialysis. However, depending on the critical micellar concentrations, the time necessary for the dialysis might be incompatible with protein stability. 2. The detergent has to maintain the protein in solution in a functional state or more precisely, in a conformation that is relevant to the function. Functional assays are not always possible in PDCs, in particular for transport processes. If ligands are known (substrates or analogs, cofactors, inhibitors), then binding activities can be assayed; they will provide partial indications on the “quality” of the protein. The success of this initial step also depends upon the knowledge of the physicalchemistry of the detergent used, for review (38). 3. The choice and the concentration of the detergent during the purification steps prior crystallization is well discussed in (39). During protein concentration, the detergent is usually concentrated. Removing excess detergent can be essential (40). 4. Several biophysical methods will help to characterize the folding. Thermal fluorescence scans adapted to membrane

Crystallography of Membrane Proteins

93

protein will provide information on the stability (41). Circular dichroism will provide information on the secondary structure. Infrared spectroscopy using ATR-IR will show the secondary structure as well as the quantity of detergent (42). All these methods can be particularly efficient for relative measurements when comparing the properties of PDCs with different detergents. Same measurements can also be performed in the presence of lipids or any other ligand to identify conditions under which the protein has a proper folding and is more stable. It might also be useful to identify and quantify endogenous lipids bound to the protein. The size, the shape of the PDCs, their content in protein and the oligomerization state of the protein, amounts of detergent and lipid molecules and their identification, the stability of the PDCs, have to be characterized as best as possible and can be studied by several biochemical and biophysical methods described in the chapters of (43). Size exclusion chromatography coupled to UV absorption, refractive index, static and dynamic light scattering is well adapted to such measurements (44) and can be complemented by analytical ultracentrifugation (45). 5. The dynamics of proteins in PDCs might be different from the native membrane, and the proteins will therefore be able to adopt a large variety of conformations (not necessarily relevant for their functions), making precise molecular contacts for crystal growth difficult. Stabilizing one single conformation in the solution will therefore enhance the success rate in crystal growth. This was achieved by the mutation of a single residue for LacY (46), or by co-crystallization with an inhibitor for the ADP/ATP carrier (47). The detergent concentration can also be critical. 6. Several precipitants will favor the phase separation and protein precipitation. If enough protein is available, all the commercial screens can be tested using a nanodrop robot. Even with 100 nL drops, 80 mL are sufficient to screen more than 600 conditions. However, with membrane proteins, it is not expected to have protein crystals from the first screens. They will only indicate conditions where phase separations and interesting precipitates occur, which have then to be refined. Alternately, instead of screening blindly any available conditions, screens can be restricted to fewer precipitants favorable to many membrane proteins such as polyethylene glycols (PEGs) in the presence of various salts and other additives. For this strategy, grid screens with PEGs of different molecular masses (i.e., PEG400, PEG4000, PEG8000) or similar molecules (Jeffamine or PEGMME) and various salts at 0.1 or 0.2 M can be set up for different pH values. Figure 9 shows a few examples of initial hits obtained with AcrB.

94

Deniaud et al.

Fig. 9. Initial hits obtained for AcrB in the presence of different PEGs. Small crystals are identified in (a) and (c). Higher PEG concentrations lead to phase separations as seen in (b) and (d).

7. A large part of the crystallization experiments consists in observing the crystallization setups. In contrast to many soluble proteins, even with a large number of conditions, protein crystals should not be expected from initial screens. The difficulty is then to recognize crystalline precipitates or small angular objects as possible early events in the crystallization process. Large amounts of detergent combined with salts will lead to detergent crystals. Dyes coloring protein crystals don’t easily diffuse in membrane protein crystals which therefore remain uncolored as salt crystals do. Observing crystals under UV light is a more efficient way of detecting the molecular nature of small promising objects. 8. Figures 10a, b show two typical phase diagrams of nonionic detergents. They highlight the existence of the consolution boundary, which separates the micellar phase from a domain where two immiscible phases coexist. One is detergent enriched, the other is detergent depleted. Micelle–micelle interactions will increase while approaching the consolution boundary from the micellar phase. In the presence of proteins, the same effect will occur and increasing PDC interactions will thus favor protein–protein interactions. When reaching the phase separation,

Crystallography of Membrane Proteins

95

Fig. 10. Typical nonionic detergent phase diagrams. The detergent concentration versus temperature is shown in (a) and (b) for two different nonionic detergents. The diagrams highlight different phases separated by the critical micellar concentration (CMC) and the consolution boundary. (c) represents the detergent concentration as a function of the precipitant showing the influence of the precipitant on the CMC and the consolution boundary.

the protein will concentrate in detergent-enriched phase as illustrated with colored proteins in Fig. 11b. Very often, crystallization is obtained across or near the consolution boundary (Fig. 11c–e). This boundary will also be influenced by the precipitant itself (Fig. 10c). The phase diagram of the protein versus the precipitant (Fig. 12) indicates that the

96

Deniaud et al.

Fig. 11. A bacterial photosynthetic complex crystallized in the presence of PEG. (a)–(c) show the effect of three PEG concentrations. At 8% (a), the protein solution remains clear. At 15% (b), a phase separation occurs, and the protein is concentrated in one of the phase (as seen by the dark droplets containing the colored protein). At 10% (c), the conditions are close to the consolution boundary and crystals have appeared. (d) is a close-up of a condition similar to (b), small crystals have formed using all the protein in their surroundings. (e) is a close-up view of (c).

Fig. 12. Phase diagram of protein concentration versus precipitant. The diagram is the standard diagram valid for all types of proteins. For membrane proteins, this diagram superimposes with the detergent diagrams.

Crystallography of Membrane Proteins

97

protein will go from a soluble forward to a metastable zone above the solubility curve, in which increasing protein–protein interactions might lead to the nucleation of a crystal (5). Closer to the solubility curve, nucleation will be weaker and the growth of crystals will be favored. The protein phase diagram has to be combined with the phase diagram of the detergent. The general strategy for the crystallization setups is to search for precipitants that induce phase separations with crystalline precipitates in one of the phases. In a second step, the precipitant should be reduced to approach the consolution boundary without crossing it. This can be a smoother way and produces often fewer and larger crystals (Fig. 11c, e). It has to be noticed that phase diagrams of a detergent might be perturbed by various parameters such as its chemical purity, or the ratio of different isomers. Crystallization will thus be influenced by these parameters (48). 9. When refining the conditions, additives have to be considered. In particular, some additives are known to modify the micelle sizes and therefore will influence the size of the detergent belt in the PDC. This can be important in the crystal packing. Indeed, heptane-triol, known to shrink the radius of detergent micelles, was successfully used as an additive to improve the diffraction quality of the reaction center crystals probably by adapting the size of the detergent belt to the cavities in the crystal left over by protein–protein contacts (49). With the same idea, detergents can also be screened (48, 50). Detergents that are successful as additives might then be tested as main detergent in the PDC. The temperature will drastically influence the phase diagrams of the protein and of the detergent and is therefore an important parameter to explore. 10. Because of the high degree of flexibility of detergent molecules surrounding the protein in the PDCs, crystal contacts have to be made through protein–protein interactions to ensure crystals of high diffraction quality. These contacts are in most cases restricted to extramembrane domains of the protein, and the number of possible contacts is thus much lower than for a soluble protein. The organization of the detergent in membrane protein crystals was highlighted by low-resolution neutron diffraction for the reaction center (49) and for the OmpF porin (51). These studies have shown that the bulk volume occupied by the detergent around the proteins has to fit perfectly in the cavities of the crystal left over by protein–protein contacts. To improve the crystal contacts, the size and shapes of PDCs can be modified.

98

Deniaud et al.

11. Crystallizing the protein in complex with an antibody fragment increases the size of the extramembrane domain of the protein and favors the crystal contacts. The method was first illustrated for a cytochrome c oxidase, which was crystallized with Fv fragments (52). Fab fragments were also used as shown for the KcsA K+ channel (53). Several other examples were reviewed by Hunte et al. (54). More recently, a similar approach based on ankyrin repeat proteins (DARPins) was proposed (55). Modifications on the protein itself can also improve the crystallization. Truncations of N- or C-termini or the presence or absence of Tags can influence the quality of crystal packing (50). 12. As any protein crystal, the crystal is in equilibrium with the drop in which it grew. The so-called “mother liquor” is difficult to reproduce and becomes even more difficult to substitute for MPs because it contains also detergent and sometimes lipids in amounts that are difficult to determine precisely. Therefore, transferring the crystals to a solution containing cryoprotectant or soaking the crystals in solution containing heavy-atom derivatives can be difficult. In most cases, crystals have to be flash-frozen at 100 K for the diffraction experiments. It is usual to test several cryoprotectants with various procedures (time, concentration, intermediate transfer solutions). If in all cases the diffraction is poor, it is worthwhile testing the crystal quality prior to flash-cooling and even prior to any handling. It is now feasible to test directly the diffraction quality of crystals in the crystallization drop by setting the whole crystallization plate in the X-ray beam (56). Such setups are now available on a few synchrotron beam lines (i.e., BM30AFIP at ESRF, Grenoble). 13. The matrix most widely used for crystallization is composed of unsaturated monoglyceride monoolein and water. The temperature–composition phase diagram of the monoolein in water is shown in Fig. 13. At lower water content, two lamellar (Lc and La) phases exist (57). At hydration, over 20% MO spontaneously forms bicontinuous cubic phases of different symmetries (Ia3d and Pn3m), and the latter are stable in the presence of excess of water. Bicontinuous lipidic cubic phases are quite “resistant” to changes in temperature. The cubic phase is a gel-like material with high viscosity and stability. 14. The crystallization in the lipidic cubic phase environment has already enabled one to obtain crystal structures for a number of proteins such as bacteriorhodopsin (58–60), halorhodopsin (61), sensory rhodopsins (62–65), and the first complex of two membrane proteins: sensory rhodopsin II with transducer protein HtrII (66, 67), bacterial photosynthetic

Crystallography of Membrane Proteins

99

Fig. 13. A phase diagram of monooleoyl/water system (redrawn from (19)).

reaction centers (18, 68, 69), the cobalamin transporter BtuB (70), and the human b2-adrenergic G-protein-coupled receptor (31, 71, 72), which were the first high resolution structures of the first human “ligand” binding GPCRs. 15. Over the past several years, the behavior of lipid/water phases has been investigated under different conditions, in particular, the phase diagram of monoglyceride/detergent/water system in water (73). Further, there is a theoretical attempt to understand the mechanism of membrane protein crystal growth in the lipidic cubic phase (74). Theoretical calculations showed that the elastic energy of the deformation of the curved bilayer due to embedded proteins may be the driving force of crystallization. Unfortunately, this theory is not complete, and in addition, it is not yet known whether the model of crystallization system used for the calculation is correct. There are several possible reasons for efficiency of the method, and they have been discussed in the literature. However, the first question is whether the system is really in this phase. In fact, the probe for crystallization is different from the pure MO/water system. Indeed, it contains salts in the buffer and, what is most important, a certain amount of the detergent. A priori it was not evident that the phase diagram shown in Fig. 13 somehow describes the lipidic/membrane protein/ detergent/salt/water system as well. Therefore, the detailed kinetics of a proteolipidic cubic phase was studied by neutron scattering in the course of crystallization of the membrane protein bacteriorhodopsin. It was shown, that the initiation of crystallization process by salt addition leads to a dramatic decrease of the lattice constant, but no phase transition takes place. The cubic phase of Pn3m symmetry is observed during the entire crystallization process. No other phases are present in

100

Deniaud et al.

a macroscopic amount (75). Unfortunately, this does not exclude the presence of a small amount of other than the cubic phase (75). There is evidence of the presence of a lamellar phase around the growing protein crystals, but there is no definite proof of that (76). Further experimental and theoretical studies are required to elucidate the mechanism of crystallization. First of all, a very detailed experimental characterization of the system for crystallization via neutron and X-ray scattering techniques is necessary. 16. For X-ray data collection and also for other applications, single crystals grown in lipidic matrix need to be released from the highly viscous and sticky environment, placed onto nylon cryo-loops, and dumped into liquid nitrogen. There are several techniques for crystal mounting. Crystals could be directly removed from LCP by mechanical manipulation. In this case, the lipidic surrounding is mechanically withdrawn using microtools during observation with high resolution light microscopy. A single crystal with the rest of lipidic phase may be directly cryocooled as lipid served as cryoprotectant. Alternative fishing procedure is based on lipolysis of cubic liquid crystal dispersions of monoolein. The protein could be freed from the surrounding lipidic matrix by digesting the monoolein with lipase at room temperature during several hours or days (14). Lipase breaks ester bonds between glycerol and fatty acids of monoacylglycerols thereby embedded crystals are spontaneously released from the lipidic cubic phase and can easily be mounted from the aqueous phase or in oleic acid droplets by cryoloops. In the third technique, the cubic phase, which itself is membranous, may be solubilized by detergent (59). In the case of BR, the lipidic matrix with embedded crystals is transferred into 3 M Na/Na–Pi buffer (pH 5.6) containing 0.1% OG by microspatula and incubated during several days before crystal harvesting. It should be mentioned that in all these three cases, all manipulations with the crystals should be gentle since the crystals are fragile and very sensitive to environmental changes. For instance, dehydration quickly leads to crystal quality degradation. References 1. Deisenhofer J, Epp O, Miki K, Huber R, Michel H (1985) Structure of the protein subunits in the photosynthetic reaction centre of Rhodopseudomonas viridis at 3 Å resolution. Nature 318:618–624 2. White SH (2004) The progress of membrane protein structure determination. Protein Sci 13:1948–1949

3. http://www.rcsb.org/pdb/ 4. Drenth J (1994) Principles of protein X-ray crystallography. Springer, New York 5. Reiss-Husson F, Picot D (1999) Crystallization of membrane proteins. In: Ducruix A, Giégé R (eds) Crystallization of nucleic acids and proteins. A practical approach. Oxford University Press, Oxford

Crystallography of Membrane Proteins 6. Landau EM, Rosenbusch JP (1996) Lipidic cubic phases: a novel concept for the crystallization of membrane proteins. Proc Natl Acad Sci U S A 93:14532–14535 7. Mariani P, Luzzati V, Delacroix H (1988) Cubic phases of lipid-containing systems. Structure analysis and biological implications. J Mol Biol 204:165–189 8. Luzzati V, Tardieu A, Gulik-Krzywicki T, Rivas E, Reiss-Husson F (1968) Structure of the cubic phases of lipid-water systems. Nature 220:485–488 9. Scriven L (1976) Equilibrium bicontinuous structure. Nature 263:123–125 10. Landh T (1995) From entangled membranes to eclectic morphologies: cubic membranes as subcellular space organizers. FEBS Lett 369:13–17 11. Fontell K (1990) Cubic phases in surfactant and surfactant-like lipid systems. Colloid Polym Sci 268:264–285 12. Ericsson B. Eriksson P, Löfroth J, Engström S (1991) Cubic phases as delivery systems for peptide drugs. ACS symposium series 469:251–265 13. Gordeliy VI, Schlesinger R, Efremov R, Buldt G, Heberle J (2003) Crystallization in lipidic cubic phases: a case study with bacteriorhodops, vol 228, Methods in molecular biology. Humana Press, Clifton, NJ, pp 305–316 14. Nollert P, Landau EM (1998) Enzymic release of crystals from lipidic cubic phases. Biochem Soc Trans 26:709–713 15. Engström S, Alfons K, Rasmusson M, LjusbergWahren H (1998) Solvent-induced sponge (L3) phases in the solvent–monoolein–water system. Prog Colloid Polym Sci 108:93–98 16. Ridell A, Ekelund K, Evertsson H, Engström S (2003) On the water content of the solvent/monoolein/water sponge (L3) phase. Colloids Surf A Physicochem Eng Asp 228:17–24 17. Wadsten P, Wohri AB, Snijder A, Katona G, Gardiner AT, Cogdell RJ, Neutze R, Engstrom S (2006) Lipidic sponge phase crystallization of membrane proteins. J Mol Biol 364:44–53 18. Katona G, Andreasson U, Landau EM, Andreasson LE, Neutze R (2003) Lipidic cubic phase crystal structure of the photosynthetic reaction centre from Rhodobacter sphaeroides at 2.35 A resolution. J Mol Biol 331:681–692 19. Cherezov V, Clogston J, Papiz MZ, Caffrey M (2006) Room to move: crystallizing membrane proteins in swollen lipidic mesophases. J Mol Biol 357:1605–1618

101

20. Papiz MZ, Prince SM, Howard T, Cogdell RJ, Isaacs NW (2003) The structure and thermal motion of the B800-850 LH2 complex from Rps. acidophila at 2.0A resolution and 100K: new structural features and functionally relevant motions. J Mol Biol 326:1523–1538 21. Wohri AB, Johansson LC, WadstenHindrichsen P, Wahlgren WY, Fischer G, Horsefield R, Katona G, Nyblom M, Oberg F, Young G, Cogdell RJ, Fraser NJ, Engstrom S, Neutze R (2008) A lipidic-sponge phase screen for membrane protein crystallization. Structure 16:1003–1009 22. Kouyama T, Yamamoto M, Kamiya N, Iwasaki H, Ueki T, Sakurai I (1994) Polyhedral assembly of a membrane protein in its three-dimensional crystal. J Mol Biol 236:990–994 23. Takeda K, Sato H, Hino T, Kono M, Fukuda K, Sakurai I, Okada T, Kouyama T (1998) A novel three-dimensional crystal of bacteriorhodopsin obtained by successive fusion of the vesicular assemblies. J Mol Biol 283:463–474 24. Denkov ND, Yoshimura H, Kouyama T, Walz J, Nagayama K (1998) Electron cryomicroscopy of bacteriorhodopsin vesicles: mechanism of vesicle formation. Biophys J 74:1409–1420 25. Faham S, Bowie JU (2002) Bicelle crystallization: a new method for crystallizing membrane proteins yields a monomeric bacteriorhodopsin structure. J Mol Biol 316:1–6 26. Faham S, Boulting GL, Massey EA, Yohannan S, Yang D, Bowie JU (2005) Crystallization of bacteriorhodopsin from bicelle formulations at room temperature. Protein Sci 14: 836–840 27. Sanders CR, Schwonek JP (1992) Characterization of magnetically orientable bilayers in mixtures of dihexanoylphosphatidylcholine and dimyristoylphosphatidylcholine by solid-state NMR. Biochemistry 31:8898–8905 28. Sanders CR, Prestegard JH (1990) Magnetically orientable phospholipid bilayers containing small amounts of a bile salt analogue, CHAPSO. Biophys J 58:447–460 29. Sanders CR 2nd, Landis GC (1995) Reconstitution of membrane proteins into lipid-rich bilayered mixed micelles for NMR studies. Biochemistry 34:4030–4040 30. Rasmussen SG, Choi HJ, Rosenbaum DM, Kobilka TS, Thian FS, Edwards PC, Burghammer M, Ratnala VR, Sanishvili R, Fischetti RF, Schertler GF, Weis WI, Kobilka BK (2007) Crystal structure of the human beta2 adrenergic G-protein-coupled receptor. Nature 450:383–387

102

Deniaud et al.

31. Cherezov V, Rosenbaum DM, Hanson MA, Rasmussen SG, Thian FS, Kobilka TS, Choi HJ, Kuhn P, Weis WI, Kobilka BK, Stevens RC (2007) High-resolution crystal structure of an engineered human beta2adrenergic G protein-coupled receptor. Science 318:1258–1265 32. Ujwal R, Cascio D, Colletier JP, Faham S, Zhang J, Toro L, Ping P, Abramson J (2008) The crystal structure of mouse VDAC1 at 2.3 A resolution reveals mechanistic insights into metabolite gating. Proc Natl Acad Sci U S A 105:17742–17747 33. Hiller S, Garces RG, Malia TJ, Orekhov VY, Colombini M, Wagner G (2008) Solution structure of the integral human membrane protein VDAC-1 in detergent micelles. Science 321:1206–1210 34. Ivetac A, Sansom MS (2008) Molecular dynamics simulations and membrane protein structure quality. Eur Biophys J 37:403–409 35. http://www.mpdb.ul.ie/ 36. Riekel C, Burghammer M, Schertler G (2005) Protein crystallography microdiffraction. Curr Opin Struct Biol 15:556–562 37. http://www.mpibp-frankfur t.mpg.de/ michel/public/memprotstruct.html 38. Garavito RM, Ferguson-Miller S (2001) Detergents as tools in membrane biochemistry. J Biol Chem 276:32403–32406 39. Wiener MC (2004) A pedestrian guide to membrane protein crystallization. Methods 34:364–372 40. Dahout-Gonzalez C, Brandolin G, PebayPeyroula E (2003) Crystallization of the bovine ADP/ATP carrier is critically dependent upon the detergent-to-protein ratio. Acta Crystallogr A 59:2353–2355 41. Alexandrov AI, Mileni M, Chien EY, Hanson MA, Stevens RC (2008) Microscale fluorescent thermal stability assay for membrane proteins. Structure 16:351–359 42. daCosta CJ, Baenziger JE (2003) A rapid method for assessing lipid:protein and detergent:protein ratios in membrane-protein crystallization. Acta Crystallogr A 59:77–83 43. Pebay-Peyroula E (2007) Biophysical analysis of membrane proteins. Investigating structure and function. Wiley, United States 44. Slotboom DJ, Duurkens RH, Olieman K, Erkens GB (2008) Static light scattering to characterize membrane proteins in detergent solution. Methods 46:73–82 45. le Maire M, Arnou B, Olesen C, Georgin D, Ebel C, Moller JV (2008) Gel chromatography and analytical ultracentrifugation to determine

46.

47.

48.

49.

50.

51.

52.

53.

54. 55.

56.

57.

the extent of detergent binding and aggregation, and Stokes radius of membrane proteins using sarcoplasmic reticulum Ca2+-ATPase as an example. Nat Protoc 3:1782–1795 Abramson J, Smirnova I, Kasho V, Verner G, Kaback HR, Iwata S (2003) Structure and mechanism of the lactose permease of Escherichia coli. Science 301:610–615 Pebay-Peyroula E, Dahout-Gonzalez C, Kahn R, Trezeguet V, Lauquin GJ, Brandolin G (2003) Structure of mitochondrial ADP/ATP carrier in complex with carboxyatractyloside. Nature 426:39–44 Screpanti E, Padan E, Rimon A, Michel H, Hunte C (2006) Crucial steps in the structure determination of the Na+/H+ antiporter NhaA in its native conformation. J Mol Biol 362:192–202 Roth M, Lewit-Bentley A, Michel H, Deisenhofer J, Huber R, Oesterhelt D (1989) Detergent structure in crystals of a bacterial photosynthetic reaction center. Nature 340:659–662 Lemieux MJ, Reithmeier RA, Wang DN (2002) Importance of detergent and phospholipid in the crystallization of the human erythrocyte anion-exchanger membrane domain. J Struct Biol 137:322–332 Pebay-Peyroula E, Garavito RM, Rosenbusch JP, Zulauf M, Timmins PA (1995) Detergent structure in tetragonal crystals of OmpF porin. Structure 3:1051–1059 Iwata S, Ostermeier C, Ludwig B, Michel H (1995) Structure at 2.8 A resolution of cytochrome c oxidase from Paracoccus denitrificans. Nature 376:660–669 Zhou Y, Morais-Cabral JH, Kaufman A, MacKinnon R (2001) Chemistry of ion coordination and hydration revealed by a K+ channel-Fab complex at 2.0 A resolution. Nature 414:43–48 Hunte C, Michel H (2002) Crystallisation of membrane proteins mediated by antibody fragments. Curr Opin Struct Biol 12:503–508 Steiner D, Forrer P, Pluckthun A (2008) Efficient selection of DARPins with subnanomolar affinities using SRP phage display. J Mol Biol 382:1211–1227 Jacquamet L, Ohana J, Joly J, Borel F, Pirocchi M, Charrault P, Bertoni A, Israel-Gouy P, Carpentier P, Kozielski F, Blot D, Ferrer JL (2004) Automated analysis of vapor diffusion crystallization drops with an X-ray beam. Structure 12:1219–1225 Briggs J, Ching H, Caffrey M (1996) The temperature–composition phase diagram and mesophase structure characterization of

Crystallography of Membrane Proteins

58.

59.

60.

61.

62.

63.

64.

65.

66.

67.

the monoolein/water system. J Phys II 6:723–751 Belrhali H, Nollert P, Royant A, Menzel C, Rosenbusch JP, Landau EM, Pebay-Peyroula E (1999) Protein, lipid and water organization in bacteriorhodopsin crystals: a molecular view of the purple membrane at 1.9 A resolution. Structure 7:909–917 Luecke H, Schobert B, Richter HT, Cartailler JP, Lanyi JK (1999) Structure of bacteriorhodopsin at 1.55 A resolution. J Mol Biol 291:899–911 Pebay-Peyroula E, Rummel G, Rosenbusch JP, Landau EM (1997) X-ray structure of bacteriorhodopsin at 2.5 angstroms from microcrystals grown in lipidic cubic phases. Science 277:1676–1681 Kolbe M, Besir H, Essen LO, Oesterhelt D (2000) Structure of the light-driven chloride pump halorhodopsin at 1.8 A resolution. Science 288:1390–1396 Luecke H, Schobert B, Lanyi JK, Spudich EN, Spudich JL (2001) Crystal structure of sensory rhodopsin II at 2.4 angstroms: insights into color tuning and transducer interaction. Science 293:1499–1503 Royant A, Nollert P, Edman K, Neutze R, Landau EM, Pebay-Peyroula E, Navarro J (2001) X-ray structure of sensory rhodopsin II at 2.1-A resolution. Proc Natl Acad Sci U S A 98:10131–10136 Vogeley L, Sineshchekov OA, Trivedi VD, Sasaki J, Spudich JL, Luecke H (2004) Anabaena sensory rhodopsin: a photochromic color sensor at 2.0 A. Science 306:1390–1393 Reshetnyak AA, Borshchevskiy VI, Klare J, Moiseeva ES, Engelhardt M, Buldt G, Gordeliy VI (2008) Comparative analysis of Sensory rhodopsin II structures in complex with a transducer and without it. J Surf Invest 2:894–899 Gordeliy VI, Labahn J, Moukhametzianov R, Efremov R, Granzin J, Schlesinger R, Buldt G, Savopol T, Scheidig AJ, Klare JP, Engelhard M (2002) Molecular basis of transmembrane signalling by sensory rhodopsin II-transducer complex. Nature 419:484–487 Moukhametzianov R, Klare JP, Efremov R, Baeken C, Goppner A, Labahn J, Engelhard M,

68.

69.

70.

71.

72.

73.

74. 75.

76.

103

Buldt G, Gordeliy VI (2006) Development of the signal in sensory rhodopsin and its transfer to the cognate transducer. Nature 440:115–119 Chiu ML, Nollert P, Loewen MC, Belrhali H, Pebay-Peyroula E, Rosenbusch JP, Landau EM (2000) Crystallization in cubo: general applicability to membrane proteins. Acta Crystallogr A 56:781–784 Katona G, Snijder A, Gourdon P, Andreasson U, Hansson O, Andreasson LE, Neutze R (2005) Conformational regulation of charge recombination reactions in a photosynthetic bacterial reaction center. Nat Struct Mol Biol 12:630–631 Misquitta LV, Misquitta Y, Cherezov V, Slattery O, Mohan JM, Hart D, Zhalnina M, Cramer WA, Caffrey M (2004) Membrane protein crystallization in lipidic mesophases with tailored bilayers. Structure 12:2113–2124 Rosenbaum DM, Cherezov V, Hanson MA, Rasmussen SG, Thian FS, Kobilka TS, Choi HJ, Yao XJ, Weis WI, Stevens RC, Kobilka BK (2007) GPCR engineering yields high-resolution structural insights into beta2-adrenergic receptor function. Science 318:1266–1273 Jaakola VP, Griffith MT, Hanson MA, Cherezov V, Chien EY, Lane JR, Ijzerman AP, Stevens RC (2008) The 2.6 angstrom crystal structure of a human A2A adenosine receptor bound to an antagonist. Science 322:1211–1217 Persson G, Edlund H, Lindblom G (2004) Phase behaviour of the 1-monooleoylracglycerol/n-octyl-b-d-glucoside/water system. Prog Colloid Polym Sci 123:36–39 Grabe M, Neu J, Oster G, Nollert P (2003) Protein interactions and membrane geometry. Biophys J 84:854–868 Efremov R, Shiryaeva GN, Islamov A, Kuklin A, Yaguzhinsky L, Fragneto-Cusani G, Bueldt G, Gordeliy VI (2005) SANS investigations of the lipidic cubic phase behaviour in course of bacteriorhodopsin crystallization. J Cryst Growth 275:1453–1459 Cherezov V, Caffrey M (2007) Membrane protein crystallization in lipidic mesophases. A mechanism study using X-ray microdiffraction. Faraday Discuss 136:195–212 (discussion 3–29)

as

Chapter 6 Structural Approaches of the Mitochondrial Carrier Family Hugues Nury, Iulia Blesneac, Stephanie Ravaud, and Eva Pebay-Peyroula Abstract The transport of solutes across the inner mitochondrial membrane is highly selective and necessitates membrane proteins mainly from the mitochondrial carrier family (MCF). These carriers are required for the transport of a variety of metabolites implicated in all the important processes occurring within the mitochondrial matrix. Due to its high abundance, the ADP/ATP carrier (AAC) is the member of the family that was studied most. It is the first mitochondrial carrier for which a high-resolution X-ray structure is known. The carrier was crystallized in the presence of a strong inhibitor, the carboxyatractyloside (CATR). The structure gives an insight not only into the overall fold of mitochondrial carriers in general but also into atomic details of the AAC in a conformation that is open toward the intermembrane space (IMS). Molecular dynamics simulations indicate the first events occurring to the carrier after the binding of ADP. A careful analysis of the primary sequences of all the carriers in light with the structure highlights properties of the protein that are related to the substrate. Key words: Membrane protein, Transport, Inner mitochondrial membrane, ADP and ATP transport

1. Introduction Mitochondria are organelles in which major metabolic cycles, Krebs cycle, degradation of fatty acids, and others take place. They also provide the environment to produce ATP, the main fuel for cellular processes. The molecular complexes of the respiratory chain located in the inner membrane create a proton gradient that is further used by the ATP-synthase, a membrane machinery, which catalyzes the production of ATP from ADP and inorganic phosphate. The structures of several components of the respiratory chain, including domains of the ATP-synthase (1, 2), are now known at quasi-atomic resolution and have shed light Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_6, © Springer Science+Business Media, LLC 2010

105

106

Nury et al.

Fig. 1. The MCF motif. The characteristic MCF motif is found within each repeat of 100 amino acids present three-times in MCF carriers. The limits of the helices determined in the AAC structure are superimposed on the consensus MCF sequence. It spans over the second part of an odd-numbered helix (Hn) to the first part of the following evennumbered helix (Hn+1). “a” represents an aromatic amino acid.

into major mechanisms, such as the rotary mechanism of ATP-synthase. The production of ATP as well as the function of all the major cycles is strongly connected to the import and export of metabolites between the cytosol and the mitochondrial matrix. The metabolites are small molecules with a large variety of size and chemical characteristics. They are able to cross the outer membrane relatively easily via a porin-like membrane protein, VDAC, which structure has been solved in 2008 by NMR (3) and X-ray crystallography (4). The transport through VDAC is rather unspecific and mainly limited by the size of the substrate with a slight preference for anionic molecules. On the contrary, the inner membrane is impermeable, and molecules are only transported through very specific membrane carriers. Many of these carriers belong to the MCF characterized by a specific motif of amino acids (Fig. 1) first identified by Walker et al. (5) and further described in (6). More than 40 carriers were described in human. They are encoded by the SLC25 gene family, and several are related to diseases as reviewed in (7). An overview of plant mitochondrial carriers is presented in (8). The ADP/ATP carrier (AAC) is the most abundant – it represents 10% of the proteins in bovine heart mitochondria – and was therefore studied extensively in the last 30 years (9). It serves as a model for other MCF carriers. This chapter first details the structural studies of the bovine AAC, discusses possible implications for the transport mechanism, and in particular, the import of ADP, and finally discusses general implications for the whole MCF.

2. Methods to Solve the Structure of Bovine AAC 2.1. General Strategy

The 3D structure of the bovine AAC was solved by X-ray crystallography (10). The same year, Kunji et al. published a 2D projection map obtained by electron microscopy from 2D crystals (11). Transporters often undergo large conformational changes in order to shuttle molecules from one side of the membrane to the other. As a consequence, in the absence of ligands, they display large conformational heterogeneities in solution. In addition, detergent molecules that surround the protein after solubilization

Structural Approaches of the Mitochondrial Carrier Family

107

do not provide the same environment as the native lipidic membrane, and large conformational changes are not necessarily well controlled and might favor the loss of the tertiary structure. It is therefore of interest to find the appropriate conditions to stabilize a single conformation and block the transport process. This can be achieved by several ways: binding a ligand that is not transported, crystallizing a single point mutant that has no transport activities as done for the lactose permease (12), or crystallizing at a pH in which the transport is blocked as exemplified with the Na/H antiporter (13). All these strategies necessitate a good knowledge of the biochemical and functional properties of the transporter. Inhibitors of AAC were well described since the 1980s. Two families of molecules are known, atractylosides (ATR) and carboxyatractyloside (CATR) on one hand, and bongkrekic acid (BA) and iso-bongkrekic acid on the other. The first is known to block the entrance of ADP from the intermembrane space (IMS) (14), the second prevents the export of ATP from the matrix (15). They were described to induce the extreme conformations adopted by the carrier during the transport process. These inhibitors are therefore adequate tools for structural characterization of AAC. 2.2. 3D Crystallization

AAC was extracted and purified from bovine heart mitochondria with a protocol derived from (16). Importantly, mitochondria have to be incubated in the presence of CATR in order to stabilize the carrier from the first step of extraction (17). Because of the constraints related to the purification of a nontagged protein, many lipid and detergent molecules are copurified with the carrier and cannot be washed away easily. As a consequence, the protein solution after concentration to 10 mg/mL for the crystallization contains about 20 g of detergent per gram of protein, a much larger amount than usual. In particular, the very large amount of detergent was incompatible with the lipid cubic phase approach (18) and prevented crystallization in the standard vapor diffusion method. Excess detergent was therefore removed using polystyrene beads (moist activated SM2 Biobeads, Bio-Rad) known to adsorb detergent molecules at their surface. Biobeads are commonly used for 2D crystallization (19). In the presence of lipids, they are added to proteins solubilized in detergent. The adsorption of detergents drives the incorporation of proteins into the liposomes. For AAC, the amount of removed detergent as a function of time could be followed by radioactive labeled detergent (17). Although the crystallization was achieved by vapor diffusion, the presence of endogenous lipids probably favored the crystallization in a similar way as for 2D crystallization. Partially removing the detergent forces the protein to interact with lipids, and the presence of precipitants favors protein–protein interactions. As a result, the crystals can be described as ordered stacks of 2D crystals.

108

Nury et al.

2.3. Structure Determination

Two crystal forms were necessary to solve the structure. The first crystal form had a P 21212 symmetry. Most of these crystals contained a defect, that is, a few protein layers (perpendicular to the c-axis) were displaced along the b-axis by approximately 2 Å (17). However, a diffraction data set to 2.2 Å could be collected from a perfect crystal and was used for the refinement of the molecular model. The high rate of defects prevented the search for heavy atom derivatives necessary for phase determination. Therefore, another crystal form was needed. Because the crystallization trials had highlighted the strong influence of various salts, the salt concentration in the buffer used for the final purification step was reduced (5 mM NaCl instead of 100 mM). New crystallization set-ups were made, and a C 2221 crystal form was identified. Despite anisotropic diffraction, this form could be used for finding heavy atom derivatives that allowed the phases to be determined to a resolution of 3 Å. Three mercury derivatives were used, and they interacted with the four cysteins of AAC with different occupancies. A first model was constructed from the experimental electron density and transposed by molecular replacement to the 2.2 Å data of the primitive crystal form. The model was progressively completed and refined on the basis of 2Fo-Fc electron density maps combined with the experimental MIR maps (translated and rotated so that they would match with the position of the carrier in the P 21212 cell). The refinement was then conducted by alternate cycles of manual modifications using the program O (20) and energy minimization or simulated annealing using CNS (21). After complete reconstruction of the molecule with the exception of a little number of disordered amino acids in the N- and C-termini, the location of the inhibitor was clearly visible from 2Fo-Fc, and Fo-Fc electron density maps in the center of the large cavity formed by the six tilted and partially kinked transmembrane (TM) helices. In the final electron density, a few lipid molecules could be modeled. By preparing the protein without any salt in the final purification step, a slightly different unit cell in the C2221 symmetry was obtained. The refinement of this structure showed three cardiolipins organized around the TM helices on the matrix-side leaflet of the membrane (22).

2.4. Description of the Structure

As predicted from the primary sequence analysis, the carrier has six helices that span over the membrane, connected by short loops on the side of the IMS. The overall fold is shown in Fig. 2. On the matrix side, the amino acids connecting the TM helices do not form long unstructured loops as depicted in previous published articles but are partially structured in small amphipathic helices as predicted in (23). The TM helices are all six tilted with respect to the axis that is perpendicular to the membrane, and the three odd-numbered ones are kinked by more than 30° at the level of the conserved prolines of the MCF motifs. A cavity largely open

Structural Approaches of the Mitochondrial Carrier Family

109

Fig. 2. Ribbon representation of AAC. Panels (a), (b) and (c) view the bovine ADP/ATP carrier from the intermembrane space, the matrix and the side, respectively. The N- and C-termini are labeled in panel (c). Panels (a) and (b) indicate the transmembrane and the short amphipathic helices, respectively.

toward the IMS and penetrating deeply toward the matrix side is formed by the tilts and kinks of the TM helices. The three oddnumbered helices are longer than initially predicted. It can also be noted that all the TM helices are largely surrounded by solvent in the central cavity that penetrates deeply into the carrier (Fig. 3). This is particularly the case for H2, which has a low hydrophobic score and was not easily predictable as a TM helix from the primary sequence. The structure solved to 2.2 Å revealed several interesting aspects (10). The inhibitor, CATR, cocrystallized with the carrier, is located in the cavity with its diterpene moiety oriented toward the matrix and the two sulfate groups pointing toward the IMS. Almost all the chemical groups of CATR interact with AAC via hydrogen bonds or van der Waals contacts, except the primary alcohol group located on the sugar moiety that points toward the cavity. The tight interaction between AAC and CATR is consistent with the nanomolar affinity of the inhibitor for AAC. Both carboxylates of CATR interact with the carrier, one through a direct electrostatic interaction, the other via a water molecule,

110

Nury et al.

Fig. 3. The cavity and the binding site of CATR. The figure shows a longitudinal section through the cavity, which is largely open toward the intermembrane space. The surface of the protein is represented. The inhibitor, CATR, is depicted in dots filling the van der Waals volumes of CATR atoms.

thus explaining the slightly lower affinity for ATR, which has only one carboxylate group. The most interesting findings are the locations of the MCF residues. The first halves of the MCF motifs are located on the odd-numbered helices, the two first charged amino acids following the proline interact via salt bridges and connect the helices two by two as shown in Fig. 4. These salt bridges were predicted from mutant studies (24). The kinks at the level of the signature prolines bring the charged residues in close vicinity and allow the formation of salt bridges, thus forcing the closure of AAC toward the matrix. 2.5. Oligomerization, Organization in the Membrane, 2D Crystals

The structure solved by X-ray crystallography demonstrated that a monomeric form can exist as a stable entity, and that it contains a cavity, the size of which is compatible with nucleotide binding. With a few structural rearrangements, it is plausible that the transport occurs through this single monomer. The C 2221 crystal form highlights protein–protein contacts mediated by cardiolipins (22). However, the contacts are rather weak and could be induced by the crystallization. At the same time, a 2D projection map of the yeast AAC3 was determined by electron microscopy (11). The carriers located in the projection maps are compatible with the X-ray structure and are organized in the 2D layer without obvious dimerization. The carrier solubilized in detergent was analyzed by several functional, biochemical, and biophysical approaches, most of them showing a monomer (25–27). All these findings seem to

Structural Approaches of the Mitochondrial Carrier Family

111

Fig. 4. Salt bridges formed by MCF residues. The residues of the first part of the MCF are located on helices H1, H3 and H5. Prolines are responsible for the sharp kinks of the odd-numbered helices, acidic and basic residues form salt bridges that connect these three helices two by two.

contradict previous evidences for dimerization. Whether the carrier forms dimers in the membrane and whether its oligomerization is necessary for the function are still open questions. It is highly probable that local high concentration of AAC in the mitochondrial membrane favors close packing that could account for an efficient transport. The 2D crystallization was done in the presence of ATR and the carrier has therefore the same conformation as in the 3D crystals. Even though the resolution of 2D crystals is much lower than 3D crystals, 2D crystals have the advantage of being formed by a monolayer of proteins that are accessible from both sides making the exchange of inhibitors by soaking possible.

3. The Transport Mechanism 3.1. What Is Suggested by the Structure?

The current structure is that of AAC in a single conformation. It represents most probably the conformation able to attract and bind ADP from the IMS or at least is close to this ADP conformation.

112

Nury et al.

Although the structure is not sufficient to fully understand the transport mechanism, it highlights the location of residues that are known to be important for the function of AAC. It also serves as a basis for molecular dynamic simulations. It was known from the amino acid sequence analysis that AACs from all species contain a highly conserved sequence in the third MCF motif, RRRMMM, where the first and last arginines belong to the MCF motif. In addition, this sequence is highly specific and is not found in any other carriers (and even any other protein). Therefore, the RRRMMM signature has to be related to the function. The structure revealed that the sequence spans over the 10 Å from the bottom of the cavity to almost the matrix side. R234 and R235 both point toward the cavity and interact with the inhibitor. R236 is oriented toward the matrix, not far from the surface, and its side chain is implicated in a salt bridge with E264. In the current known conformation, R234 and R235 are positioned to interact with an ADP that would enter from IMS. R236 is buried, but even a small conformational change would allow this residue to be accessible from the matrix so that it could possibly interact with ATP. Because the carrier transports negatively charged molecules, and before the structure was known, many of the conserved basic residues were mutated in order to test their functional relevance (28–30). The structure reveals that several of them are located in the cavity and form basic patches that line the cavity from the entrance to the bottom. In parallel, three tyrosines form an intriguing “ladder” from the entrance of the cavity to the bottom and were postulated to guide the nucleotide ring (31). The mutation of two of these tyrosines is known to prevent AAC to function (32). The location of the MCF residues suggests that the kinks could be modulated by the existence, or nonexistence of the salt bridges, while the second part of the motif would act as a hinge that could follow the movement induced by a kink modification of the odd-numbered helices. 3.2. Molecular Dynamics Simulations on AAC

Since the structure of AAC was published, several groups have performed molecular dynamics simulations on the AAC. The first approach aimed at following conformational changes occurring to the carrier in the absence of CATR (33). These authors compared the trajectories in the presence and in the absence of CATR and found that a number of salt bridges close to the matrix side are rather unstable in the absence of CATR. Their studies also highlighted that odd-numbered helices can undergo wobble movements around the conserved prolines, whereas even-numbered helices can undergo a face rotation. The rotation of these helices is partially consistent with experimental data that evidenced a difference in their accessibility measured between the states blocked by CATR and BA, respectively (34). Although these simulations

Structural Approaches of the Mitochondrial Carrier Family

113

were performed on very short time scales (10 ns), such observations could indicate the first events of conformational changes that occur during the transport. Similar calculations but with longer time scales (4 × 20 ns) also hinted to a possible hinge role of the conserved prolines (35). Longer MD simulations up to 0.3 ms (36) and 0.53 ms (37) in the presence of ADP highlighted a strong electrostatic funnel that drives ADP3− very fast into the cavity. Both studies showed the binding site of ADP at the bottom of the cavity with its phosphates pointing toward the matrix side and interacting with the residues of the conserved salt bridges. The binding of ADP could therefore perturb the salt bridges formed by the conserved MCF residues and induce the first events leading to the opening toward the matrix side. The three salt bridges are not equally involved. In particular MCF residues from helix 3 are not involved in ADP binding, compared to MCF residues in helices 1 and 5 (37). It has to be noted that MCF residues from helix 3 are the less conserved among all the AACs. 3.3. Comparison and Differences with Other Transporters

The structures of several membrane transporters are available in the Protein Data Base and listed in (38). Interestingly, many transporters have more than ten transmembrane helices. The transport pathway for the substrates is usually located in the middle of the protein surrounded by an inner shell of TM helices, whereas the other helices constitute an outer shell. These transporters adopt a more compact form, and conformational changes during the transport process mainly occur within the central part of the protein. The outer layer of TMs could dampen the movements. In AAC, within its half facing the IMS, the six TM helices form a thin protein-layer, which separates the polar cavity from the lipidic membrane. Therefore, each single helix is in contact with the solvent on one side and with lipids on the other. The helical bundle is quite dynamic on the side of the IMS as shown by normal mode calculations and by the experimental temperature factors of the N- and C-termini and cytoplasmic loops. The structure and dynamics of AAC could explain the extreme fragility of the carrier outside the lipidic bilayer. An excess of detergent can interfere with helix–helix interactions and destroy the 3D structure, unless it is stabilized by a strongly bound inhibitor. Individual cardiolipin molecules were shown to be still bound to AAC after purification from the mitochondrial membrane. These lipids formed by four acyl chains (Fig. 5a) are abundant in the mitochondrial membrane. Three of them were identified in the crystal structure (Fig. 5b, c). Their tight interaction with AAC residues belonging to the second half of the MCF motif is noticeable. If a hinge movement around these residues occurs, then the lipids could control the movement, and follow the helices in order to avoid hydrophobic mismatch provoked by twists or kink modifications of the helices.

114

Nury et al.

Fig. 5. Interactions of cardiolipins and AAC. (a) Stick representation of a cardiolipin molecule. (b, c) The three cardiolipins interacting with AAC are represented as van der Waals spheres in black, and residues from the second half of the MCF motif in grey, the backbone of AAC is depicted in white. (b) is viewed from the side and highlights the large surface of AAC covered by the cardiolipins. (c) is viewed from the matrix and represents the interactions between cardiolipins and the residues from the second half of the MCF motif.

4. The Mitochondrial Carrier Family

The structure of AAC is still the only structure of a MCF carrier. Despite the relative low sequence identities (maximum of 20%), the presence of the three MCF motifs most probably constrains the structures so that the overall fold of each carrier should be similar. Indeed, Fig. 6 shows the volume occupied by the MCF motif residues. The first half of the three MCF motifs including the prolines forming the kinks and the salt bridges that stabilize the kinked helices, form a compact core of the carrier (represented in dark gray in Fig. 6). The second part of the three MCF motifs (in light grey) surrounds this core volume. The conservation of the MCF motifs and their location in the core part of the structure as seen from AAC, most probably shapes all of these carriers in a similar way. However, the substrates transported differ drastically by their size and chemical nature. Several homology models of other MCF members were built on the basis of AAC. Because the sequence identities are less than 20%, their significance as such is questionable but the models guide the choice of mutants in functional studies as shown for the ornithine/citrulline carrier (39), or for the oxoglutarate carrier (40). Residues that are highly conserved among AACs are concentrated within the cavity (41). This is consistent with the high selectivity of the carriers for their substrates. Although the overall

Structural Approaches of the Mitochondrial Carrier Family

115

Fig. 6. Location of the MCF motif residues. The residues that are characteristic of the MCF motif are represented as van der Waals spheres. The first part of the motif is depicted in dark grey, the second half in light grey, the surface of the carrier is in white. Panel (a) is viewed from the side, panel (b) from the matrix.

backbone of AAC (and most probably of all other MCF carriers) displays a threefold pseudo-symmetry, the deviation from the symmetry by the side chains present in the cavity, is certainly related to the binding properties of non-symmetrical substrates. This idea was exploited by the group of Kunji. After a first sequence analysis coupled to homology modeling from which they postulated a common substrate binding site for MCFs (42), Kunji et al. compared the three repeats within each MCF member and scored the deviations from a threefold symmetry within these repeats. The representation of the deviations on the 3D structures highlights clusters of high deviation within the cavity postulated to be related to substrate binding (43).

5. Conclusion The knowledge of the biochemical behavior of AAC purified from bovine heart mitochondria gained over two decades, together with the development and expertise in crystallization of membrane proteins, made it possible to obtain crystals of high diffracting quality. The X-ray structure of AAC determined in the presence of the inhibitor CATR gave a first insight in ADP/ATP transport. It opened the way to molecular dynamics and new experiments. However, a complete understanding of the transport mechanism and its selectivity awaits further experimental results, that is, structures of other conformations and other MCF carriers. In particular the AAC structure in the presence of BA would be a major step forward.

116

Nury et al.

References 1. Abrahams JP, Leslie AG, Lutter R, Walker JE (1994) Structure at 2.8 A resolution of F1-ATPase from bovine heart mitochondria. Nature 370:621–628 2. Stock D, Leslie AG, Walker JE (1999) Molecular architecture of the rotary motor in ATP synthase. Science 286:1700–1705 3. Hiller S, Garces RG, Malia TJ, Orekhov VY, Colombini M, Wagner G (2008) Solution structure of the integral human membrane protein VDAC-1 in detergent micelles. Science 321:1206–1210 4. Ujwal R, Cascio D, Colletier JP, Faham S, Zhang J, Toro L, Ping P, Abramson J (2008) The crystal structure of mouse VDAC1 at 2.3 A resolution reveals mechanistic insights into metabolite gating. Proc Natl Acad Sci U S A 105:17742–17747 5. Walker JE, Runswick MJ (1993) The mitochondrial transport protein superfamily. J Bioenerg Biomembr 25:435–446 6. Jezek P, Jezek J (2003) Sequence anatomy of mitochondrial anion carriers. FEBS Lett 534:15–25 7. Palmieri F (2008) Diseases caused by defects of mitochondrial carriers: a review. Biochim Biophys Acta 1777:564–578 8. Laloi M (1999) Plant mitochondrial carriers: an overview. Cell Mol Life Sci 56:918–944 9. Klingenberg M (2008) The ADP and ATP transport in mitochondria and its carrier. Biochim Biophys Acta 1778:1978–2021 10. Pebay-Peyroula E, Dahout-Gonzalez C, Kahn R, Trezeguet V, Lauquin GJ, Brandolin G (2003) Structure of mitochondrial ADP/ATP carrier in complex with carboxyatractyloside. Nature 426:39–44 11. Kunji ER, Harding M (2003) Projection structure of the atractyloside-inhibited mitochondrial ADP/ATP carrier of Saccharomyces cerevisiae. J Biol Chem 278:36985–36988 12. Abramson J, Smirnova I, Kasho V, Verner G, Kaback HR, Iwata S (2003) Structure and mechanism of the lactose permease of Escherichia coli. Science 301:610–615 13. Hunte C, Screpanti E, Venturi M, Rimon A, Padan E, Michel H (2005) Structure of a Na/H antiporter and insights into mechanism of action and regulation by pH. Nature 435:1197–1202 14. Block MR, Lauguin GJ, Vignais PV (1981) Atractyloside and bongkrekic acid sites in the mitochondrial ADP/ATP carrier protein. An appraisal of their unicity by chemical modifications. FEBS Lett 131:213–218

15. Henderson PJ, Lardy HA (1970) Bongkrekic acid. An inhibitor of the adenine nucleotide translocase of mitochondria. J Biol Chem 245:1319–1326 16. Kramer R, Aquila H, Klingenberg M (1977) Isolation of the unliganded adenosine 5’-diphosphate, adenosine 5’-triphosphate carrier-linked binding protein and incorporation into the membranes of liposomes. Biochemistry 16:4949–4953 17. Dahout-Gonzalez C, Brandolin G, PebayPeyroula E (2003) Crystallization of the bovine ADP/ATP carrier is critically dependent upon the detergent-to-protein ratio. Acta Crystallogr 59:2353–2355 18. Landau EM, Rosenbusch JP (1996) Lipidic cubic phases: a novel concept for the crystallization of membrane proteins. Proc Natl Acad Sci U S A 93:14532–14535 19. Lacapere JJ, Stokes DL, Olofsson A, Rigaud JL (1998) Two-dimensional crystallization of Ca-ATPase by detergent removal. Biophys J 75:1319–1329 20. Jones TA, Zou JY, Cowan SW, Kjeldgaard M (1991) Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr A 47:110–119 21. Brunger AT, Adams PD, Clore GM, DeLano WL, Gros P, Grosse-Kunstleve RW, Jiang JS, Kuszewski J, Nilges M, Pannu NS, Read RJ, Rice LM, Simonson T, Warren GL (1998) Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr 54:905–921 22. Nury H, Dahout-Gonzalez C, Trezeguet V, Lauquin G, Brandolin G, Pebay-Peyroula E (2005) Structural basis for lipid-mediated interactions between mitochondrial ADP/ ATP carrier monomers. FEBS Lett 579:6031–6036 23. Panneels V, Schussler U, Costagliola S, Sinning I (2003) Choline head groups stabilize the matrix loop regions of the ATP/ADP carrier ScAAC2. Biochem Biophy Res Commun 300:65–74 24. Nelson DR, Felix CM, Swanson JM (1998) Highly conserved charge-pair networks in the mitochondrial carrier family. J Mol Biol 277:285–308 25. Bamber L, Harding M, Monne M, Slotboom DJ, Kunji ER (2007) The yeast mitochondrial ADP/ATP carrier functions as a monomer in mitochondrial membranes. Proc Natl Acad Sci U S A 104:10830–10834

Structural Approaches of the Mitochondrial Carrier Family 26. Bamber L, Slotboom DJ, Kunji ER (2007) Yeast mitochondrial ADP/ATP carriers are monomeric in detergents as demonstrated by differential affinity purification. J Mol Biol 371:388–395 27. Nury H, Manon F, Arnou B, le Maire M, Pebay-Peyroula E, Ebel C (2008) Mitochondrial bovine ADP/ATP carrier in detergent is predominantly monomeric but also forms multimeric species. Biochemistry 47:12319–12331 28. Nelson DR, Lawson JE, Klingenberg M, Douglas MG (1993) Site-directed mutagenesis of the yeast mitochondrial ADP/ATP translocator. Six arginines and one lysine are essential. J Mol Biol 230:1159–1170 29. Muller V, Heidkamper D, Nelson DR, Klingenberg M (1997) Mutagenesis of some positive and negative residues occurring in repeat triad residues in the ADP/ATP carrier from yeast. Biochemistry 36:16008–16018 30. Heimpel S, Basset G, Odoy S, Klingenberg M (2001) Expression of the mitochondrial ADP/ATP carrier in Escherichia coli. Renaturation, reconstitution, and the effect of mutations on 10 positive residues. J Biol Chem 276:11499–11506 31. Pebay-Peyroula E, Brandolin G (2004) Nucleotide exchange in mitochondria: insight at a molecular level. Curr Opin Struct Biol 14:420–425 32. David C, Arnou B, Sanchez JF, Pelosi L, Brandolin G, Lauquin GJ, Trezeguet V (2008) Two residues of a conserved aromatic ladder of the mitochondrial ADP/ATP carrier are crucial to nucleotide transport. Biochemistry 47:13223–13231 33. Falconi M, Chillemi G, Di Marino D, D’Annessa I, Morozzo della Rocca B, Palmieri L, Desideri A (2006) Structural dynamics of the mitochondrial ADP/ATP carrier revealed by molecular dynamics simulation studies. Proteins 65:681–691 34. Kihira Y, Iwahashi A, Majima E, Terada H, Shinohara Y (2004) Twisting of the second transmembrane alpha-helix of the mitochon-

35.

36.

37.

38. 39.

40.

41.

42.

43.

117

drial ADP/ATP carrier during the transition between two carrier conformational states. Biochemistry 43:15204–15209 Johnston JM, Khalid S, Sansom MS (2008) Conformational dynamics of the mitochondrial ADP/ATP carrier: a simulation study. Mol Membr Biol 25:506–517 Wang Y, Tajkhorshid E (2008) Electrostatic funneling of substrate in mitochondrial inner membrane carriers. Proc Natl Acad Sci U S A 105:9598–9603 Dehez F, Pebay-Peyroula E, Chipot C (2008) Binding of ADP in the mitochondrial ADP/ ATP carrier is driven by an electrostatic funnel. J Am Chem Soc 130:12725–12733 http://blanco.biomol.uci.edu/Membrane_ Proteins_xtal.html Tonazzi A, Giangregorio N, Palmieri F, Indiveri C (2005) Relationships of Cysteine and Lysine residues with the substrate binding site of the mitochondrial ornithine/citrulline carrier: an inhibition kinetic approach combined with the analysis of the homology structural model. Biochim Biophys Acta 1718:53–60 Cappello AR, Miniero DV, Curcio R, Ludovico A, Daddabbo L, Stipani I, Robinson AJ, Kunji ER, Palmieri F (2007) Functional and structural role of amino acid residues in the odd-numbered transmembrane alphahelices of the bovine mitochondrial oxoglutarate carrier. J Mol Biol 369:400–412 Nury H, Dahout-Gonzalez C, Trezeguet V, Lauquin GJ, Brandolin G, Pebay-Peyroula E (2006) Relations between structure and function of the mitochondrial ADP/ATP carrier. Annu Rev Biochem 75:713–741 Robinson AJ, Kunji ER (2006) Mitochondrial carriers in the cytoplasmic state have a common substrate binding site. Proc Natl Acad Sci U S A 103:2617–2622 Robinson AJ, Overy C, Kunji ER (2008) The mechanism of transport by mitochondrial carriers based on analysis of symmetry. Proc Natl Acad Sci U S A 105:17766–17771

as

Chapter 7 What Can Be Learned About the Function of a Single Protein from Its Various X-Ray Structures: The Example of the Sarcoplasmic Calcium Pump Jesper Vuust Møller, Claus Olesen, Anne-Marie Lund Winther, and Poul Nissen Abstract Improvements in the handling of membrane proteins for crystallization, combined with better synchrotron sources for X-ray diffraction analysis, are leading to clarification of the structural details of an ever increasing number of membrane transporters and receptors. Here we describe how this development has resulted in the elucidation at atomic resolution of a large number of structures of the sarcoplasmic Ca2+ATPase (SERCA1a) present in skeletal muscle. The structures corresponding to the various intermediary states have been obtained after stabilization with structural analogues of ATP and of metal fluorides as mimicks of inorganic phosphate. From these results it is possible, in accordance with previous biochemical and molecular biology data, to give a detailed structural description of both ATP hydrolysis and Ca2+ transport through the membrane, to serve as the starting point for a fuller understanding of the pump mechanism and, in future studies, on the regulatory role of this ubiquitous intracellular Ca2+-ATPase in cellular Ca2+ metabolism in normal and pathological conditions. Key words: X-ray diffraction, Sarcoplasmic reticulum, Ca2+-ATPase, Ca2+ transport, Metal fluorides, P-type ATPases

1. Introduction Studies on the calcium pump present in skeletal muscle (SERCA1a) can be traced back to the demonstration by Marsh (1) of a socalled relaxation factor, remaining as a particulate fraction in the supernatant of a muscle homogenate after removal of the fibrils by low-speed centrifugation. The fraction had the remarkable property of temporarily inducing relaxation by a process that could be attributed to the removal of Ca2+ from the suspension medium Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_7, © Springer Science+Business Media, LLC 2010

119

120

Møller et al.

both when added to a contractile actomyosin preparation and to an ion permeabilized muscle fibre (2, 3). Further studies carried out by a number of investigators in the 1960s identified the factor as the sarcoplasmic reticulum (SR), an organelle central to the removal and release of Ca2+ during the relaxation/contraction cycle. The characteristics of the Ca2+ uptake process and its ATP dependence were described by Ebashi and Lipmann (4). Rather than being due to binding it was demonstrated by Hasselbach and Makinose that Ca2+ is accumulated inside the SR by an ATP requiring transport in isolated membrane preparations (5). The relation of the Ca2+-transporting ATPase to that of Na+,K+ATPase became clear with the realization that both pumps utilize a common acid-stable aspartylphosphorylated intermediate to perform active transport (6). The important physiological function of the sarcoplasmic 2+ Ca -ATPase in the relaxation/contraction cycle and easy access to purification in large yield, soon made this enzyme a favourite for fundamental studies on the nature of active transport mechanisms. In the following decades (1970–2000), hundreds, even thousands of papers were devoted to functional and structural studies of the enzyme. Concurrent with developments in the study of other P-type ATPases, in particular of Na+,K+-ATPase this has led to the formulation of different reaction schemes. Among these, the scheme originally proposed for Ca2+-ATPase by de Meis and Vianna (7) has gained general acceptance. Figure 1 shows a present day version of the scheme on which (8) the functional interpretation of our structural data are based. In the initial reaction a Ca2+-dependent phosphorylation by ATP leads to the formation of a high energy Ca2E1~P intermediate,

Fig. 1. Reaction scheme of the sarcoplasmic Ca2+-ATPase, showing binding of Ca2+ to the E1 conformations on the cytosolic side (upper part of the scheme), and release of Ca2+, in exchange with protons, to the E2P conformation on the lower part of the scheme. , transition states, with occluded Ca2+ and H+ shown in brackets; (K+) refers to the steps accelerated by binding of K+ to a modulatory binding site near the P-domain.

What Can Be Learned About the Function of a Single Protein

121

phosphorylated at Asp 351. This is followed by a conformational change of the protein to an E2P state during which two Ca2+ ions, bound inside the membrane, are released towards the SR lumen. This occurs in exchange for 2–3 H+ to form an (Hn)E2-P intermediate (9–11) in which form protons in a backward transport reaction are transferred from the luminal to the cytoplasmic side of the SR membrane. This translocation is accompanied by dephosphorylation of the protein to form an E2 conformation with the intramembranous binding sites oriented towards the cytoplasmic side. In between the E1/E1P and E2P/E2 forms the protein is present in transition states, marked by stars with occluded Ca2+ and H+, shown in brackets, barred from contact with both the cytosolic and intralumenal medium. For the kinetic analysis more elaborate and branched reaction schemes than that shown in Fig. 1 have been formulated (e.g., refs. 8, 12, 13) that take into account other factors such as the effect of stepwise association of Ca2+ with the Ca2+-ATPase, the modulatory effect of ATP, the effect of protein–protein interactions (8) and of K+ on the E2 reactions (14, 15), and a number of other environmental variables like pH, inhibition by intracellularly accumulated Ca2+ and Ca2+/Ca2+ exchange (16, 17). However, with respect to ATP the intracellular concentrations in the physiological situation are sufficiently high that, as indicated in the scheme, all intermediates (with the exception of the product of the ATP phosphorylation step, ([Ca2]E1~P-ADP) can be regarded as having ATP bound in a modulatory mode, irrespective of nucleotide level. Thus although a high intracellular ATP level undoubtedly is important for transport efficiency, even severe physiological changes in ATP concentration can be expected to occur without regulatory consequences. A similar statement can be made concerning the effect of intracellular K+ on the potassium-sensitive E2 reactions, since the intracellular level of K+ by far exceeds the dissociation constant for the binding of this cation to the ATPase. Regardless of the inclusion of these and other kinetic aspects, it has to be stated that reaction schemes are by themselves not of much help to explain how transport across a biological membrane occurs. In the present case, it is clear that efficient structurally based mechanisms must exist for the enzyme to be capable of maintaining a precise 2:1 Ca2+/ATP transport stoichiometry over a wide range of cytosolic Ca2+ concentrations and to be able to accumulate Ca2+ against very high (up to 104:1) concentration gradients (18). The Ca2+-ATPase was found early on by electron microscopy of SR vesicles to contain both a large globular and cytosolic exposed head as well as an intramembranous docked part (19–21). Structural aspects such as studied by analytical ultracentrifugation and SAXS on detergent-solubilized Ca2+-ATPase (22, 23), neutron scattering of stacked membranes (24), and structural analysis

122

Møller et al.

of vanadate-stabilized 2D crystals (25–27) confirmed that the 112 kDa Ca2+-ATPase protein has an elongated shape, with a large cytoplasmic head and a narrow membrane inserted tail. Continued studies with 2D-crystals by cryo EM, combined with a theoretical analysis of the amino acid sequence (28, 29) led to a more detailed, but tentative description of the ATPase in terms of three cytoplasmic domains and ten transmembrane helices. Interestingly, studies with fluorescent probes suggested that the phosphorylation site, located in one of the cytoplasmic domains, is at least 50 Å removed from the intramembranous Ca2+ binding sites (30–32). Evidently, chemical energy derived from hydrolysis of ATP had to be mediated from the phosphorylation domain to the intramembranous cation-binding sites via the intervening stalk segment to energize Ca2+ translocation (33). But how this occurred remained a matter of speculation, although at the same time, following the elucidation of the DNA-based amino acid sequence (34, 35), great strides were made by site directed mutagenesis to pinpoint the role of particular residues, resulting in, for example identification of the intramembranous Ca2+ liganding residues (36) and of residues critical for the conformational changes accompanying transport (37). At the turn of the twentieth century a new era was introduced with the publication of the 3D structure of Ca2+-ATPase in the Ca2E1 conformation at atomic (2.6 Å) resolution by Toyoshima et al. (38). The relative ease with which this membrane protein has since then been found to form crystals, has resulted in a multitude of structures at atomic resolution. These, in effect, provide a library of structures of the ATPase in various conformational states with which the transport mechanism can be analyzed in sufficient detail to provide plausible models for the transport mechanism.

2. Use of Structural Analogues to Stabilize Intermediary Transport Steps

Since the essence of protein X-ray crystallography is to deduce 3D structure from the diffraction pattern provided by the ordered arrangement of many unit cells in the crystal, the technique leaves little room for studying the dynamic aspects, apart from those associated with localized vibrational modes of small amplitude (as indicated to some extent by the refined atomic B-factor). But if a series of well-defined and functionally relevant forms of the same protein is available, it becomes possible from these “snapshots” to piece together the main structural changes that the protein undergoes during its functional cycle. The SERCA1a protein is a good case in point. At present (end of year 2010) 29 structures have been deposited in the Brookhaven Protein Data Bank, representative of

What Can Be Learned About the Function of a Single Protein

123

structures of the ATPase during the various stages of transport and in combination with specific ligands and inhibitors. However, it should be noted that to obtain structures of intermediary states, requires these to be stabilized before crystallization. In principle, this can be achieved in one of two ways: (1) by the use of structural analogues, in particular of transition state analogues, that stabilize the protein in the desired intermediary state, and (2) stabilization by the use of mutated protein. For SERCA1a both approaches have been used, but by far most structures have been obtained from nonmutated Ca2+-ATPase prepared from SR of skeletal muscle. Classical P-type ATPase transition state analogues are Cr ATP (39–42) and vanadate (43) that stabilize the Ca2+-ATPase in conformations corresponding to the(Ca2) E1~P and (Hn)E2-P and transition states, with occluded Ca2+ and protons, respectively. However, the use of these compounds has never led to the formation of diffracting crystals. Instead it has turned out that fluoride-metal complexes of Al3+, Be2+, and Mg2+ can be regarded as structural analogues of phosphate (44–48) and used for crystallization of Ca2+-ATPase in a similar way as they have been used for crystallization of other proteins, for example transducin (49, 50), myosin ATPase (51), G-proteins (52), and phosphotransferases (53). Figure 2 shows how tetrahedral BeF3−, ligated to oxygen-containing ligands, mimicks phosphate covalently bound to the carboxyl group of Asp 351 in the E2P (ground state) of Ca2+-ATPase. While tetragonal-bipyrimidal AlF4− with a planar configuration of the four F− surrounding the central Al3+ functions, and two apical oxygen-liganding groups, can mimick the phosphate of the [Ca2]E1~P-ADP and the (Hn) E2-P transition states. Finally, that MgF42− can be used to mimick the inorganic phosphate bound to the enzyme in the E2·Pi state. a

b

c

Fig. 2. (a) Tetrahedral conformation of MgF42−, forming with ATPase a noncovalent intermediate, mimicking the E2·Pi product state. (b) Tetragonal bipyrimidal conformation of AlF4− forming a transition state structure by interaction with Asp 351. (c) Tetrahedral conformation of E2BeF3−, forming with ATPase a covalent intermediate, similar to the E2P ground state.

124

Møller et al.

We prepare crystals of these forms by the hanging drop technique after solubilization by C12E8 of Ca2+-ATPase, isolated from rabbit skeletal muscle, in the presence of metallofluoride. For crystallization we use polyethylene glycol as a precipitant, together with sucrose or glycerol and additives as conformation stabilizing agents (54). Optimization of crystallization is performed with screens where we vary the salt and buffer composition. To obtain E1 conformations of Ca2+-ATPase usually require the presence of a rather high (10 mM) Ca2+ concentrations, while stabilization of E2 usually is performed in the presence of a specific inhibitor, such as thapsigargin. There is also an obligatory requirement for phospholipids which in the present set-up is derived from the endogenous phospholipids present in the C12E8 solubilized Ca2+-ATPase membranes. The crystals obtained are usually plate like, formed by the stacking of planar (bilayer type) layers of protein–phospholipid–detergent complex. Mutant forms of rabbit SERCA1a, with a biotinylation acceptor domain (BAD) tag fused to the C-terminal end of Ca2+-ATPase through a thrombin-sensitive linker, are prepared by heterologous expression in yeast according to established procedures (55, 56). Purification from yeast membranes is performed by solubilization with dodecylmaltoside (DDM) and affinity chromatography, from which the Ca2+-ATPase is released from the column without the tag by proteolytic cleavage with thrombin. This is followed by an HPLC size exclusion chromatography in which DDM is exchanged by C12E8, and finally relipidated with DOPC. Crystallization by the hanging drop technique then proceeds in the same way as for the Ca2+-ATPase prepared from sarcoplasmic-derived Ca2+-ATPase and similar crystal forms are obtained (55).

3. Phosphorylation of Ca2+-ATPase by ATP

The overall structure of SR Ca 2+-ATPase, as illustrated in Fig. 3 of the Ca2E1-AMPPCP structure, is constituted by (1) an intramembranous domain with ten transmembrane helices (M1– M10) that harbour the two Ca2+-binding sites, (2) three cytoplasmic domains, and (3) a connecting stalk region. The cytoplasmic domains comprise the nucleotide binding N-domain, the phosphorylation P-domain with the Asp 351 phosphorylation site, and the actuator (A) domain, which plays a key role in the E1/E2 conformational switches, leading to Ca2+/H+ exchange. The connecting stalk region is formed by the cytosolic extensions of M1–M6 that together with the A-domain is involved in the conformational changes by which bound Ca2+ induces phosphorylation by ATP. Phosphorylation in turn gives rise to the changes that result in Ca2+ translocation and H+ exchange.

What Can Be Learned About the Function of a Single Protein

125

Fig. 3. The structure of Ca2+-ATPase in the Ca2E1~P-AMPPCP conformation, shown in a graphical representation with the N-domain in red, P-domain in blue, and A-domain in yellow. The transmembrane helices and their cytosolic stalk extensions are coloured as follows: M1–M2 purple, M3–M4 green, and M5–M10 in orange. Ligands and central amino acid residues are shown in black. Notice the binding of nucleotide by the N-domain, with the g-phosphate close to Asp 351in the P-domain, while the 181TGES184 dephosphorylation motif is drawn away from the phosphorylation region, together with the A-domain that is located in a notch on the backside between the N- and P-domain.

Structures relevant for different stages of the ATP phosphorylation reaction are depicted in Fig. 4. We start in Fig. 4a by showing the interaction of the protonated E2 intermediate with modulatory-bound ATP (represented by the (H n) E2-AMPPCP structure). This ATPase form via the transitional states depicted in Fig. 4b, c is converted to the phosphorylated [Ca2+]E1~P ADP intermediate (Fig. 4d). During these transitions the adenosyl moiety of ATP remains bound to the N-domain in a preformed notch, stabilized by p–p stacking with Phe 487 and by interactions with Lys 515, Glu 482, Glu 442, and a number of other, mainly hydrophobic, amino acid residues. The ribose-a,bphosphate moiety of the bound nucleotide is localized at the mouth of the binding site, where it is stabilized in a characteristic bent conformation by interaction with Arg 560 and a Mg2+ ion that in the E2 conformation, via a coordinated water molecule, interacts with Glu 439. The g-phosphate protruding from the N-domain interacts with a conserved TGD motif, located at the periphery of the P-domain (57). In this position, the g-phosphate is about 9 Å displaced from the phosphorylation site by the A-domain, out of reach for reaction with the carboxylate group of Asp 351, which is located in a notch in the central

126

Møller et al.

Fig. 4. Structural events taking place at the phosphorylation site during the Ca2+dependent phosphorylation of Ca2+-ATPase by ATP. (a) The structure of E2-AMPPCP(TG), corresponding to the E2·ATP conformation, showing binding of nucleotide to the N-domain (in red) and interaction with the Gly residue of the 625TGD627 motif of the P-domain (in blue), 9 Å removed from Asp 351 (the phosphorylation site). (b) The structure of the Ca2E1-AMPPCP, corresponding to the Ca2E1·ATP intermediate. The figure shows the approach of the g-phosphate of the nucleotide to Asp 351 that occurs as the result of Ca2+ binding in the transmembrane domain and conversion of the ATPase to the E1 conformation. (c) The structure of [Ca2]E1-ADP-AlF4−, representative of the [Ca2] E1~P-ADP transition state, with AlF4− as a mimick of g-phosphate. (d) Structure of [Ca2] E1~P·AMPPN, representative of the [Ca2]E1~P·ADP conformation. Nucleotide is coloured in CPK, green spacefilling refer to bound divalent cation, with 2 Mg2+ bound in c, and one Ca2+ in b replacing the physiologically bound Mg2+ (see text).

part of the P-domain. In the next step, following binding of 2 Ca2+ inside the membraneous domain, there is an E2 → E1 transition which primarily is the result of displacement of the A-domain from its interactions with the phosphorylation site as will be discussed later. This paves the way for the N-domain with bound ATP, by bending of the long S5 helix and the hinge region attaching the N-domain to the P-domain, to move in close contact with the P-domain to form the Ca2E1-ATP and [Ca2]E1-PADP transition state intermediates. These states are in Fig. 4b, c represented by the Ca2E1-AMPPCP and [Ca2]E1-ADP-AlF4− crystalline structures. Considering first the latter structure where

What Can Be Learned About the Function of a Single Protein

127

AlF4− mimicks g-phosphate, the central Al3+ is found to be linearly and equidistantly (~2 Å) interposed between the b-phosphate of ADP and the carboxyl group of Asp 351. This is in accordance with AlF4− mimicking the intermediate formed in the transfer of g-phosphate from ATP to the accepting Asp 351 carboxylate group, according to an SN-2 associative nucleophilic reaction scheme. From this structure the transition state, with unfavourable electrostatic interactions between the negatively charged nucleotide and Asp 351 carboxylate, is seen to be stabilized by the positive charge imparted by the closely positioned Lys 684 residue, and by Mg2+ that coordinates with both the Asp 351 carboxylate and the g-phosphate group, as well as with the carbonyl group of main chain Thr 353 and the Asp 703 carboxylate group. Furthermore, the side chains of Thr 353 and Thr 625 stabilize the b- and g-phosphate as they arrange for phosphorylation of Asp 351.The structure of the environment for the phosphate transfer reaction is in good agreement with deductions previously drawn from studies involving mutation of residues critical for the ATP phosphorylation reaction (58, 59). But in addition to the Mg2+ coordinated with the phosphorylation site in the [Ca2] E1-AlF4−ADP structure, a second Mg2+ ion is also present, bound to the a,b-phosphate of ADP. It can be noted that the second bound Mg2+ ion interacts with the same part of AMPPCP as in the (Hn)E2AMPPCP form, but with different ATPase residues due to movements of the cytosolic domains associated with the E2 → E1 transition. The functional role of the second bound Mg2+ can be seen as an aid to additionally lower the activation energy required for the transfer of the phosphoanhydride bond in ATP to form the aspartylphosphate bond. with Asp 351 and, possibly, also as an aid to the subsequent product removal of nucleotide from the ATPase as MgADP (60). The structure of Ca2E1 with bound AMPPCP (Fig. 4b) is strikingly similar to that of the transition state. However, the g-phosphate of the nucleotide with a slightly longer distance from carboxylate group (3 Å) is in a position where it has not entered into covalent interactions with Asp 351. Furthermore, the Mg2+ bound at the second site is missing and, in addition, Site I is in fact occupied by a Ca2+ ion instead of Mg2+ (61). This is suggested by the fact that the density remains in the absence of Mg2+ in the solvent and on the basis of anomalous difference density at this site. Furthermore, a better correspondence of the B-factor was obtained when calculated on the assumption of Ca2+ binding (61). The replacement of Mg2+ for Ca2+ can be rationalized on the basis of the high (10 mM) Ca2+ concentration present in the crystallization medium Ca2+ that is required for producing crystals with good diffracting properties.

128

Møller et al.

Despite the similarities of the ATPase with bound nucleotide and the transition state structure there are distinct differences in the biochemical properties of the complexes in the native − membraneous states: while the [Ca2]E1-ADP-AlF4 form stably 2+ occludes Ca , this is not the case for ATPase with bound − AMPPCP which also, in contrast to the ADP-AlF4 form, is susceptible to proteolytic degradation and sulfhydryl modification (60). Thus, while the ATPase with bound MgAMPPCP appears to be present in the native membranes in a dynamically fluctuat− ing state, this is not the case for the [Ca2]E1-ADP-AlF4 intermediate. However, the difference between the two forms is much less pronounced when examined in the membraneous state at the same high concentration of Ca2+ as required for crystallization of the E1 forms of Ca2+-ATPase (62). Thus, it appears likely that the lack of difference between the two crystalline forms, at least partly, is attributable to the presence of Ca2+ at the phosphorylation site which we consider to stabilize the ATPase with AMPPCP in a more stable structure, reflecting perhaps the kinetic evidence for the formation of a presteady state complex of ATPase with MgATP, before phosphorylation (63). Of further interest for understanding of the phosphorylation reaction we found in studies on the D351A mutant (which does not hydrolyze ATP) that ATP is bound with the same characteristic bend of the triphosphate moiety as AMPPCP. The presence of a carbon atom bridging the b- and g-phosphate in AMPPCP thus does not affect the conformation of bound ATP. The binding region was indistinguishable from that of the wild type, except for the replacement of Asp 351 with alanine. Furthermore, in agreement with previous studies (64, 65) we confirmed that the mutant binds ATPase with a considerably higher affinity than the wild type, consistent with the abolition in the mutant of opposing negative electrostatic interactions imparted by the negatively charged nucleotide phosphate and phosphorylation site in the wild type ATPase (56). Finally, Fig. 4d shows the structure of the fully phosphorylated Ca2+-ATPase (66). This was obtained by the use of the slowly hydrolyzed nucleotide analogue, AMPPNP, which stabilized the Ca2+-ATPase in the Ca2E1~P-AMPPN nucleotide bound form. The formation of a covalent phosphoaspartyl bond was documented by mass spectrometry, consistent with the electron density shifting towards the Asp 351 side chain in this structure. The remaining part of the nucleotide (AMPPN) is retracted from the aspartylphosphate, but remains enclosed by the P- and N-domain, with only slight changes in the topology of the amino acid residues at the phosphorylation site, indicating that the structure corresponds to that of the ADP-sensitive phosphorylated form of the ATPase ([Ca2]E1~P-ADP) in the reaction scheme in Fig. 1.

What Can Be Learned About the Function of a Single Protein

129

4. Membrane Binding of Ca2+ Concomitant with the E1P transitions described above the two Ca2+ ions, serving as the substrate for transport, are bound inside the membrane domain at sites, commonly designated as Sites I and II, where they are surrounded by a number of intramembranous amino acid residues that form a cagelike network, involving many hydrogen bonds around them (67). Within this network, as can be seen from Fig. 5 the Ca2+ binding is stabilized in tetragonal-bipyrimidal form by coordination with oxygen-containing groups. At site I the equatorial coordination comprises the side chains of Asn 768 (M5), Glu 908 (M8), Thr 799 (M6), and a water molecule, and these groups are adjoined by the carboxylate groups of Glu 771 (M5) and Asp 800 (M6) in the apical positions to complete the bipyrimidal form (67). At Site II Ca2+ is nested inside M4, interacting with main chain Val 304, Ala 305, Ile 307, and with carboxyl Glu 309 by bidentate linkage. Furthermore, Site II is stabilized by the side chains of Asp 800 and Asn 796 in M6 that together with Val 304 and Ile 307 are assembled as a tetragon around the central Ca2+ ion. As shown by the figure the two Ca2+-binding sites are located side by side, linked by the central Asp 800 residue, and with their tetragonal planes tilted with respect to each other and to the plane of the membrane. With both Ca2+-binding sites occupied and the presence of four ligating carboxylate groups an electroneutral cavity is formed. It is important to realize that the cage is not a preformed structure,

Fig. 5. Coordination of the membrane-bound Ca2+ by carboxylic groups and amino acid – side chain and backbone carbonyl groups in the [Ca2]E1-ADP-AlF4 (1T5T) conformation. 2+ At Site I, Ca is hexa-coordinated by N768, E771, E908, and a H2O molecule in equatorial position, and by D800 at proximal- and E771 distal apical positions. At Site II, Ca2+ is hepta-coordinated at V304, D800, N796, and I307, with A305 at distal- and E309 at proximal apical positions, and in the latter case with bidentate coordination.

130

Møller et al.

but the result of the interaction of the ATPase with both Ca2+ ions. It thus differs significantly from the structure of Ca2+-ATPase in the E2 state, from which it is formed by a number of both translational and rotational changes of the Ca2+-associated transmembrane helices to fix the liganding groups in accurate position for optimal interaction with the bound Ca2+. The formation of the Ca2+ cage is the background for the above-described events in the cytosolic domains, leading to phosphorylation of the ATPase by ATP. These are initiated by the displacement of the A-domain which instead of being locked to the P-domain close to the phosphorylation site, by an upward and counterclockwise rotation, becomes bound in a crevice formed between the upper part of the P- and N-domain (60, 68). The altered location of the A-domain exerts a traction and elevation of the M1/M2 helices that blocks a putative cytosolic N-terminal entrance for Ca2+ which presumably is an important background for the occlusion of the bound Ca2+ ions associated with the formation of [Ca2]E1~P-ADP (60). In conclusion, the phosphoryltransfer from ATP to Asp 351 takes place in an environment that is formed between the N- and P-domain in the E1 state. In the absence of nucleotide there is no interaction between the N- and P-domain, as indicated by the open conformation of the Ca2E1 form. Thus, the close association between the N- and P-domain is critically dependent on the binding of nucleotide that glues the N- and P-domain together, and once this has occurred the stage is set for the catalytic transfer of energy-rich phosphate from ATP to Asp 351 and the ensuing reactions leading to active transport.

5. The Translocation Step ((Ca2)E1P to E2P Transition)

In all structures published until recently, the distal (luminally) oriented part of the ten transmembrane segments in the structures were seen to present a formidable hydrophobic barrier which in our view made it difficult to argue for any specific mechanism by which bound Ca2+ is translocated through the membrane (18). On this background, it was of interest when we recently succeeded in solving a structure which qualifies as a candidate for the conformation that the Ca2+-ATPase assumes immediately after the two Ca2+ ions occluded inside the membrane in the [Ca2]E1~P-ADP state have been delivered into the SR lumen (66). It is a structure stabilized by BeF3− which, according to biochemical and spectroscopic data (48), after covalent interaction with the phosphorylation site functions as a structural analogue of phosphate in the E2P ground state. The preparation of E2–BeF3− used to obtain this structure included the presence of EGTA and

What Can Be Learned About the Function of a Single Protein

131

Fig. 6. The mechanism of Ca2+ translocation through the membrane. (a) E2P ground – state as represented by the E2–BeF 3 structure which suggests that release of 2+ the membrane bound Ca occurs through a wide water-filled channel arising from the spreading out of M1–M2 (green) and M3–M4 (purple) membrane segments from the remainder of the membraneous domain (M5–M10, in orange ). (b) Changes in the disposition of the Ca2+ coordinating residues, resulting in exposure of E771, N796, and E309 towards the luminal channel during the E1P to E2P transition. In the E1P and E2P state, Ca2+ coordinating residues are shown as yellow and purple sticks, respectively, based on superimposition of the C-terminal membrane domain with the Ca2+ occluded E1P (1T5T) structure. For further explanation, see text.

a high Mg2+ concentration (50 mM) during crystallization, but absence of any inhibitor like thapsigargin that would be expected to destabilize the formation of the E2P ground state complex (48, 69). Surprisingly, the structure of the E2–BeF3− complex, solved at 2.65 Å resolution, showed that the compact organization of the membranous domain had become changed into a trilobed structure (Fig. 6a) as a result of lateral and rotational movement of the M1/M2 and M3/M4 pairs, relative to the M5–M10 complex which remained intact (66). This resulted in the formation of a funnel-shaped luminal opening, approx 15 Å broad at the luminal base and tapering into approx 4 Å at the apex where it is lined by three of the Ca2+ ligating residues in the E1 structures (Glu 309, Asn 796, and Glu 771). As shown in Fig. 6b this configuration has become possible, mainly because M4 with Glu 309 has moved towards Glu 771, and because the carboxamide group of Asn 796 has flipped sidewards by counterclockwise rotation of M6. The positions of Glu 908 and Asn 768 are essentially unaltered, while Thr 799 and Asp 800 as a result of the M6 rotation have moved away from their Ca2+ ligating position, leaving room for the bound Ca2+ to move to the distal position where the new constellation of Glu 309, Asn 796, and Glu 771 can be conceived of as constituting the basis for binding of Ca2+ with low affinity. In this connection, it is of interest that exactly the same residues on the basis of mutation studies were pointed out as candidates for the luminal proton exchange with two to three

132

Møller et al.

protons during the reaction cycle (70, 71). Thus luminal exposure of these residues can be considered to result in the release of Ca2+ in exchange for luminal protons. To what extent protons have been bound in our structure remains an open question, since one Mg2+ is present, coordinated with the carboxylate groups of Glu 309 and Glu 59 in M1. Two somewhat different structures of E2–BeF3− were concomitantly published by Toyoshima et al. (72), one with thapsigargin bound (at 2.4 Å resolution) and one in the absence of thapsigargin (3.8 Å resolution). Overall, these two structures are much alike and both are without the evidence for the luminal opening, created by dissociation of the intramembranous segments that is a characteristic of our structure. Instead, on the basis of a detailed analysis of the movements of the intramembranous N-terminal segments (M1–M4), Toyoshima et al. propose that in the absence of thapsigargin a narrow slit between these transmembrane helices is formed through which the bound Ca2+ can escape to the lumen. We may characterize this as a gating model for Ca2+ translocation, similar to what has earlier been proposed by the Japanese group as an extrapolation from the compact organizations of the E2Pi product state (73). Whereas in our model luminal Ca2+ release is the result of more extensive conformational changes of the intramembranous domain, leading to direct exposure of Ca2+ to the luminal space. What is the background for these different models? Fundamentally, the two E2–BeF3− structures without bound thapsigargin, E2–BeF3−Olesen and E2–BeF3−Toyoshima, differ with respect to the movements that the A-domain performs during the E1P to E2P transition. While the rotation in E2–BeF3−Toyoshiama is incomplete (90°), compared to the E2 → Ca2E1 transition (108°), it is 120° in E2–BeF3−Olesen, as the result of a movement where the A domain wedges into the N-domain, providing a tightly fitting seal on top of the phosphorylation site. This is a hyperextended state compared with that observed in the Ca2E1 → E2 transition which readily accounts for the luminal opening as being the result of a drag exerted by this movement on the N-terminal transmembrane segments via the A-M2 and A-M3 linkers. Accordingly, the luminally open state in our model represents a strained conformation of the ATPase where the A-domain via the A-M1, A-M2, and A-M3 linkers with the membrane exerts a pull on the M1/M2 and M3/M4 membrane segments decisive for their moving apart from the C-terminal transmembrane segments. In accordance with this view proteolytic cleavage of the A-M3 (74) and A-M2 (75) linkers block the processing of [Ca2]E1~P intermediate. Furthermore, in mutagenesis experiments the length of A-M1 has been found to be very critical, shortening by one or two residues preventing the [Ca2]E1~P → E2P transition (76), whereas lengthening the linker by two glycine residues stabilizes a Ca2+ occluded E2P form of the ATPase (77).

What Can Be Learned About the Function of a Single Protein

6. The Mechanism of E2-P Dephosphorylation

133

Ultimately, Ca2+ transport must be powered by events taking place in the P-domain as the result of the formation of the “energyrich” and thermodynamically unstable aspartylphosphate intermediate. In all E2 conformations published so far the P-domain is seen to have an increased tilt towards the membrane, compared to the E1 conformations. This, together with the liberation of ADP at the [Ca2]E1~P → Ca2E2P transition presumably are decisive factors for permitting the A-domain to move from its elevated position on top of the P-domain in the [Ca2]E1~P-ADP structure to engage in tight contact with the phosphorylation site. The movement of the A-domain seems to occur in two steps, the first one as discussed above leading to opening of the intramembraneous domain to allow passage of Ca2+ from the binding sites to the luminal space. In the second stage the exposed TGES loop of the A-domain is turned from an outward orientation and becomes docked into the phoshorylation site (Fig. 7). This is accompanied by closure of the luminal gate, with protons from the luminal space that have entered into contact with the membrane to neutralize the surplus of negative charge created by the carboxylate groups after the release of Ca2+ (78). The structure of the ATPase in this transition state is best portrayed by the structures of the E2–AlF4− complex that have been published both in the presence (72, 78) and absence (66, 72) of thapsigargin. In these structures, the

Fig. 7. Interaction of the A-domain with the phosphorylation sites in the E2P states. − (a) The E2P ground state, represented by the E2–BeF3 structure. Here the TGES motif is close to the phosphorylation site, but the catalytically active Glu 183 and Thr 181 are turned away from Asp 351. (b) In the (Hn)E2P conformation, as represented by the (Hn) E2–AlF4− structure the carboxyl group of E183 and the hydroxyl group of T181, via a bound water molecule, are lined up for hydrolysis of the aspartylphospho anhydride by an SN-2 associative mechanism.

134

Møller et al.

carbonyl group of Thr 181 and carboxyl group of Glu 183 are seen via an intervening water molecule to form a linear complex with the AlF4− that is coordinated with the carboxyl group of Asp 351. The complex very much resembles that formed with ADP and AlF4− mimicking the ATP phosphorylation reaction, except that now it is the TGES loop of the A domain that via the coordinated water molecule reacts with Asp 351 in lieu of ADP (Fig. 7). From this arrangement we conclude that in the physiological situation dephosphorylation of E2-P occurs by base catalysis as the result of proton extraction from the bound water molecule, in complete accordance with mutational data on the TGES loop related to the dephosphorylation in the E2-P state (79, 80), while luminal protons are required at the ion binding sites in the membrane for the dephosphorylation site to become activated (78). Other mutational data of amino acid residues related to the A-domain that have been shown to lead to enzymatic inactivation suggest that the necessary E1–E2 rotation and interaction between the A- and P-domain, required for docking of the TGES loop is dependent on the formation of two hydrophobic clusters where one is formed from five residues centred around Tyr 122 at the lower end of the A-domain (81). The other hydrophobic cluster is centred on Val 200, located on the top of the A-domain, which interacts with Val 705 and Val 726 of the P-domain (82). The docking of the TGES loop into the phosphorylation site is probably also associated with the formation of a helical bundle by the adjoinment of three of the C-terminal helices of the P-domain and two helical segments, linking the A-domain to the M2 and M3 transmembrane helices (the A-M2 and A-M3 linkers). At the base of this bundle there is a monovalent cation (K+) binding site that stabilizes the association of A-M3 with the P-domain and probably facilitates the dephosphorylation of E2P by the TGES loop, thereby providing a structural explanation for the accelerating effect of K+ on E2-P dephosphorylation known for a number of years (14). Furthermore, the transition state is probably stabilized by hydrogen bond formation between Ser 186 and Glu 439 and by other interactions between the A- and N-domain. After E2-P hydrolysis these interactions weaken, leaving room for the liberated phosphate to escape through a tunnel formed between the A- and N-domain and for modulatory bound nucleotide to interact with the TGD motif at the periphery of the P-domain as illustrated in Fig. 4a (57).

7. The Dephosphorylated E2 and E1 States

After E2-P dephosphorylation, we have almost completed our tour through the Ca2+ transport cycle. It only remains for the Ca2+-ATPase to unload the bound protons towards the cytosolic

What Can Be Learned About the Function of a Single Protein

135

side in exchange for Ca2+. Crystal structures representative of these E2 forms are available, stabilized by thapsigargin (83) or other inhibitors of Ca2+ membrane binding (67, 84). Overall, these structures are very similar to those representing the E2-P transition or E2·Pi product states, except that the A-domain appears to be in a more relaxed state with the TGES loop retracted from its docking interaction with the phosphorylation site (78). However, there is the caveat associated with these crystal structures that according to extensive functional evidence thapsigargin stabilizes the ATPase in an occluded and functionally slightly different (Hn)E2 state (85, 86), and thus the representation of the membrane region probably differs somewhat from that of the genuine E2 form which readily reacts with Ca2+. Despite that the affinity of the thapsigargin stabilized ATPase for nucleotide is reduced (57), there is spectroscopic evidence that the cytosolic domains of Ca2+-ATPase, as opposed to the membrane domain, are only minimally affected by thapsigargin binding (85). This is the reason why we have felt justified to use the (Hn)E2-AMPPCP (Tg) form as the starting point in the description of the ATP phosphorylation of ATPase from (Hn)E2-P·ATP in Fig 7.4a. With respect to the membraneous domain, the internal position of the carboxylic groups in the E2 structures is indeed a good indication of their proton occluded state, since their removal would result in the formation of charged carboxylate groups inside the membrane, an event that would be highly energy requiring. On the other hand, the lability of the “naked” E2 conformations, in the absence of ligand stabilizers, both in the membraneous and detergent-solubilized (87) state would seem to be in accord with the view that E2 forms cover a rather wide dynamic range of slightly different conformations (86). There is thus good reason to consider that E2 represents a labile state in the membrane where some of the carboxylic groups could be spending part of the time close to the cytosolic space to enable the cytosolic release of bound protons in exchange for Ca2+. The intramembranous binding of cytoplasmic Ca2+ to the dephosphorylated enzyme in exchange for protons evokes the profound changes in the structure of the Ca2+-ATPase molecule that we ordinarily refer to as the E2 → Ca2E1 transition (83). How are they brought about? Ultimately, the formation of the Ca2E1 and [Ca2]E1~P states must be the result of changes in the position of the intramembraneous cation liganding residues to fix them in a position optimal for coordination with Ca2+. This involves a reversal of the changes that we saw was associated with the [Ca2]E1~P to E2P transition (Fig. 6b), i.e. there is a clockwise rotation of M6 to bring the three liganding residues of M6 (Asp 800, Asn 796, and Thr 799) in position for binding of Ca2+ at sites I and II. In addition, to adapt the ATPase for interaction with Ca2+ at site II there is an upward displacement of Glu 309,

136

Møller et al.

accompanied by a ~20° inclination of the cytosolic extensions of M4 (57). The latter changes are decisive, because they probably lead to disruption of the interactions between the A-domain and the C-terminal part of the P-domain as well as that they are required for maintaining the integrity of the hydrophobic cluster centred on Tyr 122 involved in holding the A- and P-domain firmly together in the E2 state.1 As a result, the A-domain then becomes free to move away from the position it occupies close to the phosphorylation site in the E2 conformation. Other and presumably weaker interactions between the N- and top of the A-domain are also disrupted, allowing the P-domain and N-domain to move towards an upright position, resulting in the well-known open structure of the Ca2E1 where there is minimal interaction between the cytosolic domains (38). In the physiological situation these events are modified by the binding of ATP which induces the N-domain, via its hinge attachment to the P-domain and by bending of the S5 helix, to close down like a lid on the P-domain to form the pre-steady complex leading to phosphorylation as explained in the beginning of our review (Fig. 4b).

8. Concluding Comments The recent explosion of X-ray structures of the SERCA Ca 2+-ATPase at atomic resolutions, corresponding to the various functional intermediates, has made it possible to give a description of the whole transport cycle at the structural level. The new structures by and large confirm and extend conclusions reached from previous biochemical and biophysical studies as well as investigations on Ca2+-ATPase mutants. Although some aspects are still missing such as the detailed mechanism for the passage of the transported Ca2+ ions and protons through the membrane, the role of modulatory bound ATP etc., this is, of course, a pleasing state of affairs. However, at the more fundamental level we still need to understand not only how, but why transport occurs – what are the forces behind the complementary movements of the Ca2+ATPase domains and polypeptide chains required for the transport processes, and how is ATP hydrolysis in the cytosolic part of the molecule coupled to transport in the membraneous domain?

Note that in the originally published structures of E2–AlF4 − (Tg) (78) and E2·MgF42− (73) with thapsigargin bound the carboxyl group of Glu 309 was modelled towards the cytosolic space rather than towards the interior of the membranous domain as found through later and improved refinement. However, this does not exclude the possibility that an outward orientation of carboxyl 309 could be a feature of the genuine E2 structure, in agreement with previous suggestions (83, 88) that Glu 309 serves as a gating residue in the release of protons and binding of Ca2+.

1

What Can Be Learned About the Function of a Single Protein

137

Single-molecule studies will be of significant importance in this regard. In addition, there are still many challenges ahead concerning the regulation and structures of various Ca2+-ATPase isoforms, and the use of Ca2+-ATPase as a drug target in connection with the treatment of human diseases.

References 1. Marsh BB (1951) A factor modifying muscle fibre synaeresis. Nature 167:1065–1066 2. Bendall JR (1952) Effect of the Marsh factor on the shortening of muscle fibre models in the presence of adenosine triphosphate. Nature 170:1058–1060 3. Bendall JR (1953) Further observations on a factor (The ‘Marsh’ factor) effecting relaxation of ATP-shortened muscle-fibre models and the effect of Ca and Mg ions upon it. J Physiol 121:232–254 4. Ebashi S, Lipmann F (1962) Adenosine triphosphate-linked concentration of calcium ions in a particulate fraction of rabbit muscle. J Cell Biol 14:389–400 5. Hasselbach W, Makinose M (1961) Die Calciumpumpe der “Erschlaffungsgrana” des Muskels und ihre Abhängigheit von der ATPspaltung. Biochem Z 333:518–528 6. Tonomura Y (1972) Muscle proteins, muscle contraction, and cation transport. Univ Tokyo Press, Tokyo 7. de Meis L, Vianna AL (1979) Energy interconversion by the Ca2+-dependent ATPase of the sarcoplasmic reticulum. Annu Rev Biochem 48:275–292 8. Mahaney JE, Thomas DD, Froehlich JP (2004) The time-dependent distribution of phosphorylated intermediates in native sarcoplasmic reticulum Ca2+-ATPase from skeletal muscle is not compatible with a linear kinetic model. Biochemistry 43:4400–4416 9. Levy D, Seigneuret M, Bluzat A, Rigaud JL (1990) Evidence for proton countertransport by the sarcoplasmic reticulum Ca2+-ATPase during calcium transport in reconstituted proteoliposomes with low ionic permeability. J Biol Chem 265:19524–19534 10. Cornelius F, Moller JV (1991) Electrogenic pump current of sarcoplasmic reticulum Ca2+ATPase reconstituted at high lipid/protein ratio. FEBS Lett 284:46–50 11. Yu X, Carroll S, Rigaud JL, Inesi G (1993) H+ countertransport and electrogenicity of the sarcoplasmic reticulum Ca2+ pump in reconstituted proteoliposomes. Biophys J 64:1232–1242

12. Lee AG, East JM (2001) What the structure of a calcium pump tells us about its mechanism. Biochem J 356:665–683 13. Mintz E, Guillain F (1997) Ca2+ transport by the sarcoplasmic reticulum ATPase. Biochim Biophys Acta 1318:52–70 14. Shigekawa M, Pearl LJ (1976) Activation of calcium transport in skeletal muscle sarcoplasmic reticulum by monovalent cations. J Biol Chem 251:6947–6952 15. Shigekawa M, Dougherty JP (1978) Reaction mechanism of Ca2+-dependent ATP hydrolysis by skeletal muscle sarcoplasmic reticulum in the absence of added alkali metal salts. II. Kinetic properties of the phosphoenzyme formed at the steady state in high Mg2+ and low Ca2+ concentrations. J Biol Chem 253:1451–1457 16. Gerdes U, Moller JV (1983) The Ca2+ permeability of sarcoplasmic reticulum vesicles. II. Ca2+ efflux in the energized state of the calcium pump. Biochim Biophys Acta 734:191–200 17. Smith WS, Broadbridge R, East JM, Lee AG (2002) Sarcolipin uncouples hydrolysis of ATP from accumulation of Ca2+ by the Ca2+ATPase of skeletal-muscle sarcoplasmic reticulum. Biochem J 361:277–286 18. Møller JV, Nissen P, Sørensen TL-M (2006) X-ray crystallographic structures of sarcoplasmic reticulm Ca2+-ATPases at the atomic level. In: Grisshammer R, Buchanan SK (eds) Structural biology of membrane proteins. RSC Publishing, Cambridge, UK, 288–306 19. Deamer DW, Baskin RJ (1969) Ultrastructure of sarcoplasmic reticulum preparations. J Cell Biol 42:296–307 20. Hardwicke PMD, Green NM (1974) The effect of delipidation on the adenosine triphosphatase of sarcoplasmic reticulum. Electron microscopy and physical properties. Eur J Biochem 42:183–193 21. Hasselbach W, Elfvin LG (1967) Structural and chemical asymmetry of the calciumtransporting membranes of the sarcotubular system as revealed by electron microscopy. J Ultrastruct Res 17:598–622

138

Møller et al.

22. le Maire M, Jorgensen KE, Roigaard-Petersen H, Moller JV (1976) Properties of deoxycholate solubilized sarcoplasmic reticulum Ca2+-ATPase. Biochemistry 15:5805–5812 23. le Maire M, Moller JV, Tardieu A (1981) Shape and thermodynamic parameters of a Ca2+-dependent ATPase. A solution x-ray scattering and sedimentation equilibrium study. J Mol Biol 150:273–296 24. Herbette L, DeFoor P, Fleischer S, Pascolini D, Scarpa A, Blasie JK (1985) The separate profile structures of the functional calcium pump protein and the phospholipid bilayer within isolated sarcoplasmic reticulum membranes determined by X-ray and neutron diffraction. Biochim Biophys Acta 817:103–122 25. Taylor KA, Dux L, Martonosi A (1986) Three-dimensional reconstruction of negatively stained crystals of the Ca2+-ATPase from muscle sarcoplasmic reticulum. J Mol Biol 187:417–427 26. Stokes DL, Green NM (1990) Threedimensional crystals of CaATPase from sarcoplasmic reticulum. Symmetry and molecular packing. Biophys J 57:1–14 27. Stokes DL, Green NM (1990) Structure of CaATPase: electron microscopy of frozenhydrated crystals at 6 A resolution in projection. J Mol Biol 213:529–538 28. Toyoshima C, Sasabe H, Stokes DL (1993) Three-dimensional cryo-electron microscopy of the calcium ion pump in the sarcoplasmic reticulum membrane. Nature 362:467–471 29. Zhang P, Toyoshima C, Yonekura K, Green NM, Stokes DL (1998) Structure of the calcium pump from sarcoplasmic reticulum at 8-A resolution. Nature 392:835–839 30. Gutierrez-Merino C, Munkonge F, Mata AM, East JM, Levinson BL, Napier RM, Lee AG (1987) The position of the ATP binding site on the (Ca2+ + Mg2+)-ATPase. Biochim Biophys Acta 897:207–216 31. Bigelow DJ, Inesi G (1992) Contributions of chemical derivatization and spectroscopic studies to the characterization of the Ca2+ transport ATPase of sarcoplasmic reticulum. Biochim Biophys Acta 1113:323–338 32. Baker KJ, East JM, Lee AG (1994) Localization of the hinge region of the Ca2+-ATPase of sarcoplasmic reticulum using resonance energy transfer. Biochim Biophys Acta 1192: 53–60 33. Moller JV, Juul B, le Maire M (1996) Structural organization, ion transport, and energy transduction of P-type ATPases. Biochim Biophys Acta 1286:1–51 34. MacLennan DH, Brandl CJ, Korczak B, Green NM (1985) Amino-acid sequence of a

35.

36.

37.

38.

39.

40.

41.

42.

43. 44.

45.

46.

47.

Ca2+ + Mg2+-dependent ATPase from rabbit muscle sarcoplasmic reticulum, deduced from its complementary DNA sequence. Nature 316:696–700 Brandl CJ, Green NM, Korczak B, MacLennan DH (1986) Two Ca2+ ATPase genes: homologies and mechanistic implications of deduced amino acid sequences. Cell 44:597–607 Clarke DM, Loo TW, Inesi G, MacLennan DH (1989) Location of high affinity Ca2+binding sites within the predicted transmembrane domain of the sarcoplasmic reticulum Ca2+-ATPase. Nature 339:476–478 Andersen JP (1995) Dissection of the functional domains of the sarcoplasmic reticulum Ca2+-ATPase by site-directed mutagenesis. Biosci Rep 15:243–261 Toyoshima C, Nakasako M, Nomura H, Ogawa H (2000) Crystal structure of the calcium pump of sarcoplasmic reticulum at 2.6 A resolution. Nature 405:647–655 Serpersu EH, Kirch U, Schoner W (1982) Demonstration of a stable occluded form of Ca2+ by the use of the chromium complex of ATP in the Ca2+-ATPase of sarcoplasmic reticulum. Eur J Biochem 122:347–354 Coan C, Scales DJ, Murphy AJ (1986) Oligovanadate binding to sarcoplasmic reticulum ATPase. Evidence for substrate analogue behavior. J Biol Chem 261:10394–10403 Vilsen B, Andersen JP (1992) Interdependence of Ca2+ occlusion sites in the unphosphorylated sarcoplasmic reticulum Ca2+-ATPase complex with CrATP. J Biol Chem 267:3539–3550 Cantley LC Jr, Cantley LG, Josephson L (1978) A characterization of vanadate interactions with the (Na, K)-ATPase. Mechanistic and regulatory implications. J Biol Chem 253:7361–7368 Pick U (1982) The interaction of vanadate ions with the Ca-ATPase from sarcoplasmic reticulum. J Biol Chem 257:6111–6119 Murphy AJ, Coll RJ (1992) Fluoride is a slow, tight-binding inhibitor of the calcium ATPase of sarcoplasmic reticulum. J Biol Chem 267:5229–5235 Murphy AJ, Coll RJ (1992) Fluoride binding to the calcium ATPase of sarcoplasmic reticulum converts its transport sites to a low affinity, lumen-facing form. J Biol Chem 267:16990–16994 Murphy AJ, Coll RJ (1993) Formation of a stable inactive complex of the sarcoplasmic reticulum calcium ATPase with magnesium, beryllium, and fluoride. J Biol Chem 268:23307–23310 Troullier A, Girardet JL, Dupont Y (1992) Fluoroaluminate complexes are bifunctional

What Can Be Learned About the Function of a Single Protein

48.

49.

50.

51.

52.

53.

54. 55.

56.

57.

analogues of phosphate in sarcoplasmic reticulum Ca2+-ATPase. J Biol Chem 267: 22821–22829 Danko S, Yamasaki K, Daiho T, Suzuki H (2004) Distinct natures of beryllium fluoridebound, aluminum fluoride-bound, and magnesium fluoride-bound stable analogues of an ADP-insensitive phosphoenzyme intermediate of sarcoplasmic reticulum Ca2+-ATPase: changes in catalytic and transport sites during phosphoenzyme hydrolysis. J Biol Chem 279:14991–14998 Chabre M (1990) Aluminofluoride and beryllofluoride complexes: a new phosphate analogs in enzymology. Trends Biochem Sci 15:6–10 Antonny B, Chabre M (1992) Characterization of the aluminum and beryllium fluoride species which activate transducin. Analysis of the binding and dissociation kinetics. J Biol Chem 267:6710–6718 Park S, Ajtai K, Burghardt TP (1999) Inhibition of myosin ATPase by metal fluoride complexes. Biochim Biophys Acta 1430: 127–140 Sondek J, Lambright DG, Noel JP, Hamm HE, Sigler PB (1994) GTPase mechanism of Gproteins from the 1.7-A crystal structure of transducin alpha-GDP-AlF4-. Nature 372: 276–279 Wang W, Cho HS, Kim R, Jancarik J, Yokota H, Nguyen HH, Grigoriev IV, Wemmer DE, Kim SH (2002) Structural characterization of the reaction pathway in phosphoserine phosphatase: crystallographic “snapshots” of intermediate states. J Mol Biol 319:421–431 Sorensen TL, Olesen C, Jensen AM, Moller JV, Nissen P (2006) Crystals of sarcoplasmic reticulum Ca2+-ATPase. J Biotechnol 124:704–716 Jidenko M, Nielsen RC, Sorensen TL, Moller JV, le Maire M, Nissen P, Jaxel C (2005) Crystallization of a mammalian membrane protein overexpressed in Saccharomyces cerevisiae. Proc Natl Acad Sci USA 102: 11687–11691 Marchand A, Lund Winther AM, Holm PJ, Olesen C, Montigny C, Arnou B, Champeil P, Clausen JD, Vilsen B, Andersen JP, Nissen P, Jaxel C, Moller JV, le Maire M (2008) Crystal structure of D351A and P312A mutant forms of the mammalian sarcoplasmic reticulum Ca2+-ATPase reveals key events in phosphorylation and Ca2+ release. J Biol Chem 283:14867–14882 Jensen AM, Sorensen TL, Olesen C, Moller JV, Nissen P (2006) Modulatory and catalytic modes of ATP binding by the calcium pump. EMBO J 25:2305–2314

139

58. Clausen JD, McIntosh DB, Woolley DG, Andersen JP (2001) Importance of Thr-353 of the conserved phosphorylation loop of the sarcoplasmic reticulum Ca2+-ATPase in MgATP binding and catalytic activity. J Biol Chem 276:35741–35750 59. McIntosh DB, Clausen JD, Woolley DG, MacLennan DH, Vilsen B, Andersen JP (2004) Roles of conserved P domain residues and Mg2+ in ATP binding in the ground and Ca2+-activated states of sarcoplasmic reticulum Ca2+-ATPase. J Biol Chem 279:32515–32523 60. Sorensen TL, Moller JV, Nissen P (2004) Phosphoryl transfer and calcium ion occlusion in the calcium pump. Science 304:1672–1675 61. Picard M, Jensen AM, Sorensen TL, Champeil P, Moller JV, Nissen P (2007) Ca2+versus Mg2+ coordination at the nucleotide-binding site of the sarcoplasmic reticulum Ca2+-ATPase. J Mol Biol 368:1–7 62. Picard M, Toyoshima C, Champeil P (2005) The average conformation at micromolar (Ca2+) of Ca2+-Atpase with bound nucleotide differs from that adopted with the transition state analog ADP.AlFx or with AMPPCP under crystallization conditions at millimolar (Ca2+). J Biol Chem 280:18745–18754 63. Pickart CM, Jencks WP (1982) Slow dissociation of ATP from the calcium ATPase. J Biol Chem 257:5319–5322 64. Pedersen PA, Rasmussen JH, Jorgensen PL (1996) Consequences of mutations to the phosphorylation site of the alpha-subunit of Na, K-ATPase for ATP binding and E1-E2 conformational equilibrium. Biochemistry 35:16085–16093 65. McIntosh DB, Woolley DG, MacLennan DH, Vilsen B, Andersen JP (1999) Interaction of nucleotides with Asp(351) and the conserved phosphorylation loop of sarcoplasmic reticulum Ca2+-ATPase. J Biol Chem 274:25227–25236 66. Olesen C, Picard M, Winther AM, Gyrup C, Morth JP, Oxvig C, Moller JV, Nissen P (2007) The structural basis of calcium transport by the calcium pump. Nature 450:1036–1042 67. Obara K, Miyashita N, Xu C, Toyoshima I, Sugita Y, Inesi G, Toyoshima C (2005) Structural role of countertransport revealed in Ca2+ pump crystal structure in the absence of Ca2+. Proc Natl Acad Sci USA 102:14489–14496 68. Toyoshima C, Mizutani T (2004) Crystal structure of the calcium pump with a bound ATP analogue. Nature 430:529–535 69. Seekoe T, Peall S, McIntosh DB (2001) Thapsigargin and dimethyl sulfoxide activate medium P(i)HOH oxygen exchange catalyzed by sarcoplasmic reticulum Ca2+-ATPase. J Biol Chem 276:46737–46744

140

Møller et al.

70. Andersen JP, Vilsen B (1995) Structure-function relationships of cation translocation by Ca2+and Na+, K+-ATPases studied by site-directed mutagenesis. FEBS Lett 359:101–106 71. Vilsen B, Andersen JP (1998) Mutation to the glutamate in the fourth membrane segment of Na+, K+-ATPase and Ca2+-ATPase affects cation binding from both sides of the membrane and destabilizes the occluded enzyme forms. Biochemistry 37:10961–10971 72. Toyoshima C, Norimatsu Y, Iwasawa S, Tsuda T, Ogawa H (2007) How processing of aspartylphosphate is coupled to lumenal gating of the ion pathway in the calcium pump. Proc Natl Acad Sci USA 104:19831–19836 73. Toyoshima C, Nomura H, Tsuda T (2004) Lumenal gating mechanism revealed in calcium pump crystal structures with phosphate analogues. Nature 432:361–368 74. Moller JV, Lenoir G, Marchand C, Montigny C, le Maire M, Toyoshima C, Juul BS, Champeil P (2002) Calcium transport by sarcoplasmic reticulum Ca2+-ATPase. Role of the A domain and its C-terminal link with the transmembrane region. J Biol Chem 277:38647–38659 75. Lenoir G, Picard M, Gauron C, Montigny C, Le Marechal P, Falson P, Le Maire M, Moller JV, Champeil P (2004) Functional properties of sarcoplasmic reticulum Ca2+-ATPase after proteolytic cleavage at Leu119-Lys120, close to the A-domain. J Biol Chem 279:9156–9166 76. Daiho T, Yamasaki K, Wang G, Danko S, Iizuka H, Suzuki H (2003) Deletions of any single residues in Glu40-Ser48 loop connecting a domain and the first transmembrane helix of sarcoplasmic reticulum Ca2+-ATPase result in almost complete inhibition of conformational transition and hydrolysis of phosphoenzyme intermediate. J Biol Chem 278:39197–39204 77. Daiho T, Yamasaki K, Danko S, Suzuki H (2007) Critical role of Glu40-Ser48 loop linking actuator domain and 1st transmembrane helix of Ca2+-ATPase in Ca2+ deocclusion and release from ADP-insensitive phosphoenzyme. J Biol Chem 282:34429–34447 78. Olesen C, Sorensen TL, Nielsen RC, Moller JV, Nissen P (2004) Dephosphorylation of the calcium pump coupled to counterion occlusion. Science 306:2251–2255 79. Clausen JD, Vilsen B, McIntosh DB, Einholm AP, Andersen JP (2004) Glutamate-183 in

80.

81.

82.

83. 84.

85.

86.

87.

88.

the conserved TGES motif of domain A of sarcoplasmic reticulum Ca2+-ATPase assists in catalysis of E2/E2P partial reactions. Proc Natl Acad Sci USA 101:2776–2781 Anthonisen AN, Clausen JD, Andersen JP (2006) Mutational analysis of the conserved TGES loop of sarcoplasmic reticulum Ca2+ATPase. J Biol Chem 281:31572–31582 Yamasaki K, Wang G, Daiho T, Danko S, Suzuki H (2008) Roles of Tyr122-hydrophobic cluster and K+ binding in Ca2+ -releasing process of ADP-insensitive phosphoenzyme of sarcoplasmic reticulum Ca2+ -ATPase. J Biol Chem 283:29144–29155 Wang G, Yamasaki K, Daiho T, Suzuki H (2005) Critical hydrophobic interactions between phosphorylation and actuator domains of Ca2+-ATPase for hydrolysis of phosphorylated intermediate. J Biol Chem 280:26508–26516 Toyoshima C, Nomura H (2002) Structural changes in the calcium pump accompanying the dissociation of calcium. Nature 418:605–611 Takahashi M, Kondou Y, Toyoshima C (2007) Interdomain communication in calcium pump as revealed in the crystal structures with transmembrane inhibitors. Proc Natl Acad Sci USA 104:5800–5805 Montigny C, Picard M, Lenoir G, Gauron C, Toyoshima C, Champeil P (2007) Inhibitors bound to Ca2+-free sarcoplasmic reticulum Ca2+-ATPase lock its transmembrane region but not necessarily its cytosolic region, revealing the flexibility of the loops connecting transmembrane and cytosolic domains. Biochemistry 46:15162–15174 Inesi G, Lewis D, Toyoshima C, Hirata A, de Meis L (2008) Conformational fluctuations of the Ca2+-ATPase in the native membrane environment. Effects of pH, temperature, catalytic substrates, and thapsigargin. J Biol Chem 283:1189–1196 Møller JV, le Maire M, Andersen JP (1986) Uses of non-ionic bile salt detergent in the study of membrane proteins. In: Watts A, de Pont JJHHM (eds) Progress in protein-lipid interactions. Elsevier, Amsterdam Inesi G, Ma H, Lewis D, Xu C (2004) Ca2+ occlusion and gating function of Glu309 in the ADP-fluoroaluminate analog of the Ca2+ATPase phosphoenzyme intermediate. J Biol Chem 279:31629–31637

Chapter 8 Recent Progress in the Structure Determination of GPCRs, a Membrane Protein Family with High Potential as Pharmaceutical Targets Vadim Cherezov, Enrique Abola, and Raymond C. Stevens Abstract G protein-coupled receptors (GPCRs) constitute a highly diverse and ubiquitous family of integral membrane proteins, transmitting signals inside the cells in response to an assortment of disparate extracellular stimuli. Their strategic location on the cell surface and their involvement in crucial cellular and physiological processes turn these receptors into highly important pharmaceutical targets. Recent technological developments aimed at stabilization and crystallization of these receptors have led to significant breakthroughs in GPCR structure determination efforts. One of the successful approaches involved receptor stabilization with the help of a fusion partner combined with crystallization in lipidic cubic phase (LCP). The success of using an LCP matrix for crystallization is generally attributed to the creation of a more native, membrane-like stabilizing environment for GPCRs just prior to nucleation and to the formation of type I crystal lattices, thus generating highly ordered and strongly diffracting crystals. Here we describe protocols for reconstituting purified GPCRs in LCP, performing pre-crystallization assays, setting up crystallization trials in manual mode, detecting crystallization hits, optimizing crystallization conditions, harvesting, and collecting crystallographic data. The protocols provide a sensible framework for approaching crystallization of stabilized GPCRs in LCP, however, as in any crystallization experiment, extensive screening and optimization of crystallization conditions as well as optimization of protein construct and purification steps are required. The process remains risky and these protocols do not necessarily guarantee success. Key words: GPCR, Lipidic cubic phase, Membrane proteins, Fluorescence recovery after photobleaching, Crystallization, Crystallography, Minibeam

1. Introduction G protein-coupled receptors (GPCRs) comprise the largest family of integral membrane proteins in the human genome, with ~800 GPCR family members identified (1). These receptors communicate Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_8, © Springer Science+Business Media, LLC 2010

141

142

Cherezov, Abola, and Stevens

signals across the plasma membrane in response to a variety of extracellular stimuli, ranging from photons, ions, and small molecules, to peptides and proteins. The signals are amplified and transmitted to downstream effectors inside the cells primarily through coupling to heterotrimeric guanidine nucleotide binding proteins (G proteins). Upon activation, receptors trigger complex cascades of reactions controlling crucial physiological and cellular processes. As such, GPCRs are implicated in multitude of diseases, making them important pharmaceutical targets. In fact, about 40% of drugs on the market act on the G protein-coupled receptors (2). GPCR family members share common architecture of a seven transmembrane a-helical bundle (3). It remains intriguing how such a relatively simple scaffold has evolved to selectively bind to thousands of diverse ligands transmitting signals to dozens of different effectors. The primary questions in the GPCR field include understanding mechanisms of signal transduction and aspects of ligand selectivity and specificity. These and other essential questions can only be answered when a variety of biophysical and biochemical data are combined with high-resolution structures of family members stabilized in different functional states. Despite the great importance of GPCRs and the immense worldwide efforts aimed at obtaining their structures, very little progress has been achieved until recently. Indeed, until late 2007, a high resolution structure of only one highly atypical GPCR member, visual rhodopsin, was known (4). Rhodopsin is unusual in that it is highly abundant in the eye’s retina and contains a covalently bound ligand, 11-cis retinal, keeping the receptor conformationally stable in a fully inactive state. However, all other receptors are expressed in miniscule amounts in native tissues and activated by diffusible ligands. For structural studies, it has therefore become necessary that these proteins be heterologously overexpressed, solubilized, and purified. Human integral membrane proteins, including GPCRs, are notoriously difficult to express and they are highly unstable when solubilized by detergents. What makes GPCRs even more challenging is their observed conformational heterogeneity highlighting dynamic equilibrium between multiple functional states (5), obviously vital for signal transduction; however, a formidable roadblock for structural studies. For example, most GPCRs possess a certain level of basal activity in the apo, or unliganded state. Ligands that increase activity are called agonists, those that bind to the receptor and block it without changing its activity are called antagonists, and ligands that decrease the activity below basal are called inverse agonists. Typically, there is a whole range of intermediate states between completely inactive and fully active receptors, and ligands that induce or stabilize them are called partial

Recent Progress in the Structure Determination of GPCRs

143

inverse agonists and partial agonists. In general, receptors are more stable and less conformationally heterogeneous in the inactive states and become progressively less stable and more heterogeneous as they are converted into more active states. Thus, it is now generally accepted that in order to crystallize a GPCR, it has to be stabilized in a single conformational state. An explosion of technological breakthroughs in the past few years have made it possible to stabilize the conformation and determine the high-resolution structures of several members of the GPCR family. One of the successful approaches involved developing a monoclonal antibody toward a structural epitope on the third intracellular loop of the human b2-adrenergic receptor (b2AR) (6). The antibody/receptor complex was crystallized in bicelles, and the structure was determined at a moderate resolution of 3.4–3.7 Å (7). In the second approach, a highly flexible intracellular loop between helices 5 and 6 of the human b2AR was replaced by a compact and stable protein, lysozyme from T4 bacteriophage (T4L) (8). The chimeric receptor (b2AR-T4L) was crystallized in lipidic cubic phase (LCP) and the structure solved to a resolution of 2.4 Å (9). The same approach was recently used to obtain the structure of the human adenosine A2A receptor at 2.6 Å resolution (10). Finally, the third approach toward receptor stabilization involved a systematic mutagenesis strategy (11, 12). This strategy helped to identify six mutations in turkey b1-adrenergic receptor that increased its melting temperature in detergent solution by 21 °C. The thermally stabilized receptor was crystallized by a traditional vapor diffusion technique and the structure was solved to 2.8 Å (13). All of these studies were done on receptors with bound ligands, typically inverse agonists or antagonists, locking them in a stable conformation. Additional stabilization was achieved by crystallizing b2AR/Fab complex in bicelles (7), and both T4L fusion receptors, the b2AR-T4L and A2AR-T4L, in the lipidic cubic phase (9, 10, 14). Comparison of these new structures highlights important details related to the plasticity of the GPCR 7TM core and the involvement of the extracellular loops in the ligand binding (15, 16). In addition, the new structures provided superior templates for homology-based modeling of other GPCRs and for performing virtual ligand screening and structure based drug design (17). While new approaches have enabled GPCR structure determination, more remains to be done to address crucial questions centering on mechanisms of signal transduction upon receptor activation. To gain insights into these mechanisms, it is essential to obtain a high-resolution structure of an activated receptor with an agonist bound. Recently, a crystal structure of the bovine opsin bound to a transducin fragment shed some light on the receptor activation mechanism (18). However, the atypical mode of the

144

Cherezov, Abola, and Stevens

rhodopsin activation and the lack of an agonist in the structure prevent generalizations applicable to other GPCRs responding to diffusible ligands. It is conceivable that a similar approach of stabilizing an activated receptor by a G protein, Ga subunit, or by a fragment of Ga will yield a high resolution structure in the near future. Two of the best resolution GPCR structures to date, not including rhodopsin, have been obtained using crystallization in LCP, also referred to as in meso crystallization (19). The success of in meso crystallization can be attributed primarily to two factors: stabilization of highly flexible receptors in a membrane-like LCP environment and formation of type I crystal lattice leading to highly ordered and strongly diffracting crystals. Here we describe methods related to reconstitution of purified receptors in LCP, performing pre-crystallization assays, setting up crystallization trials, harvesting crystals, and acquiring crystallographic data. We recommend the reader to get familiar with more general and detailed protocols for in meso crystallization recently published (20). In this chapter, the emphasis is on working with highly unstable GPCRs, which require pre-crystallization analysis of multiple constructs, extensive crystallization optimization, and crystallographic data collection on multiple microcrystals using a highly collimated synchrotron beam.

2. Materials 1. Monoolein (1-oleoyl-rac-glycerol) (Sigma or Nu-Chek Prep).

2.1. Reconstitution of Protein in LCP

2. Additive lipids (Sigma or Avanti Polar Lipids).

2.1.1. Lipid Mixing

3. Chloroform (HPLC grade, Sigma).

2.1.2. Mixing Lipid with Protein

1. Purified protein solution (see Note 1). 2. Two 100 mL gas-tight removable needle syringes without a needle (Hamilton, cat.# 81065). 3. 25 or 50 mL gas-tight syringe with a flat tip (point style 3) removable needle (gauge 26s) (Hamilton, cat.# 80265). 4. Syringe coupler made of two gauge 26 removable needles (Hamilton, cat.# 7768-02) and two removable needle (RN) nuts (Hamilton, cat.# 30902) as shown in Fig. 1 and described in (21).

2.2. LCP-FRAP Crystallization Pre-screening Assays 2.2.1. Protein Labeling

1. Purified protein solution at 1–5 mg/mL concentration. 2. Stock solution of 5 mg/mL Cy3 monofunctional N-hydroxysuccinimide ester (Cy3 NHS ester) (GE Healthcare)

Recent Progress in the Structure Determination of GPCRs

145

Fig.1. (a) Attaching the syringe coupler to a gas-tight syringe. After the coupler is attached, the syringe is filled with lipid. Then a second syringe, filled with protein solution, is attached to the other end of the coupler. (b) Section through the syringe coupler.

in dimethylformamide (DMF) (Sigma). Can be stored at −20 °C for up to 2 weeks (see Note 2). 3. Labeling buffer: 50 mM Hepes pH 7.2, 150 mM NaCl, 0.05%w/v n-dodecyl-b-d-maltopyranoside (DDM), 0.01%w/v cholesterol hemisuccinate (CHS), ligand. 4. Wash buffer: 50 mM Hepes pH 7.5, 150 mM NaCl, 0.05%w/v DDM, 0.01%w/v CHS, 20 mM imidazole, ligand. 5. Elution buffer: 50 mM Hepes pH 7.5, 150 mM NaCl, 0.05%w/v DDM, 0.01%w/v CHS, 200 mM imidazole, ligand. 6. Desalting column PD-10 (GE Healthcare). 7. Ni Sepharose resin (GE Healthcare). 8. Empty 2 mL gravity-flow columns (Thermo Scientific). 2.2.2. Sample Setup

1. Glass slides, 127.8 × 85.5 mm, 1 mm thick (Erie Scientific) (see Note 3). 2. Cover slips, 112 × 77 mm, 0.2 mm thick (Erie Scientific) (see Note 3). 3. Perforated double stick spacer (3M 9492MP double stick tape, 60 mm thick, cut to 112 × 77 mm, with 96 punched 7 mm in diameter holes making 12 columns and 8 rows with 9 mm distance between the centers of the adjacent holes) (Saunders Corp.) (see Note 3). 4. Screening solutions (see Note 4).

146

Cherezov, Abola, and Stevens

2.2.3. FRAP Data Collection

1. FRAP station consisting of a Zeiss AxioImager A1 fluorescent microscope with an EC-Plan 10× objective (NA = 0.3), an HBO 100 epi-illumination, and a Cy3 fluorescence filter set (excitation at 543 nm with 22 nm bandwidth, emission at 605 nm with 60 nm bandwidth); a tunable dye cell (set at 551 nm) MicroPoint laser system (Photonic Instruments); a CoolSnap HQ2, 14 Bit, cooled (−30 °C) CCD (1,392 × 1,040 pixels, 6.45 µm/pixel) monochrome FireWire camera (Photometrics); an XYZ automated microscope stage MS-2000 and an automated shutter with controller (Applied Scientific Instrumentation). All the FRAP station hardware is controlled by an ImagePro Advance Microscopy program suite (Media Cybernetics) (see Note 5).

2.3. Crystallization in Lipidic Mesophases

1. Protein reconstituted in LCP at 10–30 mg/mL final concentration prepared as described in Subheading 3.1. 2. 10 mL gas-tight removable needle syringes without a needle (Hamilton, cat.# 80065). 3. Repetitive syringe dispenser (Hamilton, cat.# 83700) (see Note 6). 4. Short (0.375-in.), flat-tipped needle (point style 3, gauge 26, Hamilton, cat.# 7804-03). 5. Glass slides, 127.8 × 85.5 mm, 1 mm thick (Erie Scientific) (see Note 3). 6. Cover slips, 112 × 77 mm, 0.2 mm thick (Erie Scientific) (see Note 3). 7. Perforated double stick spacer (3M 9500PC double stick tape, 140 mm thick, cut to 112 × 77 mm, with 96 punched 5 mm in diameter holes making 12 columns and 8 rows with 9 mm distance between the centers of the adjacent holes) (Saunders Corp.) (see Note 3). 8. Crystallization screens (see Notes 7 and 8). 9. MiteGen MicroMounts (see Note 9).

3. Methods 3.1. Reconstitution of Protein in LCP

Protocols for pre-crystallization assays and crystallization trials described in this chapter begin with reconstitution of membrane proteins in the lipid bilayer of LCP. The reconstitution is achieved spontaneously upon mechanical mixing of a purified protein in detergent solution with special lipids or lipid mixtures prepared as described in Subheading 3.1.1.

Recent Progress in the Structure Determination of GPCRs 3.1.1. Lipid Mixing

147

The procedure described here can be used to supplement monoolein or any other LCP host lipid with lipophilic additives. Such additives can modulate properties of LCP or can preferentially interact with proteins reconstituted in LCP and therefore are useful for optimization of crystallization conditions. Cholesterol was found to be the best lipid additive significantly improving the b2AR-T4L crystal size (9, 14). Cholesterol was also used for crystallization of adenosine A2A receptor (10). Compatibility of monoolein-based LCP with several lipid-like molecules was reported in (22). 1. Weigh a small amount (few mg) of an additive lipid in a small (1–2 mL) amber glass vial with a Teflon-lined cap. 2. Add an appropriate amount of monoolein to obtain required concentration of the additive lipid (see Note 10). 3. Dissolve the lipids in ~200–400 mL of chloroform. 4. Evaporate bulk of the solvent using a gentle stream of dry, filtered nitrogen, keeping the vial warm (~37 °C) to prevent the lipid from freezing. 5. Remove the last traces of chloroform under a vacuum ( 4)

272

39

Total

612

487

Sequential (|i − j| = 1) Medium range (3 < |i − j| < 5)

The large amount of long range NOEs observed for b-barrel proteins are sufficient for defining their 3D fold, whereas the large amount of medium range NOEs distance restraints allow a precise structure of the secondary structure elements for a-helical membrane proteins, and only a few long range NOE restraints are required to obtain the 3D fold of the protein

In the case of membrane protein oligomer systems, distinction between intra and inter monomeric NOE correlations is required to define the proper quaternary structure of the system. This distinction is obtained by differential subunit labelling protocols (25). A second set of long range distance restraints may be obtained by the use of the paramagnetic relaxation enhancement (PRE) effect. The covalent linkage of an unpaired electron containing tag to a specific location within the protein result in a dramatic sp increase of the transverse relaxation rate ( R2 ) of the neighbouring nuclear spin according to the inter-spin distance ( R2sp a r6). Comparing peak intensities and linewidths in TROSY spectra of spin labelled and corresponding diamagnetic proteins provide distance restraints up to 30 Å. This method has been used to refine the NMR structure of OmpA (20) and to determine the overall fold of the a-helical membrane protein Mistic (19). In order to delineate the protein parts embedded within the micelle and the regions of the MP which are solvent exposed, the combined use of spin labelled detergent or lipophilic paramagnetic probes and soluble paramagnetic compounds has been successfully employed (26, 27). 3.1.2.4. Residual Dipolar Couplings (RDCs)

The orientational constraints deduced from the measurement of RDC are widely used to refine structures of soluble proteins by NMR. In order to measure RDCs, several media can be chosen to induce the weak alignment of the protein sample such as the filamentous phage Pf1, bicelles, and compressed or stretched polyacrylamide gels. For MP-containing micelles characterized by high detergent concentrations, only the latter method is suitable

Determination of Membrane Protein Structures Using Solution and Solid-State NMR

271

and has been applied to measure RDCs for OmpA (20) and phospholamban (28). In order to increase the dataset of RDCs, these gels may be positively or negatively charged to induce distinct alignments of the protein (29). HN, C¢Ca, and NC¢ dipolar couplings may be obtained from IPAP-HSQC and TROSY HNCO experiments. Although not tried yet for membrane proteins, partial protein sample alignment could also be achieved using the covalent attachment of a paramagnetic ion chelating tag. This tagging method is similar to the procedure employed for measuring PREs and could be an alternative to the polyacrylamide gels approach (30). Moreover, different alignments could be achieved varying the lanthanide ion, and therefore, more RDCs could be measured. 3.1.2.5. New Methodological Approaches

C direct detection has been recently proposed as an alternative to the classical more sensitive 1H detection based NMR experiments (31). The slower transverse relaxation rate of 13C, especially upon deuteration labelling, compared to 1H makes the detection of this nucleus very attractive for large biomolecular systems. A whole set of 2D and 3D protonless experiments have been designed to allow backbone and side chain chemical shifts assignments. Identification of 13C resonances within each residue is obtained by 2D/3D CACO, CBCACO, CCCO, and CGCB experiments using various homonuclear decoupling schemes (IPAP, S3E, DIPAP). CON-IPAP, CBCACON-IPAP, and CCCON-IPAP experiments correlate the 13C resonances to the 15 N backbone resonance, and CANCO-IPAP experiments correlate resonances from sequential residues. In the case of Smr solubilised in bicelles containing solution, the combined use of the 2D CACO and TROSY modified HNCA and HNCO experiments yields to more than 80% of COCA correlations assignments (15). One approach to deal with signal overlap is to increase the dimensionality of the NMR spectra without reduction of signal and resolution. Such an approach has been recently proposed and applied to assign the resonances of subunit c of E. coli F1F0 ATP synthase (32). This method, based on the phase sensitive joint sampling of indirect spectral dimensions, allows recording high dimensionality NMR spectra without reduction of resolution compared to their conventional 3D counterparts and within less measurement time. Application to a-helical membrane proteins NMR studies seems particularly suitable considering extensive signal overlap as well as the generally time limited stability of membrane protein NMR samples.

3.1.2.6. Toward MP Dynamics and Mechanisms

One of the main advantages of solution NMR studies of proteins is the ability of probing dynamics for a wide timescale range (typically from pico to milliseconds) at the atomic level. Two recent

13

272

Montaville and Jamin

examples in the literature illustrate how NMR spectroscopy can provide a description of key functional dynamics processes at atomic level. A detailed NMR work performed on the integral membrane enzyme PagP in a detergent compatible with its function, provided a high resolution structure of PagP and established that this protein is in equilibrium between two conformational states (33–35). Chemical shift data indicate that the main structural and dynamics changes between these states involved residues surrounding the catalytic site. It has been proposed that the R state (the more flexible state) may facilitate the ligand entry in the b-barrel core and the more rigid T state may be important for catalysis. Moreover, this enzyme has been further characterized as the relative population of R and T states as well as the exchange rate between R and T and the equilibrium constant were quantified at different temperatures using NMR data. The functional implications of this detailed structural and dynamics description of PagP remain to be investigated. A NMR study of the KcsA potassium channel in DPC micelles at pH 4 (open state) and pH 7 (closed state) provided hints concerning the role of local dynamics in the channel function (22). The NMR signals of residues located in two spatially distinct but functionally important regions of the protein (the selectivity filter and the C-terminal part of TM2) were monitored at different pH values (between 4 and 7). The structural changes observed at different pH values are correlated with the pH dependent open probability profile of the channel measured by functional tests. Occupancy and transition rates between both conformations for residues belonging to the selectivity filter and the C-terminal part of TM2 were measured and appeared to be different at pH 4 and pH 7. Combining these local dynamics data provided a qualitative model of KcsA function in agreement with its simultaneous gating and ion selectivity properties. 3.2. Solid-State NMR of Membrane Proteins

Two different approaches yield high-resolution solid-state NMR spectra. Oriented-sample ssNMR exploits the orientationdependence of the anisotropic interactions. The unique orientation simplifies the spectra and the magnitude and orientation of chemical shift and dipolar coupling tensors provide structural information. Rather than exploiting sample orientation, magic-angle spinning (MAS) NMR averages orientation-dependent interactions through fast sample rotation at the magic angle (54.7°) relative to the external magnetic field while selectively recoupling the interactions needed to obtain structural information during specific periods of the experiment. Structures of small helical MPs have already been obtained using oriented ssNMR (see Table 2). MAS NMR has been applied with success for the determination

18

25

25

36

50

a-Receptor M6

AchR M2 channel

Influenza M2 channel

HIV Vpu channel

fd coat protein

2 (39)

1 (18)

1 (25)

1 (15)

2 (14)

No. of helicesa

POPC/POPG

DOPC/DOPG

DMPC

DMPC

DMPC

Lipid bilayerb

8/80

3.5/75

20/38.5

2/40

4/50

4

5

10

? d

10

400

700

b

a

1MZT

1PJE

1NYJ

1CEK

450 and 500 300 and 400

1PJD

PDB code

700

Number of residues in helices is in parenthesis All samples were aligned on glass plates c Apart from uniformly 15N-labelled sample, selectively 15N-labelled samples were used for both the assignment and structure determination d The number of selectively labelled samples used for the assignment and structure determination is not published

No. of a.a.

Protein

No. of selectively Protein/lipid mg/ 15N-labelled Magnetic field mg samplesc strength (MHz)

Table 2 Small helical MPs which structure have been determined using solid-state NMR

Determination of Membrane Protein Structures Using Solution and Solid-State NMR 273

274

Montaville and Jamin

of 3D structure of soluble protein prepared as microcrystalline sample, and its application to MP is foreseen in a near future. 3.2.1. High Resolution Solid-State of Static Aligned Sample

This method provides very precise orientational constraints for structure determination in membrane bilayers as well as the orientation of the MP in the membrane. All the structures of small membrane peptides/proteins in the PDB determined using ssNMR have been obtained using this method. The largest MP structure is that of fd coat protein, composed of two transmembrane domains. However, its application to larger MP is challenging due to the difficulties to obtain uniform aligned MPs and resonance overlaps (3, 36).

3.2.1.1. Aligned Samples

Sample alignment is a prerequisite of this method and can be very challenging especially for MPs (3, 4). It can be obtained either by mechanical alignment of bilayers on glass plates or by magnetic alignment of bicelles. Mechanical alignment of lipid bilayers. The first step is to prepare proteoliposomes via either organic solvent (for TM peptides) or detergent mediated (for TM peptides and MPs) reconstitution (see Chapter 18 for an example). The choice of organic solvent, detergent, lipids as well as the molar ratio protein to lipid will depend on the peptide/MP and previous synthesis and purification steps. The second step is the preparation of the oriented bilayers on glass plates. Aliquots of a solution of proteoliposomes is spread onto thin glass plates (30–50) and slowly evaporated. The glass plates are stacked, inserted into a sample cell, and are then rehydrated by incubation in a chamber where the humidity is controlled by a saturated salt solution at a temperature above the gel to liquid crystalline phase transition temperature. Oriented bilayers then form after equilibrating the sample in this chamber, and care should be taken to maintain sample hydration during the NMR experiment. Bicelles. Protein-containing bicelles for which the long chain to short-chain phospholipid ratio is between 2.8 and 6.5 will spontaneously align in a magnetic field at temperatures above the gelto-liquid crystalline phase transition temperature of the long-chain phospholipids such that the bilayer normal is perpendicular to the magnetic field. Protocols for preparing stable protein-containing bicelles samples that yield high-resolution ssNMR spectra have recently been published in details (37). The spectral resolution can be improved by using bicelles that align with their bilayer normal parallel to the magnetic field as a result of an increase by a factor two of the spectral range of the chemical shift. This can be obtained with conventional bicelles samples by the addition of lanthanide ions or as demonstrated recently with bicelles containing a lipid with a biphenyl group in one of the acyl chains mixed with DHPC (38).

Determination of Membrane Protein Structures Using Solution and Solid-State NMR

275

The direction and extent of the alignment of the bilayer and the formation of bicelles can be verified with 31P. Protein alignment is checked with a 1D 15N ssNMR spectrum. This spectrum should display significant resolution with identifiable peaks and no evidence of underlying “powder pattern” intensity which would indicate the presence of unoriented material (37). The main advantage of mechanically over magnetically alignment is the choice of lipid composition. Disadvantages of planar bilayers are the difficulties to change easily the environment, to maintain the sample hydration in the presence of high decoupling power, and the necessity to adapt the RF coil to the geometry of the sample. The main disadvantages of bicelles are the limited choice of lipid combinations and the temperature range defined by its lipid composition. 3.2.1.2. Resonance Assignments and Structure Determination

ssNMR on uniaxially oriented samples enables direct correlation between the resonance frequencies of the protein backbone nuclei and the orientation of the peptide plane to which these nuclei belong relative to the bilayer normal. The PISEMA (polarization inversion spin exchange at the magic angle) spectra correlate 15N chemical shift of the backbone amides and 15N–1H dipolar couplings. These spectra exhibit characteristic wheel-like patterns of resonances, PISA wheels (polarity index slant angle), that reflect both the protein structure and orientation in the membrane (3, 4). The analysis of these spectra has been first developed for small helical proteins and was recently applied to a b-barrel MP (39). For uniform helices, the resonances within the PISEMA spectra form PISA wheels with 3.6 resonances per turn and the shape and position of these PISA wheels depend on the helix geometry and its orientation relative to the membrane. PISA wheels can be complemented by dipolar waves, which reflect the periodic wave-like variations of the magnitude of 15N–1H dipolar couplings as a function of residue number. These dipolar waves provide the number of residues in the helix as well as its tilt and rotation angle and are useful to identify helical kinks or curvature. The strategy developed on small MPs relies on the spectra of one uniformly 15N labelled sample and several selectively 15N labelled samples combined with the symmetry properties of the PISA wheel to enable simultaneous resonance assignment and determination of orientational constraints (40). In the PISEMA spectra, resonances from transmembrane helix, from helix parallel to the bilayer plane and from loop and terminal regions without regular secondary structure are in distinct parts of the spectra (Fig. 2). An initial model of the MP with the different helices and their orientation relative to the bilayer plane is generated from pattern recognition of the resonances in the different regions of the PISEMA spectrum of a uniformly 15N-labelled sample. This model is then further refined by rotating separately each

276

Montaville and Jamin

Fig. 2. Ideal PISA wheels representation of an ideal a-helix tilted at 30° (a) and 90° (b) relative to the direction of the applied magnetic field B0.

helix around its axis until the resonances patterns of the calculated PISA wheel match the resonances of the experimental spectra of selectively labelled samples. This protocol yields the rotation of the helices and the assignments of the resonances. In the next step, the 15N chemical shift and 15N–1H dipolar frequencies are measured in the PISEMA spectra and are used to determine the polar angles that describe the peptide plane orientation in the magnetic field and provide input for structure calculation. For more complex small MPs, each resonance must be assigned before the analysis of the dipolar waves and the structure calculation (41). Other strategies have been developed such as fitting the spectrum to a model structure assuming a constant peptide plane geometry (4, 42). Current developments include the design of tripleresonance experiments for resolution of the overlap of resonances and for measuring additional orientational constraints (43). 3.2.2. High Resolution MAS NMR

Although no MP 3D structure have been determined using this technique, sample preparation and assignments strategies have been published and are briefly summarized below (44, 45).

3.2.2.1. Sample Preparation

MPs can be studied as microcrystal or proteoliposomes. Microcrystal may be obtained from precipitates of solubilized MPs in detergent micelles or bicelles. There is no general procedure for optimal sample preparation for ssNMR and varied precipitation conditions are generally tested to obtain the highest spectral quality (signal-tonoise ratio, resolution) as assessed by 2D 13C–13C homonuclear spectra. Typical narrow line widths of ~0.5–0.7 ppm for 13C and ~0.7–5 ppm for 15N are expected to give high resolution spectra.

Determination of Membrane Protein Structures Using Solution and Solid-State NMR

277

Detergent-purified membrane proteins can also be reconstituted into lipid bilayers to form proteoliposomes or 2D crystals. The sample is then packed into a rotor (about 40 ml standard volume). Due to the low sensitivity of the heteronuclear 3D NMR experiments, large quantity of pure and conformationally homogeneous protein sample are needed. In the case of proteoliposomes, the difficulty of getting a large quantity of proteins into the rotor is linked to the presence of lipids as typical 1:100 molar protein/ lipid ratios are used to avoid protein aggregation. The use of crystalline samples overcomes this concentration problem but at the detriment of the bilayer environment. Finally, the sample should be stable for several days due to the necessity to record several 2D and 3D experiments and compatible with sample heating that can result from strong proton decoupling during acquisition and sample spinning at high speed (~10–20 kHz). It has been noticed that 2D crystals seem to yield better resolved lines than proteoliposomes (46). 13 C and 15N signals are detected in ssNMR as 1H signals give large line widths (several kilohertz) at the standard spinning rates used. Therefore, 15N and 13C labelled protein samples should be prepared. Apart from fully uniformly (13C, 15N)-labeled proteins samples, different labelling strategies have been proposed for both signal assignments and structure calculation. In order to reduce spectral crowding, either amino acid-specific and/or reverse labelling can be used. Reverse 13C and 15N-labelling is obtained by adding amino acids of natural 13C and 15N abundance to the expression medium and therefore, the resulting spectra are devoid of 13C and 15N signals from these amino acids. Other specific labelling schemes have been developed in combination with specific NMR techniques. In particular, in order to avoid the strong couplings between directly bonded spins that would interfere with the weak coupling yielding distance information, reduced 13C-block-labelling schemes are used (47). This reduced labelling or spin dilution can be obtained by using [1, 3-13C]-glycerol and [2-13C]-glycerol as the sole nutrient carbon source in bacteria medium and results in an alternating labelling scheme where carbon sites are labelled but not in adjacent positions. In combination with this labelling scheme, long-range distances constraints are obtained using a broad-band recoupling method like the proton-driven spin diffusion (PDSD) mixing scheme (48). 3.2.2.2. Resonance Assignments

The full sequence-specific assignment of membrane protein by ssNMR remains a difficult challenge. It concerns 13C and 15N resonances since the proton resonances are not sufficiently well-resolved. The strategy relies on well dispersed resonances in the 15N dimension and on 13C or 15N heteronuclei correlations, employing NCACX and NCOCX type experiments. NCACX experiments correlate 15 N, CA with 13C of the side chain while NCOCX experiments correlate 15N, CO, and 13C of the backbone/side chains (49).

278

Montaville and Jamin

However, these two experiments are generally not sufficient for uniformly labelled proteins especially for proteins that contain a high number of hydrophobic residues in helical conformation due to the degeneracy of 13C and 15N chemical shifts. For samples containing large quantity of protein, two additional experiments can be recorded, CAN(CO)CX and CON(CA)CX 3D which provide unambiguous interresidue 13C–13C correlations (44). In case of low resolution, dispersion, and sensitivity of NCACX and NCOCX spectra, the assignments would mainly rely on homonuclear carbon correlations and specific amino acid labelling schemes. Similar to solution NMR, the assignment protocol involved first the sequential assignment which is composed of the following steps: spin system identification, linking the spin systems and amino acid type identification. 3.2.2.3. Distance Restraints and Structure

Structure-determination protocols are currently developed for and with small soluble proteins in micro-crystalline state. As a matter of fact, five 3D structures of soluble proteins in microcrystalline state have been determined using MAS (50). Two different strategies have been proposed. The first one is based on the estimation of a large number of 13C–13C distances using protondriven spin-diffusion (PDSD) experiments performed on reduced 13 C-block-labelled samples (51). It necessitates extensive labelling strategies, removing one-bond couplings by partial labelling and has been used for two proteins (spectrin SH3 domain, 62 residues and ubiquitin, 76 residues). The second strategy involves the indirect measurement of proton distances using proton mediated 13C/15N detected experiments performed on a uniformly 13 C/15N labelled sample (13C,13C-encoded 1H,1H mixing, i.e., CHHC and 15C,13C-encoded 1H,1H mixing, i.e., NHHC experiments) (52). This strategy has the advantage that only one sample is required but the experiments are not very sensitive and large amount of protein is necessary. In addition to distance constraints, empirical chemical shifts restraints (torsion angle likelihood obtained from shift and sequence similarity (TALOS)) are combined to improve the structure calculations.

4. Conclusion and Perspectives The success of both solution and ssNMR applied to the determination of MP 3D structure is first highly correlated to the improved protocols for obtaining high amount of homogeneous and functional protein in a membrane mimic environment which yield high spectral quality.

Determination of Membrane Protein Structures Using Solution and Solid-State NMR

279

Membrane protein solution NMR is still a challenging field despite the increased releasing rate of structures published in the last years. The field of large soluble protein structural investigations by NMR mainly contributed to the early successful studies of membrane proteins. The use of specific approaches such as paramagnetic reagents or specific labelling of hydrophobic residues predominantly found in the transmembrane domains provided essential tools for structural determinations of membrane proteins. Recent methodological developments in the field of large protein systems such as 13C direct detection include nowadays MPs as one of their main applications. Solution NMR has already addressed various aspects of membrane protein function such as protein–protein interactions (53), protein–detergent interactions (13, 26), drug screening (54). The ability to study MPs in bicelles (the most native like environment still compatible with solution NMR), will open the application of this technique to more specific aspects such as MP–lipids interactions, MP–MP interactions or membrane protein folding. Recent advances in ssNMR have provided 3D structures of small MPs in phospholipids bilayers. Besides 3D structures, ssNMR already provides essential structural and dynamics data on large MPs (55, 56), as well as more specific information related to MP–protein interactions (57), seven TM helices GPCR topology (58), MP–MP interaction (59, 60), and orientation of MPs in the bilayer (39). Increasing resolution and sensitivity of ssNMR technique is now essential for studying at atomic level more complex functional MPs systems in their native environment. Thanks to recent developments, both solid-state and solution state NMR are essential tools that, when used in combination with other experimental structural techniques such as X-ray crystallography, electron microscopy, AFM, and EPR, provide a powerful set of structural techniques for unprecedented detailed description of membrane proteins systems. References 1. Vold RR, Prosser RS (1996) Magnetically oriented phospholipid bilayered micelles for structural studies of polypeptides. Does the ideal bicelle exist? J Magn Res B 113: 267–271 2. Tugarinov V, Hwang PM, Kay LE (2004) Nuclear magnetic resonance spectroscopy of high-molecular-weight proteins. Annu Rev Biochem 73:107–146 3. Page RC, Li C, Hu J, Gao FP, Cross TA (2007) Lipid bilayers: an essential environment for the understanding of membrane proteins. Magn Reson Chem 45:S2–S11

4. De Angelis AA, Jones DH, Grant CV, Park SH, Mesleh MF, Opella SJ (2005) NMR experiments on aligned samples of membrane proteins. Methods Enzymol 394:350–382 5. Watts A, Straus SK, Grage SL, Kamihira M, Lam YH, Zhao X (2004) Membrane protein structure determination using solid-state NMR Methods. Mol Biol 278:403–473 6. Nielsen N, Malmendal A, Vosegaard T (2004) Techniques and applications of NMR to membrane proteins. Mol Membr Biol 21:129–141 7. Gordon E, Horsefield R, Swarts HG, de Pont JJ, Neutze R, Snijder A (2008) Effective

280

8.

9.

10.

11.

12.

13.

14. 15.

16.

Montaville and Jamin high-throughput overproduction of membrane proteins in Escherichia coli. Protein Expr Purif 62:1–8 Tyler RC, Sreenath HK, Singh S, Aceti DJ, Bingman CA, Markley JL, Fox BG (2005) Auto-induction medium for the production of [U-15N)- and [U-13C, U-15N)-labeled proteins for NMR screening and structure determination. Protein Expr Purif 40:268–278 Koglin A, Klammt C, Trbovic N, Schwarz D, Schneider B, Schafer B, Lohr F, Bernhard F, Dotsch V (2006) Combination of cell-free expression and NMR spectroscopy as a new approach for structural investigation of membrane proteins. Magn Reson Chem 44: S17–S23 Reckel S, Sobhanifar S, Schneider B, Junge F, Schwarz D, Durst F, Lohr F, Guntert P, Bernhard F, Dotsch V (2008) Transmembrane segment enhanced labeling as a tool for the back bone assignment of alpha-helical membrane proteins. Proc Natl Acad Sci U S A 105:8262–8267 Krueger-Koplin RD, Sorgen PL, KruegerKoplin ST, Rivera-Torres IO, Cahill SM, Hicks DB, Grinius L, Krulwich TA, Girvin ME (2004) An evaluation of detergents for NMR structural studies of membrane proteins. J Biomol NMR 28:43–57 Page RC, Moore JD, Nguyen HB, Sharma M, Chase R, Gao FP, Mobley CK, Sanders CR, Ma L, Sonnichsen FD, Lee S, Howell SC, Opella SJ, Cross TA (2006) Comprehensive evaluation of solution nuclear magnetic resonance spectroscopy sample preparation for helical integral membrane proteins. J Struct Funct Genomics 7:51–64 Zhang Q, Horst R, Geralt M, Ma X, Hong WX, Finn MG, Stevens RC, Wuthrich K (2008) Microscale NMR screening of new detergents for membrane protein structural biology. J Am Chem Soc 130:7357–7363 Sanders CR, Sonnichsen F (2006) Solution NMR of membrane proteins: practice and challenges. Magn Reson Chem 44:S24–S40 Poget SF, Girvin ME (2007) Solution NMR of membrane proteins in bilayer mimics: small is beautiful, but sometimes bigger is better. Biochim Biophys Acta 1768:3098–3106 Bocharov EV, Pustovalova YE, Pavlov KV, Volynsky PE, Goncharuk MV, Ermolyuk YS, Karpunin DV, Schulga AA, Kirpichnikov MP, Efremov RG, Maslennikov IV, Arseniev AS (2007) Unique dimeric structure of BNip3 transmembrane domain suggests membrane permeabilization as a cell death trigger. J Biol Chem 282:16256–16266

17. Shih SC, Stoica I, Goto NK (2008) Investigation of the utility of selective methyl protonation for determination of membrane protein structures. J Biomol NMR 42:49–58 18. Zhou Y, Cierpicki T, Flores Jimenez RH, Lukasik SM, Ellena JF, Cafiso DS, Kadokura H, Beckwith J, Bushweller JH (2008) NMR solution structure of the integral membrane enzyme DsbB: functional insights into DsbBcatalyzed disulfide bond formation. Mol Cell 31:896–908 19. Roosild TP, Greenwald J, Vega M, Castronovo S, Riek R, Choe S (2005) NMR structure of Mistic, a membrane-integrating protein for membrane protein expression. Science 307:1317–1321 20. Cierpicki T, Liang B, Tamm LK, Bushweller JH (2006) Increasing the accuracy of solution NMR structures of membrane proteins by application of residual dipolar couplings. High-resolution structure of outer membrane protein. A J Am Chem Soc 128:6947–6951 21. Fernandez C, Wider G (2003) TROSY in NMR studies of the structure and function of large biological macromolecules. Curr Opin Struct Biol 13:570–580 22. Baker KA, Tzitzilonis C, Kwiatkowski W, Choe S, Riek R (2007) Conformational dynamics of the KcsA potassium channel governs gating properties. Nat Struct Mol Biol 14:1089–1095 23. Tugarinov V, Muhandiram R, Ayed A, Kay LE (2002) Four-dimensional NMR spectroscopy of a 723-residue protein: chemical shift assignments and secondary structure of malate synthase g. J Am Chem Soc 124: 10025–10035 24. Fernandez C, Hilty C, Wider G, Guntert P, Wuthrich K (2004) NMR structure of the integral membrane protein OmpX. J Mol Biol 336:1211–1221 25. Oxenoid K, Chou JJ (2005) The structure of phospholamban pentamer reveals a channellike architecture in membranes. Proc Natl Acad Sci U S A 102:10870–10875 26. Hilty C, Wider G, Fernandez C, Wuthrich K (2004) Membrane protein–lipid interactions in mixed micelles studied by NMR spectroscopy with the use of paramagnetic reagents. Chembiochem 5:467–473 27. Beel AJ, Mobley CK, Kim HJ, Tian F, Hadziselimovic A, Jap B, Prestegard JH, Sanders CR (2008) Structural studies of the transmembrane C-terminal domain of the amyloid precursor protein (APP): does APP function as a cholesterol sensor? Biochemistry 47:9428–9446

Determination of Membrane Protein Structures Using Solution and Solid-State NMR 28. Mascioni A, Eggimann BL, Veglia G (2004) Determination of helical membrane protein topology using residual dipolar couplings and exhaustive search algorithm: application to phospholamban. Chem Phys Lipids 132:133–144 29. Cierpicki T, Bushweller JH (2004) Charged gels as orienting media for measurement of residual dipolar couplings in soluble and integral membrane proteins. J Am Chem Soc 126:16259–16266 30. Kamen DE, Cahill SM, Girvin ME (2007) Multiple alignment of membrane proteins for measuring residual dipolar couplings using lanthanide ions bound to a small metal chelator. J Am Chem Soc 129:1846–1847 31. Bermel W, Bertini I, Felli IC, Lee YM, Luchinat C, Pierattelli R (2006) Protonless NMR experiments for sequence-specific assignment of backbone nuclei in unfolded proteins. J Am Chem Soc 128: 3918–3919 32. Zhang Q, Atreya HS, Kamen DE, Girvin ME, Szyperski T (2008) GFT projection NMR based resonance assignment of membrane proteins: application to subunit C of E. coli F(1)F(0) ATP synthase in LPPG micelles. J Biomol NMR 40:157–163 33. Hwang PM, Choy WY, Lo EI, Chen L, Forman-Kay JD, Raetz CR, Prive GG, Bishop RE, Kay LE (2002) Solution structure and dynamics of the outer membrane enzyme PagP by NMR. Proc Natl Acad Sci U S A 99:13560–13565 34. Hwang PM, Bishop RE, Kay LE (2004) The integral membrane enzyme PagP alternates between two dynamically distinct states. Proc Natl Acad Sci U S A 101:9618–9623 35. Hwang PM, Kay LE (2005) Solution structure and dynamics of integral membrane proteins by NMR: a case study involving the enzyme PagP. Methods Enzymol 394: 335–350 36. Li C, Gao P, Qin H, Chase R, Gor’kov PL, Brey WW, Cross TA (2007) Uniformly aligned full-length membrane proteins in liquid crystalline bilayers for structural characterization. J Am Chem Soc 129:5304–5305 37. De Angelis AA, Opella SJ (2007) Bicelle samples for solid-state NMR of membrane proteins. Nat Protoc 2:2332–2338 38. Park SH, Loudet C, Marassi FM, Dufourc EJ, Opella SJ (2008) Solid-state NMR spectroscopy of a membrane protein in biphenyl phospholipid bicelles with the bilayer normal parallel to the magnetic field. J Magn Reson 193:133–138

281

39. Mahalakshmi R, Marassi FM (2008) Orientation of the Escherichia coli outer membrane protein OmpX in phospholipid bilayer membranes determined by solid-state NMR. Biochemistry 47:6531–6538 40. Marassi FM, Opella SJ (2003) Simultaneous assignment and structure determination of a membrane protein from NMR orientational restraints. Protein Sci 12:403–411 41. De Angelis AA, Howell SC, Nevzorov AA, Opella SJ (2006) Structure determination of a membrane protein with two trans-membrane helices in aligned phospholipid bicelles by solid-state NMR spectroscopy. J Am Chem Soc 128:12256–12267 42. Achuthan S, Asbury T, Hu J, Bertram R, Cross TA, Quine JR (2008) Continuity conditions and torsion angles from ssNMR orientational restraints. J Magn Reson 191:24–30 43. Sinha N, Grant CV, Park SH, Brown JM, Opella SJ (2007) Triple resonance experiments for aligned sample solid-state NMR of (13)C and (15)N labeled proteins. J Magn Reson 186:51–64 44. Li Y, Berthold DA, Gennis RB, Rienstra CM (2008) Chemical shift assignment of the transmembrane helices of DsbB, a 20-kDa integral membrane enzyme, by 3D magicangle spinning NMR spectroscopy. Protein Sci 17:199–204 45. Huang L, McDermott AE (2008) Partial sitespecific assignment of a uniformly (13)C, (15) N enriched membrane protein, light-harvesting complex 1 (LH1), by solid state NMR. Biochim Biophys Acta 1777:1098–1108 46. Lorch M, Fahem S, Kaiser C, Weber I, Mason AJ, Bowie JU, Glaubitz C (2005) How to prepare membrane proteins for solid-state NMR: a case study on the alpha-helical integral membrane protein diacylglycerol kinase from E. coli. Chembiochem 6:1693–1700 47. Pauli J, van Rossum B, Forster H, de Groot HJ, Oschkinat H (2000) Sample optimization and identification of signal patterns of amino acid side chains in 2D RFDR spectra of the alpha-spectrin SH3 domain. J Magn Reson 143:411–416 48. Manolikas T, Herrmann T, Meier BH (2008) Protein structure determination from 13C spin-diffusion solid-state NMR spectroscopy. J Am Chem Soc 130:3959–3966 49. Pauli J, Baldus M, van Rossum B, de Groot H, Oschkinat H (2001) Backbone and sidechain 13C and 15N signal assignments of the alpha-spectrin SH3 domain by magic angle spinning solid-state NMR at 17.6 Tesla. Chembiochem 2:272–281

282

Montaville and Jamin

50. http://www.drorlist.com/nmr/SPNMR. html 51. Castellani F, van Rossum B, Diehl A, Schubert M, Rehbein K, Oschkinat H (2002) Structure of a protein determined by solid-state magicangle-spinning NMR spectroscopy. Nature 420:98–102 52. Lange A, Becker S, Seidel K, Giller K, Pongs O, Baldus M (2005) A concept for rapid protein-structure determination by solid-state NMR spectroscopy. Angew Chem Int Ed Engl 44:2089–2092 53. Malia TJ, Wagner G (2007) NMR structural investigation of the mitochondrial outer membrane protein VDAC and its interaction with antiapoptotic Bcl-xL. Biochemistry 46:514–525 54. Assadi-Porter FM, Tonelli M, Maillet E, Hallenga K, Benard O, Max M, Markley JL (2008) Direct NMR detection of the binding of functional ligands to the human sweet receptor, a heterodimeric family 3 GPCR. J Am Chem Soc 130:7212–7213 55. Etzkorn M, Kneuper H, Dunnwald P, Vijayan V, Kramer J, Griesinger C, Becker S, Unden G, Baldus M (2008) Plasticity of the PAS domain and a potential role for signal transduction in the histidine kinase DcuS. Nat Struct Mol Biol 15:1031–1039

56. Ader C, Schneider R, Hornig S, Velisetty P, Wilson EM, Lange A, Giller K, Ohmert I, Martin-Eauclaire MF, Trauner D, Becker S, Pongs O, Baldus M (2008) A structural link between inactivation and block of a K+ channel. Nat Struct Mol Biol 15:605–612 57. Lange A, Giller K, Hornig S, Martin-Eauclaire MF, Pongs O, Becker S, Baldus M (2006) Toxin-induced conformational changes in a potassium channel revealed by solid-state NMR. Nature 440:959–962 58. Etzkorn M, Martell S, Andronesi OC, Seidel K, Engelhard M, Baldus M (2007) Secondary structure, dynamics, and topology of a sevenhelix receptor in native membranes, studied by solid-state NMR spectroscopy. Angew Chem Int Ed Engl 46:459–462 59. Seidel K, Andronesi OC, Krebs J, Griesinger C, Young HS, Becker S, Baldus M (2008) Structural characterization of Ca(2+)ATPase-bound phospholamban in lipid bilayers by solid-state nuclear magnetic resonance (NMR) spectroscopy. Biochemistry 47: 4369–4376 60. Traaseth NJ, Verardi R, Torgersen KD, Karim CB, Thomas DD, Veglia G (2007) Spectroscopic validation of the pentameric structure of phospholamban. Proc Natl Acad Sci U S A 104: 14676–14681

Chapter 15 Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure of Membrane Proteins Philip L. Yeagle and Arlene D. Albert Abstract Structural data on membrane proteins, while crucial to understanding cellular function, are scarce due to difficulties in applying to membrane proteins the common techniques of structural biology. Fragments of membrane proteins have been shown to reflect, in many cases, the secondary structure of the parent protein with fidelity and are more amenable to study. This chapter provides many examples of how the study of membrane protein fragments has provided new insight into the structure of the parent membrane protein. Key words: Membrane protein, Structure, Peptide, Secondary structure, NMR, Loop, Transmembrane helix, Protein fragment, Structural biology

1. Introduction Membrane proteins enable and regulate a wide variety of cell functions such as cell signaling, membrane transport, biosynthesis, and cell morphology. Membrane proteins are major targets for drug development. Three-dimensional structure is as important for membrane proteins as it is for soluble proteins in understanding function, regulation, and external control by pharmaceuticals. A severe deficit of membrane protein structures at the atomic level, however, hampers our understanding of the function of these proteins in cellular membranes. The protein structure databases contain tens of thousands of atomic structures of proteins, but less than 0.5% of the structures are of membrane proteins. One database lists fewer than 200 unique membrane protein structures determined to atomic level resolution (1). This information deficit delineates the frontier of structure biology today: developing new approaches and refining existing Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_15, © Springer Science+Business Media, LLC 2010

283

284

Yeagle and Albert

techniques to increase the breadth of structural knowledge of membrane proteins. It is only with such structural information that mechanistic understanding of cellular processes can be expressed at the atomic level. And it is such structural information that informs most effectively the development of new interventions to modify function, such as therapies for diseases. The major methodology behind the available structural information for membrane proteins is X-ray crystallography, just as it is for soluble proteins. Many examples will be seen in other chapters in this volume. However, X-ray crystallography has generated far fewer structures of membrane proteins than it has for soluble proteins because membrane proteins, which are not soluble in aqueous media, are not readily amenable to the commonly used technologies for crystal growth required by X-ray crystallography. Therefore, alternative methodologies of structural biology have been explored in the search for new membrane protein structural information. The major alternative methodology for determining highresolution structures of soluble proteins is nuclear magnetic resonance (NMR). Modern NMR approaches have led to an explosion of structural data derived from proteins in solution. This is particularly important because it circumvents the crystallization requirement. A number of investigators have worked to translate this approach from soluble protein structural determination to the determination of the structures of membrane proteins. Unfortunately, the insolubility of membrane proteins in aqueous media has inhibited the application of solution NMR methodology. Isolated membrane proteins reconstituted into a lipid bilayer form a complex too large to experience rotational diffusion rapid enough for solution NMR. To address this problem, membrane proteins have been solubilized in detergent micelles, which are much smaller than reconstituted bilayers and hence experience much more rapid rotational diffusion. By using detergent solubilized proteins, structures for several b-barrel membrane proteins from bacteria (porins) have now been obtained by NMR. Despite the success achieved with b-barrel proteins, only one complete high-resolution structure has been published strictly from NMR data for a membrane protein containing a transmembrane a-helical bundle. Given the importance of this class of transmembrane proteins (likely the majority of transmembrane proteins), it was critical to develop other methodologies to obtain structural information. This imperative drove the development of an alternative approach to obtain structural information for membrane proteins built around an a-helical bundle. This approach that is based on determining the structures of fragments of membrane proteins is the subject of this chapter. This alternative approach was founded on the hypothesis that a-helices in a transmembrane bundle exhibit stable secondary

Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure

285

structure independent of the remainder of the protein (short-range interactions, including hydrogen bonds within the helix, stabilize this element of secondary structure). The hypothesis was extended to the secondary structure of short loops, such as a b-turn connecting two transmembrane helices, which also contain internal hydrogen bonds. Since loops and a-helices are stabilized by shortrange interactions, peptide fragments of the membrane protein corresponding to an individual loop or a-helix may well form the same secondary structure found in the intact protein. This chapter begins with peptide structures that correspond to small domains of membrane proteins. These peptides exhibit well-defined structures and in some cases can be shown to possess biological activity. Biological activity of the fragment lends credence to the deduction that these peptides are forming solution structures similar to what these fragments form in the intact protein. This section illustrates how important information of secondary structure can be obtained even from relatively small fragments of membrane proteins. This chapter continues with recent studies on fragments that contain two transmembrane helices connected by a loop. These studies reveal information about tertiary structure of the proteins, as well as the protein secondary structure because they show the packing of one transmembrane helix against the other. Next to be discussed is a set of membrane proteins for which structures of more than one domain have been determined based on peptide studies. These studies show that the structure of a membrane protein can be extensively defined by determining the secondary structures of the individual helices and loops. Finally, since structures of both helices and loops are independently stabilized by short-range interactions even when separate from the intact protein, one could conceive of an approach leading to the structure of an entire transmembrane helical bundle protein using the structures of these peptide fragments. This chapter ends with a description of the methodology that allows the assembly of the structures of the fragments into a structure of the whole protein. To do so, long distance constraints from measurements on intact proteins are exploited that reflect the threedimensional packing of the elements of secondary structure of the protein. A remarkably useful structure of the whole protein, entirely determined by experimental constraints, can be obtained (2, 3). This methodology was confirmed using bacteriorhodopsin for which several X-ray crystal structures were available for verification of the result. The success of this approach has important implications for membrane protein folding and stability. These concepts form the basis for a rapidly growing series of studies on a wide variety of membrane proteins, many of them G-protein coupled receptors (GPCR), using protein fragments to determine the secondary structure of the intact protein. The earliest

286

Yeagle and Albert

of these studies have been reviewed previously and will only be summarized here (4, 5). In most cases, the structures of these fragments provide the first atomic level insight into the structures of the corresponding membrane proteins.

2. Materials Peptide fragments are designed to encompass a loop, a transmembrane helix, or a helix-turn-helix of the parent protein. Standard structure prediction techniques are used to estimate the portion of the primary sequence of the protein that defines the loop, the transmembrane helix, or the helix-turn-helix. The hydrophobicity plot is the primary tool, with the addition where appropriate of factors like the preferred location of tryptophans at the interface between the hydrophobic and hydrophilic domains of a membrane and the inclusion on the end of fragments representing transmembrane helices of any charged residues in the immediate sequence to assist in solubility. The peptides are synthesized by solid-phase peptide synthesis. If desired, stable isotopes can be included at specific positions. As will be seen in the many examples given below, loop fragments can often be examined in aqueous buffer. However, transmembrane fragments are insoluble in water and must be studied in detergent micelles or in organic solvent. The most generally useful solvent that does not appear to perturb the secondary structure is dimethylsulfoxide. All these structural studies are performed using high resolution, high field NMR, with standard two- and three-dimensional techniques.

3. Methods 3.1. Secondary Structures of Fragments of Membrane Proteins Corresponding to Single Loops or to Single Transmembrane Helices

Structures of over 100 peptide fragments of helical transmembrane proteins have now been determined. In many cases, these peptides exhibit biological activity. It is reasonable to propose that the structure of the peptide in solution that is biologically active closely resembles the structure of the corresponding domain in the intact protein. A case in point is provided by the cytoplasmic loops of bovine rhodopsin (a GPCR) that exhibited both biological activity in solution (6) and structure in solution (7). In some cases, subsequent X-ray crystal structure results demonstrated unequivocally that many peptide fragments of membrane proteins are able to independently assume the structural motifs of helices and turns characteristic of the intact protein. This is

Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure

287

consistent with the stabilization of secondary structures through short-range interactions. These kinds of studies demonstrate that it is feasible to obtain relevant structural information for discrete domains of membrane proteins through experimentally accessible approaches on designed peptides. The following will discuss first transmembrane fragments that form helices, followed by fragments that form loops that in turn connect transmembrane helices. 3.1.1. Transmembrane Helical Fragments

In the following, many of the studies on single transmembrane fragments of membrane proteins are briefly summarized to provide an overview of what has been reported to date. At the end of this section, some general features of these structures are discussed. A number of studies have examined fragments of GPCRs. Peptide fragments were chosen based partly on hydropathy plots and were designed to encompass the entire sequence of the putative transmembrane helix. For example, a peptide fragment corresponding to the seventh transmembrane domain (TM) of the tachykinin NK-1 receptor was synthesized, and an a-helical solution structure was reported (8). Similar studies have been reported on fragments from the human adenosine A2a receptor (9), the a-factor receptor (Ste2p) from Saccharomyces cerevisiae (10), and bovine rhodopsin (11). Studies have reported a-helical structures from fragments encompassing transmembrane segments of transport proteins. Peptides corresponding to the transmembrane domain of the monomer of three forms of the potassium ion channel, Shaker, ROMK1, and minK, were each predominantly helical (12). Two of the putative transmembrane segments of the human erythrocyte anion transporter, band 3, were studied and predominantly a-helical structures were reported (13). A peptide containing the putative transmembrane domain of Isk, a voltage-gated potassium channel (residues 42–68), promoted ion transport and solution NMR studies showed an a-helical structure (14). A peptide corresponding to the putative TM7 of the yeast vacuolar H+ ATPase was synthesized, its solution structure determined by NMR, and a predominantly helical structure was observed (15, 16). A peptide containing the sequence of the seventh transmembrane segment of the Na+/H+ exchanger was synthesized. A helical structure, with a break at the double glycine in the helix, was observed (17). Almost all of the transmembrane segments of lactose permease have also been studied by NMR, and a-helical structures were reported (18). Several other examples of peptide fragments of the transmembrane domain of other transmembrane proteins have been reported to form a-helical structures as well. Peptides corresponding to two putative TM from phosphatidylglycerophosphate synthase (E. coli) formed stable helices in solution and in

288

Yeagle and Albert

SDS micelles (19). A peptide containing the sequence of the first TM domain of nAChR b2 subunit was studied in dodecylphosphocholine micelles, and while predominantly helical, the study revealed nonhelical sections with possible functional significance (20). A peptide corresponding to the transmembrane segment of human erythrocyte glycophorin was synthesized, and a helical structure was reported (21). A peptide containing the sequence of the transmembrane segment of FKBP-like protein twisted dwarf 1 (TWD1) from Arabidopsis thaliana was studied by NMR and found to be largely helical (22). NMR studies show a helical conformation for the transmembrane segment of PMP1, a transmembrane protein with a single transmembrane domain that regulates H+-ATPase (23). The Na+K+-ATPase regulatory protein FXYD1 was studied by NMR in micelles and a predominantly helical, though bent, structure was found (24). Subunit b of the F1F0ATP synthase shows an a-helix representing a TM in the amino terminus of this protein (25). The structures obtained from these fragments of the transmembrane domains of a wide variety of membrane proteins reveal that these fragments form stable a-helical structures. In a few cases, comparison with subsequently reported X-ray crystal structures became possible, and good structural correspondence was observed. One of the general characteristics of these studies is that these a-helical structures in solution showed greatest stability in the center of the fragment. The ends of these peptide structures are commonly frayed (see Fig. 1). This phenomenon is widely observed for structures of peptides of these sizes and suggests that these peptide fragments must be designed to be longer than the sequence for which a stable structure is desired. 3.1.2. Fragments Corresponding to Loops of Membrane Proteins

Loop regions of membrane proteins correspond to domains exterior to the membrane. Together the loops can form the site for interaction of biological ligands or drug molecules with the receptor. Individually, these loops may exhibit partial activity for binding ligands or other proteins. For example, the third extracellular loop of the cholecystokinin-2 receptor (26–31) binds a ligand for the receptor. Therefore, the structures of membrane proteins loops are of particular interest. Many studies have been reported, and are noted below, followed by some general observations at the end of this section. Peptide fragments from several GPCR’s have been studied. In some of these cases, a peptide fragment was designed to encompass a loop bounded by a short stretch of helix on either end. This fragment design has been called a helix-turn-helix motif based on the structural results (see below). Two fragments of the parathyroid hormone receptor have been studied by solution NMR techniques in detergent micelles. One peptide containing the amino acid sequence of the first

Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure

289

Fig. 1. Family of structures derived from solution NMR data for a peptide fragment corresponding to transmembrane helix 7. See ref. 19 for details. The figure demonstrates that the stable portions of these fragments are in central portion of the peptide and the ends of the solution structure are typically frayed.

extracellular loop showed a loop structure. The peptide was sufficiently long to contain a loop bounded by two short helices, forming a helix-turn-helix motif (32). A peptide containing the sequence of the third extracellular loop of the cholecystokinin-2 receptor (residues 352–379) revealed a helix-turn-helix motif (28). A 44-residue peptide of the human cannabinoid 1 receptor containing the amino acid sequence of the third cytoplasmic loop of this GPCR is biologically active and exhibits a helix-turn-helix motif (33) mimicking the turn found in the intact protein. A 34residue peptide fragment corresponding to the second intracellular loop of the bradykinin B2 receptor revealed a helix-turn-helix motif (27, 28). A peptide fragment containing the sequence of the second cytoplasmic loop, iC2, of the V1A vasopressin receptor adopted a helix-turn-helix motif, consistent with a turn in the protein connecting two transmembrane helices (34). The structure was very similar to the structure of the corresponding loop on rhodopsin (35). Other peptide fragments corresponding to loops in GPCRs were not long enough to form the helix-helix-turn motif, but nevertheless formed a stable structure in solution; in several cases,

290

Yeagle and Albert

a turn was observed. The peptide corresponding to the third cytoplasmic loop of the parathyroid hormone receptor formed a loop structure (36, 37). Peptide fragments of the first and second extracellular loops of the thromboxane A2 receptor were synthesized. In each case, loop structures were obtained (38, 39), consistent with the connection of these fragments to two transmembrane helices. A peptide fragment from the cytoplasmic face of the parathyroid hormone/parathyroid hormone related protein receptor was structured in solution (37). Two peptides were synthesized that spanned the third cytoplasmic loop of the rat angiotensin II AT1A receptor, a GPCR (40). Some helix was observed, a portion of which corresponds to the connection to a TM. The fragment peptide corresponding to the first extracellular loop forms a type 2 b turn (41). Peptide fragments of the turkey b-adrenergic receptor corresponding to the third intracellular loop showed helical structure, likely corresponding to the beginning of TM6 (42). The same region of the human b-adrenergic receptor was also found to be helical in detergent micelles but in water the peptide was disordered (43). The second cytoplasmic loop of the a2A adrenergic receptor was studied using a peptide fragment in DPC micelles. This peptide proved to be predominantly helical (44). A peptide fragment of the amino terminus of the neurokinin-1 receptor was synthesized, and the structure was determined. Some helix was observed, likely a portion of TM1. A 27 residue peptide corresponding to the third extracellular loop of this GPCR was made, and its structure showed a largely helical structure, consistent with a helix-turn-helix motif (33). The crystal structure of the GPCR rhodopsin shows a helical segment at the amino terminus that lies parallel to the plane of the bilayer and is referred to as helix 8. A peptide fragment corresponding to the putative helix 8 of the human cannabinoid 1 receptor was found to be predominantly helical (45). The putative helix 8 of the cannabinoid 2 receptor has also been synthesized and the structure determined, and an a-helix was observed in the presence of detergent micelles (46). A predominantly a-helical conformation was observed from a peptide containing the sequence of the corresponding region of the turkey b-adrenergic receptor (47). The collection of work on the helix 8 region of GPCRs suggested that this region is helical in the presence of a membrane mimetic but does not form a stable helix in the absence of the membrane mimetic (48). This could have important consequences in the activation of the receptor. In most cases, peptide fragments corresponding to loops (connecting two transmembrane helices) of membrane proteins formed loop structures in solution, with a helix-turn-helix motif when the peptide fragment was long enough. The short helices at the end of the fragment correspond to the ends of the transmembrane

Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure

291

helices. These studies suggest that loops connecting TM in transmembrane bundles may contribute to the stability of membrane proteins. For the most part, these solution structures are independent of the solution conditions. As in the case of the transmembrane fragments discussed above, the ends of the NMR-derived solution structures of these loops are frayed. 3.1.3. General Features of Membrane Protein Structure Accessible from Studies of Membrane Protein Fragments

The available structural literature on membrane protein fragments: 1. Reports structures that report with fidelity on the secondary structure of the intact protein 2. Shows structures of a transmembrane helix or a loop connecting two transmembrane helices 3. Is dominated by solution NMR studies, often in solvents like DMSO that dissolve hydrophobic peptides without perturbing their secondary structure 4. Hydropathy plots have long been used to predict the helical and loop regions of membrane proteins. While these plots can underestimate the length of the transmembrane helix, they have proved to be an important guide in choosing the peptide sequences to investigate. Another factor that is helpful in determining the appropriate length is the pattern of tryptophans locating in the interfacial region of the membrane, and thus occurring near the end of a transmembrane helix (49) 5. Structures obtained can be independent of solvent; some sequences have been examined in both DMSO and water and found to have the same structure (48, 50) These results suggest that it is feasible to explore the secondary structure of membrane proteins using this fragment approach, discovering, among other features, the stop and start points of helices, break points in helices, and the structures of loops connecting transmembrane helices. Under the most favorable circumstances, studies of the structure of membrane fragments can provide a nearly complete survey of the secondary structure of a membrane protein providing that the protein is built around a transmembrane helical bundle. (In the case of bacterial porins that are formed of b-barrels, studies of membrane fragments would not be expected to produce useful information since b-barrels require long-range interactions for stability.) Importantly, the available literature does not contain an example of a membrane protein fragment where the structure reflected incorrectly the secondary structure of the intact membrane protein. Of course some cases exist in which a fragment is

292

Yeagle and Albert

disordered in solution and thus provides no information on secondary structure. Consequently, studies on the structures of over 100 of these fragments of membrane proteins have not yet misled the investigator about the structure in the protein from which the fragments were derived. 3.2. Two-Helix Fragments

The work on fragments encompassing a single TM of a membrane protein does not provide information on the packing of one helix against another in a transmembrane helical bundle. A step in the direction of obtaining information on helix packing can be taken by studying a two-helix bundle connected by a loop. This helix-loop-helix structure can be expected to express some of the helix–helix packing constraints inherent in the intact transmembrane helical bundle. Several investigators have now explored the structure of such two-helix bundles. The oldest example of a structure of a two-helix bundle is not strictly speaking a member of this family of structures, but is very informative, nonetheless. The protein is human erythrocyte glycophorin and the two-helix bundle is not connected by a peptide loop because glycophorin has a single transmembrane domain and forms a stable dimer in the membrane. In this pioneering NMR structure, both the crossing angle of the two helices (−40°) and the interhelical packing can be determined, as well as the structure of the helix itself. Glycines are found to play an important role in the packing of the helices (21). Several such studies have focused on GPCRs. Extensive studies have been published on the structure of fragments of the single GPCR found in yeast, Ste2p (51–54). Several fragments containing two TM have been studied. Recent studies on the TM1-loop-TM2 fragment and the TM6-loop-TM7 fragment showed strong evidence for a helix-loop-helix structure, though a high resolution structure is yet to be reported (55). An NMR study of a fragment of the human cannabinoid CB2 receptor encompassing the first and second transmembrane domains of this protein reported an extensive assignment of the expressed fragment, but an atomic structure was not determined (56). More detailed structural information was obtained from other membrane proteins. The second and third transmembrane domains of the human glycine receptor were expressed as a TM-turn-TM fragment. The NMR structure of this fragment confirmed the anticipated helix-turn-helix motif and revealed a crossing angle of 44° between the two helices (57). Subunit c of the F1F0ATP synthase forms a helix-turn-helix as a monomer in organic solution. This NMR structure is consistent with known structural information from the intact transmembrane region. In this case, the helix-turn-helix motif represents the entire subunit (58). NMR studies have been reported recently on the transmembrane domain of the ErbB2 receptor that consists of two TM.

Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure

293

A helix-turn-helix motif was found, and the crossing angle of the two helices is −42° with glycines playing a role in helix–helix packing (59). A recent NMR study of the BNip3 transmembrane domain in lipid bicelles showed a helix-turn-helix motif and a helix-crossing angle of −45° (60). The transmembrane domain of the T cell receptor was studied as a zeta–zeta dimer in detergent micelles by NMR. A crossing angle of 23° was observed between the two helices (61). In summary, studies show that TM-loop-TM peptides from the transmembrane domains of membrane proteins constructed of helical bundles provide more information than can be obtained from studies of single TM. These polypeptides form the expected helix-turn-helix motif in micelles or bicelles. Furthermore, information about helix–helix packing that likely reflects the structure of the intact protein, including the helix crossing angles, is revealed. 3.3. Full Scan of Secondary Structure Using Membrane Protein Fragments

The studies described above encouraged a more systematic analysis of an entire membrane protein. The well-studied membrane protein, bacteriorhodopsin, was chosen. An overlapping set of peptide fragments that spanned the entire primary sequence of the protein was synthesized, and each fragment of bacteriorhodopsin was studied individually by solution NMR. The structures of peptides corresponding to transmembrane helical segments were a-helical as protein fragments. Peptides corresponding to loops formed loops as protein fragments. Comparison of these peptide structures to the crystal structure of bacteriorhodopsin revealed that the structures adopted by these fragments of bacteriorhodopsin reported faithfully on the secondary structure of the intact protein (3). These results suggest the hypothesis that scanning the entire primary structure of an integral membrane protein with overlapping fragments and determining the secondary structure of the fragments will provide information on the secondary structure of the whole protein. Several examples exist in which this approach has proven useful. EmrE is a multidrug resistance protein in E. coli. Models for the structure of this protein have included a helical bundle and a b-barrel. Four overlapping peptides were synthesized and their structures determined in SDS micelles, and the authors determined that the helical bundle was the better model for the transmembrane domain of this membrane protein (62). Human adenosine A2a receptor is a GPCR, and thus expected to contain a transmembrane bundle of seven helices. Seven peptides were synthesized, each one designed to encompass one putative transmembrane segment of the protein. Circular dichroism studies revealed that five of these peptides were extensively a-helical, consistent with the hypothesis of the structural model

294

Yeagle and Albert

for this protein. Two of the peptides showed considerably less helical content, suggesting a reduced stability of secondary structure in those transmembrane segments (9). The translocator protein TSPO binds cholesterol and has been suggested to contain five TM. Five peptides encompassing these five TM were synthesized, and their structures were determined by NMR. In each case, the putative TM exhibited largely helical conformation, thus supporting the model of a five TM bundle of helices for the structure of this protein (63). A prior report from the same lab revealed the interesting observation that a fragment containing the C-terminal region of the protein formed a helical structure capable of binding cholesterol (64). The a-factor receptor (Ste2p) from S. cerevisiae has been extensively studied using this fragment approach. Fragments corresponding to each of the transmembrane segments have been synthesized, and structures were determined (10). More recently, a much larger fragment containing the putative third cytoplasmic loop, TM7, and a substantial portion of the C terminus has been studied. Both the helical nature of TM7 was observed as was the putative eighth helix of this GPCR (65). Much has been learned about the structure of this GPCR from these studies, including a complete survey of the secondary structure of this membrane protein using fragments of the receptor (51–55). The data are consistent with a transmembrane helical bundle typical of other GPCRs. A set of 18 peptides spanning the entire sequence of rhodopsin was synthesized and the structure of each of the peptides was individually determined by NMR. Although these peptides were studied before any X-ray crystal structure was available, comparison of these structures to the crystal structures when they became available showed fidelity in reporting of the secondary structure of the protein by the structures of the fragments (11, 49, 50, 66–71). The secondary structure of the 12 TM protein, lactose permease, has been studied using a set of 12 peptides, each encompassing one of the TMs of this protein. As in the case of other membrane proteins described above, most of these 12 fragments of lactose permease showed substantial helical content. Interestingly some did not. Molecular dynamics calculations produced results consistent with the experimental NMR results; that is, some TMs were more stable than others. These data suggested that while the protein contained a transmembrane bundle of 12 helices, some of these helices are more stable than others within the tertiary structure of the protein. When mapped on the recent crystal structure of the protein, a remarkable asymmetry was observed within the protein with respect to stability that may have functional consequences (18, 72).

Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure

295

These studies have demonstrated that considerable information about membrane protein secondary structure (for membrane proteins with an a-helical transmembrane domain) can be obtained from structural analysis of peptide fragments of those membrane proteins. 3.4. ThreeDimensional Structures from Protein Fragments

It is possible to create a set of overlapping peptides that span the sequence of a membrane protein (built around a helical transmembrane domain) in which each peptide encompasses an individual transmembrane segment or a loop connecting two transmembrane segments, as introduced above. If each of the fragments expressed stable secondary structure, the opportunity would present to assemble the three-dimensional structure of the protein from the fragments if information existed to organize the elements of secondary structure in three dimensions. In 2001, bacteriorhodopsin was used to test this idea. An assumption was made that the secondary structure determined from studies of the fragments was the same as in the intact bacteriorhodopsin. This assumption was verified by comparing the structures of the fragments determined in solution by NMR to the available X-ray crystal structures of the intact bacteriorhodopsin. The methodology relied on experimental data from the intact protein that defined the arrangement of the transmembrane helices in three dimensions. These data consisted of point-to-point distances determined experimentally by a variety of techniques. A preliminary construct of the protein was then made by assembling the (overlapping) structures of the fragments into a continuous polypeptide spanning the whole sequence of bacteriorhodopsin. This construct expressed the secondary structure but not the tertiary structure of the protein. All of the available experimental distance constraints were then applied to the construct, including the NMR constraints from the studies of the fragments and the longdistance constraints from other experiments that defined the organization of the protein in three dimensions. This construct was then subjected to simulated annealing to allow an evolution of the structure to achieve a match between the structure and the experimental distance constraints. Finally, the whole structure was minimized with respect to energy under the experimental distance constraints. The result was a structure that agreed remarkably well with X-ray crystal structures of the protein (see Fig. 2), and could be characterized as a medium-resolution structure of bacteriorhodopsin. It should be emphasized that this successful result is entirely an experimental structure, driven by individual distance constraints from a variety of experiments (3). A more theoretical foundation to this approach was provided recently in which the process was defined by two steps: (1) “search the conformational space of membrane folds to find those matching a given set of distance constraints and (2) refine the helical bundles

296

Yeagle and Albert

Fig. 2. Three-dimensional structure of bacteriorhodopsin determined as described in the text (red) superimposed on one of the crystal structures (yellow) of bacteriorhodopsin (2BRD). (Figure prepared with VMD (79))

from step 1 using a Monte Carlo simulated annealing protocol designed for local minimization of a custom penalty function.” A key finding in this study is that the number of long distance constraints required to adequately organize in three dimensions a transmembrane helical bundle is surprisingly small (73). This result helped to explain the success of the work with bacteriorhodopsin described above, which also utilized a limited number of distance constraints defining the tertiary structure of the transmembrane bundle. This sparse constraints approach was further exploited in assembling a model of the transmembrane domain of lactose permease in which 99 interhelical distance constraints were utilized to define the helix packing of the transmembrane helical bundle of this protein (74). The process outlined above was utilized for the development of an experimentally based structure for the GPCR, rhodopsin. The entire sequence of rhodopsin was scanned with a set of overlapping peptides that encompassed either a putative TM or a loop connecting two TM. The secondary structure of each of these peptides was determined by NMR. (Later, when a crystal structure

Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure

297

became available, the assumption that the secondary structure in the peptides matched the structure in the intact protein was confirmed.) Because the peptides overlapped in sequence, it was possible to build a construct for the entire protein by linking together the fragment structures using the regions of overlap. A substantial number of long-range distance constraints from a variety of independent experiments on the intact protein were then written onto the construct, along with all the short-range constraints from the secondary structure determination in the fragments. This construct was then subjected to simulated annealing and finally to energy minimization. The resulting structure was eventually compared with the crystal structure of the protein and remarkable agreement was noted, best characterized as a medium resolution structure. This structure was determined entirely from experimental distance constraints (75). This approach was extended to the structure of the activated form of rhodopsin, called Meta II. This is the form that binds the G protein to activate the latter and is a transient species. Using a different set of long-range distance constraints obtained from experiments on the trapped activated form of the receptor, a new structure was determined. This structure was determined before any other atomic level structure for the activated form was published and provided a view into the changes that activation causes on the receptor structure (76). These studies reveal that experimentally based membrane protein structures are accessible by an alternative approach. The interactability of many membrane proteins to structure determination is the driving force for the studies described here. While high-resolution structures determined by X-ray crystallography and NMR remain the goal of membrane protein structural biology, alternative approaches have provided considerable information in the absence of X-ray crystal structures and NMR structures of the whole protein. These studies also reveal that the stability of elements of secondary structure are even more important to transmembrane proteins than they are already known to be for the stability of soluble proteins. Folding of newly synthesized transmembrane proteins likely proceeds through the formation of secondary structure, in particular, the transmembrane helical bundle, as a stable core of the protein as soon as the polypeptide chain encounters the hydrophobic interior of the membrane. 3.5. NMR Studies on Intact Membrane Proteins

It is appropriate to close this chapter with a brief consideration of the determination of the complete structure of a membrane protein with a helical transmembrane domain by NMR. For one membrane protein, diacylglycerol kinase, a complete assignment of the NMR data has been achieved (77). For several others, a partial assignment of NMR data from the intact protein has been

298

Yeagle and Albert

reported, including translocator protein (63) and signal peptidase (78). Clearly, determination of intact membrane protein structure for membrane proteins with a transmembrane helical bundle from NMR data alone is yet to come into its own, which emphasizes the importance of the methods reviewed in this chapter.

4. Notes Experimental difficulties arise in using fragments to study membrane protein secondary structure. The greatest difficulty can arise if nonperturbing conditions cannot be found in which the fragment exhibits stable secondary structure. Using strong helixstabilizing solvents such as trifluoroethanol to get around this problem is not very useful because it may be stabilizing a helix where none exists naturally. For example, in the study of the lactose permease by fragments, it was discovered that some fragments had little or no stability as separate fragments and that likely reflected on the internal stability of these transmembrane segments in the native protein (18). The structures themselves are often difficult to obtain because of chemical shift redundancies made worse by the lack of tertiary structure. References 1. http://blanco.biomol.uci.edu/Membrane_ Proteins_xtal.html 2. Katragadda M, Alderfer JL, Yeagle PL (2000) Solution structure of the loops of bacteriorhodopsin closely resemble the crystal structure. Biochim Biophys Acta 1466:1–6 3. Katragadda M, Alderfer JL, Yeagle PL (2001) Assembly of a polytopic membrane protein structure from the solution structures of overlapping peptide fragments of bacteriorhodopsin. Biophys J 81:1029–1036 4. Yeagle PL, Albert AD (2002) Use of nuclear magnetic resonance to study the threedimensional structure of rhodopsin. Methods Enzymol 343:223–231 5. Yeagle PL, Albert AD (2007) G-protein coupled receptor structure. Biochim Biophys Acta 1768:808–824 6. Konig B, Arendt A, McDowell JH, Kahlert M, Hargrave PA, Hofmann KP (1989) Three cytoplasmic loops of rhodopsin interact with transducin. Proc Natl Acad Sci U S A 86: 6878–6882 7. Yeagle PL, Alderfer JL, Albert AD (1995) Structure of the carboxyl terminal domain

8.

9.

10.

11.

12.

of bovine rhodopsin. Nat Struct Biol 2: 832–834 Berlose J, Convert O, Brunissen A, Chassaing G, Lavielle S (1994) Three dimensional structure of the highly conserved seventh transmembrane domain of G-protein-coupled receptors. FEBS Lett 225:827–843 Lazarova T, Brewin KA, Stoeber K, Robinson CR (2004) Characterization of peptides corresponding to the seven transmembrane domains of human adenosine A2a receptor. Biochemistry 43:12945–12954 Arshava B, Taran I, Xie H, Becker JM, Naider F (2002) High resolution NMR analysis of the seven transmembrane domains of a heptahelical receptor in organic–aqueous medium. Biopolymers 64:161–176 Katragadda M, Chopra A, Bennett M, Alderfer JL, Yeagle PL, Albert AD (2001) Structures of the transmembrane helices of the G-protein coupled receptor, rhodopsin. J Pept Res 58:79–89 Haris PI (1988) Synthetic peptide fragments as probes for structure determination of potassium ion-channel proteins. Biosci Rep 18:299–312

Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure 13. Gargaro AR, Bloomberg GB, Dempsey CE, Murray M, Tanner MJ (1994) The solution structures of the first and second transmembrane-spanning segments of band 3. Eur J Biochem 221:445–454 14. Aggeli A, Bannister ML, Bell M et al (1998) Conformation and ion-channeling activity of a 27-residue peptide modeled on the singletransmembrane segment of the IsK (minK) protein. Biochemistry 37:8121–8131 15. Duarte AM, de Jong ER, Wechselberger R, van Mierlo CP, Hemminga MA (2007) Segment TM7 from the cytoplasmic hemichannel from VO-H+-V-ATPase includes a flexible region that has a potential role in proton translocation. Biochim Biophys Acta 1768:2263–2270 16. Duarte AM, Wolfs CJ, van Nuland NA et al (2007) Structure and localization of an essential transmembrane segment of the proton translocation channel of yeast H+-V-ATPase. Biochim Biophys Acta 1768:218–227 17. Ding J, Rainey JK, Xu C, Sykes BD, Fliegel L (2006) Structural and functional characterization of transmembrane segment VII of the Na+/H+ exchanger isoform 1. J Biol Chem 281:29817–29829 18. Bennett M, D’Rozario R, Sansom M, Yeagle PL (2006) Asymmetric stability among the transmembrane helices of lactose permease. Biochemistry 45:8088–8095 19. Morein S, Trouard TP, Hauksson JB, Rilfors L, Arvidson G, Lindblom G (1996) Twodimensional 1H-NMR of transmembrane peptides from Escherichia coli phosphatidylglycerophosphate synthase in micelles. Eur J Biochem 241:489–497 20. Bondarenko V, Xu Y, Tang P (2007) Structure of the first transmembrane domain of the neuronal acetylcholine receptor beta2 subunit. Biophys J 92:1616–1622 21. MacKenzie KR, Prestegard JH, Engelman DM (1997) A transmembrane helix dimer: structure and implications. Science 276:131–133 22. Scheidt HA, Vogel A, Eckhoff A, Koenig BW, Huster D (2007) Solid-state NMR characterization of the putative membrane anchor of TWD1 from Arabidopsis thaliana. Eur Biophys J 36:393–404 23. Mousson F, Beswick V, Coic YM, HuynhDinh T, Sanson A, Neumann JM (2001) Investigating the conformational coupling between the transmembrane and cytoplasmic domains of a single-spanning membrane protein. A 1H-NMR study. FEBS Lett 505: 431–435

299

24. Teriete P, Franzin CM, Choi J, Marassi FM (2007) Structure of the Na, K-ATPase regulatory protein FXYD1 in micelles. Biochemistry 46:6774–6783 25. Dmitriev O, Jones PC, Jiang W, Fillingame RH (1999) Structure of the membrane domain of subunit b of the Escherichia coli F0F1 ATP synthase. J Biol Chem 274: 15598–15604 26. Pellegrini M, Mierke DF (1999) Molecular complex of cholecystokinin-8 and N-terminus of the cholecystokinin A receptor by NMR spectroscopy. Biochemistry 38: 14775–14783 27. Giragossian C, Mierke DF (2003) Determination of ligand–receptor interactions of cholecystokinin by nuclear magnetic resonance. Life Sci 73:705–713 28. Giragossian C, Schaschke N, Moroder L, Mierke DF (2004) Conformational and molecular modeling studies of beta-cyclodextrin– heptagastrin and the third extracellular loop of the cholecystokinin 2 receptor. Biochemistry 43:2724–2731 29. Giragossian C, Sugg EE, Szewczyk JR, Mierke DF (2003) Intermolecular interactions between peptidic and nonpeptidic agonists and the third extracellular loop of the cholecystokinin 1 receptor. J Med Chem 46: 3476–3482 30. Giragossian C, Mierke DF (2002) Intermolecular interactions between cholecystokinin-8 and the third extracellular loop of the cholecystokinin-2 receptor. Biochemistry 41: 4560–4566 31. Giragossian C, Mierke DF (2001) Intermolecular interactions between cholecystokinin-8 and the third extracellular loop of the cholecystokinin A receptor. Biochemistry 40: 3804–3809 32. Piserchio A, Bisello A, Rosenblatt M, Chorev M, Mierke DF (2000) Characterization of parathyroid hormone/receptor interactions: structure of the first extracellular loop. Biochemistry 39:8153–8160 33. Ulfers AL, Piserchio A, Mierke DF (2002) Extracellular domains of the neurokinin-1 receptor: structural characterization and interactions with substance P. Biopolymers 66:339–349 34. Demene H, Granier S, Muller D et al (2003) Active peptidic mimics of the second intracellular loop of the V(1A) vasopressin receptor are structurally related to the second intracellular rhodopsin loop: a combined 1H NMR and biochemical study. Biochemistry 42:8204–8213

300

Yeagle and Albert

35. Yeagle PL, Alderfer JL, Albert AD (1997) The first and second cytoplasmic loops of the G-protein receptor, rhodopsin, independently form b-turns. Biochemistry 36:3864–3869 36. Mierke DF, Royo M, Pelligrini M, Sun H, Chorev M (1996) Third cytoplasmic loop of the PTH/PTHrP receptor. J Am Chem Soc 118:8998–9004 37. Pellegrini M, Royo M, Chorev M, Mierke DF (1996) Conformational characterization of a peptide mimetic of the third cytoplasmic loop of the G-protein coupled parathyroid hormone/parathyroid hormone related protein receptor. Biopolymers 40:653–666 38. Ruan KH, So SP, Wu J, Li D, Huang A, Kung J (2001) Solution structure of the second extracellular loop of human thromboxane A2 receptor. Biochemistry 40:275–280 39. Wu J, So SP, Ruan KH (2003) Solution structure of the third extracellular loop of human thromboxane A2 receptor. Arch Biochem Biophys 414:287–293 40. Franzoni L, Nicastro G, Pertinhez TA et al (1999) Structure of two fragments of the third cytoplasmic loop of the rat angiotensin II AT1A receptor. Implications with respect to receptor activation and G-protein selection and coupling. J Biol Chem 274:227–235 41. Nicastro G, Peri F, Franzoni L, de Chiara C, Sartor G, Spisni A (2003) Conformational features of a synthetic model of the first extracellular loop of the angiotensin II AT1A receptor. J Pept Sci 9:229–243 42. Jung H, Windhaber R, Palm D, Schnackerz KD (1995) NMR and circular dichroism studies of synthetic peptides derived from the third intracellular loop of the beta-adrenoceptor. FEBS Lett 358:133–136 43. Katragadda M, Maciejewski MW, Yeagle PL (2004) Structural studies of the putative helix 8 in the human beta(2) adrenergic receptor: an NMR study. Biochim Biophys Acta 1663:74–81 44. Chung DA, Zuiderweg ER, Fowler CB, Soyer OS, Mosberg HI, Neubig RR (2002) NMR structure of the second intracellular loop of the alpha 2A adrenergic receptor: evidence for a novel cytoplasmic helix. Biochemistry 41:3596–3604 45. Choi G, Guo J, Makriyannis A (2005) The conformation of the cytoplasmic helix 8 of the CB1 cannabinoid receptor using NMR and circular dichroism. Biochim Biophys Acta 1668:1–9 46. Choi G, Landin J, Xie XQ (2002) The cytoplasmic helix of cannabinoid receptor CB2, a conformational study by circular dichroism

47.

48.

49. 50.

51.

52.

53.

54.

55.

56.

57.

and (1)H NMR spectroscopy in aqueous and membrane-like environments. J Pept Res 60:169–177 Jung H, Windhaber R, Palm D, Schnackerz KD (1996) Conformation of a beta-adrenoceptor-derived signal transducing peptide as inferred by circular dichroism and 1H NMR spectroscopy. Biochemistry 35:6399–6405 Yeagle PL, Alderfer JL, Albert AD (1996) Structure determination of the fourth cytoplasmic loop and carboxyl terminal domain of bovine rhodopsin. Mol Vis 2. http://www. molvis.org/molvis/v2/p12/ Wallace BA, Janes RW (1999) Tryptophans in membrane proteins. Adv Exp Med Biology 467:789–799 Yeagle PL, Danis C, Choi G, Alderfer JL, Albert AD (2000) Three dimensional structure of the seventh transmembrane helical domain of the G-protein receptor, rhodopsin. Mol Vis. http://www.molvis.org/molvis/v6/ a17/ Arshava B, Liu SF, Jiang H, Breslav M, Becker JM, Naider F (1998) Structure of segments of a G protein-coupled receptor: CD and NMR analysis of the Saccharomyces cerevisiae tridecapeptide pheromone receptor. Biopolymers 46:343–357 Xie HB, Ding FX, Schreiber D et al (2000) Synthesis and biophysical analysis of transmembrane domains of a Saccharomyces cerevisiae G protein-coupled receptor. Biochemistry 39:15462–15474 Estephan R, Englander J, Arshava B, Samples KL, Becker JM, Naider F (2005) Biosynthesis and NMR analysis of a 73-residue domain of a Saccharomyces cerevisiae G protein-coupled receptor. Biochemistry 44:11795–11810 Naider F, Khare S, Arshava B, Severino B, Russo J, Becker JM (2005) Synthetic peptides as probes for conformational preferences of domains of membrane receptors. Biopolymers 80:199–213 Cohen LS, Arshava B, Estephan R et al (2008) Expression and biophysical analysis of two double-transmembrane domain-containing fragments from a yeast G protein-coupled receptor. Biopolymers 90:117–130 Zheng H, Zhao J, Sheng W, Xie XQ (2006) A transmembrane helix-bundle from G-protein coupled receptor CB2: biosynthesis, purification, and NMR characterization. Biopolymers 83:46–61 Ma D, Liu Z, Li L, Tang P, Xu Y (2005) Structure and dynamics of the second and third transmembrane domains of human glycine receptor. Biochemistry 44:8790–8800

Membrane Protein Fragments Reveal Both Secondary and Tertiary Structure 58. Girvin ME, Rastogi VK, Abildgaard F, Markley JL, Fillingame RH (1998) Solution structure of the transmembrane H+transporting subunit c of the F1F0 ATP synthase. Biochemistry 37:8817–8824 59. Bocharov EV, Mineev KS, Volynsky PE et al (2008) Spatial structure of the dimeric transmembrane domain of the growth factor receptor ErbB2 presumably corresponding to the receptor active state. J Biol Chem 283: 6950–6956 60. Bocharov EV, Pustovalova YE, Pavlov KV et al (2007) Unique dimeric structure of BNip3 transmembrane domain suggests membrane permeabilization as a cell death trigger. J Biol Chem 282:16256–16266 61. Call ME, Schnell JR, Xu C, Lutz RA, Chou JJ, Wucherpfennig KW (2006) The structure of the zetazeta transmembrane dimer reveals features essential for its assembly with the T cell receptor. Cell 127:355–368 62. Venkatraman J, Nagana Gowda GA, Balaram P (2002) Structural analysis of synthetic peptide fragments from EmrE, a multidrug resistance protein, in a membrane-mimetic environment. Biochemistry 41:6631–6639 63. Murail S, Robert JC, Coic YM et al (2008) Secondary and tertiary structures of the transmembrane domains of the translocator protein TSPO determined by NMR. Stabilization of the TSPO tertiary fold upon ligand binding. Biochim Biophys Acta 1778:1375–1381 64. Jamin N, Neumann J-M, Ostuni MA, Thi Kim Ngoc Vu, Yao ZX, Murail S, Robert J-C, Giatzakis C, Papadopoulos V, Lacapere J-J (2005) Characterization of the cholesterol recognition amino acid consensus sequence of the peripheral-type benzodiazepine receptor. Mol Endocrinol 19(3):588–594 65. Neumoin A, Arshava B, Becker J, Zerbe O, Naider F (2007) NMR studies in dodecylphosphocholine of a fragment containing the seventh transmembrane helix of a G-proteincoupled receptor from Saccharomyces cerevisiae. Biophys J 93:467–482 66. Yeagle PL, Alderfer JL, Albert AD (1995) Structure of the third cytoplasmic loop of bovine rhodopsin. Biochemistry 34: 14621–14625 67. Dorey M, Hargrave PA, McDowell JH et al (1999) Effects of phosphorylation on the

68.

69. 70.

71.

72.

73.

74.

75.

76.

77.

78.

79.

301

structure of the G-protein receptor, rhodopsin. Biochim Biophys Acta 1416:217–224 Yeagle PL, Alderfer JL, Albert AD (1997) Three dimensional structure of the cytoplasmic face of the G protein receptor rhodopsin. Biochemistry 36:9649–9654 Albert AD, Yeagle PL (2000) NMR analysis of the three dimensional structure of rhodopsin domains. Methods Enzymol 315:107–115 Chopra A, Yeagle PL, Alderfer JA, Albert A (2000) Solution structure of the sixth transmembrane helix of the G-protein coupled receptor, rhodopsin. Biochim Biophys Acta 1463:1–5 Yeagle PL, Salloum A, Chopra A et al (2000) Structures of the intradiskal loops and amino terminus of the G-protein receptor, rhodopsin. J Pept Res 55:455–465 Bennett M, Yeagle JA, Maciejewski M, Ocampo J, Yeagle PL (2004) Stability of loops in the structure of lactose permease. Biochemistry 43:12829–12837 Sale K, Faulon JL, Gray GA, Schoeniger JS, Young MM (2004) Optimal bundling of transmembrane helices using sparse distance constraints. Protein Sci 13:2613–2627 Sorgen PL, Hu Y, Guan L, Kaback HR, Girvin ME (2002) An approach to membrane protein structure without crystals. Proc Natl Acad Sci U S A 99:14037–14040 Yeagle PL, Choi G, Albert AD (2001) Studies on the structure of the G-protein coupled receptor rhodopsin including the putative G-protein binding site in unactivated and activated forms. Biochemistry 40:11932–11937 Choi G, Landin J, Galan JF, Birge RR, Albert AD, Yeagle PL (2002) Structural studies of metarhodopsin II, the activated form of the G-protein coupled receptor, rhodopsin. Biochemistry 41:7318–7324 Oxenoid K, Kim HJ, Jacob J, Sonnichsen FD, Sanders CR (2004) NMR assignments for a helical 40 kDa membrane protein. J Am Chem Soc 126:5048–5049 Musial-Siwek M, Kendall DA, Yeagle PL (2008) Solution NMR of signal peptidase, a membrane protein. Biochim Biophys Acta 1778:937–944 Humphrey W, Dalke A, Schulten K (1996) VMD – visual molecular dynamics. J Mol Graph 14:33–38

as

Chapter 16 What Can We Learn from a Small Regulatory Membrane Protein? Gianluigi Veglia, Kim N. Ha, Lei Shi, Raffaello Verardi, and Nathaniel J. Traaseth Abstract This chapter reviews the molecular biology, biochemical, and NMR methods that we used to study the structural dynamics, membrane topology, and interaction of phospholamban (PLN), a small regulatory membrane protein involved in the regulation of the sarcoplasmic reticulum Ca-ATPase (SERCA). In particular, we show the progression of our research from the initial hypotheses toward understanding the molecular mechanisms of SERCA’s regulation, including the effects of PLN oligomerization and posttranslational phosphorylation. Finally, we show how the knowledge of the molecular mechanism of the structural dynamics and topology of free and bound proteins can lead to the rational design of PLN analogs for possible use in gene therapy. Key words: Phospholamban, SERCA, Ca-ATPase, Structural dynamics, Topology, Membrane proteins, NMR, PISEMA

1. Introduction Phospholamban (PLN) is a small integral membrane protein (52 residues) localized in the sarcoplasmic reticulum (SR) membrane that regulates the flow of calcium ions into the SR lumen of cardiac muscle (1). Specifically, PLN binds to SR calcium ATPase (SERCA), the enzyme responsible for calcium re-uptake into the SR lumen. In its unphosphorylated form, PLN inhibits SERCA by reducing its affinity for calcium. Upon b-adrenergic stimulation, PLN is phosphorylated by protein kinase A at S16 with concomitant relief of SERCA inhibition and restoration of SERCA activity. These cyclic events account for the relaxation (diastolic phase) of the heart muscle, which if disrupted, may evolve into heart Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_16, © Springer Science+Business Media, LLC 2010

303

304

Veglia et al.

failure (2). Due to their central role in cardiac muscle contractility, PLN and SERCA have become targets for new, alternative therapies for heart failure (3). What can be learned from the study of the structural and dynamic transitions of PLN free and bound to SERCA?, More importantly, can these studies contribute in the development of new clinical approaches to heart failure? Much progress has been made toward understanding the molecular mechanism of SERCA within the enzymatic cycle (4, 5). Chapters 9 and 15 of this volume give exquisite portraits of the latest findings, outlining the successes of x-ray crystallography in reconstructing the structural transitions of SERCA during enzyme turnover. Undeniably, the first crystal structure of SERCA (E1-Ca2, calcium-bound form) solved by Toyoshima and co-workers paved the way for the atomic level understanding of calcium translocation in the SR (6). To date, there are several crystal structures of SERCA under different experimental conditions that are allowing enzymologists to model the major conformational states of the enzyme during turnover (7). However, very few studies have been dedicated to the determination of the complexes between SERCA and its endogenous inhibitor PLN. This is probably due to the inherent dynamics of the SERCA/PLN complex that might complicate the formation of large, diffracting crystals. This hypothesis is reinforced by several fluorescence and EPR studies published by the Thomas and Squier Laboratories underscoring the dynamic nature of the SERCA/PLN complex (8–14). So far, the labs of Stokes and Young (15, 16) have produced the best images of this complex although the quality of the crystals does not define the interaction surface between the two proteins at the atomic level. NMR studies, on the other hand, have been focused on the structural dynamics of PLN in the unbound form. This is mainly due to the difficulties in producing isotopically labeled SERCA and analyzing its complex with PLN by solution-state NMR (Mw ~ 116 kDa). Our laboratory, in collaboration with the Thomas laboratory, has used a combination of solution and solid-state NMR methods to map the structural dynamics, topology, and interactions between PLN and SERCA. Based on our chemical shift perturbation data from solution NMR, we have constructed an allosteric model for SERCA regulation by PLN (11, 13, 17–19). This regulatory model has been now adopted by several groups in the fields of structural biology and muscle physiology to explain the effects of naturally occurring and engineered mutations in PLN. More importantly, this model has allowed us to rationally manipulate the structural dynamics of PLN and design new mutants that are possible candidates for gene therapy in heart failure treatment. In the following sections, we briefly review the molecular biology, biochemistry, and NMR approaches that we have used to study SERCA’s regulation by looking at PLN free (monomeric

What Can We Learn from a Small Regulatory Membrane Protein?

305

and pentameric) and bound to SERCA. We show the progression of our research from our initial hypotheses on the mechanism of SERCA regulation by PLN to our initial attempts to control PLN structural dynamics to tune SERCA’s function.

2. Materials 2.1. Expression and Purification of Monomeric and Pentameric PLN

1. pMal c2E maltose-binding protein vector (New England Biolabs, Ipswich, MA).

2.1.1. Plasmid Construction and Mutagenesis of AFA–PLN

3. Pfu Turbo DNA polymerase (Stratagene, La Jolla, CA).

2. Oligonucleotide primers (Biomedical Genomics Center, University of MN). 4. dNTP mix (Promega, Madison, WI). 5. T4 DNA ligase (Promega, Madison, WI). 6. 1× Luria–Bertani (LB) media. 7. Qiaprep Spin Miniprep Kit (Qiagen, Valencia, CA).

2.1.2. Expression of Unlabeled and 15N Uniformly Labeled PLN

1. BL21(DE3) strain E. coli cells (Novagen, Gibbstown, NJ). 2. 1× Luria–Bertani broth media. 3. M9 minimal media containing 12.8 g/L Na2HPO4·7H2O, 3 g/L KH2PO4, 0.5 g/L NaCl, and 1 g/L 15N-labeled NH4Cl, 2 g/L glucose, and 0.1% w/v ampicillin. 4. 1 M IPTG (Gold Biotechnology, St. Louis, MO). 5. Mineral cocktail: 6 mg/L CaCl2, 6 mg/L FeSO4, 1 mg/L MnCl2, 0.8 mg/L CoCl2, 0.7 mg/L ZnSO4, 0.3 mg/L CuCl2, 0.02 mg/L H3BO3, 0.25 mg/L (NH4)6MO7O24, and 5 mg/L EDTA. 6. Vitamin cocktail: 1 mg/L calcium pentothenate, 1 mg/L biotin, 1 mg/L folic acid, 1 mg/L niacinamide, and 1 mg/L pyridoxal phosphate.

2.1.3. Purification of PLN

1. Lysis buffer: 20 mM PBS pH 7.3, 120 mM NaCl, 8 mM EDTA, 0.1 mM DTT, 52.6 mM glycerol, 0.5 mg/mL pepstatin A, 0.5 mg/mL leupeptin, 2.5 mM lysozyme, and 4.5 mM Tween-20. 2. Amylose affinity chromatography resin (New England Biolabs, Ipswich, MA). 3. Wash buffer: 20 mM PBS pH 7.3, 120 mM NaCl. 4. Maltose elution buffer: 46 mM maltose in 20 mM PBS pH 7.3, 120 mM NaCl, 1 mM EDTA, and 3.1 mM NaN3. 5. Recombinant tobacco etch virus serine protease (Invitrogen, Carlsbad, CA or expressed and purified in our own lab) (see Note 2).

306

Veglia et al.

6. Dialysis membranes with 1,000-kDa molecular weight cutoff. 7. HPLC purification: H2O with 0.1% trifluoroacetic acid and isopropanol. 2.1.4. Quantification

1. Bio-Rad gradient 10–15% polyacrylamide gel (Bio-Rad, Hercules, CA). 2. Protein standard.

2.1.5. Phosphorylation of PLN at S16 by cAMP-Dependent Protein Kinase A

1. Phosphorylation buffer: 20 mM MOPS (pH 7.3), 1% b-Octylglucoside, 1 mM ATP, and 1 mM MgCl2.

2.2. SERCA Preparation

1. White muscle from the back and hind legs of rabbit.

2. cAMP-dependent protein kinase A catalytic subunit (Sigma) or recombinant kinase as described in ref. 20.

2. Extraction buffer: 20 mM MOPS and 0.1 M KCl, pH 7. 3. Sucrose buffer: 20 mM MOPS, 0.3 M sucrose, 1 mM NaN3. 4. Reactive red affinity resin (Sigma). 5. SERCA buffer: 5 mM DPC, 1 mM CaCl2, 1 mM MgCl2, 20 mM MOPS (pH 7.0), 20% glycerol, 0.25 mM DTT, and 4 mM ADP.

2.3. Functional Assays 2.3.1. SERCA PLN Co-reconstitution

1. 1,2-Dioleoyl-sn-glycero-3-phosphocholine (DOPC) and 1,2-dioleoyl-sn-glycero-phosphoethanolamine (DOPE) (Avanti Polar Lipids, Alabaster, AL). 2. Biobeads (Bio-Rad, Hercules, CA). 3. b-Octylglucoside (Sigma).

2.3.2. ATPase Activity Measurements

1. Coupled enzyme assay mix containing 0.5 mM phosphoenol pyruvate, 2.5 mM ATP, 0.2 mM NADH, 2 IU of pyruvate kinase, 2 IU of lactate dehydrogenase, and 1–2 mg calcium ionophore (A23187).

2.3.3. 31P NMR Activity Assays

1. Coupled enzyme assay mix containing a final concentration of 10 mM dodecylphosphocholine, 1 mM MgCl2, 80 mM ATP, 20 mM MOPS (pH 7.0), and 0.2% C12E8.

2.4. NMR Spectroscopy

1. Dodecylphosphocholine (DPC) (Avertec or Cambridge Isotope Laboratories).

2.4.1. Solution-State Samples for Titration

2. Solution-state NMR sample buffer containing 20 mM Na2PO4, 120 mM NaCl, and 0.1% NaN3. 3. Mini-dialysis chambers (Millipore, Billerica, MA). 4. Guanidinium–HCl (Sigma). 5. Dialysis buffer: 20 mM PBS and 50 mM b-mercaptoethanol.

What Can We Learn from a Small Regulatory Membrane Protein? 2.4.2. Oriented Samples and ssNMR Spectroscopy

307

1. 10% (w/v) sodium dodecyl sulfate (SDS). 2. DOPC and DOPE lipids. 3. Ultra filtration concentrator (Amicon, Millipore Corporation, Bedford, MA). 4. Glass plates, 8 × 12 mm (Marienfeld GmbH & Co, LaudaKonigshofen, Germany). 5. Trifluoroethanol. 6. Extruder (Northern Lipids Inc. Burnaby, BC, Canada).

3. Methods 3.1. Expression and Purification of Monomeric and Pentameric PLN

For structural studies of membrane proteins by NMR, it is necessary to find a robust expression system that allows for the purification of milligram quantities of pure protein. For this purpose, we optimized a fusion protein construct for both monomeric (cysteine null PLN with the following mutations C36A, C41F, and C46A, which we refer to as AFA–PLN) and pentameric species (21).

3.1.1. Construction of PLN and Mutant PLN Expression Plasmids

To maximize the expression, all of the codons were optimized for E. coli. The nucleic acid sequence was divided into four overlapping primers and three polymerase chain reactions (PCRs) were performed, with incorporation of a TEV protease cleavage site between MBP and PLN, an EcoRI restriction site at the 5¢ end, and a HindIII site at the 3¢ end (a map of the plasmid is available from ref. 21). Each PCR was performed with a total volume of 50 mL containing 0.06 mM of each oligonucleotide, 80 mg of parental template, 200 mM dNTP mix, and 5.0 U Pfu Turbo DNA polymerase. Each PCR was carried out in an Eppendorf Master Cycler system with standard temperature cycling suggested for Pfu Turbo DNA polymerase. AFA–PLN PCR product was amplified and then ligated into EcoRI and HindIII sites on the pMal c2E vector using T4 DNA ligase (Rationale for fusion partner is described in Note 1). The ligated product was used to transform XL1-Blue competent cells with transformants selected by growth on LB/ampicillin plates. For single-site mutations of PLN, the forward and reverse primers were designed with optimized codons for E. coli three residues before and after the target residue to be mutated. PCR was performed with Pfu DNA polymerase using the protocol described above. PCR products were then digested with Dpn-I in order to remove nonmutated methylated dsDNA and then were transformed into XL-Blue competent cells. Constructs were amplified and extracted using the Qiaquick Spin Miniprep kit.

3.1.2. PLN Expression

E. coli BL21(DE3) cells were transformed with MBP–PLN fusion constructs. One hundred milliliters of LB growth media with

308

Veglia et al.

50 mg/mL ampicillin was inoculated with a single colony and grown ~16 h at 25°C shaking at 250 rpm to reach an OD600 of ~1.0. The growth was then diluted (1:50) into M9 media containing mineral and vitamin cocktails and grown at 37°C to an OD600 of ~1.0. Protein expression was induced with 1 mM IPTG for 4.5 h at 37°C. All cells were harvested by centrifugation at 6,370 × g for 20 min at 4°C. Pellets were stored at −20°C. 3.1.3. PLN Purification

PLN pellets (~10–15 g) were resuspended in ~400 mL of lysis buffer and blended using a Sorval Omni-Mixer 17105. Cell lysates were then sonicated on ice for 15 min using a Branson Sonifier 450 sonicator at an output setting of 4 and a duty cycle of 45%. After sonication, lysates were spun down at 45,700 × g for 20 min at 4°C. The supernatant containing MBP/PLN fusion protein was applied to an amylose resin affinity chromatography column at 4°C. The column was washed with 12 column volumes of wash buffer until the OD280 was 80% 2H) provide large gains in both sensitivity and resolution (3). Using this approach, complete or nearly complete backbone resonance assignments of integral membrane proteins (up to 280 residues) were obtained, leading to the determination of several b-barrel global folds over the last decade (4–8). To circumvent the limited amount of long range 1H–1H NOEs in perdeuterated proteins, the development of efficient tools for obtaining additional conformational restraints such as restraints obtained from residual dipolar couplings (9, 10), paramagnetic relaxation enhancements experiments (11), or selective methyl protonated otherwise perdeuterated samples (12, 13), were reported for the structure refinement of membrane proteins (14–18). These approaches are particularly useful in the case of helical membrane proteins, which exhibit only few long-range NOEs, which are crucial for the global fold determination (15, 19, 20). This chapter highlights the techniques and experimental protocols used for the investigation of the molecular structure of the 216-residue transmembrane domain of the Outer Membrane Protein A from Klebsiella pneumoniae (KpOmpA) in detergent micelles by liquid-state NMR spectroscopy (21). Among the complete set of state-of-the-art NMR methods for backbone assignment of large protein, we performed a 3D 13C-TOCSY-(15N,1H)-TROSY (22) for aliphatic carbon side chain assignment, a 4D 15N,15N separated NOESY (23) for unambiguous assignment of HN–HN NOEs, and selective methyl protonation approaches for the measurement of distance constraints involving methyl groups. Because structure determination of membrane protein placed strong constraints on the expression system efficiency (with the requirement of tens of milligram amount of pure MPs labelled with three low abundant isotopes) and on protein stability (in the presence of highly concentrated detergent molecules), the overexpression of KpOmpA in E. coli inclusion bodies with different 2 H,13C,15N isotope labelling patterns, its in vitro refolding in nondenaturing detergent micelles, and an efficient and fast detergent screening method for NMR sample preparation are developed.

2. Materials 2.1. Cells Culture and Isotope Labelling

1. Freshly transformed BL21(DE3) cells harboring the plasmid pET21c-KpOmpA. The cDNA for the transmembrane domain of KpOmpA (Met1-Glu207) with a C-terminal

Solution-State NMR Spectroscopy for Investigating 3D Structure of Membrane Proteins

323

hexahistidine tag was subcloned by using standard PCR methods into the NheI/HindIII sites of pET21c expression vector (Novagen, France) from the complete KpOmpA sequence into pVALP40 vector (24). 2. Filter-sterilized solutions of 1 M isopropyl b-d-thiogalactopyranoside (IPTG) and of the appropriate antibiotic stored in single use aliquots at −20°C. Working concentration for media are typically 50 mM ampicillin and 1 mM IPTG. 3. Sterile Luria–Bertani (LB) (Euromedex, France) growth medium. 4. Sterile M9 minimal medium: For 1 L of growth medium, 12.8 g of Na2HPO4, 3.0 g of KH2PO4, 0.5 g of NaCl, 2 g of NH4Cl, 2 g of D-glucose, 2 mM of MgSO4, 0.1 mM of CaCl2, vitamins (0.01 mg of cobalamine, 0.1 mg of pyridoxal and folic acid, 0.5 mg of riboflavine, 1 mg of biotin, pantothenic acid and niacinamide, 25 mg of thiamine), traces (50 mg of EDTA, 60 mg of CaCl2, 60 mg of FeSO4, 4.4 mg of CuSO4, 12 mg of MnCl2, 0.2 mg of H3BO3, 7 mg of ZnSO4, 2 mg of ascorbic acid). Filter-sterilize. 5. Isotope labelled compounds (Cambridge Isotope Laboratory, USA): [15N]–NH4Cl (15N 99%); [13C6]-D-glucose (13C 99%); [13C6, 2H7]-D-glucose (13C 99%, 2H 97–98%); D2O (2H 99.98%); [3,3−2H2, 13C]-a-ketobutyrate (13C 98%, 3–3 2H2 98%); [3−2H, 13C]-a-ketoisovalerate (13C 97–98%, 2 H 98%). 6. All cultures were performed in 2-L flasks containing 500-ml culture medium. Incubation at 37°C at 220 rpm. 2.2. In Vitro Refolding and Protein Purification

1. Buffers: (A) 50 mM Tris/HCl, 200 mM NaCl, 5 mM EDTA, 0.05% Tween 20 (w/v) (Anatrace, USA) at pH = 8.5, (B) 50 mM Tris/HCl, 200 mM NaCl, 5 mM EDTA, 0,05% Tween 20 (w/v) (Anatrace, USA), 2 M Urea at pH = 8.5, (C), denaturing buffer) 25 mM Tris/HCl, 5 mM EDTA, 6 M guanidine hydrochloride (Gdn/HCl) at pH = 8.5, (D, refolding buffer) 25 mM Tris/HCl, 150 mM NaCl, 1% Anzergent 3-14 (w/v) (Anatrace, USA) identical to zwitergent 3-14(ZW 3-14) at pH = 8.5, (E, purification buffer) 25 mM Tris/HCl, 150 mM NaCl, 0.1% Anzergent 3-14(w/ v) (Anatrace, USA) at pH = 8.5. 2. Ni-NDA resin (Amersham, USA), conditioned according to the manufacturer’s procedure. 3. Dialysis membrane (MWCO 12,000–14,000 Da, SpectraPor, USA).

2.3. Detergent Screening and NMR Sample Preparation

1. Pure fraction of isotope labelled KpOmpA in 25 mM Tris/ HCl, 150 mM NaCl, 0.1% ZW 3-14 (w/v), pH = 8.5, at a protein final concentration of about ~15 mg/mL.

324

Renault et al.

2. Solutions for protein precipitation: Cold ethanol (−20°C), solution of (4 M) NaCl sterilized by filtration through 0.2 mm filter. 3. NMR buffer: 20 mM phosphate, NaCl 100 mM at pH 6.5. 4. Detergents: Dihexanoylphosphatidylcholine (DHPC, powder, Avanti Polar Lipids, USA), deuterated dihexanoylphosphatidylcholine (DHPC-d22, powder, CIL, USA), dodecylphosphocholine (DPC, powder, Anatrace, USA), Anzergent 3-14 (ZW 3-14, Anatrace, USA), tetraethylene glycol monooctyl ether (C8E4, Anatrace, USA). 2.4. NMR Spectroscopy, Data Analysis, and Structure Calculation

1. Four radiofrequency channel Bruker spectrometers using 1H (13C, 15N) triple resonance cryoprobe, and equipped with z pulsed field gradient coils. 2. Software: NMRpipe for NMR data processing (http://spin. niddk.nih.gov/NMRPipe); NMRView for spectra analysis, NMR assignment, and NOEs analysis (http://www.onemoonscientific.com/nmrview); CNS for structure calculation (http://cns-online.org/v1.21); PROCHECK for structure validation (http://www.biochem.ucl.ac.uk/~roman/ procheck/procheck.html); PYMOL for structure visualization (http://pymol.org); and other software with similar functions can be used.

3. Methods 3.1. Overexpression and Isotope Labelling Strategies of KpOmpA

The transmembrane domain of KpOmpA was conveniently overexpressed as a recombinant protein of 216 residues (with a C-terminus hexahistidine tag) in inclusion bodies after IPTG induction 1 mM at 37°C for 4 h in E. coli BL21(DE3) bacteria. Typical yields of KpOmpA are about 220 mg/L of rich medium after refolding and purification steps. Incorporation of labelled isotopes into a recombinant protein expressed in E. coli requires minimal growth media containing 15N, 13C, and 2H labeled metabolic sources. Although high deuteration levels are tolerated by the bacteria, a significant reduction of the growth rate and biomass yields is usually observed. Higher protein yields are usually obtained if bacteria are must be progressively adapted to the deuterium content for further improvement of protein production as described below. Protein yields were reported for each isotopic labelling patterns of KpOmpA (see Table 1)

3.1.1. Uniform ( 2H,13C,15N) Labelling

1. Inoculate 3 ml culture of LB-rich medium with a freshly transformed colony of BL21(DE3) cells and incubate the culture in a shaking incubator (220 rpm) at 37°C during 3 h (OD600 is about 2.0)

Solution-State NMR Spectroscopy for Investigating 3D Structure of Membrane Proteins

325

Table 1 Protein yields for different isotope labelling patterns Labelling

Yields (mg/l)a

No

>200

[u−2H,13C,15N]

80

[u−2H,13C,15N/I(d1)LV−13CH3]

40

[u−15N/10%13C]

80

Quantity of KpOmpA per liter of growth medium after refolding and purification steps

a

2. Spin down the cells (1,000 × g at room temperature) and resuspend the pellet in 50 ml of labeled M9/H2O minimal medium (containing 2 g/L of [15N]-NH4Cl and 2 g/L of [13C6]-D-glucose) to achieve an initial OD600 of 0.08. The culture was grown at 37°C until reaching an OD600 of 0.8. 3. Spin down the cells (1,000 × g at room temperature) and resuspend a small amount of cells in 150 mL of labeled M9/ D2O minimal medium (containing 99% D2O, 2 g/L of [15N]-NH4Cl and 2 g/L of [13C6, 2H7]-D-glucose) to achieve a starting OD600 of 0.08. Incubate the culture at 37°C until reaching an OD600 of 2.0. 4. Dilute the culture to a final volume of 1 L with “fresh” 13C, 15 N-M9/D2O medium and incubate at 37°C until the culture reaches an OD600 of 0.5. 5. Induce protein overexpression with IPTG (1 mM) for 4 to 30 h at 37°C (see Note 1). 6. Collect the cells by centrifugation (5,000 × g, at 4°C for 15 min) 3.1.2. Selective protonation of Isoleucine, Leucine, and Valine (ILV) Methyl Groups in a Perdeuterated Background

1. Use uniform (2H, 13C, 15N) labelling protocol described above until the step 4. 2. Approximately 1 h before induction (OD600 is about 0.3), add the isotope labeled precursors in the following proportions: 50 mg/L of [3,3−2H2, 13C]-a-ketobutyrate and 100 mg/L of [3−2H, 13C]-a-ketoisovalerate as final concentrations. Maintain the culture at 37°C for about 1 h until the culture reaches an OD600 of 0.5. 3. Induce protein overexpression with IPTG (1 mM) and collect cells after 3 to 4 hours in order to prevent generation of undesired isotopomers.

3.1.3. Fractional 13C (10%) labelling

1. Inocculate cells a 50 ml culture of LB-rich medium with freshly transformed cells and incubate the culture at 37 (220 rpm) incubated in a shaking incubator (220 rpm) at 37°C until reaching an OD600 of about 2.0.

326

Renault et al.

2. Spin down the cells (1,000 × g at room temperature) and resuspend the appropriate amount of cells in 1 L of labeled M9 minimal medium (containing 2 g/L of [15N]-NH4Cl, 0.2 g/L of [13C6]-D-glucose and 1.8 g/L of unlabelled D-glucose as main nitrogen and carbon sources, respectively) to achieve an initial OD600 of 0.08. Incubate the culture at 37°C until reaching an OD600 of 0.8. 3. Induce the protein overexpression with IPTG 1 mM as final concentration and continue the growth for 3 to 4 h at 37°C. 3.2. In Vitro Refolding from E. coli Inclusion Bodies

Overexpression in D2O leads to the production of fully deuterated proteins with 2H incorporation also at exchangeable backbone and side chain sites, resulting in severe losses of NMR signal. To ensure a complete 2H/1H exchange for amides in very stable structural motif, deuterated proteins must be unfolded and subsequently refolded in protonated medium. 1. Spin down the cells by centrifugation at 5,000 × g for 10 min at 4°C, Resuspend the pellet in the buffer A (10 mL per g of material), and disrupt the cells by sonication on ice, until obtaining a clear and homogeneous suspension (15 to 30 min, at 50 W using 50% duty cycle). 2. Harvest E. coli inclusion bodies by centrifugation at 10,000 × g at 4°C for 30 min) and resuspend the pellet in buffer B (10 mL/g of material). To ensure a complete removal of membrane contaminants from the inclusion bodie fraction, repeat this procedure three times and collect cleaned inclusions bodies by centrifugation (10,000 × g, at 4°C, for 30 min) 3. Solubilize resuspend the pellet the inclusion bodies in a denaturing buffer (buffer C, 4 mL/g of material) and incubate under vigorous stirring at room temperature until obtaining a complete complete solubilization of the protein (12 h). 4. Refold the protein by fast dilution into 15 volumes of “Gdn/ HCl-free” refolding buffer (buffer D) and incubate under gentle stirring at ambient temperature for 16 h. Remove insoluble material by centrifugation (10,000 × g at 4°C for 30 min) and retain the supernatant. 5. Dialyse the supernatant (16 h at 4°C) against purification buffer (buffer E). 6. Load the sample onto a Ni-NDA resin (Amersham) equilibrated in the same buffer E. Incubate under gentle stirring at ambient temperature for 2 h. Proteins were eluted by adding stepwise (0 mM, 20 mM, 400 mM, and 1 M) imidazole in the purification buffer (buffer E) (6 volumes of each eluant buffer at 1.0 ml/min).

Solution-State NMR Spectroscopy for Investigating 3D Structure of Membrane Proteins

327

7. Pool the pure protein fractions and dialyse against purification buffer (buffer E) for 16 h at 4°C to remove residual imidazole. Protein purity (typically > 95%) was checked on an SDS/ PAGE acrylamide gels and Commassie blue staining. Protein concentrations of the purified fraction were determined by OD280 measurement, using a molar extinction coefficient of 49,500/M−1·cm−1. 3.3. Detergent Screening and NMR Sample Preparation

A prerequisite for solution NMR structure determination is to find conditions allowing millimolar concentration solubility, conformational homogeneity, and high stability of the membrane protein (for days to weeks). So far, detergent micelles are the most efficient membrane mimetic for solution NMR structure determination purposes, capable of maintaining a native-fold over extended periods and retaining an overall size of MP/detergent complexes compatible with solution NMR techniques. However, NMR suitability of detergent molecules and membrane protein behaviour in different detergent remains poorly predictable. Screening the nature of detergents for proper and stable fold of the membrane protein is an essential step for structure determination by NMR spectroscopy. We introduced in the KpOmpA refolding protocol a precipitation step in cold ethanol to easily sceen several NMR buffers and detergents. Sample properties in each detergent were judged on the quality of 2D [15N, 1H]-TROSY spectrum of a 15N/2H-labeled protein 1. Precipitate 12 mg of isotope labelled protein by adding successively 0.15 volume of 4 M NaCl and 20 volumes of cold ethanol (−20°C) into an aliquot of the purified fraction KpOmpA/ZW 3-14 complexes. This procedure assumes the use of pure fractions of KpOmpA/ZW 3-14 complexes in 25 mM Tris/HCl, 150 mM NaCl at pH = 8.5 (purification buffer), at a final protein concentration of about 15 mg/ml. Vortex and leave the mixture at−20°C for 4–16 h. 2. Harvest the protein precipitate by centrifugation (15,000 × g at 4°C for 10 min). Resuspend the pellet in 2 ml of deionized water and homogenize the suspension by vigorous stirring at 4°C. Spin down the protein precipitate (15,000 × g at 4°C for 10 min) and remove the supernatant. To ensure a complete removal of residual salts and detergent molecules, repeat this procedure three times. 3. Resuspend the pellet in the desired NMR buffer to achieve a final protein concentration of about 1 mM. Incubate under gentle stirring at room temperature until complete dissolution of protein aggregates (for 4–16 h). If any, remove insoluble protein aggregates by centrifugation (10,000 × g) and concentrate the soluble fraction using concentrator (Vivaspin, 10 kD cutoff) until obtaining a final protein concentration of 1 mM.

328

Renault et al.

4. Record 2D [15N, 1H]-TROSY HSQC spectra to assess the quality of each sample, in terms of spectral dispersion, information completeness (i.e. check that the number of observed peaks is close to the expected amide resonances) and stability over several weeks. Four detergents were tested: ZW 3-14, C8E4, DPC, and DHPC. Among them, ZW 3-14, DPC, and DHPC solubilized properly the ethanol precipitate (Fig. 1). Detailed analysis of the [15N,1H]-TROSY spectra revealed a better spectral dispersion for the DHPC sample (KpOmpA 1 mM, DHPC 300 mM, phosphate buffer 20 mM, pH 6.5, NaCl 100 mM). 3.4. Solution-State NMR Experiments for Resonance Assignments and Structure Determination

The structure elucidation of KpOmpA has involved three distinct 2H, 13C, 15N isotope labelling patterns, nine NMR samples, five NMR spectrometers at different frequencies (500, 600, 700, 800, and 900 MHz) and 11 multidimensional heteronuclear experiments. For the sake of clarity, all the experimental parameters are reported in Table 2 and further commented below (see Notes 2–5). All NMR data were processed with NMRPipe, using linear prediction to improve the resolution in all indirectly detected dimensions, and analysed with NMRView.

Fig. 1. Detergent screening for KpOmpA NMR studies. Contour plots of 2D [15N,1H]-TROSY spectra of u-[13C,15N/80%2H]-KpOmpA solubilized in different types of detergent: 300 mM ZW 3-14, 300 mM DPC, and 300 mM DHPC. The protein concentration was ~1 mM dissolved in a 20-mM phosphate buffer (pH 6.5) containing 100 mM NaCl, 10% D2O. The NMR spectra were recorded at 313 K at a 1H Larmor frequency of 700 MHz using identical acquisition and process parameters.

Solution-State NMR Spectroscopy for Investigating 3D Structure of Membrane Proteins 3.4.1. Sequential Assignment

329

1. Sequential assignment using typo error Ca and typo error C b connection pathways. The 3D [15N,1H]-TROSY HNCACB, which detects both intraresidual and sequential correlation peaks with 13Ca,b typo error and 13Ca,b(i−1) at 15N(i)/1HN(i) position of residue i, was recorded at a proton Larmor frequency of 900 MHz to maximize N–H TROSY effect and cross-peaks dispersion. The 3D [15N,1H]-TROSY HN(CO)CACB, which detects only the sequential correlation peaks 13Ca(i−1) and 13Cb(i−1) at 15N(i)/1HN(i) position of residue i, was recorded at lower field (see Note 3). This experiment complements the [15N,1H]-TROSY HNCACB and allows the distinction between 13C(i−1) and 13C(i) peaks in [15N,1H]TROSY HNCACB. The combination of intraresidual and sequential connectivities between (15N, 1HN) and (13Ca, 13 b C ) pairs provides the basis for the main chain assignment procedure (Fig. 2a). 2. Sequential assignment using 13C¢ connection pathways. Similarly, the 3D [15N,1H]-TROSY HNCO and 3D [15N,1H]TROSY HN(CA)CO provide the assignment of C’ of both intra and interresidue (13C(i−1)) with (15N(i)/1HN(i)) pair (Fig. 2c) and alleviate most of the residual ambiguities of the 13 a 13 b C / C sequential assignment procedure, notably for cluster involving glycine residues with degenerated 13Ca resonances. 3. Sequential assignment using side chain 13Caliph connection pathway. The 3D (HNCA)CC-TOCSY-(CA)-[15N,1H]-TROSY experiment provides 13C aliphatic side assignment of both sequential (13Caliph(i−1)) and intraresidue (13Caliph(i)) with (15N(i)/1HN(i)) pair. The magnetization transfer along the side chain is operated via an out and back FLOPSY-8 spin lock scheme (13C–13C isotropic mixing period of 16.4 ms). This experiment provided additional sequential assignment of KpOmpA by direct and unambiguous identification of each residue (Fig. 2b), in particular when 13Ca/13Cb were degenerated.

3.4.2. Measurement of Backbone HN–HN NOEs Using a 4D 15N,15N Separated NOESY-TROSY Experiment

For perdeuterated b-barrel membrane proteins, NOEs between amide protons are essential distance restraints for the global fold determination. Perdeuteration of the protein strongly reduces the occurrence of spin diffusion, allowing detection of amide proton NOEs between protons separated by more than 5 Å. Instead of the 3D 15N separated NOESY-TROSY experiments, we preferred to perform the 15N,15N-separated HMQC-NOESY-TROSY (mixing time 250 ms), which provided again an unambiguous assignment of HN–HN NOEs by the introduction of an additional 15 N dimension without excessive loss of sensitivity as clearly shown in Fig. 3a, b. This experiment yielded 173 backbone HN–HN NOEs, including 81 cross-stranded long-range NOEs

10.0



36.5

6.3

85.5



200

184

1,536



sw(F3) (ppm)

sw(F4) (ppm)

t1 max (ms)

t2 max (ms)

t3 max (ms)

t4 max (ms)

t1 (data points)

t2 (data points)

t3 (data points)

t4 (data points)

F1 carrier (ppm) 117.5

117.5



1,024

104

200



85.2

5.4

54.8



10.0

64.0

117.5



1,024

64

200



85.2

17.3

54.8



10.0

12.3

117.4



1,024

156

74



61.1

27.7

16.8



12.0

16.0

31.0

117.3



1,024

208

168



64.0

8.1

34.6



10.0

63.7

29.9

117.3



1,024

72

160



64.0

10.1

33.0



10.0

17.8

30.0

800

117.2



640

80

128



64.0

36.0

42.3



10.0

2.2

30.0

500

18.0





1,024

512





56.1

58.1





13.0

25.0

700 (*)

8.8

512

64

32

56

54.2

13.0

9.3

10.5

11.8

29.5

20.0

3.2

800

A4

64.0

30.0

800

D

sw(F2) (ppm)

30.0

700 (*)

B2

30.0

600

B1

sw(F1) (ppm)

600

A3

900

A2

H frequency (MHz)

1

A1

A1

NMR sample

A1

HNCACB HN(CO) HN(CA) HNCO (HNCA)CC(CA) (H)CC(CA) H(CCCO) CT-HSQC 4D 15N,15NCACB CO (*) NH-TOCSY NH-TOCSY NH-TOCSY (*) separated HMQC-NOESYTROSY

Parameter

Experiment

117.5



1,536

104

172



71.3

14.3

32.0



12.0

16.0

30.0

900

B3

3D 15N,13Cseparated HMQC-NOESYTROSY

18.0



1,024

104

96



60.7

14.4

13.2



9.4

16.0

16.0

900

C

3D 13C,13Cseparated HMQCNOESY-HMQC

Table 2 Heteronuclear experiments, acquisition parameters and NMR samples used for chemical shifts assignments and structure calculation of kpOmpA in DHPC micelles

330 Renault et al.

4.59



8

F3 carrier (ppm) 4.57

F4 carrier (ppm) –

Scans number

3/21

Total measuring 6/20 time (days/h)

4/21

1.4

16



4.59

174.7

2/14

1.5

16



4.59

174.5

6/14

1.5

8



4.58

42.8

5/13

2.0

16



4.58

18.3

5/16

1.2

32



4.51

1.0

18 h

1.5

80





4.55

9/16

1.14

4

4.58

116.9

116.1

6/18

1.5

4



4.57

18.0

5/15

1.1

32



4.57

18.0

NMR samples and corresponding replica are noted as follows: (A1–An) uniformly [2H,13C,15N]-KpOmpA/DHPC; (B1–Bn) [u-2H, 13C, 15N/Leu, Val, Ile(d1)-13CH3]-KpOmpA/ DHPC; (C) [u-2H, 13C, 15N/Leu, Val, Ile(d1)- 13CH3]-KpOmpA/DHPC-d22; (D) u-[15N, 10% 13C]-KpOmpA/DHPC-d22. In each sample, the concentration of the protein was ~1 mM, dissolved in 20 mM phosphate buffer (pH 6.5) containing 100 mM NaCl, 300 mM DHPC, 10% D2O. All NMR experiments were carried out at 313 K on four radiofrequency channel BRUKER spectrometers equipped with triple resonance cryoprobes (unless indicated *) and z-gradient coils

1.4

Recycle delay (s) 1.4

8

43.5

F2 carrier (ppm) 43.5

Solution-State NMR Spectroscopy for Investigating 3D Structure of Membrane Proteins 331

332

Renault et al.

Fig. 2. Sequential assignment of KpOmpA using backbone and aliphatic side-chain carbon connection pathways. Selected 13 C(w1)/1HN(w3) strips of the (a) 3D [15N,1H]-TROSY HNCACB, (b) 3D (HNCA)CC-TOCSY-(CA)-[15N,1H]-TROSY, and (c) 3D 15 1 [ N, H]-TROSY HN(CA)CO showing sequential assignment of Tyr111 to Arg113 protein segment, through the use of backbone and side-chain carbon chemical shifts (highlighted by horizontal and vertical dashed lines). Strips were extracted at the 15N(w2) chemical shifts indicated at the top of the each panel.

(i−j ³ 5). The identification of HN–HN cross-stranded NOEs for each pair of neighbouring b-strands established clearly the antiparallel eight-stranded b-barrel arrangement (Fig. 3c). 3.4.3. Selective Methyl Protonation Approach to Structure Determination

1. The first step consists in the sequence-specific assignment of carbon and proton resonances of isoleucine (d1), leucine (d1,d2), and valine (g1,g2) methyl groups. 13C and 1H chemical shifts were obtained from two separated 3D experiments recorded on a u-[2H,13C,15N]/Ile(d1),Leu,Val-[13CH3]labelled KpOmpA/DHPC sample. The 3D (H)C(CC)TOCSY-(CA)-[15N,1H]-TROSY provides methyl 13C chemical shifts from both intraresidual and sequential methyl carbon to amide correlations (Fig. 4a). Proton chemical shifts can be independently obtained from sequential methyl proton to backbone amide correlations identified in the 3D H(CCCO) NH-TOCSY spectrum (Fig. 4b). Combined analysis of the two spectra resulted in a nearly complete sequence-specific assignment of proton and carbon resonances of isoleucine (d1), leucine (d1,d2), and valine (g1,g2) methyl groups of kpOmpA.

Solution-State NMR Spectroscopy for Investigating 3D Structure of Membrane Proteins

333

Fig. 3. Measurement of unambiguous backbone amide NOEs. (a) 2D 15N(w2)/1HN(w3) plane extracted from the 3D 15N-separated NOESY-TROSY at w1(1H) = 9.01 ppm corresponding to the proton amide resonance of T159 (coloured in grey). (b) Corresponding 2D 15N(w3)/1HN(w4) plane extracted from the 4D 15N,15N-separated HMQC-NOESY-TROSY spectrum at w1(1H) = 9.01 ppm and w2(15N) = 114.4 ppm at the proton and nitrogen amide frequencies of T159 (coloured in grey) providing unambiguous assignment of sequential and long-range HN–HN NOE correlations. The most intense correlation that identifies cross-stranded HN–HN NOE (underlined residues) establishes an anti-parallel orientation of sequential b-strands. (c) Topological representation of KpOmpA. Crossstranded HN–HN NOEs are represented with transversal dashed lines. Loops (L) and turns (T) are numbered from N to C-terminus. Residues located in b-strands are shown in squares, those that are partially assigned are coloured in light grey and residues that remain unassigned are coloured in dark grey.

Only methyl resonances of residues whose intraresidual and sequential 1H/15N backbone amide resonances are broadened beyond detection and remain unassigned, due to molecular motions at ms time scale.

334

Renault et al.

Fig. 4. Sequence specific and stereospecific assignment of 13C and 1H methyl resonances: the example of Leucine 66. (a) Section of 13C(w1)/1HN(w3) slices from 3D (H)C(CC)-TOCSY-(CA)-[15N,1H]-TROSY spectrum corresponding to intraresidual correlation of L66. (b) Section of 1H(w1)/1HN(w3) slices from 3D H(CCCO)NH-TOCSY spectrum corresponding to sequential correlation of L66. (c) Expanded region of 2D [13C,1H]-CT-HSQC corresponding to proR and proS methyl groups of L66 (positive and negative contour plots are coloured in black and grey, respectively).

2. Stereospecific assignment of 1H and 13C methyl resonances of valine (g1,g2) and leucine (d1,d2) prochiral methyl groups is a prerequisite for straightforward analysis of methyl-derived NOE restraints. They can be obtained using a u-[15N, 10% 13 C]-labelled KpOmpA/DHPCd22 sample (see Note 6). Under these circumstances, the pro-R and pro-S methyl groups became reliably distinguishable on a 2D [13C,1H]- constant time HSQC spectrum by the presence or the absence of 1 JCC coupling, respectively. With the use of a constant time period of 26.6 ms (1/Jcc), 13C/1H correlations derived from 13 C methyl directly bound to a 13C partner (pro-R) give opposite phase relative to cross-peaks from isolated 13C spins (pro-S) (Fig. 4c). Stereospecific assignment of methyl resonances is directly obtained by examination of the resulting spectrum. Only the isopropyl methyl groups showing a complete degeneracy of both carbon and proton frequencies remained indistinguishable. 3. Methyl-to-backbone-amide NOE (Me–HN NOE) and methylto-methyl NOE (Me–Me NOE) measurements are usually achieved using standard 3D 15N-separated and 13C-separated [1H,1H]-NOESY, respectively. However, the strong degeneracy of methyl proton frequencies seriously hampered the

Solution-State NMR Spectroscopy for Investigating 3D Structure of Membrane Proteins

335

assignment of such NOEs (Fig. 5a). For an unambiguous identification of Me–HN and Me–Me NOE correlations, information about 13C chemical shifts should be preferred since the latter offers higher chemical shift dispersion as compared with proton. The 3D 15N, 13C-separated HSQC-NOESY-TROSY experiment provided a non-ambiguous assignment of each Me–HN NOE correlations (243 NOEs) as shown in Fig. 5b. 4. Similarly, the Me–Me NOE measurement was achieved using a 3D 13C, 13C-separated HMQC-NOESY-HMQC instead of standard 13C-separated [1H,1H]-NOESY. In addition to Me–HN NOEs, this experiment provided a large number of NOE contacts between proximal methyl group, and hence abundant distance restraints homogeneously spread over the hydrophobic core of KpOmpA (Fig. 6). 3.5. Structure Calculation

Structural calculations for KpOmpA were carried out using HN–HN NOEs, Me–HN NOEs, Me–Me NOEs, dihedral angle restraints, and hydrogen bond distances restraints. 1. Assigned NOEs’ peaks were integrated individually and converted to distances using NOE analysis module of NMRView. For KpOmpA structure calculation, the distances were separated

Fig. 5. Profit of the 3D 15N, 13C-separated HMQC-NOESY-TROSY in the assignment of methyl NOEs. Strips extracted form 3D 15N-separated HMQC-NOESY-TROSY (a) and 3D 15 N, 13C-separated HMQC-NOESY-TROSY (b ) at proton and nitrogen amide resonances of Y102.

336

Renault et al.

Fig. 6. 3D structure calculation of KpOmpA. (Left ) Cartography of Me–HN and Me-Me NOEs-derived distance restraints involving methyl groups of Ile, Leu, and Val residues (denoted by dashed lines) onto the NMR structure of KpOmpA represented by the superimposition of the 20 lowest-energy CNS conformers in the b-barrel region (Right ). Ribbon representation of the average structure of KpOmpA derived from the 20 lowestenergy CNS conformers with a stick representation of the two aromatic girdles located at membrane interfaces. Structures were generated with PyMOL software.

into three classes, as having strong, medium, or weak NOEs, with upper limits of 3.4 Å, 4.7 Å, 8.0 Å for HN–HN NOEs, and 4.5 Å, 7.0 Å, 10.0 Å for both Me–HN and Me–Me NOEs. 2. Hydrogen bond distance restraints derived from the identification of slowly exchanging amide protons and from characteristics HN–HN NOEs. Experimentally, validated hydrogen bonds were implemented by means of two distance restraints with upper limits of 2.4 Å for O–HN and 3.2 Å for O–N. 3. Backbone f and j dihedral angle restraints were obtained by using the TALOS program, using chemical shift assignments corrected from shifts due to 2H isotopic and TROSY effect in NMRview. 4. Solution NMR structures of KpOmpA were calculated using CNS (v.1.1) simulated annealing protocols with 2,000 steps for the high temperature annealing stage, 6,000 steps for the first cooling stage using torsion angle dynamics, and 5,000 steps for the second cooling stage using Cartesian dynamics. An ensemble of 200 structures was generated, and the 20 lowest overall violation-energy conformers were selected to represent the 3D structure of KpOmpA in DHPC micelles.

Solution-State NMR Spectroscopy for Investigating 3D Structure of Membrane Proteins

337

5. The structural quality of the ensemble was analysed using PROCHECK, which gave 70.5% of residues in most favoured regions, 26.7% in additional allowed regions, 2.6% in generously allowed regions, and 0.2% in disallowed regions.

4. Notes 1. Since KpOmpA is expressed as intracellular inclusion bodies, the induction time (i.e. 30 h) was optimized in order to enhance the protein expression yields. 2. For standard out-and-back type triple resonance experiments used for backbone and side chain resonance assignments, maximum sensitivity is achieved with 100% deuteration. In these experiments, magnetization starts and finishes on the amide proton, which relaxes more slowly and the attenuation due to 13Ca transverse relaxation is more favourable when the protein is perdeuterated. 3. Because of large chemical shift anisotropy of carbonyl carbon and resulting fast transverse relaxation of 13C¢ nuclei at high fields, the 3D [15N,1H]-TROSY HN(CO)CACB, HNCO, and HN(CA)CO experiments should be performed at a 1H frequency equal or less than 700 MHz. All other NMR experiments should be preformed at the highest magnetic field available in order to expect the better sensitivity and resolution. 4. Standard 3D multidimensional NMR experiments, especially those in TROSY detection scheme, are usually written with a 15 N evolution in constant time mode in order to minimize the relaxation contribution. The evolution in a constant time manner limits the number of time domain data points in the 15 N dimension and hence the final spectral resolution (i.e. FID resolution about 30 Hz/point), which is a crucial parameter for the structure determination of macromolecule with high molecular weight. Therefore, each 3D multidimensional experiment described above was performed with a 15N evolution in semi-constant time manner in order to record 200 time domain data points in 15N dimension (FID resolution about 14 Hz/point). 5. Since the sample contains 300 mM of DHPC, echo/antiecho gradient field pulses for the 15N coherence selection should be adjusted carefully to remove efficiently the 1H signals of DHPC, particularly the intense signal arising from the choline methyl groups. 6. Samples for the 3D 13C,13C-separated HMQC-NOESY-HMQC and 2D [13C,1H]- constant time HSQC were prepared with

338

Renault et al.

deuterated DHPC (i.e. DHPC-d22) to prevent undesirable 1H signal from the methyl groups of the detergent acyl chains.

Acknowledgements These studies and the IPBS NMR equipment were financed by the French research ministry, CNRS, Université Paul Sabatier, the Région Midi-Pyrénées and European structural funds. Extended access to the EU-NMR facility in Frankfurt 6th Framework Program of the EC (contract number RII3-026145) has a crucial role in the development of this project and is duly acknowledged. We thank, in particular, Dr C. Richter, and Professor H. Schwalbe. The CIPF (P. Fabre) is acknowledged for providing the pVALP40 plasmid of KpOmpA. References 1. Tamm LK, Liang BY (2006) NMR of membrane proteins in solution. Prog Nucl Magn Reson Spectrosc 48:201–210 2. Pervushin K, Riek R, Wider G, Wuthrich K (1997) Attenuated T2 relaxation by mutual cancellation of dipole-dipole coupling and chemical shift anisotropy indicates an avenue to NMR structures of very large biological macromolecules in solution. Proc Natl Acad Sci U S A 94:12366–12371 3. Pervushin K (2000) Impact of transverse relaxation optimized soectroscopy (TROCSY) on NMR as a technique in structural biology. Q Rev Biophys 33:161–197 4. Arora A, Abildgaard F, Bushweller JH, Tamm LK (2001) Structure of outer membrane protein A transmembrane domain by NMR spectroscopy. Nat Struct Biol 8:334–338 5. Fernandez C, Adeishvili K, Wuthrich K (2001) Transverse relaxation-optimized NMR spectroscopy with the outer membrane protein OmpX in dihexanoyl phosphatidylcholine micelles. Proc Natl Acad Sci U S A 98: 2358–2363 6. Hwang PM, Choy WY, Lo EI, Chen L, Forman-Kay JD, Raetz CRH, Prive GG, Bishop RE, Kay LE (2002) Solution structure of the outer membrane enzyme PagP by NMR. Proc Natl Acad Sci U S A 99: 13560–13565 7. Johansson MU, Alioth S, Hu KF, Walser R, Koebnik R, Pervushin K (2007) A minimal

8.

9.

10. 11.

12.

13.

14.

transmembrane beta-barrel platform protein studied by nuclear magnetic resonance. Biochemistry 46:1128–1140 Liang BY, Tamm LK (2007) Structure of outer membrane protein G by solution NMR spectroscopy. Proc Natl Acad Sci U S A 104:16140–16145 Tjandra N, Bax A (1997) Direct measurement of distances and angles in biomolecules by NMR in a dilute liquid crystalline medium. Science 278:1111–1114 Prestegard JH, Bougault CM, Kishore AI (2004) Chem Rev 104:3519–3540 Battiste JL, Wagner G (2000) Utilization of siet-directed spin labelling and high resolution heteronuclear NMR for global fold determination of large proteins with limited nuclear overhauser effect data. Biochemistry 39:5355–5365 Rosen MK, Gardner KH, Willis RC, Parris WE, Pawson T, Kay LE (1996) Selective methyl group protonation of perdeuterated proteins. J Mol Biol 263:627–636 Goto NK, Gardner KH, Mueller GA, Willis RC, Kay LE (1999) A robust and cost-effective method for the production of Val, Leu, Ile (delta 1) methyl-protonated 15N-, 13C-, 2H-labeled proteins. J Biomol NMR 13: 369–374 Fernandez C, Hilty C, Wider G, Guntert P, Wuthrich K (2004) NMR structure of the integral membrane protein OmpX. J Mol Biol 336:1211–1221

Solution-State NMR Spectroscopy for Investigating 3D Structure of Membrane Proteins 15. Roosild TP, Greenwald J, Vega M, Castronovo S, Riek R, Choe S (2005) NMR structure of mistic, a membrane-integrating protein for membrane protein expression. Science 307: 1317–1321 16. Liang BY, Bushweller JH, Tamm LK (2006) Site-directed parallel spin-labelling and paramagnetic relaxation enhancement in structure determination of membrane proteins by solution NMR spectroscopy. J Am Chem Soc 128:4389–4397 17. Cierpicki T, Liang BY, Tamm LK, Bushweller JH (2006) Increasing the accuracy of solution NMR structures of membrane proteins by application of residual dipolar couplings. High-resolution structure of outer membrane protein A. J Am Chem Soc 128:6947–6951 18. Hiller S, Garces RG, Malia TJ, Orekhov VY, Colombini M, Wagner G (2008) Solution structure of the integral human membrane VDAC-1 in detergent micelles. Science 321:1206–1210 19. Teriete P, Franzin CM, Choi J, Marassi FM (2007) Structure of the Na, K-ATPase regulatory protein FXYD1 in micelles. Biochemistry 46:6774–6783

339

20. Shih SC, Stoica I, Goto NK (2008) Investigation of the utility of selective methyl protonation for determination of membrane protein structures. J Biomol NMR 42:49–58 21. Renault M, Saurel O, Czaplicki J, Demange P, Gervais V, Löhr F, Réat V, Piotto M, Milon A (2009) Solution state NMR structure and dynamics of KpOmpA, a 210 residue transmembrane domain possessing a high potential for immunology applications. J Mol Biol 385:117–130 22. Lohr F, Ruterjans H (2002) Correlation of backbone amide and side-chain (13)C resonances in perdeuterated proteins. J Magn Reson 156:10–18 23. Xia YL, Sze KH, Zhu G (2000) Transverse relaxation optimized 3D and 4D 15N/15N separated NOESY experiments of 15N labeled proteins. J Biomol NMR 18:261–268 24. Haeuw JF, Rauly I, Zanna L, Libon C, Andreoni C, Nguyen TN, Baussant T, Bonnefoy JY, Beck A (1998) The recombinant Klebsiella pneumoniae outer membrane protein OmpA has carrier properties for conjugated antigenic peptides. Eur J Biochem 255:446–454

as

Chapter 18 NMR Spectroscopy of Lipid Bilayers Axelle Grélard, Cécile Loudet, Anna Diller, and Erick J. Dufourc Abstract Knowledge of lipid structure and dynamics in a membranous environment is of first importance for deciphering cellular function. Sterols and sphingolipids are key molecules in maintaining membrane integrity and are the building blocks of membrane domains, such as “rafts”. Phosphatidyl inositols are crucial in signalling pathways as they are recognition sites at the membrane surface. Other lipids such as Phosphatidylethanolamines, Cardiolipins, or diacylglycerols are essential in fusion processes. It is fundamental to have techniques that can resolve the structure and dynamics of various classes of lipids in a membrane environment. Solid state NMR with its high resolution and wide line facets is a very powerful tool for such determinations. Here it is shown that multinuclear solid state NMR provides information on the nature of the membrane phase (bicelle, lamellar, hexagonal, micelle, cubic, etc.), its dynamics (fluid or gel, or liquid-ordered with cholesterol), and the molecular structure of embedded lipids when using the magic angle sample spinning (MAS) apparatus. Typical examples of relatively simple experiments are shown both with high resolution MAS and wide line NMR of lipids. Relaxation time measurements are also described to measure lipid motional processes from the picosecond to the second timescale. Key words: Solid state 1H–2H–14N–31P-NMR, Bicelles, Liposomes, Micelles, Gel and fluid phases, Hexagonal phases, Liquid-ordered state, Cholesterol, Sphingolipid, Diacylphosphatidylcholine, Diacylphophatidylethanolamine, Deuterium-labelled lipids, Magic angle sample spinning, Wide line spectra, Relaxation times, Order parameters

1. Introduction Lipids are one of the building blocks in cell biology and are present in many cellular locations. They may be found as small aggregates allowing their transport in water media or may be embedded in cellular membranes where their dynamics are much more reduced. In this case, they constitute the cement of the bilayer membrane and also play many signalling roles. For instance, cholesterol and sphingolipids are key molecules in maintaining membrane integrity and are the building blocks of membrane liquid-ordered phases, Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_18, © Springer Science+Business Media, LLC 2010

341

342

Grélard et al.

also named “rafts” (1, 2). Lipids such as phosphatidylinositol (PtdIns) are essential in signalling pathways as they are recognition sites at the membrane surface (3). In order to decipher their structure in their native membrane environment, Nuclear Magnetic Resonance is one of the most powerful techniques because it can act in media of reduced dynamics such as membranes or aggregates. NMR relies on the presence of active nuclei in atoms that constitute the lipid molecules. A lipid naturally contains protons (1H), carbon (only the 13C is active, but in low natural abundance), phosphorus (31P), oxygen (only the 17O is magnetically active but has a very low natural abundance) and nitrogen (14N). Some lipids may be chemically labelled with the hydrogen isotope, deuterium (2H), with fluorine (19F), carbon-13 (13C), or with other nuclei of interest for structural biology. Lipids are not water soluble; as a consequence, their structure and dynamics must be evaluated in a hydrated model membrane state (bicelles, liposomes, bilayers vesicles of various sizes, etc.) (4) or in real membranes (envelopes of living cells) (5).

2. Materials 2.1. Samples, Preparation Tools and Measuring Cells

1. Ninety percent ultra pure water (miliQ-system) + 10% deuterated water (2H2O), v:v (Eurisotop, Saint-Aubin, France). 2. Deuterium-depleted water, 1H2O (Isotec-Sigma-Aldrich, France). 3. Lipids: 1,2-Dipalmitoyl-2H62-sn-glycero-3-phosphocholine (16:0/16:0 DPPtdCho-2H62), 1,2-dimyristoyl-sn-glycero-3phosphocholine (14:0/14:0 DMPtdCho), 1,2-dicaproyl-snglycero-3-phosphocholine (6:0/6:0 DCPtdCho), Liver Phosphatidylethanolamine (PtdEth), Liver Phosphatidylcholine (PtdCho), brain Sphingomyelin (SM) and cholesterol are from Avanti Polar Lipids (USA). 1-Tetradecanoyl-2-(4-(4biphenyl)butanoyl)-sn-glycero-3-phosphocholine (14:0/BB TBBPtdCho) and 1,2-dimyristoyl-sn-glycero-3-phosphocholine 2H72 (14:0/14:0 DMPtdCho-2H72) were synthesized in the laboratory (6, 7). 4. Table top centrifuge Eppendorf (VWR, France). 5. Two millilitre micro centrifuge tubes (VWR, France). 6. MS2 Minishaker (VWR, France). 7. Water bath, thermostated at 50°C. 8. Liquid nitrogen, 50 ml in a 500 ml Dewar. 9. Zirconia (ZrO2) 4 mm diameter magic angle spinning (MAS) rotor of HR-MAS type (50 ml) and of CP-MAS type (100 ml) (Cortec, France).

NMR Spectroscopy of Lipid Bilayers 2.2. NMR Instrumentation for 31P-NMR

343

1. NMR Spectrometer equipped for solid state: Bruker Avance 300 operating at 121.5 MHz (Bruker, Wissembourg, France). 2. NMR Probe: 1H/X, 4 mm CP-MAS probe tuned at 121.5 MHz (31P channel) and 300.13 MHz (1H channel); equipped with temperature regulation (thermocouple and heater). 3. PC station with TopSpin software v2.0 (Bruker, Wissembourg, France). 4. Bruker BCU (air cooling) and VT (variable temperature) units to control sample temperature using dry airflow.

2.3. NMR Instrumentation for 2H-NMR

1. NMR Spectrometer equipped for solid state: Bruker Avance 500 operating at 76.8 MHz (Bruker, Wissembourg, France). 2. NMR Probe: Low Gamma 1H/X, 4 mm CP-MAS probe tuned at 76.8 MHz; equipped with temperature regulation (thermocouple and heater). 3. PC station with TopSpin software v2.1 (Bruker, Wissembourg, France). 4. Bruker BCU (air cooling) and VT (variable temperature) units to control sample temperature using dry airflow.

2.4. NMR Instrumentation for 14N-NMR

1. NMR Spectrometer equipped for solid state: Bruker Avance 500 operating at 36.1 MHz (Bruker, Wissembourg, France). 2. NMR Probe: PE triple 1H–X–Y CP probe tuned at 36.1 MHz on X channel; 50 W on the Y and 1H channels; equipped with temperature regulation (thermocouple and heater). Homemade 4 mm horizontal coil. 3. PC station with TopSpin software v2.1 (Bruker, Wissembourg, France). 4. Bruker BCU (air cooling) and VT (variable temperature) units to control sample temperature using dry airflow.

2.5. NMR Instrumentation for HR-MAS 1H-NMR

1. NMR Spectrometer equipped for solid state: Bruker Avance 500 operating at 500.16 MHz (Bruker, Wissembourg, France). 2. NMR Probe: 1H/13C, 2H-lock, z-gradient, 4 mm HR-MAS probe tuned at 500.16 MHz, equipped with temperature regulation (thermocouple and heater). 3. PC station with TopSpin software v2.1 (Bruker, Wissembourg, France). 4. Bruker BCU (air cooling) and VT (variable temperature) units to control temperature on sample using dry airflow. 5. Bruker pneumatic MAS unit to control sample rotation at the magic angle.

344

Grélard et al.

2.6. Data Analysis

1. PC computer loaded with Topspin 2.1 software (Bruker, Wissembourg, France) and Origin software V7.5 (OriginLab Corporation, Massachusetts, USA). 2. NMR Friend V1.2 plug-in software to import and treat NMR spectra (see Note 1) on Origin software developed by Sébastien Buchoux, UMR5248 CNRS-University Bordeaux 1, France (see Note 2).

3. Methods The membrane that defines the cell entity or the cell organelles is by its very nature a medium that is half way between a liquid and a solid. This state called soft matter is by definition a liquid crystalline medium whose anisotropic properties are essential for membrane function and cells. Molecules there embedded, such as lipids, proteins, drugs may undergo many dynamic processes such as lateral diffusion in the bilayer plane, rotational diffusion around the bilayer normal or transverse diffusion from one membrane leaflet to the other. They may also group as ordered patches in the membrane plane that are named “rafts” to picture the rigidity of these membrane domains that swim in a “sea” of more fluid lipids and proteins. Understanding structure and dynamics of membrane components will provide insight towards deciphering complex biological processes such as cell fusion, trafficking, apoptosis, energetic, signal transduction, etc. Solid state NMR is the only non-destructive, non-invasive and quantitative spectroscopy that can measure membrane structure and dynamics. There are two data categories that can be obtained using NMR, spectra and relaxation times. Spectra are in turn of two types “wide line” and “high resolution”. Wide line NMR (spectra may span several hundredths of kilohertz) can be sensitive to membrane macroscopic orientation and symmetry (Fig. 1) and molecular motions (Fig. 2). It is a useful tool to determine the nature of membrane phases (bicelles, lamellar, hexagonal, isotropic (micelles or cubic), etc.) (8, 9) and track membrane dynamics (membrane fluidity, fusion, gel (solidordered), liquid-ordered, and fluid (liquid-disordered) states). The well-known regulating effect of cholesterol on membrane phases (increasing fluidity of solid-ordered phases and decreasing that of liquid-disordered phases) can easily be monitored by using phospholipids or sterols that are deuterated (2, 10–12). Analysis of spectral moments or quadrupolar splittings allows describing membrane dynamics either globally (bilayer/molecule fluctuations) or locally by measuring the reorientation of individual chain segments in the bilayer core (Fig. 2). By taking advantage of

NMR Spectroscopy of Lipid Bilayers

345

Fig. 1. Nitrogen-14 and Phosphorus-31 solid state NMR spectra of lipids embedded in different membrane phases. Top row: spectra obtained for a mixture of large micrometric liposomes and nanometric micelles made of phosphatidylcholine lipids. Middle-top row: spectra of DMPtdCho/DCPtdCho bicelles in 80% D2O oriented such that the normal to the disc (400 nm) plane is perpendicular to the magnetic field direction, B0. Middle-bottom row: spectra of TBBPtdCho/DCPtdCho bicelles (biphenyl bicelles) in 80% D2O oriented such that the normal to the disc (800 nm) plane is parallel to the magnetic field direction, B0. Bottom row: spectra of phopholipid micelles (nm size). On the right hand side are represented cartoons to picture the nature of the lipid systems. L, I, B and B//, B0, and n stand respectively for lamellar, isotropic, bicelle normal perpendicular and parallel, magnetic field, and bilayer normal.

magnetic field orientational dependence, wide line NMR spectra can also probe average orientations of molecules embedded in membranes (membrane topology) (10, 11, 13) when used in conjunction with X-rays or neutrons structural information. The 3D structure of molecules in membranes is also obtained by making use of MAS, a technique by which the sample is rapidly spun at an angle of 54.7° with respect to the magnetic field (Fig. 4d) leading to pseudo “high-resolution” spectra (sharp lines of less than 1 Hz width), as in the liquid state, the sample being still in the membranous “liquid-crystalline” state (Fig. 4). All multidimensional NMR techniques used in solution NMR can hence be applied bringing assignment of resonances to molecular structure (Table 1, Fig. 4). Relaxation times (T1 and T2) are obtained by making use of specific pulse sequences (see Note 10) and bring

346

Grélard et al.

a

c + 30% Chol (lo)

G el (so)

b

d

+ 30% Chol (lo)

F luid(ld)

60

40

20 2H

0

−20 −40

−60 −80 80

40

20 2H

F requency (kH z)

e F irst spectral moment (kH z)

60

0

−20 −40

−60 −80

F requency (kH z)

f

62.5

so

140

lo

120

lo

100

50.0

+ 30% Chol + 30% Chol

80

37.5 25.0

ld

60

12.5

ld 40

Quadrupolar splitting (kH z)

80

0.0 0

10

20

30

40

Temperature (°C)

50

60

2

4

6

8

10

12

14

16

Labelled carbon position on lipid acyl chain

Fig. 2. Solid-state wide line 2H-NMR spectra of DPPtdCho-2H62 with and without cholesterol dispersed in water (liposomes). (a) DPPtdCho, T = 10°C, gel (so) phase; (b) DPPtdCho, T = 40°C, fluid (ld) phase; (c) DPPtdCho/cholesterol (2/1 molar), T = 10°C, lo phase; (d) DPPtdCho/cholesterol (2/1 molar), T = 40°C, lo phase. (e) First spectral moment as a function of temperature for DPPtdCho (filled symbol) and DPPtdCho/cholesterol (empty symbol) spectra. (f) Quadrupolar splittings for DPPtdCho acyl chains at 40°C in the presence (empty symbols) and absence (filled symbols) of cholesterol.

values ranging from microseconds to minutes (Fig. 5). Analysis of relaxation times in the frame of proper motional modes leads to the calculation of speed of molecular motion and activation energies. This allows describing membrane dynamics from the atomic level where intra-molecular motions dominate (nano-topicoseconds timescale), to the cell level where membrane hydrodynamic modes of motion play an important role (seconds time scale) (14–16).

NMR Spectroscopy of Lipid Bilayers

3.1. Determination of Lipid Phase Nature (Lamellar (Liposomes), Oriented-Bicelle, Isotropic) Using Wide Line 31P-NMR

347

1. Appropriate amounts of phospholipids are weighed (see Note 11), and a suitable volume of water is added to obtain a lipid hydration (water mass/total mass) 80–95% (w/w). Lipid concentration is of the order 10–200 mM. For bicelle samples, 14:0/BB TBBPtdCho and 14:0/14:0 DMPtdCho represent about 80 mol% of the total lipid, 6:0/6:0 DCPtdCho the remaining 20%. 2. The hydrated sample is then vigorously shaken in a vortex mixer, frozen in liquid nitrogen, and heated to 50°C for 10 min in a water bath. This cycle of liposome (multilamellar vesicles of micrometre size) or bicelle (bilayer discs of 400–800 nm diameters) formation is repeated three to five times until a homogeneous preparation is obtained at room temperature. 3. The resulting sample (may appear translucent when in bicelle phase) is transferred (see Note 12) into a 4-mm NMR ZrO2 rotor (100 ml) that is placed in the magnetic field. The measuring probe is properly tuned to 121.5 MHz (31P channel) and 300.13 MHz (1H channel) and the space homogeneity (“shimming”) is adjusted at best. 4. Using the variable temperature set up (VT and BCU units), samples are allowed to equilibrate 15–30 min at a given temperature before the time dependent NMR signal (FID) is acquired; the temperature is regulated to ±1°C (see Note 9). 5.

P-NMR spectroscopy is performed at 121.5 MHz. FID are acquired using a phase-cycled Hahn-echo pulse sequence with gated broad-band proton decoupling (17). Typical acquisition parameters are as follows: p/2 pulse width (P1) of 14.5 µs (see Notes 3 and 10) spectral window (SW) of 32 kHz, interpulse delay (D6) of 50 µs and a repetition delay (D1) of 5 s (see Note 4). Typically, 512 scans are recorded in 42 min (see Note 5). 31

6. Processing parameters: noise filtering of the FID with an exponential window (see Note 7), characterized by the Lorentzian line-broadening factor (LB = 10–50 Hz, see Note 6), is used. Fourier Transformation is applied to get the spectrum, and its phase is adjusted to correct for unavoidable time delays in acquisition. Reference is made to 85% H3PO4 (0 ppm). The width of the spectrum is measured and used to calculate the chemical shift anisotropy (CSA). 7. The spectrum (Fig. 1, 31P column, top) is characteristic of a non-oriented spectrum (broad and major axially symmetric pattern of 45 ppm CSA) superimposed on a small isotropic line (0 ppm). This is typical of a mixture of micrometre liposomes and nanometre micelles (sketches on the right).

348

Grélard et al.

Table 1 1 H chemical shifts of liposomal lipids in H2O/D2O (90/10). Ambient temperature, reference TSP (0 ppm). The sample is spun at 7 kHz at the magic angle

Head group

Backbone

Fatty acyl chains

N

Sphingomyelin

Liver PtdEth

Liver PtdCho

a

nd

3.97

3.99

b

nd

3.22

3.66

g

3.66

7.87

3.21

g1

nd

nd

nd

g2

5.31

5.29

5.31

g3

4.00

4.03

4.27

1

5.71

1′ 2

5.71

2.40

2.28

2′

2.22/2.31

nd

2.28

3

2.02

1.57

1.57

3′

1.60

nd

1.57

4

1.29

1.24

1.25

4′

1.29

2.00

1.25

5

1.29

1.24

1.25

5′

1.29

nd

1.25

6

1.29

1.24

1.25

6′

1.29

nd

1.25

7

1.29

1.24

1.25

7′

1.29

2.75

1.25

8

1.29

1.24

1.25

8′

1.29

nd

2.02

9

1.29

1.24

1.25

9′

1.29

nd

nd

10

1.29

1.24

1.25

10′

1.29

2.75

nd

11

1.29

1.24

1.25

11′

1.29

nd

2.77

12

1.29

1.24

1.25

(continued)

NMR Spectroscopy of Lipid Bilayers

349

Table 1 (continued) N

Sphingomyelin

Liver PtdEth

Liver PtdCho

12′

1.29

nd

nd

13

1.29

1.24

1.25

13′

1.29

2.75

nd

14

1.29

1.24

1.25

14′

1.29

nd

2.02

15

0.88

1.24

1.25

15′

1.29

nd

1.25

1.24

1.25

2.00

1.25

1.24

1.25

1.24

1.25

0.85

0.86

1.24

0.86

16 16′

1.29

17 17′

1.29

18 18′

0.88

19 19′

1.24

20′

0.85

nd not determined. Numbers separated by “/” (e.g. 2.22/2.31) indicate that there are two lines of indicated chemical shifts

8. The spectrum (Fig. 1, 31P column, middle-top) is much narrower and made of two sharp lines representing 14:0/14:0 DMPtdCho (major, −12 ppm) and 6:0/6:0 DCPtdCho (minor, −4 ppm) in a bicelle (disc) structure with the bilayer normal oriented at 90° with respect to the magnetic field direction (sketches on the right). 9. The spectrum (Fig. 1, 31P column, middle-bottom) is also made of two sharp lines representing 14:0/BB TBBPtdCho (major, 20 ppm) and 6:0/6:0 DCPtdCho (minor, 8 ppm) in a bicelle (disc) structure with the bilayer normal oriented at 0° with respect to the magnetic field direction (sketch on the right). 10. The spectrum (Fig. 1, 31P column, bottom) is characteristic of a phase with isotropic symmetry (cubic, micelles, etc.): a single, very intense sharp line appears and is centred at ca. 0 ppm.

350

Grélard et al.

3.2. Determination of Lipid Phase Nature (Lamellar (Liposomes), OrientedBicelle, Isotropic) Using Wide Line 14 N-NMR

1. Step 1 as in Subheading 3.1. 2. Step 2 as in Subheading 3.1. 3. The resulting sample (may appear translucent when in bicelle phase) is transferred into a 4-mm NMR ZrO2 rotor (100 ml) that is placed in the magnetic field. The measuring probe is properly tuned to 36.1 MHz (X-channel). 4. Step 4 as in Subheading 3.1. 5.

N-NMR spectroscopy is performed at 36.1 MHz. NMR spectra are acquired using a quadrupolar echo pulse sequence (18). Typical acquisition parameters are as follows: spectral window of 100 kHz; p/2 pulse width 10 µs (see Note 13), and interpulse delay of 200 µs. A recycle delay of 0.2 s is used. Typically, 40,000 scans are accumulated in 2.2 h. 14

6. Processing parameters: noise filtering of the FID with an exponential window, characterized by the Lorentzian line-broadening factor (LB = 100 Hz), is used. Fourier Transformation is applied to get the spectrum, and its phase is adjusted to correct for unavoidable time delays in acquisition. The spectrum centre is set to 0 ppm. 7. The spectrum (Fig. 1, 14N column, top) is characteristic of a non-oriented spectrum (broad axially symmetric powder pattern of ca. 9 kHz) superimposed on a isotropic line (0 Hz). This is typical of a mixture of micrometre liposomes and nanometre micelles (sketches on the right). 8. The spectrum (Fig. 1, 14N column, middle-top) is made of two sets of sharp doublets representing 14:0/14:0 DMPtdCho (major, ca. 7 kHz splitting) and 6:0/6:0 DCPtdCho (minor, ca. 1.5 kHz splitting) in a bicelle (disc) structure with the bilayer normal oriented at 90° with respect to the magnetic field direction (sketches on the right). 9. The spectrum (Fig. 1, 14N column, middle-bottom) is also made of two sets of sharp doublets representing 14:0/BB TBBPtdCho (major, ca. 13 kHz Splitting) and 6:0/6:0 DCPtdCho (minor, ca. 3 kHz splitting) in a bicelle (disc) structure with the bilayer normal oriented at 0° with respect to the magnetic field direction (sketch on the right). 10. The spectrum (Fig. 1, 14N column, bottom) is characteristic of a phase with isotropic symmetry (cubic, micelles, etc.): a single, very intense sharp line appears and is centred at ca. 0 Hz. 3.3. Determination of Lipid Dynamics in Non-oriented Multilamellar Bilayers (Gel, Fluid, LiquidOrdered) Using Wide Line 2H-NMR

1. Appropriate amounts of phospholipids are weighed to reach a final concentration of ca. 10–200 mM (see Note 11). 2. A suitable volume of deuterium-depleted water, 1H2O is added to obtain a lipid hydration of 80–90% (v/w). Hydration is defined as the mass of water over the total mass of the system (phospholipids and water).

NMR Spectroscopy of Lipid Bilayers

351

3. The hydrated sample is then vigorously shaken in a vortex mixer, frozen in liquid nitrogen, and heated to 50°C for 10 min in a water bath. This cycle of liposome (multilamellar vesicles) formation is repeated three to five times until a milky dispersion is obtained at room temperature. 4. The resulting dispersion is transferred into a 4-mm NMR ZrO2 rotor (100 ml) that is placed in the magnetic field (see Note 12). The measuring probe is properly tuned to 76.77 MHz (2H Channel), and the space homogeneity is adjusted at best. 5. As step 4 in Subheading 3.1. 6. 2H NMR spectroscopy is performed at 76.77 MHz. NMR spectra are acquired using a solid quadrupolar echo pulse sequence (18). Typical acquisition parameters are as follows: spectral window of 500 kHz, p/2 pulse width 3 µs (see Note 13), and interpulse delay of 30 µs. A recycle delay of 1.5 s is used. Typically, 512–1,024 scans are accumulated in 10–20 min. 7. Processing parameters: noise filtering of the FID with an exponential window, characterized by the Lorentzian linebroadening factor (LB = 50–300 Hz), is used. Fourier Transformation is applied to get the spectrum, and its phase is adjusted to correct for unavoidable time delays in acquisition. The peak-to-peak separation, called the quadrupolar splitting, that can be in principle obtained for non-equivalent C–2H bonds, can be measured on some spectra. Spectral moments can also be used to estimate width and shape changes in spectra (14, 19). 8. The spectrum at 10°C for DPPtdCho-2H62 liposomes (Fig. 2a) is characteristic of a gel phase where the quadrupolar interaction occurring for each C–2H bond (there are 62 deuterium labelled positions on the acyl chains of DPPtdCho2 H62) reveals non-axial symmetry due to the very restricted chain dynamics (solid-ordered phase, so) and to the symmetry of the phase. No peak-to-peak measurement can be measured on such a spectrum, except for the central broad doublet assigned to the chain-end deuterated methyl groups. The first moment is calculated using a routine built in the Origin software. 9. The spectrum obtained at 40°C for DPPtdCho-2H62 liposomes (Fig. 2b) is much narrower along the frequency axis and shows a shape characteristic of a lamellar fluid phase with axial symmetry (liquid-disordered phase, ld). The overall spectrum is nonetheless complex because it represents the superposition of all spectra coming from all C–2H bonds. Several peak-to-peak separations, quadrupolar splittings, can be measured and plotted as a function of the labelled carbon position along the acyl chain (Fig. 2f). Spectral deconvolution

352

Grélard et al.

to get oriented-like spectra (20, 21) can be applied here for more resolution (see Note 2). 10. The spectrum at 10°C for DPPtdCho-2H62/cholesterol (2/1) liposomes (Fig. 2c) is characteristic of a liquid-ordered phase (lo) where the quadrupolar interaction occurring for each C–2H bond reveals axial symmetry due to the axial diffusion of lipids in the membrane. 11. The spectrum at 40°C for DPPtdCho-2H62/cholesterol (2/1) liposomes (Fig. 2d) is characteristic of a liquid-ordered phase with greater dynamics than at 10°C (spectrum of smaller width). Several peak-to-peak separations, quadrupolar splittings, can be measured and plotted as a function of the labelled carbon position along the acyl chain (Fig. 2f). 12. Spectra were recorded as a function of temperature, the first moment calculated and plotted against temperature (Fig. 2e). Elevated values are found for gel phase temperatures and depict an ordered rigid state (so) whereas smaller values are found above the gel-to-fluid phase transition temperature (ca. 37°C) for pure DPPtdCho-2H62 liposomes and depict a disordered lamellar fluid phase (ld). In case of added cholesterol, the transition is nearly smoothed out demonstrating the regulating effect of cholesterol: it increases the fluidity of ordered phases and decreases that of fluid phases: this is the liquid-ordered state (lo). 13. Peak-to-peak separation, so-called quadrupolar splittings, measured for individual C–2H bonds report on local space and time angular fluctuations (order parameters, see Note 14). When plotted against labelled acyl chain position, they report on local dynamics (order/disorder) (Fig. 2f ). The greatest the quadrupolar splitting, the highest the order (less bond fluctuations). C–2H bonds close to the glycerol backbone (positions 2–8) are much more ordered than those near the bilayer centre (positions 12–16). 3.4. Determination of Lipid Molecular Structure in a Membrane Environment Using 1 H HR-MAS (Fig. 3)

1. Appropriate amounts of phospholipids are weighed to reach a final concentration of ca. 1–10 mM (see Note 11). 2. A suitable volume of ultra pure water mixed with deuterated water (90:10, v:v) containing traces of TSP (trimethylsilylpropionic acid) as a reference (0 ppm) is added to obtain a lipid hydration of 80–95% (v/w). Hydration is defined as the mass of water over the total mass of the system (phospholipids and water). 3. The hydrated sample is then vigorously shaken in a vortex mixer, frozen in liquid nitrogen, and heated to 50°C for 10 min in a water bath. This cycle of liposome (multilamellar vesicles) formation is repeated three to five times until a milky dispersion is obtained at room temperature.

NMR Spectroscopy of Lipid Bilayers

353

O

Liver PC

g

g

+

N

b a

g

O

P

3'

1'

O

2'

H O O

O-

G1

G2

O

G3

5'

4'

2 1

7'

6'

4

9'

8'

6

3

10'

8

5

11'

13'

12'

10

7

12

9

15'

14'

18

11

17'

16'

18'

16

13

15

18 17

O O

Liver PE +

H3N

b

a

1'

O

H O

O P O

G2

O-

G1

2'

3'

O

2 1

G3

4'

5'

6'

4 3

7'

8'

9'

6 5

10'

8

11'

12'

10

7

14'

15'

13'

9

12 11

17'

19'

16' 18

13

18'

16 15

20'

18 17

O

O 3'

1'

Sphingomyelin

g

g N+ g

O b

a

H

O P O O-

G1

2'

HN G2 G3

5' 4'

1

3 2

7' 6' 5

4

9'

7 6

11' 10'

8'

9 8

13' 12' 11

10

15' 14' 13

12

17' 16'

18'

15 14

H OH

Fig. 3. Molecular structures of selected lipids with numbering for Table 1.

4. The resulting dispersion is transferred into a 4-mm NMR ZrO2 rotor for “High-Resolution MAS NMR” (50 ml) that is placed in the magnetic field. 5. Speed for MAS is set to 7–10 kHz using the pneumatic unit device. Using the variable temperature set up (VT and BCU units), samples are allowed to equilibrate 15–30 min at 30°C before the time dependent NMR signal (FID) is acquired; the temperature is regulated to ±1°C. The measuring probe is properly tuned to 500.16 MHz and the space and time homogeneity is adjusted by a deuterium “lock” system using the deuterated solvent as a reference. 6. 1H-NMR spectroscopy is performed at 500.16 MHz. NMR spectra are acquired using a single pulse sequence with water suppression (22). Typical acquisition parameters are a pulse duration (P1) of 5.5 ms, a time domain (TD) set to 32 K points, a spectral width (SW) of 20 ppm and a relaxation delay (D1) of 3 s. The number of scans is set to 16. These conditions determine a total acquisition time of 90 s for the time dependent signal (FID). 7. Processing parameters: noise filtering with an exponential window of the FID, characterized by the Lorentzian linebroadening factor (LB = 0.3 Hz), is used. This noise filtering generally improves the signal-to-noise ratio, but at the cost of resolution. Fourier Transformation is applied to get the spectrum, and its phase is adjusted to correct for unavoidable time delays in acquisition (Fig. 4). Referencing (TSP rightmost resonance is set to 0 ppm), peak picking (measure of chemical

354

Grélard et al.

shifts in parts per million), and peak integration (measure line areas) are part of the relevant processing parameters. 8. The spectrum obtained in the absence of rotation (Fig. 4c) is characteristic of an unresolved proton spectrum of a membrane phase. Very little peak assignment can be made on such a spectrum. 9. The spectrum obtained for a MAS speed of 10 kHz (Fig. 4b) shows a resolution that is similar to that obtained in liquids (organic solution, Fig. 4a), the sample being still under the form of liposomes. Referencing, peak and peak integration can now be applied. 10. The strategy for structure elucidation is the same as for lipids in organic solvent solution; 2D-NMR experiments (COSY, a Chloroform solution

10

5

0

5

0

b Liposomes, Water, MAS B0 54.7 °

Rotor axis

d c 10 Liposomes, Water, Static

10

5

0

ppm (1H)

Fig. 4. 1H-NMR spectra of Liver PtdCho. (a) Lipid dissolved in CDCl3 solution. (b) Solid state “high-resolution” magic angle sample spinning (sketch in inset d) of lipid dispersed in water containing 10% D2O, MAS speed = 10 kHz. Water suppression was used during signal acquisition. (c) Spectrum without spinning. Note the change in x-scale from (b) to (c).

NMR Spectroscopy of Lipid Bilayers

355

TOCSY) are often needed for complete assignment. Table 1 reports the structural assignment of 1H chemical shifts for selected lipids (Fig. 3) under liposomal form, i.e., as water dispersions (see Note 8). 3.5. Determination of Lipid Internal Dynamics in Oriented Bilayers Using 2H-NMR Spin-Lattice Relaxation Time Measurements

1. Appropriate amounts of phospholipids are weighed to obtain a bicelle composition (ca. 80 mol% of 14:0/14:0 DMPtdCho2 H72 and 20 mol% of 6:0/6:0 DCPtdCho (23)) and dissolved in organic solvent (CHCl3/MeOH), the solvent is evaporated, water is added, the sample shaken in a vortex mixer and lyophilized to remove solvent traces. 2. A suitable volume of deuterium-depleted water is added to obtain a lipid hydration (water mass/total mass) 80–95% (w/w). Total lipid concentration is of the order of 200–50 mM. 3. The hydrated sample is then vigorously shaken in a vortex mixer, frozen in liquid nitrogen, and heated to 50°C for 10 min in a water bath. This cycle of bicelle (bilayer discs of 400 nm diameter) formation is repeated three to five times until a homogeneous translucent preparation is obtained at room temperature. 4. The resulting dispersion is transferred into a 4-mm NMR ZrO2 rotor (100 ml) that is placed in the magnetic field (see Note 12). The measuring probe is properly tuned to 76.77 MHz (2H channel) and the space homogeneity is adjusted at best. 5. As step 4 in Subheading 3.1. 6. 2H NMR spectroscopy is performed at 76.77 MHz. A series of ca. 30 NMR signals are acquired using an inversion recovery quadrupolar echo pulse sequence. Typical acquisition parameters are as follows: spectral window of 250 kHz, p/2 and p pulse widths of 3 and 6 µs (see Note 13), interpulse delay in the echo sequence of 30 µs, variable delays, t1, between the inverting p pulse and the first p/2 pulse of the echo sequence ranging between 1 and 1,500 ms. A recycle delay of 1.5 s is used and typically, 256 scans are accumulated per spectrum in 6 min. The whole experiment lasts for 4.5 h. 7. Processing parameters: noise filtering of all FIDs with the same exponential window, characterized by a Lorentzian linebroadening factor (LB) of 50 Hz. Fourier Transformation is applied to get the spectrum series. The zero order phase correction is adjusted on the last spectrum of the series and applied to all spectra (Fig. 5c). 8. Quadrupolar splitting can easily be measured on oriented spectra and attributed to labelled carbon positions in the molecular structure (Fig. 5b) using previous assignments (6).

356

Grélard et al.

Fig. 5. 2H-NMR relaxation of DMPtdCho-2H72 in DMPtdCho/DCPtdCho bicelles hydrated at 80% with 1H2O. (a) Rate of spin-lattice relaxation (1/T1z) as a function of labelled carbon position. (b) DMPtdCho-2H72 molecule showing carbon numbering. (c) Stack of Fourier transformed NMR signals after an inversion recovery quadrupolar echo sequence. X-axis is the classical frequency axis (kHz) and the t1 axis is that of delays between the inverting pulse and the first pulse of the echo sequence. Analysis of such data yields (a).

The area of each individual splitting is determined using a Bruker routine, and the area variation as function of t1 is fitted against the proper equation for inversion-recovery sequence in the Bruker package to obtain the individual spin lattice relaxation time T1 (see Note 15).

NMR Spectroscopy of Lipid Bilayers

357

9. Spin-lattice relaxation times measured for individual C–2H bonds report on molecular motions occurring at the nanosecond timescale. When plotted against labelled carbon position, they report on local dynamics (correlation times for segment or molecular motions) (Fig. 5a). The relaxation rate (inverse of T1) is the largest for C–2H bonds of the glycerol backbone (positions g1, g2, g3) and is much smaller for head group (a, b, g) or chain (2–14) positions. This indicates that the glycerol backbone possesses a very restricted motional freedom and can be considered as a semi-rigid “kneecap” separating two very fluid zones, the phosphocholine head group facing the water medium and the acyl chains constituting the oily bilayer interior.

4. Notes 1. Data may also be treated on the computer that drives the spectrometer. Most spectrometer-linked computers are now PCs. 2. Spectral moment calculations or spectral de-Pake-ing may not be available on computers driving spectrometers. They can nonetheless be implemented in PC by obtaining the source code from the authors (14, 21). 3. P1 is determined with a routine provided on all spectrometers (POPT on Bruker software). It must be determined especially for samples with different solvents (because of the different dielectric constants that may modify the P1 value). Note that elevated salt concentration may alter the P1 duration. 4. Relaxation delays are also determined using a routine provided on spectrometers, which allows calculation of spin lattice relaxation time, T1. D1 is usually set to 5 × T1 for optimum relaxation delay. 5. The number of scans is chosen such that the signal-to-noise ratio is about or greater than 50. 6. Line broadening is set in order to filter spectral noise without marked modification of peak line width. 7. More sophisticated window filtering can be used as they are available on most spectrometer software. 8. Many groups can be identified by their 1H chemical shifts using chemical shift tables (24) and coupling constants (multiplet separation). Two-Dimensional NMR experiments (COSY, TOCSY) are sometimes needed for complete assignment. 9. Experiments may sometimes be run overnight implementing a temperature variation from low to high temperatures and back to low to detect possible sample hysteresis.

358

Grélard et al.

10. A pulse sequence is a series of radiofrequency impulsions separated by specific delays, which allow selecting one or several magnetic or electric molecular interactions to be observed. The time dependent signals (FID) that are detected following a pulse sequence must be Fourier Transformed to obtain spectral data. 11. In case of lipid mixtures (DPPC/Cholesterol), lipids are dissolved in organic solvent (CHCl3/MeOH), the solvent is evaporated, water is added, the sample shaken in a vortex mixer and lyophilized to remove solvent traces All these experiments can be run with DPPtdCho-2H31 (less expensive) instead of DPPtdCho-2H62, the numbers of scans must then be increased by a factor 2. 12. In order to facilitate the transfer into the rotor, it may be wise to vary (increase or decrease) the temperature to be in a fluid state. The sample may then be easily poured into the rotor. 13. The pulse must be very short to acquire very wide spectra (hundredths of kilohertz). Using longer pulse widths will lead to intensity loss in the outer verges of the spectrum and hence errors in spectral moment calculation. 14. Order parameters, S, represent a time and space average of molecules or of molecular bonds with respect to the bilayer normal, z, in an axially symmetric situation, as in lipid bilayers. The SZ order parameter may vary from 1 (M1 of ca. 150 kHz or C–D2 quadrupolar splitting of 62.5 kHz) for fully rigid solid state systems to 0 for liquid systems. 15. Mz(t1) = Mz(t1 =฀∞) × (1 − 2 × exp(–t1/T1z)) where Mz(t1) is the z-magnetization at time t1 obtained by straight integration of individual lines; Mz(t1 =฀∞) is the Boltzmann equilibrium magnetization; t1 is the delay between the inverting p pulse and the first p/2 pulse of the echo sequence and T1z is the spinlattice (longitudinal) NMR relaxation time. References 1. Simons K, Ikonen E (2000) How cells handle cholesterol. Science 290:1721–1726 2. Beck JG, Mathieu D, Loudet C, Buchoux S, Dufourc EJ (2007) Plant sterols in “rafts”: a better way to regulate membrane thermal shocks. FASEB J 21:1714–1723 3. Byrne RD, Barona TM, Garnier M, Koster G, Katan M, Poccia DL, Larijani B (2004) Nuclear envelope assembly is promoted by phosphoinositide-specific PLC with selective recruitment of phosphatidylinositol enriched membranes. Biochem J 387:393–400 4. Dufourc EJ (2009) NMR for lipids and biomembranes. In: Wiley encyclopedia of

chemical Biology. Wiley, Chichester, England DOI: 10.1002/9780470048672. wecb 9780470048389 5. Gamier-Lhomme M, Grelard A, Byrne RD, Loudet C, Dufourc EJ, Larijani B (2007) Probing the dynamics of intact cells and nuclear envelope precursor membrane vesicles by deuterium solid state NMR spectroscopy. Biochim Biophys Acta Biomembr 1768:2516–2527 6. Aussenac F, Laguerre M, Schmitter JM, Dufourc EJ (2003) Detailed structure and dynamics of bicelle phospholipids using selectively and perdeuterated labels. A 2H-NMR and molecular mechanics study. Langmuir 19:10468–10479

NMR Spectroscopy of Lipid Bilayers 7. Loudet C, Manet S, Gineste S, Oda R, Achard MF, Dufourc EJ (2007) Biphenyl bicelle disks align perpendicular to magnetic fields on large temperature scales: a study combining synthesis, solid-state NMR, TEM, and SAXS. Biophys J 92:3949–3959 8. Marinov R, Dufourc EJ (1995) Cholesterol stabilizes the hexagonal type II phase of 1-palmitoyl-2-oleoyl sn glycero-3-phosphoethanolamine. A solid state 2H and 31P NMR study. J Chim Phys 92:1727–1731 9. Marinov R, Dufourc EJ (1996) Thermotropism and hydration properties of POPE and POPEcholesterol systems as revealed by solid state 2 H and 31P-NMR. Eur Biophys J 24:423–431 10. Aussenac F, Tavares M, Dufourc EJ (2003) Cholesterol dynamics in membranes of raft composition: a molecular point of view from 2 H and 31P solid state NMR. Biochemistry 42:1383–1390 11. Dufourc EJ, Parish EJ, Chitrakorn S, Smith ICP (1984) Structural and dynamical details of cholesterol-lipid interaction as revealed by deuterium NMR. Biochemistry 23: 6063–6071 12. Pott T, Maillet JC, Dufourc EJ (1995) Effects of pH and cholesterol on DMPA membranes: a solid state 2H- and 31P-NMR study. Biophys J 69:1897–1908 13. Marsan MP, Muller I, Ramos C, Rodriguez F, Dufourc EJ, Czaplicki J, Milon A (1999) Cholesterol orientation and dynamics in dimyristoylphosphatidylcholine bilayers: a solid state deuterium NMR analysis. Biophys J 76:351–359 14. Dufourc EJ (2006) Solid state NMR in biomembranes. In: Larijani B, Woscholski R, Rosser CA (eds) Chemical biology. Wiley, London, pp 113–131 15. Dufourc EJ, Mayer C, Stohrer J, Althoff G, Kothe G (1992) Dynamics of phosphate head

16.

17.

18.

19.

20.

21. 22.

23.

24.

359

groups in biomembranes. Comprehensive analysis using phosphorus-31 nuclear magnetic resonance lineshape and relaxation time measurements. Biophys J 61:42–47 Dufourc EJ, Smith ICP (1986) A detailed analysis of the motions of cholesterol in biological membranes by 2H-NMR relaxation. Chem Phys Lipids 41:123–135 Rance M, Byrd RA (1983) Obtaining highfidelity spin-1/2 powder spectra in anisotropic media: phase-cycled Hahn echo spectroscopy. J Magn Reson 52:221–240 Davis JH (1979) Deuterium magnetic resonance study of the gel and liquid crystalline phases of dipalmitoylphosphatidylcholine. Biophys J 27:339–358 Davis JH, Jeffrey KR, Bloom M, Valic MI, Higgs TP (1976) Quadrupolar echo deuteron magnetic resonance spectroscopy in ordered hydrocarbon chains. Chem Phys Lett 42:390–394 Bloom M, Davis JH, Mackay AL (1981) Direct determination of the oriented sample NMR spectrum for systems with local axial symmetry. Chem Phys Lett 80:198–201 Sternin E, Bloom M, MacKay AL (1983) De-Pake-ing of NMR Spectra. J Magn Reson 55:274–282 Piotto MVS, Sklenar V (1992) Gradienttailored excitation for single-quantum NMR spectroscopy of aqueous solutions. J Biomol NMR 2:661–666 Raffard G, Steinbruckner S, Arnold A, Davis JH, Dufourc EJ (2000) Temperaturecomposition diagram of dimyristoyl-dicaproyl phosphatidylcholine “bicelles” self-orienting in the magnetic field. A solid state 2H and 31PNMR study. Langmuir 16:7655–7662 Pretsch E, Bühlmann P, Affolter C (2000) Structure determination of organic compounds. Springer, Berlin

as

Part V Molecular Modelling

as

Chapter 19 Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis Catherine Etchebest and Gaelle Debret Abstract Membrane proteins play major roles in many biological processes such as signalling, transport, etc. They have been shown to be involved in the development of many diseases and have become important drug targets per se. The understanding of their functional properties may be facilitated if a 3D structure is available. However, in the case of membrane proteins, only a few 3D structures have been solved to date. Bioinformatics and molecular modelling approaches are thus powerful alternatives to fill the gap between the sequence and the structure. Here, a review of the most recent approaches is proposed together with guidelines on how to use them. In addition, insofar as important biological processes require conformational changes, we discuss some interesting methods aimed at exploring the dynamic behaviour of proteins in their membrane environment. The paper ends with a brief description of useful approaches for determining oligomerisation or ligand binding sites. Key words: Membrane proteins, Structure prediction, Bioinformatics, Molecular dynamics, Normal mode analysis, Docking, Conformational changes

1. Introduction For biologists working with membrane proteins, molecular modelling techniques have become a true companion. They are valuable tools that complement biophysical approaches by yielding, at an atomic level, structural and dynamic details for a given system. Their use extends from 3D structure prediction to fine structural and energetic details of ions transport as well as the study of conformational changes of proteins in the lipid environment. Even though the concepts are quite identical to those for soluble proteins, when dealing with membrane proteins, important differences exist in their practical usage. Indeed, the membrane environment, which represents an anisotropic medium compared to water, requires specific Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_19, © Springer Science+Business Media, LLC 2010

363

364

Etchebest and Debret

methods, protocols, and careful controls of different parameters for the characterisation of biologically pertinent features. This chapter describes the most used (and useful, it is hoped,) techniques for simulating membrane protein systems. We restrict our topic to the a-helical class of membrane proteins. The structural modelling of b-barrels proteins requires specific approaches that will not be detailed here. Indeed, b-strands are generally more difficult to predict because they are stabilised by long-range interactions. The first section is dedicated to tools aimed at constructing membrane protein structures from sequence information that could be guided by experimental data. We describe different strategies that can yield complete 3D-models when sequence relationship exists with known structures (homology modelling) or alternatively, we provide partial structural information such as the location of the transmembrane segments and/or the orientation of the N and C termini with respect to the intra/extra cellular medium. Some examples that illustrate the similarities and differences between the different methods are given. The second section focuses on structural analysis using sophisticated molecular simulation techniques, i.e., molecular dynamics (MD), normal modes analysis (NMA), steered molecular dynamics (SMD), driven molecular dynamics for deducing, and/or understanding functional characteristics of membrane proteins. These approaches enable, for instance, the exploration of the mechanical features of the protein, among which are domain flexibility, pathway of ions along channels with the corresponding free energies, etc. More importantly, molecular dynamics simulations are widely used for solving structures with nuclear magnetic resonance data (see chapters in Part IV of this volume). Steered molecular dynamics simulations also have become a true partner to Atomic Force Microscopic (AFM) experiments, providing further insight into the impact of the forces on the structure at an atomistic level. Even though most methods are practised by experts in the field, strong efforts are realised to make these tools as convivial as possible. For instance, the first difficulty lies in the elaboration of the system to be simulated, i.e., a protein embedded in a membrane in the presence of a solvent. Useful graphical interfaces are now available that allow the construction of a complete system. Lastly, we also introduce briefly the problem of constructing multimeric assemblies and ligand–protein complexes via docking methods. Generally, a supervised procedure is necessary because the membrane, which restrains the accessible regions for ligands, is frequently not considered. This issue is addressed in the light of a well-documented example. A rapid methodology survey is given for the different techniques, with a discussion of their capabilities and their limits. A few examples illustrate the topic with advices and recommendations.

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

365

2. Materials A PC-computer with an internet connection is required. Most tools described below are available on web-servers and/or may be downloaded (freely). When dealing with molecular dynamics simulations, the calculations are CPU-time consuming. Therefore, a powerful computer is required in this case. Two main molecular dynamics softwares are freely available, namely, Gromacs and NAMD, and can be easily installed locally (see Note 1). They are particularly efficient on parallel architecture as developed on PC-clusters for instance. Charmm and Amber are also very popular programs, but they require an end-user licence and fee. Lastly, a graphical software is a valuable tool for exploring and analysing the 3D structure. Many programs exist, but the most popular ones in the community are undoubtedly PyMol and Vmd. Unfortunately, latest versions of Pymol are no longer available for free, and the free older versions (pre 1.0) are not maintained. Besides the program, numerous extensions developed by the community can be used via the Tk console for VMD or added through the plugin facilities for Pymol (see Note 2).

3. Methods 3.1. Tools for Bridging Modelling Structure from Sequence 3.1.1. With Available Known Homologues

Homology modelling is the most successful method for obtaining a reliable structure if a homolog exists in the Protein Data Bank. However, as a matter of fact, the number of membrane protein 3D structures solved at an atomistic level is considerably smaller compared to soluble proteins. Consequently, this approach is considerably limited to the few membrane protein families (see http:// blanco.biomol.uci.edi) available in the dataset. As an example, until recently, the G protein coupled receptor family, one of the largest membrane protein families with crucial functional properties, was represented by only one structure, the bovin rhodopsin (code PDB: 1F88 and 1HZX) (1, 2) with a rather weak resolution (2.8 Ǻ). Since, the protein was solved in different species (3, 4) and in different stages, i.e., free from ligand (5) or complexed with ligand. These last 2 years have seen the release of structures for two different GPCR members, the turkey b1-adrenergic GPCR (6) and the humanb2-adrenergic GPCR (7), and their engineered form (8, 9). Comparison of the structures of the rhodopsin and b1 adrenergic receptor clearly shows a strong similarity in the transmembrane (TM) domain (Fig. 1). Accordingly, a structural model of rhodopsin based on the structure of b1 adrenergic receptor (or reversely) might be reliable. However, this model strongly depends on the accuracy of the

366

Etchebest and Debret

Fig. 1. Superimposition of the structures of the bovine rhodopsin (pdb:1U19) in red and the b2 adrenergic receptor (pdb:2R1H). Left and middle: the aligned transmembrane core domains along two different views. Right: a view including the nonaligned regions in yellow. The T4-lysozyme region is omitted for clarity.

alignment. As an illustration, in Fig. 2, are given the sequence alignments between the two proteins, one using a powerful tool, Muscle (10, 11) for multiple sequence alignment (MSA) (see Note 3) and the other one deduced from Rapido (12), a new elegant software for structure superimposition (see Note 4). The alignments are far from being identical, but it may be noticed that in the transmembrane core, the alignments are very similar. In the present case, both alignments are difficult to obtain because the b2-adrenergic receptor is fused with T4-lysozyme. The use of a large set of homologous sequences permits generally to trap the main conserved (or similar) regions and yields an improvement of the alignment. As an advice, it is recommended to perform multiple sequences alignment for the target and for the template sequences separately and to correct the pairwise alignment between the template and the target accordingly. Information based on secondary structures and/or TM regions predictions are also valuable for improving the detection of similar regions. Numerous tools exist for the prediction of transmembrane domains mainly for a-helical bundles. Most of them rely on learning and data mining methods relying on databases dedicated to TM proteins structures (13–18). The most popular methods are based on sophisticated learning procedures such as Hidden Markov Models, Neural Networks, or Support Vectors Machine (HMMTop (19), TMHMM (20), PolyPHOBIUS (21), TMPpro (22), MEMSAT3 (23), SVMTop (24), k4HTM (25), and MemBrain (26)). Significant advances have recently been achieved, thanks to the incorporation of evolutionary information. One of the most interesting approaches based on physical principles demonstrates a success comparable with that of the best methods based on machine-learning techniques (27). This method relies on experimental data that describe the energetic of insertion into the endoplasmic reticulum membrane of a single TM segment embedded within a larger protein.

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

367

Fig. 2. Sequence alignments between b2-adrenergic receptor (2RH1, first line) and rhodopsin (1U19, second line). (a) Alignment from MUSCLE and (b) alignment deduced from structural superimposition with RAPIDO (see Subheading 4).

Most approaches predict the number of TM segments, their limits, as well as the topology, i.e., the orientation of the protein with respect to the intra- or extra-cellular side. The prediction takes profit from a bias in the residues distribution in the loops located in the intracellular side. Indeed, it has been observed that these loops

368

Etchebest and Debret

are enriched in positive residues while the loops located on the extracellular side are depleted. This observation is known as the so-called “positive inside rule” (28). All these methods nevertheless suffer from the sparseness of the available structural data. Their performances tend to decrease when they are re-evaluated on new and larger datasets. Hence, a regular checking of the literature is recommended before selecting a method that claims to be the most accurate one. In addition, it has to be kept in mind that even with an accurate alignment, a structural homologous model requires further investigations and refinement to take into account possible distortions and/or slight displacements of the helices compared to the template structure. These differences in the transmembrane core domain, as well as the external regions, frequently nonconserved, correspond to the true signature of the sequence. They may be of considerable importance when dealing with fine recognition process and ligands specificity investigation (29). 3.1.2. Without Known Homologues

When no template structure is available or easy to detect, alternative approaches are required. These methods may be divided into two categories: threading (or fold recognition) for which a structural database is needed, and ab initio/de novo methods. In the last case, the structural modelling may be achieved by combining fragments for instance or by applying physical principles. In many cases, numerous models are constructed and sorted according to an appropriate scoring function. These approaches were initially developed and extensively tested on soluble proteins. They are currently being adapted for the prediction of transmembrane proteins. Threading requires a set of 3D structures and a scoring function that estimates the compatibility between the sequence and a 3D structure. This scoring function is frequently on the basis of statistical potentials learnt from a large databank of 3D structures. In case of membrane proteins, amino acids distribution in the transmembrane region is somewhat different and thus, statistical potentials from soluble proteins might not be appropriate. The TASSER approach, which combines threading and fragments combination, developed successfully for soluble proteins, was recently extended with specific adaptation for modelling the structure of all identified G Protein-coupled receptors in the human genome (30). The procedure was thus able to trap the seven-helices bundle topology but cannot be readily extended to other folds of membrane proteins. Similarly, a threading approach using the 3D structure of lactose permease from E. coli (LacY) was applied and permitted to detect proteins similar to LacY in prokaryotes and eukaryotes (31). Finally, when no putative template structure (evolutionary related or not) may be detected, ab initio methods are definitely required. Ab initio (or de novo) methods are supposed to be much easier to develop for transmembrane (TM)

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

369

domains of these proteins. Indeed, it has been argued that the constraint of the membrane strongly limits the diversity of possible folds compared to soluble proteins. However, such methods have been essentially applied for the prediction of proteins mainly composed of a-helices. The protocol is generally hierarchical and consists (1) in predicting the TM helical segments, (2) in determining the relative positions of each helical axis, (3) in establishing the orientation of the helical faces (lipid vs. protein interior), (4) in positioning the side chains and the connecting loops conformations, and (5) in refining all the structure (32). Steps 1 and 3–5, even though difficult, are tractable. Step 2 is clearly the most difficult step because one needs to know which helix is close to which other one. For this purpose, experimental data, for instance, are required that can come from low-resolution electron microscopy maps (Chapter 13 this book: by Hinsen et al., “from electron microscopy maps to atomic structures using normal mode-based fitting”), and/or from any indirect biochemical or biophysical experiments, that could inform about the proximity between the helical segments. Since the pioneering work performed by Herzyk and Hubbard (33) and Baldwin (34, 35), many new protocols have been developed. However, they mainly focus on GPCR families (36–39). In these methods, the main prevailing hypotheses are roughly, the role of hydrophobicity, the role of the hydrophobic moment, and a specific “lipidic” moment that would point towards the lipid bilayer. The conservation patterns also help to orient the faces of the helices, the most conserved residues being supposed to point towards the interior of the protein (40). For instance, the LIPS server (41) (see Note 5) allows the prediction of the faces turned towards the lipid environment with a good efficiency. Finally, special care is taken for considering the specific packing forces that drive to the full folded structure. Preferred contacts in membrane protein structures were thus analysed and used to deduce statistical potentials. With their help and various rules, constraints are applied that guide the a-helix assemblies (42). Their success is still limited but quite promising because it is the only issue when no structural template exists. Most approaches consider helices as rigid bodies. Noteworthy, some methods begin to deal with local deformations of helices, which are frequently encountered in membrane proteins. Nowadays, such local deformations can be easily studied with synthetic peptides coupled to NMR studies (see above). Molecular mechanics computations can also be performed for studying the conformation of helices and per se, their putative deformations in an environment representative of the membrane properties (see next section). Lastly, interesting and promising success was obtained with Robetta method developed by Baker’s group for soluble proteins, and adapted for membrane proteins, with no need of any experimental data (43).

370

Etchebest and Debret

In summary, the recent literature shows some significant progress, but it is generally limited to few membrane protein families, mainly G coupled protein receptor for instance. Their success is high as the procedure takes advantage of experimental data to drive the conformational space exploration and to guide the finding of the 3D structure (44). An example of such an approach is described in detail in Chapter 20 in a concrete case with some advices. A systematic check of the literature is required before going further into the exploration of the modelled structure. 3.2. Tools for Exploring Function from Structure

When a 3D-structure is available, the next important step consists in exploring its functional properties. Different levels of exploration may be distinguished. The simplest one consists in analysing the structure itself and then its relation with the environment. Finally, the furthest step deals with the dynamical behaviour of the structure.

3.2.1. Structural Analysis

As in the case of soluble proteins, surface properties are useful information to understand functional properties. For instance, for receptor proteins, pockets have to be identified to propose putative binding sites. Different tools are available for determining pockets in proteins. CastP (45) is a method that provides us with different information such as the size of the pockets, the atoms that delineate the pockets and cavities, as well as the mouths of these pockets (available online, see Note 6). Naccess software developed by Hubbard and Thornton (46) is another useful tool that computes the accessibility of the residues. The accessible surface area S is generally measured by considering a probe with a 1.4 Ǻ radius i.e., a water molecule (but the value can be modified, see Note 7). The accessibility of a residue X is then defined as the ratio between the surface S and the accessible surface computed for the same residue in a tripeptide GXG, in an extended conformation. A residue is considered as buried when this ratio is smaller than 10% (25% is also an acceptable threshold for defining the burying limit). The state of the residue in the model structure may be compared with data deduced from experiments such as antigen–antibody recognition for instance. Indeed, if no important conformational rearrangement occurs during the recognition process, the residues have to be accessible before making the interaction. Accordingly, some models may be discarded or privileged. Finally, a convenient way to analyse a channel pore is to use the “Hole” approach developed by Smart et al. (47). The dimensions of the pore are obtained using a Monte Carlo simulated annealing procedure that finds the optimal pathway for a sphere with variable radius to squeeze through the channel (see Note 8).

3.2.1.1. Surface and Pockets

3.2.1.2. Helices Conformation and Relative Positioning

A first geometrical analysis is generally dedicated to the determination of the limits of the secondary structures. This is performed using for instance, DSSP (48) or Stride (49). Once the extents of

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

371

the helices are obtained, their relative positioning may be defined with different geometrical parameters: distances between centres of mass, for instance, tilt angles or W angles that necessitate the helical axis computation (see Fig. 3). If the helix is not deformed, the calculation of the axis is obtained by diagonalisation of the inertia matrix of the structured segment defined in the previous step. The axis may be computed by considering the polypeptide backbone atoms or alternatively, the sole Ca atoms (see Note 9). Then it is possible to measure the tilt, i.e., the angle between the helical axis and the normal to the membrane plane. The main problem consists in determining this normal because the location of the membrane is only hypothesised. In many cases (but not in all cases), the principal inertia axis of the whole TM domain is aligned with this normal. Yet, it is sometimes necessary to apply transformations on the coordinates (e.g., rotations) to make the appropriate alignment. If the membrane protein is an oligomer, the membrane normal is generally considered as the rotation axis for superimposing one monomer on another one. In the absence of symmetry, the problem is much more complicated. A visual inspection and a human supervision may be sometimes necessary to retrieve the plane. The location of aromatic residues, which generally delimit the interfacial region and lie roughly in the same plane, can be a useful guide to define the membrane plane. Finally, Lomize et al. (13, 50) proposed a method that predicts, from the structure, the orientation of the protein in the membrane. It is determined by minimising the transfer energy, DGtransfer, from water to membrane interior, taking account different variables, and in a coordinate system whose axis Z coincides with the bilayer normal. The results for all membrane proteins are available online in a database (see Note 10). The calculation of the crossing W angle (also called packing angle) may be obtained using the approach defined initially by

Fig. 3. Definition of the tilt (t), azimuthal rotation (r), and crossing (W) angles.

372

Etchebest and Debret

Chothia et al. (51) or with more recent tools such as HA2 (see Note 11, (52)). The main steps consist in computing the mass centres of each helix in contact with another one, defining the vector that joins these two points, and then computing the dihedral angle defined by the two helical axes with respect to this vector. On the basis of the angle distribution in membrane proteins, significant differences were observed with packing angles in soluble proteins. However, when looking at structural homologues in soluble protein, similarities seem to exist that can be useful for deducing sequence–structure relationship (53). Besides, a recent study shows some differences between packing angles in channels and membrane bundles (also called membrane coils in the publication) (54). The determination of these parameters requires that the helices are mainly straight. Yet, in many cases, the helices are slightly deformed, curved, or even kinked. These deformations induce modifications in the local helical parameters that may deviate significantly from ideal a-helix characteristics. These parameters can be computed, for instance, with HELANAL (55, 56) that provides us with a detailed description of the helix properties: bending angles, average helical radius, twist angles, etc. (see Note 12). The Qhelix computational tool (57) complements this analysis by measuring the inter-helical angles (see Note 13). In our recent study (58), we also examined kink angles in TM helices and provided a method that permits to locate automatically the residue(s) responsible for the deformation as well as the amplitudes and the direction of the kinks (see Note 14). 3.2.2. Dynamics and Conformational Changes

The different approaches described above allow construction and analysis of model structure. They provide first elements to understand function and to delineate functional sites. Yet, in many cases, protein function is accomplished through conformational changes that range from a few angströms (e.g., side chain rotation) to tens of angströms displacements (domain motions) and extend from few picoseconds to seconds. Experimental approaches yield in most cases indirect information on the structures and the dynamics of the protein. In some cases, they can take profit from simulation and modelling techniques to interpret at an atomic scale, the data they observe (see others chapters in Parts III and IV in this volume).

3.2.2.1. Molecular Dynamics Simulations and Related Approaches

There are nowadays the most popular in silico approaches for exploring the time behaviour of a protein and its putative conformational changes with atomic details. However, the time scale that can be simulated is far from being comparable with the experiment time scale (see Chapter 21 for the concepts and a detailed discussion). For membrane proteins, the main problem relates to the lipid environment which increases considerably the

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

373

number of atoms included in the system, and per se limits the time scale that could be simulated. A second problem lies in the representation of the environment. If experiments are conducted in a cellular medium, the environment is so complex that it can’t be truly represented in the simulations. In this way, direct comparison is quite limited and even prohibited. For in vitro experiments, the medium is simpler but not necessarily reproducible directly in the simulations. Indeed, most simulation methods require the computation of energies between chemical groups encountered in proteins, lipids, and any component participating in the system (e.g., prosthetic groups, ions, quinones, organic molecules, detergents, etc). These interaction energies are on the basis of empirical force fields with parameters issued from a fine calibration aiming at reproducing experimental data and/or sophisticated quantum mechanics computations. Nowadays, two main families of force fields are currently available. The most standard ones are founded on a full (or almost full) atomic description while the most recent ones reduce the number of particles by using a coarse-grained description (see Chapter 22). The last ones are frequently calibrated on the first ones. Most chemical groups, required for the simulation of proteins in a pure lipid bilayer, are present in the most exercised force fields (see ref. 59 for a review). For instance, the Charmm force field (revisited recently for some lipids (60–63)) contains, besides classical amino acid residues, parameters for the head groups of the major lipids. Similar lipid molecules can be approached with AMBER force field and can also be treated in Gromacs package through the parameterisation developed by Berger (see Chapter 21 for advices). However, the large diversity of lipids, or detergents used in experiments, in particular, the nature of the polar headgroup, necessitates the fine calibration of chemical groups, which are “unusual” compared to those encountered in proteins or DNA molecules. Consequently, before starting any simulation, it is necessary to verify that all the chemical groups represented in the system are available in the force field. A careful reading of the outputs of the construction of the system (topology, force field implementation) is thus required even if most packages alert the user of the lack of parameters (see Note 15). The second step consists in constructing the membrane and placing the protein in an appropriate position. Different pre-equilibrated (pure) solvated membranes (“lipid box”) are currently available and easily usable (see Note 16). It is however, generally necessary to enlarge the lipid box to accommodate a large protein such as the MscL (Mechanosensitive Channel of Large conductance) for instance. This is generally performed by duplicating the box and translating the generated copy along appropriate axes. This enlarged lipid box is afterwards equilibrated for a short period of time (few nanoseconds). Then, the protein is

374

Etchebest and Debret

placed in the centre of the resulting lipid box. The position of the protein along the normal axis of the membrane normal is the most difficult to determine. Besides experimental data, aromatic residues are a good help for determining the appropriate location along this axis. Indeed, they frequently anchor the protein at the membrane interface. In their absence, the most hydrophobic region of the membrane can be located and aligned with the region of the lipid acyl chains. Finally, the system lipids–protein is solvated. This can be done using a box of solvent (mainly water) pre-equilibrated (see Note 17). Once the system is defined, the simulation can be started, choosing appropriate conditions (for practical recommendations, see Chapter 21). Finally, analysis of different structural as well as energetic properties can be achieved along the time course of the simulations (see Subheading 3.2.1). The amplitude of motions strongly depends on the simulation time and consequently, large conformational changes are rarely observed. Essential dynamics is a convenient way to highlight relevant collective motions (64). The method is on the basis of the principal component analysis of the covariance matrix (see Note 18). Once the directions of the movement are obtained, trajectories can be generated along these privileged directions. It is therefore possible to construct new structures along a putative conformational pathway and to cross barriers using slightly larger amplitudes compared to the MD average fluctuations. Deformations along principal directions are closer to the “natural” deformation tendencies of the protein in comparison to classical steered molecular dynamics that generally require assumptions on the points to apply the forces and the directions of these forces. The main limitation in case of essential dynamics is of course to have beforehand classical MD simulations for obtaining the principal directions. In contrast, steered molecular dynamics are extremely useful for obtaining results comparable to AFM experiment results (65). Brownian dynamics (BD), in which a stochastic force replaces the explicit solvent, is a very interesting alternative for studying large systems and simulating long time processes. Few studies on membrane systems have been reported until now, but their applications are quite important (66). Besides BD simulations, other approximations may be introduced and consist mainly in simplifying the lipid environment representation. The simplifications range from coarse-grained models for lipids (see Chapter 22) to implicit membrane models. In this last case, the lipid bilayer is represented by slabs with different thickness and physical properties (67, 68). 3.2.2.2. Normal Mode Analysis

This approach, introduced in details in Chapter 13, enables to trap the main tendencies of a given structure to accomplish conformational changes. The procedure relies on the harmonic approximation of the energy surface. For a few years, crude energy models have been introduced. They do not necessitate large

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

375

computer memory (as required with classical all-atoms force field) or significant computer time. We applied such an approach for studying the mechanical properties of MscL and obtained quite interesting results that pointed out putative regions involved in the opening process as well as the relevant directions of motions to reach an open state (see Note 19). 3.2.3. Tools for Predicting Multimeric Assemblies, Membrane Protein Complexes

In many cases, the tools for predicting multimers or more generally complexes involving protein–protein/protein–ligands interactions are on the basis of classical docking approaches similar to those used for soluble proteins (69). Docking procedures may be decomposed into different steps: the first stage of the process consists in exploring different positions of one protein (“the ligand”) with respect to the surface of the target (“the receptor”). Each position is scored with the aid of a simple energy function. At this stage, the partners are generally considered as rigid. In the second stage, a first set of optimal configurations (generally more than a thousand) is selected and refined with a more sophisticated scoring function. The flexibility of the ligand may be introduced as well as some local conformational changes in the receptor. The solutions are then ranked, and few complexes are finally considered. The docking procedures are all the more efficient that some experimental data are used to guide the exploration. In the absence of such data, predictions based on sequences features (mainly based on multiple sequence alignment) can be used. Illustrative of the application of such approaches is the recent oligomeric structure prediction of G protein coupled receptor (70). In this case, the interaction site is located inside the membrane domain while in the case of ligands, the interaction frequently occurs with regions close to or in contact with the solvent (see Note 20 and Chapter 23). As an example, we recently examined the binding sites of the DARC protein (Duffy antigen/ receptor for chemokine) supposed to interact with loops and/or termini regions (see Note 21 and Chapter 20). Once the ligand binding site is located, dedicated tools for drug design may be applied, namely screening libraries for detecting new leads. Finally, coupled with methods described above, alternative conformations may be explored to identify new putative binding sites. In this context, an interesting work, very recently published, explores direct and also indirect interactions on recognition controlled from distant sites in proteins, e.g., by changes in protein conformation and mobility. The approach, called proteochemometrics, seems to successfully predict the location of indirect effects on ligand recognition arising from distant sites in the receptors (71).

3.3. Conclusion

In summary, molecular modelling can be used to successfully construct models of membrane proteins either by complete or

376

Etchebest and Debret

partial homology or by ab initio strategies and thus, be relevant for a broad range of applications. It also explores conformational changes of these proteins either by placing atomic structure in lipid environment or by making a dynamic bridge between static view of membrane proteins gained from X-ray, EM, or NMR data. Its main limits reside in time scale reached (still far from classical enzymatic reaction (milliseconds)) and in the accuracy of the physical atomic description used (force field). However, immense progress has been made these last few years on both programs and equipments that should permit to overcome these difficulties. Finally, it is the method of choice for drug design as long as atomic structures are reliable.

4. Notes 1. NAMD may be downloaded from the URL: http://www. ks.uiuc.edu/Development/Download/download. cgi?PackageName=NAMD. A large number of pre-compiled binaries are available, for different architectures, including Windows and MacOS. In Windows environment, NAMD must be run from the MS-DOS command line. Cygwin package, a Unix-like environment for Windows, (http://www.cygwin. com/) may be installed to facilitate the usage of the program. NAMD can also be compiled from source files. Force field data, i.e., topology files and parameters, are also required. Various versions of the Charmm force field are available at http:// mackerell.umar yland.edu/CHARMM_ff_params.html. NAMD is also compatible with Amber (http://ambermd.org/ dbase.html) and X-PLOR force fields. Preparation of some input files are facilitated by the use of VMD that is, besides its graphical capacities, an excellent partner for NAMD. Gromacs is nowadays widely used in the membrane protein modellers’ community (see Chapters 21 and 22). Source codes are freely available at http://www.gromacs.org/ and pre-compiled binaries also exist for different architectures. Gromacs has become very popular because it is extremely fast and, importantly, is provided with numerous useful tools dedicated to the analysis of proteins structures, membrane characteristics, as well as many other physical properties such as diffusion coefficients, order parameter, radial distribution function, etc. Parallel computations may be performed using MPI library. The LAM (Local Area Multicomputer) package is generally used on local workstation networks.

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

377

For some computations, the FFTW (Fast Fourier Transform) library is required. It may be downloaded at http://www. fftw.org/. It has to be installed before configuring Gromacs. Both NAMD and Gromacs necessitate a C compiler and eventually an f77 compiler. 2. For VMD, the Tk console allows extending the possibilities of computations without modifying the program itself. A library of scripts (Python or Tcl) is available at http:// www.ks.uiuc.edu/Research/vmd/script_library/. The users’ community also provides us with many useful additional tools, the main difficulty being to locate them. For Pymol, Tcl/Python scripts are made available under the Plugin menu once the files are copied in the PyMol modules/pmg_tk/startup directory. Some useful scripts can be found in the PyMol wiki (http://pymolwiki.org). 3. Muscle is available at http://www.ebi.ac.uk/Tools/muscle/ index.html. It is included in Seaview (http://pbil.univ-lyon1. fr/software/seaview.html), a graphical multiple sequence alignment editor, that also proposes ClustalW, the most popular program for multiple sequence alignment. Seaview binaries may be downloaded for different computer architectures (MacOS, Windows and Linux). The source code is freely available. 4. Rapido is a recent approach (http://webapps.embl-hamburg. de/rapido/) that permits the superimposition of 3D structures of proteins. It is able to take into account large structural changes such as hinge motions between domains. It is a very convenient way to locate rigid bodies vs. flexible regions. Besides its own Java applet (Jmol), it provides rasmol and Pymol scripts for easily visualising the results with the corresponding softwares. 5. The LIPS method is on the basis of a canonical model of the heptad repeat originally developed for coiled coils. The LIPS server (http://gila.bioengr.uic.edu/lab/larisa) requires a multiple sequence alignment for the TM domain studied. The helical domain is divided into seven surfaces. For each surface, are given a list of residues with different scores and three average values: the lipophilicity surface score, the entropy (a measure related to the conservation of a position in the multiple sequence alignment), and the LIPS score, the product of the two previous values. For the TM2-MscL, the LIPS score didn’t correctly assign the lipid-oriented face but the lipophilicity surface score was quite relevant. The entropy values strongly depend on the quality of the alignment and the diversity of the sequences used. A careful analysis of the MSA is thus recommended.

378

Etchebest and Debret

6. CastP server (http://sts-fw.bioengr.uic.edu/castp/calculation.php) uses a PDB file as input. The results are displayed through a quite convenient interface. It is possible to select a given pocket, to obtain different information such as its delimiting residues or its area and volume. If the user email is given (highly suggested), a set of files is sent, which contains all the useful results. In particular, the parameter Nmouth is indicated in the .pocInfo file. If the value is 0, it means that the pocket is a cavity fully buried in the protein. Distinction between pockets and cavities is important. In the first case, it could be a ligand binding site while in the second case (cavity), the void space could facilitate internal motions. A Pymol plugin may be downloaded (http://sts-fw.bioengr.uic.edu/ castp/pymol.php) and installed as described in Note 2. The plugin itself does not make CastP computations but enables to retrieve the results using the job identification number of a previous CastP request. 7. Naccess is available at http://www.bioinf.manchester.ac.uk/ naccess/. It can be easily installed on many platforms because the source files are given. The default probe radius is 1.4 Å (a water molecule radius). In some publications on membrane proteins, the probe radius is increased to 1.9 Å to better account for the accessible surface of atoms to CH2 groups of lipids. We developed a program in the context of Gromacs, to easily compute Naccess accessibilities along the time course of a Gromacs MD simulation. Some data for lipids have been included in the radii file. The program is available on request. We also considered a 3 Å probe radius when we tested the accessibility values for the Ca’s in a bundle. This raw probing enables to better compare the lipid accessible faces predicted from the sequences to those computed from the 3D structure. 8. The Hole program may be downloaded at (http://hole.biop. ox.ac.uk/hole). The main inputs are an initial position and a direction corresponding roughly to the channel axis (typically 0 0 1 if the channel is oriented along the Z axis). Apart from numerical results, the program can also generate some files for visualising the results with VMD. 9. The inertia axis for a given helical segment may be computed with a Python script that takes profit from the Python numerical libraries, i.e., Numpy or Scipy, that include linear algebra calculations. A part of a simple script developed by P. Poulain in our group is given below. The full script, available on request, also includes some tricks to visualise the axis with Pymol.

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

379

10. The OPM database can be found at http://opm.phar.umich. edu/. It is not possible to make its own computation, but it is possible to send a request by mail to the authors. 11. The program HA2 can be downloaded at http://bioinfo.tau. ac.il/~sarel/HelAna.html and easily installed on a PC-Linux with Perl language programming available. However, it requires MatLab package that is not a freeware, even for academics. Similar routines could exist in the R package (freeware) and can be appropriately called in the main Perl routines. 12. Helanal (http://nucleix.mbu.iisc.ernet.in/helanal/helanal. shtml) only needs a pdb file and by default, it uses the information of the field “HELIX” in the pdb file (location and limits of the helical secondary structure), but it is possible to explicitly introduce these limits. Helanal yields much useful information about the helical geometrical parameters (twist, rise, pitch, etc.). It distinguishes linear, curved, and kinked helices. 13. QHelix (http://compbio.sookmyung.ac.kr/~qhelix/) yields interhelical axis values. The inputs are a pdb file, the position of the first residue in each helical segment, and the length of the helix. It is not possible to consider multichains complex unless some modifications are made in the pdb file, i.e., a renumbering of the residues. In contrast to Helanal, it does not consider the field HELIX as a default. A Python script may easily extract this information from the pdb file and writes in a correct way, the input for Qhelix (available on request). The angles are calculated for all pairs of helices. 14. The program, written in C Language, includes a loop running on the number of residues in the helix. First, a residue is selected along the helix and for each helical sub-domain this residue delimits, an inertia axis is computed. Then, the angle

380

Etchebest and Debret

between these two axes is calculated. The kink angle is defined as the angle with the largest value, when all the residues along the helix have been tested. The position of the kink corresponds to the residue where the angle is maximal. 15. In case of absence of parameters, it is possible to develop new parameters by transferring values from close fragments and adjusting them to reproduce experimental observables. A more rigorous approach consists in dividing the molecule of interest into small representative fragments, in performing sophisticated ab initio quantum chemistry computations, and in deducing empirical parameters, such as charges, coefficients for Lennard–Jones potentials, and torsion barriers, etc, which should reproduce the quantum computations (see Chapter 24). Then the parameters for fragments have to be combined, generally slightly modified when they are transferred to the whole molecule. This new set of parameters is then assessed on experimental measurements. Yet, it is important to keep in mind that the parameters for lipids generally are calibrated to reproduce experimental data measured on pure membranes that include one type of lipids at a time. The use of data measured on biological membranes i.e., a mixture of lipids with different polar head groups and different alkyl chains could thus be inappropriate. In the same spirit, experimental information on membranes (even pure) that accommodate a protein, are very difficult to obtain. 16. Different coordinates and topologies files for membranes preequilibrated with Gromacs software may be downloaded at Peter Tieleman’s web site (http://moose.bio.ucalgary.ca/ index.php?page=Structures_and_Topologies). Valuable information is also available from Dr. M. Karttunen’s web site (http://www.apmaths.uwo.ca/~mkarttu/downloads. shtml). In particular, a mixed membrane of POPG/POPE, as well as the parameters for a SDS micelle, is provided. The crude placement of the protein in the membrane leads to numerous overlaps with lipid atoms. In most cases, the lipids that overlap with any atoms of the proteins are removed. Roux’s group (http://thallium.bsd.uchicago.edu/RouxLab/ method.html) developed a slightly different strategy for embedding a protein in a membrane. The lipids from a preequilibrated and pre-hydrated set are randomly placed around the protein. The number of core–core overlaps between heavy atoms is reduced through convenient rotations and translations of the lipids and protein. To provide the initial positions in the membrane plane for each lipid, a simplified representation of the full lipid molecule is used. Then the packing of the particles with the protein is obtained with a MD simulation using appropriate restraints.

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

381

17. With the genbox tool from Gromacs, water molecules are placed in any free volumes, whose size is sufficient to accommodate a water molecule. Consequently, it is sometimes necessary to remove (by hand) some water molecules located in the hydrophobic membrane domain. In case of MscL, we chose firstly to fully solvate the channel. We performed a MD simulation where the periplasmic loops and the side chains bordering the channel pore were free to move, while restraining all the TM domains of the protein. Afterwards, we placed the protein plus the equilibrated water molecules contained in the channel vestibule in a solvated membrane. 18. Principal component analysis can be performed with Gromacs tool. It is important to check that the cosine content of eigenvectors is different from one to differentiate true motions from random diffusion. The relevance of the analysis can be checked by comparing the directions computed for different parts of the simulation. If the directions are conserved whatever be the set of time frames chosen, the corresponding directions can be considered as significant. Finally, to focus on significant directions, it is also convenient to compute the contribution of each direction to the total fluctuations. In most cases, the very first directions contain the largest part of the motions. 19. A main question in NMA relates to the choice of modes for exploring a conformational change. The modes with the lowest frequencies are obviously the most appropriate ones because they are associated with the largest amplitudes of motions. Yet, in our study, we also found that the collectivity measure is a powerful index for selecting interesting and significant modes. This index evaluates the collective protein motions within a given mode or eigenvector (i.e., the number of atoms significantly affected). It is rarely used but we suggest considering it systematically. We chose the definition proposed by Bruschweiler (72). k =

 N  1 exp  − ∑ aAi2 log aAi2   i =1  N

where Ai is the amplitude of displacement and a is a normalisation factor such as ∑ aA 2 = 1. The conformational change i is maximal for a value of 1 and minimal for a value of 1/N . 20. In the case of ligands, the flexibility of the receptor may play a major role and has to be taken into account. However, most docking approaches only include flexibility at a final refinement stage. Hence, if the interaction requires complex conformational changes, the prediction could be completely misleading. New approaches aim at better introducing the

382

Etchebest and Debret

flexibility (73) but actually they do not take into account the conformational changes all along the docking process. 21. We have elaborated an original protocol for studying the DARC protein (see Chapter 20). The DARC protein is an erythrocyte seven TM-receptor that binds chemokines and also malaria parasites (Plasmodium vivax and Plasmodium knowlesi). Biological data show that the four extracellular domains (ECDs) of DARC are essential for its interaction with chemokines (CXC-L8 for instance), while the first (ECD1) is sufficient for the interaction with malaria erythrocyte-binding protein (duffy binding protein DBP). Importantly, ECD1 domain is a highly flexible 50 residues peptide with two sulfated tyrosines. When starting from a 3D structural model of DARC (44), including ECD1 in a free state structure, and using a rigid-body docking approach, we failed to find a solution compatible with all the experimental data for the binding sites of P. vivax DBP and CXC-L8. We finally combined semi-flexible and semi-rigid dockings taking into account ECD1 flexibility: (1) the flexible ECD1 domain is independently docked on a rigid ligand using the flexible docking tools of ICM (74), (2) in parallel, the rigid ligand is docked on the rigid TM domain, and (3) solutions from both steps are combined and only those compatible with bond constraints between ECD1 and TM1 are selected. This approach yields a 3D-model for DARC–CXC-L8 and DARC– DBP complexes, both consistent with available experimental data. Such models permitted us to identify and predict potential residue targets for mutagenesis, and to define the area of interaction for receptor and both ligands. References 1. Teller DC, Okada T, Behnke CA, Palczewski K, Stenkamp RE (2001) Advances in determination of a high-resolution three-dimensional structure of rhodopsin, a model of G-proteincoupled receptors (GPCRs). Biochemistry 40(26):7761–7772 2. Palczewski K, Kumasaka T, Hori T, Behnke CA, Motoshima H, Fox BA et al (2000) Crystal structure of rhodopsin: a G protein-coupled receptor. Science 289(5480):739–745 3. Murakami M, Kouyama T (2008) Crystal structure of squid rhodopsin. Nature 453(7193):363–367 4. Shimamura T, Hiraki K, Takahashi N, Hori T, Ago H, Masuda K et al (2008) Crystal structure of squid rhodopsin with intracellularly extended cytoplasmic region. J Biol Chem 283(26):17753–17756

5. Park JH, Scheerer P, Hofmann KP, Choe H, Ernst OP (2008) Crystal structure of the ligand-free G-protein-coupled receptor opsin. Nature 454(7201):183–187 6. Warne T, Serrano-Vega MJ, Baker JG, Moukhametzianov R, Edwards PC, Henderson R et al (2008) Structure of a beta1-adrenergic G-protein-coupled receptor. Nature 454(7203):486–491 7. Rasmussen SGF, Choi H, Rosenbaum DM, Kobilka TS, Thian FS, Edwards PC et al (2007) Crystal structure of the human beta2 adrenergic G-protein-coupled receptor. Nature 450(7168):383–387 8. Cherezov V, Rosenbaum DM, Hanson MA, Rasmussen SGF, Thian FS, Kobilka TS et al (2007) High-resolution crystal structure of an engineered human beta2-adrenergic G

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis

9.

10.

11.

12.

13.

14.

15.

16.

17.

18. 19. 20.

21.

protein-coupled receptor. Science 318(5854): 1258–1265 Hanson MA, Cherezov V, Griffith MT, Roth CB, Jaakola V, Chien EYT et al (2008) A specific cholesterol binding site is established by the 2.8 A structure of the human beta2adrenergic receptor. Structure 16(6): 897–905 Edgar R (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5(1):113 Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32(5): 1792–1797 Mosca R, Schneider TR (2008) RAPIDO: a web server for the alignment of protein structures in the presence of conformational change. Nucleic Acids Res 36(Suppl 2):W42–W46 Lomize MA, Lomize AL, Pogozheva ID, Mosberg HI (2006) OPM: orientations of proteins in membranes database. Bioinformatics 22(5):623–625 Tusnády GE, Dosztányi Z, Simon I (2005) PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank. Nucleic Acids Res 33(Database issue):D275–D278 Tusnády GE, Kalmár L, Simon I (2008) TOPDB: topology data bank of transmembrane proteins. Nucleic Acids Res 36(Database issue):D234–D239 Tusnády GE, Kalmár L, Hegyi H, Tompa P, Simon I (2008) TOPDOM: database of domains and motifs with conservative location in transmembrane proteins. Bioinformatics 24(12):1469–1470 Tusnády GE, Dosztányi Z, Simon I (2004) Transmembrane proteins in the Protein Data Bank: identification and classification. Bioinformatics 20(17):2964–2972 Jayasinghe S, Hristova K, White SH (2001) MPtopo: a database of membrane protein topology. Protein Sci 10(2):455–458 Tusnády GE, Simon I (2001) The HMMTOP transmembrane topology prediction server. Bioinformatics 17(9):849–850 Krogh A, Larsson B, von Heijne G, Sonnhammer EL (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305(3):567–580 Käll L, Krogh A, Sonnhammer ELL (2005) An HMM posterior decoder for sequence feature prediction that includes homology information. Bioinformatics 21(Suppl 1):i251–i257

383

22. Ganapathiraju M, Balakrishnan N, Reddy R, Klein-Seetharaman J (2008) Transmembrane helix prediction using amino acid property features and latent semantic analysis. BMC Bioinformatics 9(Suppl 1):S4 23. Jones DT (2007) Improving the accuracy of transmembrane protein topology prediction using evolutionary information. Bioinformatics 23(5):538–544 24. Lo A, Chiu H, Sung T, Lyu P, Hsu W (2008) Enhanced membrane protein topology prediction using a hierarchical classification method and a new scoring function. J Proteome Res 7(2):487–496 25. Kitsas IK, Hadjileontiadis LJ, Panas SM (2008) Transmembrane helix prediction in proteins using hydrophobicity properties and higher-order statistics. Comput Biol Med 38(8):867–880 26. Shen H, Chou JJ (2008) MemBrain: improving the accuracy of predicting transmembrane helices. PLoS One 3(6):e2399 27. Bernsel A, Viklund H, Falk J, Lindahl E, von Heijne G, Elofsson A (2008) Prediction of membrane-protein topology from first principles. Proc Natl Acad Sci U S A 105(20): 7177–7181 28. von Heijne G, Gavel Y (1988) Topogenic signals in integral membrane proteins. Eur J Biochem 174(4):671–678 29. Kobilka B, Schertler GFX (2008) New G-protein-coupled receptor crystal structures: insights and limitations. Trends Pharmacol Sci 29(2):79–83 30. Zhang Y, Devries ME, Skolnick J (2006) Structure modeling of all identified G proteincoupled receptors in the human genome. PLoS Comput Biol 2(2):e13 31. Kasho VN, Smirnova IN, Kaback HR (2006) Sequence alignment and homology threading reveals prokaryotic and eukaryotic proteins similar to lactose permease. J Mol Biol 358(4):1060–1070 32. Etchebest C, Popot J (1997) Packing transmembrane a-helices into bundles: computational vs experimental approaches. In: von Heijne G (ed) Membrane protein assembly. Chapman & Hall, New York, pp 221–250 33. Herzyk P, Hubbard RE (1998) Combined biophysical and biochemical information confirms arrangement of transmembrane helices visible from the three-dimensional map of frog rhodopsin. J Mol Biol 281(4): 741–754 34. Baldwin JM (1993) The probable arrangement of the helices in G protein-coupled receptors. EMBO J 12(4):1693–1703

384

Etchebest and Debret

35. Baldwin JM, Schertler GF, Unger VM (1997) An alpha-carbon template for the transmembrane helices in the rhodopsin family of G-protein-coupled receptors. J Mol Biol 272(1):144–164 36. Elofsson A, von Heijne G (2007) Membrane protein structure: prediction versus reality. Annu Rev Biochem 76:125–140 37. Dastmalchi S, Church WB, Morris MB (2008) Modelling the structures of G protein-coupled receptors aided by three-dimensional validation. BMC Bioinformatics 9(Suppl 1):S14 38. Trabanino RJ, Hall SE, Vaidehi N, Floriano WB, Kam VWT, Goddard WA (2004) First principles predictions of the structure and function of G-protein-coupled receptors: validation for bovine rhodopsin. Biophys J 86(4):1904–1921 39. Vaidehi N, Floriano WB, Trabanino R, Hall SE, Freddolino P, Choi EJ et al (2002) Prediction of structure and function of G protein-coupled receptors. Proc Natl Acad Sci U S A 99(20):12622–12627 40. Park Y, Helms V (2006) Assembly of transmembrane helices of simple polytopic membrane proteins from sequence conservation patterns. Proteins 64(4):895–905 41. Adamian L, Liang J (2006) Prediction of transmembrane helix orientation in polytopic membrane proteins. BMC Struct Biol 6:13 42. McAllister SR, Floudas CA (2008) Alphahelical topology prediction and generation of distance restraints in membrane proteins. Biophys J 95(11):5281–5295 43. Yarov-Yarovoy V, Schonbrun J, Baker D (2006) Multipass membrane protein structure prediction using Rosetta. Proteins Struct Funct Bioinf 62(4):1010–1025 44. de Brevern AG, Wong H, Tournamille C, Colin Y, Le Van Kim C, Etchebest C (2005) A structural model of a seven-transmembrane helix receptor: the Duffy antigen/receptor for chemokine (DARC). Biochim Biophys Acta 1724(3):288–306 45. Binkowski TA, Naghibzadeh S, Liang J (2003) CASTp: computed atlas of surface topography of proteins. Nucleic Acids Res 31(13):3352–3355 46. Hubbard S, Thornton J (1996) Naccess V2.1.1: atomic solvent accessible area calculations. http://www.bioinf.manchester.ac.uk/ naccess/ 47. Smart OS, Neduvelil JG, Wang X, Wallace BA, Sansom MS (1996) HOLE: a program for the analysis of the pore dimensions of ion channel structural models. J Mol Graph 14(6):354–360, 376

48. Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22(12):2577–2637 49. Heinig M, Frishman D (2004) STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res 32(Web server issue): W500–W502 50. Lomize AL, Pogozheva ID, Lomize MA, Mosberg HI (2006) Positioning of proteins in membranes: a computational approach. Protein Sci 15(6):1318–1333 51. Chothia C, Levitt M, Richardson D (1981) Helix to helix packing in proteins. J Mol Biol 145(1):215–250 52. Fleishman SJ, Ben-Tal N (2002) A novel scoring function for predicting the conformations of tightly packed pairs of transmembrane (alpha)-helices. J Mol Biol 321(2):363–378 53. Gimpelev M, Forrest LR, Murray D, Honig B (2004) Helical packing patterns in membrane and soluble proteins. Biophys J 87(6): 4075–4086 54. Hildebrand PW, Gunther S, Goede A, Forrest L, Frommel C, Preissner R (2008) Hydrogenbonding and packing features of membrane proteins: functional implications. Biophys J 94(6):1945–1953 55. Bansal M, Kumar S, Velavan R (2000) HELANAL: a program to characterize helix geometry in proteins. J Biomol Struct Dyn 17(5):811–819 56. Kumar S, Bansal M (1996) Structural and sequence characteristics of long alpha helices in globular proteins. Biophys J 71(3):1574–1586 57. Lee HS, Choi J, Yoon S (2007) QHELIX: a computational tool for the improved measurement of inter-helical angles in proteins. Protein J 26(8):556–561 58. Debret G, Valadié H, Stadler AM, Etchebest C (2007) New insights of membrane environment effects on MscL channel mechanics from theoretical approaches. Proteins 71(3):1183–1196 59. Guvench O, MacKerell AD (2008) Comparison of protein force fields for molecular dynamics simulations. Methods Mol Biol 443:63–88 60. Sonne J, Jensen MO, Hansen FY, Hemmingsen L, Peters GH (2007) Reparameterization of all-atom dipalmitoylphosphatidylcholine lipid parameters enables simulation of fluid bilayers at zero tension. Biophys J 92(12):4157–4167 61. Davis JE, Warren GL, Patel S (2008) Revised charge equilibration potential for liquid alkanes. J Phys Chem B 112(28):8298–8310

Critical Review of General Guidelines for Membrane Proteins Model Building and Analysis 62. Rosso L, Gould IR (2008) Structure and dynamics of phospholipid bilayers using recently developed general all-atom force fields. J Comput Chem 29(1):24–37 63. Högberg C, Nikitin AM, Lyubartsev AP (2008) Modification of the CHARMM force field for DMPC lipid bilayer. J Comput Chem 29(14):2359–2369 64. Kitao A, Go N (1999) Investigating protein dynamics in collective coordinate space. Curr Opin Struct Biol 9(2):164–169 65. Isralewitz B, Gao M, Schulten K (2001) Steered molecular dynamics and mechanical functions of proteins. Curr Opin Struct Biol 11(2):224–230 66. Haddadian EJ, Cheng MH, Coalson RD, Xu Y, Tang P (2008) In silico models for the human alpha4beta2 nicotinic acetylcholine receptor. J Phys Chem B 112(44): 13981–13990 67. Sperotto MM, May S, Baumgaertner A (2006) Modelling of proteins in membranes. Chem Phys Lipids 141(1–2):2–29

385

68. Feig M (2008) Implicit membrane models for membrane protein simulation. Methods Mol Biol 443:181–196 69. Ritchie DW (2008) Recent progress and future directions in protein–protein docking. Curr Protein Pept Sci 9(1):1–15 70. Reggio PH (2006) Computational methods in drug design: modeling G protein-coupled receptor monomers, dimers, and oligomers. AAPS J 8(2):E322–E336 71. Prusis P, Uhlén S, Petrovska R, Lapinsh M, Wikberg JES (2006) Prediction of indirect interactions in proteins. BMC Bioinformatics 7:167 72. Bruschweiler R (1995) Collective protein dynamics and nuclear spin relaxation. J Chem Phys 102(8):3396–3403 73. Andrusier N, Mashiach E, Nussinov R, Wolfson HJ (2008) Principles of flexible protein– protein docking. Proteins 73(2):271–289 74. Fernández-Recio J, Totrov M, Abagyan R (2003) ICM–DISCO docking by global energy optimization with fully flexible sidechains. Proteins 52(1):113–117

as

Chapter 20 3D Structural Models of Transmembrane Proteins Alexandre G. de Brevern Abstract Transmembrane proteins are macromolecules implicated in major biological processes and diseases. Because of their specific neighborhood, few transmembrane protein structures are currently available. The building of structural models of transmembrane proteins is a major research area. Because of the lack of available 3D structures, automatic homology modeling is not an efficient way of proposing pertinent structural models. Hence, most of the structural models of transmembrane proteins are developed through a more complex protocol that comprises the use of secondary structure prediction to complete the comparative modeling process. Then, refinement and assessment steps are performed go often to a novel comparative modeling process. Nowadays, it is also possible to take attention to the helix–helix and helix–lipid interactions, and even build quaternary structures. In all cases, the most important factor when proceeding to correct structural models is taking the experimental data into account. Key words: Transmembrane protein structures, Multiple sequence alignment, Structural models, Comparative modeling, Homology modeling, Secondary structure prediction, Helix–helix interaction, Helix–lipid interaction, Bilayer membrane, Protein docking

1. Introduction Transmembrane proteins represent ~25% of proteins coded by genomes. They are composed of two major classes: all-a, e.g., rhodopsin and all-b, e.g., outer membrane proteins. They support essential biological functions as receptors, transporters, or channels. They are embedded in lipid membrane that constitutes a very specific neighborhood. As a result of this particularity, obtaining experimental 3D transmembrane structures is difficult. The total number of transmembrane proteins in the Protein DataBank (1) is limited, comprising ~1% of available structures (2). The design of structural models becomes an important axis of research. Indeed, more than two-thirds of the marketed drugs target a transmembrane protein and 50%, specifically a GPCR (3). Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_20, © Springer Science+Business Media, LLC 2010

387

388

de Brevern

Thus, most of the time it is not possible to work with an experimental structure, and so, the 3D structural model is one of the important research fields for understanding biological mechanisms and interactions (4). We present in this chapter the classical pipeline to build 3D structural models. The most common way to propose 3D structural models is on the basis of a comparative modeling process coupled with transmembrane segment predictions. Nonetheless, the principle goes far beyond the classical homology modeling as often the target structure is not directly related to the query sequence, i.e., it is not possible to simply align the sequence of the protein queries and targets. Thus, it is an iterative process mainly based – when possible – on multiple sequence alignments, bibliographic and web researches, molecular refinement, and helix–helix and helix–lipid interaction prediction tools. Two papers can be read to have pertinent examples of such an approach (5, 6). The chosen structural models must encompass most of the biochemical features and reflect the known experimental data. They may be used to analyze functional interaction properties.

2. Materials A recent computer with an internet connection is sufficient. Numerous software programs are available on-line, but some must be locally installed. Most of these software programs can be used with the Windows Operating System (OS) while a few work only with Linux OS. Tables 1–3 summarize some of the available tools that can be used, e.g., the secondary structure and interaction prediction methods. Updated and additional links can be found at http://www. dsimb.inserm.fr/~debrevern/TM/index.html.

3. Methods The main principle of the approach is described in Fig. 1. The first step consists in the true knowledge of the protein of interest (POI). Three complementary approaches are important: multiple sequence alignment, bibliographic and on-line research, and secondary structure prediction. Then, the alignment of the POI (query) sequence with a (target) sequence of a protein with an available (template) 3D structure must be done. This information is sufficient to build protein structures; thanks to the comparative modeling approach, different structural models can be obtained. They must be refined and evaluated. Appropriate corrections must be done to the original

3D Structural Models of Transmembrane Proteins

389

Table 1 Web services and softwares Name

Purpose

Soft

Url

Protein DataBank

Database of available protein structures

On-line

http://www.rcsb.org/pdb/ home/home.do

Stephen White Laboratory

Transmembrane protein structures

On-line

http://blanco.biomol.uci.edu/ Membrane_Proteins_xtal.html

UniProt KB/ Swiss-Prot

Protein sequences

On-line

http://www.expasy.ch/sprot/

PDBTM

Information on transmembrane proteins

On-line

http://pdbtm.enzim.hu/

OMP

Orientations of proteins in membranes (OPM) database

On-line

http://opm.phar.umich.edu/

GPCRdb

Information on GPCR

On-line

http://www.gpcr.org/7tm/

PSI-BLAST

Search protein database

Linux/on-line

http://blast.ncbi.nlm.nih.gov/ Blast.cgi

Modeller

Homology modeling

Windows/Linux http://salilab.org/modeller/ modeller.html

Swiss model

Homology modeling (and other)

Windows/Linux http://swissmodel.expasy.org/ SWISS-MODEL.html

Homer

Homology modeling

On-line

http://protein.cribi.unipd.it/ Homer/

Consensus

Homology modeling

On-line

http://structure.bu.edu/ cgi-bin/consensus/consensus. cgi

Proteus 2

Homology modeling

On-line

http://wishart.biology.ualberta. ca/proteus2

HOMA

Homology modeling (and other)

On-line

http://www.nmr.cabm.rutgers. edu/HOMA/

Clustalw

Sequence alignment

All

http://www.ebi.ac.uk/Tools/ clustalw2/index.html

T-Coffee

Sequence alignment

Linux/on-line

http://www.ebi.ac.uk/Tools/tcoffee/

Muscle

Sequence alignment

Linux/on-line

http://www.drive5.com/ muscle/

rasmol

Protein visualization

Windows/Linux http://www.rasmol.org/

VMD

Protein visualization

Windows/Linux http://www.ks.uiuc.edu/ Research/vmd/

PyMol

Protein visualization

Windows/Linux http://pymol.sourceforge.net/ (continued)

390

de Brevern

Table 1 (continued) Name

Purpose

Soft

Url

WhatCheck

Structural model assessment

Linux

http://swift.cmbi.ru.nl/gv/ whatcheck/

ProCheck

Structural model assessment

Linux

http://www.biochem.ucl.ac. uk/~roman/procheck/ procheck.html

SCWRL

Side-chain replacement

Linux/on-line

http://dunbrack.fccc.edu/ SCWRL3.php http://www1.jcsg.org/prod/ scripts/scwrl/serve.cgi

Gromacs

Molecular modeling

Linux

http://www.gromacs.org/

Table 2 Transmembrane repetitive structure prediction web services Web services

Year

Url

TMbase

1993

http://www.ch.embnet.org/software/TMPRED_form. html

TopPred II

1994

http://mobyle.pasteur.fr/cgi-bin/MobylePortal/portal. py?form=toppred

TMAP

1994

http://bioinfo4.limbo.ifm.liu.se/tmap/index.html

MEMSAT

1994

http://saier-144-37.ucsd.edu/memsat.html

PredictProtein (include PhDTm)

1995

http://www.predictprotein.org/

TSEG

1998

http://www.genome.ad.jp/SIT/tsegdir/

TM-Finder

1999

http://www.ccb.sickkids.ca/tools/tmfinder/cgi-bin/ TMFinderForm.cgi

PRED-TMR 2

1999

http://athina.biol.uoa.gr/PRED-TMR2/

HMMTOP 2.0

2001

http://www.enzim.hu/hmmtop/

OrienTM

2001

http://o2.biol.uoa.gr/orienTM/

DAS-TMfilter (DAS 2.0)

2002

http://mendel.imp.ac.at/sat/DAS/DAS.html

SOSUI

2002

http://bp.nuap.nagoya-u.ac.jp/sosui/sosui_submit.html

SPLIT 4.0

2002

http://split.pmfst.hr/split/4/

THUMBUP-UMDHMM TMHP

2003

http://sparks.informatics.iupui.edu/Softwares-Services_ files/umdhmm.htm

All-a

(continued)

3D Structural Models of Transmembrane Proteins

391

Table 2 (continued) Web services

Year

Url

Phobius (new TMHMM)

2004

http://phobius.sbc.su.se/

ConPred II

2004

http://bioinfo.si.hirosaki-u.ac.jp/~ConPred2/

PRODIV-TMHMM

2004

http://www.pdc.kth.se/~hakanv/prodiv-tmhmm/

SVMtm

2004

http://ccb.imb.uq.edu.au/svmtm/svmtm_predictor. shtml

TUPS

2005

http://sparks.informatics.iupui.edu/Softwares-Services_ files/tups.htm

Localizome

2006

http://localodom.kobic.re.kr/LocaloDom/index.htm

MINNOU

2006

http://minnou.cchmc.org/

MEMSAT3

2007

http://bioinf.cs.ucl.ac.uk/memsat/

MemBrain

2008

http://chou.med.harvard.edu/bioinf/MemBrain/

BDM

2002

http://gpcr.biocomp.unibo.it/biodec/

TBBpred

2004

http://www.imtech.res.in/raghava/tbbpred/

PROFtmb

2004

http://www.rostlab.org/services/ProfTMB/index.html

BOMP

2004

http://www.bioinfo.no/tools/bomp

PRED-TMBB

2004

http://bioinformatics2.biol.uoa.gr/PRED-TMBB/ index.jsp

TMBETA-DISC

2005

http://psfs.cbrc.jp/tmbetadisc/

TMBETA-SVM

2005

http://tmbeta-svm.cbrc.jp/

TMBETA-NET

2005

http://psfs.cbrc.jp/tmbeta-net/

TMB-HMM

2006

http://bmbpcu36.leeds.ac.uk/~andy/betaBarrel/ TMB_Hunt_2/TMB_HMM.cgi

transFold

2006

http://bioinformatics.bc.edu/clotelab/transFold/

All-b

alignment until a structural model is found interesting. Then, many approaches can be performed to go deeper into the protein structural models (see the other chapters). 3.1. Sequence Analyses: Multiple Sequence Alignment and Bibliographic Researches

1. The correct sequence of the (query) POI must be found. When alternative splicing or mutants are common, careful attention must be paid to the selection of the right sequence. The expasy server (Uniprot) is a valuable web server as it is highly manually curated. 2. Related protein sequences can be found – thanks to PSIBLAST. As a first step, this search is performed using only the

392

de Brevern

Table 3 Helix–helix, helix–lipid and miscellaneous web services Web services

Year

Url

Helix analysis computational tool

2002

http://bioinfo.tau.ac.il/~sarel/HelAna.html (Not web service)

kprot

2004

Not reachable

TMLIP-H

2005

Not simply usable

ProperTM

2004

http://icb.med.cornell.edu/services/propertm/start

TMLIP-C

2005

Not simply usable

LIPS

2006

http://gila.bioengr.uic.edu/lab/larisa/lips.html

TMX

2007

http://service.bioinformatik.uni-saarland.de/tmx/ about.htm

TMRPres2D

2004

http://biophysics.biol.uoa.gr/TMRPres2D/

Archdb

2004

http://sbi.imim.es/cgi-bin/archdb/loops.pl

Triton

2008

http://ncbr.chemi.muni.cz/triton/

Helix–helix

Helix–lipid

Misc.

Protein DataBank. If one hit appears with a correct sequence identity, it must be used as the structural template. However, this is not often the case. 3. As the second step, related sequences must be found to analyze evolutionary information. The search is so done on the Uniprot and on the non-redundant databank of NCBI. As PSI-BLAST can take very redundant sequences and also select short sequences, the researcher must carefully select interesting proteins. The NCBI web service of PSI-BLAST is well conceived for such pruning schema (see Note 1). 4. PSI-BLAST has not been created to optimally align sequences, but to search for sequences related to one query sequence. The selected related proteins must then be aligned into a multiple alignment with dedicated software like Clustalw, T-Coffee, or Muscle. It allows locating conserved and non conserved regions of the protein sequences, i.e., more selective pressure can be expected to conserved regions. They are important for the protein fold/function. In contrast, nonconserved regions can be potentially more flexible. A phylogenetic approach – with a dendogram – is an efficient way to analyze the raw data (see Note 2).

3D Structural Models of Transmembrane Proteins

393

Fig. 1. Pipeline to build a structural model of a transmembrane protein structures. The protein sequence of interest is analyzed, thanks to multiple sequence analysis, bibliographic and on-line researches, and secondary structure prediction methods. Then a comparative modeling with constraints is performed. The structural models are refined and analyzed. It is an iterative process; the local modification of alignment gives new structural models which must be analyzed.

5. A bibliographic research on the POI is obvious, but it is also important to search information on related protein sequences. In the same way, a good knowledge of related protein folds is essential. For this purpose, dedicated transmembrane protein websites, e.g., Stephen White, PDBTM, and OMP, have a lot of pertinent information, both for the specialists and nonspecialists. If the protein is a GPCR, GPCRdb is also an interesting tool to obtain information of this kind of protein. 6. Biological and experimental data are essential to build pertinent structural models. Summarize the biological data in a table with the amino acid position, its kind, and the corresponding biological property(ies) (e.g., see Table 1 of (5)) (see Note 3). 3.2. Of Helices and Transmembrane Segments

1. Table 2 summarizes different and recognized transmembrane secondary prediction methods. These web services are on the basis of different approaches (hydrophobicity, sequence alignment, artificial neural networks…) and of different protein structure databanks (ranging from the oldest one to the most recent ones). The principle is to put the sequence in a form and wait for the results.

394

de Brevern

2. Results can be given in different ways ranging from the simple delimitation of the transmembrane repetitive secondary structure to the localization of extra- and intra-cellular loops and even a confidence index to the quality of the predictions (see Note 4). 3. As the number of available transmembrane protein structures is limited, the prediction rate is to be taken carefully when no homologous sequences is associated to a known structure. Summarize the results in a table (e.g., see Table 2 of (5)). The high number of different approaches allows finding average helix number and positions; they will serve to start the comparative modeling. Bibliographic data could also eliminate a method which seems inadequate for this protein, e.g., a method which misses half of the helices. In the same way, the confidence index can explain quantitatively why some regions are predicted very differently by the prediction methods. Indeed, some regions are hard-to-predict and require a special care (see Note 5). 4. An interesting point is to predict the secondary structure of the target structure, i.e., the structure of the sequence that will be aligned with the query sequence. It permits to look at the behaviors of each secondary structure prediction method. 3.3. Comparative Modeling and Other Approaches

1. Some web services able to propose structural models are available, e.g., Homer, Consensus, Proteus 2, or HOMA. As in the case of the other web services, the sequence must be put in a form. Numerous methods are not dedicated to transmembrane proteins and cannot find correct transmembrane models. Nonetheless, they must be tried as they can give some hints about the compatibility folds. 2. Comparative modeling can be done with Modeller software, which is the most known and most used homology modeling software. It uses an alignment between one query and one target sequence; the target sequence must have a known available (template) structure. The crucial part is the quality of the alignment. Concerning our purpose, it is also the converging of repetitive secondary structures. Figure 2 summarizes the principle: (a) The POI (query sequence) must be cut in transmembrane domain, i.e., the region with the transmembrane repetitive secondary structure (TRSS) and connecting loops, and the N and C termini regions. In Subheading 3.2, initial delineation of repetitive secondary structure has been performed. (b) The TRSSs of the target structure must be precisely located. (c) These TRSSs must be aligned with the corresponding TRSSs of the target structure. Same thing must be done for the loops, resulting in a global alignment of

3D Structural Models of Transmembrane Proteins

395

Fig. 2. Construction of the alignment of the transmembrane protein. The protein sequence must be cut into three parts: transmembrane domain and the N and C termini regions. (a) The predicted repetitive structures are placed on the sequence. (b) The repetitive structures are assigned on the template structure. (c) The correspondence between predicted and assigned regions must be made out. Each local alignment is optimized with dedicated software such as Clustalw. (d) The N and C termini regions can be treated as globular proteins, and specific predictions can so be made (or not). (e) A complete alignment is done corresponding to the entire protein. >P1;1F88 structureX:1F88:฀฀฀1฀:A฀:฀348฀:฀:RHODOPSIN฀:฀BOS฀TAURUS:฀฀฀฀2.80฀:-1.0฀ ----MNGTE-GP--------NFYVPFSNKTGVVRSPFEAPQYYL---A-EP WQ----FS--MLAAYMFLLIMLGFPINFLTLY-VTVQHKKLR-TP-LNYIL LNLAV-ADLF-MVFGGFTTTLYTSLHGYFVFGPTGC-NLE--GFFATLGGE IALWSLVVLAIERYVVVCKPMSNFRFGENH---AIMGVAF-TWVMALACAA PPLVGWSRYIPEGMQCSCGIDYYTPHEETNNE---SFVIYMFVVHFIIPLI VIFFCYGQLVFTVKEAAA----SATTQKAEKEVTRMVIIMVIAFLICWLPY AGVAFYI--FTHQ-----GSDFGP-----IFMTIPAFFAKTSAVYNPVIYI -MMNKQFRNCMVTTLCCGKNP--------------------* >P1;DARC_H1 sequence:DARC_H1:฀฀฀1฀:฀:฀336฀:฀:฀DARC฀HUMAN฀::฀: MGNCLHRAELSPSTENSSQLDFEDVWNSSYG-VNDSFPDGDYDANLEAAAP CHSCNLLDDSALP-FFILTSVLGILASSTVLFMLFRPLFRWQLCPGWP-VL AQLAVGSALFSIVVPVLAPG----------LGSTRSSALCSLGYCVWYGSA FA--QALLLG-------CHASLGHRLGAGQVPGLTLGLTVGIWGVAALLTL P-VT-LASGASGGL---CTLIYSTELKA----LQATHTVACLAIFVLLPLG --LFG-AK---GLKKALGMGPGPW-------------MNILWAWFIFWWPH -GVVLGLDFLVRSKLLLLSTCLAQQALDL-LLNLAEALAILHCVATPLLLA LFCHQATR-TLLPSL-----PLPEG-WSSHLDTLGSKS---*

Fig. 3. Example of protein sequence alignment usable by Modeller. The alignment was done between the sequence query DARC and the sequence of the target (structural template), the rhodopsin. The alignment file is named darc.ali.

transmembrane domain. (d) The N and C termini regions can be considered as globular. So, classical homology modeling can be done with software of Table 2. Depending on the size of and compatibility with related structures, it can be less or more complicated. (e) An entire alignment can also be done. Figure 3 shows an example of an alignment usable by Modeller

396

de Brevern

(PIR format). This example is the alignment of (target) rhodopsin with (query) DARC. 3. Modeller needs not only this alignment, but also the corresponding template structure and a simple script which summarizes all this information. The template structure needs to be strictly equivalent to the sequence of the structure, i.e., the sequence present in the PDB file. Indeed, it is classical to have not a complete equivalence of protein sequence and the resolved protein. For instance, the PDB structure of the rhodopsin (PDB code 1F88) has missing atoms within a helix. The correspondence between sequence and structure of rhodopsin is not complete. Figure 4 gives an example of a Modeller script. 4. A series of models (at least 100) must be generated. It corresponds to the a.ending in the Modeller script. The recent Modeller software use Python language (see Note 6). 5. It is often interesting to explicitly add constraints on TRSSs, i.e., to add in the script the position of TRSSs to force the conservation of repetitive structures in the generated models. In the same way, constraints on known disulfide bridges or distances between residues lead to enhanced structural models. In Fig. 4, the disulfide bond constraint has been added (class MyModel) for disulfide bonds 51–276 and 129–195 (see Note 7). #฀Homology฀modeling฀by฀the฀automodel฀class฀ from฀modeller฀import฀*฀฀฀฀฀฀฀฀฀฀฀฀฀฀#฀Load฀standard฀Modeller฀classes฀ from฀modeller.automodel฀import฀*฀฀฀฀#฀Load฀the฀automodel฀class฀ #฀Redefine฀the฀special_patches฀routine฀to฀include฀the฀additional฀disulfides฀ #฀(this฀routine฀is฀empty฀by฀default):฀ class฀MyModel(automodel):฀ ฀฀฀฀def฀special_patches(self,฀aln):฀ ฀฀฀฀฀฀฀฀#฀A฀disulfide฀between฀residues฀51฀and฀276:฀ ฀฀฀฀฀฀฀฀self.patch(residue_type='DISU',฀residues=(self.residues['51'],฀ ฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀self.residues['276']))฀ ฀฀฀฀฀฀฀฀#฀A฀disulfide฀between฀residues฀129฀and฀195:฀ ฀฀฀฀฀฀฀฀self.patch(residue_type='DISU',฀residues=(self.residues['129'],฀ ฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀self.residues['195']))฀ log.verbose()฀฀฀฀#฀request฀verbose฀output฀ env฀=฀environ()฀฀#฀create฀a฀new฀MODELLER฀environment฀to฀build฀this฀model฀in฀ #฀directories฀for฀input฀atom฀files฀ env.io.atom_files_directory฀=฀['.',฀'../atom_files']฀ a฀=฀automodel(env,฀ ฀฀฀฀฀฀฀฀฀฀฀฀฀฀alnfile฀฀=฀'darc.ali',฀฀฀฀฀#฀alignment฀filename฀ ฀฀฀฀฀฀฀฀฀฀฀฀฀฀knowns฀฀฀=฀'1F88',฀฀฀฀฀฀฀฀฀฀฀฀฀฀#฀codes฀of฀the฀templates฀ ฀฀฀฀฀฀฀฀฀฀฀฀฀฀sequence฀=฀'DARC_H1')฀฀฀฀฀฀฀฀฀฀฀#฀code฀of฀the฀target฀ a.starting_model=฀1฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀#฀index฀of฀the฀first฀model฀ a.ending_model฀฀=฀100฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀#฀index฀of฀the฀last฀model฀ ฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀#฀(determines฀how฀many฀models฀to ฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀#฀calculate)฀ a.make()฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀฀#฀do฀the฀actual฀homology฀modeling฀

Fig. 4. Example of Modeller script.

3D Structural Models of Transmembrane Proteins

3.4. Structural Models: Assessment and Refinement

397

1. Modeller or equivalent approaches are quite good to define a correct topology. However, the structural models must be refined to obtain a better geometry. Thus, a first step often consists in selecting few models from the generated ones. It is possible to only look at the objective and DOPE functions given by Modeller for each of the generated models. Nonetheless, these functions can be used carefully as they were not fully tested with transmembrane proteins. Moreover, these functions can be highly sensitive to flexible regions. For instance, the DARC protein has a long flexible region (5), namely ECD1; these functions were only underlining the different conformations of this loop (only 20% of the protein). They were not discriminating (see Note 8). 2. Side-chains must be replaced, thanks to a performing method, e.g., SCWRL. 3. A minimization of the generated structural models is also a good requirement. The most powerful software today is Gromacs (see Chapter 21 for details). 4. Then, a precise analysis of the models must be performed. Visualization of the protein structures with dedicated software, e.g., rasmol, PyMol, or VMD, is a first obligated step. 5. Recognition of different types of errors in 3D models can be done by different software, e.g., verify 3D or Prosa. They are on the basis of sequence–structure relationship statistics deduced from non-redundant databank. Moreover, no equivalent method is available for transmembrane proteins. Thus, only geometry of the structural models can be checked, e.g., atom distances, angle values… The most-used dedicated software are ProCheck and WhatCheck. They give numerous values with summaries. They can highlight part of the structural models with wrong geometry or particular residues with a strange conformation. An important point is also to look at the target structure. Indeed, the template structure can already have some local un-canonical conformation. 6. The chosen structural models must encompass most of the biochemical features and reflect the known experimental data. If experimental data have shown that some residues are accessible, they must be found accessible in the selected models. Otherwise, the alignment must be corrected and the process of comparative modeling and analyzes done again. For instance, the building of DARC (5) has needed ten consecutive rounds with manual corrections of the alignment (see Note 9). 7. Other information can be used (see the sections below).

3.5. Helix–Helix Interactions

1. Interaction between helices are well-described for globular proteins while for transmembrane some interesting works have been published. To have an idea of the interaction zones

398

de Brevern

between helices is a major advantage in proposing pertinent structural models. Table 3 shows the available methods of helix–helix and helix–lipid interactions. 2. A very elegant research (7) has shown the possibility to compute the compatibility between two helices and deduced rules. Specific researches have quantified these compatibility statistics. In this limited field, kPROT is the most renowned approach. Nonetheless, it is not sometimes reachable. Other approaches are not fully automated and so researchers need a strong background in bioinformatics to deal with this kind of approach. 3.6. Helix–Lipid Interactions

1. These interactions are also essential for the protein folding and biological functions. It was less analyzed than helix–helix interactions, but nowadays, the number of available methods is higher than for helix–helix interactions. Often, e.g., LIPS, they also provide hints to regions of interaction with other helices. 2. For LIPS, a multiple sequence alignment must be given, but the alignment concerns only one helix and must not contain a gap. So, important further work is required to prepare the data. 3. As a result, the web server does not give a single result, but the quantification of each possible face (7). Thus, a careful research of correspondence between these predictions and the selected structural models can be performed to improve the quality of the models.

3.7. Of Loops and Mutants

1. Specific research can be done on extra- and intra-cellular loops. It is especially interesting in the case of GPCRs. Indeed, it is a common hypothesis that the seven transmembrane helices have a strong conservation among their entire folds. This hypothesis is strongly supported by biochemical, biophysical, and biological experiments. Hence, a major divergence between the GPCRs is the conformation of loops. A database like ArchDB can be used to select (manually) some alternative potential conformations. Then, building of an alternative conformation can be generated, thanks to comparative modeling. The simplest approach is to use the selected models as the query and the template. In the target sequence, place gaps at the position of the loops. In the example of Fig. 3, it could correspond to putting two times the DARC_H1 sequence, one as the query sequence, one as the target sequence, and putting the obtained structural models as the template. In the target sequence, put gaps at the position of the desired loop. Add a new sequence of the length of query sequence, but only with gaps. Add also the name of this sequence in the Modeller script (line sequence).

3D Structural Models of Transmembrane Proteins

399

At the position, change the gaps by the sequence of the loop. In the same way, add its PDB file into the script (to the line knows). It is possible to add constraints to all the rest of the protein. Molecular dynamics (or a simulated annealing) can help to analyze the flexibility of the loops (see Chapter 21). Recently, a novel class (MyLoop) in Modeller has been proposed to refine the loops; it is on the basis of the optimization of Modeller’s DOPE function. 2. Another potentiality of in silico building of transmembrane protein structural models is the proposition of “supervised” mutants. As the final selected models reflect the biological data, an analysis of the surface underlines the important positions. It is possible to mutate in silico the residues (a specific class exist in Modeller) and analyze a possible consequence of this mutation. A simple idea is to observe the electrostatic properties of the protein. Electrostatic potential mapped on the molecular surface can be coarsely done with Swiss-PdbViewer (aka DeepView) from SwissModel. It allows ranking of the mutation to be proposed. 3. Quaternary structures can be computed in silico. Two approaches exist: (a) from a known available complex or (b) by docking. For the first possibility, it consists – roughly – in the comparative/homology modeling of the different partners. The hypothesis is that the interaction regions are re-conserved and so, must also be in the structural model complex. The second case is most common. If the protein complex is an homomer with a known symmetry, it is a constraint that can be directly used during the comparative modeling approach. Otherwise, each partner must be built and docking approach must be used. As for the building of one protein structural model, the building of a complete quaternary structure is greatly enhanced if biological data are included.

4. Notes 1. For the multiple sequence alignment, it is important to control the redundancy of the data as the length of the sequences. (a) A cluster of highly redundant sequences is not informative and bias the alignment analysis. (b) PSI-BLAST often keep small sequence fragments; they are not pertinent in the case of a protein multiple sequence alignment. 2. With a pertinent multiple sequence alignment, it is also possible to locate important residues in related sequences. These positions can be used as biological constraints, the most important ones.

400

de Brevern

3. Summarize the data in tables, for the prediction and for the important residues. Note that in a mutation abolishing a binding, an interaction is not always present directly at the binding site, but it can also be a crucial residue implicated in the maintaining of the fold (important is not always accessible). 4. The prediction index is a great tool to analyze the difficulty of prediction. Some regions are clearly repetitive structures embedded in the bilayer or loop swimming in water while others are complicated. These latter must be carefully checked at each step of the process. 5. As the number of available protein structure is quite low, it is very important to analyze properly the known folds and the rules (8) that govern this kind of protein folds. However, do not blindly follow the common features of transmembrane proteins. For instance, Tryptophan is known to be preferred at the lipid− water interface, but often it is not the case. 6. Multiple structural models must be developed to find at least a pertinent one. In the same way, the testing of alternative alignment is an essential task. Another interesting approach is to predict also structural models of related proteins. In the same way, if one structural model of a related protein is available, it can be used as a structural template. 7. Proline and kinks in repetitive structures are always a problem. For instance, rhodopsin has three kinked helices and the K+ channel one as essential features. 8. N and C termini regions of the transmembrane protein can be long fragments. In this case, it is possible to use classical methods available on the web, e.g., Proteus 2, and use the results as novel template only for one part of the template (see Subheading 3.7). In the same way, the use of multiple templates is an excellent tool to sample the potential conformations of the structural models. 9. As presented in this chapter, it must be noticed that the building of a pertinent transmembrane protein structural model is mainly a manual approach and that the most important constraint is the biological (experimental) one.

Acknowledgments This work was supported by French National Institute for Blood Transfusion (INTS) and French Institute for Health and Medical Care (INSERM) and University Paris 7 – Denis Diderot.

3D Structural Models of Transmembrane Proteins

401

References 1. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The Protein Data Bank. Nucleic Acids Res 28:235–242 2. Fleishman SJ, Unger VM, Ben-Tal N (2006) Transmembrane protein structures without X-rays. Trends Biochem Sci 31: 106–113 3. Klabunde T, Hessler G (2002) Drug design strategies for targeting G-protein-coupled receptors. Chembiochem 3:928–944 4. Radestock S, Weil T, Renner S (2008) Homology model-based virtual screening for GPCR ligands using docking and targetbiased scoring. J Chem Inf Model 48: 1104–1117

5. de Brevern AG, Wong H, Tournamille C, Colin Y, Le Van Kim C, Etchebest C (2005) A structural model of a seven-transmembrane helix receptor: the Duffy antigen/receptor for chemokine (DARC). Biochim Biophys Acta 1724:288–306 6. Hazai E, Bikadi Z (2008) Homology modeling of breast cancer resistance protein (ABCG2). J Struct Biol 162:63–74 7. Fleishman SJ, Ben-Tal N (2002) A novel scoring function for predicting the conformations of tightly packed pairs of transmembrane alpha-helices. J Mol Biol 321:363–378 8. von Heijne G (2007) The membrane protein universe: what’s out there and why bother? J Intern Med 261:543–557

as

Chapter 21 Molecular Dynamics of Membrane Peptides and Proteins: Principles and Comparison to Experimental Data Patrick F.J. Fuchs Abstract Molecular dynamics (MD) simulation is a standard tool used to assess the motion of biomolecules at atomic resolution. It requires a so-called “force field” that allows the evaluation of an empirical energy from the 3D coordinates of the atoms in the system. In this chapter, the application of MD simulations to membrane proteins and peptides is described with a particular emphasis on the comparison of MD results to experimental data. Such a comparison can be used either for (1) validating the results of a simulation, (2) interpreting an experiment at the atomic level, or (3) calibrating the force field. This last step is particularly important for the use of MD as a predictive tool. As an illustration, a comparison of 2H NMR experiments to MD simulations of a transmembrane peptide is presented and discussed. Key words: Molecular dynamics, Membrane protein, Membrane peptide, Comparison to experimental data, WALP23, 2H NMR

1. Introduction Molecular dynamics (MD) simulations have become a powerful tool for assessing the motion of biomolecules at atomic or molecular resolution (1–3). Using an empirical energy function, the basic principle of MD is to integrate Newton’s equations, giving thus a collection of coordinates of all the particles of the system (e.g., a membrane protein within a bilayer of lipids surrounded by water molecules) as a function of time (i.e., a trajectory). With the explosion of computer capabilities as well as improvements of computing methodologies and algorithms, it is now possible to simulate systems of hundreds of thousands of atoms over tens (sometimes hundreds) of nanoseconds (this aspect is further developed in the next chapter). MD applied to membrane objects (peptides and proteins) has started in the 1990s (4) and is now Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_21, © Springer Science+Business Media, LLC 2010

403

404

Fuchs

routinely used with an increasing number of studies (reviewed, for example, in refs. 5, 6 for membrane proteins). Most of the time, MD simulations complement one (or many) experimental work(s) (7, 8), allowing an atomic/molecular description of the process under study, such as a specific function of a membrane protein (e.g., channeling of ions across the membrane, signal transduction, etc.), the binding to the membrane of a peptide (e.g., binding of an antimicrobial peptide). This comparison is not always easy given the variety of information experiments can bring, such as functional data (e.g., effect of mutations on the protein activity) or biophysical data (e.g., orientations of chemical bonds derived from nuclear magnetic resonance). The approach also works the other way round, that is, MD can suggest new experiments by bringing a molecular description of the problem. In this chapter, I briefly describe the tools used for performing MD simulations specifically on membrane proteins/peptides and emphasize the strategies, allowing a sound comparison of simulations to experimental data. As a recent issue of Methods in Molecular Biology (Vol. 443) was entirely devoted to computer simulations, I just recall to the reader the basics of MD and then concentrate on the comparison between experimental data and simulations.

2. Theory Even if we manipulate “virtual” objects in the field of computer simulations (usually referred to as in silico), I briefly overview here what is needed to perform an MD simulation in terms of hardware, software, and other tools/algorithms/concepts. 2.1. Computers

The basic need when one wants to perform an MD simulation is a suitable computer or access to a computing facility. This can be (1) a single workstation, (2) a cluster of computers (many computers working together), or (3) a bunch of Central Processing Units (CPUs) on a super computer (see Note 1). Point (1) is the easiest way of performing an MD; it can even be done on a home computer (under the appropriate operating system, usually Unix or Unix-like). Current 2 GHz (gigahertz) computers with 64 bits multicore CPUs (of course, the higher the number of CPUs, the faster the simulation, thus the longer the trajectory one can simulate), such as Intel or AMD, work very well. Solution (2) is on the one hand more expensive and requires more complicated hardware and software administration than (1), but on the other hand, it gives really much more computing power. Solution (3) needs to write a (small) scientific project to be granted some hours of computation on a super computer. A list of the top 500 world

Molecular Dynamics of Membrane Peptides and Proteins

405

super computers is available at the following URL: http://www. top500.org/. Last but not least, MD simulations can generate many and very large files (to store the trajectories) and thus require a large amount of storage (at least a few hundreds of GigaBytes, even more depending on how big the system is, and how long the trajectory one wants to simulate). 2.2. Force Field

MD simulations require a so-called “force field” in order to be performed. A force field is a set of parameters to which a functional form of the potential energy is associated (9–12). Basically, it allows the calculation of the (empirical) energy of a molecular system from its 3D coordinates, as well as the forces acting on each atom using the first derivative of the energy with respect to the coordinates. The most commonly used force fields for simulations of peptides and proteins are AMBER (13), CHARMM (14), GROMOS (15), and OPLS (16) see e.g., refs. 3, 9–13 for more details). Simulations of membrane peptides/proteins within their native environment also require the use of a force field for phospholipids. Currently, two main force fields are available, the set of Berger (17) and that of CHARMM (18) recently reparameterized (19) (see Note 2). Berger lipids work with an old version of GROMOS (20) for the protein and have been modified in order to work with OPLS (21). A discussion on how to combine those two force fields has been recently published (22). Obviously, CHARMM lipids (18, 19) work with the CHARMM force field for proteins (14). Recently, another alternative became available with the development of phospholipid parameters compatible with the AMBER force field (23). Last, we need a model to simulate water. The two standard models are SPC (24) (working with GROMOS and Berger lipids) and TIP3 (25) (working with CHARMM, AMBER, and OPLS). Note that the use of the lipid force fields described above implies that we simulate explicitly their presence, that is, we construct the coordinates of each one of them and calculate explicitly the microscopic interactions between them and the protein/peptide; the same applies for water. The other alternative is to use an implicit representation of the membrane (26) where this latter is treated as a continuum (as well as for water). This has the advantage to speed up the calculation time, but the main drawbacks are the overestimated kinetics and the loss of microscopic details of the interactions between the protein/peptide with lipids and water. In the following, I focus only on explicit simulations, which are nowadays the standard approach for membrane systems.

2.3. Software

The most widely used software for MD of membrane peptides/ proteins are GROMACS (27) and NAMD (28). GROMACS natively implements the GROMOS force field and the Berger lipids have been imported there (29), whereas NAMD natively

406

Fuchs

implements the CHARMM force field. The advantage of these two programs is the high speed of the code and the efficiency of the parallelization (the efficiency to run the program on several CPUs) (see Note 3). There are some other alternatives such as CHARMM or AMBER, which are programs that implement their own force field, but they are generally slower than GROMACS or NAMD. 2.4. Protein/Peptide 3D Structure

MD simulations of a membrane protein or peptide of course require the 3D structure of the object one wants to embed in the bilayer. For membrane proteins, the first source is the protein data bank (30); Stephen White’s lab also maintains an up to date list of membrane proteins of known 3D structure (31). When no 3D structure of the protein of interest exists, it is possible to build a (homology) model based on a homolog which shares sufficient sequence identity (this topic is largely developed in the previous chapter). For peptides, it is generally easier as they are most of the time alpha-helical. One has just to take care of the partitioning of the peptide within the bilayer, that is, whether it is a transmembrane (that spans across the bilayer) or interfacial (that lies at the interface between the phospholipids and the aqueous phase) peptide. The initial structure can be easily constructed with the appropriate f/y angles of an a-helix using structure generators such as Pymol (32, 33), Swiss-Pdb viewer (34, 35), or some web services like Basic Builder hosted on the RPBS platform (36, 37).

2.5. Preparation of the System

Various strategies have been developed to embed the protein/ peptide within the bilayer (38–42). In the 1990s, the first membrane protein (or peptide) simulations used tricky strategies to shorten as much as possible the equilibration phase (4). Nowadays, with the speed of current computers and available MD codes, the fastest strategy just requires a pre-equilibrated bilayer of phospholipids, in which one can make a hole (slightly larger than the protein) (38) and/or adapt the lipids position around the protein (42). Then, an appropriate equilibration of a few nanoseconds (usually maintaining the protein fixed by position restraints) with pressure coupling will shrink the box and pack the lipids against the protein. Most of the time, this step still requires a manual input even if efforts are underway to help build protein/peptide/ membrane systems (40, 41).

2.6. Simulation Parameters

Once the object is embedded in the bilayer, it is time to run the MD using the appropriate algorithms and parameters. Two issues are of critical importance when dealing with membrane simulations, the way of controlling the pressure of the system and computing electrostatics (43). Nowadays, membrane simulations

Molecular Dynamics of Membrane Peptides and Proteins

407

are most of the time run under the so-called “NPT” ensemble, that is, the number of molecules (N) is fixed, and we couple the system to a thermostat and barostat to get the temperature (T) and pressure (P) constant, respectively; the other alternative is the use of the NPgT ensemble, where g stands for constant surface tension (see Note 2). Coupling the system to a pressure bath basically needs to scale the dimensions of the box (x,y,z), which can be done using the same scaling factor for all directions (isotropic coupling), the same scaling factor for x and y directions but a different one for the z direction (semi-isotropic coupling), or independently in all directions (anisotropic coupling). In all these possibilities, it is recommended that semi-isotropic coupling (see Note 4) be used when simulating lamellar systems. One popular algorithm for applying pressure coupling is the one of Berendsen (44) (see Note 5). The other important issue in membrane protein simulation is the way of computing electrostatics when evaluating the energy of the system. Nonbonded interactions (including van der Waals and electrostatics) represent the most expensive computational burden (see Note 3). To alleviate the computing effort, various strategies were created in the early time of MD in the 1970s and improved so far. They generally rely on the so-called cutoff. This latter stands for an atom–atom distance beyond which the interaction is considered equal to 0; since both electrostatics and van der Waals are inverse power functions (of the distance between atoms), they quickly tend to 0 at long distances. Although this approximation can be acceptable for simple systems, it has been demonstrated to cause numerous artifacts on highly charged systems such as ionic solutions. It leads to a wrong radial distribution of the ions, which tend to be separated by the cutoff distance while they should not (45). To circumvent this problem, two main techniques/algorithms are used in biomolecular simulations, the particle-mesh-Ewald (PME) (46, 47) and the reaction field (48). On pure phospholipids, the use of cutoff gives the wrong area per lipid; they indeed tend to be too packed, thus the membrane thickness is overestimated (49, 50). For simulations of membrane proteins (or peptides) within a bilayer of phospholipids, the use of cutoff has been shown to not affect directly the protein nor the water, but only the phospholipids (51). Nonetheless, it is still highly recommended not to use cutoff schemes if one wants to avoid artifacts due to a wrong thickness/area per lipid, which may cause conformational consequences due to different matching/mismatching conditions of the environment around the protein (or peptide). Currently, PME is the most correct and most used technique for computing electrostatics on membrane systems (49, 50) (see Note 6).

408

Fuchs

3. Comparison of MD Simulations to Experimental Data

3.1. General Considerations on the Comparison of MD to Experiments

In this section, I first highlight some issues to consider when one wants to compare an MD trajectory with experiments, notably sampling. I then briefly describe and discuss the practical aspects of comparing a simulation to both biochemical and biophysical data. I then give a complete example of such a comparison in a study of the orientation of a helical peptide within a phospholipid bilayer. Once the production run has been done, it is time to analyze all the relevant data one can extract from the trajectory(ies). Sooner or later, the results will have to be compared to experimental data, either to validate the simulation or to interpret the data at atomic/ molecular level (7, 8, 43, 52). This is not an easy task. The main reason of this comes from a difference of scale, both in time and space, of many orders of magnitude. The first issue is the quantity of molecules. During an experiment in the real world, one deals with a number of molecules in the sample that is on the order of the Avogadro number (~6.1023 molecules), whereas the simulation box represents a microscopic view of the system (103–105 molecules), with a single molecule of solute if we consider for example a membrane protein within a bilayer (surrounded by water). One of the drawbacks is that we neglect the interactions between the multiple copies of the object (dimerization, aggregation…). The second issue is the time scale. A typical experiment generally ranges from microseconds to seconds depending on the technique; we thus make the approximation that the quantity of interest averages in the same way during the full time of the experiment than in the simulated time. Consequently, we have to ensure that the simulation time is greater (at least of one order of magnitude) than the correlation time of the measured quantity, in order for this quantity to relax (see below). Some of the experimental results are even harder to predict since most of the biologically relevant motions take place on longer time scales (2, 53). However, there is an overlap that begins to arise between experiments and simulations (around the microsecond) (2), notably using coarse-grained models (54) (see next chapter for more considerations on this topic). What links both experiments (macroscopic) and simulations (microscopic) is statistical mechanics (see ref. 55 for a brief introduction, and all references therein), that is, in the limit of infinite sampling of simulations, macroscopic (bulk) properties can be explained by the individual interactions between the atoms of the system (thus on a microscopic description of the system). Of course, there is a first inherent limitation to this, since we sample a system in a simulation over a finite time (a few tens to

Molecular Dynamics of Membrane Peptides and Proteins

409

hundreds of nanoseconds with current computers). The ergodic hypothesis states that averaging a property over an infinite time is equivalent to averaging over an appropriate statistical ensemble of configurations of the system. Such an ensemble is most of the time impossible to obtain on complex biological macromolecules using standard MD simulations. At least, one has to make sure that the property of interest is at equilibrium by plotting its autocorrelation function (43); usually, an exponential decay is observed, which allows the extraction of the relaxation time of the observable. One has then to verify that the simulated time is (at least) greater than one order of magnitude of the relaxation time. It allows one to get good statistics to calculate a reliable average of the observable. If a given event is observed once in a simulation, it is highly recommended to run many clones of the same simulation (at least three using different initial conditions). It enhances the sampling and ensures that this event of interest does not depend on the initial conditions and thus occurs by chance. If the event is very rare, it might never occur in a classical MD simulation because the free energy barrier associated to it is too high. In such a case, free energy techniques are adapted to force the system to go from one state to another or along a given reaction coordinate (e.g., the position of a small solute along the main axis of a channel) even if a significant free energy barrier is present (55). Last, one limitation to bear in mind before starting any simulation is that it cannot reproduce the experimental complexity of the biological system under study (medium, molecules around, etc.). One spectacular example is the constitution of biological membranes, which contain several different types of lipid (phospholipids, sphingolipids, cholesterol, etc.) and are sometimes asymmetric such as in red blood cell membranes (56); simulations can hardly model this complexity, and we usually simplify the model by taking into account only the most represented lipid. 3.2. Comparison of MD to Biochemical Data

There is a great variety of information that can be extracted from biochemical experiments, which can give useful insights such as (structural) stability, ability to bind some ligands or to accomplish a given function (e.g., catalysis), etc. Usually, the experimentalist will observe the consequence of a given change in the “normal” system on a given observable. The purpose of this chapter is not to enumerate exhaustively all the possible experiments and consequences, but we can cite a few of them, e.g., the mutation of one (or a few) residue(s) according to the wild type protein, addition of a product/cofactor/inhibitor/drug, modification of a physical condition (temperature, pressure, ionic force), etc. For instance, if some parameters are available, it is possible to make a simulation of a protein with another molecule added to the system that has been shown experimentally to perturb the protein. As a response

410

Fuchs

to the change of condition, the experimentalist will often observe a qualitative or quantitative variation of the activity of the protein such as ligand binding, conductance of ions, ability to transmit the signal across the membrane etc. This can also be at higher level, such as the cellular level. Of course MD simulations cannot reproduce the complexity of the experiment but might be able to give some clues on the early molecular event(s) generated by the change of condition. For example, in the case of mutagenesis data, it is trivial to mutate in silico a residue and assess the effect of the mutation on the structure by running simulations of the wild type and of the mutant. One useful modeling technique for the study of mutation is the evaluation in silico of the free energy of a (bio)chemical process, for example, the variation of the free energy of binding of a ligand to a receptor between the wild type protein and one of its mutants; Chipot gives an example of such a calculation applied to the binding of a ligand to a GPCR (55). I now give two examples taken from recent literature that illustrate the comparison of experiments to MD simulations. The first deals with a very recent work on the SecYE translocon (57), which is involved in the translocation of proteins across the endoplasmic reticulum membrane. The authors could solve the crystal structure of an open state of SecYE by cocrystallizing the protein with an antibody. Additionally, they performed disulfide mapping experiments, which suggested that the closed form is more stable when nothing is bound to the protein, whereas the open form occurs when a ligand is bound. To assess this hypothesis, they ran an MD simulation of 100 ns starting from the open state (with nothing bound to it) embedded in the lipid bilayer. The MD confirmed the hypothesis since the protein reached the closed state over the 100 ns. This interesting example shows the synergy between MD and a typical biochemical experiment. A second example deals with the permeation of aquaporins and aquaglyceroporins (58). These latter allow an efficient and selective permeation of small solutes across the membrane. In this work, the authors studied their selectivity by computer simulations, using umbrella sampling (59). This technique permits the evaluation of the potential of mean force (PMF) along a chosen reaction coordinate, giving an estimation of the free energy barrier. In this case, the PMF was evaluated along the z position of the solute within the pore and gave the free energy barrier encountered by the solute when passing through the porin. Interestingly, the authors could evaluate that ammonia crosses a barrier of 12.5 kJ/mol, which is in line with permeation experiments. Additionally, this work allowed the authors to identify which region in the pore acts as a selectivity filter. In this example, the experiment validated the simulation, and this latter gave meaningful atomic/molecular details. More generally, PMF is

Molecular Dynamics of Membrane Peptides and Proteins

411

now a standard computational tool to examine permeation through protein channels and is often compared to binding and conductance experiments (60). For more examples on the comparison of experiments with MD simulations, one can read the following two references. Corry and Martinac showed how the computational studies complemented some experimental techniques (EPR, FRET spectroscopy, Patch Clamp) on the comprehension of bacterial mechanosensitive channels (MscL and MscS) (61). Dodson et al. give some more examples of successful comparisons between MD simulations and experiments that help understand the underlying biological mechanism under study (1). 3.3. Comparison of MD to Biophysical Data

I consider the case of biophysical data separated from biochemical data because of the nature of information that can be injected into or compared to the simulations. Usually, biophysical data can lead to fairly precise structural observable(s) and is thus very useful for e.g., lipid force field calibration (e.g., area per lipid from SAXS or NMR), or can very precisely be compared to simulation results. Moreover, it is often possible (and desirable) to back-calculate directly the experimental observable and compare it directly with the simulations results (e.g., a quadrupolar splitting evaluated from the orientation of a bond vector, see next section for an example). Nonetheless, comparing MD simulations to biophysical data is very challenging because of the difference of time scale. Usually, experiments based on NMR or X-ray and neutron scattering (SAXS, WAXS, SANS…) give access to useful observables. The main difficulty is that we generally have no idea of the shape of the distribution of these observables (8) as the experimental measurement gives a unique value (or set of values), which represents an average over the time scale of the experiment. Different distributions may be interpreted in different ways and lead to confusions. In the next section, I present an example that illustrates this problem; I notably emphasize the need to take into account the dynamics of the observable during the time course of the experiment, in order to interpret the experimental data at the structural point of view. Last, it is important to critically assess the source of experimental data, especially the limits of the technique of measurement and the conditions in which it has been measured. For instance, Nagle described in 2000 the variety of published values of the area per lipid of DPPC in a fluid phase (62), which was greater than 15 Å2. Knowing that lipid force fields are sensitive to differences of 1 to a few Å2, one has to do a critical choice of the source of experimental data before establishing any comparison. I now give a few examples of biophysical techniques that give useful information on pure lipid systems, which can directly be compared to simulations. From NMR 13C and 2H spin-lattice

412

Fuchs

relaxation experiments, it is possible to derive the orientation and dynamics of the C–H bond along the aliphatic chains (63, 64). From the simulation, it just requires the computation of the correlation function describing the orientation of a chosen C–H bond vector along the aliphatic chain of the lipid. This is then possible to evaluate the spectral density function from the Fourier transform of the correlation function and compare it directly to experiment (43). Lateral self-diffusion of lipids can be probed by fluorescence recovery after photobleaching (FRAP). From simulations, diffusion is usually evaluated using the mean square displacement (MSD) and the Einstein relation. Before evaluating the MSD, Wohlert and Edholm recommend that the motion of each leaflet relative to the other (as well as the motion of each leaflet relative to the solvent), which are artifacts due to the finite size of a simulation, be subtracted (64). They also propose a fitting procedure to extract two diffusion coefficients from the simulation: one for the fast diffusion of lipids at short time (3.0

10−7–10−5

Beta-sheet formation

>2.0

10−6–10−3

Local denaturation

0.5–1.0

10−5–101

Protein aggregation

>3.0

10−5–104

electrons in a molecule directly and focus on the dynamics of entire atoms, QM simulations concentrate on the dynamics of electrons and chemical bonds. This includes electronic distributions, excitations events, transition states, or the formation and breaking of bonds in chemical reactions. 2.2. Quantum Mechanics

Both MM and QM simulations on a molecular scale describe atoms as point masses. However, while MM simulations resort to an empirically derived energy function to model the electronic contributions, QM simulations consider some or all electrons in the system explicitly, using terms of spatial probability distributions to describe each electron by its wave function. This is done by solving or rather approximating the Schrödinger equation, for which an exact solution is only possible for the hydrogen atom. More complex systems are described through a linear combination of many hydrogen orbitals (6, 7). The approximation quality employed to solve the Schrödinger equation determines the range of system size and simulation time accessible by QM techniques. If a low level of approximation quality is used, one time-step can be performed to describe the electronic structure of a small protein, like crambin or a hemoglobin monomer. On a high level of approximation quality, characteristics such as bond lengths, vibrations, and absorption spectra can be computed from a triatomic system with an accuracy exceeding that of any direct experimental measurements today (8). While QM can also be used to derive the forces acting on each atom and thus predicting atomic motions – as in Born–Oppenheimer MD or Car–Parrinello MD (CPMD) (9) – the method’s main focus is on the high-detail description of the nature of chemical

426

Kandt and Monticelli

bonds, which does not necessarily require a time scale aspect. QM methods are also a crucial means in parameterizing new models for MM simulations. Solving the Schrödinger equation can be done in a timedependent or time-independent way. Both ways can be applied to compute a system’s dynamics over time as the terms of time dependency refer only to the way the wave function is used: in the time-dependent variant, the wave function is built only once at the beginning of the simulation and is later on propagated through time, whereas in the time-independent approach, the wave function is built several times throughout a simulation in a predefined interval of computation steps (10). Both approaches can be carried out using different sets of simulation techniques. Ab initio QM solves the Schrödinger equation without simplifying any of the mathematical operations necessary to compute the wave function, whereas semi-empirical QM employs mathematical simplifications and uses predetermined parameters in the calculation of the wave function interactions. In terms of complexity, computational cost and accuracy density functional theory (DFT) can be considered intermediate between ab intio and semi empirical QM. It also follows a different strategy to describe electronic structure by using terms of electron densities instead of electronic wave functions (11). Of these techniques, ab initio allows for the greatest accuracy but on the other hand is limited to the smallest system sizes of tens of atoms. DFT methods are currently among the most popular QM methods. Computationally less expensive than ab initio, they still retain a good accuracy on a number of systems of biological interest, like metalloenzymes (12), that is, systems up to hundreds of atoms. Very recently, it has become feasible to calculate whole proteins by means of DFT. More approximate semi-empirical methods can be used to model even larger systems, but can be rather inaccurate in more difficult cases such as the calculation of reaction energies. Ab initio, DFT, and semi-empirical techniques can be combined with molecular mechanical simulations in so-called hybrid methods, where part of the system of interest is treated using a QM method and the rest is treated using MM (typically with a molecular dynamics approach). Review articles on QM/ MM methods can be found in (12, 13). The QM region would normally include only the active site of the enzyme and the substrate, while the MM region would consist of the rest of the protein and the environment around it (membrane and solvent). QM/MM methods were introduced by Warshel and Levitt in 1976 (14) and, since then, have become increasingly popular in theoretical studies of enzymatic catalysis. The major advantage of QM/MM methods compared to plain QM methods lies in the description of the chemical environment around the site where the reaction takes place.

Membrane Protein Dynamics from Femtoseconds to Seconds

427

2.3. Molecular Mechanics

Molecular mechanics (MM) refers to a large number of simulation methods in which molecular systems are modeled using simple, empirical potential energy functions. In molecular mechanics, atoms are considered as point masses moving in an effective energy field, commonly referred to as force field. Electrons are not considered explicitly, instead their effect is accounted for indirectly. Force fields usually describe molecules in terms of elastic springs, bond lengths, bond angles, torsion angles, and point charges (although simpler forms are also used in some cases). MM includes a very diverse range of simulation techniques. Some of them – such as energy minimization, normal mode analysis, or Monte Carlo simulations – do not rely on equations of motion and therefore do not provide direct information on the dynamics of the system. In other methods – for example, Langevin dynamics (LD), Brownian dynamics (BD), dissipative particle dynamics (DPD), and molecular dynamics (MD) – the positions and momenta of all the atoms (or particles) are calculated as a function of time by integrating an equation of motion in discrete time steps. The result is referred to as “trajectory” and, for biomolecular systems, it typically consists of millions of steps (15) (i.e., nanoseconds of simulation time). The technique that is most widely used for the description of membrane protein dynamics is MD (15, 16). It is still rare that the entire time scale of the process of interest can be covered by MD simulations. However, the dynamics of motion, even on a relatively short time scale, can reveal trends of conformational or energetic changes that provide insight into protein function. Next to classical atomistic force fields, where each atom is treated explicitly, simplified representations can also be used, in which groups of atoms are represented by single interaction sites (beads). Such coarse-grained (CG) approaches are often used in combination with elastic network models, DPD and MD. Coarsegrained models were initially developed in the 1970s (17) and then applied mainly to lipids, surfactants, polymers, and proteins. Numerous CG representations have been developed for membrane proteins, reviewed in (18–21). Dynamics in CG simulations is often faster compared to the atomistic case, and has to be interpreted with care (22). Coarse-grained MD simulations provide a means toward mesoscale simulations, where system sizes on a magnitude of 106 particles and simulation times of microseconds are feasible. As the representation of the system’s electronic configuration does not change in the course of a molecular mechanics simulation, chemical reactions cannot be simulated directly (see Fig. 1).

2.4. Limitations of Computer Simulations

In principle, all properties of a system that depend on positions and velocities of the atoms can be calculated from a trajectory, using the tools of statistical mechanics. In practice, limitations are present in any simulation, related mainly to two aspects: (1)

428

Kandt and Monticelli

Fig. 1. While QM simulations concentrate on electron distributions and the dynamics of chemical bonds (a), molecular dynamics simulations do not consider the electrons directly and focus on the dynamics of entire atoms and molecules (b, c). Atomistic MD simulations consider all atoms in the system whereas in coarse-grained MD simulations groups of atoms are condensed into effective interaction sites (c).

the time and length scale of the simulation and (2) the accuracy in the calculation of the interactions. The limited size and time scale of the simulations is often referred to as the “sampling problem”: in order to sample correctly motions that take place on a certain time scale, the simulations are required to be at least as long as the time scale of the phenomena, and typically much longer. Moreover, the size of the modeled systems should be larger than the length scale of the motion one wants to characterize. As a consequence, it is still difficult to use computer simulations to study transformations and motions that take place on time scales beyond the microsecond and lengths beyond tens of nanometers. Unfortunately, this involves many biologically interesting phenomena. Techniques are available to overcome some of these problems by accelerating molecular motions and transformations. The development of these techniques is a very active area in the field of molecular simulations. In addition to the sampling problem, computer simulations are also limited by the accuracy of the calculations of the interactions between atoms. This problem is particularly important in MM simulations, which are based on force fields. Force fields are simplified descriptions of the interactions between atoms and contain a number of approximations. Comparison with experiments is normally used both in the development and in the validation of the force field. Once a certain force field has been shown to reproduce realistically some quantities experimentally measured, we can expect it to perform reasonably well in predicting properties that have not been measured, as long as the simulation conditions are similar. 2.5. Applications

In this section, we will discuss recent examples of QM and MM computer simulations of membrane proteins investigating biological phenomena occurring on a time scale of femtoseconds

Membrane Protein Dynamics from Femtoseconds to Seconds

429

to milliseconds. As the field of molecular modeling of membrane proteins is growing rapidly, giving a complete overview is not possible in the framework of this chapter. We will therefore limit our presentation to a few selected examples published in the last few years. Further literature reviewing computer simulations of membrane proteins can be found in references (15, 23–26). 2.5.1. Femtosecond to Picosecond Dynamics: Quantum Mechanics Simulations

Combined QM/MM methods have been widely used to address questions on the mechanism of specific enzymatic reactions and on the general principles of enzymatic catalysis, much debated in recent years. Most studies published so far using QM/MM techniques deal with the dynamics of water-soluble enzymes, while fewer studies have been published on membrane proteins. One example from the recent literature is about the KcsA potassium channel. Potassium channels are membrane proteins responsible for the transmission of electrical pulses in the nervous system (for a more detailed discussion of ion channels, see the next section). They conduct K+ ions at near diffusion limit (108 ions s−1 for each channel) while not allowing the conduction of other monovalent ions. Although the crystal structure of a few K+ channels has been solved (27–31), questions still remain on the origin of the high selectivity, as well as about the mechanism of opening and closing of the pore. The region responsible for the high selectivity of K+ channels is know as the “selectivity filter” of the protein. In KcsA, two ionizable residues are found close to the selectivity filter. In order to explore the molecular determinants of ion channel selectivity using computer simulations, it is necessary to model accurately the selectivity filter and the adjacent region, including the protonation state of all residues. The group of Rothlisberger used CPMD to investigate the KcsA potassium channel, focusing on the electronic structure of the selectivity filter, including polarization effects, charge transfer (32), and the protonation state of specific residues (33). It was found that two acidic residues in the proximity of the selectivity filter of KcsA, namely Glu71 and Asp80, share one proton. The proton exchange occurs on the picosecond time scale, a phenomenon that cannot be observed experimentally nor through classical MD simulations. In combination with experiments, the theoretical investigation suggested that the occupancy and structure of the ion selectivity filter, as well as ion translocation, depend on the protonation state of these residues.

2.5.2. Picosecond to Microsecond Dynamics: Molecular Mechanics Simulations

G protein-coupled receptors (GPCRs) form the largest known protein superfamily. Its members are of critical importance in a wide range of eukaryotic signaling processes (34). Sensing molecules or other stimuli outside the cell, GPCRs activate signal transduction cascades based on the second messenger mechanism. To date, the structures of two GPCRs are known at atomic resolution: the beta2 adrenergic receptor (35, 36) and rhodopsin (37). All the recent simulation studies have focused on rhodopsin, which

2.5.2.1. G Protein-Coupled Receptors

430

Kandt and Monticelli

is the primary light receptor in animal visual systems. Like in the photosynthetic bacteriorhodopsin (38, 39), retinal acts as a molecular light sensor, undergoing a cis to trans transition in its polyene chain within 200 fs upon light absorption (40, 41), which ultimately induces the closure or cGMP-gated ion channels within micro to milliseconds after the light stimulus (42, 43). Simulation studies of rhodopsin have focused on how lightinduced changes in the retinal chromophor propagate through the protein (44–48). They also investigated protein–lipid interactions (49, 50) and rhodopsin oligomerization behavior (51, 52). To determine which residues couple the changes in retinal configuration to the changes in the cytoplasmic transducin binding site, Kong and Karplus computed the correlation of residue–residue and residue–retinal interaction energies in multi-copy MD simulations of rhodopsin with and without simulated chromophor isomerization (48). Although the simulated time of 2–3 ns per run is too short compared to the microsecond scale of the actual structural changes, the authors did observe changes in residue interaction energy preceding the actual conformational transitions, based on which potential mutagenesis candidates were identified and cross-linking experiments proposed. MartinezMayorga et al. focused on the retinal micro-environment and how different protonation states affect the chromophor dynamics after light isomerization (46). Two opposing functional models have been proposed regarding the protonation state of two glutamates in the retinal binding pocket. According to the first model, Glu-181 would be protonated prior to isomerization but deprotonated after the isomerization (proton transfer to Glu-113) (53); the second model proposes that both glutamates remain deprotonated (54). To understand which functional model is more likely, rhodopsin was simulated in a bilayer environment for 1,500 and 2,000 ns after retinal isomerization at two different protonation states. From the trajectories, solid state ²H NMR spectra of the retinal methyl groups were computed and compared to experimental data. The simulated spectra were found in remarkable agreement with the second model. 25.2.2. Transport Across the Membrane: Transporters and Channels

Biological membranes are largely impermeable to ions and polar molecules. To achieve controlled permeability for such compounds, living cells employ two types of membrane proteins: channels and transporters. While channels facilitate substrate diffusion down its electrochemical gradient (passive transport), transporters draw from a source of energy – such as light, ATP hydrolysis, or concentration gradient – to drive thermodynamically uphill transport of the substrate against its electrochemical gradient (active transport). Transporters can be classified as either uniporters – transporting one substrate in one direction, symporters – two substrates in the same direction or antiporters – two

Membrane Protein Dynamics from Femtoseconds to Seconds

431

substrates in opposite directions. Numerous simulation studies of transporters were performed in the last few years on uniporters (55, 56), antiporters (57–59), and symporters (60). 2.5.2.3. Transporters

ABC transporters are uniporters found in all forms of life. Powered by ATP hydrolysis, they transport a broad range of substrates across the membrane, ranging from small molecules such as ions, sugars, and amino acids to larger compounds such as pharmaceutics, lipids, and oligopeptides (61, 62). In order to understand the mechanism of functioning of ABC transporters, Sonne et al. combined elastic network normal mode analysis (63) with a series of steered MD simulations on the vitamin B12 importer BtuCD in a POPE bilayer (56). Using the motor domains of another transporter trapped in the presence and absence of nucleotide as a template, the BtuCD motor domains were pushed together and pulled apart while monitoring the conformational response in the trans-membrane domains. Both simulation techniques found the same trends in conformational response, clearly supporting one functional model while contradicting the other. The favored model has later been confirmed by further ABC transporter crystal structures (62). A critical component for Escherichia coli to maintain cellular salt and pH homeostasis is the Na+/H+ antiporter NhaA (64). Though the crystal structure of the protein was determined in an inactive and thus sodium-free state, two aspartates located halfway through the protein had been proposed as Na+ binding sites based on earlier mutagenesis experiments (65). As the protonation state of these two aspartates was unknown, Arkin et al. simulated all four possible protonation scenarios, with NahA embedded in a POPE bilayer for 12–100 ns each (57). Monitoring water accessibility from each side of the membrane and sodium diffusion behavior after initial placement next to one of the aspartates, the authors were able to deduct a complete Na+/H+ antiporting cycle for NhaA.

2.5.2.4. Channels

Self-assembling in the host cell membrane into a heptameric pore, alpha hemolysin is a bacterial toxin that causes cell death by introducing unregulated membrane channels in the target membrane (66). Patch clamp experiments have shown that various solutes (with sizes up to 1,300 nucleotides single DNA or RNA) can translocate through the alpha hemolysin channel by applying a transmembrane electric potential (67). Wells and coworkers chose this channel to introduce a new method of steered MD that makes it possible to simulate in tens of nanoseconds transport events that naturally occur on a millisecond time scale (68). Applying a transmembrane electric potential acting on the solute only, the authors were able to simulate DNA and peptide translocation through alpha hemolysin. The simulations reproduced a number

432

Kandt and Monticelli

of experimental results, including ratios of DNA translocation velocities and amplitude of the applied potential. OpcA is an outer membrane channel in Neisseria meningitides – a major pathogen for inflammatory diseases such as meningtitis and septicaemia – and a key player in facilitating cell adhesion and the internalization of the bacterium within the host cells. Two crystal structures of the protein have been determined, revealing substantially different conformations in two of the extracellular loop regions in a region of the protein implicated in proteoglycan binding (69, 70). Luan et al. performed 20 ns MD simulation of each crystal structure, embedded in a POPC bilayer and in the unit cell of their respective crystal lattice (71). They could show that the different loop conformations arose from the different crystal packings, and that within the membrane environment the X-ray structures showed large conformational changes. These findings were combined into a new model, based on parts of the X-ray structures that were least affected by crystal contacts. The new model, subjected to another 20 ns simulation, proved to be more stable than the original crystal structures. 2.5.2.5. Ion Channels

Ion channels act as extremely selective gateways, allowing specific ions to pass in and out of the cell in response to various signals. They are fundamental for physiological processes such as the formation and transduction of nerve impulses, muscle contraction, and osmoregulation. Recent computational studies included computer simulations of potassium channels, mechanosensitive, and ligand-gated ion channels. The first X-ray crystal structure of an ion channel was determined for the potassium channel KcsA in 1998 (27). While KcsA has a high selectivity for potassium, the structurally similar NaK channel is able to conduct both K+ and Na+ ions (72). Noskov and Roux investigated the physical foundation for ion selectivity in these channels using a combination of simple dynamic models of a cation surrounded by different ligand components – such as carbonyl functions or water – as well as free energy calculations based on atomistic MD simulations of NaK embedded in a DPPC bilayer (73). They found the selectivity of the central ion binding sites in KcsA and NaK to be largely controlled by the hydration of the cation inside the channel. Oxygen atoms lining the ion pore were found to play an important role in this, having a different effect on ion selectivity, depending on whether an oxygen is donated by a carbonyl group or a water molecule. A central ion binding site dominated by carbonyl functions – as in KcsA – is found K+-selective, whereas the binding site in NaK is not, as there the cation is liganded predominantly by water. Being slightly wider in this region, the ion pore in NaK does not remove the ion’s hydration shell as efficiently as KcsA does.

Membrane Protein Dynamics from Femtoseconds to Seconds

433

2.5.3. Microsecond Dynamics: Coarse-Grained Simulations

A wide range of biologically interesting phenomena occurs on time scales that are currently out of reach for atomistic models. Vesicle fusion, self-assembly of large protein complexes, and signal transduction take place on microsecond or longer time scales. Simplification of the model is required to simulate these motions. In recent years, the use of coarse-grained models has become increasingly popular, and a large number of membrane protein systems has been investigated. Many different approaches have been developed to study lipid–protein interactions, reviewed in (19–21). It is impossible to give a broad overview that will do justice to any of these simulations. Some CG simulation studies of membrane peptides and proteins are selected here for a brief discussion.

2.5.3.1. Hydrophobic Mismatch

Membrane proteins normally present large hydrophobic surfaces in contact with lipid membranes. To minimize exposure of hydrophobic residues to the water environment, it is necessary that the hydrophobic thickness of the lipids surrounding the protein matches the hydrophobic thickness of the protein. It has been proposed that hydrophobic matching has an important role in several fundamental processes in cell membranes, e.g., in the secretory pathway in the Golgi, in lipid sorting around membrane proteins and in sequestering proteins with long transmembrane regions into lipid rafts (sphingolipid and cholesterol rich membrane patches). Hydrophobic mismatch has been proposed to result in protein-induced bilayer deformations, lipid-induced protein tilting, and aggregation. For many years, coarse-grained simulations have been used to explore the consequences of hydrophobic mismatch in proteincontaining membranes. Using Monte Carlo CG simulations, Sperotto and Mouritsen found that the perturbation of the bilayer decays exponentially with the distance from the protein (74). More detailed CG models were used by Smit and coworkers in DPD simulations to describe the effects of hydrophobic mismatch as a function of the size of the protein (75, 76). Small transmembrane peptides were predicted to display much larger tilt angles compared to larger proteins. These results were recently confirmed by experimental investigations by Marsh and coworkers (77).

2.5.3.2. Protein Aggregation and ProteinMembrane Self-Assembly

Among the numerous CG models recently published, the one by Marrink and coworkers (coined the MARTINI force field (22, 78, 79)) has been used to study protein aggregation and lipid– protein interactions. Based on the current state of knowledge, most membrane proteins appear to be oligomeric (80). Self-assembly of membrane proteins probably plays a role in sorting of membrane components. Rhodopsin, the light receptor involved in vision, also seems to form dimers in native membranes, according to recent experimental evidence (81). Periole used the MARTINI model to

434

Kandt and Monticelli

study the aggregation of rhodopsin in model membranes (51). The simulated systems contained up to 16 copies of the GPCR and 1,600 lipids, in addition to water, and were simulated for up to 8 ms. Spontaneous protein aggregation was observed to be dependent on the thickness of the lipid membrane, with hydrophobic mismatch promoting aggregation, in agreement with theoretical models. The bilayer was found to adapt to the local hydrophobic thickness of the protein, and the persistence length of the thickness alterations was 1–2 nm. Protein aggregation was found to proceed via a multi-stage mechanism, with the formation of an encounter complex followed by a rearrangement leading to the fully bound state. Shape complementarity was suggested to play an important role in the second stage of the formation of the complex. Mechanosensitive channels are membrane proteins that open in response to tension in cell membranes. They are involved in various physiological processes, like touch, hearing, and osmoregulation. When the tension in the membrane reaches a threshold value (gating tension), the channel opens a large pore with a diameter up to 4 nm for Eco-MscL (82). The threshold value can change in proteins from different organisms, and the time scale for the gating process is normally between milliseconds and seconds. The crystal structure of the mechanosensitive channel Tb-MscL, solved a few years ago (83), has been used by several groups as a starting point to simulate the gating mechanism and to predict the channel structure in the open state. Since the time scales of the biological process are not within reach of molecular simulation techniques, different methods have been used to speed up the conformational change. Marrink and coworkers used a CG force field to simulate the gating process of Tb-MscL and a mutant (84). In their simulations, high negative pressures were applied in the membrane plane in order to promote the channel opening. Both proteins were found to open in response to increasing membrane tension with an iris-like mechanism. The protein conformational changes were observed on the microsecond time scale, and the Tb-MscL mutant showed a more pronounced expansion of the pore, in good agreement with experimental measures. Folding and self-assembly of membrane proteins are fundamental problems in modern biology. For simple helical peptides, folding can take place on time scales as short as 10–100 ns and has been simulated in atomistic detail (85–87). For proteins including multiple helical regions or beta-sheets and combinations of different secondary structure elements, the folding process requires time scales of microseconds and beyond (88). The mechanism of insertion of proteins into membranes is thought to be different for helical and beta-sheet membrane proteins: for the helical ones, transmembrane helices can insert independently into

Membrane Protein Dynamics from Femtoseconds to Seconds

435

the membrane and then self-assemble (89), while for beta-barrels folding and insertion would occur at the same time (90). In order to explore this issue, Bond and Sansom simulated the insertion of glycophorin (alpha helical) and OmpA (beta-barrel) in detergent micelles and in lipid bilayers (91). Multiple CG simulations were carried out starting from random distributions of lipids and proteins. Lipids were observed to self-assemble spontaneously into bilayers both in the presence and in the absence of proteins (91). In the case of glycophorin, interfacial partitioning of the peptide was followed by spontaneous insertion into the bilayer and dimerization. This mechanism is consistent with the much debated two-state model of membrane protein folding. In the case of OmpA, the protein was quickly surrounded by lipids during the formation of the bilayer so that once the bilayer was formed the protein was already inserted in it. Comparison with atomistic simulations (92) showed good agreement in terms of lipid–protein interactions and tilt of the beta barrel relative to the bilayer normal. These results show that CG simulations can be used to insert a folded membrane protein into a lipid bilayer, and can therefore, provide starting structures for atomistic simulations to explore finer details of protein dynamics. 2.5.4. Beyond the Microsecond

Two factors contribute to the continuous increase in the time scale accessible to molecular simulations: the progress in computer technology and the development of new simulation techniques. In the following paragraph, we will briefly discuss some of the methodological advances that might allow us in the near future to explore protein dynamics on time scales well beyond the microsecond. The success of CG simulations paves the way to new developments in computer simulations of membrane proteins, with multiscale approaches among the most promising. In multiscale approaches, different resolutions can be applied in sequence or at the same time. The first case is more simple, since there is no interaction between descriptions at different levels of resolution. The second is more difficult to implement but also potentially more powerful, in that it allows to explore protein dynamics with the accuracy of all-atoms representations on length and time scales typical of coarse-grained representation. Different methodologies have recently been developed to simulate biological macromolecules simultaneously at different length and time scales. In the first approach, part of the system is treated with an atomistic representation and part with a less detailed model. This includes methodologies in which an atomistic protein is embedded in a continuum membrane (93, 94) and methods in which some molecules are described with atomistic detail and others at the coarse-grained level (95–97). In principle, different resolutions could also be used on different

436

Kandt and Monticelli

parts of the same molecule, like in the QM–MM approach. A large gain in the simulation speed is achieved through the reduction of degrees of freedom in the “less interesting” parts of the system, while higher accuracy is maintained in the “more interesting” parts. Another approach, developed by Lyman and Zuckerman, is based on the resolution exchange method: simulations are performed at the same time at different resolutions, from all-atoms to increasingly coarse-grained, and an exchange is attempted at discrete time intervals (98, 99). Energy barriers for large conformational changes are crossed easily in simulations running at lower resolution, while canonical sampling is achieved at the atomistic level. In all these approaches, the main difficulty lies in modeling correctly the interaction between components with different resolution. While it is difficult to quantify the sampling efficiency and the speed-up in the protein dynamics, multi-scale techniques allow us to explore conformational transformations that would take extremely long times, presently out of reach for traditional molecular dynamics methods. 2.6. Concluding Remarks

Computer simulations have become increasingly important tools in structural biology and particularly in the field of membrane protein studies. They are used both as an aid in the interpretation of experimental data and in the prediction of structure and dynamics. Depending on the details of the problem of interest, and particularly on the length scale and the time scale of the phenomena, a vast range of techniques can be employed, ranging from quantum mechanics to mesoscale approaches. In the present work, we have reviewed the basic methodological aspects of some of the most common simulation techniques, and described a few examples of their application to real-life problems. While current techniques allow us to explore membrane protein dynamics on time scales up to tens of microseconds, progress in computer technology and the ongoing development of multi-scale approaches bear the possibility to extend simulations beyond this limit in the near future.

Acknowledgments We thank Frank Wennmohs and Emppu Salonen for their fruitful discussions. This work is supported by the Academy of Finland and its Center of Excellence, and by the Ministerium für Innovation, Wissenschaft, Forschung und Technologie des Landes Nordrhein-Westfalen. CK is a junior research group leader funded by the NRW Rückkehrerprogramm.

Membrane Protein Dynamics from Femtoseconds to Seconds

437

References 1. Veldhuizen EJ, Haagsman HP (2000) Role of pulmonary surfactant components in surface film formation and dynamics. Biochim Biophys Acta 1467:255–270 2. Bernstein FC, Koetzle TF, Williams GJ et al (1977) The Protein Data Bank: a computerbased archival file for macromolecular structures. J Mol Biol 112:535–542 3. Berman HM, Westbrook J, Feng Z et al (2000) The Protein Data Bank. Nucleic Acids Res 28:235–242 4. Berman H, Henrick K, Nakamura H (2003) Announcing the worldwide Protein Data Bank. Nat Struct Biol 10:980 5. White SH (2004) The progress of membrane protein structure determination. Protein Sci 13:1948–1949 6. Leach AR (2001) Molecular modelling: principles and applications. Pearson Education, Harlow 7. Pauling L, Wilson EBJ (1985) Introduction to quantum mechanics with applications to chemistry. Courier Dover, New York 8. Neese F (2003) An improvement of the resolution of the identity approximation for the formation of the Coulomb matrix. J Comput Chem 24:1740–1747 9. Car R, Parrinello M (1985) Unified approach for molecular dynamics and density-functional theory. Phys Rev Lett 55:2471–2474 10. Messiah A (1963) Quantum mechanics. Wiley, New York 11. Jensen F (2006) Introduction to computational chemistry. Wiley, New York 12. Mulholland AJ (2007) Chemical accuracy in QM/MM calculations on enzyme-catalysed reactions. Chem Cent J 1:19 13. Mulholland AJ (2005) Modelling enzyme reaction mechanisms, specificity and catalysis. Drug Discov Today 10:1393–1402 14. Warshel A, Levitt M (1976) Theoretical studies of enzymic reactions: dielectric, electrostatic and steric stabilization of the carbonium ion in the reaction of lysozyme. J Mol Biol 103:227–249 15. Kandt C, Ash WL, Tieleman DP (2007) Setting up and running molecular dynamics simulations of membrane proteins. Methods 41:475–488 16. Tieleman DP, Marrink SJ, Berendsen HJC (1997) A computer perspective of membranes: molecular dynamics studies of lipid bilayer systems. Biochim Biophys Acta 1331:235–270

17. Levitt M, Warshel A (1975) Computer simulation of protein folding. Nature 253: 694–698 18. Shillcock JC, Lipowsky R (2006) The computational route from bilayer membranes to vesicle fusion. J Phys Condens Matter 18:S1191–S1219 19. Sperotto MM, May S, Baumgaertner A (2006) Modelling of proteins in membranes. Chem Phys Lipids 141:2–29 20. Venturoli M, Sperotto MM, Kranenburg M, Smit B (2006) Mesoscopic models of biological membranes. Phys Rep 437:1–54 21. Müller M, Katsov K, Schick M (2006) Biological and synthetic membranes: what can be learned from a coarse-grained description? Phys Rep 434:113–176 22. Monticelli L, Kandasamy SK, Periole X, Larson RG, Tieleman DP, Marrink SJ (2008) The MARTINI coarse-grained force field: Extension to proteins. J Chem Theory Comput 4:819–834 23. Grossfield A, Feller SE, Pitman MC (2007) Convergence of molecular dynamics simulations of membrane proteins. Proteins 67:31–40 24. Gumbart J, Wang Y, Aksimentiev A, Tajkhorshid E, Schulten K (2005) Molecular dynamics simulations of proteins in lipid bilayers. Curr Opin Struct Biol 15:423–431 25. Kandt C, Mátyus E, Tieleman DP (2008) Protein lipid interactions from a molecular dynamics simulation point of view. In: Nag K (ed) Structure & dynamics of membranous interfaces. Wiley-Interscience, Hoboken, NJ, pp 267–282 26. Lindahl E, Sansom MS (2008) Membrane proteins: molecular dynamics simulations. Curr Opin Struct Biol 18:425–431 27. Doyle DA, Morais Cabral J, Pfuetzner RA et al (1998) The structure of the potassium channel: molecular basis of K+ conduction and selectivity. Science 280:69–77 28. Jiang YX, Lee A, Chen JY, Cadene M, Chait BT, MacKinnon R (2002) Crystal structure and mechanism of a calcium-gated potassium channel. Nature 417:515–522 29. Jiang YX, Lee A, Chen JY et al (2003) X-ray structure of a voltage-dependent K+ channel. Nature 423:33–41 30. Kuo AL, Gulbis JM, Antcliff JF et al (2003) Crystal structure of the potassium channel KirBac1.1 in the closed state. Science 300:1922–1926

438

Kandt and Monticelli

31. Long SB, Campbell EB, MacKinnon R (2005) Crystal structure of a mammalian voltagedependent Shaker family K+ channel. Science 309:897–903 32. Bucher D, Raugei S, Guidoni L et al (2006) Polarization effects and charge transfer in the KcsA potassium channel. Biophys Chem 124:292–301 33. Bucher D, Guidoni L, Rothlisberger U (2007) The protonation state of the Glu-71/Asp-80 residues in the KcsA potassium channel: A first-principles QM/MM molecular dynamics study. Biophys J 93:2315–2324 34. Gether U (2000) Uncovering molecular mechanisms involved in activation of G protein-coupled receptors. Endocr Rev 21:90–113 35. Cherezov V, Rosenbaum DM, Hanson MA et al (2007) High-resolution crystal structure of an engineered human beta2-adrenergic G protein-coupled receptor. Science 318:1258–1265 36. Rasmussen SGF, Choi H-J, Rosenbaum DM et al (2007) Crystal structure of the human beta2 adrenergic G-protein-coupled receptor. Nature 450:383–387 37. Luecke H, Schobert B, Lanyi JK, Spudich EN, Spudich JL (2001) Crystal structure of sensory rhodopsin II at 2.4 angstroms: insights into color tuning and transducer interaction. Science 293:1499–1503 38. Kandt C, Gerwert K, Schlitter J (2005) Water dynamics simulation as a tool for probing proton transfer pathways in a heptahelical membrane protein. Proteins 58:528–537 39. Kandt C, Schlitter J, Gerwert K (2004) Dynamics of water molecules in the bacteriorhodopsin trimer in explicit lipid/water environment. Biophys J 86:705–717 40. Zgrabli G, Voãtchovsky K, Kindermann M, Haacke S, Chergui M (2005) Ultrafast excited state dynamics of the protonated Schiff base of all-trans retinal in solvents. Biophys J 88:2779–2788 41. Schenkl S, van Mourik F, van der Zwan G, Haacke S, Chergui M (2005) Probing the ultrafast charge translocation of photoexcited retinal in bacteriorhodopsin. Science 309:917–920 42. Okada T, Ernst OP, Palczewski K, Hofmann KP (2001) Activation of rhodopsin: new insights from structural and biochemical studies. Trends Biochem Sci 26:318–324 43. Meng EC, Bourne HR (2001) Receptor activation: what does the rhodopsin structure tell us? Trends Pharmacol Sci 22:587–593

44. Lemaitre V, Yeagle P, Watts A (2005) Molecular dynamics simulations of retinal in rhodopsin: from the dark-adapted state towards lumirhodopsin. Biochemistry 44:12667–12680 45. Isin B, Rader AJ, Dhiman HK, KleinSeetharaman J, Bahar I (2006) Predisposition of the dark state of rhodopsin to functional changes in structure. Proteins 65:970–983 46. Martinez-Mayorga K, Pitman MC, Grossfield A, Feller SE, Brown MF (2006) Retinal counterion switch mechanism in vision evaluated by molecular simulations. J Am Chem Soc 128:16502–16503 47. Crozier PS, Stevens MJ, Woolf TB (2007) How a small change in retinal leads to G-protein activation: initial events suggested by molecular dynamics calculations. Proteins 66:559–574 48. Kong Y, Karplus M (2007) The signaling pathway of rhodopsin. Structure 15:611–623 49. Grossfield A, Feller SE, Pitman MC (2006) A role for direct interactions in the modulation of rhodopsin by omega-3 polyunsaturated lipids. Proc Natl Acad Sci U S A 103:4888–4893 50. Cordomi A, Perez JJ (2007) Molecular dynamics simulations of rhodopsin in different one-component lipid bilayers. J Phys Chem B 111:7052–7063 51. Periole X, Huber T, Marrink SJ, Sakmar TP (2007) G protein-coupled receptors selfassemble in dynamics simulations of model bilayers. J Am Chem Soc 129:10126–10132 52. Niv MY, Filizola M (2008) Influence of oligomerization on the dynamics of G-protein coupled receptors as assessed by normal mode analysis. Proteins 71:575–586 53. Yan ECY, Kazmi MA, Ganim Z et al (2003) Retinal counterion switch in the photoactivation of the G protein-coupled receptor rhodopsin. Proc Natl Acad Sci U S A 100:9262–9267 54. Ludeke S, Beck R, Yan ECY, Sakmar TP, Siebert F, Vogel R (2005) The role of Glu181 in the photoactivation of rhodopsin. J Mol Biol 353:345–356 55. Ivetac A, Campbell JD, Sansom MSP (2007) Dynamics and function in a bacterial ABC transporter: simulation studies of the BtuCDF system and its components. Biochemistry 46:2767–2778 56. Sonne J, Kandt C, Peters GH, Hansen FY, Jensen M, Tieleman DP (2007) Simulation of the coupling between nucleotide binding and transmembrane domains in the ATP binding cassette transporter BtuCD. Biophys J 92:2727–2734

Membrane Protein Dynamics from Femtoseconds to Seconds 57. Arkin IT, Xu H, Mù J et al (2007) Mechanism of Na+/H+ antiporting. Science 317:799–803 58. Law CJ, Almqvist J, Bernstein A et al (2008) Salt-bridge dynamics control substrate-induced conformational change in the membrane transporter GlpT. J Mol Biol 378:826–837 59. Lu WC, Wang CZ, Yu EW, Ho KM (2006) Dynamics of the trimeric AcrB transporter protein inferred from a B-factor analysis of the crystal structure. Proteins 62:152–158 60. Holyoake J, Sansom MSP (2007) Conformational change in an MFS protein: MD simulations of LacY. Structure 15:873–884 61. Davidson AL, Dassa E, Orelle C, Chen J (2008) Structure, function, and evolution of bacterial ATP-binding cassette systems. Microbiol Mol Biol Rev 72:317–364 62. Moussatova A, Kandt C, O’Mara ML, Tieleman DP (2008) ATP-binding cassette transporters in Escherichia coli. Biochim Biophys Acta 1778:1757–1771 63. Tirion MM (1996) Large amplitude elastic motions in proteins from a single-parameter, atomic analysis. Phys Rev Lett 77:1905–1908 64. Hunte C, Screpanti E, Venturi M, Rimon A, Padan E, Michel H (2005) Structure of a Na+/H+ antiporter and insights into mechanism of action and regulation by pH. Nature 435:1197–1202 65. Inoue H, Noumi T, Tsuchiya T, Kanazawa H (1995) Essential aspartic acid residues, Asp-133, Asp-163 and Asp-164, in the transmembrane helices of a Na+/H+ antiporter (NhaA) from Escherichia coli. FEBS Lett 363:264–268 66. Song L, Hobaugh MR, Shustak C, Cheley S, Bayley H, Gouaux JE (1996) Structure of staphylococcal alpha-hemolysin, a heptameric transmembrane pore. Science 274:1859–1866 67. Akeson M, Branton D, Kasianowicz JJ, Brandin E, Deamer DW (1999) Microsecond time-scale discrimination among polycytidylic acid, polyadenylic acid, and polyuridylic acid as homopolymers or as segments within single RNA molecules. Biophys J 77:3227–3233 68. Wells DB, Abramkina V, Aksimentiev A (2007) Exploring transmembrane transport through alpha-hemolysin with grid-steered molecular dynamics. J Chem Phys 127:125101–125110 69. Prince SM, Achtman M, Derrick JP (2002) Crystal structure of the OpcA integral membrane adhesin from Neisseria meningitidis. Proc Natl Acad Sci U S A 99:3417–3421 70. Cherezov V, Liu W, Derrick JP et al (2008) In meso crystal structure and docking simulations suggest an alternative proteoglycan

71.

72. 73.

74.

75. 76.

77.

78. 79.

80. 81.

82.

83.

439

binding site in the OpcA outer membrane adhesin. Proteins 71:24–34 Luan B, Caffrey M, Aksimentiev A (2007) Structure refinement of the OpcA adhesin using molecular dynamics. Biophys J 93:3058–3069 Shi N, Ye S, Alam A, Chen L, Jiang Y (2006) Atomic structure of a Na+- and K+-conducting channel. Nature 440:570–574 Noskov SY, Roux B (2007) Importance of hydration and dynamics on the selectivity of the KcsA and NaK channels. J Gen Physiol 129:135–143 Sperotto MM, Mouritsen OG (1991) MonteCarlo simulation studies of lipid order parameter profiles near integral membrane-proteins. Biophys J 59:261–270 Kranenburg M, Venturoli M, Smit B (2003) Molecular simulations of mesoscopic bilayer phases. Phys Rev E 67:060901 Venturoli M, Smit B, Sperotto MM (2005) Simulation studies of protein-induced bilayer deformations, and lipid-induced protein tilting, on a mesoscopic model for lipid bilayers with embedded proteins. Biophys J 88:1778–1798 Ramakrishnan M, Qu J, Pocanschi CL, Kleinschmidt JH, Marsh D (2005) Orientation of beta-barrel proteins OmpA and FhuA in lipid membranes. chain length dependence from infrared dichroism. Biochemistry 44:3515–3523 Marrink SJ, de Vries AH, Mark AE (2004) Coarse grained model for semiquantitative lipid simulations. J Phys Chem B 108:750–760 Marrink SJ, Risselada HJ, Yefimov S, Tieleman DP, de Vries AH (2007) The MARTINI forcefield: coarse grained model for biomolecular simulations. J Phys Chem B 111:7812–7824 Engelman DM (2005) Membranes are more mosaic than fluid. Nature 438:578–580 Fotiadis D, Liang Y, Filipek S, Saperstein DA, Engel A, Palczewski K (2003) Atomic-force microscopy: rhodopsin dimers in native disc membranes. Nature 421:127–128 Cruickshank CC, Minchin RF, Le Dain AC, Martinac B (1997) Estimation of the pore size of the large-conductance mechanosensitive ion channel of Escherichia coli. Biophys J 73:1925–1931 Chang G, Spencer RH, Lee AT, Barclay MT, Rees DC (1998) Structure of the MscL homolog from Mycobacterium tuberculosis: a gated mechanosensitive ion channel. Science 282:2220–2226

440

Kandt and Monticelli

84. Yefimov S, van der Giessen E, Onck PR, Marrink SJ (2008) Mechanosensitive membrane channels in action. Biophys J 94:2994–3002 85. Duan Y, Pa K (1998) Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. Science 282:740–744 86. Zagrovic B, Snow CD, Shirts MR, Pande VS (2002) Simulation of folding of a small alphahelical protein in atomistic detail using worldwide-distributed computing. J Mol Biol 324:1051 87. Monticelli L, Tieleman DP, Colombo G (2005) Mechanism of helix nucleation and propagation: microscopic view from microsecond time scale MD simulations. J Phys Chem B 109:20064–20067 88. Munoz V, Thompson PA, Hofrichter J, Eaton WA (1997) Folding dynamics and mechanism of beta-hairpin formation. Nature 390:196–199 89. Popot JL, Engelman DM (1990) Membrane protein folding and oligomerization – the 2-stage model. Biochemistry 29:4031–4037 90. Tamm LK, Hong H, Liang BY (2004) Folding and assembly of beta-barrel membrane proteins. Biochim Biophys Acta 1666:250–263 91. Bond PJ, Sansom MSP (2006) Insertion and assembly of membrane proteins via simulation. J Am Chem Soc 128:2697–2704

92. Deol SS, Domene C, Bond PJ, Sansom MSP (2006) Anionic phospholipid interactions with the potassium channel KcsA: simulation studies. Biophys J 90:822–830 93. Im W, Feig M, Brooks CL III (2003) An implicit membrane generalized born theory for the study of structure, stability, and interactions of membrane proteins. Biophys J 85:2900–2918 94. Ayton G, Bardenhagen SG, McMurtry P, Sulsky D, Voth GA (2001) Interfacing continuum and molecular dynamics: an application to lipid bilayers. J Chem Phys 114:6913–6924 95. Villa E, Balaeff A, Mahadevan L, Schulten K (2004) Multiscale method for simulating protein–DNA complexes. Multiscale Model Simul 2:527–553 96. Villa E, Balaeff A, Schulten K (2005) Structural dynamics of the lac repressor–DNA complex revealed by a multiscale simulation. Proc Natl Acad Sci U S A 102:6783–6788 97. Shi Q, Izvekov S, Voth GA (2006) Mixed atomistic and coarse-grained molecular dynamics: simulation of a membrane-bound ion channel. J Phys Chem B 110:15045–15048 98. Lyman E, Ytreberg FM, Zuckerman DM (2006) Resolution exchange simulation. Phys Rev Lett 96:028105 99. Lyman E, Zuckerman DM (2006) Resolution exchange simulation with incremental coarsening. J Chem Theory Comput 2:656–666

Chapter 23 The Family of G Protein-Coupled Receptors: An Example of Membrane Proteins Irina G. Tikhonova and Daniel Fourmy Abstract The G protein coupled receptors belong to the largest group of membrane proteins that regulates many essential physiological properties and represents an important class of drug targets. In this chapter, we show how the synergy between a laboratory experiment and computational modeling leads to structural delineation of the ligand binding pocket and how the knowledge of ligand–protein interactions is used for rational local and global drug design in which the structural knowledge of a particular receptor and its ligands is used for drug design of this particular GPCR and others. Key words: G protein coupled receptor, Cholecystokinin, Molecular modeling, Binding site, Site-directed mutagenesis, Drug design

1. Introduction The G protein-coupled receptors (GPCRs) are the largest group of cell surface membrane proteins, which form a transmembrane bundle composed of seven membrane-spanning alpha helices connected by loop regions. Binding with endogenous mediators causes conformational changes in GPCRs (1, 2) that lead to high affinity interaction of GPCRs with the cognate G protein and in turn initiate numerous downstream signaling pathways in cells (3, 4). GPCRs are signal transmitters for hormones, neuromediators, cytokines, lipids, peptides, small molecules, and various sensory exogenous stimuli, such as light, odors, and taste. Consequently, GPCRs are regulators of many life important cell processes and universal drug targets against various diseases. Because of structure-based drug design is a rational way to design novel small molecule ligands and to improve binding and Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4_23, © Springer Science+Business Media, LLC 2010

441

442

Tikhonova and Fourmy

selectivity of old drugs, many laboratories have been working on structural delineation of the ligand binding site of GPCRs. As membrane proteins, GPCRs have difficulties to be expressed, purified and crystallized in a large scale. Therefore, there is a limited number of high-resolution GPCR structures today: bovine and squid rhodopsin (5–9) opsin (10), b2 and b1 adrenergic (11–14), and A2A adenosine (15) receptors. For decades, the structural insight of GPCRs ligand binding site has being mainly gained using indirect methods such as receptor mutagenesis, ligand structure– activity relationships (SAR), and receptor modeling. Thus, till 2007, only the crystal structure of bovine rhodopsin was available and used to construct the low-resolution homology-based models of GPCRs. Although, the sequence homology between light-activated rhodopsin and ligand-activated GPCRs is low (about 11–17%) and the second extracellular loop (EL2) of rhodopsin structure buries deeply into the helical bundle and closes a putative ligand binding cavity, several modeling strategies have been developed to delineate the ligand binding site of ligand-activated GPCRs based on the rhodopsin structure. In one approach, the EL2 was removed before the docking of known ligands and added back when the ligand interactions with transmembrane helices were defined by molecular docking and receptor mutagenesis (16, 17). In another approach, forced molecular dynamics simulations were applied straight away to drive the ligand–protein interactions in the initial homology model according to experimental data (18–20). In addition, molecular simulations of the rhodopsin-based homology model in water– lipid environment were used for the formation of a binding cavity by initially added spheres with a flexible Van der Waals radius, and the formed cavity then were employed for docking of ligands (21). In all these approaches, the iterative application of computational and experimental studies to establish ligand–protein interactions was important, in which molecular modeling results were used to generate hypothesis that were then validated by receptor mutagenesis, and the mutagenesis results were used, in turn, to refine computational models. Moreover, the docking of different available ligands into the binding cavity and its experimental validation allowed better to shape and optimize the binding site. The recently published first crystal structures of beta 1,2-adrenergic receptors (11–14) have proved the earlier anticipated larger volume of the binding cavity and the solvent exposed conformation of the EL2 in the ligand-activated GPCRs. The homology modeling based on these structures will simplify the construction of the ligand binding sites of GPCRs; hence, the ligand–protein contacts are still required to be proved by mutagenesis data, especially when the sequence homology of residues in the binding cavity between a template structure and a modeled GPCR is very low.

The Family of G Protein-Coupled Receptors: An Example of Membrane Proteins

443

In this review, we show the case study of the ligand binding site delineation for cholecystokinin receptor named as CCK1 by the International Union of Pharmacology that has been done in our laboratory for the past ten years and others, and then we provide the rational strategies for local and global GPCRs drug discovery in which the information of the ligand binding site of a particular GPCR is used for drug design of the particular GPCR and others.

2. Materials Cholecystokinin (CCK) is a regulatory peptide having high affinity for CCK receptors. Cholecystokinin shares its carboxylterminal pentapeptide sequence with gastrin, another regulatory peptide. Cholecystokinin and gastrin differ in their selectivity for the two CCK receptor subtypes, the CCK1 (CCK1R) and the CCK2 (CCK2R) receptors, on the basis of tyrosine sulfation at the seventh position (CCK) or at the sixth position (gastrin) from the carboxyl-terminus (22). CCK1R are mainly found in the periphery where they regulate pancreatic secretion, gallbladder, and gastrointestinal motility, but are also found in some areas of central nervous system where they regulate satiety and analgesia. At present, a large set of converging data related to binding sites of CCK1R is currently available, giving a good picture of the binding mode of natural and synthetic ligands to this receptor. The data were provided using essentially four complementary approaches, site-directed mutagenesis, photoaffinity labeling, NMR-NOE transfer, and three-dimensional modeling. In the laboratory, we have combined the use of site-directed mutagenesis and molecular modeling to delineate the binding sites of the CCK1R. A model of the CCK1R was constructed by homology modeling and refined on the basis of site-directed mutagenesis data as the homology model of the CCK1R constructed on the basis of rhodospin template could not accommodate the peptidic ligand CCK into its binding site (23).

3. Methods 3.1. Mapping of CCK1R Binding Site: Synergy Between Laboratory and In Silico Experiments

The first contact point between the CCK1R and CCK was defined on the basis of photoaffinity labeling results (24). Indeed, in the course of biochemical studies of the pancreatic CCK1R, we identified a CCK1R truncated from its terminal moiety, using a peptidic antagonist (JMV 179) as a photoaffinity label.

444

Tikhonova and Fourmy

As this truncated CCK1R was not labeled by a CCK-derived photoaffinity label, we hypothetized that the lacking region in the truncated CCK1R contained the attachment site of the CCKderived photoaffinity label. Moreover, in a competition assay, the labeled truncated CCK1R seemed capable of interacting with CCK, but with a lower affinity than did the intact CCK1R. These biochemical results were a first indication that residue(s) located between the N-terminus of the CCK1R and top of TM I were probably in interaction with the N-terminal moiety of CCK through which the photoaffinity group was attached (24). The next step was the construction and analysis of the N-terminally truncated CCK1R, which led to the identification of a region at the junction between the transmembrane helix 1 and the N-terminal moiety, between residues 38–42, which was involved in the binding of CCK (25). Residues Trp39 and Gln40 of the receptor were then shown to be important for recognition of the C-terminal nonapeptide of CCK as Trp39Phe and Gln40Asn mutants demonstrated parallel decreases in both binding affinity, and potency to induce accumulation of inositol phosphates (26). To determine which region of CCK interacts with Trp39 and Gln40 at the binding site, we compared binding affinities of Trp39Phe and Gln40Asn mutants for CCK analogs modified at their C- and N-terminal ends. We postulated that only peptides containing residues capable of interacting with aminoacids Trp39 and Gln40 of the receptor would bind to the mutated receptors Trp39Phe and Gln40Asn with decreased affinities relative to the wild-type receptor. Such experiments identified residues at the N-terminal of the nonapeptide of CCK as likely in interaction with Trp39 and Gln40 (26). At this stage of the work, a three dimensional model of the CCK1R was constructed using transmembrane helical arrangement found in the bacteriorhodopsin crystal structure as starting point since at this time, the high resolution crystal of rhodopsin was not available yet (27). The model was then modified according to the Baldwin model for G-protein coupled receptors and to the mutant data-base “input/output” information scheme defined in the Viseur program (28). Extracellular and intracellular loops connecting the helices were then added to the preliminary 7-helix bundle, and the structural model was optimized by the use of simulated annealing procedures. The entire system was finally relaxed and submitted to one nanosecond molecular dynamics with possible translational and rotational movements of individual TM helices taken into account. The positioning of the CCK peptide in the modeled CCK1R was achieved using a docking model in which the C-terminal moiety of the CCK-related ligands was placed in the middle of the receptor transmembrane region, while the N-terminal part was positioned in its extracellular region near the entrance of the putative binding pocket.

The Family of G Protein-Coupled Receptors: An Example of Membrane Proteins

445

Fig. 1. Ligands of the CCK1 receptor.

By doing so, it appeared that the N-terminal residues of CCK nonapeptide (Fig. 1) could be easily connected to the receptor through a strong hydrogen bond and salt bridge network involving residues Trp39 and Gln40 identified experimentally in our site-directed mutagenesis-based study (26). Based on this first set of data, subsequent studies were devoted to identification of determinants of the receptor in interaction with other key residues of CCK. We constrained the N-terminus of CCK in interaction with Trp39 and Gln40, while the rest of CCK was positioned inside the receptor grove, allowing inspection of the molecular electrostatic potentials. Then, the docking was improved by simulated annealing calculations. The resulting structure obtained for the ligand/receptor complex was further refined using molecular dynamics and energy minimization. In a second step, experiments were performed by mutating candidate residue(s) of the receptor binding site, which had been depicted in the 3D model, and analyzing extensively effects of mutation(s). By doing so, several critical contacts between the CCK1R binding site and CCK were successively discovered: Met195 and Arg197, located in the second extracellular loop, were shown to interact with the sulfated tyrosine (29, 30); Arg336 and Asn333 at the top of helix VI were demonstrated to pair with the Asp carboxylate and the C-terminal amide of CCK, respectively (31). Subsequently, a network of hydrophobic residues from helices III, V, VI, and VII all forming a binding pocket for the CCK C-terminal region was identified. This binding mode of the C-terminus of CCK into CCK1R thus obtained appeared in agreement with an NMR study of the interactions between CCK and a fragment of CCK1R comprising the top portion of helix VI and the third extracellular loop as well as a fragment including amino acids at the top of transmembrane segment I (32, 33). Using photoaffinity labeling, two

446

Tikhonova and Fourmy

hits in the CCK1R were identified. The first was a Trp at the top of TM I using a photoprobe with the reactive moiety within the C-terminal Phe of CCK, and the second was a His within the third extracellular loop using a probe with a benzophenone in the place of the Gly of CCK (Fig. 2) (34, 35). Accordingly, a model of binding of CCK to the CCK1R was proposed in which the C-terminus of CCK and the tyrosine sulfate were in interaction with Trp39 and Arg197, respectively, and the N-terminal moiety was in contact with the third extracellular loop of the receptor (36). This

Fig. 2. Serpentine representation of the CCK1R depicting residues involved in the binding site of the full agonist CCK (a). Schematic representation of the CCK1R binding site with docked CCK (b).

The Family of G Protein-Coupled Receptors: An Example of Membrane Proteins

447

second model for CCK binding mode into the CCK1R binding site is somewhat divergent from that obtained on the basis of sitedirected mutagenesis results (19). Further key support for the model placing the C-terminal end of CCK in an hydrophobic cavity formed by helices III, V, VI, and VII came from a study dedicated to the understanding of the molecular basis for partial agonist activity of JMV 180 (Fig. 1), a CCK analog having the C-terminal amidated phenylalanine substituted by a phenylethyl ester (37). In this study combining site-directed mutagenesis experiments, the use of CCK-related peptides modified at their C-terminus and molecular modeling, we demonstrated that partial agonism of JMV180 was due to flexibility of the phenylethyl ester moiety of the ligand which could not allow optimal interaction with key aromatic residue for receptor activation in helices V and VI (38). The key role in the process of receptor activation of such aromatic residue (especially Trp6.48) is a general feature of the group I of GPCRs, as it was explored for instance in rhodopsin and more recently in the CCK2R (39, 40). The structure of CCK1R.CCK complex obtained on the basis of site-directed mutagenesis data was used to study the binding site of several synthetic nonpeptide agonists and antagonists. According to docking and experimental data, nonpeptide agonists and antagonists most likely occupy a region in CCK receptors that interacts with the amidated tetrapeptide of the C-terminal part of CCK, i.e., Trp-Met-Asp-Phe-NH2, and moreover, there is an overlap between agonist and antagonist binding sites. For example, we showed that structurally related compounds, the agonist SR-146131 and the antagonist SR-27,897 of CCK1R (Fig. 1), interact with Arg336(6.58) and Asn333(6.55) and locate in two hydrophobic sub-pockets composed of amino acids of helices III/VI and I/III/VII, respectively (19, 41). Interestingly, in the case of agonist SR-146,131, there is a hydrophobic interaction between cyclohexane moiety of the ligand and Leu356(7.39) of the CCK1R, which is lacking in the binding mode of the antagonist SR-27897 (19, 41). We further combined site-directed mutagenesis studies, SAR, and dynamic-based docking in order to identify the binding site of pyridopyrimidine-derived antagonists of the CCK1R and understand their selectivity towards this receptor type versus the CCK2R (42). This study, again, provided unambiguous evidence that the binding site of these antagonists is overlapping that of the C-terminal tetrapeptide of CCK. However, in the course of this study, several orientations for the ligand within the binding site were found, with an identical probability of occurrence. Only combination of experimental and SAR data enabled us to propose a very likely binding mode (42). Collectively, data provided by these studies validated the binding site of the CCK1R.

448

Tikhonova and Fourmy

A similar strategy as that used for the CCK1R was applied to the CCK2R leading to a complete delineation of its binding site (43–46). Moreover, on this latter CCK receptor subtype, we worked at identification of networks involved in receptor switching from the inactive state to the active state (40). The computational approach consisted in modeling the active conformation from the inactive-rhodopsin-based model, using sets of experimental restraints, which characterize the active conformation in family A, and CCK interactions, identified by biophysical and site directed mutagenesis techniques, respectively (40). The exercise of comparison of both conformations in the CCK2R led to the identification of different network rearrangements, which have been experimentally validated to be involved in the control of the equilibrium between the two conformations and therefore in the mechanism of activation. In this study, conversion of the CCK2R from the inactive to active conformation was studied by targeted molecular dynamics, a method based on MD previously applied to other protein families (47). This method was then applied to complexes formed between the CCK2R and two structurally very closely related nonpeptide ligand having either partial agonist activity or inverse agonist activity. Targeted molecular dynamics was able to discriminate the two compounds; however, correct docking of the compounds into the CCK2R binding pocket required experimental investigations based on site-directed mutagenesis (48). 3.2. Local GPCR Drug Design

The general flowchart of structure-based GPCR drug discovery is presented in Fig. 3. In the previous paragraph, we show on the example of CCK1R how the structural knowledge of the ligand binding site of a GPCR was provided using a synergy of modeling and experimental data. Once the ligand binding site is delineated, a structure-based lead search campaign can be carried out using

Fig. 3. The G-protein coupled receptors structure-base local drug design flowchart.

The Family of G Protein-Coupled Receptors: An Example of Membrane Proteins

449

high-throughput docking or/and receptor-based pharmacophore search. In the high-throughput docking, each small molecule of chemical libraries is placed inside of the binding cavity based on preferable electrostatic and Van der Waals interactions with a receptor, and the selection of a focused library for the experimental test is performed using a chosen threshold value of scoring functions, which is established with docking results of known ligands. Since scoring functions are simplified and rapid estimation of a binding energy and homology-based 3D structures of GPCRs ligand binding sites are low-resolution models, it has been shown that rescoring of docking results by different force field-based, knowledge-based and, empirical scoring functions and/or their consensus scoring improve the discrimination of known compounds from the random decoy in the retrospective virtual screenings for dopamine D3, muscarinic M1, vasopressin V1a (18), cannabinoid 2 receptor (49), 5-hydroxytryptamine receptors 5-HT2c (50), and chemokine CCR1 (51). Moreover, recently applied for ranking of the metabotropic glutamate receptor (mGluR) subtype 5 and b2 adrenergic receptor ligands, ligand-receptor interaction fingerprint-based similarity has shown higher enrichment rates relative to the scoring functions (52, 53). When a GPCR has already plenty of diverse ligands, for example, biogenic GPCRs, the application of docking-based virtual screening with ligand-based methods, such as ligand-based pharmacophores and QSAR (quantitative structure activity relationships) models, increases significantly the enrichment rates in the retrospective virtual screenings (16). The docking-based virtual screening for alpha 1 adrenergic receptor (54), melanin-concentrating hormone receptor (55), serotonin 5HT1 and 5HT4, tachykinin NK1 receptor, dopamine D2 receptor, chemokine CCKR3 (56) and CCKR5 (57) receptor, free fatty acid receptor 1 FFAR1 (58), and thyrotropin-releasing hormone receptor TRH1 (59) lead to the identification of novel ligands with micro/nano molar activity. In receptor-based pharmacophore search, the ligand binding site is represented as a 3D-pharmacophore, in which important for ligand binding amino acid residues are coded as acceptor, donor, hydrophobic, or other features, and the shape of binding cavity is described by a molecular surface and formed a spatial constraint on the possible ligand atom location. The screening library of small molecules is flexibly aligned to the pharmacophore, and the focused library is selected using the defined threshold value of root-mean square distance (RMSD) between the query features and matching ligand atoms, and of Van der Waals (VdW) radii of selected atoms for spatial constraints. The threshold value of RMSD and VdW radii has to preliminary set using benchmark calculation of a set of known ligands in a random decoy.

450

Tikhonova and Fourmy

The application of this search was shown in retrospective screening of the metabotropic glutamate receptors (60). Since this method is too coarse-grained, it provides more rough and large libraries, which can be minimized by a subsequent docking procedure. Indeed, a receptor-based pharmacophore filter was used to create a focused library for TRH1 receptor in the multistep virtual screening leading to identification of diverse high potent antagonists (59). The structure-based optimization campaign is devoted to improve the binding properties of pharmacologically confirmed lead compounds or already known ligands by creating structural analogs using the information about the ligand binding site. The accurate docking of lead compounds provides visual information about what chemical groups can be incorporated to improve the affinity. 3D-quantitative structure–activity relationship models (QSAR) can be built using the set of known analogs with experimentally measured binding affinities and knowledge of ligand– protein interactions to predict in silico affinities of novel analogs. As learnt from previous paragraphs, the GPCR ligand binding site can be used for the discovery of novel ligands and improvement of the binding affinity. Can we use the ligand binding site to predict the efficacy of the ligands? Intuitively, to predict the agonistic and antagonistic properties, the knowledge of different GPCR conformations is required, especially the conformations of active and inactive states. It has been shown that the ligand binding site optimized based on the full potent agonist allows to better discriminate the agonists in retrospective screening than the ligand binding site optimized based on antagonists (18, 53). Indeed, the virtual screening of the FFA1 agonist binding site led to the discovery of full and partial agonists (58). 3.3. Global GPCRs Drug Design

For decades, many laboratories have been working to integrate chemistry, biology, and medicine of GPCRs with the aim of rational discovery of novel potent drugs. As a result of this work, huge amount of data related to structure-function relationships of receptors and SAR of the small molecules have been produced for many GPCRs. As gaining from the previous sections, structural information of the ligand binding site of GPCRs has been learnt indirectly through synergetic application of site-directed mutagenesis and molecular modeling. The information about GPCR mutagenesis data, sequences, and modeling is collected in several web pages (Fig. 4). A large number of small molecules were designed using medicinal chemistry, high-throughput and virtual screening approaches and tested for GPCR activity. The information about known ligands and their biological activity was used in turn to predict novel molecules using receptor and ligand based

The Family of G Protein-Coupled Receptors: An Example of Membrane Proteins

451

Fig. 4. G protein-coupled receptors internet resources.

Fig. 5. The G-protein coupled receptors structure-base global drug design flowchart.

molecular modeling approaches. Moreover, small molecule knowledge allowed to determine substructures or so-called privileges structures that are active in a particular receptor subtype (20). A database of GPCR ligands with the vast biological data is publicly available (Fig. 4). All these accumulated chemical and biological knowledge promote novel global GPCR drug design strategies (61) in which the knowledge gained from well-characterized receptors with plenty known ligands can be applied for the search of novel ligands of similar GPCRs but poor-characterized structurally and with a few known ligands (62) (Fig. 5). Thus, the sequence analysis of residues in the ligand binding sites of GPCRs can be done to find the closest homologous receptors to the receptor of interest, then known ligands for this receptor can be used as lead structures to generate high-potent ligands for the GPCR of interest. This approach was used to discover the first nonpeptide

452

Tikhonova and Fourmy

ligands for the somastotin receptor subtype 5 SST5 (63). A virtual screening methodology using the information about protein sequences and known nonnative ligands was developed to predict ligands for orphan GPCR receptors (64). References 1. Tikhonova IG, Best RB, Engel S, Gershengorn MC, Hummer G, Costanzi S (2008) Atomistic insights into rhodopsin activation from a dynamic model. J Am Chem Soc 130(31):10141–10149 2. Kobilka BK, Deupi X (2007) Conformational complexity of G-protein-coupled receptors. Trends Pharmacol Sci 28(8):397–406 3. Johnston CA, Siderovski DP (2007) Receptormediated activation of heterotrimeric G-proteins: current structural insights. Mol Pharmacol 72(2):219–230 4. Thompson MD, Cole DE, Jose PA (2008) Pharmacogenomics of G protein-coupled receptor signaling: insights from health and disease. Methods Mol Biol 448:77–107 5. Palczewski K, Kumasaka T, Hori T et al (2000) Crystal structure of rhodopsin: A G protein-coupled receptor. Science 289(5480):739–745 6. Teller DC, Okada T, Behnke CA, Palczewski K, Stenkamp RE (2001) Advances in determination of a high-resolution three-dimensional structure of rhodopsin, a model of G-proteincoupled receptors (GPCRs). Biochemistry 40(26):7761–7772 7. Li J, Edwards PC, Burghammer M, Villa C, Schertler GF (2004) Structure of bovine rhodopsin in a trigonal crystal form. J Mol Biol 343(5):1409–1438 8. Okada T, Sugihara M, Bondar AN, Elstner M, Entel P, Buss V (2004) The retinal conformation and its environment in rhodopsin in light of a new 2.2 A crystal structure. J Mol Biol 342(2):571–583 9. Murakami M, Kouyama T (2008) Crystal structure of squid rhodopsin. Nature 453(7193):363–367 10. Park JH, Scheerer P, Hofmann KP, Choe HW, Ernst OP (2008) Crystal structure of the ligand-free G-protein-coupled receptor opsin. Nature 454(7201):183–187 11. Warne T, Serrano-Vega MJ, Baker JG et al (2008) Structure of a beta1-adrenergic G-proteincoupled receptor. Nature 454(7203):486–491 12. Rosenbaum DM, Cherezov V, Hanson MA et al (2007) GPCR engineering yields highresolution structural insights into beta2-adrenergic receptor function. Science 318(5854): 1266–1273

13. Cherezov V, Rosenbaum DM, Hanson MA et al (2007) High-resolution crystal structure of an engineered human beta2-adrenergic G protein-coupled receptor. Science 318(5854):1258–1265 14. Rasmussen SG, Choi HJ, Rosenbaum DM et al (2007) Crystal structure of the human beta2 adrenergic G-protein-coupled receptor. Nature 450(7168):383–387 15. Jaakola VP, Griffith MT, Hanson MA et al (2008) The 2.6 Angstrom crystal structure of a human A2A adenosine receptor bound to an antagonist. Science 322(5905):1211–1217 16. Evers A, Hessler G, Matter H, Klabunde T (2005) Virtual screening of biogenic aminebinding G-protein coupled receptors: comparative evaluation of protein- and ligand-based virtual screening protocols. J Med Chem 48(17):5448–5465 17. Tikhonova IG, Sum CS, Neumann S et al (2007) Bidirectional, iterative approach to the structural delineation of the functional “chemoprint” in GPR40 for agonist recognition. J Med Chem 50(13):2981–2989 18. Bissantz C, Bernard P, Hibert M, Rognan D (2003) Protein-based virtual screening of chemical databases. II. Are homology models of G-Protein coupled receptors suitable targets? Proteins 50(1):5–25 19. Archer-Lahlou E, Tikhonova I, Escrieut C et al (2005) Modeled structure of a G-proteincoupled receptor: the cholecystokinin-1 receptor. J Med Chem 48(1):180–191 20. Klabunde T, Hessler G (2002) Drug design strategies for targeting G-protein-coupled receptors. Chembiochem 3(10):928–944 21. Krystek SR Jr, Kimura SR, Tebben AJ (2006) Modeling and active site refinement for G protein-coupled receptors: application to the beta-2 adrenergic receptor. J Comput Aided Mol Des 20(7–8):463–470 22. Dufresne M, Seva C, Fourmy D (2006) Cholecystokinin and gastrin receptors. Physiol Rev 86(3):805–847 23. Archer E, Maigret B, Escrieut C, Pradayrol L, Fourmy D (2003) Rhodopsin crystal: new template yielding realistic models of G-proteincoupled receptors? Trends Pharmacol Sci 24(1):36–40

The Family of G Protein-Coupled Receptors: An Example of Membrane Proteins 24. Poirot SS, Escrieut C, Dufresne M et al (1994) Photoaffinity labeling of rat pancreatic cholecystokinin type A receptor antagonist binding sites demonstrates the presence of a truncated cholecystokinin type A receptor. Mol Pharmacol 45(4):599–607 25. Kennedy K, Escrieut C, Dufresne M, Clerc P, Vaysse N, Fourmy D (1995) Identification of a region of the N-terminal of the human CCKA receptor essential for the high affinity interaction with agonist CCK. Biochem Biophys Res Commun 213(3):845–852 26. Kennedy K, Gigoux V, Escrieut C et al (1997) Identification of two amino acids of the human cholecystokinin-A receptor that interact with the N-terminal moiety of cholecystokinin. J Biol Chem 272(5):2920–2926 27. Henderson R, Baldwin JM, Ceska TA, Zemlin F, Beckmann E, Downing KH (1990) Model for the structure of bacteriorhodopsin based on high-resolution electron cryo-microscopy. J Mol Biol 213(4):899–929 28. Campagne F, Jestin R, Reversat JL, Bernassau JM, Maigret B (1999) Visualisation and integration of G protein-coupled receptor related information help the modelling: description and applications of the Viseur program. J Comput Aided Mol Des 13(6):625–643 29. Gigoux V, Maigret B, Escrieut C et al (1999) Arginine 197 of the cholecystokinin-A receptor binding site interacts with the sulfate of the peptide agonist cholecystokinin. Protein Sci 8(11):2347–2354 30. Gigoux V, Escrieut C, Silvente-Poirot S et al (1998) Met-195 of the cholecystokinin-A receptor interacts with the sulfated tyrosine of cholecystokinin and is crucial for receptor transition to high affinity state. J Biol Chem 273(23):14380–14386 31. Gigoux V, Escrieut C, Fehrentz JA et al (1999) Arginine 336 and asparagine 333 of the human cholecystokinin-A receptor binding site interact with the penultimate aspartic acid and the C-terminal amide of cholecystokinin. J Biol Chem 274(29):20457–20464 32. Giragossian C, Mierke DF (2001) Intermolecular interactions between cholecystokinin-8 and the third extracellular loop of the cholecystokinin A receptor. Biochemistry 40(13):3804–3809 33. Giragossian C, Sugg EE, Szewczyk JR, Mierke DF (2003) Intermolecular interactions between peptidic and nonpeptidic agonists and the third extracellular loop of the cholecystokinin 1 receptor. J Med Chem 46(16):3476–3482 34. Ji Z, Hadac EM, Henne RM, Patel SA, Lybrand TP, Miller LJ (1997) Direct identification of a distinct site of interaction between

35.

36.

37.

38.

39.

40.

41.

42.

43.

44.

453

the carboxyl-terminal residue of cholecystokinin and the type A cholecystokinin receptor using photoaffinity labeling. J Biol Chem 272(39): 24393–24401 Hadac EM, Pinon DI, Ji Z et al (1998) Direct identification of a second distinct site of contact between cholecystokinin and its receptor. J Biol Chem 273(21):12988–12993 Ding XQ, Pinon DI, Furse KE, Lybrand TP, Miller LJ (2002) Refinement of the conformation of a critical region of charge–charge interaction between cholecystokinin and its receptor. Mol Pharmacol 61(5):1041–1052 Galas MC, Lignon MF, Rodriguez M et al (1988) Structure–activity relationship studies on cholecystokinin: analogues with partial agonist activity. Am J Physiol 254(2 Pt 1): G176–G182 Archer-Lahlou E, Escrieut C, Clerc P et al (2005) Molecular mechanism underlying partial and full agonism mediated by the human cholecystokinin-1 receptor. J Biol Chem 280(11):10664–10674 Crocker E, Eilers M, Ahuja S et al (2006) Location of Trp265 in metarhodopsin II: implications for the activation mechanism of the visual receptor rhodopsin. J Mol Biol 357(1):163–172 Marco E, Foucaud M, Langer I, Escrieut C, Tikhonova IG, Fourmy D (2007) Mechanism of activation of a G protein-coupled receptor, the human cholecystokinin-2 receptor. J Biol Chem 282(39):28779–28790 Escrieut C, Gigoux V, Archer E et al (2002) The biologically crucial C terminus of cholecystokinin and the non-peptide agonist SR-146, 131 share a common binding site in the human CCK1 receptor. Evidence for a crucial role of Met-121 in the activation process. J Biol Chem 277(9):7546–7555 Martin-Martinez M, Marty A, Jourdan M et al (2005) Combination of molecular modeling, site-directed mutagenesis, and SAR studies to delineate the binding site of pyridopyrimidine antagonists on the human CCK1 receptor. J Med Chem 48(15):4842–4850 Silvente-Poirot S, Escrieut C, Gales C et al (1999) Evidence for a direct interaction between the penultimate aspartic acid of cholecystokinin and histidine 207, located in the second extracellular loop of the cholecystokinin B receptor. J Biol Chem 274(33): 23191–23197 Gales C, Poirot M, Taillefer J et al (2003) Identification of tyrosine 189 and asparagine 358 of the cholecystokinin 2 receptor in direct interaction with the crucial C-terminal amide of cholecystokinin by molecular modeling,

454

45.

46.

47.

48.

49.

50.

51.

52.

53.

54.

Tikhonova and Fourmy site-directed mutagenesis, and structure/affinity studies. Mol Pharmacol 63(5):973–982 Langer I, Tikhonova IG, Travers MA et al (2005) Evidence that interspecies polymorphism in the human and rat cholecystokinin receptor-2 affects structure of the binding site for the endogenous agonist cholecystokinin. J Biol Chem 280(23):22198–22204 Foucaud M, Tikhonova IG, Langer I et al (2006) Partial agonism, neutral antagonism, and inverse agonism at the human wild-type and constitutively active cholecystokinin-2 receptors. Mol Pharmacol 69(3):680–690 Rodriguez-Barrios F, Balzarini J, Gago F (2005) The molecular basis of resilience to the effect of the Lys103Asn mutation in nonnucleoside HIV-1 reverse transcriptase inhibitors studied by targeted molecular dynamics simulations. J Am Chem Soc 127(20): 7570–7578 Foucaud M, Marco E, Escrieut C, Low C, Kalindjian B, Fourmy D (2008) Linking nonpeptide ligand binding mode to activity at the human cholecystokinin-2 receptor. J Biol Chem 283(51):35860–35868 Chen JZ, Wang J, Xie XQ (2007) GPCR structure-based virtual screening approach for CB2 antagonist search. J Chem Inf Model 47(4):1626–1637 Bissantz C, Schalon C, Guba W, Stahl M (2005) Focused library design in GPCR projects on the example of 5-HT(2c) agonists: comparison of structure-based virtual screening with ligand-based search methods. Proteins 61(4):938–952 Vaidehi N, Schlyer S, Trabanino RJ et al (2006) Predictions of CCR1 chemokine receptor structure and BX 471 antagonist binding followed by experimental validation. J Biol Chem 281(37):27613–27620 Radestock S, Weil T, Renner S (2008) Homology model-based virtual screening for GPCR ligands using docking and target-biased scoring. J Chem Inf Model 48(5):1104–1117 de Graaf C, Rognan D (2008) Selective structure-based virtual screening for full and partial agonists of the beta2 adrenergic receptor. J Med Chem 51(16):4978–4985 Evers A, Klabunde T (2005) Structure-based drug discovery using GPCR homology modeling:

55.

56.

57.

58.

59.

60.

61. 62. 63.

64.

successful virtual screening for antagonists of the alpha1A adrenergic receptor. J Med Chem 48(4):1088–1097 Cavasotto CN, Orry AJ, Murgolo NJ et al (2008) Discovery of novel chemotypes to a G-protein-coupled receptor through ligandsteered homology modeling and structurebased virtual screening. J Med Chem 51(3):581–588 Becker OM, Marantz Y, Shacham S et al (2004) G protein-coupled receptors: in silico drug discovery in 3D. Proc Natl Acad Sci U S A 101(31):11304–11309 Kellenberger E, Springael JY, Parmentier M, Hachet-Haas M, Galzi JL, Rognan D (2007) Identification of nonpeptide CCR5 receptor agonists by structure-based virtual screening. J Med Chem 50(6):1294–1303 Tikhonova IG, Sum CS, Neumann S et al (2008) Discovery of novel agonists and antagonists of the free fatty acid receptor 1 (FFAR1) using virtual screening. J Med Chem 51(3):625–633 Engel S, Skoumbourdis AP, Childress J et al (2008) A virtual screen for diverse ligands: discovery of selective G protein-coupled receptor antagonists. J Am Chem Soc 130(15):5115–5123 Kratochwil NA, Malherbe P, Lindemann L et al (2005) An automated system for the analysis of G protein-coupled receptor transmembrane binding pockets: alignment, receptor-based pharmacophores, and their application. J Chem Inf Model 45(5): 1324–1336 Rognan D (2007) Chemogenomic approaches to rational drug design. Br J Pharmacol 152(1):38–52 Klabunde T (2007) Chemogenomic approaches to drug discovery: similar receptors bind similar ligands. Br J Pharmacol 152(1):5–7 Martin RE, Green LG, Guba W, Kratochwil N, Christ A (2007) Discovery of the first nonpeptidic, small-molecule, highly selective somatostatin receptor subtype 5 antagonists: a chemogenomics approach. J Med Chem 50(25):6291–6294 Bock JR, Gough DA (2005) Virtual screen for ligands of orphan G protein-coupled receptors. J Chem Inf Model 45(5):1402–1414

INDEX

A

C

Absorption................................................. 5–10, 16, 32, 38, 66, 81, 93, 150, 425, 430 ADP/ATP transporter (AAC, ANT, ANCP) expression .................................................................. 26 inhibitors bonkrelic acid (BA) .......................26, 107, 112, 115 carboxyatractyloside (CATR) ....................... 20, 21, 24–27, 107, 109, 110, 112, 115 purification .........................................20, 107, 113, 115 Alpha helices break points ............................................................. 291 helix–helix packing .......................................... 292, 293 helix-turn-helix......................... 286, 288–290, 292, 293 interactions with lipid .............................................. 279 structure ........................................................... 284–285 TM-turn-TM .......................................................... 292 Amphiphile ..................................................................... 80 Aquaporin (AQP)................... 171, 172, 174, 181–183, 410 ATP binding cassette transporter (ABCG2) expression ......................................................53, 55, 58, 59, 62, 65, 66, 68, 70, 71 inhibitors Hoechst ............................................33, 342, 59–61 vanadate ....................................................64, 67, 72 purification ....................... 48–49, 62–64, 66–69, 72, 73

Calcium ATPase (Ca-ATPase) inhibitor, Thapsigargin .............................124, 131, 238 Carbon films ..............................................6, 181, 189–190, 194–197, 203, 204 Cells epithelial .......................................................... 172, 179 HT-29.. .......................................................... 222–224 Sf9........................................................... 47, 48, 55–58 Channel..................................................... 84, 90, 98, 131, 171, 172, 183, 262, 272, 273, 287, 324, 343, 347, 350, 351, 355, 364, 370, 372, 373, 378, 381, 387, 400, 409, 411, 429–432, 434 Chimera software .................................................. 223, 231 Circular dichroism ...................................... 4, 51, 52, 64, 93 ClustalW software ..................................377, 389, 392, 395 Column affinity (Ni-NTA)...................................................... 30 gel filtration ....................................................... 63, 308 hydroxyapatite............................................................ 25 reverse phase ............................................................ 308 size exclusion ........................................................... 161 Cryoelectronmicroscopy (cryo-EM)...................... 187, 196 Crystallization ...............................................16, 20, 23, 24, 79–100, 107, 108, 110, 111, 115, 123, 124, 131, 144, 146–150, 153–157, 161–165, 187–204, 250, 284 Crystallography ...............................................79–100, 106, 110, 122, 161, 183, 187–204, 207–209, 217, 238, 239, 284, 297, 304, 412, 424

B Bacteria expression ........................6, 9, 29–31, 33, 263, 322, 337 inclusion bodies ..................................... 6, 9, 31, 35–38, 322, 326–327, 337 Bacteriorhodopsin (BR) ................................84–85, 87–90, 98–100, 187, 285, 293, 295, 296, 430, 444 Baculovirus/insect cell systems ............................ 48, 67–71 Beta barrels ............................................210, 216, 262–264, 267–270, 272, 275, 284, 291, 293, 322, 329, 332, 336, 364, 435 Beta sheet... ......................................................66, 425, 434 Bio Beads............................................... 5, 9–13, 16, 23–25, 107, 189, 192–193, 202–203, 211, 306, 309 Bioinformatics ............................................................... 398

D Data collection (data set collection).......................100, 144, 146, 150, 159–160, 166, 218 Detergents cholamidopropyl dimethylammonio hydroxy propanesulfonate (CHAPSO) ................. 89, 90 cholamidopropyl dimethylammonio propanesulfonate (CHAPS) ...................................39, 67, 69, 203 dihexanoylphosphatidylcholine (DHPC) ................. 32, 39, 89, 90, 266, 328, 338 dodecylmaltopyranoside (DDM) .................39, 62, 266

Jean-Jacques Lacapère (ed.), Membrane Protein Structure Determination: Methods and Protocols, Methods in Molecular Biology, vol. 654, DOI 10.1007/978-1-60761-762-4, © Springer Science+Business Media, LLC 2010

455

MEMBRANE PROTEIN STRUCTURE DETERMINATION 456 Index Detergents (Continued ) dodecylphosphocholine (DPC) .......................5–10, 12, 13, 39, 41, 42, 310, 315, 328 fos-choline 16 (FC-16).............................................. 67 lauroylsarcosine (sarcosyl) .................................... 31, 39 laurylamido dimethylpropylaminoxide (LAPAO) ................................................. 20, 25 lauryldimethylamine oxide (LDAO)................................................. 39, 266 lyso-phosphatidylglycerol (LPPG)........................................................ 266 micelle......................................................72, 81, 90, 97, 262, 263, 265, 276, 284, 286, 288, 290, 293, 310, 312–315, 321, 322, 327, 435 octaethylene glycol monododecyl ether (C12E8) ................................................. 39, 202 octylglucopyranoside (OG) ............................... 85, 100 perfluoro-octanoic acid (PFO) ............................ 67, 69 sodium dodecyl sulfate (SDS) .............................. 5–11, 13, 16, 31, 35, 39, 41, 69 tetraethylene glycol monooctyl ether (C8E4)........................................ 324, 328 triton X-100 (TX-100) .............................................. 39 zwittergent 3–14 (ZW 3–14) .......................... 324, 328 Dialysis buttons ............................................................. 193, 194 membranes ............................... 189, 193, 194, 306, 323 Diffraction electron .............................................................. 92, 108 neutron ...................................................................... 97 X-ray........................................................................91 Docking..... ....................................................134, 135, 239, 364, 375, 381, 382, 399, 442, 445, 447–450 DomainFinder software .................................240, 244–246 Dynamics................................................. 93, 105, 112, 113, 115, 255, 261–263, 271–272, 279, 294, 304, 305, 315, 316, 336, 341, 342, 344, 346, 350–352, 355–357, 364, 365, 372, 374, 399, 403–418, 423–436, 442, 444, 445, 448

Escherichia coli (E. Coli) ...................................... 6, 9, 29, 30, 33–35, 39, 42, 48, 49, 53, 71, 91, 189, 191, 210, 263, 271, 287, 293, 305, 307, 317, 322, 324, 326, 368, 431 Expression.............................................4, 5, 29–31, 33–35, 39, 43, 48, 53–55, 58, 59, 65, 68, 70, 124, 190, 210, 263–265, 277, 305, 307–308, 310, 316, 322, 323, 337

F Fluorescence (fluorescence recovery after photobleaching, FRAP) .......................... 51, 60, 66, 146, 150–153, 156, 162, 164, 412

G G coupled protein receptor (GPCR) ........................ 39, 89, 99, 141–166, 279, 285–294, 296, 365, 369, 387, 389, 393, 398, 410, 429, 434, 441–443, 447–452

H His-tag proteins................................................72, 161, 163 Hydrogen bonds ..............109, 129, 134, 285, 335, 336, 445 Hydrophobic clusters............................................. 134, 136

I ImageJ software ...............190, 201, 223, 227–229, 231, 234 Image processing ....................................182–183, 223, 226 Immobilized metal ion affinity chromatography (IMAC) ............................................. 26, 35–40 IMOD software .............................................213, 215, 218 Inflammation ......................................................... 222, 432 Ion transport, ion channel ............................................. 287 Isopropyl thio galactopyranoside (IPTG)................. 33–35, 53–55, 264, 305, 308, 310, 323–326

K Klebsiella pneumoniae (KpOmpA).......................... 321–338

L

E Electron microscopy cryo...... ............................................175, 177, 181–182, 191, 196, 207, 208, 237 crystallography .................. 183, 187–204, 207–209, 217 density maps .................................................... 239–241 fitting procedure .............................................. 240–244 freeze fracture ........................... 173, 177, 178, 181, 184 grids..... .....................................189–191, 195–197, 224 missing wedge.......................................................... 218 negative staining ...............................178–179, 190, 208 section.............................................................. 177–180 tomography...................................................... 221–234 Electrostatic interactions ................................109, 127, 128

Labelling carbon (12C and 13C) ......................................34, 43, 44, 264–267, 271, 276–279, 322–326, 328, 329, 332, 334, 335, 411 hydrogen (1H and 2H) .................................... 266–269, 271, 277, 278, 311, 313, 322, 324, 326–330, 334–338, 342, 343, 347–354 nitrogen (14N and 15N) ...................................34, 43, 44, 265–269, 271, 273, 275–278, 305, 311, 313, 315, 322–335, 337, 342, 343, 350, 416 perdeuteration...........................................266, 329–332 phosphorus (31P) ......................................275, 306, 310, 311, 342, 343, 345, 347, 349 selective methyl protonation .................................... 322

MEMBRANE PROTEIN STRUCTURE DETERMINATION 457 Index seleno methionine ...................................................... 91 stable isotope labelling ............................................. 267 Lactose permease............................................107, 294, 368 Lipid analysis.............................. 149, 188–189, 191–192, 207 azolectine ............................................................. 67, 72 bicelles.................................................89–90, 143, 262, 263, 266, 274–275, 279, 293, 344, 345, 347–350, 355, 356 bilayer................................................... 6, 11, 80, 84, 90, 146, 148, 171, 187, 188, 191, 209, 210, 212, 215–217, 274, 277, 284, 310, 312–314, 341–359, 369, 373, 374, 410, 412, 435 cholesterol................................................147, 165, 342, 344, 346, 352, 358, 409, 433 cubic phase (LCP) .........................................81, 83–85, 87, 88, 100, 107, 143–157, 159, 161–163, 165 detergent extraction .................................... 4, 5, 107, 209, 215 removal ..................................................5, 9–13, 16, 188, 189, 191, 193–194, 209–211, 309 gel and fluid phase ...................... 85, 351, 352, 411, 417 hexagonal phase ........................................156, 157, 162 lamellar phase ............................................98, 100, 156, 162, 163, 347, 350, 352 liposome ........................................................5, 11, 107, 210, 342, 347–350, 352, 354 liquid-ordered phase ........................................ 341, 352 membrane ................................................11, 48, 69, 71, 81, 113, 207–209, 212, 387, 433, 434 monoolein (MO) ...........................................81, 84–86, 98, 100, 144, 147, 151, 163, 164, 189 oriented ............................ 193, 347, 350–352, 355, 377 reconstitution .....................................72, 144, 146–148, 153, 188, 191, 192, 202, 203, 207, 209, 210, 266, 274, 306, 309, 313 sphingolipids.............................................341, 409, 433 sponge phase .................................................86, 87, 165 vesicle multilamellar ....................... 5, 67, 72, 347, 351, 352 unilamellar ................................................. 312, 313 Liquid chromatography fast protein liquid chromatography (FPLC) ................................... 30, 31, 36, 37, 44 high performance liquid chromatography (HPLC) .................. 44, 124, 144, 306, 308, 309 Loop............... .......................................108, 113, 133–135, 143, 158–160, 163, 183, 253, 264, 275, 285–296, 333, 367–369, 375, 379, 394, 397–400, 417, 432, 441, 442, 444, 446

M Mass spectroscopy (MALDI-TOF) .................4, 5, 30, 309

Medium for bacterial culture Luria Bertani (LB) ..................................30, 31, 33–35, 43, 49, 53, 54, 324, 325 minimal (M9) ....................... 34, 43, 236, 305, 323, 325 Membrane.. ...........................................3–17, 19–27, 29–44, 47–73, 79–100, 105–111, 113, 115, 120–122, 124–126, 128–136, 141–166, 171–185, 187–204, 207–218, 221–234, 238, 244–247, 250, 252, 253, 261–279, 283–298, 303–317, 321–338, 341, 342, 344–346, 352–355, 363–382, 387–390, 393–395, 397–400, 403–418, 423–436, 441–452 Merging data ................................................................. 160 Metal fluoride ................................................................ 123 Mitochondria inner membrane..................................19, 105, 106, 233 intermembrane space (IMS) .....................107–111, 113 outer membrane....................................................... 106 proteins (mitochondrial carrier family (MCF)) ............................................4, 5, 19–27, 30, 106–108, 110, 111, 113, 115, 222 ultra-structure ...........................................221, 224–225 Modeller software.................................................. 394, 396 Modelling ab initio.............................................368, 376, 380, 426 comparative ..............................................388, 393–399 homology ................................. 114, 115, 143, 364, 365, 376, 388, 389, 394, 395, 399, 406, 442, 443, 449 Molecular dynamics.............................................. 112–113, 115, 255, 294, 364, 365, 372–374, 399, 403–418, 426–428, 436, 442, 444, 445, 448 Molecular modelling ..............................240, 256, 363, 375 Molecular refinement .................................................... 388 Multi drug efflux pump (MexA, MexB, OprM) ................................ 208 Mutagenesis (mutants) ........................................21, 22, 26, 49, 122, 132, 143, 255, 267, 305, 316, 382, 410, 430, 431, 442, 445, 447, 448, 450

N Neutron diffraction .................................................................. 97 diffusion........................................................88, 90, 412 scattering .........................88, 90, 99, 100, 121, 411, 412 Nuclear magnetic resonance (NMR) chemical shift...........................................262, 267–269, 271, 272, 274, 275, 304, 314, 330, 336, 337, 347, 355, 357, 416 correlation spectroscopy (COSY ) .................... 354, 357 distance restrains...............................268–270, 278, 336 heteronuclear multiple-quantum coherence (HMQC).. 329, 333, 335, 337 heteronuclear single-quantum coherence (HSQC ) .............................265, 266, 271, 311, 317, 328, 330, 334, 335, 337

MEMBRANE PROTEIN STRUCTURE DETERMINATION 458 Index Nuclear magnetic resonance (NMR) (Continued ) high resolution magic angle spinning (HR-MAS )..................................342, 343, 352 magic angle ..............................................272, 275, 313, 342, 343, 348, 354, 416 nuclear overhauser enhancement (NOE) .................. 267–270, 313, 333–335, 443 nuclear overhauser enhancement spectroscopy (NOESY ) .................... 269, 322, 329–335, 337 oriented sample......................... 272, 275, 312–313, 414 paramagnetic relaxation enhancement (PRE) ...........................................268, 270, 322 phase modulated Lee–Goldburg scheme (PMLG ) ..................................................... 313 polarised inversion spin exchange at the magic angle (PISEMA) .................... 275, 276, 313–315, 416 polarity index slant angle (PISA)..................... 275, 276 quadrupolar splitting .......................................344, 346, 351, 352, 355, 358, 411, 413 relaxation time ..................................344–346, 355–358 residual dipolar coupling (RDC) ..............270–271, 322 softwares NMRpipe .................................................. 324, 328 NMRview ...................................324, 328, 335, 336 TALOS...................................................... 278, 336 solid state (ssNMR) ........... 89, 261–279, 304, 312–315, 343–346, 354, 358, 413, 416, 430 solution ..............................................43, 261–279, 284, 287–289, 291–293, 295, 304, 306, 310–315, 317, 321–338, 345, 354 total correlation spectroscopy (TOCSY ) ................ 322, 329, 330, 332, 334, 355, 357 transverse relaxation-optimized spectroscopy (TROSY ) HNCA(CO) ...............................267, 269, 271, 330 HNCACB ..................................267, 269, 329, 332 HN(CO)CACB ........................................ 329, 337 two pulse phase modulated sequence (TPPM) ....................................................... 313 wide line spectra .............................................. 344–352 Normal modes ...............................................113, 237–256, 364, 369, 374–375, 427, 431

P Peptide backbone ...................................................266, 275, 371 plane................................................242, 255, 275, 276, 290, 413, 414, 416–418, 434 side chain ..........................................242, 255, 266, 369 Phospholamban (PLN) ..................................271, 303–317 Phospholipids dimyristoil phosphatidyl choline (DMPC) ......................... 5, 11, 89, 90, 273, 416

dimyristoil phosphatidyl ethanolamine (DMPE) .................................................... 5, 11 dioleyl phosphatidyl (choline, ethanolamine) (DOPC, DOPE) .................................124, 209, 211, 273, 306, 307, 309, 310, 312, 313 egg yolk phosphatidic acid ....................................... 189 egg yolk phosphatidyl choline.................................. 189 egg yolk phosphatidyl ethanolamine........................ 189 1-palmitoyl-2-oleoylphosphatidyl choline (POPC) ............................................... 273, 432 1-palmitoyl-2-oleoylphosphatidyl glycerol (POPG) ............................................... 273, 380 phosphatidylinositol ................................................ 342 tetradecanoyl phosphocholine ................................. 342 Plasmids................. .......................................21–22, 30, 33, 48–50, 54, 71, 305, 307, 322 PROCHECK software ......................................... 324, 397 Protein analysis (absorption spectra) .......................7, 9, 32, 425 characterisation analytical centrifugation................................. 82, 93 light scattering ....................................11, 12, 82, 93 SDS-gels.............................................................. 81 concentration determination.................................... 5–9 concentrators .................................................50, 72, 82, 149, 161, 307, 327 data bank ............................ 79, 122, 263, 365, 406, 424 docking ....................................................364, 375, 381, 382, 399, 442, 444, 447–450 dynamics ..................................................51, 63, 64, 93, 113, 115, 122, 135, 142, 161, 233, 240, 244, 255, 261–263, 266, 271–272, 279, 294, 304, 305, 315, 336, 344, 363–365, 372–374, 376, 399, 403–418, 423–436, 442, 444, 445, 447, 448 fragments .................................................21, 22, 31, 83, 98, 144, 283–298, 368, 380, 399, 400, 445 over expression ...................................................4, 6, 29, 48, 53–59, 70, 71, 173, 263–264, 322, 324–326 polymers ...............................................42, 43, 221, 427 recombinant .................................................5, 6, 19–27, 29, 30, 33–35, 43, 306, 309, 324 refolding ............................. 39, 317, 322–324, 326, 327 sequence alignment..........................................366, 367, 375, 377, 388, 391–393, 395, 398, 399 Psi-blast software............................................389, 392, 399 P-type ATPases ..............................................120, 123, 238 Purification .............................................. 4–7, 9, 11, 16, 20, 22–25, 29–32, 35–42, 44, 48, 50, 62–64, 72, 73, 80, 81, 83, 92, 107, 108, 113, 120, 124, 161, 176, 210, 221, 264, 274, 305–310, 316, 323–327 PyMol software .....................................................240, 250, 324, 336, 365, 377, 378, 389, 397, 406

MEMBRANE PROTEIN STRUCTURE DETERMINATION 459 Index R Receptors...... ............................................. 5, 30, 39, 89, 99, 141–144, 147, 156, 163, 273, 285, 287–290, 292–294, 297, 365–368, 370, 375, 381, 382, 387, 410, 429–430, 433, 441–451 Resin embedded material .............................................. 233 Rhodopsin. ...............................98, 142, 144, 286, 287, 289, 290, 294, 296, 297, 365–367, 387, 395, 396, 400, 429, 430, 433, 434, 442, 444, 447, 448

S Saccharomyces cerevisiae ....................................................287 Sarcoplasmic reticulum ..........................................120, 121, 123, 124, 130, 239, 240, 303, 304, 310, 447 Screening..... ..............................................26, 93, 143–146, 148–153, 156, 157, 161, 162, 165, 190, 198–202, 204, 265, 279, 322–324, 327–328, 375, 449, 450, 452 Secondary structure .............................................64–66, 72, 93, 262, 267, 269, 275, 285–298, 366, 370, 379, 388, 393, 394, 434 Signal transduction ........................................142, 143, 161, 344, 404, 429, 433 Single particle analysis ........................................... 207–218 Solubilization.................................................25, 31, 35–36, 41, 44, 50, 62–64, 67–69, 72, 81, 82, 92, 106, 124, 192, 202, 210, 326 Spider software .............................................................. 182 Sterols............. ............................................................... 344 Structure evaluation......................................................... 410, 412 function relationship .......................................... 26, 450 prediction........................................... 65, 286, 328, 363, 366, 368, 369, 375, 388, 390, 393–395, 400, 436 protein data bank ................ 79, 122, 263, 365, 406, 424

Transmembrane beta barrel ................................................................ 435 domain............................................... 48, 126, 216, 266, 274, 279, 280, 287, 288, 292, 293, 295–297, 311, 312, 314, 315, 322, 324, 365, 366, 368–369, 394, 395 fragment ..................................................109, 113, 244, 253, 275, 285–287, 289–294, 298, 388, 406, 417, 444, 445 helix............ .............................. 284–292, 295, 298, 445 TSPO expression ...............................................30, 33–35, 265 ligands benzodiazepine ................................................ 5, 30 PK 11195 ................................................... 6, 13–16 purification ........................5, 7, 9, 16, 29–32, 35–42, 44 Two-dimensional crystals sheet (mono, multilayered) .............................. 111, 423 tubular...................................................................... 188

V Vapour diffusion hanging ................................................................ 83, 87 sitting drops ......................................................... 83, 87 Vectors pET15.. ............................................................... 30, 33 pET21b ..........................................................49, 53, 54 pET21c............................................................ 322, 323 pMalc2E .......................................................... 305, 307 Vibrational dynamics ..................................................... 122 Voltage dependant anionic channel (VDAC)............................ 26, 90, 106, 269, 270

W Water channel ............................................................... 172

T

X

Tertiary structure ............................ 107, 267, 283–298, 313 Three-dimensional crystals.............................................. 83 Three-dimensional reconstruction................................. 233 Tomograms algorithm ......................................................... 218, 230 backprojection ..................................209, 218, 229–230 weighted back projection ..................209, 218, 228, 230 TomoJ software ..............................................223, 226–231 Topology..... ...................................................128, 265, 279, 304, 314, 345, 367, 368, 373, 376, 397 Transfection bacteria ...................................................................... 33 cells....... ..........................................................33, 56, 57

Xmipp software ..................................................... 214, 216 X-ray crystallography .............................................80, 83, 106, 110, 122, 183, 188, 201, 202, 238, 279, 284, 297, 304, 412, 424 diffraction .....................................................91, 98, 122 scattering ................................ 86, 88, 90, 100, 411, 412

Y Yeast expression .....................................................26, 48, 124 strain.... ............................................... 21, 22, 26, 48, 49