6N2C

The Crystal Structure of Caldicellulosiruptor hydrothermalis Tapirin C-terminal domain


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.75 Å
  • R-Value Free: 0.211 
  • R-Value Work: 0.168 
  • R-Value Observed: 0.169 

wwPDB Validation   3D Report Full Report


This is version 2.0 of the entry. See complete history


Literature

Comparative Biochemical and Structural Analysis of Novel Cellulose Binding Proteins (Tapirins) from Extremely ThermophilicCaldicellulosiruptorSpecies.

Lee, L.L.Hart, W.S.Lunin, V.V.Alahuhta, M.Bomble, Y.J.Himmel, M.E.Blumer-Schuette, S.E.Adams, M.W.W.Kelly, R.M.

(2019) Appl Environ Microbiol 85

  • DOI: https://doi.org/10.1128/AEM.01983-18
  • Primary Citation of Related Structures:  
    6N2B, 6N2C

  • PubMed Abstract: 

    Genomes of extremely thermophilic Caldicellulosiruptor species encode novel cellulose binding proteins, called tāpirins, located proximate to the type IV pilus locus. The C-terminal domain of Caldicellulosiruptor kronotskyensis tāpirin 0844 (Calkro_0844) is structurally unique and has a cellulose binding affinity akin to that seen with family 3 carbohydrate binding modules (CBM3s). Here, full-length and C-terminal versions of tāpirins from Caldicellulosiruptor bescii (Athe_1870), Caldicellulosiruptor hydrothermalis (Calhy_0908), Caldicellulosiruptor kristjanssonii (Calkr_0826), and Caldicellulosiruptor naganoensis (NA10_0869) were produced recombinantly in Escherichia coli and compared to Calkro_0844. All five tāpirins bound to microcrystalline cellulose, switchgrass, poplar, and filter paper but not to xylan. Densitometry analysis of bound protein fractions visualized by SDS-PAGE revealed that Calhy_0908 and Calkr_0826 (from weakly cellulolytic species) associated with the cellulose substrates to a greater extent than Athe_1870, Calkro_0844, and NA10_0869 (from strongly cellulolytic species). Perhaps this relates to their specific needs to capture glucans released from lignocellulose by cellulases produced in Caldicellulosiruptor communities. Calkro_0844 and NA10_0869 share a higher degree of amino acid sequence identity (>80% identity) with each other than either does with Athe_1870 (∼50%). The levels of amino acid sequence identity of Calhy_0908 and Calkr_0826 to Calkro_0844 were only 16% and 36%, respectively, although the three-dimensional structures of their C-terminal binding regions were closely related. Unlike the parent strain, C. bescii mutants lacking the tāpirin genes did not bind to cellulose following short-term incubation, suggesting a role in cell association with plant biomass. Given the scarcity of carbohydrates in neutral terrestrial hot springs, tāpirins likely help scavenge carbohydrates from lignocellulose to support growth and survival of Caldicellulosiruptor species. IMPORTANCE The mechanisms by which microorganisms attach to and degrade lignocellulose are important to understand if effective approaches for conversion of plant biomass into fuels and chemicals are to be developed. Caldicellulosiruptor species grow on carbohydrates from lignocellulose at elevated temperatures and have biotechnological significance for that reason. Novel cellulose binding proteins, called tāpirins, are involved in the way that Caldicellulosiruptor species interact with microcrystalline cellulose, and additional information about the diversity of these proteins across the genus, including binding affinity and three-dimensional structural comparisons, is provided here.


  • Organizational Affiliation

    Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, North Carolina, USA.


Macromolecules
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
Tapirin
A, B
603Caldicellulosiruptor hydrothermalis 108Mutation(s): 0 
Gene Names: Calhy_0908
UniProt
Find proteins for E4Q7C4 (Caldicellulosiruptor hydrothermalis (strain DSM 18901 / VKM B-2411 / 108))
Explore E4Q7C4 
Go to UniProtKB:  E4Q7C4
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupE4Q7C4
Sequence Annotations
Expand
  • Reference Sequence
Small Molecules
Ligands 6 Unique
IDChains Name / Formula / InChI Key2D Diagram3D Interactions
GOL
Query on GOL

Download Ideal Coordinates CCD File 
E [auth A],
K [auth B]
GLYCEROL
C3 H8 O3
PEDCQBHIVMGVHV-UHFFFAOYSA-N
OXL
Query on OXL

Download Ideal Coordinates CCD File 
J [auth B]OXALATE ION
C2 O4
MUBZPKHOEPUJKR-UHFFFAOYSA-L
GOA
Query on GOA

Download Ideal Coordinates CCD File 
H [auth B]GLYCOLIC ACID
C2 H4 O3
AEMRFAOFKBGASW-UHFFFAOYSA-N
ZN
Query on ZN

Download Ideal Coordinates CCD File 
C [auth A],
F [auth B]
ZINC ION
Zn
PTFCDOFLOPIGGS-UHFFFAOYSA-N
EDO
Query on EDO

Download Ideal Coordinates CCD File 
D [auth A],
I [auth B]
1,2-ETHANEDIOL
C2 H6 O2
LYCAIKOWRPUZTN-UHFFFAOYSA-N
CL
Query on CL

Download Ideal Coordinates CCD File 
G [auth B]CHLORIDE ION
Cl
VEXZGXHMUGYJMC-UHFFFAOYSA-M
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.75 Å
  • R-Value Free: 0.211 
  • R-Value Work: 0.168 
  • R-Value Observed: 0.169 
  • Space Group: P 21 21 21
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 76.16α = 90
b = 90.43β = 90
c = 158.449γ = 90
Software Package:
Software NamePurpose
REFMACrefinement
PROTEUM PLUSdata collection
PROTEUM PLUSdata reduction
CRANK2phasing

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
Department of Energy (DOE, United States)United States--

Revision History  (Full details and data files)

  • Version 1.0: 2018-12-19
    Type: Initial release
  • Version 1.1: 2019-02-06
    Changes: Data collection, Database references
  • Version 1.2: 2019-12-04
    Changes: Author supporting evidence
  • Version 2.0: 2023-11-15
    Changes: Atomic model, Data collection, Database references, Derived calculations, Refinement description