Shedding light on the expansion and diversification of the Cdc48 protein family during the rise of the eukaryotic cell

15Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: A defining feature of eukaryotic cells is the presence of various distinct membrane-bound compartments with different metabolic roles. Material exchange between most compartments occurs via a sophisticated vesicle trafficking system. This intricate cellular architecture of eukaryotes appears to have emerged suddenly, about 2 billion years ago, from much less complex ancestors. How the eukaryotic cell acquired its internal complexity is poorly understood, partly because no prokaryotic precursors have been found for many key factors involved in compartmentalization. One exception is the Cdc48 protein family, which consists of several distinct classical ATPases associated with various cellular activities (AAA+) proteins with two consecutive AAA domains. Results: Here, we have classified the Cdc48 family through iterative use of hidden Markov models and tree building. We found only one type, Cdc48, in prokaryotes, although a set of eight diverged members that function at distinct subcellular compartments were retrieved from eukaryotes and were probably present in the last eukaryotic common ancestor (LECA). Pronounced changes in sequence and domain structure during the radiation into the LECA set are delineated. Moreover, our analysis brings to light lineage-specific losses and duplications that often reflect important biological changes. Remarkably, we also found evidence for internal duplications within the LECA set that probably occurred during the rise of the eukaryotic cell. Conclusions: Our analysis corroborates the idea that the diversification of the Cdc48 family is closely intertwined with the development of the compartments of the eukaryotic cell.

Figures

  • Fig. 1 Domain organization of the different human members of the Cdc48 family. The tandem D-domains, D1 and D2, are shown in grey. N-domains with φβ double barrel fold are shown in green; the deviating N-terminal domain of nuclear VCP-like (NVL) is shown as a white box. The putative second N-domains of Pex1 (N2) and of Pex6 (N1) are highlighted by dashed boxes. The larger inserts into the D1-domain of NVL and the D2-domain of Pex1 are shown in brown. The tail helices at the C-terminal end of D2-domains are indicated as black boxes. The bromodomain in the D2-domain of Yta7 is shown in blue and is located right after the N-terminal subdomain containing the Rossman fold. Note that vertebrates generally possess two Yta7 homologs, referred to as ATAD2 and ATAD2B; only one of the two human Yta7 variants is shown. Note that the detailed arrangement of the secondary structural elements of the two D-domains of Cdc48 is given in Additional file 3: Figure S2. The novel family member Spaf-like has been discovered in screens for chronic kidney disease [113, 114] and has also been found in several interactome studies (e.g. [64, 115–117], suggesting that it plays a role in selective protein degradation. Spaf-like constitutes a distinctive branch that has been not recognized clearly in earlier surveys, probably because this factor is present in only a few eukaryotic lineages. Generally, its domain structure is similar to that of Cdc48. However, as noted earlier [19], Spaf-like from Arabidopsis thaliana has no N-domain and contains a transmembrane region at its C-terminal end. As more sequence information is now available, we found that a C-terminal transmembrane region is shared by all Spaf-like from core eudicots, suggesting that the membrane anchor was gained in this lineage. By contrast, the loss of the N-domain appears to have occurred much earlier in plants, as we did not find it in most plants, apart from the green algae group Mamiellales (Ostreococcus, Micromonas). It cannot be excluded, however, that the absence of this domain in some species is caused by incomplete sequence assembly. Recurrently, we came across a few more diverged double-ring AAA ATPase sequences that formed longer branches in our phylogenetic trees and that appear to be more closely related to Cdc48 than to any other member of the family. As we discovered these sequences in several diverse lineages, including heterokonts, amoebozoa, a few green algae, and basal fungi, but not in animals, they might constitute another basal family member. We named this factor Cdc48-like, but cannot currently exclude the possiblity that Cdc48-like is a collection of more diverged Cdc48 variants that group together because of long-branch attraction
  • Fig. 2 Evolutionary tree of the individual AAA domains of the different Cdc48 family members. The tree was constructed from the individual AAA domains (D1- and D2-domains) of the Cdc48 family using a selection of 48 eukaryotic and 26 archaeal species, accounting for a total of 687 AAA domains. The representative species are listed in Additional file 2: Table S2. The different family members are highlighted in different colors, while D1- and D2-domains of the same protein have the same color. Statistical support values (likelihood-mapping/IQ-TREE support/RAxML support/ PhyML support) are given at selected inner edges. Most AAA domains form short branches and split into a D1 and a D2 subtree, in which the two domains of all archaeal Cdc48 are located close to the center of the tree, probably reflecting the fact that the eukaryotic family members are derived from a primordial VAT [19]. However, in the tree all archaea sequences, even from the recently found Lokiarchaeota, are well-separated from eukaryotic family members. The two more divergent D1-domains of the peroxins and the D2 domain of the N-ethylmaleimide sensitive factor (NSF) form long branches. Notably, the D2-domain of NSF is located in the D1 subtree, whereas the D1-domain of NSF is found in the D2 subtree. A similar pattern had been observed in earlier studies and it has been suggested that the two domains of NSF have been swapped during evolution [118]. Given the generally conserved architecture of the protein family, this scenario is not very likely [39]. It is much more probable that this branching pattern is caused by long-branch attraction. In fact, when we included the incomplete, long-branching D2-domain of Yta7, the branching pattern of the two NSF domains changed (Additional file 11: Figure S7)
  • Fig. 3 WebLogo representation of the key sequence elements of the Cdc48 family. Sequence logos were generated from alignments of the Ddomains of different Cdc48 family members using the WebLogo software [119]. Alignment contained more than 500 eukaryotes. The key regions involved in nucleotide binding and hydrolysis and the pore loop as defined by [13, 14, 18] are shown. The overall height of a stack indicates the sequence conservation at a certain position; the height of the symbols within the stack indicates the relative frequency of each amino acid at that position. The sequence logo of the entire alignment is provided in Additional file 12: Figue S8
  • Fig. 4 Evolutionary tree of the combined D1- and D2-domains for different Cdc48 family members. The tree was calculated using an alignment of concatenated D1- and D2-domains of 48 eukaryotic species comprising 270 sequences. The tree shows a distinct separation of the individual Cdc48 family members (highlighted in different colors as in Fig. 2). Statistical support values are annotated at selected inner edges. Note that the most divergent member, Yta7, was omitted
  • Table 1 Repertoire of the Cdc48 family members in selected parasitic lineages and species. A filled black circle indicates the presence of the particular family member in the genome of the organism. For comparison, the repertoire of Homo sapiens is given as well
  • Fig. 5 The evolutionary tree of Cdc48 supports the common ancestry of cryptophytes, alveolates, stramenopiles, and haptophytes (CASH lineages). The tree was calculated from the alignment of 687 Cdc48 sequences from a subset of eukaryotic species comprising archaeplastida and the CASH lineages. Statistical support values are annotated at selected inner edges. The tree splits into two main subsections. One subtree contains cytosolic Cdc48 working in the endoplasmic reticulum-associated protein degradation (ERAD) system. The other subtree includes a Cdc48 variant that transports proteins across the second-outermost membrane into the periplastidal compartment, a process referred to as symbiont-specific ERAD-like machinery (SELMA). The branching patterns supports the idea that SELMA Cdc48 is from a common red algal origin in CASH lineages. Note that the Cdc48 encoded by the nucleomorph of cryptophytes is more closely related to the ERAD Cdc48 of red algae. Although the sequences of SELMA Cdc48 diverged rapidly, its subtree still generally reflects the evolutionary relationships of the species. However, in the SELMA Cdc48 subtree, the missing lineages are those that have apparently lost their complex red plastid. Examples are ciliates, dinoflagellates, oomycetes, and cryptosporidians (Additional file 9: Table S3). All four nucleomorphs of cryptophyte algae contain genes for Cdc48 and nuclear VCP-like (NVL) [85], which is known to be involved in ribosome biogenesis. As the entire translation machinery is encoded in the nucleomorph, NVL might play a role in this process. Note that in previous studies, the nucleomorph-encoded Cdc48 and NVL were annotated as Cdc48a and b, respectively [85]. The nucleomorphs do not encode for N-ethylmaleimide sensitive factor (NSF) though, indicating that the red algae endosymbiont does not contain its own endomembrane system [76]

References Powered by Scopus

This article is free to access.

Get full text

This article is free to access.

Cited by Powered by Scopus

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Kienle, N., Kloepper, T. H., & Fasshauer, D. (2016). Shedding light on the expansion and diversification of the Cdc48 protein family during the rise of the eukaryotic cell. BMC Evolutionary Biology, 16(1), 1–17. https://doi.org/10.1186/s12862-016-0790-1

Readers over time

‘16‘17‘18‘19‘22‘23‘2402468

Readers' Seniority

Tooltip

Researcher 7

35%

Professor / Associate Prof. 6

30%

PhD / Post grad / Masters / Doc 6

30%

Lecturer / Post doc 1

5%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 8

44%

Biochemistry, Genetics and Molecular Bi... 7

39%

Chemistry 2

11%

Nursing and Health Professions 1

6%

Save time finding and organizing research with Mendeley

Sign up for free
0