Key findings are
the following:
• The sea urchin is
estimated to have 23,300 genes with representatives of nearly all
vertebrate gene families, although often
the families are not as large as in
vertebrates.
• Some genes thought to be
vertebrate-specific were found in the sea urchin
(deuterostome-specific); others were
identified in sea urchin but not the chordate
lineage, which suggests loss in the
vertebrates.
• Expansion of some gene
families occurred apparently independently in the sea
urchin and vertebrates.
• The sea urchin has a
diverse and sophisticated immune system mediated by an
astonishingly large repertoire of innate
pathogen recognition proteins.
• An extensive defensome
was identified.
• The sea urchin has
orthologs of genes associated with vision, hearing, balance, and
chemosensation in vertebrates, which
suggests hitherto unknown sensory
capabilities.
• Distinct genes for
biomineralization exist in the sea urchin and vertebrates.
• Orthologs of many human
disease–associated genes were found in the sea urchin.
Details:
1) Genome
features
A window on the genetic
landscape is scaffold-centric in S. purpuratus, because linkage and
cytogenetic maps are not
available. The 36.9% GC content of the genome is uniformly low
because assessment of the
average GC content by domains is consistent (36.8%), and the
distribution is tight (see
SOM). Genes from the OGS show no tendency to occupy regions of
higher- or lower-than-average
GC content. In fact, nearly all genes lie in regions of 35 to
39% GC.
2) The
Echinoderm Genome in the Context of Metazoan Evolution
Two of the
most abundant domains make up
3% of the total and mark genes that are involved in the
innate immune response. Others
define proteins associated with apoptosis and cell death
regulation, as well as
proteins that serve as downstream effectors in the Toll–interleukin 1
(IL-1) receptor (TIR) cascade.
The quinoprotein amine dehydrogenase domain seen in the
sea urchin set is 10 times as
abundant as in other representative genomes and may be used in
the systems of quinone-containing
pigments known to occur in these marine animals. The
large number of nucleosomal
histone domains found agrees with the long-established sea
urchin–specific expansion of
histone genes. In summary, the distribution of proteins among
these conserved families shows
the trend of expansion and shrinkage of the preexisting
protein families, rather than
frequent gene innovation or loss. Gene family sizes in the sea
urchin are more closely
correlated with what is seen in deuterostomes than what is seen in
the protostomes.
Of equal interest are the
sorts of proteins not found in sea urchins. The sea urchin gene set
shares with other bilaterian
gene models about 4000 domains, whereas 1375 domains from
other bilaterian genomes are
not found in the sea urchin set.
In agreement with the lack of
morphological evidence of gap
junctions in sea urchins, there are no gap junction proteins
(connexins, pannexins, and
innexins).
Also missing are several protein domains unique to
insects, such as insect
cuticle protein, chitin-binding protein, and several pheromone- or
odorant-binding proteins, as
well as a vertebrate invention—the Krüppel-associated box or
KRAB domain, a repressor
domain in zinc finger transcription factors (12).
Finally, searches
for specific subfamilies of G
protein–coupled receptors (GPCRs) that are known as
chemosensory and/or odorant
receptors in distinct bilaterian phyla failed to detect clear
representatives in the sea
urchin genome.
However, this failure more likely reflects the
independent evolution of these
receptors, rather than a lack of chemoreceptive molecules,
because the sea urchin genome
encodes close to 900 GPCRs of the same superfamily
(rhodopsin-type GPCRs),
several of which are expressed in sensory structures (13).
A
conservative way to compare
gene sets is to count the strict orthologs that give reciprocal
BLAST matches. Genes that are
genuine orthologs are likely to yield each other as a best
hit. Comparison of sea urchin,
fruit fly, nematode, ascidian, mouse, and human gene sets
(Fig. 2) indicates that the
greatest number of reciprocal best matches is observed between
mouse and human, which
reflects their close relation. The numbers of presumed orthologous
genes between the ascidian and
the two mammals are about equal, but are less than the
number counted between these
species and the sea urchin. The difference is consistent with
the lower gene number and
reduced genome size in the urochordates (4).
The number of reciprocal pairs
for sea urchin and mouse is about 1.5 times the matches
between proteins in sea urchin
and fruit fly. The number of nematode proteins matching
either sea urchin or fruit fly
is even lower. This is likely the result of the more rapid
sequence changes in the
nematode compared with the other species used in this analysis.
More than 75% of the genes
that are shared by sea urchin and fruit fly are also shared
between sea urchin and mouse.
Thus, these genes constitute a set of genes common to the
bilaterians, whereas the
additional sea urchin–mouse pairs are unique to the deuterostomes.
The sea urchin genome
consequently provides evidence for the now extremely robust
concept of the deuterostome
superclade. A 1908 concept that originated in the form of
embryos of dissimilar species
(14) is demonstrated by genomic comparisons.
3) Developmental
Genomics
In the 1980s, the sea urchin
embryo became the focus of cis-regulatory analyses of
embryonic gene expression, and
there was a great expansion of molecular explorations of
the developmental cell
biology, signaling interactions, and regulatory control systems of the
embryo. Analysis of the entire
genome facilitated the first large-scale correlation of the gene
regulatory network for
development, which represents the genomic control circuitry for
specification of the endoderm
and mesoderm of this embryo (15–17)with the encoded
potential of the sea urchin.
The embryo
transcriptome and regulome
Because of indirect
development in the sea urchin, embryogenesis is cleanly separated from
adult body plan formation, in
developmental process and in time, and therefore, it is possible
to estimate the genetic
repertoire specifically required for formation of a simple embryo
(10). Pooled mRNA preparations
from four stages of development, up to the mid-late
gastrula stage (48 hours),
were hybridized with a whole-genome tiling array. Expression of
about 12,000 to 13,000 genes,
as conservatively assessed, was seen during this early period,
indicating that ~52% of the
entire protein-coding capacity of the sea urchin genome is
Page 6
Science. Author
manuscript; available in PMC 2011 August 22.
NIH-PA Author Manuscript
NIH-PA Author Manuscript NIH-PA Author Manuscript
expressed during development
to the mid-late gastrula stage. An additional set of microarray
experiments extended the
interrogation of embryonic expression to the 3-day pluteus larva
stage (see SOM) (18).
The DNA binding domains of
transcription factor families are conserved across the
Bilateria, and these protein
domain motifs were used to extract the sea urchin homologs (see
SOM). For each identified
gene, if data were not already available, probes were built from
the genome sequence and used
to measure transcript concentration by quantitative
polymerase chain reaction with
a time series of embryo mRNAs, as well as to determine
spatial expression by
whole-mount in situ hybridization.
All bilaterian transcription
factor families were represented in the sea urchin with a few rare
exceptions (see below), so the
sea urchin data strongly substantiate the concept of a
panbilaterian regulatory tool
kit (19) or “regulome.” We found that 80% of the whole sea
urchin regulome (except the
zinc finger genes) was expressed by 48 hours of embryogenesis
(20), an even greater genetic
investment than the 52% total gene use in the same embryo.
4) Signal
transduction pathways
More than 1200 genes involved
in signal transduction were identified. Comparative analysis
highlights include the protein
kinases that mediate the majority of signaling and
coordination of complex
pathways in eukaryotes. The S. purpuratus genome has 353 protein
kinases, intermediate between
the core vertebrate set of 510 and the fruit fly and nematode
conserved sets of ~230.
Fine-scale classification and comparison with annotated kinomes
(21, 22) reveals a remarkable
parsimony. Indeed, with only 68% of the total number of
human kinases, the sea urchin
has members of 97% of the human kinase subfamilies,
lacking just four of those
subfamilies (Axl, FastK, H11, and NKF3),whereas Drosophila
lacks 20 and nematodes 32
(Fig. 3) (23). Most sea urchin kinase subfamilies have just a
single member, although many
are expanded in vertebrates; thus, the sea urchin kinome is
largely nonredundant. The sea
urchin therefore possesses a kinase diversity surprisingly
comparable to that of
vertebrates without the complexity. A small number of kinases were
more similar to insect than to
vertebrate homologs (including the Titin homolog Projection,
the Syk-like tyrosine kinase
Shark, and several guanylate cyclases), which indicated for the
first time the loss of kinase
classes in vertebrates (23). Expression profiling showed that
87% of the signaling kinases
and 80% of the 91 phosphatases were expressed in the embryo
(23, 24), which emphasized the
importance of signaling pathways in embryonic
development.
The small guanosine
triphosphatases (GTPases) function as molecular switches in signal
transduction, nuclear import and
export, lipid metabolism, and vesicle docking. Vertebrate
GTPase families were expanded
after their divergence from echinoderms, in part by wholegenome
duplications (25–27). The sea
urchin genome did not undergo a whole-genome
duplication, yet phylogenies
for four Ras GTPase families (Ras, Rho, Rab, and Arf) revealed
that local gene duplications
occurred (Fig. 4), which ultimately resulted in a comparable
number of monomeric GTPases in
the human and sea urchin genomes (28). Thus, expansion
of each family in vertebrates
and echinoderms was achieved by distinct mechanisms (genespecific
versus whole genome
duplication). More than 90% of the small GTPases are
expressed during sea urchin
embryogenesis, which suggests that the complexity of signaling
through GTPases is comparable
between sea urchins and vertebrates.
The Wnt family of secreted
signaling molecules plays a central role in specification and
patterning during embryonic
development. Phylogenetic analyses from cnidarian to human
indicate that of the 13 known
Wnt subfamilies, S. purpuratus has 11, missing Wnt2 and
Wnt11 homologs (Fig. 5). S.
purpuratus has WntA, previously reported as being absent
Page 7
Science. Author
manuscript; available in PMC 2011 August 22.
NIH-PA Author Manuscript
NIH-PA Author Manuscript NIH-PA Author Manuscript
from deuterostomes (29). Of
126 genes described as components of the Wnt signal
transduction machinery,
homologs of ~90% were present in the sea urchin genome, which
indicates a high level of
conservation of all three Wnt pathways (30). However, of 94 Wnt
transcriptional target genes
reported in the literature, mostly from vertebrates (31), only 53%
were found with high
confidence in the sea urchin genome (Fig. 6). The absent Wnt targets
include vertebrate adhesion
molecules, which were frequently missing from the sea urchin
genome (32), as well as
signaling receptors, which are more divergent and thus more
difficult to identify. In
contrast, most transcription factor targets of the Wnt pathway are
present in the genome, which
reflects a higher degree of conservation of transcription factor
families (20). Taken together,
the genomic analysis of signal transduction components
indicates that sea urchins
have signaling machinery strikingly comparable to that of
vertebrates, often without the
complexity that arises from genetic redundancy.
5) Defense
Systems
The need to deal with
physical, chemical, and biological challenges in the environment
underlies the evolution of an
array of defense gene families and pathways. One set of
protective mechanisms involves
the immune system, which responds to biotic stressors such
as pathogens. A second group
of genes comprises a chemical “defensome,” anetworkof
stress-sensing transcription
factors and defense proteins that transform and eliminate many
potentially toxic chemicals.
6) The sea
urchin immune system
The sea urchin has a greatly
expanded innate immunity repertoire compared with any other
animal studied to date (table
S5). Three classes of innate receptor proteins are particularly
increased (Fig. 7). These make
up a vast family of Toll-like receptors (TLRs), a similarly
large family of genes that
encode NACHT and leucine-rich repeat (LRR)–containing
proteins (NLRs), and a set of
genes encoding multiple scavenger receptor cysteine-rich
(SRCR) domain proteins of a
class highly expressed in the sea urchin immune cells or
coelomocytes (33, 34).
Receptors from each of these families participate in immunity by
recognizing nonself molecules
that are conserved in pathogens or by responding to self
molecules that indicate the
presence of infection (35). In contrast, homologs of signal
transduction proteins and
nuclear factor kappa B (NFκB)/Rel domain
transcription factors
that are known to function
further downstream of these genes were present in numbers
similar to those in other
invertebrate species. One of the more unexpected findings from our
analysis of sea urchin immune
genes was the identification of a Rag1/2-like gene cluster
(36). The presence of this
cluster, along with other recent findings (37), suggested the
possibility that these genes
had been part of animal genomes for longer than previously
considered. Further analysis
of the genomic insights into the innate immune system and the
underpinnings of vertebrate
adaptive immunity can be found in a review in this issue (38).
The
complement system
The complement system of
vertebrates is a complex array of soluble serum proteins and
cellular receptors arranged
into three activation pathways (classical, lectin, and alternative)
that converge and activate the
terminal or lytic pathway. This system opsonizes pathogenic
cells for phagocytosis and
sometimes activates the terminal pathway, which leads to
Page 8
Science. Author
manuscript; available in PMC 2011 August 22.
NIH-PA Author Manuscript
NIH-PA Author Manuscript NIH-PA Author Manuscript
pathogen destruction. An
invertebrate complement system was first identified in the sea
urchin [for reviews, see (39,
40)], and the analysis of the genome sequence presented a more
complete picture of this
important immune effector system. In chordates, collectins initiate
the lectin cascade through
members of the mannose-binding protein (MBP)–associated
protease (MASP)/C1r/C1s
family. Several genes encoding collectins, C1q and MBP, have
been predicted (39) and were
present in the genome; however, members of the MASP/C1r/
C1s family were not
identified. There was no evidence for the classical pathway, which
links the complement cascade
with immunoglobulin recognition in jawed vertebrates. The
alternative pathway is
initiated by members of the thioester protein family, which, in the sea
urchin, was somewhat expanded
with four genes. Two of the thioester proteins, SpC3 and
SpC3-2, are known to be
expressed, respectively, in coelomocytes and in embryos and
larvae. Furthermore, there
were three homologs of factor B, the second member of the
alternative pathway (41).
The terminal complement
pathway in vertebrates acts to destroy pathogens or pathogeninfected
cells with large pores called
membrane attack complexes (MACs). Twenty-eight
gene models were identified
that encode MAC-perforin domains, but none of these had the
additional domains expected
for terminal complement factors (C6 through C9). Instead,
these are members of a novel
and very interesting gene family with perforin-like structure.
In vertebrates, perforins
carry out cell-killing functions by cytotoxic lymphocytes through
the formation of small pores
in the cell membranes. If the complement system in the sea
urchin functions through
multiple lectin and alternative pathways in the absence of the lytic
functions of the terminal
pathway, the major activity of this system is expected to be
opsonization.
7) Homologs of
immune regulatory proteins
Cytokines are key regulators
of intercellular communication involving immune cells, acting
to coordinate vertebrate
immune systems. Genes encoding cytokines and their receptors
often evolve at a rapid pace,
and most families are known only from vertebrate systems.
Although members of many
cytokine, chemokine, and receptor families were not identified
in the sea urchin genome, a
number of important immune signaling homologs were present.
These included members of the
tumor necrosis factor (TNF) ligand and receptor
superfamilies, an IL-1
receptor and accessory proteins, two IL-17 receptor–like genes and
30 IL-17 family ligands, and
nine macrophage inhibitory factor (MIF)–like genes. Receptor
tyrosine kinases (RTKs)
included those that bind important growth factors that regulate cell
proliferation in vertebrate
hematopoietic systems. Of particular note, from the sea urchin
genome, were two vascular
endothelial growth factor (VEGF) receptor–like genes and a
Tie1/2 receptor, all of which
were expressed in adult coelomocytes. Many of these genes are
homologs of important
inflammatory regulators and growth factors in higher vertebrates,
and these sea urchin homologs
may have similar functions in regulating coelomocyte
differentiation and recruitment.
Representatives of nearly all
subclasses of important vertebrate hematopoietic and immune
transcription factors were
present in the sea urchin genome. Notably, the genome contained
homologs of immune
transcription factors that had not been identified previously outside of
chordates, including
PU.1/SpiB/SpiC, a member of the Ets subfamily, and a zinc finger gene
with similarity to the Ikaros
subfamily. Transcript prevalence measurements showed that
PU.1, the Ikaros-like gene and
homologs of Gata1/2/3, E2A/HEB/ITF2, and Stem Cell
Leukemia (SCL) were all
expressed at substantial levels in coelomocytes (41). This was
consistent with the presence
of conserved mechanisms of regulating gene expression among
sea urchin coelomocytes and
vertebrate blood cells.
Page 9
Science. Author
manuscript; available in PMC 2011 August 22.
NIH-PA Author Manuscript
NIH-PA Author Manuscript NIH-PA Author Manuscript
8) ABC
transporters
Many chemicals are removed
from cells by efflux proteins known as ATP-binding cassette
(ABC) or multidrug efflux
transporters. S. purpuratus has 65 ABC transporter genes in the
eight major subfamilies of
these genes [ABC A to H; (42)]. The ABCC family of multidrug
transporters is about 25%
larger than in other deuterostome genomes with at least 30 genes
in this family (nearly half of
the sea urchin ABC transporters), and 25 of these 30 genes
showed substantial mRNA
expression in eggs, embryos, or larvae. Much of the expansion is
in the Sp-ABCC5 and Sp-ABCC9
families, whereas orthologs of the vertebrate gene
ABCC2 (also called MRP2) are
absent. Because the ABCC family is known to generally
transport more hydrophilic
compounds than other transporter families, such as the ABCB
genes, sea urchins may have
increased need for transport of these compounds. ABCC efflux
activity has been described in
sea urchin embryos and, consistent with the genomic
expansion of the ABCC family,
the major activity in early embryos ensues from an ABCClike
efflux mechanism.
9) Cytochrome
P-450 monooxygenase (CYP)
Enzymes in the CYP1, CYP2,
CYP3, and CYP4 families carry out oxidative
biotransformation of chemicals
to more hydrophilic products. The sea urchin has 120 CYP
genes, and those related to
CYP gene families 1 to 4 constitute 80% of the total, which
suggests that there has been
selective pressure to expand functionality in these gene families
(42). Eleven CYP1-like genes
are present in the sea urchin genome, more than twice the
number in chordates. CYP2-like
and CYP3-like genes are also present at greater numbers
than in other deuterostomes.
In addition to the CYPs in families 1 to 4, the sea urchin
genome contains homologs of
proteins involved in developmental patterning (CYP26),
cholesterol synthesis (CYP51),
and metabolism (CYP27, CYP46). Homologs of some CYPs
with endogenous functions in
vertebrates were not found; however, (CYP19, androgen
aromatase; CYP8, prostacyclin
synthase; CYP11, pregnenolone synthase; CYP7,
cholesterol-7α-hydroxylase). These CYP genes in
concert with additional expanded
defensive gene families represent
a large diversification of defense gene families by the sea
urchin relative to mammals
(42).
10) Oxidative
defense and metal-complexing proteins
The metal-complexing proteins
include three metallothionein genes and three homologs of
phytochelatin synthase genes.
Genes for antioxidant proteins include three superoxide
dismutase (SOD) genes and a
gene encoding ovoperoxidase (an unusual peroxidase with
SOD-like activity), along with
one catalase, four glutathione peroxidase, and at least three
thioredoxin peroxidase genes.
Reactive oxygen detoxification genes may be important in
conferring the long life-span
of sea urchins, because oxidative damage is thought to be a
major factor in aging.
11) Diversity and
conservation in xenobiotic signaling
The diversity of genes
encoding xenobiotic-sensing transcription factors that regulate
biotransformation enzymes and
transporters was similar to other invertebrate genomes, but
in most cases lower than
vertebrates. For example, the sea urchin genome encoded a single
predicted CNC-bZIP protein
homologous to the four human CNC-bZIP proteins involved in
the response to oxidative
stress. There were two sea urchin homologs of the aryl
hydrocarbon receptor (AHR),
which in vertebrates mediates the transcriptional response to
polynuclear and halogenated
aromatic hydrocarbons and, in both protostomes and
deuterostomes, also regulates
specific developmental processes (43–45). One of the sea
urchin AHR homologs was more
closely related to the vertebrate AHR; the other shared
greatest sequence identity
with the Drosophila AHR homolog spineless. Sea urchins also
Page 10
Science. Author
manuscript; available in PMC 2011 August 22.
NIH-PA Author Manuscript
NIH-PA Author Manuscript NIH-PA Author Manuscript
had two genes encoding hypoxia-inducible
factors (HIFα subunits),
which regulate adaptive
responses to hypoxia, and a
gene encoding ARNT, a PAS protein that is a dimerization
partner for both AHRs and
HIFs.
Strongylocentrotus
purpuratus has 32 nuclear receptor (NR) genes (20), two-thirds the
number in humans, including
several with potential roles in chemical defense (42). The sea
urchin genome also contains
two peroxisome proliferator–activated receptor (PPAR, NR1C)
homologs and an NR1H gene
coorthologous to both liver X receptor (LXR) and farnesoid X
receptor (FXR) (42). Genes
homologous to the vertebrate xenobiotic sensor NR1I genes
[pregnane X receptor, PXR;
constitutive androstane receptor, CAR (46)] are absent,
although three NR1H-related
genes were found, which possibly form a new subfamily of
genes involved in xenobiotic
sensing.
Many of the defense genes are
expressed during development (10, 42), which suggests that
they have dual roles in
chemical defense and in developmental signaling. In several cases
(CYPs, AHR, NF-E2), the evolution
of pathways for chemical defense may have involved
recruitment from developmental
signaling pathways (42).
12) Nervous
System
The echinoderm nervous system
is the least well studied of all the major metazoan phyla.
For a number of technical
reasons, the structure and function of echinoderm nerves have
been neglected. Analysis of
the sea urchin genome has enabled an unprecedented glimpse
into the neural and sensory
functions and has revealed several novel molecular approaches
to the study of echinoderm
nervous systems (Table 3).
The nervous systems of
echinoderm larvae and adults are dispersed, but they are not simple
nerve nets. This organization
differs from both vertebrates, which do not have a dispersed
nervous system, and
hemichordates, which do have nerve nets (47). Adult sea urchins have
thousands of appendages, each
with sensory neurons, ganglia, and motor neurons arranged
in local reflex arcs. These
peripheral appendages are connected to each other and to radial
nerves, which provide overall
control and coordination (47, 48).
Nearly all of the genes
encoding known neurogenic transcription factors are present in the
sea urchin genome, and several
are expressed in neurogenic domains before gastrulation,
which indicates that they may
operate near the top of a conserved neural gene regulatory
network (47). Axon guidance
molecules known from other metazoans are also expressed in
the developing embryo.
Unexpectedly, genes encoding the neurotrophin-Trk receptor system
that were thought to be
vertebrate-specific because they were not found in Ciona, are present
in sea urchin, which suggests
a deuterostome origin and a potential loss in urochordates.
The genes required to
construct neurons and to transmit signals are present, but the
repertoire of neural genes and
the initial characterization of expression of a number of them
led to unexpected and
surprising conclusions. There appear to be no genes encoding gap
junction proteins, which
suggests that communication among neurons depends on chemical
synapses without ionic
coupling. The repertoire of sea urchin neurotransmitters is large, but
melatonin and adrenalin are
lacking, as they are in ascidians (4, 47). Cannabinoid,
lysophospholipid, and
melanocortin receptors are not present in urchins, but orthologs were
found in ascidians (4, 47). In
contrast, some sets of genes thought to be chordate-specific
have sea urchin orthologs, for
example, insulin and insulin-like growth factors (IGFs) that
are more similar to their
chordate counterparts than those of other invertebrates (47).
Overall, the genome contains
representatives of all five large superfamilies of GPCRs,
including those that mediate
signals from neuropeptides and peptide hormones. Both the
secretin and rhodopsin
superfamilies display marked lineage-specific expansions (13, 47).
Page 11
Science. Author
manuscript; available in PMC 2011 August 22.
NIH-PA Author Manuscript
NIH-PA Author Manuscript NIH-PA Author Manuscript
13) Sensory
systems
There were 200 to 700 putative
chemosensory genes that formed large clusters and lacked
introns, which are features of
chemosensory genes in vertebrates, but not in Caenorhabditis
elegans and Drosophila
melanogaster. Many of these genes encoded amino acid motifs that
were characteristic of
vertebrate chemosensory and odorant receptors (13, 47). Sea urchins
had an elaborate collection of
photoreceptor genes that quite surprisingly appeared to be
expressed in tube feet (13,
47). These included many genes encoding transcription factors
regulating retinal development
and a photorhodopsin gene.
Human Usher syndromes are
genetic diseases affecting hearing, balance, and retinitis
pigmentosa (retinal
photoreceptor degeneration). Most of the genes involved have been
identified, and they encode a
set of membrane and cytoskeletal proteins that form an
interacting network that
controls the arrangement of mechanosensory stereocilia in hair cells
of the mammalian ear. Many or
all of the proteins play some roles in photoreceptor
organization and/or
maintenance. Orthologs of virtually the entire set of membrane and
cytoskeletal proteins of the
Usher syndrome network were found in the sea urchin genome.
These include the very large
membrane proteins, usherin and VLGR-1 and large cadherins
(Cadh23 and possibly Pcad15),
all of which participate in forming links between stereocilia
in mammalian hair cells, as
well as myosin 7 and 15, two PDZ proteins (harmonin and
whirlin) and another adaptor
protein (SANS), which participate in linking these membrane
proteins to the cytoskeleton.
In addition, two membrane transporters, NBC (a candidate
Usher syndrome target known to
interact with harmonin) and TrpA1 (the mechanosensory
channel connected to the tip
links containing cadherin 23), have orthologs in the sea urchin
genome. Sea urchins do not have
ears or eyes, so they must deploy these proteins in other
sensory processes. Sea urchins
respond to light, touch, and displacement and probably use
some of same sensory genes
used by vertebrates.
14) The
Echinoderm Adhesome
The S. purpuratus genome
contained representatives of all the standard metazoan adhesion
receptors (table S7), but the
emphasis on different classes of receptors differed substantially
from that used by vertebrates.
The integrin family was intermediate in size between those of
protostomes and
vertebrates—several chordate-specific expansions of the integrin repertoire
were absent, and there were
some expansions unique (so far) to echinoderms. The cadherin
repertoire was also small
relative to vertebrates (a dozen or so instead of over a hundred),
and many chordate-specific
expansions were missing. Specialized large cadherins shared by
protostomes and vertebrates
were present, as well as some specialized large cadherins
previously thought to be
chordate-specific, but overall, the cadherin repertoire was more
invertebrate than vertebrate
in character. Sea urchins lacked the integrins and cadherins that
link to intermediate filaments
in vertebrates.
In contrast, sea urchins had
large repertoires of adhesion molecules containing
immunoglobulin superfamily,
fibronectin type 3 repeat (FN3), epidermal growth factor
(EGF), and LRR repeats. In
addition to the expansion of TLRs and NLRs mentioned above,
there are large expansions of
other LRR receptor families, including GPCRs (32). The key
neural adhesion systems
involved in regulating axonal outgrowth were present (netrin/Unc5/
DCC; Slit/Robo; and
semaphorins/plexins), as were adhesion molecules involved in
synaptogenesis (Agrin/MUSK;
and neurexin/neuroligins). This was not surprising because
these molecules were known in
both protostomes and vertebrates. However, structurally, the
synapses of echinoderms are
unusual because there are no direct synaptic contacts (49).
Some of them were expressed in
sea urchin embryos before there are any neurons,
suggesting that they may have
other roles as well.
Page 12
Science. Author
manuscript; available in PMC 2011 August 22.
NIH-PA Author Manuscript
NIH-PA Author Manuscript NIH-PA Author Manuscript
The basic metazoan basement
membrane extracellular matrix (ECM) tool kit was present—
two alpha-IV collagen genes,
perlecan, laminin subunits, nidogen, and collagen XV/XVIII.
There did not appear to be
much, if any, expansion of these gene families, as is found in
vertebrates, which suggests
that there is less diversity among basement membranes. Quite a
few ECM proteins present in
chordates, but not protostomes, were also missing in sea
urchins, including
fibronectins, tenascins, von Willebrand factor, vitronectin, most
vertebrate-type matrix
proteoglycans, and complex VWA/FN3 collagens among others (32).
Absence of these genes may be
related to the absences of neural crest migration, a high
shear endothelial-lined
vasculature and, of course, cartilage and bone.
In addition to the components
of Usher syndromes mentioned above, it was surprising to
find a clear ortholog of
reelin, a large ECM protein involved in establishing the layered
organization of neurons in the
vertebrate cerebral cortex. Reelin is mutated in the reeler
mouse, and mutations in the reeler
gene in humans have been associated with Norman-
Roberts-type lissencephaly
syndrome. Reelin has a unique domain composition and
organization (Reeler, EGF,
BNR) that has not been found outside chordates, but the sea
urchin genome included a very
good homolog of reelin. Receptors for reelin are believed to
include low-density
lipoprotein receptor–related proteins (LRPs), and there are a number of
these receptors in S.
purpuratus although it is as yet unclear whether they are reelin
receptors, lipoprotein
receptors, or something else. Similar receptors are also involved in
human disease
(atherosclerosis).
15) Biomineralization
Genes
Among the deuterostomes, only
echinoderms and vertebrates produce extensive skeletons.
The possible evolutionary
relations between biomineralization processes in these two groups
have been controversial.
Analysis of the S. purpuratus genome revealed major differences in
the proteins that mediate
biomineralization in echinoderms and vertebrates (50). First, there
were few sea urchin counterparts
of extracellular proteins that mediate biomineral deposition
in vertebrates. For example,
in vertebrates, an important class of proteins involved in
biomineralization is the
family of secreted, calcium-binding phosphoproteins, or SCPPs. Sea
urchins did not have
counterparts of SCPP genes, which supports the hypothesis that this
family arose via a series of
gene duplications after the echinoderm-chordate divergence (51).
Second, almost all of the
proteins that have been directly implicated in the control of
biomineralization in sea
urchins were specific to that clade. The echinoderm skeleton
consists of magnesium calcite
(as distinct from the calcium phosphate skeletons of
vertebrates) in which is
occluded many secreted matrix proteins. The sea urchin spicule
matrix proteins were encoded
by a family of 16 genes that are organized in small clusters
and likely proliferated by
gene duplication. Counterparts of sea urchin spicule matrix genes
were not found in vertebrates,
amphioxus, or ascidians. Likewise, other genes that have been
implicated in
biomineralization in sea urchins, including genes that encode the
transmembrane protein P16 and
MSP130, a glycosylphosphatidylinositol-linked
glycoprotein, were members of
small clusters of closely related genes without apparent
homologs in other
deuterostomes. The members of all three of these sea urchin–specific
gene families were expressed
specifically by the biomineral-forming cells of the embryo, the
primary mesenchyme cells [see
(50)]. As a whole, these findings highlighted substantial
differences in the primary
sequences of the proteins that mediate biomineralization in
echinoderms and vertebrates.
16) Cytoskeletal
genes
In addition to identifying
genes for all previously known S. purpuratus actins and tubulins,
one δ- and two ε-tubulin genes were found (52). Newly
identified motor protein genes
include members of four more
classes of myosin, and eight more families of kinesins. The
Page 13
Science. Author
manuscript; available in PMC 2011 August 22.
NIH-PA Author Manuscript
NIH-PA Author Manuscript NIH-PA Author Manuscript
first dynein cloned and
sequenced was from sea urchin, and although most S. purpuratus
dynein heavy chain genes
mapped one-to-one to mammalian homologs, Sp-DNAH9 mapped
one-to-three, as it was equidistant
between the closely similar mammalian genes DNAH9,
DNA11, and DNAH17 (52).
17) Conclusions
Our estimate of 23,300 genes
is similar to estimates for vertebrates, despite the fact that two
whole-genome duplications are
believed to have occurred in the chordate lineage after
divergence from the lineage
leading to the echinoderms (25–27).
From the analysis
presented here, it seems
likely that many mechanisms shaped the final genetic content of
these genomes. On the one
hand, there are cases of gene families that are expanded in
vertebrates compared with sea
urchin, including examples of the expected 4:1 ratio from two
duplications (15). However
other patterns are also found.
The nuclear receptor family is only
slightly reduced in sea urchin
compared with that of humans, which suggests gene loss
followed the vertebrate
duplications.
The unprecedented expansions of innate immune
system diversity contrast
sharply with the much smaller sets of counterparts that are present
in the sequenced genomes of
protostomes, Ciona, and vertebrates, an example of
independent expansion in the
sea urchin, whereas the GTPases described here have
expanded in sea urchin to
about the same numbers as in vertebrates. Thus, whereas the
duplications of the chordate
lineage were a contributor to the increased complexity of
vertebrates, regional
expansions clearly play a large role in the evolution of these animals.
The refinement of the
inventory of vertebrate-specific or protostome-specific genes likewise
benefits from the sea urchin genome.
Many more human genes have shared ancestry across
the deuterostomes, and in
fact, bilaterian genes are more broadly shared than had been
inferred from comparison of
the previously limited genome sequences. The new biological
niche sampled by the sea
urchin genome provides not only a clearer view of the
deuterostome and bilaterian
ancestor, but has also provided a number of surprises.
The
finding of sea urchin homologs
for sensory proteins related to vision and hearing in humans
may lead to interesting new
concepts of perception, and the extraordinary organization of
the sea urchin immune system
is different from any animal yet studied.
From a practical
standpoint, the sea urchin may
be a treasure trove. Because of the many pathways shared by
sea urchin and human, the sea
urchin genome includes a large number of human disease
gene orthologs. Many of the
genes described in the preceding sections fall into this category
(see tables S8 and S9) and
cover a surprising diversity of systems such as nervous,
endocrine, and blood systems,
as well as muscle and skeleton, as exemplified by the
Huntington and muscular
dystrophy genes.
Continued exploration of the sea urchin immune
system is expected to uncover
additional variations for protection against pathogens. The
immense diversity of
pathogen-binding motifs encoded in the sea urchin genome provides
an invaluable resource for
antimicrobial applications and the identification of new
deuterostome immune functions
with direct relevance to human health. These exciting
possibilities show that much
biodiversity is yet to be uncovered by sampling additional
evolutionary branches of the tree of life.
No comments:
Post a Comment