; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G04750 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G04750
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationClcChr11:4517484..4519660
RNA-Seq ExpressionClc11G04750
SyntenyClc11G04750
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148842.1 uncharacterized protein LOC101210790 isoform X1 [Cucumis sativus]4.2e-9297.74Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET ERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHL+HAVSNVKNELVYEFSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EVAMVRTVARIVQGDAGKVICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPV+IVPGKEAGETSVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

XP_022943557.1 uncharacterized protein LOC111448293 [Cucurbita moschata]3.0e-9095.48Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET  RRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVSNVKNELVYE+SQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EVAMVRTVARIVQGDAGK+ICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGE SVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

XP_023512225.1 uncharacterized protein LOC111777016 [Cucurbita pepo subsp. pepo]3.9e-9095.48Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET  RRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVSNVKNELVYE+SQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EVAMVRTVARIVQGDAGK+ICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGE SVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

XP_031737219.1 uncharacterized protein LOC101210790 isoform X2 [Cucumis sativus]3.9e-9097.18Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET ERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHL+HAVSNVKNELVYEFSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EVAMVRTVARIVQGDAGKVICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPV+IVPGK AGETSVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

XP_038901616.1 uncharacterized protein LOC120088411 [Benincasa hispida]1.2e-9197.74Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET ERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHL+HAVSNVKNELVYEFSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EVAMVRTVARIVQGDAGKVICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGE SVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

TrEMBL top hitse value%identityAlignment
A0A0A0LL61 Usp domain-containing protein2.0e-9297.74Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET ERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHL+HAVSNVKNELVYEFSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EVAMVRTVARIVQGDAGKVICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPV+IVPGKEAGETSVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

A0A1S3CET4 uncharacterized protein LOC1034996762.0e-9297.74Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET ERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHL+HAVSNVKNELVYEFSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EVAMVRTVARIVQGDAGKVICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPV+IVPGKEAGETSVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

A0A5D3DYQ0 Universal stress protein PHOS322.0e-9297.74Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET ERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHL+HAVSNVKNELVYEFSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EVAMVRTVARIVQGDAGKVICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPV+IVPGKEAGETSVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

A0A6J1FTC2 uncharacterized protein LOC1114482931.4e-9095.48Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET  RRRGRDIL+AVDHGPNSKHAFDWALIHFCRLADTIHL+HAVSNVKNELVYE+SQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EVAMVRTVARIVQGDAGK+ICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGE SVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

A0A6J1JGJ7 uncharacterized protein LOC1114843071.2e-8993.79Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET  RRRGRDIL+AVDHGPNSKHAFDWA+IHFCRLADTIHL+HAVSNVKNELVYE+SQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        EV+MVRTVARIVQGDAG++ICKEAEKLKPAAV+MGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGE SVI
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

SwissProt top hitse value%identityAlignment
Q8L4N1 Universal stress protein PHOS347.6e-0424.86Show/hide
Query:  PELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIH------------------------------AVSNVKNELVYEFSQGLMEKLA
        P     TP     R I +AVD    S  A  WA+ H+ R  D + ++H                              A      E    F+   +  LA
Subjt:  PELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIH------------------------------AVSNVKNELVYEFSQGLMEKLA

Query:  VEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQ---GSVSEHVFHNCKSAPVIIV
            E      +  +   D  + +C E E+L  +AVIMG+RG    +       GSVS++  H+C   PV++V
Subjt:  VEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQ---GSVSEHVFHNCKSAPVIIV

Q8VYN9 Universal stress protein PHOS324.4e-0426.38Show/hide
Query:  TPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSN------------VKNELVYEFSQGLMEKLAVEAFEVAMVRTVAR----------
        TP     R I +AVD    S  A  WA+ H+ R  D + L+H                +K ++    +Q    +   +AF    V  +A+          
Subjt:  TPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSN------------VKNELVYEFSQGLMEKLAVEAFEVAMVRTVAR----------

Query:  ---IVQGDAGKVICKEAEKLKPAAVIMGTRG----RSLIQSVLQGSVSEHVFHNCKSAPVIIV
           +   D  + +C E E+L  +AVIMG+RG    +        GSVS++  H+C   PV++V
Subjt:  ---IVQGDAGKVICKEAEKLKPAAVIMGTRG----RSLIQSVLQGSVSEHVFHNCKSAPVIIV

Arabidopsis top hitse value%identityAlignment
AT2G21620.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.1e-8179.66Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF
        M+ L E+EEY++REV LPSLIPVVPEPELERE+ ERRRGRD+++AVDHGPNSKHAFDWAL+HFCRLADT+HL+HAVS+VKN++VYE SQ LMEKLAVEA+
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAF

Query:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        +VAMV++VAR+V+GDAGKVICKEAEK+KPAAVI+GTRGRSL++SVLQGSVSE+ FHNCKSAPVIIVPGKEAG+ S++
Subjt:  EVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

AT2G21620.2 Adenine nucleotide alpha hydrolases-like superfamily protein1.9e-7977.05Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSN------VKNELVYEFSQGLMEK
        M+ L E+EEY++REV LPSLIPVVPEPELERE+ ERRRGRD+++AVDHGPNSKHAFDWAL+HFCRLADT+HL+HAVS+      VKN++VYE SQ LMEK
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSN------VKNELVYEFSQGLMEK

Query:  LAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI
        LAVEA++VAMV++VAR+V+GDAGKVICKEAEK+KPAAVI+GTRGRSL++SVLQGSVSE+ FHNCKSAPVIIVPGKEAG+ S++
Subjt:  LAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.4e-0823.66Show/hide
Query:  SLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADT----------IHLIHAVSNVKNELVY-----------------------
        S +   PE   E E P     R +++A+D   +S +A  W + HF  L  T          + +IH  S   +   +                       
Subjt:  SLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADT----------IHLIHAVSNVKNELVY-----------------------

Query:  -EFSQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKE
         E S  L+ + A++      +RT   +++G+A ++IC+  EK+    +++G+RG   I+    GSVS++  H+     +I+ P KE
Subjt:  -EFSQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKE

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein1.8e-0823.53Show/hide
Query:  SLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADT----------IHLIHAVSNVKNELVY-----------------------
        S +   PE   E E P     R +++A+D   +S +A  W + HF  L  T          + +IH  S   +   +                       
Subjt:  SLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADT----------IHLIHAVSNVKNELVY-----------------------

Query:  --EFSQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKE
          E S  L+ + A++      +RT   +++G+A ++IC+  EK+    +++G+RG   I+    GSVS++  H+     +I+ P KE
Subjt:  --EFSQGLMEKLAVEAFEVAMVRTVARIVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKE

AT3G53990.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.7e-0928.3Show/hide
Query:  RGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAV----SNVKNELVY----------EFSQ-GLMEKLAVEAFEVAM-----------VRTVARI
        + R+I IA+D   +SK+A  WA+ +     DTI++IH +       +N L +          EF +  +MEK  V+     +           V  V ++
Subjt:  RGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAV----SNVKNELVY----------EFSQ-GLMEKLAVEAFEVAM-----------VRTVARI

Query:  VQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKE
          GDA + +    + LK  +++MG+RG S +Q ++ GSVS  V  +    PV +V   E
Subjt:  VQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACATTGGAGGAGGAAGAAGAATACAACTGGAGAGAAGTCCGGCTTCCGTCGCTGATCCCAGTGGTGCCGGAGCCAGAGCTTGAGAGAGAGACGCCGGAGAGACG
CCGTGGGCGAGACATCCTAATCGCCGTCGACCATGGCCCCAACAGCAAACACGCTTTCGATTGGGCTCTAATCCATTTCTGCCGCCTCGCTGACACCATCCATCTAATCC
ACGCCGTTTCCAATGTGAAGAATGAATTAGTTTATGAGTTCAGCCAAGGGCTGATGGAGAAGCTTGCAGTGGAGGCCTTTGAGGTGGCCATGGTTAGGACTGTGGCAAGG
ATTGTGCAGGGAGATGCAGGGAAGGTTATTTGTAAGGAAGCAGAGAAGTTGAAGCCTGCTGCTGTTATTATGGGCACCAGAGGAAGAAGCTTGATTCAAAGTGTCTTGCA
AGGAAGTGTGAGTGAGCATGTCTTTCACAACTGCAAATCAGCACCTGTTATTATAGTTCCTGGGAAAGAAGCTGGAGAAACATCTGTGATTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAGTAAGAGAATAAAAAGTATTTTTCTAAAAATAATTTGTTAGAAACAGCCAAACTTCAACAAACAATGAGGGACGAAATTTCTTCGGCTTCGGCGGCTGCAAAAA
GCAAAACAATCGAAAACACCCAACTCTGATTCCGGCCACCATTTTAAAGCCCCAATCACCGCCCCATTTCCCGTCTCTCCGATTCGATTCGATTCCTCAATCCAAACCGC
CCCTAAAACCCCAAATCACATCGCTAGATTTTGATCTTCTTGTTTTCTTTTTCGATGGATACATTGGAGGAGGAAGAAGAATACAACTGGAGAGAAGTCCGGCTTCCGTC
GCTGATCCCAGTGGTGCCGGAGCCAGAGCTTGAGAGAGAGACGCCGGAGAGACGCCGTGGGCGAGACATCCTAATCGCCGTCGACCATGGCCCCAACAGCAAACACGCTT
TCGATTGGGCTCTAATCCATTTCTGCCGCCTCGCTGACACCATCCATCTAATCCACGCCGTTTCCAATGTGAAGAATGAATTAGTTTATGAGTTCAGCCAAGGGCTGATG
GAGAAGCTTGCAGTGGAGGCCTTTGAGGTGGCCATGGTTAGGACTGTGGCAAGGATTGTGCAGGGAGATGCAGGGAAGGTTATTTGTAAGGAAGCAGAGAAGTTGAAGCC
TGCTGCTGTTATTATGGGCACCAGAGGAAGAAGCTTGATTCAAAGTGTCTTGCAAGGAAGTGTGAGTGAGCATGTCTTTCACAACTGCAAATCAGCACCTGTTATTATAG
TTCCTGGGAAAGAAGCTGGAGAAACATCTGTGATTTGAATGCCAACTGCTTGCTTGAAGCCTGAAATGAAACTACAATGTTTCTTCCTCAATTTGTGTTTTCTTTTTTTG
TTTTAGTATTAAAGGTTTAGGGTATTGTATTTAAGTTGTAAGAAGAATTTGATATTGTATTTATATATTTATATATATGTAATGTATAGTACTTGATTTCCAGCATTAAA
ATTTACGCCCTACATATTTTATGTTTTTAGTGTTTTTGAATGTGTGATTAGTAGCCAATTTAGA
Protein sequenceShow/hide protein sequence
MDTLEEEEEYNWREVRLPSLIPVVPEPELERETPERRRGRDILIAVDHGPNSKHAFDWALIHFCRLADTIHLIHAVSNVKNELVYEFSQGLMEKLAVEAFEVAMVRTVAR
IVQGDAGKVICKEAEKLKPAAVIMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKEAGETSVI