; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh20G008460 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh20G008460
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationCma_Chr20:4053476..4055838
RNA-Seq ExpressionCmaCh20G008460
SyntenyCmaCh20G008460
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010929.1 hypothetical protein SDJN02_27727 [Cucurbita argyrosperma subsp. argyrosperma]4.7e-8898.22Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWA+IHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EV+MVRTVARIVQGDAG+IICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

XP_022943557.1 uncharacterized protein LOC111448293 [Cucurbita moschata]4.7e-8898.22Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWA+IHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EV+MVRTVARIVQGDAG+IICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

XP_022986619.1 uncharacterized protein LOC111484307 [Cucurbita maxima]5.6e-89100Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

XP_023512225.1 uncharacterized protein LOC111777016 [Cucurbita pepo subsp. pepo]1.4e-8797.63Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERET VRRRGRDILVAVDHGPNSKHAFDWA+IHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EV+MVRTVARIVQGDAG+IICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

XP_038901616.1 uncharacterized protein LOC120088411 [Benincasa hispida]1.2e-8695.86Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERETA RRRGRDIL+AVDHGPNSKHAFDWA+IHFCRLADTIHLVHAVSNVKNELVYE+SQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EV+MVRTVARIVQGDAG++ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

TrEMBL top hitse value%identityAlignment
A0A0A0LL61 Usp domain-containing protein7.4e-8795.27Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERETA RRRGRDIL+AVDHGPNSKHAFDWA+IHFCRLADTIHLVHAVSNVKNELVYE+SQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EV+MVRTVARIVQGDAG++ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPV+IVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

A0A1S3CET4 uncharacterized protein LOC1034996767.4e-8795.27Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERETA RRRGRDIL+AVDHGPNSKHAFDWA+IHFCRLADTIHLVHAVSNVKNELVYE+SQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EV+MVRTVARIVQGDAG++ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPV+IVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

A0A5D3DYQ0 Universal stress protein PHOS327.4e-8795.27Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERETA RRRGRDIL+AVDHGPNSKHAFDWA+IHFCRLADTIHLVHAVSNVKNELVYE+SQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EV+MVRTVARIVQGDAG++ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPV+IVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

A0A6J1FTC2 uncharacterized protein LOC1114482932.3e-8898.22Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWA+IHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EV+MVRTVARIVQGDAG+IICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

A0A6J1JGJ7 uncharacterized protein LOC1114843072.7e-89100Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

SwissProt top hitse value%identityAlignment
P87132 Uncharacterized protein C167.058.9e-0524.83Show/hide
Query:  RRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAV--------------SNVKNELVYEYSQGLMEKLAVEAFEVSMVRTVARIVQGDAGRIICK
        +R     + +D    S HA +WA+    R  DT+ +V  +               + + E + + ++ +++ L+    EV +   +  I    A  +I +
Subjt:  RRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAV--------------SNVKNELVYEYSQGLMEKLAVEAFEVSMVRTVARIVQGDAGRIICK

Query:  EAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKVR
          + ++P+ VVMG+RGRS ++ VL GS S ++  N  S PV++   K++
Subjt:  EAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGKVR

Arabidopsis top hitse value%identityAlignment
AT2G21620.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.0e-7778.7Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF
        M+ L E+EEY++REV LPSLIPVVPEPELERE+  RRRGRD++VAVDHGPNSKHAFDWA++HFCRLADT+HLVHAVS+VKN++VYE SQ LMEKLAVEA+
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAF

Query:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        +V+MV++VAR+V+GDAG++ICKEAEK+KPAAV++GTRGRSL++SVLQGSVSE+ FHNCKSAPVIIVPGK
Subjt:  EVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

AT2G21620.2 Adenine nucleotide alpha hydrolases-like superfamily protein3.8e-7576Show/hide
Query:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSN------VKNELVYEYSQGLMEK
        M+ L E+EEY++REV LPSLIPVVPEPELERE+  RRRGRD++VAVDHGPNSKHAFDWA++HFCRLADT+HLVHAVS+      VKN++VYE SQ LMEK
Subjt:  MDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSN------VKNELVYEYSQGLMEK

Query:  LAVEAFEVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
        LAVEA++V+MV++VAR+V+GDAG++ICKEAEK+KPAAV++GTRGRSL++SVLQGSVSE+ FHNCKSAPVIIVPGK
Subjt:  LAVEAFEVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.6e-0724.86Show/hide
Query:  SLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNEL--------------------VY-------------
        S +   PE   E E A     + ++VA+D   +S +A  W I HF  L  T     A S +   +                    VY             
Subjt:  SLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNEL--------------------VY-------------

Query:  -EYSQGLMEKLAVEAFEVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
         E S  L+ + A++      +RT   +++G+A  +IC+  EK+    +V+G+RG   I+    GSVS++  H+     +I+ P K
Subjt:  -EYSQGLMEKLAVEAFEVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein3.3e-0724.73Show/hide
Query:  SLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNEL---------------------VY------------
        S +   PE   E E A     + ++VA+D   +S +A  W I HF  L  T     A S +   +                     VY            
Subjt:  SLIPVVPEPELERETAVRRRGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAVSNVKNEL---------------------VY------------

Query:  --EYSQGLMEKLAVEAFEVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK
          E S  L+ + A++      +RT   +++G+A  +IC+  EK+    +V+G+RG   I+    GSVS++  H+     +I+ P K
Subjt:  --EYSQGLMEKLAVEAFEVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIVPGK

AT3G53990.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.3e-0827.74Show/hide
Query:  RGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAV----SNVKNELVYEYSQGL-----------MEKLAVEAFEVSM-----------VRTVARI
        + R+I +A+D   +SK+A  WAI +     DTI+++H +       +N L ++    L           MEK  V+     +           V  V ++
Subjt:  RGRDILVAVDHGPNSKHAFDWAIIHFCRLADTIHLVHAV----SNVKNELVYEYSQGL-----------MEKLAVEAFEVSM-----------VRTVARI

Query:  VQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIV
          GDA   +    + LK  ++VMG+RG S +Q ++ GSVS  V  +    PV +V
Subjt:  VQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAPVIIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGCCTTCTAGAAGCGATCAAAGCGGCTGCAAAAAGCAATTCAATCCAAAAGATCCATTTCGCCACCAACTTTTCTGATTCCGGCCACCATTTTAAATCCCCACC
CACCACCGATTTCCCATCTCTCCGATTCGATTTTCAAGTTTTTGATCGTCTTGTTTTCTCTCCGATGGATACATTAGAGGAGGAAGAAGAATACAACTGGAGAGAAGTCC
GGCTTCCGTCGCTGATCCCGGTAGTGCCGGAGCCAGAGCTAGAGAGAGAGACGGCGGTGAGACGTCGTGGCCGAGACATTCTCGTTGCCGTCGATCATGGTCCGAACAGC
AAACACGCTTTCGATTGGGCTATAATCCATTTCTGCCGCCTTGCCGACACCATCCATCTCGTCCACGCCGTTTCCAATGTGAAGAATGAATTGGTTTATGAGTATAGTCA
GGGGCTGATGGAGAAGCTTGCGGTGGAGGCCTTTGAGGTTTCCATGGTGAGGACTGTGGCAAGGATTGTGCAGGGAGATGCTGGGAGGATTATTTGCAAGGAAGCAGAGA
AGTTGAAGCCTGCTGCTGTTGTTATGGGCACTAGAGGAAGAAGCTTGATTCAAAGTGTTCTGCAGGGAAGTGTGAGTGAGCATGTCTTCCACAACTGCAAATCAGCACCT
GTTATTATAGTTCCTGGAAAAGTCAGACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGCCTTCTAGAAGCGATCAAAGCGGCTGCAAAAAGCAATTCAATCCAAAAGATCCATTTCGCCACCAACTTTTCTGATTCCGGCCACCATTTTAAATCCCCACC
CACCACCGATTTCCCATCTCTCCGATTCGATTTTCAAGTTTTTGATCGTCTTGTTTTCTCTCCGATGGATACATTAGAGGAGGAAGAAGAATACAACTGGAGAGAAGTCC
GGCTTCCGTCGCTGATCCCGGTAGTGCCGGAGCCAGAGCTAGAGAGAGAGACGGCGGTGAGACGTCGTGGCCGAGACATTCTCGTTGCCGTCGATCATGGTCCGAACAGC
AAACACGCTTTCGATTGGGCTATAATCCATTTCTGCCGCCTTGCCGACACCATCCATCTCGTCCACGCCGTTTCCAATGTGAAGAATGAATTGGTTTATGAGTATAGTCA
GGGGCTGATGGAGAAGCTTGCGGTGGAGGCCTTTGAGGTTTCCATGGTGAGGACTGTGGCAAGGATTGTGCAGGGAGATGCTGGGAGGATTATTTGCAAGGAAGCAGAGA
AGTTGAAGCCTGCTGCTGTTGTTATGGGCACTAGAGGAAGAAGCTTGATTCAAAGTGTTCTGCAGGGAAGTGTGAGTGAGCATGTCTTCCACAACTGCAAATCAGCACCT
GTTATTATAGTTCCTGGAAAAGTCAGACTTTGA
Protein sequenceShow/hide protein sequence
MGGLLEAIKAAAKSNSIQKIHFATNFSDSGHHFKSPPTTDFPSLRFDFQVFDRLVFSPMDTLEEEEEYNWREVRLPSLIPVVPEPELERETAVRRRGRDILVAVDHGPNS
KHAFDWAIIHFCRLADTIHLVHAVSNVKNELVYEYSQGLMEKLAVEAFEVSMVRTVARIVQGDAGRIICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEHVFHNCKSAP
VIIVPGKVRL