; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0001776 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0001776
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionUnknown protein
Genome locationchr10:15990092..15991156
RNA-Seq ExpressionPI0001776
SyntenyPI0001776
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147694.1 uncharacterized protein LOC101221084 [Cucumis sativus]4.5e-11891.98Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI
        ME P  K++NKKQKHHHQS A TPNP LSDYSFKPS AVKGLRFGGQFVVKSFTIRR WPLEFLQLLSLPPRYD    DD NKRPPFNSTAAFLPTNFTI
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI

Query:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS
        LAHHAW+TLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIA GDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS
Subjt:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS

Query:  LNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH
        LNDFLDHTAMLALPNQRNISY GGGSSFTTAPV VVH
Subjt:  LNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH

XP_008461660.1 PREDICTED: uncharacterized protein LOC103500208 [Cucumis melo]1.0e-12596.65Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDD--NKRPPFNSTAAFLPTNF
        ME PEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDD  NKR PFNST AF+PTNF
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDD--NKRPPFNSTAAFLPTNF

Query:  TILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAV
        TILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAV
Subjt:  TILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAV

Query:  VSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH
        VSLNDFLDHTAMLALPNQRNI+Y GGGSSFTTAPVAVVH
Subjt:  VSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH

XP_022984700.1 uncharacterized protein LOC111482900 [Cucurbita maxima]1.8e-10680.97Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI
        ME+ EKKNNNKKQKHHHQ G AT NP LSD+SFKPS AVKGLRFGGQFVVKSFTIRRAW +EFLQLLSLPP +       D+ R PFNST AFLPTNFTI
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI

Query:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS
        LAHHAWHTLTLGLGTKKSK ILFVFATEALK AAGR+WP EI LG+ NRRLIRGLSGCEMARFK+RKGCLTFYIYAVREKGCFGFSAADDL TILQAV +
Subjt:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS

Query:  LNDFLDHTAMLALPNQRNISYGGGG----------SSFTTAPVAVVH
        LNDFLDHTAMLALPNQRNISYGGGG          ++FTT PVAVVH
Subjt:  LNDFLDHTAMLALPNQRNISYGGGG----------SSFTTAPVAVVH

XP_023549491.1 uncharacterized protein LOC111807888 [Cucurbita pepo subsp. pepo]1.4e-10682.38Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI
        ME+ EKKNNNKKQKHHHQ GAAT NP LSD+SFKPS AVKGLRFGGQFVVKSFTIRRAW +EFLQLLSLPP +       D+ R PFNSTAAFLPTNFTI
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI

Query:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS
        LAHHAWHTLTLGLGTKKSK ILFVFATEALK AAGR+WP EI LG+ NRRLIRGLSGCEMARFK+RKGCLTFYIYAVREKGCFGFSAADDL TILQAV +
Subjt:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS

Query:  LNDFLDHTAMLALPNQRNISYGGGGS-------SFTTAPVAVVH
        LNDFLDHTAMLALPNQRNISY  GGS       +FTT PVAVVH
Subjt:  LNDFLDHTAMLALPNQRNISYGGGGS-------SFTTAPVAVVH

XP_038892634.1 uncharacterized protein LOC120081660 [Benincasa hispida]2.9e-11790.46Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYD---GGIHDD-DNKRPPFNSTAAFLPT
        MEN  KKNNNKKQKHHHQSG AT NP LSD+SFKPS AVKGLRFGGQF+VKSFTIRRAWPLEFLQLLSLPPRYD   GG+ DD +NKR PFNSTAAF+PT
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYD---GGIHDD-DNKRPPFNSTAAFLPT

Query:  NFTILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQ
        NFTILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGR+WPAEI LGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQ
Subjt:  NFTILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQ

Query:  AVVSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH
        AVVSLNDFLDHTAM+ALPNQRNISYGGGG  FTT PVAVVH
Subjt:  AVVSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH

TrEMBL top hitse value%identityAlignment
A0A0A0KQ89 Uncharacterized protein2.2e-11891.98Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI
        ME P  K++NKKQKHHHQS A TPNP LSDYSFKPS AVKGLRFGGQFVVKSFTIRR WPLEFLQLLSLPPRYD    DD NKRPPFNSTAAFLPTNFTI
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI

Query:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS
        LAHHAW+TLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIA GDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS
Subjt:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS

Query:  LNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH
        LNDFLDHTAMLALPNQRNISY GGGSSFTTAPV VVH
Subjt:  LNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH

A0A1S3CF88 uncharacterized protein LOC1035002084.9e-12696.65Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDD--NKRPPFNSTAAFLPTNF
        ME PEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDD  NKR PFNST AF+PTNF
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDD--NKRPPFNSTAAFLPTNF

Query:  TILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAV
        TILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAV
Subjt:  TILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAV

Query:  VSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH
        VSLNDFLDHTAMLALPNQRNI+Y GGGSSFTTAPVAVVH
Subjt:  VSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH

A0A6J1FQS4 uncharacterized protein LOC1114476044.3e-10681.74Show/hide
Query:  MENPEKK--NNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDN--KRPPFNSTAAFLPT
        MEN EKK  NNNKKQKH HQSG AT NP+L+D+SFKPSTAVKGLRFG QFVVKSFTIRRAWPLEFLQLLSLPP +DG    D+N   R PFNSTA FLPT
Subjt:  MENPEKK--NNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDN--KRPPFNSTAAFLPT

Query:  NFTILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQ
        NFTILAHHAWH LTLGLGTKKSK ILFVF TEALKAA G  WPAE+ LGDVNRRLIRGLSGCEMARFKFRKGCLTFY+YAVRE+GCFGFSAADDLK ILQ
Subjt:  NFTILAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQ

Query:  AVVSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH
        AV++LNDFLD+TAMLALP+QR ISYGG GS F   PV VVH
Subjt:  AVVSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH

A0A6J1GZ93 uncharacterized protein LOC1114585811.9e-10680.4Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI
        ME+ EKKNNNKKQKHHHQ G AT NP LSD+SFKPS AVKGLRFGGQFVVKSFTIRRAW +EFLQLLSLPP +       D+ R PFNSTAAFLPTNFTI
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI

Query:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS
        LAHHAWHTLTLGLGTKKSK ILFVFATEALK AAGR+WP EI LG+ NRRLIRGLSGCEMARFK+RKGCLTFYIYAVREKGCFGFSAADDL TILQAV +
Subjt:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS

Query:  LNDFLDHTAMLALPNQRNISYGGGGS-------------SFTTAPVAVVH
        LNDFLDHTAMLALPNQRNISY GGGS             +FTT PVAVVH
Subjt:  LNDFLDHTAMLALPNQRNISYGGGGS-------------SFTTAPVAVVH

A0A6J1J9B5 uncharacterized protein LOC1114829008.6e-10780.97Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI
        ME+ EKKNNNKKQKHHHQ G AT NP LSD+SFKPS AVKGLRFGGQFVVKSFTIRRAW +EFLQLLSLPP +       D+ R PFNST AFLPTNFTI
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI

Query:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS
        LAHHAWHTLTLGLGTKKSK ILFVFATEALK AAGR+WP EI LG+ NRRLIRGLSGCEMARFK+RKGCLTFYIYAVREKGCFGFSAADDL TILQAV +
Subjt:  LAHHAWHTLTLGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVS

Query:  LNDFLDHTAMLALPNQRNISYGGGG----------SSFTTAPVAVVH
        LNDFLDHTAMLALPNQRNISYGGGG          ++FTT PVAVVH
Subjt:  LNDFLDHTAMLALPNQRNISYGGGG----------SSFTTAPVAVVH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G55420.1 unknown protein4.4e-7158.92Show/hide
Query:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI
        M   EKK   KK+K  HQS         SD SFKPS+ VKGL+FGGQ +VKSFTIRRA   E L+LLSLP           +  PP  STAA+LPTNFTI
Subjt:  MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTI

Query:  LAHHAWHTLTLGLGTKKSKAILFVFATEALK----AAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQ
        LAHHAWHTLTLGLGT+KSK ++FVF TEA+K    AA G +WP+EI LGDVN+++IR L   EMARFKFRKGC+TFY+YAVR  G  GF+AA+DLK ILQ
Subjt:  LAHHAWHTLTLGLGTKKSKAILFVFATEALK----AAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQ

Query:  AVVSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH
        AVV+L DF+DHTAML +P+Q++I+Y       +  P A+ H
Subjt:  AVVSLNDFLDHTAMLALPNQRNISYGGGGSSFTTAPVAVVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATCCGGAGAAGAAAAACAATAACAAAAAGCAAAAACACCACCACCAATCCGGCGCCGCCACTCCCAATCCATCCCTTTCCGATTACTCCTTCAAACCTTCCAC
AGCCGTCAAAGGACTCCGATTCGGCGGCCAATTCGTTGTCAAATCCTTCACAATCCGCCGAGCTTGGCCTCTCGAATTTCTCCAACTCCTTTCTCTCCCACCGCGCTACG
ACGGCGGTATCCACGACGACGACAACAAACGACCTCCATTCAATTCCACGGCGGCGTTTCTTCCGACGAATTTCACGATTTTAGCGCATCACGCTTGGCACACATTAACG
CTCGGCCTCGGCACAAAGAAATCAAAAGCGATTCTGTTCGTGTTCGCGACGGAAGCTCTGAAGGCCGCCGCTGGCCGTGTCTGGCCGGCGGAAATTGCACTCGGCGATGT
GAATCGACGGCTGATTCGGGGATTAAGCGGCTGCGAAATGGCTAGGTTTAAATTTAGAAAAGGATGTTTAACTTTTTACATCTACGCGGTTCGTGAAAAAGGCTGCTTCG
GATTCTCCGCCGCCGATGATCTGAAAACGATTTTGCAGGCGGTTGTTTCGCTCAATGATTTTTTGGATCATACTGCCATGCTCGCCTTGCCTAATCAGAGAAATATTAGT
TACGGCGGCGGCGGAAGCAGCTTTACGACGGCGCCGGTCGCCGTTGTTCATTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTGTGTTGTTTTGTTTTGACCCCAAAATAATCAAAACCCAAATCCCAAATACCAAATCCCACCGACAAATTCAAATCAAAAAACCAAACAAAATGGAAAATCCGGAG
AAGAAAAACAATAACAAAAAGCAAAAACACCACCACCAATCCGGCGCCGCCACTCCCAATCCATCCCTTTCCGATTACTCCTTCAAACCTTCCACAGCCGTCAAAGGACT
CCGATTCGGCGGCCAATTCGTTGTCAAATCCTTCACAATCCGCCGAGCTTGGCCTCTCGAATTTCTCCAACTCCTTTCTCTCCCACCGCGCTACGACGGCGGTATCCACG
ACGACGACAACAAACGACCTCCATTCAATTCCACGGCGGCGTTTCTTCCGACGAATTTCACGATTTTAGCGCATCACGCTTGGCACACATTAACGCTCGGCCTCGGCACA
AAGAAATCAAAAGCGATTCTGTTCGTGTTCGCGACGGAAGCTCTGAAGGCCGCCGCTGGCCGTGTCTGGCCGGCGGAAATTGCACTCGGCGATGTGAATCGACGGCTGAT
TCGGGGATTAAGCGGCTGCGAAATGGCTAGGTTTAAATTTAGAAAAGGATGTTTAACTTTTTACATCTACGCGGTTCGTGAAAAAGGCTGCTTCGGATTCTCCGCCGCCG
ATGATCTGAAAACGATTTTGCAGGCGGTTGTTTCGCTCAATGATTTTTTGGATCATACTGCCATGCTCGCCTTGCCTAATCAGAGAAATATTAGTTACGGCGGCGGCGGA
AGCAGCTTTACGACGGCGCCGGTCGCCGTTGTTCATTGAATTTTTAAACCTCTGTTTCTTCTCTTTAAAAATGATATCTGTTCTCATCATCATCATCGTCTTCTTCTTCA
TCTTCTTTTTTTGGAAAAAAGAAATTTTGTTGTTTGACCTTTGATTGATTGTTGCAACTTTTTTCTTTTCTCCTTCCGTCTGAATCACTAGATTGAATTACTAAATCTTA
TATCAGAAAATTGATCCGTTTAACATTTTCTAGTTGTAATTTTCATATGTAATTAGATCATCTAATAAAGGGGAG
Protein sequenceShow/hide protein sequence
MENPEKKNNNKKQKHHHQSGAATPNPSLSDYSFKPSTAVKGLRFGGQFVVKSFTIRRAWPLEFLQLLSLPPRYDGGIHDDDNKRPPFNSTAAFLPTNFTILAHHAWHTLT
LGLGTKKSKAILFVFATEALKAAAGRVWPAEIALGDVNRRLIRGLSGCEMARFKFRKGCLTFYIYAVREKGCFGFSAADDLKTILQAVVSLNDFLDHTAMLALPNQRNIS
YGGGGSSFTTAPVAVVH