; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G008280 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G008280
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
Genome locationCmo_Chr05:5047884..5050449
RNA-Seq ExpressionCmoCh05G008280
SyntenyCmoCh05G008280
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138889.1 uncharacterized protein LOC101217347 [Cucumis sativus]5.6e-8488.52Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLS SSSTSIS +HYNIH LFLLCNY+LLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGA+SGCAMASA+TGRW+GVHMVFTVLTAIFQGS+TVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
        V+TRTA+FLGELKSYVREE G+VILKLGGGLSGLIFCLEW+VLVLAF LKYY+YVEGNG+GE LKRS KVQQFEDS WA PFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP

XP_022933163.1 uncharacterized protein LOC111440012 [Cucurbita moschata]4.2e-95100Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
        VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP

XP_022996789.1 uncharacterized protein LOC111491920 [Cucurbita maxima]6.6e-9398.91Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
        VYTRTANFLGELKSYVREESGTVILKL GGLSGLIFCLEWIVLVLAF LKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP

XP_023520915.1 uncharacterized protein LOC111784439 [Cucurbita pepo subsp. pepo]5.3e-9096.2Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAA+SCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVL AIFQGSLTVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLK-RSGKVQQFEDSKWAPPFP
        VYTRTANFLGELKSYVREESGTVILKL GGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGS EDLK  +GKVQQFEDSKWAPPFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLK-RSGKVQQFEDSKWAPPFP

XP_038890643.1 uncharacterized protein LOC120080148 [Benincasa hispida]1.9e-8489.62Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLS SSSTSISR+HYNIH LFLLCNY+LLGAASSCIFLTLSLRL+PSVCGFFIIFLHAFTIAGA+SGCAMASAATGRW+GVHMVFTVLTAIFQGS+TVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
        V+TRT++FLGELKSYVREE G+VILKLGGGLSGLIFCLEWIVLVLAF LKYY+YVEGNGS E LKRSGKVQQFEDS WA PFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP

TrEMBL top hitse value%identityAlignment
A0A0A0LHB3 Uncharacterized protein2.7e-8488.52Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLS SSSTSIS +HYNIH LFLLCNY+LLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGA+SGCAMASA+TGRW+GVHMVFTVLTAIFQGS+TVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
        V+TRTA+FLGELKSYVREE G+VILKLGGGLSGLIFCLEW+VLVLAF LKYY+YVEGNG+GE LKRS KVQQFEDS WA PFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP

A0A5A7UCF0 Uncharacterized protein4.6e-8487.43Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLS SSSTS+SR+HYNIH LFLLCNY+LLGAASSCIFLTLSLRL+PSVCGFFIIFLHAFTIAGA+SGCAMASA+TGRW+GVHMVFTVLTAIFQGS+TVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
        V+TRT +FLGELKSYVREE G+VILKLGGGLSGLIFCLEW+VLVLAF LKYY+YVEGNG+GE LKRS KVQQFEDS WA PFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP

A0A5D3DJ66 Uncharacterized protein1.1e-8286.96Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLS SSSTS+SR+HYNIH LFLLCNY+LLGAASSCIFLTLSLRL+PSVCGFFIIFLHAFTIAGA+SGCAMASA+TGRW+GVHMVFTVLTAIFQGS+TVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGN-GSGEDLKRSGKVQQFEDSKWAPPFP
        V+TRT +FLGELKSYVREE G+VILKLGGGLSGLIFCLEW+VLVLAF LKYY+YVEGN G+GE LKRS KVQQFEDS WA PFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGN-GSGEDLKRSGKVQQFEDSKWAPPFP

A0A6J1F3Y9 uncharacterized protein LOC1114400122.0e-95100Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
        VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP

A0A6J1K9N5 uncharacterized protein LOC1114919203.2e-9398.91Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
        VYTRTANFLGELKSYVREESGTVILKL GGLSGLIFCLEWIVLVLAF LKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02640.1 unknown protein4.3e-5863.89Show/hide
Query:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL
        MGL      SI  +HY  H LFL  NYVLLGA+SSCIFLTLSLRL+PS+CGFF+I LHA TIA A+SGCA AS    RWY  HM+ TVLTAIFQGS++VL
Subjt:  MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVL

Query:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAP
        ++T T+NFL  L SYVRE+  ++ILKL GGL  +IFCLEWIVLVLAF+LKYY YV+G+ +G  +KR+GKVQ  E  K +P
Subjt:  VYTRTANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAP

AT5G16250.1 unknown protein5.1e-5965.73Show/hide
Query:  SSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVLVYTRT
        SSS+ +  +HY+ H +FL  NY+LLGAASSCIFLTLSLRL+PS+CGF +I LHA TIA A+SGCA AS    RWY  HMV TVLTAIFQGS++VL++T T
Subjt:  SSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVLVYTRT

Query:  ANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSK-WAPPF
        + FLG LKSYVREE   VILKLGGGL  +IFCL+WIVLV AF+LKYY YV+G G G  +KR+GKVQ  E+ K W  PF
Subjt:  ANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSK-WAPPF

AT5G36710.1 unknown protein4.2e-4558.1Show/hide
Query:  ISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAM-----ASAATGRWYGVHMVFTVLTAIFQGSLTVLVYTRT
        +S++  N HN+FLLCNY+LLG+ASSCIFLT+SLRL PS+ G  +IFL+  TIA A+SGC++     ++ A+ R YG HMV TVLTAIFQG+++VL++TRT
Subjt:  ISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAM-----ASAATGRWYGVHMVFTVLTAIFQGSLTVLVYTRT

Query:  ANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKV-QQFEDSKWAPPFP
         +FL  LKSYVREE G VILKL GGL  L+FCLEWIVLVLAF LKY  Y++ +   +D     KV +Q ED K  P +P
Subjt:  ANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKV-QQFEDSKWAPPFP

AT5G36800.1 unknown protein4.2e-4558.1Show/hide
Query:  ISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAM-----ASAATGRWYGVHMVFTVLTAIFQGSLTVLVYTRT
        +S++  N HN+FLLCNY+LLG+ASSCIFLT+SLRL PS+ G  +IFL+  TIA A+SGC++     ++ A+ R YG HMV TVLTAIFQG+++VL++TRT
Subjt:  ISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAM-----ASAATGRWYGVHMVFTVLTAIFQGSLTVLVYTRT

Query:  ANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKV-QQFEDSKWAPPFP
         +FL  LKSYVREE G VILKL GGL  L+FCLEWIVLVLAF LKY  Y++ +   +D     KV +Q ED K  P +P
Subjt:  ANFLGELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKV-QQFEDSKWAPPFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCTTTCAACCTCTTCTTCAACCTCAATTTCTAGGGCTCACTACAACATCCACAATCTCTTCCTCCTCTGCAACTACGTCCTCCTCGGCGCCGCCTCCAGCTGCAT
TTTCCTAACCTTATCCCTCCGCCTCCTCCCTTCCGTTTGCGGTTTCTTCATCATCTTCCTCCACGCATTCACCATCGCCGGCGCTATCTCCGGCTGCGCCATGGCCTCTG
CCGCCACTGGCCGCTGGTACGGCGTCCACATGGTCTTCACCGTCCTCACCGCCATCTTCCAAGGCTCCCTCACGGTCCTGGTCTACACCAGAACAGCCAACTTCCTTGGC
GAACTGAAATCGTACGTCCGGGAGGAAAGCGGAACGGTGATTCTGAAATTGGGCGGCGGATTGAGCGGCTTGATCTTCTGTCTGGAGTGGATCGTCCTGGTTCTTGCGTT
TTGGTTGAAATACTATTTGTATGTTGAAGGAAATGGAAGTGGAGAGGATCTGAAGAGGAGTGGGAAGGTTCAGCAGTTTGAAGATTCCAAATGGGCGCCGCCTTTCCCTG
GTTCAATCAACAACAGGATTTACAGATATAAGGAAGGGGACAGACAATGGATCATAGAACAAAAGCTGGTAAAAAATGATCTTGGACGGCTGAACAGAGACAGACAACTA
AAGAATTCTATTCCAGAACACTATGCAGATCCATGCATGTTTGAATTTGTTCTTGCCTACCATCTGCAAGCCTCTCCTTACCCCAATTCATCATCATTAGCTGATTTTCT
GGCAGTTGAAAGTCCATTGGCTTCTGACGAATGCCCCAATGGCTTTTTCCTCTTGAACGAATTGAATGATCTATGGAGCGCCAAGAAGACATGCCGCCTGAAAGTAGAAT
ATTTCTTAAATCTTCTCTTCTCGTTCTACTTCCTTCTCAGTCCTAAAATAAGAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCTTTCAACCTCTTCTTCAACCTCAATTTCTAGGGCTCACTACAACATCCACAATCTCTTCCTCCTCTGCAACTACGTCCTCCTCGGCGCCGCCTCCAGCTGCAT
TTTCCTAACCTTATCCCTCCGCCTCCTCCCTTCCGTTTGCGGTTTCTTCATCATCTTCCTCCACGCATTCACCATCGCCGGCGCTATCTCCGGCTGCGCCATGGCCTCTG
CCGCCACTGGCCGCTGGTACGGCGTCCACATGGTCTTCACCGTCCTCACCGCCATCTTCCAAGGCTCCCTCACGGTCCTGGTCTACACCAGAACAGCCAACTTCCTTGGC
GAACTGAAATCGTACGTCCGGGAGGAAAGCGGAACGGTGATTCTGAAATTGGGCGGCGGATTGAGCGGCTTGATCTTCTGTCTGGAGTGGATCGTCCTGGTTCTTGCGTT
TTGGTTGAAATACTATTTGTATGTTGAAGGAAATGGAAGTGGAGAGGATCTGAAGAGGAGTGGGAAGGTTCAGCAGTTTGAAGATTCCAAATGGGCGCCGCCTTTCCCTG
GTTCAATCAACAACAGGATTTACAGATATAAGGAAGGGGACAGACAATGGATCATAGAACAAAAGCTGGTAAAAAATGATCTTGGACGGCTGAACAGAGACAGACAACTA
AAGAATTCTATTCCAGAACACTATGCAGATCCATGCATGTTTGAATTTGTTCTTGCCTACCATCTGCAAGCCTCTCCTTACCCCAATTCATCATCATTAGCTGATTTTCT
GGCAGTTGAAAGTCCATTGGCTTCTGACGAATGCCCCAATGGCTTTTTCCTCTTGAACGAATTGAATGATCTATGGAGCGCCAAGAAGACATGCCGCCTGAAAGTAGAAT
ATTTCTTAAATCTTCTCTTCTCGTTCTACTTCCTTCTCAGTCCTAAAATAAGAAGTTAA
Protein sequenceShow/hide protein sequence
MGLSTSSSTSISRAHYNIHNLFLLCNYVLLGAASSCIFLTLSLRLLPSVCGFFIIFLHAFTIAGAISGCAMASAATGRWYGVHMVFTVLTAIFQGSLTVLVYTRTANFLG
ELKSYVREESGTVILKLGGGLSGLIFCLEWIVLVLAFWLKYYLYVEGNGSGEDLKRSGKVQQFEDSKWAPPFPGSINNRIYRYKEGDRQWIIEQKLVKNDLGRLNRDRQL
KNSIPEHYADPCMFEFVLAYHLQASPYPNSSSLADFLAVESPLASDECPNGFFLLNELNDLWSAKKTCRLKVEYFLNLLFSFYFLLSPKIRS