; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028113 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028113
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionAA_kinase domain-containing protein
Genome locationscaffold2:44699465..44703731
RNA-Seq ExpressionSpg028113
SyntenySpg028113
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0005488 - binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN60970.1 hypothetical protein VITISV_026408 [Vitis vinifera]2.1e-1836.3Show/hide
Query:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMME-GSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEH
        ++ ERGL  +  N P  T +I ER W  F   PQVA+VP+VREFYAN+ E      FVRGK + F   +IN F+ LP+I+ D Y  +     +  +++  
Subjt:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMME-GSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEH

Query:  LCRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSD
        +  P     +   + + F S  LT + +AW+ FL   L  V H +D
Subjt:  LCRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSD

EOY13933.1 Uncharacterized protein TCM_032752 [Theobroma cacao]1.3e-2038.51Show/hide
Query:  VPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEG-STTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL
        +PERG+        E+   I +R W  F   P V VV +VREFYA ++E     +FVRGK +PF S +IN+    PNI+ D Y  Y  +H + ++II  L
Subjt:  VPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEG-STTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL

Query:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTE
        C   A      GE ++FK S +  + + W  F+ A+LLP  H+SD T+
Subjt:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTE

KAA0033353.1 putative S-locus lectin protein kinase family protein [Cucumis melo var. makuwa]6.0e-5037.27Show/hide
Query:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL
        V+PERGL P   +QP+L  +I +RGW  F + P+ AVV IVREFYANM+EGS+ SFVRG+ + FD  +IN++Y LPN +RD Y  YAS H + HQII  L
Subjt:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL

Query:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVST
        C+P A  +I  GE I FKSS+LTV N+ WH F+CAKLLPV H S  T                                                     
Subjt:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVST

Query:  LGPERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAA
                                                         + RA+ +   +T R++DVGKVI+ S   IR+   T GLGH SLIT LCR  
Subjt:  LGPERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAA

Query:  GVVWDPREEISHPATATDGNFI
        GVVW+ +EE+  P    D NFI
Subjt:  GVVWDPREEISHPATATDGNFI

KGN46897.1 hypothetical protein Csa_020731 [Cucumis sativus]6.6e-4936.65Show/hide
Query:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL
        V+PERGL P   +QP+L  +I +RGW  F + P+ AV+ IVREFYANM+EGS+ SFVRG+ + FD  +IN++Y LPN +RD Y+ YAS H + HQII  L
Subjt:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL

Query:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVST
        C+P A  +I  GE I FKSS+LTV N+ WH F+CAKLLPV H S  T                                                     
Subjt:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVST

Query:  LGPERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAA
                                                         + RA+ +   +T R++DVGKVI  S   IR+   T GLGH SLIT LCR  
Subjt:  LGPERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAA

Query:  GVVWDPREEISHPATATDGNFI
        GVVW+ +EE+  P    D +FI
Subjt:  GVVWDPREEISHPATATDGNFI

XP_008458668.1 PREDICTED: uncharacterized protein LOC103497996 [Cucumis melo]5.4e-4336.33Show/hide
Query:  ERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHLCRPEATGLIRRGEAINFKSSDL
        +RGW  F + P+ AVV IVREFYANM+EGS+ SFVRG+ + FD  +IN++Y LPN +RD Y  YAS H + HQII  LC+P A  +I  GE I FKSS+L
Subjt:  ERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHLCRPEATGLIRRGEAINFKSSDL

Query:  TVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVSTLGPERLDAATTSAPRKPDVTAS
        TV N+ WH F+CAKLLPV H S  T                                                                           
Subjt:  TVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVSTLGPERLDAATTSAPRKPDVTAS

Query:  RRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAAGVVWDPREEISHPATATDGNFI
                                   + RA+ +   +T R++DVGKVI+ S   IR+   T GLGH SLIT LCR  GVVW+ +EE+  P    D NFI
Subjt:  RRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAAGVVWDPREEISHPATATDGNFI

TrEMBL top hitse value%identityAlignment
A0A061FAJ6 Uncharacterized protein6.3e-2138.51Show/hide
Query:  VPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEG-STTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL
        +PERG+        E+   I +R W  F   P V VV +VREFYA ++E     +FVRGK +PF S +IN+    PNI+ D Y  Y  +H + ++II  L
Subjt:  VPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEG-STTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL

Query:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTE
        C   A      GE ++FK S +  + + W  F+ A+LLP  H+SD T+
Subjt:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTE

A0A0A0KER1 Uncharacterized protein3.2e-4936.65Show/hide
Query:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL
        V+PERGL P   +QP+L  +I +RGW  F + P+ AV+ IVREFYANM+EGS+ SFVRG+ + FD  +IN++Y LPN +RD Y+ YAS H + HQII  L
Subjt:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL

Query:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVST
        C+P A  +I  GE I FKSS+LTV N+ WH F+CAKLLPV H S  T                                                     
Subjt:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVST

Query:  LGPERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAA
                                                         + RA+ +   +T R++DVGKVI  S   IR+   T GLGH SLIT LCR  
Subjt:  LGPERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAA

Query:  GVVWDPREEISHPATATDGNFI
        GVVW+ +EE+  P    D +FI
Subjt:  GVVWDPREEISHPATATDGNFI

A0A0A0KNI1 AA_kinase domain-containing protein2.3e-3130.94Show/hide
Query:  PERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHLCR
        PERGL P   +QP+L  +I +RGW  F + P+ AV+ IVREFYANM+EGS+ SFVRG+ + FD  +IN++Y LPN +RD Y+ YAS H + HQII  LC+
Subjt:  PERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHLCR

Query:  PEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVSTLG
        P A                       W       LLP+ H S  T                                                       
Subjt:  PEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVSTLG

Query:  PERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAAGV
                                                       + RA+ +   +T R++DVGKVI  S   IR+   T GLGH SLIT LCR  GV
Subjt:  PERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAAGV

Query:  VWDPREEISHPATATDGNFI
        VW+ +EE+  P    D +FI
Subjt:  VWDPREEISHPATATDGNFI

A0A1S3C7Y0 uncharacterized protein LOC1034979962.6e-4336.33Show/hide
Query:  ERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHLCRPEATGLIRRGEAINFKSSDL
        +RGW  F + P+ AVV IVREFYANM+EGS+ SFVRG+ + FD  +IN++Y LPN +RD Y  YAS H + HQII  LC+P A  +I  GE I FKSS+L
Subjt:  ERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHLCRPEATGLIRRGEAINFKSSDL

Query:  TVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVSTLGPERLDAATTSAPRKPDVTAS
        TV N+ WH F+CAKLLPV H S  T                                                                           
Subjt:  TVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVSTLGPERLDAATTSAPRKPDVTAS

Query:  RRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAAGVVWDPREEISHPATATDGNFI
                                   + RA+ +   +T R++DVGKVI+ S   IR+   T GLGH SLIT LCR  GVVW+ +EE+  P    D NFI
Subjt:  RRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAAGVVWDPREEISHPATATDGNFI

A0A5D3BBY3 Putative S-locus lectin protein kinase family protein2.9e-5037.27Show/hide
Query:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL
        V+PERGL P   +QP+L  +I +RGW  F + P+ AVV IVREFYANM+EGS+ SFVRG+ + FD  +IN++Y LPN +RD Y  YAS H + HQII  L
Subjt:  VVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASINQFYILPNIDRDGYNDYASNHFNAHQIIEHL

Query:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVST
        C+P A  +I  GE I FKSS+LTV N+ WH F+CAKLLPV H S  T                                                     
Subjt:  CRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPKKNEERRQKSAYEASRRPSVST

Query:  LGPERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAA
                                                         + RA+ +   +T R++DVGKVI+ S   IR+   T GLGH SLIT LCR  
Subjt:  LGPERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGHPSLITTLCRAA

Query:  GVVWDPREEISHPATATDGNFI
        GVVW+ +EE+  P    D NFI
Subjt:  GVVWDPREEISHPATATDGNFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGACGTGGAATCTGGGTAAAAACCCCAGATTTCGAAAGCTTCGGTTAAATTTTGTTTATAAGGAGGAAGGGCCTTTAAGTGTTGGAGTTTTTGGACCCATTGATCT
TGGATCCTTAGGGGTAGTTCCTGAAAGAGGGTTAGTTCCAAGTGCCCAAAACCAACCTGAGCTCACCCATAGTATAGCTGAGAGAGGTTGGGGTACGTTCACTAGGCACC
CTCAGGTCGCTGTAGTCCCTATCGTTAGAGAATTTTACGCTAACATGATGGAAGGATCCACTACTTCCTTTGTTAGAGGTAAAATGATCCCTTTCGACTCGGCCTCTATT
AACCAATTTTATATCCTTCCCAATATCGATAGGGATGGGTACAATGATTATGCAAGTAATCACTTTAATGCCCATCAAATTATAGAGCATCTATGTAGGCCAGAGGCAAC
TGGGTTAATAAGAAGGGGTGAAGCAATAAACTTTAAATCCTCTGATCTGACAGTGGATAACAGAGCATGGCATAGTTTTCTCTGTGCCAAACTCTTACCTGTGATGCATC
TGAGTGATAAAACCGAGTTTGAGAAGCCAAACCAAGAATTTTACCAACTTTGGGCCATAAAAGAAGAAAAGAAAAGAAGATATTTGAAGTGGAGGCCCAAAAAGCCCAAA
AAGAATGAAGAAAGGAGGCAAAAATCTGCATATGAGGCTTCTAGAAGGCCTAGCGTCTCGACGCTAGGACCAGAGCGTCTCGACGCTGCGACAACTTCTGCTCCAAGAAA
ACCTGACGTCACAGCGTCTCGACGCTGTGACGTAGCGTCTCGACGCTGTGCGTCTTGGGCAGATTCGATACTCGGGATACTTTTCGTTTTATATTACTTGCGATACCGTG
CGCTTGCGGTAGCGATCACATCAACCGACCGCAATATAGACGTAGGAAAAGTTATATATTCTTCCACGCGCCGGATTCGCAGAGGAACGACCACAGTAGGTCTAGGGCAC
CCTTCCCTCATCACAACACTCTGTAGAGCCGCTGGGGTCGTGTGGGACCCCCGTGAAGAGATTAGCCATCCTGCAACTGCGACTGATGGGAACTTCATCACGACTAGATT
TAGGGAGCCTGGGCCTAGGATAACCCACCCACCACCTCCACCGCAACAGGAAGAACCAAAAGAGAAGCCCCAAGCTCAGGGAGAGCAGCCCCACTTCGATCTGAAGGATT
GGAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACCTGCTCCTCTAATGAACAACCTATTTATGGTCCAACCAGTAAACAGAAAGTCCCTCTTGGGCCAGTGAGAGGG
CGGGATCCCTTTGTTCAAGACCCGGAGTCACCACTTAAGGGAACACTCATCTACTTCTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGACGTGGAATCTGGGTAAAAACCCCAGATTTCGAAAGCTTCGGTTAAATTTTGTTTATAAGGAGGAAGGGCCTTTAAGTGTTGGAGTTTTTGGACCCATTGATCT
TGGATCCTTAGGGGTAGTTCCTGAAAGAGGGTTAGTTCCAAGTGCCCAAAACCAACCTGAGCTCACCCATAGTATAGCTGAGAGAGGTTGGGGTACGTTCACTAGGCACC
CTCAGGTCGCTGTAGTCCCTATCGTTAGAGAATTTTACGCTAACATGATGGAAGGATCCACTACTTCCTTTGTTAGAGGTAAAATGATCCCTTTCGACTCGGCCTCTATT
AACCAATTTTATATCCTTCCCAATATCGATAGGGATGGGTACAATGATTATGCAAGTAATCACTTTAATGCCCATCAAATTATAGAGCATCTATGTAGGCCAGAGGCAAC
TGGGTTAATAAGAAGGGGTGAAGCAATAAACTTTAAATCCTCTGATCTGACAGTGGATAACAGAGCATGGCATAGTTTTCTCTGTGCCAAACTCTTACCTGTGATGCATC
TGAGTGATAAAACCGAGTTTGAGAAGCCAAACCAAGAATTTTACCAACTTTGGGCCATAAAAGAAGAAAAGAAAAGAAGATATTTGAAGTGGAGGCCCAAAAAGCCCAAA
AAGAATGAAGAAAGGAGGCAAAAATCTGCATATGAGGCTTCTAGAAGGCCTAGCGTCTCGACGCTAGGACCAGAGCGTCTCGACGCTGCGACAACTTCTGCTCCAAGAAA
ACCTGACGTCACAGCGTCTCGACGCTGTGACGTAGCGTCTCGACGCTGTGCGTCTTGGGCAGATTCGATACTCGGGATACTTTTCGTTTTATATTACTTGCGATACCGTG
CGCTTGCGGTAGCGATCACATCAACCGACCGCAATATAGACGTAGGAAAAGTTATATATTCTTCCACGCGCCGGATTCGCAGAGGAACGACCACAGTAGGTCTAGGGCAC
CCTTCCCTCATCACAACACTCTGTAGAGCCGCTGGGGTCGTGTGGGACCCCCGTGAAGAGATTAGCCATCCTGCAACTGCGACTGATGGGAACTTCATCACGACTAGATT
TAGGGAGCCTGGGCCTAGGATAACCCACCCACCACCTCCACCGCAACAGGAAGAACCAAAAGAGAAGCCCCAAGCTCAGGGAGAGCAGCCCCACTTCGATCTGAAGGATT
GGAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACCTGCTCCTCTAATGAACAACCTATTTATGGTCCAACCAGTAAACAGAAAGTCCCTCTTGGGCCAGTGAGAGGG
CGGGATCCCTTTGTTCAAGACCCGGAGTCACCACTTAAGGGAACACTCATCTACTTCTCCTAG
Protein sequenceShow/hide protein sequence
MPTWNLGKNPRFRKLRLNFVYKEEGPLSVGVFGPIDLGSLGVVPERGLVPSAQNQPELTHSIAERGWGTFTRHPQVAVVPIVREFYANMMEGSTTSFVRGKMIPFDSASI
NQFYILPNIDRDGYNDYASNHFNAHQIIEHLCRPEATGLIRRGEAINFKSSDLTVDNRAWHSFLCAKLLPVMHLSDKTEFEKPNQEFYQLWAIKEEKKRRYLKWRPKKPK
KNEERRQKSAYEASRRPSVSTLGPERLDAATTSAPRKPDVTASRRCDVASRRCASWADSILGILFVLYYLRYRALAVAITSTDRNIDVGKVIYSSTRRIRRGTTTVGLGH
PSLITTLCRAAGVVWDPREEISHPATATDGNFITTRFREPGPRITHPPPPPQQEEPKEKPQAQGEQPHFDLKDWNSQFRYSGSNWTCSSNEQPIYGPTSKQKVPLGPVRG
RDPFVQDPESPLKGTLIYFS