; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr008580 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr008580
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionS-protein homolog
Genome locationtig00007012:70731..74534
RNA-Seq ExpressionSgr008580
SyntenySgr008580
Gene Ontology termsGO:0060320 - rejection of self pollen (biological process)
GO:0005576 - extracellular region (cellular component)
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016902699.1 PREDICTED: uncharacterized protein LOC107991826 [Cucumis melo]2.2e-5562.2Show/hide
Query:  LLHAPSGACMRQVAPVDAPAMGVRSLKKHFLV--FLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIW
        L++ P   C  +   + +    +RS+K HFLV  FL VL +AI +P +A  L +WHIH+VNGLSN Q LFVHC+SKD+DLG+  LS GTEFNWTF++N W
Subjt:  LLHAPSGACMRQVAPVDAPAMGVRSLKKHFLV--FLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIW

Query:  NTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVHN
        +TTLFWCYL+ PN +SVSFE+FWVE+ SIWL+YRCF SNC WTAKDDGIYL+DNP  RD LVHN
Subjt:  NTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVHN

XP_022143772.1 S-protein homolog 74-like [Momordica charantia]1.8e-6581.82Show/hide
Query:  AMGVRSLKKHFLVFLLVLWVAIAEPIEA-TELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFE
        AMGVRS KKHFLVFLLVL + I EPIEA +EL+KW IH+VNGLSNGQ LFVHCKSKDNDLGEHNL+SGTEFNWTFRVN+WNTTLFWCYL  P+G+S SF+
Subjt:  AMGVRSLKKHFLVFLLVLWVAIAEPIEA-TELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFE

Query:  SFWVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH
         FWVEK SIWL+YRC+ SNC WTAKDDGIYLRDNPVQRD+LVH
Subjt:  SFWVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH

XP_022143780.1 S-protein homolog 1-like [Momordica charantia]9.3e-5473.88Show/hide
Query:  KHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSI
        KHFLVFL V  +AI E IEA  L KW IH+ N LSN Q+LFVHCKSK++DLGEHNLS GTEFNW FRVNIW+TTL+WCYL+ PNGQSVSF++FWVEK SI
Subjt:  KHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSI

Query:  WLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH
        WLYY+C +SNC W AKDDGIYLR+NP  RDV VH
Subjt:  WLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH

XP_022143829.1 S-protein homolog 74-like [Momordica charantia]3.1e-5771.63Show/hide
Query:  MGVRSLKKHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESF
        MG R LKKHFLV LL L +AI EP  + EL++W+IH+VNGL NG++LFVHCKS+D+DLGE NL  G EF+WTFRVN+ +TTLFWC+LR P+ QSVSF++F
Subjt:  MGVRSLKKHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESF

Query:  WVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH
        WVEK+SIWL+YRC+D+NC WTAKDDG+YLRDNPVQRDVLVH
Subjt:  WVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH

XP_031745090.1 S-protein homolog 1-like [Cucumis sativus]3.3e-5167.88Show/hide
Query:  LKKHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKS
        +KK  +V L VL +AI E  +A EL KWHIH+VNGLSNGQIL  HCKSKDNDLGE  L +GTEFNW FRVN WNTTLFWCYL+ PNGQ  SFESFW+E  
Subjt:  LKKHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKS

Query:  SIWLYYRCFDSNCFWTAKDDGIYLRDN-PVQRDVLVH
        S+WLY  CF+ NC WTAKDDGIYL+DN    +D+L+H
Subjt:  SIWLYYRCFDSNCFWTAKDDGIYLRDN-PVQRDVLVH

TrEMBL top hitse value%identityAlignment
A0A1S4E390 S-protein homolog1.1e-5562.2Show/hide
Query:  LLHAPSGACMRQVAPVDAPAMGVRSLKKHFLV--FLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIW
        L++ P   C  +   + +    +RS+K HFLV  FL VL +AI +P +A  L +WHIH+VNGLSN Q LFVHC+SKD+DLG+  LS GTEFNWTF++N W
Subjt:  LLHAPSGACMRQVAPVDAPAMGVRSLKKHFLV--FLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIW

Query:  NTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVHN
        +TTLFWCYL+ PN +SVSFE+FWVE+ SIWL+YRCF SNC WTAKDDGIYL+DNP  RD LVHN
Subjt:  NTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVHN

A0A6J1CPR8 S-protein homolog4.5e-5473.88Show/hide
Query:  KHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSI
        KHFLVFL V  +AI E IEA  L KW IH+ N LSN Q+LFVHCKSK++DLGEHNLS GTEFNW FRVNIW+TTL+WCYL+ PNGQSVSF++FWVEK SI
Subjt:  KHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSI

Query:  WLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH
        WLYY+C +SNC W AKDDGIYLR+NP  RDV VH
Subjt:  WLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH

A0A6J1CQH6 S-protein homolog1.5e-5771.63Show/hide
Query:  MGVRSLKKHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESF
        MG R LKKHFLV LL L +AI EP  + EL++W+IH+VNGL NG++LFVHCKS+D+DLGE NL  G EF+WTFRVN+ +TTLFWC+LR P+ QSVSF++F
Subjt:  MGVRSLKKHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESF

Query:  WVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH
        WVEK+SIWL+YRC+D+NC WTAKDDG+YLRDNPVQRDVLVH
Subjt:  WVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH

A0A6J1CRU0 S-protein homolog8.7e-6681.82Show/hide
Query:  AMGVRSLKKHFLVFLLVLWVAIAEPIEA-TELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFE
        AMGVRS KKHFLVFLLVL + I EPIEA +EL+KW IH+VNGLSNGQ LFVHCKSKDNDLGEHNL+SGTEFNWTFRVN+WNTTLFWCYL  P+G+S SF+
Subjt:  AMGVRSLKKHFLVFLLVLWVAIAEPIEA-TELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFE

Query:  SFWVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH
         FWVEK SIWL+YRC+ SNC WTAKDDGIYLRDNPVQRD+LVH
Subjt:  SFWVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH

A0A6J1HAC3 S-protein homolog3.7e-4056.03Show/hide
Query:  MGVRSLKKHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESF
        M   SLK   LVFL+   +A+A    +    KW IH+ N LSNGQ +FVHCKSKDNDLGEH L++GTEF W F+VN W+TTLFWCYLR PNG  ++F++F
Subjt:  MGVRSLKKHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESF

Query:  WVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH
        WVEK + WL  +C  + C WTA+D+GIYL+DN    D  VH
Subjt:  WVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVH

SwissProt top hitse value%identityAlignment
F4JLS0 S-protein homolog 15.1e-2343.24Show/hide
Query:  ELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY
        ++ +W + +VNGL+ G+ LF+HCKSK++DLGE NL     F+W F  N+ ++T FWCY+   NG  ++   FW     + L++RC   NC WTAK DG+Y
Subjt:  ELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY

Query:  LRDNPVQRDVL
        L ++    DVL
Subjt:  LRDNPVQRDVL

P0DN92 S-protein homolog 247.0e-1234.35Show/hide
Query:  FLVFLLVLWVAIAEPI---EATELRKWHIHMVN-GLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKS
        F+V ++V+ +  +E +   EA E  + H+  V     N  +L +HCKS+D+DLG H L+ G  F W F VN   +TL++C       +   FE +   + 
Subjt:  FLVFLLVLWVAIAEPI---EATELRKWHIHMVN-GLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKS

Query:  SIWLYYRCFDSNCFWTAKDDGIY-LRDNPVQ
            +YRC  +NC W A+ DGIY   ++PV+
Subjt:  SIWLYYRCFDSNCFWTAKDDGIY-LRDNPVQ

Q2HQ46 S-protein homolog 743.0e-2341.74Show/hide
Query:  IEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKD
        I   ++ +W + + NGL+ G+ LF+HCKSK+NDLG+ NL     F+W F  N+ ++TLFWCY+   +G  ++ + FW     + L++RC   NC WTAK+
Subjt:  IEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKD

Query:  DGIYLRDNPVQRDVL
        DG+YL ++ +  DVL
Subjt:  DGIYLRDNPVQRDVL

Q40975 Self-incompatibility protein S16.5e-1032.98Show/hide
Query:  IHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY
        + ++N   NG+ + +HC+SKDNDL    ++SG + +++FR + ++TT F+C L+        F S+  ++       RC  S C W   DDG+Y
Subjt:  IHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY

Q9FI84 S-protein homolog 276.5e-1037.93Show/hide
Query:  SNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY
        +N  +L +HCKSKD+DLG H    G  + W F VN  N+TL++C           F+    E+     +YRC   NC W AK D +Y
Subjt:  SNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY

Arabidopsis top hitse value%identityAlignment
AT4G16295.1 S-protein homologue 13.7e-2443.24Show/hide
Query:  ELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY
        ++ +W + +VNGL+ G+ LF+HCKSK++DLGE NL     F+W F  N+ ++T FWCY+   NG  ++   FW     + L++RC   NC WTAK DG+Y
Subjt:  ELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY

Query:  LRDNPVQRDVL
        L ++    DVL
Subjt:  LRDNPVQRDVL

AT4G29035.1 Plant self-incompatibility protein S1 family2.1e-2441.74Show/hide
Query:  IEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKD
        I   ++ +W + + NGL+ G+ LF+HCKSK+NDLG+ NL     F+W F  N+ ++TLFWCY+   +G  ++ + FW     + L++RC   NC WTAK+
Subjt:  IEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKD

Query:  DGIYLRDNPVQRDVL
        DG+YL ++ +  DVL
Subjt:  DGIYLRDNPVQRDVL

AT5G04350.1 Plant self-incompatibility protein S1 family7.6e-1431.3Show/hide
Query:  HFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYL-RTPNGQS----VSFESFWVE
        +  +F +V+ + I       E+ +  + + N L + ++L VHC+SKD+DLGEH L  G ++ +TF  NIW TT F C + + PN +     V++E+ W  
Subjt:  HFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYL-RTPNGQS----VSFESFWVE

Query:  KSSIWLYYRCFDSNCFWTAKDDGIYLRDNPV
                +  +++C W  ++DGIY   + V
Subjt:  KSSIWLYYRCFDSNCFWTAKDDGIYLRDNPV

AT5G06020.1 Plant self-incompatibility protein S1 family4.6e-1137.93Show/hide
Query:  SNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY
        +N  +L +HCKSKD+DLG H    G  + W F VN  N+TL++C           F+    E+     +YRC   NC W AK D +Y
Subjt:  SNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIY

AT5G06030.1 Plant self-incompatibility protein S1 family6.7e-1029.93Show/hide
Query:  FLVFLLVLWVAIAEPI---EATELRKWHIHMVNGLS-NGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKS
        F++ ++V+ +  +E +   +A E  + H+  V   + N  +L +HCKS+D+DLG H L+ G  F W F VN   +TL +C           F  +   + 
Subjt:  FLVFLLVLWVAIAEPI---EATELRKWHIHMVNGLS-NGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNTTLFWCYLRTPNGQSVSFESFWVEKS

Query:  SIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVHN
            +YRC  +NC W A+ DG +   +   R  L +N
Subjt:  SIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCCGGCACCCCCGTGTGGTGCCCGACTCTTGCATGCACCATCTGGTGCATGCATGCGCCAGGTGGCGCCTGTGGACGCGCCAGCAATGGGAGTGAGATCTCTGAA
GAAGCACTTTTTGGTTTTCTTGCTTGTCTTGTGGGTGGCCATAGCTGAGCCAATCGAGGCTACCGAGCTGAGGAAATGGCACATTCACATGGTGAATGGGTTGAGCAACG
GACAAATTCTCTTTGTGCACTGCAAATCGAAGGACAATGATCTAGGCGAACACAATCTCAGCAGCGGAACTGAATTCAACTGGACTTTCAGAGTGAACATTTGGAACACG
ACACTGTTTTGGTGTTACCTGCGAACGCCAAATGGACAATCTGTATCATTCGAGTCCTTTTGGGTTGAGAAGAGCTCGATTTGGCTCTATTACAGATGTTTCGATTCTAA
CTGCTTTTGGACTGCAAAAGATGATGGAATCTACCTAAGAGACAATCCTGTTCAAAGAGATGTTTTGGTTCATAACTTGATTGCTGCCGCAGGGCCATCATCGGGTGAAG
CGGCGAGACGGCTGACGACGGCGCTCCACGCCTTTACGTGGGGAAGCGGCCACCGCGAGGAGGCCGAGCGAATTGCCGTCGAGGCGGTGGAGCTGGTTATCATGGCGGAG
GCGCGTGATGATGGACTGAGCTCGAGTGAGGCTCGAATGGGTGTTCTGATGAGGGCAGTGGGTATATGTTATCCAGGCCTCATCGGTTTTCCTCTGCACGTAGCAGTGAT
TAGAGTCTCTGTTCTAAGGAACTGGATGATGAAATCGCTGACGACAGAGGGGTCGGTGGAGTTGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCCGGCACCCCCGTGTGGTGCCCGACTCTTGCATGCACCATCTGGTGCATGCATGCGCCAGGTGGCGCCTGTGGACGCGCCAGCAATGGGAGTGAGATCTCTGAA
GAAGCACTTTTTGGTTTTCTTGCTTGTCTTGTGGGTGGCCATAGCTGAGCCAATCGAGGCTACCGAGCTGAGGAAATGGCACATTCACATGGTGAATGGGTTGAGCAACG
GACAAATTCTCTTTGTGCACTGCAAATCGAAGGACAATGATCTAGGCGAACACAATCTCAGCAGCGGAACTGAATTCAACTGGACTTTCAGAGTGAACATTTGGAACACG
ACACTGTTTTGGTGTTACCTGCGAACGCCAAATGGACAATCTGTATCATTCGAGTCCTTTTGGGTTGAGAAGAGCTCGATTTGGCTCTATTACAGATGTTTCGATTCTAA
CTGCTTTTGGACTGCAAAAGATGATGGAATCTACCTAAGAGACAATCCTGTTCAAAGAGATGTTTTGGTTCATAACTTGATTGCTGCCGCAGGGCCATCATCGGGTGAAG
CGGCGAGACGGCTGACGACGGCGCTCCACGCCTTTACGTGGGGAAGCGGCCACCGCGAGGAGGCCGAGCGAATTGCCGTCGAGGCGGTGGAGCTGGTTATCATGGCGGAG
GCGCGTGATGATGGACTGAGCTCGAGTGAGGCTCGAATGGGTGTTCTGATGAGGGCAGTGGGTATATGTTATCCAGGCCTCATCGGTTTTCCTCTGCACGTAGCAGTGAT
TAGAGTCTCTGTTCTAAGGAACTGGATGATGAAATCGCTGACGACAGAGGGGTCGGTGGAGTTGGTTTGA
Protein sequenceShow/hide protein sequence
MQPAPPCGARLLHAPSGACMRQVAPVDAPAMGVRSLKKHFLVFLLVLWVAIAEPIEATELRKWHIHMVNGLSNGQILFVHCKSKDNDLGEHNLSSGTEFNWTFRVNIWNT
TLFWCYLRTPNGQSVSFESFWVEKSSIWLYYRCFDSNCFWTAKDDGIYLRDNPVQRDVLVHNLIAAAGPSSGEAARRLTTALHAFTWGSGHREEAERIAVEAVELVIMAE
ARDDGLSSSEARMGVLMRAVGICYPGLIGFPLHVAVIRVSVLRNWMMKSLTTEGSVELV