; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016796 (gene) of Chayote v1 genome

Gene IDSed0016796
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF616)
Genome locationLG08:36265444..36271162
RNA-Seq ExpressionSed0016796
SyntenySed0016796
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012433.1 hypothetical protein SDJN02_25185, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-28986.1Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMM--KEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM
        MTGGSLGLRS SYG+LDKQL  +VSPIQ  RKPSKMM  KEKDYLFPWICKFV RKKVGMLLLCIVSAAVFLWVLY+GKGED+Q GQHIQHVSINNSI M
Subjt:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMM--KEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM

Query:  SFREPSSEETLDGSSF-LAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALM
        SFRE S+EE +D +S+ LAKG ETSSLAS PPPP    PPP PPPPPS+PPPA+FLGYTLPPGHPC+NFA+PPPPADKKRTGPRPCPVCYLPVEEAVALM
Subjt:  SFREPSSEETLDGSSF-LAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALM

Query:  PNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQP
        PNASSYS VK+LEYIY ENL RE EFGGSDFGGYPTLAQR DSFDVRESMR+HCGF+GG KPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDV+NQP
Subjt:  PNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQP

Query:  NNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNA
         NIS+YA+ TVCFFMF DEETE  LKETGILESSKKIGLWRIVVVH LPYKD+RRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKN+
Subjt:  NNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNA

Query:  TFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKI
        TFAISRHYRRFDVF+EADANKAA KYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVI+REHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKI
Subjt:  TFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKI

Query:  MAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAG-TRDNGL
        MAKTNWTVNMFLDCERRNFVVQKYHRD+L+Q+ASPV  AVHPPPLPPS P          SE+ SSL RKAS R+SRE RSRRHRKV+AG T+ N L
Subjt:  MAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAG-TRDNGL

XP_008442599.1 PREDICTED: uncharacterized protein LOC103486418 [Cucumis melo]1.5e-29585.93Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF
        MTGGSLGLRSGSYG+LDKQLN +VSPIQ ARKPSKMMKEKDYLFPWICKFV RKKVGMLLLC+VSAAVFLWVLY+GKGED +EGQ IQ VSINNS+ MSF
Subjt:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF

Query:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALMPN
        RE S+E+ +D  SS LAKGIETSS A  PPPPPP             PPPA+FLGYTLPPGHPC+NFA+PPPPADKKRTGPRPCPVCYLPVEEAVALMPN
Subjt:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALMPN

Query:  ASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPN
        ASS S V K+L+YIY ENL RE EFGGSDFGGYPTLAQR DSFD+RESMRVHCGF+GG KPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQP+
Subjt:  ASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPN

Query:  NISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNAT
        NISDYAK+TVCFFMF DEETEA LKE GILESSKKIGLWRI+VVH LPYKD+RRTGKIPKLL+HRMFPNARYSLW+DGKLELVVDPYQ+LERFLWRKNAT
Subjt:  NISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNAT

Query:  FAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM
        FAISRHY+RFDVF EADANKAA KYDNASIDFQ+DFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM
Subjt:  FAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM

Query:  AKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS
        AKTNWT+NMFLDCERRNFV+QKYHRDVL+QKA  VPMAVHPPPLPPSPP +++NPV DS S++VSSLPRKASPRR+RE RSRRHRKVAAGT+DN LS
Subjt:  AKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS

XP_022994363.1 uncharacterized protein LOC111490105 isoform X1 [Cucurbita maxima]5.0e-29186Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMM--KEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM
        MTGGSLGLRS SYG+LDKQL  +VSPIQ  RKPSKMM  KEKDYLFPWICKFV RKKVGMLLLCIVSAAVFLWVLY+GKGED+Q GQHIQHVSINNSI M
Subjt:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMM--KEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM

Query:  SFREPSSEETLDGSSF-LAKGIETSSLAS--HPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVA
        SFRE S+EE +D +S+ LA+G ETSSLAS   PPPPPP  PPP PPPPPS+PPPA+FLGYTLPPGHPC+NFA+PPPPADKKRTGPRPCPVCYLPVEEAVA
Subjt:  SFREPSSEETLDGSSF-LAKGIETSSLAS--HPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVA

Query:  LMPNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVIN
        LMPNASSYS VK+LEYIY ENL RE EFGGSDFGGYP LAQR DSFDVRESMR+HCGF+ G KPGR TGFDINDDDL+DMEQC GVVVASAIFGNFDV+N
Subjt:  LMPNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVIN

Query:  QPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRK
        QP NIS+YA+ TVCFFMF DEETE  LKETGILESSKKIGLWRIVVVH LPYKD+RRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRK
Subjt:  QPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRK

Query:  NATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRD
        N+TFAISRHYRRFDVF+EADANKAA KYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVI+REHVPISNLFSCLWFNEVDRFTSRDQISFSTVRD
Subjt:  NATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRD

Query:  KIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPS-PPSIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAG-TRDNGL
        KIMAKTNWT+NMFLDCERRNFVVQKYHRD+L+Q+ASPV  AVHPPPLPPS P SIINPV +S SE+ SSL RKAS R+SRE RSRRHRKV+AG T+ N L
Subjt:  KIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPS-PPSIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAG-TRDNGL

XP_031736022.1 uncharacterized protein LOC101209711 [Cucumis sativus]4.2e-29886.77Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF
        MTGGSLGLRSGSYG+LDKQLN +VSPIQ ARKPSKMMKEKDYLFPWICKFV RKKVGMLLLC+VSAAVFLWVLY+GKGED +EGQHIQ VSINNSI M+F
Subjt:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF

Query:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALMPN
        RE S+E+ +D  SS +AKGIETSSLA  PPPPPP  PPP PPPPP  PPPA+FLGYTLPPGHPC+NFA+PPPPADKKRTGPRPCPVCYLPVEEAVALMPN
Subjt:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALMPN

Query:  ASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPN
        ASS S V K L+YIY ENL RE EFGGSDFGGYPT+AQR DSFD+RESMRVHCGF+GG KPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQP 
Subjt:  ASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPN

Query:  NISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNAT
        NIS+YAK+TVCFFMF DEETEA LKETGILESSKKIGLWRI+VVH LPYKD+RRTGKIPKLL+HRMFPNARYSLWIDGKLELVVDPYQ+LERFLWRKNAT
Subjt:  NISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNAT

Query:  FAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM
        FAIS+HY+RFDVF EADANKAA KYDNASIDFQ+DFYVKEGLTPYSEAKLPITSDVPEGCVI+REHVPISNLFSCLWFNEVDRFTSRDQISF+TVRDKIM
Subjt:  FAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM

Query:  AKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS
        AKTNWT+NMFLDCERRNFV+QKYHRDVL+QKA   PMAVHPPPLPPSPP S++NPV +SSS++VSSLPRKASPRR+RE RSRRHRKVAAGT+DN  S
Subjt:  AKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS

XP_038895516.1 uncharacterized protein LOC120083734 [Benincasa hispida]1.9e-29084.81Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKL--VSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM
        MTGGSLGLRSGSYG+LDKQLN +  VSPIQ ARKPSKMMKEKDYLFPWICKFV RKKVGMLLLC+VSAAVFLWVLY+GKGEDAQEGQHIQ VSINNSI M
Subjt:  MTGGSLGLRSGSYGSLDKQLNKL--VSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM

Query:  SFREPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALM
        S+RE S+E+ +D  SS LAKGI+ SSLAS PP                 PPPA+FLGYTLPPGHPC+NFA+PPPPADKKRTGPRPCPVCYLPVEEAVALM
Subjt:  SFREPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALM

Query:  PNASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQ
        PNASS S V K L+YIY ENL RE EFGGSDFGGYPTLAQR DSFD+RESMRVHCGF+GG KPGRNTGFDINDDDLHDMEQCRGV+VASAIFGNFDVINQ
Subjt:  PNASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQ

Query:  PNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKN
        P NIS+YAK+TVCFFMF DEETEA LK TGILESSKKIGLWRI+VVH LPYKD+RRTGKIPKLL+HRMFPNARYSLWIDGKLELVVDPYQILERFLWRKN
Subjt:  PNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKN

Query:  ATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDK
        ATFAISRHY+RFDVF EADANKAA KYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVI+REHVPISNLFSCLWFNEVDRFTSRDQISFS VRDK
Subjt:  ATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDK

Query:  IMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS
        IMAKTNWT+NMF+DCERRNFV+QKYHRDVL+QKA  VPMAVHPPPLPPS P S++NPV DSSS++VSSLPRK SP+R+RE RSRRHRKVAAG +DN LS
Subjt:  IMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS

TrEMBL top hitse value%identityAlignment
A0A0A0LRX1 Uncharacterized protein4.6e-29885.06Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF
        MTGGSLGLRSGSYG+LDKQLN +VSPIQ ARKPSKMMKEKDYLFPWICKFV RKKVGMLLLC+VSAAVFLWVLY+GKGED +EGQHIQ VSINNSI M+F
Subjt:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF

Query:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPP------------PPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCY
        RE S+E+ +D  SS +AKGIETSSLA  PPPPPP  PPP PP            PPP  PPPA+FLGYTLPPGHPC+NFA+PPPPADKKRTGPRPCPVCY
Subjt:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPP------------PPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCY

Query:  LPVEEAVALMPNASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASA
        LPVEEAVALMPNASS S V K L+YIY ENL RE EFGGSDFGGYPT+AQR DSFD+RESMRVHCGF+GG KPGRNTGFDINDDDLHDMEQCRGVVVASA
Subjt:  LPVEEAVALMPNASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASA

Query:  IFGNFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQ
        IFGNFDVINQP NIS+YAK+TVCFFMF DEETEA LKETGILESSKKIGLWRI+VVH LPYKD+RRTGKIPKLL+HRMFPNARYSLWIDGKLELVVDPYQ
Subjt:  IFGNFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQ

Query:  ILERFLWRKNATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRD
        +LERFLWRKNATFAIS+HY+RFDVF EADANKAA KYDNASIDFQ+DFYVKEGLTPYSEAKLPITSDVPEGCVI+REHVPISNLFSCLWFNEVDRFTSRD
Subjt:  ILERFLWRKNATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRD

Query:  QISFSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVA
        QISF+TVRDKIMAKTNWT+NMFLDCERRNFV+QKYHRDVL+QKA   PMAVHPPPLPPSPP S++NPV +SSS++VSSLPRKASPRR+RE RSRRHRKVA
Subjt:  QISFSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVA

Query:  AGTRDNGLS
        AGT+DN  S
Subjt:  AGTRDNGLS

A0A1S3B5K6 uncharacterized protein LOC1034864187.3e-29685.93Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF
        MTGGSLGLRSGSYG+LDKQLN +VSPIQ ARKPSKMMKEKDYLFPWICKFV RKKVGMLLLC+VSAAVFLWVLY+GKGED +EGQ IQ VSINNS+ MSF
Subjt:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF

Query:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALMPN
        RE S+E+ +D  SS LAKGIETSS A  PPPPPP             PPPA+FLGYTLPPGHPC+NFA+PPPPADKKRTGPRPCPVCYLPVEEAVALMPN
Subjt:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALMPN

Query:  ASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPN
        ASS S V K+L+YIY ENL RE EFGGSDFGGYPTLAQR DSFD+RESMRVHCGF+GG KPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQP+
Subjt:  ASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPN

Query:  NISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNAT
        NISDYAK+TVCFFMF DEETEA LKE GILESSKKIGLWRI+VVH LPYKD+RRTGKIPKLL+HRMFPNARYSLW+DGKLELVVDPYQ+LERFLWRKNAT
Subjt:  NISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNAT

Query:  FAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM
        FAISRHY+RFDVF EADANKAA KYDNASIDFQ+DFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM
Subjt:  FAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM

Query:  AKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS
        AKTNWT+NMFLDCERRNFV+QKYHRDVL+QKA  VPMAVHPPPLPPSPP +++NPV DS S++VSSLPRKASPRR+RE RSRRHRKVAAGT+DN LS
Subjt:  AKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS

A0A5A7UU08 F3H9.11 protein isoform 17.3e-29685.93Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF
        MTGGSLGLRSGSYG+LDKQLN +VSPIQ ARKPSKMMKEKDYLFPWICKFV RKKVGMLLLC+VSAAVFLWVLY+GKGED +EGQ IQ VSINNS+ MSF
Subjt:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSF

Query:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALMPN
        RE S+E+ +D  SS LAKGIETSS A  PPPPPP             PPPA+FLGYTLPPGHPC+NFA+PPPPADKKRTGPRPCPVCYLPVEEAVALMPN
Subjt:  REPSSEETLD-GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALMPN

Query:  ASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPN
        ASS S V K+L+YIY ENL RE EFGGSDFGGYPTLAQR DSFD+RESMRVHCGF+GG KPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQP+
Subjt:  ASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPN

Query:  NISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNAT
        NISDYAK+TVCFFMF DEETEA LKE GILESSKKIGLWRI+VVH LPYKD+RRTGKIPKLL+HRMFPNARYSLW+DGKLELVVDPYQ+LERFLWRKNAT
Subjt:  NISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNAT

Query:  FAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM
        FAISRHY+RFDVF EADANKAA KYDNASIDFQ+DFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM
Subjt:  FAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM

Query:  AKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS
        AKTNWT+NMFLDCERRNFV+QKYHRDVL+QKA  VPMAVHPPPLPPSPP +++NPV DS S++VSSLPRKASPRR+RE RSRRHRKVAAGT+DN LS
Subjt:  AKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPP-SIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAGTRDNGLS

A0A6J1GRP0 uncharacterized protein LOC1114568855.6e-28885.59Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMM--KEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM
        MTGGSLGLRS SYG+LDKQL  +VSPIQ  RKPSKMM  KEKDYLFPWICKFV RKKVGMLLLCIVSAAVFLWVLY+GKGED+Q GQHIQHVSINNSI M
Subjt:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMM--KEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM

Query:  SFREPSSEETLDGSSF-LAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALM
        SFRE S+EE +D +S+ LAKG ETSSLAS PPPPPP      PPPPPS+PPPA+FLGYTLPPGHPC+ F +PPPPADKKRTGPRPCPVCYLPVEEAVALM
Subjt:  SFREPSSEETLDGSSF-LAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALM

Query:  PNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQP
        PNASSYS VK+LEYIY ENL RE EFGGSDFGGYPTLAQR DSFDVRESMR+HCGF+GG KPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDV+NQP
Subjt:  PNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQP

Query:  NNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNA
         NIS+YA+ TVCFFMF DEETE  LKETGILESSKKIGLWRIVVVH LPYKD+RRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKN+
Subjt:  NNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNA

Query:  TFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKI
        TFAISRHYRRFDVF+EADANKAA KYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVI+REHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKI
Subjt:  TFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKI

Query:  MAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAG-TRDNGL
        MAKTNWTVNMFLDCERRNFVVQKYHRD+L+Q+ASPV  AVHPPPLPPS P          SE+ SSL RKAS R+SRE RSRRHRKV+AG T+ N L
Subjt:  MAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAG-TRDNGL

A0A6J1K4Y6 uncharacterized protein LOC111490105 isoform X12.4e-29186Show/hide
Query:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMM--KEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM
        MTGGSLGLRS SYG+LDKQL  +VSPIQ  RKPSKMM  KEKDYLFPWICKFV RKKVGMLLLCIVSAAVFLWVLY+GKGED+Q GQHIQHVSINNSI M
Subjt:  MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMM--KEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAM

Query:  SFREPSSEETLDGSSF-LAKGIETSSLAS--HPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVA
        SFRE S+EE +D +S+ LA+G ETSSLAS   PPPPPP  PPP PPPPPS+PPPA+FLGYTLPPGHPC+NFA+PPPPADKKRTGPRPCPVCYLPVEEAVA
Subjt:  SFREPSSEETLDGSSF-LAKGIETSSLAS--HPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVA

Query:  LMPNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVIN
        LMPNASSYS VK+LEYIY ENL RE EFGGSDFGGYP LAQR DSFDVRESMR+HCGF+ G KPGR TGFDINDDDL+DMEQC GVVVASAIFGNFDV+N
Subjt:  LMPNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVIN

Query:  QPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRK
        QP NIS+YA+ TVCFFMF DEETE  LKETGILESSKKIGLWRIVVVH LPYKD+RRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRK
Subjt:  QPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRK

Query:  NATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRD
        N+TFAISRHYRRFDVF+EADANKAA KYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVI+REHVPISNLFSCLWFNEVDRFTSRDQISFSTVRD
Subjt:  NATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRD

Query:  KIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPS-PPSIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAG-TRDNGL
        KIMAKTNWT+NMFLDCERRNFVVQKYHRD+L+Q+ASPV  AVHPPPLPPS P SIINPV +S SE+ SSL RKAS R+SRE RSRRHRKV+AG T+ N L
Subjt:  KIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPS-PPSIINPVKDSSSEKVSSLPRKASPRRSRE-RSRRHRKVAAG-TRDNGL

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI701.6e-20764.44Show/hide
Query:  MTGGSLGLRSGSYGSLDKQ-LNKLVSPIQIA----RKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNS
        MTG  LG+RS SYGSL+K  LN +V PIQI      KPSKM K+++ +  WICKF  RKKVGMLLL ++SA VFL VLY+GKGED+QEGQ    +  N S
Subjt:  MTGGSLGLRSGSYGSLDKQ-LNKLVSPIQIA----RKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNS

Query:  IAMSFRE--PSSEE---TLDGSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPV
          +++     ++EE    +   SF AK +                           PPP  FLGY+LP GHPC++F +PPPPAD+KRTGPRPCPVCYLPV
Subjt:  IAMSFRE--PSSEE---TLDGSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPV

Query:  EEAVALMPNASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFG
        EEAVALMPNA S+S V K+L YIY E L+RE EFGGSDFGGYPTL  R DSFD++E+M VHCGF+ G +PGRNTGFDI++ DL +M+QCRG+VVASA+F 
Subjt:  EEAVALMPNASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFG

Query:  NFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILE
         FD +  P NIS YA+ TVCF+MF DEETE+ LK    L+ +KK+G+WR+VVVH LPY D RR GK+PKLL+HRMFPNARYSLWIDGKLELVVDPYQILE
Subjt:  NFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILE

Query:  RFLWRKNATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQIS
        RFLWRKNATFAISRHY+RFDV  EA+ANKAA KYDNASIDFQVDFY  EGLTPYS AKLPITSDVPEGCVI+REHVPISNLF+CLWFNEVDRFTSRDQIS
Subjt:  RFLWRKNATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQIS

Query:  FSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRERSRRHRKVAAGTR
        FSTVRDKI AKTNWTV+MFLDCERRNFVVQ+YHR   ++ A   P   + PP PPSPP    PV  S     S LPRK S  R+    RR R   +G R
Subjt:  FSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRERSRRHRKVAAGTR

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)1.1e-20864.44Show/hide
Query:  MTGGSLGLRSGSYGSLDKQ-LNKLVSPIQIA----RKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNS
        MTG  LG+RS SYGSL+K  LN +V PIQI      KPSKM K+++ +  WICKF  RKKVGMLLL ++SA VFL VLY+GKGED+QEGQ    +  N S
Subjt:  MTGGSLGLRSGSYGSLDKQ-LNKLVSPIQIA----RKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNS

Query:  IAMSFRE--PSSEE---TLDGSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPV
          +++     ++EE    +   SF AK +                           PPP  FLGY+LP GHPC++F +PPPPAD+KRTGPRPCPVCYLPV
Subjt:  IAMSFRE--PSSEE---TLDGSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPV

Query:  EEAVALMPNASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFG
        EEAVALMPNA S+S V K+L YIY E L+RE EFGGSDFGGYPTL  R DSFD++E+M VHCGF+ G +PGRNTGFDI++ DL +M+QCRG+VVASA+F 
Subjt:  EEAVALMPNASSYSSV-KSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFG

Query:  NFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILE
         FD +  P NIS YA+ TVCF+MF DEETE+ LK    L+ +KK+G+WR+VVVH LPY D RR GK+PKLL+HRMFPNARYSLWIDGKLELVVDPYQILE
Subjt:  NFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKETGILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILE

Query:  RFLWRKNATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQIS
        RFLWRKNATFAISRHY+RFDV  EA+ANKAA KYDNASIDFQVDFY  EGLTPYS AKLPITSDVPEGCVI+REHVPISNLF+CLWFNEVDRFTSRDQIS
Subjt:  RFLWRKNATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQIS

Query:  FSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRERSRRHRKVAAGTR
        FSTVRDKI AKTNWTV+MFLDCERRNFVVQ+YHR   ++ A   P   + PP PPSPP    PV  S     S LPRK S  R+    RR R   +G R
Subjt:  FSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRERSRRHRKVAAGTR

AT1G34550.1 Protein of unknown function (DUF616)3.7e-7443.25Show/hide
Query:  ADKKRTGPR----PCPVCYLPVEEAVALMPNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDIN
        +D KR G R     C +  L   + + + P  +  S+  SL+YI  E+   E E     F G+ +L +R DSF V +  ++HCGF+ G K   +TGFD+ 
Subjt:  ADKKRTGPR----PCPVCYLPVEEAVALMPNASSYSSVKSLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDIN

Query:  DDDLHDMEQCRGVVVASAIFGNFDVINQPNN--ISDYAKSTVCFFMFTDEETEAGLKETG-ILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMF
        +DD + + +C  + V+S IFGN D +  P N  IS  ++  VCF +F DE T   L   G   + +  IGLW++VVV  LPY D RR GKIPK+L HR+F
Subjt:  DDDLHDMEQCRGVVVASAIFGNFDVINQPNN--ISDYAKSTVCFFMFTDEETEAGLKETG-ILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMF

Query:  PNARYSLWIDGKLELVVDPYQILERFLWRKNATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPY--SEAKLPITSDVPEGCVIIRE
        P+ARYS+W+D KL L +DP  ILE FLWRK   +AIS HY R  +++E   NK   KY++  I+ Q  FY  +GLT +  S+    + S+VPEG  I+R 
Subjt:  PNARYSLWIDGKLELVVDPYQILERFLWRKNATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPY--SEAKLPITSDVPEGCVIIRE

Query:  HVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIMAKT---NWTVNMFLDCERRNFVVQKYHR
        H P+SNLFSCLWFNEV+RFT RDQ+SF+    K+        + ++MF DCERR       HR
Subjt:  HVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIMAKT---NWTVNMFLDCERRNFVVQKYHR

AT1G53040.1 Protein of unknown function (DUF616)5.4e-15052.03Show/hide
Query:  KEKDYLFPWI-CKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQH--IQHVSINNSIAMSFREPSSEETLDGSS------FLAKGIETSSL-ASH
        KEK+    ++ C ++ R++V MLLL  ++  VF+   Y    E      H  I+ +   ++     RE +S  T +  +      FL  GI  S +  +H
Subjt:  KEKDYLFPWI-CKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQH--IQHVSINNSIAMSFREPSSEETLDGSS------FLAKGIETSSL-ASH

Query:  PPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAM-PPPPADKKRTGPRPCPVCYLPVEEAVALMPNASSYSS-VKSLEYIYGEN-LSREMEFG
           PPP LP  H                     HPC +F+  PPPP   +R GPRPCPVCYLP EEA+A MP     S  +K+L YI  E+ +  E   G
Subjt:  PPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAM-PPPPADKKRTGPRPCPVCYLPVEEAVALMPNASSYSS-VKSLEYIYGEN-LSREMEFG

Query:  GSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKE
        GS+FGGYP+L  R +SFD++ESM VHCGFI G KPG  TGFDI++D LH+++Q   V+VASAIFG +D+I +P NIS+ A+  + F+MF DEET   LK 
Subjt:  GSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKE

Query:  T-GILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNATFAISRHYRRFDVFKEADANKAAAKY
        T    + +K++GLWRI+VVH +PY D+RR GK+PKLL+HR+FPN RYS+W+D KL+LVVDPYQILERFLWR N++FAISRHYRRFDVF EA+ANKAA KY
Subjt:  T-GILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNATFAISRHYRRFDVFKEADANKAAAKY

Query:  DNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHR
        DNASID+QV+FY KEGLTPY+EAKLPITSDVPEGC IIREH+PI+NLF+C+WFNEVDRFTSRDQ+SF+  RDKI  K +W++NMFLDCERRNFV Q YHR
Subjt:  DNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHR

Query:  DVLKQKASP-VPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRERSRRHRKVAAGTRD
        DVL     P     V P PL       + P    +  + +  P K +P   +   RRHRKV+AG R+
Subjt:  DVLKQKASP-VPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRERSRRHRKVAAGTRD

AT1G53040.2 Protein of unknown function (DUF616)5.4e-15052.03Show/hide
Query:  KEKDYLFPWI-CKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQH--IQHVSINNSIAMSFREPSSEETLDGSS------FLAKGIETSSL-ASH
        KEK+    ++ C ++ R++V MLLL  ++  VF+   Y    E      H  I+ +   ++     RE +S  T +  +      FL  GI  S +  +H
Subjt:  KEKDYLFPWI-CKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQH--IQHVSINNSIAMSFREPSSEETLDGSS------FLAKGIETSSL-ASH

Query:  PPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAM-PPPPADKKRTGPRPCPVCYLPVEEAVALMPNASSYSS-VKSLEYIYGEN-LSREMEFG
           PPP LP  H                     HPC +F+  PPPP   +R GPRPCPVCYLP EEA+A MP     S  +K+L YI  E+ +  E   G
Subjt:  PPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAM-PPPPADKKRTGPRPCPVCYLPVEEAVALMPNASSYSS-VKSLEYIYGEN-LSREMEFG

Query:  GSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKE
        GS+FGGYP+L  R +SFD++ESM VHCGFI G KPG  TGFDI++D LH+++Q   V+VASAIFG +D+I +P NIS+ A+  + F+MF DEET   LK 
Subjt:  GSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKE

Query:  T-GILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNATFAISRHYRRFDVFKEADANKAAAKY
        T    + +K++GLWRI+VVH +PY D+RR GK+PKLL+HR+FPN RYS+W+D KL+LVVDPYQILERFLWR N++FAISRHYRRFDVF EA+ANKAA KY
Subjt:  T-GILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNATFAISRHYRRFDVFKEADANKAAAKY

Query:  DNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHR
        DNASID+QV+FY KEGLTPY+EAKLPITSDVPEGC IIREH+PI+NLF+C+WFNEVDRFTSRDQ+SF+  RDKI  K +W++NMFLDCERRNFV Q YHR
Subjt:  DNASIDFQVDFYVKEGLTPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHR

Query:  DVLKQKASP-VPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRERSRRHRKVAAGTRD
        DVL     P     V P PL       + P    +  + +  P K +P   +   RRHRKV+AG R+
Subjt:  DVLKQKASP-VPMAVHPPPLPPSPPSIINPVKDSSSEKVSSLPRKASPRRSRERSRRHRKVAAGTRD

AT4G09630.1 Protein of unknown function (DUF616)1.4e-7344.68Show/hide
Query:  SLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPNN--ISDYAK
        SL+YI  E+     E     F G+ +L +R DSF V+E  ++HCGF+   +   +TGFD+ +DD + + +C  + V S IFGN D +  P N  +S  ++
Subjt:  SLEYIYGENLSREMEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPNN--ISDYAK

Query:  STVCFFMFTDEETEAGLKETG-ILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNATFAISRH
          VCF +F DE T   L   G + + +  +GLW++VVV  LPY D RR GKIPKLL HR+F +ARYS+W+D KL L +DP  ILE FLWR+   +AIS H
Subjt:  STVCFFMFTDEETEAGLKETG-ILESSKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNATFAISRH

Query:  YRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPY--SEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM---A
        Y R  +++E   NK   KY++  ID Q +FY  +GLT +  S+    + S+VPEG  I+REH P+SNLFSCLWFNEV+RFT RDQ+SF+    K+     
Subjt:  YRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGLTPY--SEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIM---A

Query:  KTNWTVNMFLDCERRNFVVQKYHRDVLKQ
         T + ++MF DCERR       HR   K+
Subjt:  KTNWTVNMFLDCERRNFVVQKYHRDVLKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGGAGGGTCATTGGGACTTCGATCGGGGAGTTATGGGTCATTGGATAAGCAGCTGAACAAATTAGTTTCGCCGATCCAAATCGCACGTAAGCCCTCCAAGATGAT
GAAGGAGAAGGATTATTTGTTTCCTTGGATCTGCAAGTTCGTCCGTAGAAAGAAGGTTGGGATGCTGCTTCTCTGTATTGTTTCTGCTGCGGTTTTCCTCTGGGTGCTGT
ATATGGGCAAAGGTGAGGATGCTCAAGAAGGACAGCATATCCAGCACGTCAGCATTAACAATAGCATAGCTATGAGTTTCAGGGAACCTTCATCTGAAGAAACTTTGGAT
GGTAGTTCTTTTTTGGCAAAGGGGATAGAGACATCTTCATTGGCATCCCATCCTCCTCCTCCTCCTCCTTCTCTGCCTCCGCCTCATCCTCCTCCTCCGCCTTCCATTCC
TCCACCAGCAGTTTTCCTGGGTTATACTCTTCCACCAGGACACCCATGTAGTAATTTTGCTATGCCTCCCCCACCTGCAGATAAAAAGAGAACCGGTCCGAGGCCATGCC
CTGTATGTTATCTTCCCGTGGAAGAAGCTGTTGCCTTGATGCCAAATGCCTCATCATATTCATCTGTTAAAAGTTTGGAATACATTTATGGGGAAAATTTAAGTAGAGAG
ATGGAATTTGGAGGTTCAGACTTTGGTGGATATCCTACTTTAGCCCAGAGGGCTGATTCTTTTGATGTAAGGGAGTCAATGAGGGTGCACTGTGGGTTCATCGGAGGAGC
CAAACCTGGTCGCAACACAGGTTTTGATATCAATGATGATGACCTTCATGACATGGAGCAGTGTCGTGGCGTGGTTGTTGCATCTGCAATCTTTGGAAATTTTGATGTTA
TAAATCAGCCAAATAACATTAGTGACTATGCCAAGAGCACCGTTTGCTTCTTCATGTTTACTGATGAAGAAACAGAAGCAGGGTTAAAGGAAACGGGTATCCTAGAAAGC
AGCAAGAAAATTGGATTGTGGAGAATCGTCGTGGTCCATAAGTTACCTTACAAAGACTCAAGACGTACCGGAAAAATCCCTAAACTTTTGATGCACAGAATGTTTCCCAA
TGCTCGATATTCTCTTTGGATCGATGGAAAACTTGAGCTTGTTGTGGACCCATATCAAATTCTTGAAAGGTTCTTGTGGAGAAAAAATGCTACATTTGCAATTTCTAGAC
ATTACAGACGCTTCGATGTGTTTAAGGAAGCTGATGCAAATAAAGCTGCCGCAAAGTATGATAATGCTTCTATTGACTTTCAGGTTGATTTTTATGTAAAGGAAGGTTTG
ACTCCGTATTCCGAAGCCAAGCTTCCCATTACAAGTGATGTTCCAGAGGGATGTGTGATCATTAGAGAGCATGTACCTATTAGCAATTTGTTCAGTTGCCTTTGGTTCAA
TGAAGTTGATCGTTTTACATCGAGAGATCAAATTAGTTTTTCTACTGTTAGAGACAAAATAATGGCAAAAACAAATTGGACAGTCAATATGTTCTTGGACTGTGAAAGGC
GCAATTTCGTGGTTCAGAAATATCATAGAGACGTCCTTAAACAAAAGGCTTCTCCTGTTCCTATGGCTGTCCATCCCCCACCCCTTCCACCTTCTCCACCTTCTATAATT
AATCCAGTCAAGGACTCATCGTCCGAAAAAGTTTCAAGTTTACCGAGGAAGGCTTCCCCAAGGCGAAGTCGAGAGAGGTCCAGGCGTCATCGTAAAGTCGCTGCAGGTAC
AAGGGACAATGGTTTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
TGAATATTCAACCACTGAATCGCCTCCCCTCCCTATCAACAAACGAATATAAACTGAATGCAAAAACTTGATTAATTTGGTTTTGGCGATCCCAGATTATTCTCGATTTC
AAAAACCTAGATTCATTCCCTCCCTCCCCTATCCGTTTCCAATACTTTCCCGCAAAATAAATTCATCTTTTTCTGAGTAAATTCGTTTGGTAGTAAGCAAAATCGCCGTA
TATACAGGGAATTTGTACTCGTTCACATTTGGGGATTCAGATACAGATTTGATCCTGCCGCCATTCATTTTGGTAGTAATCGTGTCGTGAAATCTGAGCTGCATTTGGGT
TTTCAGTGAGTGCCAATTGGCATAGTCGAATTTGGGCTTCATTTGGGAGGTGGGTCTTGAAATCTTCGAGGTTTGAAGCAAATATTGAGGCGTTTCTATGGTGGGTTTGT
GTTGCAGAGTGATTAGAGGGGGGTTATGAGAAGAACACATGTTAATTTGGAGGAACAATGACTGGAGGGTCATTGGGACTTCGATCGGGGAGTTATGGGTCATTGGATAA
GCAGCTGAACAAATTAGTTTCGCCGATCCAAATCGCACGTAAGCCCTCCAAGATGATGAAGGAGAAGGATTATTTGTTTCCTTGGATCTGCAAGTTCGTCCGTAGAAAGA
AGGTTGGGATGCTGCTTCTCTGTATTGTTTCTGCTGCGGTTTTCCTCTGGGTGCTGTATATGGGCAAAGGTGAGGATGCTCAAGAAGGACAGCATATCCAGCACGTCAGC
ATTAACAATAGCATAGCTATGAGTTTCAGGGAACCTTCATCTGAAGAAACTTTGGATGGTAGTTCTTTTTTGGCAAAGGGGATAGAGACATCTTCATTGGCATCCCATCC
TCCTCCTCCTCCTCCTTCTCTGCCTCCGCCTCATCCTCCTCCTCCGCCTTCCATTCCTCCACCAGCAGTTTTCCTGGGTTATACTCTTCCACCAGGACACCCATGTAGTA
ATTTTGCTATGCCTCCCCCACCTGCAGATAAAAAGAGAACCGGTCCGAGGCCATGCCCTGTATGTTATCTTCCCGTGGAAGAAGCTGTTGCCTTGATGCCAAATGCCTCA
TCATATTCATCTGTTAAAAGTTTGGAATACATTTATGGGGAAAATTTAAGTAGAGAGATGGAATTTGGAGGTTCAGACTTTGGTGGATATCCTACTTTAGCCCAGAGGGC
TGATTCTTTTGATGTAAGGGAGTCAATGAGGGTGCACTGTGGGTTCATCGGAGGAGCCAAACCTGGTCGCAACACAGGTTTTGATATCAATGATGATGACCTTCATGACA
TGGAGCAGTGTCGTGGCGTGGTTGTTGCATCTGCAATCTTTGGAAATTTTGATGTTATAAATCAGCCAAATAACATTAGTGACTATGCCAAGAGCACCGTTTGCTTCTTC
ATGTTTACTGATGAAGAAACAGAAGCAGGGTTAAAGGAAACGGGTATCCTAGAAAGCAGCAAGAAAATTGGATTGTGGAGAATCGTCGTGGTCCATAAGTTACCTTACAA
AGACTCAAGACGTACCGGAAAAATCCCTAAACTTTTGATGCACAGAATGTTTCCCAATGCTCGATATTCTCTTTGGATCGATGGAAAACTTGAGCTTGTTGTGGACCCAT
ATCAAATTCTTGAAAGGTTCTTGTGGAGAAAAAATGCTACATTTGCAATTTCTAGACATTACAGACGCTTCGATGTGTTTAAGGAAGCTGATGCAAATAAAGCTGCCGCA
AAGTATGATAATGCTTCTATTGACTTTCAGGTTGATTTTTATGTAAAGGAAGGTTTGACTCCGTATTCCGAAGCCAAGCTTCCCATTACAAGTGATGTTCCAGAGGGATG
TGTGATCATTAGAGAGCATGTACCTATTAGCAATTTGTTCAGTTGCCTTTGGTTCAATGAAGTTGATCGTTTTACATCGAGAGATCAAATTAGTTTTTCTACTGTTAGAG
ACAAAATAATGGCAAAAACAAATTGGACAGTCAATATGTTCTTGGACTGTGAAAGGCGCAATTTCGTGGTTCAGAAATATCATAGAGACGTCCTTAAACAAAAGGCTTCT
CCTGTTCCTATGGCTGTCCATCCCCCACCCCTTCCACCTTCTCCACCTTCTATAATTAATCCAGTCAAGGACTCATCGTCCGAAAAAGTTTCAAGTTTACCGAGGAAGGC
TTCCCCAAGGCGAAGTCGAGAGAGGTCCAGGCGTCATCGTAAAGTCGCTGCAGGTACAAGGGACAATGGTTTGAGTTGAAGTTTTGTTGAGAAGAGTTATTTTTTAATGT
TTTTTTCTACAACCCCTTTTAAAATATACAGTGGGGCTAGGATGGCTGTGCAAGATGAGTGCTCTAATTCCTGTTTCTACTTAGCTCATTTTCTCATTCATTCTTTTCCT
TTTGGACTTCCCATTCATTTGATGTATATAGAGCAGAGCAAACTATATTTAATAATAACTGTTTTTATAGGTTGCTGCCAAATTCCTACTGGTTTCACAGCCCATCCAAG
C
Protein sequenceShow/hide protein sequence
MTGGSLGLRSGSYGSLDKQLNKLVSPIQIARKPSKMMKEKDYLFPWICKFVRRKKVGMLLLCIVSAAVFLWVLYMGKGEDAQEGQHIQHVSINNSIAMSFREPSSEETLD
GSSFLAKGIETSSLASHPPPPPPSLPPPHPPPPPSIPPPAVFLGYTLPPGHPCSNFAMPPPPADKKRTGPRPCPVCYLPVEEAVALMPNASSYSSVKSLEYIYGENLSRE
MEFGGSDFGGYPTLAQRADSFDVRESMRVHCGFIGGAKPGRNTGFDINDDDLHDMEQCRGVVVASAIFGNFDVINQPNNISDYAKSTVCFFMFTDEETEAGLKETGILES
SKKIGLWRIVVVHKLPYKDSRRTGKIPKLLMHRMFPNARYSLWIDGKLELVVDPYQILERFLWRKNATFAISRHYRRFDVFKEADANKAAAKYDNASIDFQVDFYVKEGL
TPYSEAKLPITSDVPEGCVIIREHVPISNLFSCLWFNEVDRFTSRDQISFSTVRDKIMAKTNWTVNMFLDCERRNFVVQKYHRDVLKQKASPVPMAVHPPPLPPSPPSII
NPVKDSSSEKVSSLPRKASPRRSRERSRRHRKVAAGTRDNGLS