; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030108 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030108
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionMucin-2
Genome locationscaffold6:8762943..8765030
RNA-Seq ExpressionSpg030108
SyntenySpg030108
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146564.1 uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus]3.5e-19182.02Show/hide
Query:  MRRRANADDAADLRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAA
        MRRR + D   D RPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS+KQRKRIGHAVLVPEPSPS+ + H+NTL+SPDIVLPFAA
Subjt:  MRRRANADDAADLRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAA

Query:  PPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPFTPPESILLTTPSSPEVPFAQY-QPSLQK
        PPSSPVS LQSEPPSA QSPTA +SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTPFTPPESI LTTPSSPEVPFAQ+ QP+L K
Subjt:  PPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPFTPPESILLTTPSSPEVPFAQY-QPSLQK

Query:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDF
         ESD+QY FP DDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPD DFAS GSQF NF LEVPPTLLNLDK  IH+WRQ QS+DSC Q+S  F SS+DF
Subjt:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDF

Query:  DLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAK
         LNPQTSESMSDHHATNESQNIQILID    +EEEP A NHRFSFELSD D LL+S  SKPLESNELAV SSPI EPFET KE SP G  TSN  EEK K
Subjt:  DLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAK

Query:  AEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN
        A+G+EA++ QEHHS+TLGSVKEFNFDNGNGSD   PNINS+WW N
Subjt:  AEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN

XP_022136623.1 uncharacterized protein At1g76660-like [Momordica charantia]9.8e-19482.74Show/hide
Query:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP
        MRRR +AD  ADL PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS +   +NTL+SPDIVLPFAAP
Subjt:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK
        PSSPVSFLQSEPPSATQSPTA LSFTSLTANMYSPDGPSSIFA+GPFAHETQLVSPPLNFST+TT+PST PFTPPESI LTTPSSPEVPFAQY QPS QK
Subjt:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK

Query:  AESDDQY-PFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHD
         ESD QY  FP DDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPD DF  SGS FSNF +EVPPTLLNLD+  I  WR  QSSDSC QNS G+ SS+D
Subjt:  AESDDQY-PFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHD

Query:  FDLNPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAKA
        F LNPQTSES+SD+HA+NE  NIQIL DG Q +E AAANHRFSFELSDEDALL+S E+KPLESNELAV SSPI EP ETAKETS VGG TSN TEE+ KA
Subjt:  FDLNPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAKA

Query:  EGEEANEHQ--EHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN
        +GEE + HQ  EHHS+TLG+VKEFNFDNGNG D LKPNINS WWAN
Subjt:  EGEEANEHQ--EHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN

XP_023522163.1 uncharacterized protein At1g76660-like [Cucurbita pepo subsp. pepo]1.5e-18982.71Show/hide
Query:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP
        MRRRA+A DAADLRP+NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS  +AHQN+L+SPDIVLPFAAP
Subjt:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTP-FTPPESILLTTPSSPEVPFAQY-QPSLQK
        PSSPVSFLQSEPPSATQSP+  LSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTP FTPPESI LTTPSSPEVPFAQ+ QP+LQK
Subjt:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTP-FTPPESILLTTPSSPEVPFAQY-QPSLQK

Query:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDKQIHSWRQGQSSDSCNQNSTGFTSS-HDF
        AESDDQY  P DDFQSYQFYPGSP+S+LISPRS IS SGASSPLPDLDFASS SQFSNFSL+VPP LLNLD      RQGQSSDSC QNS GF S+  DF
Subjt:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDKQIHSWRQGQSSDSCNQNSTGFTSS-HDF

Query:  DLNPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAKAE
        DLNP+TS+SM      NESQNIQILIDG Q EEP   NHRFSFELSDED+LLR+ ESKPLESN +AV SSP+ E FETAKETS  GG +SNG EEKA A+
Subjt:  DLNPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAKAE

Query:  GEEANEHQE-HHSITLGSVKEFNFDNGNGSDALKPNINSDWWANFGSVKVR
        GEEAN+HQE HHS TLGSV EFNFDNGNGS+ALKPNINSDWWAN   V+ +
Subjt:  GEEANEHQE-HHSITLGSVKEFNFDNGNGSDALKPNINSDWWANFGSVKVR

XP_023529207.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo]3.9e-19082.74Show/hide
Query:  MRRRANAD-DAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAA
        MRRRA+AD DAADLRP+NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS  +AHQN+L+SPDIVLPFAA
Subjt:  MRRRANAD-DAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAA

Query:  PPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTP-FTPPESILLTTPSSPEVPFAQY-QPSLQ
        PPSSPVSFLQSEPPSATQSP+  LSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTP FTPPESI LTTPSSPEVPFAQ+ QP+LQ
Subjt:  PPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTP-FTPPESILLTTPSSPEVPFAQY-QPSLQ

Query:  KAESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDKQIHSWRQGQSSDSCNQNSTGFTSS-HD
        KAESDDQY  P DDFQSYQFYPGSP+S+LISPRS IS SGASSPLPDLDFASS SQFSNFSL+VPP LLNLD      RQGQSSDSC QNS GF S+  D
Subjt:  KAESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDKQIHSWRQGQSSDSCNQNSTGFTSS-HD

Query:  FDLNPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAKA
        FDLNP+TS+SM      NESQNIQILIDG Q EEP   NHRFSFELSDED+LLR+ ESKPLESN +AV SSP+ E FETAKETS  GG +SNG EEKA A
Subjt:  FDLNPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAKA

Query:  EGEEANEHQE-HHSITLGSVKEFNFDNGNGSDALKPNINSDWWANFGSVKVR
        +GEEAN+HQE HHS TLGSV EFNFDNGNGS+ALKPNINSDWWAN   V+ +
Subjt:  EGEEANEHQE-HHSITLGSVKEFNFDNGNGSDALKPNINSDWWANFGSVKVR

XP_038884079.1 uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida]5.2e-19583.63Show/hide
Query:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP
        MRRR + DD+   RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE SPS+ ++H+N+L+SPDIVLPFAAP
Subjt:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK
        PSSPVSFLQSEPPSATQSPTA +SFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST PFTPPESI LTTPSSPEVPFAQ+ QP+LQK
Subjt:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK

Query:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDKQ-IHSWRQGQSSDSCNQNSTGFTSSHDF
        +ESD QYPFP DDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPD DFAS GSQF NF LEVPPTLLNLDKQ IH+WRQ QS+DSC Q+S    SS+DF
Subjt:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDKQ-IHSWRQGQSSDSCNQNSTGFTSSHDF

Query:  DLNPQTSESMSDHHATNESQNIQILIDGWQEEE--PAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPV-GGQTSNGTEEKA
         LNPQTSESMSDHHATNESQNIQILIDG Q+EE  P A NHRFSFELSD DALL+S  SKPL+SNE+AV SSPI EPFETAKE SPV    TSN TE K 
Subjt:  DLNPQTSESMSDHHATNESQNIQILIDGWQEEE--PAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPV-GGQTSNGTEEKA

Query:  KAEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN
        KAE EEA++HQEHHSITLGSVKEFNFDNGNGSD  K N+NS+WW N
Subjt:  KAEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN

TrEMBL top hitse value%identityAlignment
A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X17.3e-18780.94Show/hide
Query:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAA
        MRRR + D   D RPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS+ + H+NTL+SPDIVLPFAA
Subjt:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAA

Query:  PPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQ
        PPSSPVS LQSEPPSA QSPTA +SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESI LTTPSSPEVPFAQ+  PSLQ
Subjt:  PPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQ

Query:  KAESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHD
        K ESD+QY FP DDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPD DFAS GSQF NF LEVPPTL NLDK  IH+WRQ QS+DSC Q+S  F SS+D
Subjt:  KAESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHD

Query:  FDLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKA
        F LNP TSESM DHHATNESQNIQILID    +EEEP A NHRFSFELSD D L +S  SKPLESNEL V SSPI EPFET KE SP G  TSN  EEK 
Subjt:  FDLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKA

Query:  KAEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN
        KA+G+EA++HQEHHS+ LGSVKEFNFDN NGSD   P INSDWW N
Subjt:  KAEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X23.0e-18881.12Show/hide
Query:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP
        MRRR + D   D RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS+ + H+NTL+SPDIVLPFAAP
Subjt:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK
        PSSPVS LQSEPPSA QSPTA +SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESI LTTPSSPEVPFAQ+  PSLQK
Subjt:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK

Query:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDF
         ESD+QY FP DDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPD DFAS GSQF NF LEVPPTL NLDK  IH+WRQ QS+DSC Q+S  F SS+DF
Subjt:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDF

Query:  DLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAK
         LNP TSESM DHHATNESQNIQILID    +EEEP A NHRFSFELSD D L +S  SKPLESNEL V SSPI EPFET KE SP G  TSN  EEK K
Subjt:  DLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAK

Query:  AEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN
        A+G+EA++HQEHHS+ LGSVKEFNFDN NGSD   P INSDWW N
Subjt:  AEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN

A0A5A7TUB1 Mucin-29.6e-18780.45Show/hide
Query:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP
        MRRR + D   D RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS+ + H+NTL+SPDIVLPFAAP
Subjt:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK
        PSSPVS LQSEPPSA QSPTA +SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESI LTTPSSPEVPFAQ+  PS QK
Subjt:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK

Query:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDF
         ESD+QY FP DDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPD DFAS GSQF NF L+VPPTL N+DK  IH+WRQ QS+DSC Q+S  F SS+DF
Subjt:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDF

Query:  DLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAK
         LNP TSESM DHHATNESQNIQILID    +EEEP A NHRFSFELSD D L +S  SKPLESNEL V SSPI EPFET KE SP G  TSN  EEK K
Subjt:  DLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAK

Query:  AEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN
        A+G+EA++HQEHHS+ LGSVKEFNFDN NGSD   P INSDWW N
Subjt:  AEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN

A0A5D3CYQ2 Mucin-23.0e-18881.12Show/hide
Query:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP
        MRRR + D   D RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS+ + H+NTL+SPDIVLPFAAP
Subjt:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK
        PSSPVS LQSEPPSA QSPTA +SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESI LTTPSSPEVPFAQ+  PSLQK
Subjt:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK

Query:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDF
         ESD+QY FP DDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPD DFAS GSQF NF LEVPPTL NLDK  IH+WRQ QS+DSC Q+S  F SS+DF
Subjt:  AESDDQYPFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDF

Query:  DLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAK
         LNP TSESM DHHATNESQNIQILID    +EEEP A NHRFSFELSD D L +S  SKPLESNEL V SSPI EPFET KE SP G  TSN  EEK K
Subjt:  DLNPQTSESMSDHHATNESQNIQILID--GWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAK

Query:  AEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN
        A+G+EA++HQEHHS+ LGSVKEFNFDN NGSD   P INSDWW N
Subjt:  AEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN

A0A6J1C828 uncharacterized protein At1g76660-like4.7e-19482.74Show/hide
Query:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP
        MRRR +AD  ADL PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS +   +NTL+SPDIVLPFAAP
Subjt:  MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAP

Query:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK
        PSSPVSFLQSEPPSATQSPTA LSFTSLTANMYSPDGPSSIFA+GPFAHETQLVSPPLNFST+TT+PST PFTPPESI LTTPSSPEVPFAQY QPS QK
Subjt:  PSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQY-QPSLQK

Query:  AESDDQY-PFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHD
         ESD QY  FP DDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPD DF  SGS FSNF +EVPPTLLNLD+  I  WR  QSSDSC QNS G+ SS+D
Subjt:  AESDDQY-PFP-DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHD

Query:  FDLNPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAKA
        F LNPQTSES+SD+HA+NE  NIQIL DG Q +E AAANHRFSFELSDEDALL+S E+KPLESNELAV SSPI EP ETAKETS VGG TSN TEE+ KA
Subjt:  FDLNPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAKA

Query:  EGEEANEHQ--EHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN
        +GEE + HQ  EHHS+TLG+VKEFNFDNGNG D LKPNINS WWAN
Subjt:  EGEEANEHQ--EHHSITLGSVKEFNFDNGNGSDALKPNINSDWWAN

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.2e-3148.08Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE----PSPSADDAHQ----NTLESPDIVLPFAAPPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSP
        Q++RWG C  ++ CF S K  KRI  A  +PE     +   + AHQ    N   +  I L   APPSSP SF  S  PS TQSP     + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE----PSPSADDAHQ----NTLESPDIVLPFAAPPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFT-PPESILLTTPSSPEVPFAQYQPSLQKAESDDQYPFPDDFQSYQFYPGSPISHLISPRSVI
         GP SS++A GP+AHETQLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A++  S    ++  +  + D   +Y  YPGSP S L SP S  
Subjt:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFT-PPESILLTTPSSPEVPFAQYQPSLQKAESDDQYPFPDDFQSYQFYPGSPISHLISPRSVI

Query:  SRSGASSP
        S  G  SP
Subjt:  SRSGASSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)1.1e-4952.27Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEP---SPSADDAHQNTLESPDIVLPFAAPPSSPVSFLQSEP
        NN F TI AAA AIA+ D R  +++ + +KR+W + WS+  CFGS +QRKRIG++VLVPEP   S S      +   S    LPF APPSSP SF QSEP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEP---SPSADDAHQNTLESPDIVLPFAAPPSSPVSFLQSEP

Query:  PSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPP---ESILL--TTPSSPEVPFAQYQPSLQKAESDDQYP
        PSATQSP   LSF+ L  N        SIFAIGP+AHETQLVSPP+ FST TTEPS+ P TPP    SI L  TTPSSPEVPFAQ   S  +  S   Y 
Subjt:  PSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPP---ESILL--TTPSSPEVPFAQYQPSLQKAESDDQYP

Query:  FP----DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLN
        FP     +FQ YQ  PGSP+  LISP      SG +SP PD       S F +F +  PP LL+
Subjt:  FP----DDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLN

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.6e-3248.08Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE----PSPSADDAHQ----NTLESPDIVLPFAAPPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSP
        Q++RWG C  ++ CF S K  KRI  A  +PE     +   + AHQ    N   +  I L   APPSSP SF  S  PS TQSP     + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE----PSPSADDAHQ----NTLESPDIVLPFAAPPSSPVSFLQSEPPSATQSPTATLSFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFT-PPESILLTTPSSPEVPFAQYQPSLQKAESDDQYPFPDDFQSYQFYPGSPISHLISPRSVI
         GP SS++A GP+AHETQLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A++  S    ++  +  + D   +Y  YPGSP S L SP S  
Subjt:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFT-PPESILLTTPSSPEVPFAQYQPSLQKAESDDQYPFPDDFQSYQFYPGSPISHLISPRSVI

Query:  SRSGASSP
        S  G  SP
Subjt:  SRSGASSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein2.0e-4337.53Show/hide
Query:  LRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS---ADDAHQNTLESPDIVLPFAAPPSSPVSFL
        +R VNN +  T+ AAA AI + + R  + ++VQK+R GS WS+YWCFGS K  KRIGHAVLVPEP+ S         ++  S  I +PF APPSSP SFL
Subjt:  LRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS---ADDAHQNTLESPDIVLPFAAPPSSPVSFL

Query:  QSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQ-YQPSLQKAESDDQYP
         S PPSA+ +P   L   SLT N      P S F IGP+AHETQ V+PP+ FS  TTEPST PFTPP      +PSSPEVPFAQ    SL++A  +    
Subjt:  QSEPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESILLTTPSSPEVPFAQ-YQPSLQKAESDDQYP

Query:  FPD-------DFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDFDL
                  +F+S Q YPGSP  +LISP      SG SSP P             F +  PP  L  +      W     S S      G +      L
Subjt:  FPD-------DFQSYQFYPGSPISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSWRQGQSSDSCNQNSTGFTSSHDFDL

Query:  NPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSP---VGGQTSNGTEEKA--
         P  S+  S     N ++ +  +  G               ++S+  +L  S       ++E  VV  P R  FE   E            +G+ EKA  
Subjt:  NPQTSESMSDHHATNESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSP---VGGQTSNGTEEKA--

Query:  --------KAEGE-EANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWANFGSVKVRPPGPDHS
                K  GE E+ + Q+  S + GS KEF FD+ N  + +   I S+WWAN    KV   G DHS
Subjt:  --------KAEGE-EANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWANFGSVKVRPPGPDHS

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.4e-5237.42Show/hide
Query:  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADD--AHQNTLESPDIVLPFAAPPSSPVSFLQSEPP
        VNN+ +T+ AAA AI T + R  + ++ QK RWG CWS+Y CFG+ K  KRIG+AVLVPEP  S       QN+  S  +VLPF APPSSP SFLQS+P 
Subjt:  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADD--AHQNTLESPDIVLPFAAPPSSPVSFLQSEPP

Query:  SATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPP--ESILLTTPSSPEVPFAQ-YQPSLQKAESDDQYPFPD
        S + SP   L   SLT+N +SP  P S+F +GP+A+ETQ V+PP+ FS   TEPST P+TPP   S+ +TTPSSPEVPFAQ    SL+    D       
Subjt:  SATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPP--ESILLTTPSSPEVPFAQ-YQPSLQKAESDDQYPFPD

Query:  -------DFQSYQFYPGSP-ISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSW--RQGQSSDSCNQNSTGFTSSHDFDL
               +F+S Q  PGSP   +LISP SVIS SG SSP P        S    F +  PP  L  +      W  R G  S +   + +G  S      
Subjt:  -------DFQSYQFYPGSP-ISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDK-QIHSW--RQGQSSDSCNQNSTGFTSSHDFDL

Query:  NPQ-TSESMSDHHAT----NESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKA
         P+  S +++ ++ T    N+   +  L +     E   A+HR SFEL+ ED                  ++S +    +       +  + S+ T+ + 
Subjt:  NPQ-TSESMSDHHAT----NESQNIQILIDGWQEEEPAAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKA

Query:  KAEGEEANEHQEHHSI------TLGSVKEFNFDNGNGSDALKPNINSDWWANFGSVK
          E    +   E H I      ++GS KEF FDN    +  K   NS  W+ F  ++
Subjt:  KAEGEEANEHQEHHSI------TLGSVKEFNFDNGNGSDALKPNINSDWWANFGSVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGACGTGCGAATGCTGATGATGCTGCTGATCTGAGGCCTGTAAATAACACTTTTCAAACCATTACTGCAGCCGCCGATGCGATCGCCACCGTCGATCATCGTTT
TCCTCGGGCTACTGCCGTTCAGAAAAGAAGATGGGGCAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGAAAACGAATTGGGCACGCTGTCCTGGTCC
CAGAACCAAGTCCTTCAGCTGATGATGCTCATCAAAATACATTGGAATCACCAGACATTGTGCTTCCTTTTGCTGCGCCTCCCTCTTCCCCTGTATCCTTCCTTCAATCA
GAGCCACCTTCTGCTACACAATCACCTACAGCTACACTCTCTTTCACTTCTCTGACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCAATTTTTGCCATTGGCCCATT
TGCTCATGAAACACAACTAGTGTCTCCACCTCTGAATTTCTCTACACTCACCACTGAACCATCAACTCCCTTCACTCCTCCTGAGTCTATCCTCTTGACTACACCTTCTT
CCCCTGAAGTTCCTTTTGCTCAGTATCAACCTAGCCTTCAGAAAGCTGAGTCTGATGACCAATATCCATTTCCTGATGACTTTCAATCTTATCAATTCTATCCCGGCAGC
CCAATCAGTCACCTCATATCACCACGCTCAGTCATTTCTCGTTCTGGGGCATCGTCACCTTTGCCAGACTTGGATTTTGCTTCTTCTGGTTCTCAATTTTCTAATTTTTC
GTTAGAAGTTCCACCTACGCTATTGAACCTTGACAAGCAAATTCATAGCTGGCGACAAGGGCAAAGTTCTGATTCTTGCAATCAAAATTCTACAGGATTCACATCGAGTC
ATGATTTTGATTTGAATCCTCAAACTTCAGAATCTATGTCAGATCACCACGCAACAAATGAATCCCAAAACATTCAAATTCTCATTGATGGATGGCAAGAGGAGGAGCCT
GCTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATGCTTTATTAAGAAGCGCAGAAAGTAAGCCACTGGAATCAAATGAACTTGCAGTTGTATCATCTCC
AATACGCGAACCATTTGAAACGGCTAAAGAAACTTCTCCTGTTGGTGGTCAAACCTCAAATGGTACAGAAGAAAAGGCAAAAGCAGAGGGTGAAGAAGCAAATGAGCATC
AAGAACATCATTCCATTACTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATGCACTCAAGCCTAATATCAACTCAGACTGGTGGGCCAATTTT
GGGTCTGTGAAGGTACGGCCGCCGGGGCCTGATCATTCTTTCCAGACACAGGGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGACGACGTGCGAATGCTGATGATGCTGCTGATCTGAGGCCTGTAAATAACACTTTTCAAACCATTACTGCAGCCGCCGATGCGATCGCCACCGTCGATCATCGTTT
TCCTCGGGCTACTGCCGTTCAGAAAAGAAGATGGGGCAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGAAAACGAATTGGGCACGCTGTCCTGGTCC
CAGAACCAAGTCCTTCAGCTGATGATGCTCATCAAAATACATTGGAATCACCAGACATTGTGCTTCCTTTTGCTGCGCCTCCCTCTTCCCCTGTATCCTTCCTTCAATCA
GAGCCACCTTCTGCTACACAATCACCTACAGCTACACTCTCTTTCACTTCTCTGACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCAATTTTTGCCATTGGCCCATT
TGCTCATGAAACACAACTAGTGTCTCCACCTCTGAATTTCTCTACACTCACCACTGAACCATCAACTCCCTTCACTCCTCCTGAGTCTATCCTCTTGACTACACCTTCTT
CCCCTGAAGTTCCTTTTGCTCAGTATCAACCTAGCCTTCAGAAAGCTGAGTCTGATGACCAATATCCATTTCCTGATGACTTTCAATCTTATCAATTCTATCCCGGCAGC
CCAATCAGTCACCTCATATCACCACGCTCAGTCATTTCTCGTTCTGGGGCATCGTCACCTTTGCCAGACTTGGATTTTGCTTCTTCTGGTTCTCAATTTTCTAATTTTTC
GTTAGAAGTTCCACCTACGCTATTGAACCTTGACAAGCAAATTCATAGCTGGCGACAAGGGCAAAGTTCTGATTCTTGCAATCAAAATTCTACAGGATTCACATCGAGTC
ATGATTTTGATTTGAATCCTCAAACTTCAGAATCTATGTCAGATCACCACGCAACAAATGAATCCCAAAACATTCAAATTCTCATTGATGGATGGCAAGAGGAGGAGCCT
GCTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATGCTTTATTAAGAAGCGCAGAAAGTAAGCCACTGGAATCAAATGAACTTGCAGTTGTATCATCTCC
AATACGCGAACCATTTGAAACGGCTAAAGAAACTTCTCCTGTTGGTGGTCAAACCTCAAATGGTACAGAAGAAAAGGCAAAAGCAGAGGGTGAAGAAGCAAATGAGCATC
AAGAACATCATTCCATTACTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATGCACTCAAGCCTAATATCAACTCAGACTGGTGGGCCAATTTT
GGGTCTGTGAAGGTACGGCCGCCGGGGCCTGATCATTCTTTCCAGACACAGGGAAGATGA
Protein sequenceShow/hide protein sequence
MRRRANADDAADLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSADDAHQNTLESPDIVLPFAAPPSSPVSFLQS
EPPSATQSPTATLSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPFTPPESILLTTPSSPEVPFAQYQPSLQKAESDDQYPFPDDFQSYQFYPGS
PISHLISPRSVISRSGASSPLPDLDFASSGSQFSNFSLEVPPTLLNLDKQIHSWRQGQSSDSCNQNSTGFTSSHDFDLNPQTSESMSDHHATNESQNIQILIDGWQEEEP
AAANHRFSFELSDEDALLRSAESKPLESNELAVVSSPIREPFETAKETSPVGGQTSNGTEEKAKAEGEEANEHQEHHSITLGSVKEFNFDNGNGSDALKPNINSDWWANF
GSVKVRPPGPDHSFQTQGR