; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0014988 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0014988
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProline-rich protein 36
Genome locationchr12:23808456..23810865
RNA-Seq ExpressionPI0014988
SyntenyPI0014988
Gene Ontology termsGO:0009737 - response to abscisic acid (biological process)
InterPro domainsIPR037491 - Low-temperature-induced 78kDa/65kDa


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067764.1 proline-rich protein 36 [Cucumis melo var. makuwa]8.4e-24585.05Show/hide
Query:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED
        MSQFHHMQ YGGNA+PTMEQLLR+G+ASRW PT ASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDEN TPSWGYNLDEDE+EE+
Subjt:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED

Query:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENT-----------------RKEVAKPSS
        G+DAEYLGAPMYESELAPEDCKENARQHPRA+PVIAENH LANTIKLAFGHDEKPSNS ET SQM V+SSI N+                  KEVAKPSS
Subjt:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENT-----------------RKEVAKPSS

Query:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE
        P+KTLTETMTEKLAPVYSTVTDATHAIASKIQSLTI++PS+SSTRSSPSTP+KASS TI+ TCSDPIATQ+P L KGTEQIWDKGVSVKEYLMHKFEPGE
Subjt:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE

Query:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQP----L
        DERALSQVLSDALSPRAKPG+GVVEK+REAVNSMLRAGDAPQ KSTHLTAKSSSQVEVAP+PVA  SVA+SSSRAEKAPELVAAKSVR+E+T  P    L
Subjt:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQP----L

Query:  AAKPSSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSL
        AAKPSSSVEKTPQAVAAKSSSH+EVAPQAI A+HLAAKPS      PQAI ATHLAA+P PQA AAPQAI ATHLAAKPSL AEAT HPMVTHLAAKPSL
Subjt:  AAKPSSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSL

Query:  PAEVTPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN
        PAE TP PMVTHLAAKPSLPAEATPQPIVTHLAAKSS+S P+FTTTHRVAEEE+LERILQAN
Subjt:  PAEVTPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN

KGN65746.1 hypothetical protein Csa_020029 [Cucumis sativus]4.8e-24084.77Show/hide
Query:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED
        MSQFHH+QT+GG ASPTMEQLLR+GDASRW PTQASPVL KDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDEN TPSWGYNLDE+E+EE+
Subjt:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED

Query:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSS
        G+DAEYLGAPMYESELAPEDCKENARQHPRA+PVIAE+H LANTIKLAFGH+EKPSNS ET SQM V+SSI N++                 KEVAK SS
Subjt:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSS

Query:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE
        P+KTLTET+TEKLAPVYSTVTDATHAIASKIQSLTI+APSDS      STP+KASSPTIQ   SDPIATQAP L KGTEQ+WDKGVSVKEYLMHKFEPGE
Subjt:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE

Query:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQPLAAKP
        DERALSQVLSDALSPRAKPGVGVVEK+REAVNSMLRAGD PQPKSTHLTAKSSSQVEVAP+PVA  SVAKSSSRAEKAPELVAAKSVR+E+TLQ LAAKP
Subjt:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQPLAAKP

Query:  SSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEV
        SSSV+KT +AVAAKSSSH+EVAPQAI AVHL  K SPQAKAAPQAI AT+LAA+P P+A AAPQAILAT L AKPSLPAEATPHPMVTHLAAKPSLPAE 
Subjt:  SSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEV

Query:  TPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN
        TP P+VTHLAAKPSLPAEATPQPIVTHLAAKSSSS PIFTTTHRVAEEE+LERILQAN
Subjt:  TPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN

XP_016902102.1 PREDICTED: proline-rich protein 36 [Cucumis melo]2.2e-22182.19Show/hide
Query:  EGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEEDGIDAEYLGAP-------------
        +G+ASRW PT ASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDEN TPSWGYNLDEDE+EE+G+DAEYLGAP             
Subjt:  EGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEEDGIDAEYLGAP-------------

Query:  -MYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENT-----------------RKEVAKPSSPSKTLTETM
         +YESELAPEDCKENARQHPRA+PVIAENH LANTIKLAFGHDEKPSNS ET SQM V+SSI N+                  KEVAKPSSP+KTLTETM
Subjt:  -MYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENT-----------------RKEVAKPSSPSKTLTETM

Query:  TEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVL
        TEKLAPVYSTVTDATHAIASKIQSLTI++PS+SSTRSSPSTP+KASS TI+ TCSDPIATQ+P L KGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVL
Subjt:  TEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVL

Query:  SDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQP----LAAKPSSSVE
        SDALSPRAKPG+GVVEK+REAVNSMLRAGDAPQ KSTHLTAKSSSQVEVAP+PVA  SVA+SSSRAEKAPELVAAKSVR+E+T  P    LAAKPSSSVE
Subjt:  SDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQP----LAAKPSSSVE

Query:  KTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEVTPHPM
        KTPQAVAAKSSSH+EVAPQAI A+HLAAKPS      PQAI ATHLAA+P PQA AAPQAI ATHLAAKPSL AEAT HPMVTHLAAKPSLPAE TP PM
Subjt:  KTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEVTPHPM

Query:  VTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHR
        VTHLAAKPSLPAEATPQPIVTHLAAKSS+S P+FTTTHR
Subjt:  VTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHR

XP_031744345.1 mucin-20 [Cucumis sativus]3.3e-23384.56Show/hide
Query:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED
        MSQFHH+QT+GG ASPTMEQLLR+GDASRW PTQASPVL KDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDEN TPSWGYNLDE+E+EE+
Subjt:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED

Query:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSS
        G+DAEYLGAPMYESELAPEDCKENARQHPRA+PVIAE+H LANTIKLAFGH+EKPSNS ET SQM V+SSI N++                 KEVAK SS
Subjt:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSS

Query:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE
        P+KTLTET+TEKLAPVYSTVTDATHAIASKIQSLTI+APSDS      STP+KASSPTIQ   SDPIATQAP L KGTEQ+WDKGVSVKEYLMHKFEPGE
Subjt:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE

Query:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQPLAAKP
        DERALSQVLSDALSPRAKPGVGVVEK+REAVNSMLRAGD PQPKSTHLTAKSSSQVEVAP+PVA  SVAKSSSRAEKAPELVAAKSVR+E+TLQ LAAKP
Subjt:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQPLAAKP

Query:  SSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEV
        SSSV+KT +AVAAKSSSH+EVAPQAI AVHL  K SPQAKAAPQAI AT+LAA+P P+A AAPQAILAT L AKPSLPAEATPHPMVTHLAAKPSLPAE 
Subjt:  SSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEV

Query:  TPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHR
        TP P+VTHLAAKPSLPAEATPQPIVTHLAAKSSSS PIFTTTHR
Subjt:  TPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHR

XP_038894200.1 mediator of DNA damage checkpoint protein 1 [Benincasa hispida]1.1e-20474.42Show/hide
Query:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED
        MSQF+HMQ YGGNASPTMEQLLREGDASRWSPT ASPVLVKDHD E++ +HHQKKSVFTKVKEKAKKLR SLSNKKRHGEDENTTPSWGYNLDEDEDEED
Subjt:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED

Query:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSS
         IDAEYLGAPMYESELAPEDCKENARQHPRA+PVIAENH+LANTIKLA G DEKP NS ETSS M+V SS  N++                  E+AKPSS
Subjt:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSS

Query:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE
        PSKTLTE +TEKLAPVYSTVTDATHAIASKIQSLT++APS+SSTRSSPSTPRKASSPTIQKT SDP ATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE
Subjt:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE

Query:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKS-VRSEATLQP----
        DERALSQVLSDALSPR KPGVGVVEKMREAVNSMLRAG+  QPK+THL+AKSSSQVEVAP+PVA   V KSSSRAEKAPE VAAKS VR+E T QP    
Subjt:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKS-VRSEATLQP----

Query:  LAAKPSSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPS
        LAAKPSSSVEK PQ VAAKSSSH+EVAPQAI AVHLAAKPS QAKAAPQA+                                                 
Subjt:  LAAKPSSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPS

Query:  LPAEVTPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN
                 + TH  AKPSLP EATPQP+V HLA KSS+S PIFTTTHRVAEEE+LERILQAN
Subjt:  LPAEVTPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN

TrEMBL top hitse value%identityAlignment
A0A0A0M0K1 Uncharacterized protein2.3e-24084.77Show/hide
Query:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED
        MSQFHH+QT+GG ASPTMEQLLR+GDASRW PTQASPVL KDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDEN TPSWGYNLDE+E+EE+
Subjt:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED

Query:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSS
        G+DAEYLGAPMYESELAPEDCKENARQHPRA+PVIAE+H LANTIKLAFGH+EKPSNS ET SQM V+SSI N++                 KEVAK SS
Subjt:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSS

Query:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE
        P+KTLTET+TEKLAPVYSTVTDATHAIASKIQSLTI+APSDS      STP+KASSPTIQ   SDPIATQAP L KGTEQ+WDKGVSVKEYLMHKFEPGE
Subjt:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE

Query:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQPLAAKP
        DERALSQVLSDALSPRAKPGVGVVEK+REAVNSMLRAGD PQPKSTHLTAKSSSQVEVAP+PVA  SVAKSSSRAEKAPELVAAKSVR+E+TLQ LAAKP
Subjt:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQPLAAKP

Query:  SSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEV
        SSSV+KT +AVAAKSSSH+EVAPQAI AVHL  K SPQAKAAPQAI AT+LAA+P P+A AAPQAILAT L AKPSLPAEATPHPMVTHLAAKPSLPAE 
Subjt:  SSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEV

Query:  TPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN
        TP P+VTHLAAKPSLPAEATPQPIVTHLAAKSSSS PIFTTTHRVAEEE+LERILQAN
Subjt:  TPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN

A0A1S4E1K0 proline-rich protein 361.1e-22182.19Show/hide
Query:  EGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEEDGIDAEYLGAP-------------
        +G+ASRW PT ASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDEN TPSWGYNLDEDE+EE+G+DAEYLGAP             
Subjt:  EGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEEDGIDAEYLGAP-------------

Query:  -MYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENT-----------------RKEVAKPSSPSKTLTETM
         +YESELAPEDCKENARQHPRA+PVIAENH LANTIKLAFGHDEKPSNS ET SQM V+SSI N+                  KEVAKPSSP+KTLTETM
Subjt:  -MYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENT-----------------RKEVAKPSSPSKTLTETM

Query:  TEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVL
        TEKLAPVYSTVTDATHAIASKIQSLTI++PS+SSTRSSPSTP+KASS TI+ TCSDPIATQ+P L KGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVL
Subjt:  TEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVL

Query:  SDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQP----LAAKPSSSVE
        SDALSPRAKPG+GVVEK+REAVNSMLRAGDAPQ KSTHLTAKSSSQVEVAP+PVA  SVA+SSSRAEKAPELVAAKSVR+E+T  P    LAAKPSSSVE
Subjt:  SDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQP----LAAKPSSSVE

Query:  KTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEVTPHPM
        KTPQAVAAKSSSH+EVAPQAI A+HLAAKPS      PQAI ATHLAA+P PQA AAPQAI ATHLAAKPSL AEAT HPMVTHLAAKPSLPAE TP PM
Subjt:  KTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEVTPHPM

Query:  VTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHR
        VTHLAAKPSLPAEATPQPIVTHLAAKSS+S P+FTTTHR
Subjt:  VTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHR

A0A5A7VK47 Proline-rich protein 364.1e-24585.05Show/hide
Query:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED
        MSQFHHMQ YGGNA+PTMEQLLR+G+ASRW PT ASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDEN TPSWGYNLDEDE+EE+
Subjt:  MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEED

Query:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENT-----------------RKEVAKPSS
        G+DAEYLGAPMYESELAPEDCKENARQHPRA+PVIAENH LANTIKLAFGHDEKPSNS ET SQM V+SSI N+                  KEVAKPSS
Subjt:  GIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENT-----------------RKEVAKPSS

Query:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE
        P+KTLTETMTEKLAPVYSTVTDATHAIASKIQSLTI++PS+SSTRSSPSTP+KASS TI+ TCSDPIATQ+P L KGTEQIWDKGVSVKEYLMHKFEPGE
Subjt:  PSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGE

Query:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQP----L
        DERALSQVLSDALSPRAKPG+GVVEK+REAVNSMLRAGDAPQ KSTHLTAKSSSQVEVAP+PVA  SVA+SSSRAEKAPELVAAKSVR+E+T  P    L
Subjt:  DERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQP----L

Query:  AAKPSSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSL
        AAKPSSSVEKTPQAVAAKSSSH+EVAPQAI A+HLAAKPS      PQAI ATHLAA+P PQA AAPQAI ATHLAAKPSL AEAT HPMVTHLAAKPSL
Subjt:  AAKPSSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSL

Query:  PAEVTPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN
        PAE TP PMVTHLAAKPSLPAEATPQPIVTHLAAKSS+S P+FTTTHRVAEEE+LERILQAN
Subjt:  PAEVTPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN

A0A6J1CJ52 microtubule-associated protein 1A1.2e-16763.95Show/hide
Query:  MSQFHHMQTYGGNASPTMEQLLRE--GDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDE
        MSQF+HMQTYGGNASP +EQL R+   DASRW P+ +SP L +DHD E+DIDHHQKKSVFTKVKEKAKKLR +LSNKK+HGEDEN TPSWGYNLDEDE+E
Subjt:  MSQFHHMQTYGGNASPTMEQLLRE--GDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDE

Query:  EDGIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTRK-------------------EVA
        +DG DAEYLGAPMYESELAPEDCKENARQHPRA+PVIAENH+LANTIKLAFG DEKP NS ETSS+ V   S+EN  +                   E A
Subjt:  EDGIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTRK-------------------EVA

Query:  KPSSPSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKF
        KP SP KTLTE +TEKLAPVYSTVTDATHAIASKI SLT++APS  S RSSP+TP KASSP   +  S   A+QA KLGKGTEQIWDKGVSVKEYLMHKF
Subjt:  KPSSPSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKF

Query:  EPGEDERALSQVLSDALSPRAKP-GVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVE-VAPRPVATRSVAKSSSRAEKAPELVAA-----KSVRS
        EPGEDERALSQVLS+ALSPR  P  VGVVEKM+EAVNSMLRA D P+P++ HL  KSSS+ E  AP  VA     KS S+ +KAPE VA       S ++
Subjt:  EPGEDERALSQVLSDALSPRAKP-GVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVE-VAPRPVATRSVAKSSSRAEKAPELVAA-----KSVRS

Query:  EATLQP----LAAKPSSSVEKTPQAV----AAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEA
        +   QP    LAAKPSS  EK P+ V    A KSSS +E APQA+   HLA+K S Q + APQA+  THLAA+   Q   APQA+LA H+ AKP+  AE 
Subjt:  EATLQP----LAAKPSSSVEKTPQAV----AAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLPQAMAAPQAILATHLAAKPSLPAEA

Query:  TPHPMVTHLAAKPSLPAEVTPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN
         P  ++T                  T+LAAKPS   EA PQP+  HL AKSSS+ P FTT HRVAEEE+LERILQ N
Subjt:  TPHPMVTHLAAKPSLPAEVTPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN

A0A6J1GBR4 uncharacterized protein LOC1114525233.2e-11769.65Show/hide
Query:  EGD-ASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDE-DEDEEDGIDAEYLGAPMYESELAPEDC
        EGD +S+WSP  + PV  KD D E+DIDHHQKKSVF KVKEKAKKLR +LS+KK+HGEDEN TPSWGYNLDE +E+EED +DAEYLGAPMYESELAPE C
Subjt:  EGD-ASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDE-DEDEEDGIDAEYLGAPMYESELAPEDC

Query:  KENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSSPSKTLTETMTEKLAPVYSTVT
        KENARQHPRANPVIAENH+LAN I LA G DEKP  S E SS+M V+SS+ N +                  E AKPSSPSKTLTE +TEKLAPVYSTVT
Subjt:  KENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTR-----------------KEVAKPSSPSKTLTETMTEKLAPVYSTVT

Query:  DATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVLSDALSPRAKPG-
        DATHA+ASKIQSLT +APS+ S  SSP+TP+ A SP  QK  S P+  QA +LGKGTEQIWDKG SVKEYLM KFEPGEDERALS+VL DALSP   PG 
Subjt:  DATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVLSDALSPRAKPG-

Query:  VGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATR
        VGVVEKMREAV+SML+A +APQP +THL AKS  ++E AP+PVA++
Subjt:  VGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVATR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G46750.1 unknown protein9.8e-4239.02Show/hide
Query:  HMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDH-HQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPS------------WGY--
        H+   G  +  T+ Q  + G+AS+ SP+  SP        E+D +H H KKS+F+K+K+KAKKL++SLS KKRH E+ + T S             GY  
Subjt:  HMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDH-HQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPS------------WGY--

Query:  ----------NLDEDEDEEDGIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILA---------------NTIK-----------LAFGHDEK
                   + E+  EE+  D EYLGAPMYES+ APE+ KE ARQHPR NPVI E ++L+               N  K           L+    EK
Subjt:  ----------NLDEDEDEEDGIDAEYLGAPMYESELAPEDCKENARQHPRANPVIAENHILA---------------NTIK-----------LAFGHDEK

Query:  PSNSLETSSQMVVDSSIENTRKEVA---KPSSPSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIA
        P++S  T+   +V  S E TRKE     +P SPSKT+TET+TEKLAP Y+ V+DAT AI  KIQ +    P++     +       ++ T Q T      
Subjt:  PSNSLETSSQMVVDSSIENTRKEVA---KPSSPSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTITAPSDSSTRSSPSTPRKASSPTIQKTCSDPIA

Query:  TQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVLSDALSPR----AKPGVGVVEKMREAVNS
              G  T ++WDKGVS+KEY+  KFEP ED+R LS+V+S A+SPR         G    M  A NS
Subjt:  TQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVLSDALSPR----AKPGVGVVEKMREAVNS

AT4G25580.1 CAP160 protein4.8e-0428Show/hide
Query:  DSSIENTRKEVAKPSSPSKTLTETMTEKLAPVYSTVTD----ATHAIASKI--------QSLTITAPSDSSTRSSPSTPRKASSPTIQKT--CSDPIATQ
        DS  +   +E+ +P+  S       T+K++   S V D    A +A+ASK+        ++    A + SS     ST     +P  +K       + T+
Subjt:  DSSIENTRKEVAKPSSPSKTLTETMTEKLAPVYSTVTD----ATHAIASKI--------QSLTITAPSDSSTRSSPSTPRKASSPTIQKT--CSDPIATQ

Query:  APKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVLSDAL------SPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVA
         P  G GTEQ  D+GVS KE+L  K  PGE+++ALS+V+++ L         A P  G+V +  E      R G    P S   T +  +  E     +A
Subjt:  APKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVLSDAL------SPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTHLTAKSSSQVEVAPRPVA

Query:  TRSVAKSSSRAEKAPELVAAKSVRS
         +     +S      E V  KS  S
Subjt:  TRSVAKSSSRAEKAPELVAAKSVRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAATTCCATCACATGCAAACATATGGTGGCAATGCTAGCCCAACCATGGAACAACTTCTCCGAGAAGGGGATGCTTCAAGATGGTCCCCAACACAGGCATCTCC
AGTGCTCGTAAAGGACCACGACCTCGAAGATGATATCGATCACCATCAGAAGAAATCAGTTTTCACCAAGGTAAAGGAAAAAGCAAAGAAACTGAGAAACTCGCTCAGCA
ACAAGAAGAGGCACGGTGAAGACGAGAACACTACGCCGTCGTGGGGTTATAACTTGGATGAGGATGAGGATGAGGAAGACGGTATCGATGCTGAATATCTTGGAGCCCCA
ATGTATGAATCAGAGCTCGCACCAGAGGACTGCAAAGAGAATGCTAGGCAGCATCCTCGAGCAAATCCTGTGATTGCTGAGAATCATATTTTAGCGAATACCATAAAGCT
TGCCTTTGGACATGACGAAAAACCTTCCAACTCTCTAGAGACATCGTCTCAAATGGTCGTTGATTCGTCCATTGAAAATACAAGAAAGGAGGTTGCTAAGCCTTCTAGTC
CAAGCAAGACACTAACTGAAACTATGACTGAGAAGTTGGCACCAGTTTATTCCACTGTTACTGATGCTACTCATGCTATTGCTTCAAAGATCCAGAGCCTCACCATCACT
GCTCCATCCGATTCATCAACTCGTTCTTCACCATCAACTCCAAGAAAAGCTTCTTCCCCAACAATCCAAAAGACATGCTCTGACCCAATAGCTACACAAGCACCAAAGCT
TGGTAAAGGAACTGAGCAGATATGGGATAAAGGAGTTTCAGTAAAGGAGTACTTGATGCACAAGTTTGAGCCAGGTGAAGATGAGAGAGCACTTTCTCAGGTGTTGTCCG
ATGCACTGAGTCCGAGAGCAAAGCCTGGTGTAGGTGTAGTAGAGAAGATGAGAGAAGCAGTAAATTCTATGTTGCGAGCTGGGGACGCACCACAGCCAAAGTCTACGCAT
TTGACTGCAAAATCTTCGTCGCAGGTTGAAGTGGCACCACGACCAGTGGCTACACGCTCGGTTGCGAAATCTTCATCAAGGGCTGAGAAGGCACCAGAGCTAGTGGCTGC
AAAATCAGTGCGATCTGAAGCGACACTGCAACCATTGGCTGCAAAACCTTCATCCTCAGTGGAGAAGACACCACAAGCGGTGGCTGCAAAATCTTCATCACATTCCGAGG
TGGCACCACAAGCAATTCAGGCAGTACATTTGGCTGCAAAACCTTCACCACAGGCCAAGGCAGCACCACAGGCAATACAGGCTACACATTTGGCTGCACAACCTTTGCCA
CAGGCCATGGCAGCACCACAGGCAATACTGGCTACACATTTGGCTGCAAAACCTTCCTTACCAGCGGAGGCAACACCACATCCAATGGTTACACATTTGGCTGCAAAACC
TTCCTTACCGGCAGAGGTAACACCACATCCAATGGTTACACATTTGGCTGCAAAACCTTCCTTACCAGCGGAGGCAACACCACAACCAATAGTTACACATTTAGCTGCAA
AATCTTCATCGAGCGTGCCAATCTTTACCACCACCCATAGAGTGGCTGAGGAAGAAAGCCTTGAAAGAATACTCCAAGCAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAATTCCATCACATGCAAACATATGGTGGCAATGCTAGCCCAACCATGGAACAACTTCTCCGAGAAGGGGATGCTTCAAGATGGTCCCCAACACAGGCATCTCC
AGTGCTCGTAAAGGACCACGACCTCGAAGATGATATCGATCACCATCAGAAGAAATCAGTTTTCACCAAGGTAAAGGAAAAAGCAAAGAAACTGAGAAACTCGCTCAGCA
ACAAGAAGAGGCACGGTGAAGACGAGAACACTACGCCGTCGTGGGGTTATAACTTGGATGAGGATGAGGATGAGGAAGACGGTATCGATGCTGAATATCTTGGAGCCCCA
ATGTATGAATCAGAGCTCGCACCAGAGGACTGCAAAGAGAATGCTAGGCAGCATCCTCGAGCAAATCCTGTGATTGCTGAGAATCATATTTTAGCGAATACCATAAAGCT
TGCCTTTGGACATGACGAAAAACCTTCCAACTCTCTAGAGACATCGTCTCAAATGGTCGTTGATTCGTCCATTGAAAATACAAGAAAGGAGGTTGCTAAGCCTTCTAGTC
CAAGCAAGACACTAACTGAAACTATGACTGAGAAGTTGGCACCAGTTTATTCCACTGTTACTGATGCTACTCATGCTATTGCTTCAAAGATCCAGAGCCTCACCATCACT
GCTCCATCCGATTCATCAACTCGTTCTTCACCATCAACTCCAAGAAAAGCTTCTTCCCCAACAATCCAAAAGACATGCTCTGACCCAATAGCTACACAAGCACCAAAGCT
TGGTAAAGGAACTGAGCAGATATGGGATAAAGGAGTTTCAGTAAAGGAGTACTTGATGCACAAGTTTGAGCCAGGTGAAGATGAGAGAGCACTTTCTCAGGTGTTGTCCG
ATGCACTGAGTCCGAGAGCAAAGCCTGGTGTAGGTGTAGTAGAGAAGATGAGAGAAGCAGTAAATTCTATGTTGCGAGCTGGGGACGCACCACAGCCAAAGTCTACGCAT
TTGACTGCAAAATCTTCGTCGCAGGTTGAAGTGGCACCACGACCAGTGGCTACACGCTCGGTTGCGAAATCTTCATCAAGGGCTGAGAAGGCACCAGAGCTAGTGGCTGC
AAAATCAGTGCGATCTGAAGCGACACTGCAACCATTGGCTGCAAAACCTTCATCCTCAGTGGAGAAGACACCACAAGCGGTGGCTGCAAAATCTTCATCACATTCCGAGG
TGGCACCACAAGCAATTCAGGCAGTACATTTGGCTGCAAAACCTTCACCACAGGCCAAGGCAGCACCACAGGCAATACAGGCTACACATTTGGCTGCACAACCTTTGCCA
CAGGCCATGGCAGCACCACAGGCAATACTGGCTACACATTTGGCTGCAAAACCTTCCTTACCAGCGGAGGCAACACCACATCCAATGGTTACACATTTGGCTGCAAAACC
TTCCTTACCGGCAGAGGTAACACCACATCCAATGGTTACACATTTGGCTGCAAAACCTTCCTTACCAGCGGAGGCAACACCACAACCAATAGTTACACATTTAGCTGCAA
AATCTTCATCGAGCGTGCCAATCTTTACCACCACCCATAGAGTGGCTGAGGAAGAAAGCCTTGAAAGAATACTCCAAGCAAATTAA
Protein sequenceShow/hide protein sequence
MSQFHHMQTYGGNASPTMEQLLREGDASRWSPTQASPVLVKDHDLEDDIDHHQKKSVFTKVKEKAKKLRNSLSNKKRHGEDENTTPSWGYNLDEDEDEEDGIDAEYLGAP
MYESELAPEDCKENARQHPRANPVIAENHILANTIKLAFGHDEKPSNSLETSSQMVVDSSIENTRKEVAKPSSPSKTLTETMTEKLAPVYSTVTDATHAIASKIQSLTIT
APSDSSTRSSPSTPRKASSPTIQKTCSDPIATQAPKLGKGTEQIWDKGVSVKEYLMHKFEPGEDERALSQVLSDALSPRAKPGVGVVEKMREAVNSMLRAGDAPQPKSTH
LTAKSSSQVEVAPRPVATRSVAKSSSRAEKAPELVAAKSVRSEATLQPLAAKPSSSVEKTPQAVAAKSSSHSEVAPQAIQAVHLAAKPSPQAKAAPQAIQATHLAAQPLP
QAMAAPQAILATHLAAKPSLPAEATPHPMVTHLAAKPSLPAEVTPHPMVTHLAAKPSLPAEATPQPIVTHLAAKSSSSVPIFTTTHRVAEEESLERILQAN