; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G018390 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G018390
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like
Genome locationCG_Chr05:30648591..30656154
RNA-Seq ExpressionClCG05G018390
SyntenyClCG05G018390
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR012583 - Pre-rRNA-processing protein RIX1, N-terminal
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601219.1 hypothetical protein SDJN03_06452, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0079.7Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNLVANMYD ALKPRL+HKLLREHVPDDKR FNDHSELSKVVS+IKIHNLLSES   MDQ LIDSWKSAVDSWVNRLF+LLSNDM      PDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GI+LLGVTCQQC+SSRFLASYTEWLH+LLPH+QTDS FLKVASCASISDLFLRLGR QSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLD AVNLLC LIA
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFTI RHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLN+AFQG GEDSKG+E +RLLIPPGK+PPPPLGCN
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S+SE S DKIT+SSER LT  ISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVLTVDGSLPPTSVPFMTSLQQES+  ELPALHSDSLDLLIAIVK L
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
        R                                               SQLLPHAASIVRL+VKYFKKCVSAELRVKVYAVAK LMMSLGVGMAASLARD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        VIDNALVDLNPVDNES +PSSVNPK+ Q ELLQH+KKRKRPSVPTSMKGQHERH  GD    SS MSTSV+LRIAALEALETLLTL GALR+EE WRAKV
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        EHLLITAATSSFEWP+ASDDIFF+ANE IEVW DYQLA FRAL  S LS+VHVRPLALAQGLELF +GKQENG+KLAEFCA ALL MEVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET
        FLP  LSS EP ATYKFQED YFGS+ SSKLLK+DTQ  EQ+  ++DD+F YDR  A++IEEAPIRDA GN +N+ EMTYN SNDLE E  ANGLVSIET
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET

Query:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE
        PK TEQA  A ITEVGVVEK DVFA       P+SSKSNKT +DF  D G  LL EDDFPDIIDADPDTDYE
Subjt:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE

KAG7032014.1 hypothetical protein SDJN02_06056, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0077.75Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNLVANMYD ALKPRL+HKLLREHVPDDKR FNDHSELSKVVS+IKIHNLLSES   MDQ LIDSWKSAVDSWVNRLF+LLSNDM      PDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GI+LLGVTCQQC+SSRFLASYTEWLH+LLPH+QTDS FLKVASCASISDLFLRLGR QSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLD AVNLLC LIA
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFTI RHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLN+AFQG GEDSKG+E +RLLIPPGK+PPPPLGCN
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S+SE S DKIT+SSER LT  ISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVLTVDGSLPPTSVPFMTSLQQES+                      
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
                                                         QLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAK LMMSLGVGMAASLARD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        VIDNALVDLNPVDNES +PSSVNPK+ QRELLQH+KKRKRPSVPTSMKGQHERH  GD    SS MSTSVHLRIAALEALETLLTL GALR+EEGWRAKV
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        EHLLITAATSSFEWP+ASDDIFF+ANE IEVW DYQLA FRAL  S LS+VHVRPLALAQGLELF +GKQENG+KLAEFCA ALL MEVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET
        FLP  LSS EP ATYKFQED YFGS+ SSKLLK+DTQ  EQ+  ++DD+F YDR  A++IEEAPIRDA GN +N+ EMTYN SNDLE E  ANGLVSIET
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET

Query:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE
        PK TEQA  A ITEVGVVEK DVFA       P+SSKSNKT +DF  D G  LL EDDFPDIIDADPDTDYE
Subjt:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE

XP_022956971.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita moschata]0.0e+0080.05Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNLVANMYD ALKPRL+HKLLREHVPDDKR FNDHSELSKVVS+IKIHNLLSES   MDQ LIDSWKSAVDSWVNRLF+LLSNDM      PDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GIILLGVTCQQC+SSRFLASYTEWLH+LLPH+QTDS FLKVASCASISDLFLRLGR QSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLD AVNLLC LIA
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFTI RHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLN+AFQG GEDSKG+E +RLLIPPGK+PPPPLGCN
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S+SE S DKIT+SSER LT  ISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVLTVDGSLPPTSVPFMTSLQQES+  ELPALHSDSLDLLIAIVK L
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
        R                                               SQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAK LMMSLGVGMAASLARD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        VIDNALVDLNPVDNES +PSSVNPK+ QRELLQH+KKRKRPSVPTSMKGQHERH  GD    SS MSTSVHLRIAALEALETLLTL GALR+EEGWRAKV
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        EHLLITAATSSFEWP+ASDDIFF+ANE IEVW DYQLA FRAL  S LS+VHVRPLALAQGLELF +GKQENG+KLAEFCA ALL MEVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET
        FLP  LSS EP ATYKFQED YFGS+ SSKLLK+DTQ  EQ+  ++DD+F YDR  A++IEEAPIRDA GN +N+ EMTYN SNDLE E  ANGLVSIET
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET

Query:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE
        PK TEQA  A +TEVGVVEK DVFA       P+SSKS+KT +DF  D G  LL EDDFPDIIDADPDTDYE
Subjt:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE

XP_023517133.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0080.05Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNLV NMYD ALKPRL+HKLLREHVPDDKR FNDHSELSKVVS+IKIHNLLSES   MDQ LIDSWKSAVDSWVNRLF+LLSNDM      PDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GI+LLGVTCQQC+SSRFLASYTEWLH+LLPH+QTDS FLKVASCASISDLFLRLGR QSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLD AVNLLC LIA
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFTI RHY SAEAAIVSKI+SGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLN+AFQG GEDSKG+E +RLLIPPGK+PPPPLGCN
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S+SE S DKIT+SSER LT  ISTLM CCSTMITSSYNHQVAVPIRPLLA+VERVLTVDGSLPPTSVPFMTSLQQES+  ELPALHSDSLDLLIAIVK L
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
        R                                               SQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAK LMMSLGVGMAASLARD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        VIDNALVDLNPVDN+S +PSSVNPK+ Q ELLQH+KKRKRPSVPTSMKGQHERH  GD    SS MSTSVHLRIAALEALETLLTL GALR+EEGWRAKV
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        EHLLITAATSSFEWP+ASDDIFF+ANESIEVW DYQLA FRAL  S LSAVH+RPLALAQGLELF +GKQENG+KLAEFCA ALL MEVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET
        FLP  LSS EP ATYKFQED YFGS+ SSKLLKVDTQ  EQ+  ++DD+F YDR  A++IEEAPIRDA GN +N+ EMTYN SNDLE E  ANGLVSIET
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET

Query:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE
        PK TEQA  A ITEVGVVEK DVFA       P+SSKS+KT +DF  D G  LL EDDFPDIIDADPDTDYE
Subjt:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE

XP_038892364.1 proline-, glutamic acid- and leucine-rich protein 1 isoform X1 [Benincasa hispida]0.0e+0083.94Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNL+ANMYD ALKPRLLHKLLREHVPD KR FNDHSELS+VVSVIK HNLLSESSS MDQ LIDSWKSAVDSWVNRLF+LLSNDM      PDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GIILLGVTCQQC+SSRFLASYTEWLHKLLPH+QTDS FLKVASCASISDLFLRLGR QS KKDGTSCAGKVIQPV+KLLHDD+TEAVLDT+VNLLCNLIA
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLA LPKSKGDEDSWSLLMQKILLSID HLN+AFQG GEDSK NE  RLL+PPGKDPPP LGCN
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S+SEGSLDK+TKSSERTLTS ISTLM+CCSTMIT SYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQES+  ELPALHSDSLDLLIAIVKSL
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
        R                                               SQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        VIDNALVDLNPVDNESS+PSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDIT+SS MST+VHLRIAALEALETLLTL GALRSEEGWRAKV
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        EHLLITAATSS EWPRASDD+FFQAN SIEVWVDYQLA FRAL  S LSAVHVRPLALAQGLELF +GKQENGTKLAEFCA ALL MEVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDLE-EASANGLVSIETP
        FLP  LSS EP A YKFQED YFGS+ SSKLLKVD Q  EQ+A  + DDF YDR VADDIEEAPIRDAGN L+NDEMTYNTSND+E E SANGL +IETP
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDLE-EASANGLVSIETP

Query:  KRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYEE
        KRTEQATAA I+EVGVVE+DDVF +ASMNS P+SSKS+K  EDFKRDPG NLL EDDFPDIIDADPDTDYEE
Subjt:  KRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYEE

TrEMBL top hitse value%identityAlignment
A0A5D3CDD1 Proline-, glutamic acid-and leucine-rich protein 10.0e+0077.55Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNLVA+MYD ALKPRLLHKLLREHVPDDKR F+D+SELS VVS++  H+LLSESSS  DQ LIDSWKSAVDSWVNRLF+LLSNDMHPIFLKPDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GIILLGVTCQ+C+SSRFLASYTEWLHKLLPHMQTDS FLKVASCASI DLF RLGR QSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLD AVNLLC LI 
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFTI RHYDSAEAAIVSKIFSGKCSSNMLKKLA CLASLPKSKGDEDSWSLL+QKILLSI+S LN+ FQG GEDSKG+EFVRLLI PGKDPPPPLGC 
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S SEGSLDKI KSSER L S ISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLT+DGSLPPTSVPFMTSLQQES+ LELP LHS SLDLL+AIVKSL
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
        R                                               SQLLPHAASIVRL VKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASL+RD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        VIDN LVDLNPV+NES   S+VNPKDTQR+  QHH KRKRPSVPTSMKGQHER+EP +DIT SS   TSVHLRIAALEAL+TLLT  GALRSEEGWRAK+
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        EHLLIT ATSS EWPRASDD FFQANESI VWVDYQLA F AL  S LSAVHVRPLALAQGLELF +GKQENGTKL EFCA ALL MEVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDLE-EASANGLVSIETP
        FLP  LSS EP A +KFQED YF S  S KLLKV TQ  EQ A+                 EA IRD  + L+N+EMTY+ SND+E E SAN L +IE P
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDLE-EASANGLVSIETP

Query:  KRTEQATAAVITEVGVVEKDD-VFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYEE
        KRTEQ TAA I+E GVV +DD VFA+ASMNS PISSKS K  EDF RD   NLL EDDFPDIIDADPDTDYEE
Subjt:  KRTEQATAAVITEVGVVEKDD-VFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYEE

A0A6J1DBX6 proline-, glutamic acid- and leucine-rich protein 1 isoform X10.0e+0075.72Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNLVANMYD ALKPRLLHKLLREHVPDDKR F+DHSELS  VS+IKIHNLLSESSS  DQ LIDSWKSAVDSWV+RLF+LLSNDM      PDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GIILLGVTCQQC+SSRFLASYTEWL KLLPH+QTDS FLKVA+CAS+SDLF RL R Q+VKKDGTSCAGK+IQPV+KLLHDDN+EAV + AVNLL  LIA
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFT+ RHYDSAEAAIVSKIFSGKCS NMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSID+HLN+AFQG GEDS+G+E VRLLIPPGKDPPPPLGCN
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S+  GS DKITKSSER LTS ISTLM CCSTMITSSY HQVAVPIRPLLALVERVL VDGSLPPTSVPFMTSLQQES+  ELP LHS+ LDLLIAI+KSL
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
        R                                               SQLLP+AASIVRLIVKYFKKCVSAELRVKVYAVAK LMMSLGVGMAASLARD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        V++NAL+DLNPVDNE+  PSSVN KDTQRE +QHHKKRKRPSVPTS++ Q ERH  GD   ++  MST V LRIAALEALETLLTL GALRSEEGWR K+
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        E LL TAATSSF+WPRASD+  FQ +ESIEVW DYQLA FR L  S LSAVHVRPLALAQGLELF RGKQE+GTKLAEFCA ALL MEVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ---EQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSND-LEEASANGLVSIET
        FLP  LSS+E  +TYKF+E+ +F  L SSK+LK+DT    EQ+A D+DDDFL++ EVADDIEEAPIR+AGN +N+ E TYNTSND  +EAS  G  S ET
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ---EQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSND-LEEASANGLVSIET

Query:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYEE
        PKR+EQ TAA IT+VGVVEKDD F +AS+N  P+S KS+KT +DF+RD G NLL EDDFPDIIDADPDTDYEE
Subjt:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYEE

A0A6J1FZZ0 proline-, glutamic acid- and leucine-rich protein 1-like0.0e+0076.43Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNLVANMYD ALKPRLLHKLLREHVPDDK+ FNDHSELSKVVS++KIHNLLSESSS MDQ L+DSWKSAVDSWVNRL +LLSNDM      PDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GIILLG TCQQC+SSRFLASY +WLHKLLPH+QTDS FLKVA+CASISDLFLRLGR  +VKKDGTSCAGKVIQPVIKLLHDDNTEAVLD AVNLLC LIA
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFTI RHYDSAEAAIVSKIFSG CS NMLKKLAHCLASLPKSKGDEDSW++LMQKILLSID HLN+AFQG GEDS+GNE VRLLIPPGK+PPPPLGCN
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S +EGS DK+TKSSER LTSIISTLM CCSTMITSSY HQVAVPIRPLLALVER+LTVDGSLPP SVPFMTSLQQES+  ELP LHSDSLDLLIAI+KSL
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
        R                                               SQLLPHAA IVRLIVKYFKKCVSAELRVKVYAVAK LMMSLGVGMAASLARD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        VIDN LVDLNPVDNES  PSSVNPKD QREL QHHKKRKRP VPTS K QHE H  G     SS  STSV LRIAALEALETLLTL GALR+EEGW AKV
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        EHLLITAA SSFEWP ASDD+FFQ NESIEVW DYQLA FRAL  S LSAVH+RPLALAQGL+LF RGKQE GTKL EFCA ALL +EVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKV-DT--QEQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDLEEA-SANGLVSIET
        F P  LSS EP ATYK  ED Y G + S K LK+ DT   +Q+A D+DDDFLYDREVADDIEEAPIRDAGN +NN+  TYNTSN+LE   SA+ L + ET
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKV-DT--QEQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDLEEA-SANGLVSIET

Query:  PKRTEQA-TAAVITE-VGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE
        PKRT+Q  TAA IT+  G+VEKDDVFA+A MNS P+S KS+            NLL EDDFPDIIDADPDTD E
Subjt:  PKRTEQA-TAAVITE-VGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE

A0A6J1GXZ0 proline-, glutamic acid- and leucine-rich protein 1-like isoform X20.0e+0077.82Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNLVANMYD ALKPRL+HKLLREHVPDDKR FNDHSELSKVVS+IKIHNLLSES   MDQ LIDSWKSAVDSWVNRLF+LLSNDM      PDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GIILLGVTCQQC+SSRFLASYTEWLH+LLPH+QTDS FLKVASCASISDLFLRLGR QSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLD AVNLLC LIA
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFTI RHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLN+AFQG GEDSKG+E +RLLIPPGK+PPPPLGCN
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S+SE S DKIT+SSER LT  ISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVLTVDGSLPPTSVPFMTSLQQES+  ELPALHSDSLDLLIAIVK L
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
        R                                               SQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAK LMMSLGVGMAASLARD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        VIDNALVDLNPVDNES +PSSVNPK+ QRELLQH+KKRKRPSVPTSMKGQHERH  GD    SS MSTSVHLRIAALEALETLLTL GALR+EEGWRAKV
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        EHLLITAATSSFEWP+ASDDIFF+ANE IEVW DYQLA FRAL  S LS+VHVRPLALAQGLELF +GKQENG+KLAEFCA ALL MEVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDLEEASANGLVSIETPK
        FLP  LSS EP ATYKFQED YFGS+ SSKLLK+DTQ  EQ+  ++DD+F YDR  A++IEEAPIRDA                            ETPK
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDLEEASANGLVSIETPK

Query:  RTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE
         TEQA  A +TEVGVVEK DVFA       P+SSKS+KT +DF  D G  LL EDDFPDIIDADPDTDYE
Subjt:  RTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE

A0A6J1GYU8 proline-, glutamic acid- and leucine-rich protein 1-like isoform X10.0e+0080.05Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA
        MAAFNLVANMYD ALKPRL+HKLLREHVPDDKR FNDHSELSKVVS+IKIHNLLSES   MDQ LIDSWKSAVDSWVNRLF+LLSNDM      PDKCWA
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWA

Query:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA
        GIILLGVTCQQC+SSRFLASYTEWLH+LLPH+QTDS FLKVASCASISDLFLRLGR QSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLD AVNLLC LIA
Subjt:  GIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIA

Query:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN
        FFPFTI RHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLN+AFQG GEDSKG+E +RLLIPPGK+PPPPLGCN
Subjt:  FFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCN

Query:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL
        S+SE S DKIT+SSER LT  ISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVLTVDGSLPPTSVPFMTSLQQES+  ELPALHSDSLDLLIAIVK L
Subjt:  SMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSL

Query:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD
        R                                               SQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAK LMMSLGVGMAASLARD
Subjt:  RRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARD

Query:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV
        VIDNALVDLNPVDNES +PSSVNPK+ QRELLQH+KKRKRPSVPTSMKGQHERH  GD    SS MSTSVHLRIAALEALETLLTL GALR+EEGWRAKV
Subjt:  VIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKV

Query:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD
        EHLLITAATSSFEWP+ASDDIFF+ANE IEVW DYQLA FRAL  S LS+VHVRPLALAQGLELF +GKQENG+KLAEFCA ALL MEVLIHPRVLPLSD
Subjt:  EHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSD

Query:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET
        FLP  LSS EP ATYKFQED YFGS+ SSKLLK+DTQ  EQ+  ++DD+F YDR  A++IEEAPIRDA GN +N+ EMTYN SNDLE E  ANGLVSIET
Subjt:  FLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQ--EQTAADVDDDFLYDREVADDIEEAPIRDA-GNALNNDEMTYNTSNDLE-EASANGLVSIET

Query:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE
        PK TEQA  A +TEVGVVEK DVFA       P+SSKS+KT +DF  D G  LL EDDFPDIIDADPDTDYE
Subjt:  PKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30240.1 FUNCTIONS IN: binding; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Armadillo-type fold (InterPro:IPR016024); Has 165 Blast hits to 164 proteins in 73 species: Archae - 0; Bacteria - 0; Metazoa - 47; Fungi - 68; Plants - 46; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink).2.5e-14839.36Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSES-SSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCW
        MA+F    +M DL LKP++L  LL E+VP++K+   +   LSKVVS I  H LLSES  + +DQ L    KSAVD WV RL  L+S+DM      PDK W
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSES-SSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCW

Query:  AGIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQ--TDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCN
         GI L+GVTCQ+C+S RF  SY+ W + LL H++    S  ++VASC SISDL  RL R  + KKD  S A K+I P+IKLL +D++EA+L+  V+LL  
Subjt:  AGIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQ--TDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCN

Query:  LIAFFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPL
        ++  FP     +YD  EAAI SKIFS K SSNMLKK AH LA LPK+KGDE +WSL+MQK+L+SI+ HLN+ FQG  E++KG + ++ L PPGKD P PL
Subjt:  LIAFFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPL

Query:  GCNSMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIV
        G  +   G LD  + +SE+ + S +S LM C STM+T+SY  ++ +P+  LL+LVERVL V+GSLP    PFMT +QQE +  ELPALHS +L+LL A +
Subjt:  GCNSMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIV

Query:  KSLRRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASL
        KS+R                                               SQLLP+AAS+VRL+  YF+KC   ELR+K+Y++  +L+ S+  GMA  L
Subjt:  KSLRRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASL

Query:  ARDVIDNALVDLNPVDNESSE-PSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGW
        A++V+ NA VDL+    E+ +  SS NP  T   LLQ   K+++ S   +     E   P       + + + + L+IA+LEALETLLT+ GAL S + W
Subjt:  ARDVIDNALVDLNPVDNESSE-PSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGW

Query:  RAKVEHLLITAATSSFEWPRASDDIFF-QANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRV
        R  V++LL+T AT++ E   A+ + +    N+S    V++QLA  RA   SL+S   VRP  LA+GLELF  GK + G K+A FCA AL+ +EV+IHPR 
Subjt:  RAKVEHLLITAATSSFEWPRASDDIFF-QANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRV

Query:  LPLSDFLPACLSSTEPLATYKFQEDTYFGS------------LISSKLLKVDTQEQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDL
        LPL            P  + +F E   FGS            +I+     +  + Q  ADV  +    R +   +   P++++      +++    S  +
Subjt:  LPLSDFLPACLSSTEPLATYKFQEDTYFGS------------LISSKLLKVDTQEQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDL

Query:  EE-----ASANGLVSIETPKRTEQATAAVITEVGVVEKDDVFAD---ASMNSYPISSKSNKTEE-----------DFKRDPGPNLLAEDDFPDIIDADPD
        ++     AS NG    + P++  + +   +T+  V    D + +    +     ++ K +  EE           +   DP P+L   D      D+D D
Subjt:  EE-----ASANGLVSIETPKRTEQATAAVITEVGVVEKDDVFAD---ASMNSYPISSKSNKTEE-----------DFKRDPGPNLLAEDDFPDIIDADPD

Query:  TD
         +
Subjt:  TD

AT1G30240.2 unknown protein7.1e-15139.47Show/hide
Query:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSES-SSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCW
        MA+F    +M DL LKP++L  LL E+VP++K+   +   LSKVVS I  H LLSES  + +DQ L    KSAVD WV RL  L+S+DM      PDK W
Subjt:  MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSES-SSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCW

Query:  AGIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQ--TDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCN
         GI L+GVTCQ+C+S RF  SY+ W + LL H++    S  ++VASC SISDL  RL R  + KKD  S A K+I P+IKLL +D++EA+L+  V+LL  
Subjt:  AGIILLGVTCQQCNSSRFLASYTEWLHKLLPHMQ--TDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCN

Query:  LIAFFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPL
        ++  FP     +YD  EAAI SKIFS K SSNMLKK AH LA LPK+KGDE +WSL+MQK+L+SI+ HLN+ FQG  E++KG + ++ L PPGKD P PL
Subjt:  LIAFFPFTIQRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPL

Query:  GCNSMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIV
        G  +   G LD  + +SE+ + S +S LM C STM+T+SY  ++ +P+  LL+LVERVL V+GSLP    PFMT +QQE +  ELPALHS +L+LL A +
Subjt:  GCNSMSEGSLDKITKSSERTLTSIISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIV

Query:  KSLRRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASL
        KS+R                                               SQLLP+AAS+VRL+  YF+KC   ELR+K+Y++  +L+ S+G+GMA  L
Subjt:  KSLRRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVYILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASL

Query:  ARDVIDNALVDLNPVDNESSE-PSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGW
        A++V+ NA VDL+    E+ +  SS NP  T   LLQ   K+++ S   +     E   P       + + + + L+IA+LEALETLLT+ GAL S + W
Subjt:  ARDVIDNALVDLNPVDNESSE-PSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGW

Query:  RAKVEHLLITAATSSFEWPRASDDIFF-QANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRV
        R  V++LL+T AT++ E   A+ + +    N+S    V++QLA  RA   SL+S   VRP  LA+GLELF  GK + G K+A FCA AL+ +EV+IHPR 
Subjt:  RAKVEHLLITAATSSFEWPRASDDIFF-QANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQGLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRV

Query:  LPLSDFLPACLSSTEPLATYKFQEDTYFGS------------LISSKLLKVDTQEQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDL
        LPL            P  + +F E   FGS            +I+     +  + Q  ADV  +    R +   +   P++++      +++    S  +
Subjt:  LPLSDFLPACLSSTEPLATYKFQEDTYFGS------------LISSKLLKVDTQEQTAADVDDDFLYDREVADDIEEAPIRDAGNALNNDEMTYNTSNDL

Query:  EE-----ASANGLVSIETPKRTEQATAAVITEVGVVEKDDVFAD---ASMNSYPISSKSNKTEE-----------DFKRDPGPNLLAEDDFPDIIDADPD
        ++     AS NG    + P++  + +   +T+  V    D + +    +     ++ K +  EE           +   DP P+L   D      D+D D
Subjt:  EE-----ASANGLVSIETPKRTEQATAAVITEVGVVEKDDVFAD---ASMNSYPISSKSNKTEE-----------DFKRDPGPNLLAEDDFPDIIDADPD

Query:  TD
         +
Subjt:  TD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCCTTCAATCTCGTTGCGAATATGTATGACCTGGCTTTGAAGCCTCGCTTGCTACACAAACTTCTTAGGGAGCACGTTCCTGACGATAAGCGTGGGTTTAATGA
TCATTCGGAACTTTCAAAGGTGGTTTCTGTGATCAAAATCCACAATCTCCTCTCTGAATCCTCGTCTCCCATGGACCAAACGCTGATTGATAGCTGGAAATCCGCCGTTG
ATTCCTGGGTCAACCGCTTGTTTATTCTTCTCTCCAATGATATGCATCCAATCTTTCTAAAGCCTGATAAATGTTGGGCGGGAATCATTTTACTCGGAGTGACTTGTCAA
CAATGCAACTCTAGTCGTTTCTTGGCATCATATACAGAATGGCTTCACAAGCTTTTACCTCATATGCAGACAGATTCTCCGTTTCTGAAGGTGGCCTCTTGTGCTTCGAT
CTCAGATTTATTCTTGAGATTGGGTAGAGTTCAAAGTGTAAAGAAAGATGGGACTTCTTGTGCTGGGAAGGTCATTCAACCAGTTATAAAGTTGTTGCATGATGATAATA
CTGAAGCTGTTTTGGACACTGCAGTTAATCTATTATGCAATCTGATAGCTTTCTTCCCCTTTACAATCCAACGTCATTATGACTCTGCCGAAGCTGCAATTGTTTCAAAA
ATATTTTCAGGAAAGTGTAGTTCTAACATGCTGAAGAAGCTTGCTCATTGCCTGGCATCACTTCCAAAATCAAAAGGAGACGAAGATAGCTGGTCTTTACTAATGCAGAA
GATTTTGTTATCCATCGATAGTCACTTGAATGATGCCTTCCAAGGGAGTGGTGAAGATTCAAAAGGCAATGAATTTGTAAGGTTACTGATTCCACCAGGAAAAGATCCTC
CACCACCTTTAGGTTGTAATTCAATGTCTGAAGGTTCCTTAGACAAAATAACAAAGAGCTCAGAGCGAACGTTAACATCAATTATTTCAACCTTGATGGTTTGCTGTTCC
ACAATGATAACAAGTTCATACAACCATCAGGTTGCAGTTCCCATTCGCCCTTTATTAGCTCTTGTTGAGAGAGTGCTGACAGTGGACGGTTCTTTGCCACCCACTTCAGT
GCCATTTATGACATCTCTGCAGCAAGAGTCACTGTATTTAGAACTTCCGGCACTGCATTCAGACAGTCTGGATCTCCTTATTGCCATAGTTAAGAGCCTTCGCAGGCAAG
GCATCTACTATTCAACTGCACGCCCTATATACCATAAAACCTTAATGCAGACCCAGGCTCTTTCACCTAGCCATTGTCAATACAAATATGGATGCTATATTTCTGTTTAT
ATATTGATTATTAAATTCCTTTGCAGTCAATTGTTACCACATGCTGCATCAATTGTACGACTTATTGTGAAGTACTTCAAGAAGTGTGTCTCTGCAGAACTGAGAGTAAA
AGTCTACGCAGTTGCTAAGTCATTGATGATGTCTTTGGGCGTTGGAATGGCTGCATCTCTTGCACGAGATGTGATTGACAATGCACTAGTCGATTTGAACCCTGTTGATA
ATGAGAGTTCTGAACCATCTAGTGTGAATCCAAAGGACACACAAAGAGAATTGCTGCAACACCATAAGAAGAGAAAACGTCCTTCAGTTCCCACCTCCATGAAAGGGCAG
CACGAGAGGCATGAACCAGGGGACGACATTACCAACAGCAGCCGTATGTCTACCTCAGTCCACTTAAGGATAGCTGCACTTGAGGCTTTGGAGACTCTTCTTACATTGGT
TGGTGCTTTGAGATCTGAAGAAGGGTGGCGTGCAAAAGTTGAACATCTTTTAATAACAGCTGCAACATCTTCTTTTGAATGGCCACGAGCCTCAGACGACATCTTTTTCC
AAGCTAATGAATCTATTGAGGTTTGGGTGGATTATCAGTTGGCAACATTTCGTGCACTACGGACTTCATTGTTGTCTGCTGTCCATGTACGCCCTCTGGCTTTGGCTCAA
GGTCTTGAGCTTTTCCATAGAGGTAAACAAGAAAATGGAACTAAACTTGCTGAATTCTGTGCCAAAGCTCTCTTAGACATGGAGGTCCTAATACATCCAAGGGTGCTTCC
CCTCTCCGATTTTTTGCCTGCATGTTTGAGCTCTACTGAACCTCTAGCTACCTATAAATTCCAGGAAGATACGTACTTCGGTAGTCTGATTTCTAGCAAATTGTTGAAGG
TCGACACGCAAGAACAGACTGCCGCCGATGTGGACGACGATTTCTTGTATGATAGAGAAGTTGCAGATGACATTGAAGAGGCTCCAATTAGAGATGCAGGTAATGCTCTA
AATAACGATGAAATGACATATAACACTTCAAATGATCTCGAGGAGGCTTCTGCAAATGGCCTGGTGAGTATAGAAACGCCCAAGAGGACGGAGCAGGCCACTGCAGCAGT
CATCACAGAAGTAGGGGTTGTAGAGAAAGATGATGTCTTTGCTGATGCAAGTATGAATAGTTATCCCATCTCATCAAAATCCAATAAAACCGAAGAAGATTTCAAACGAG
ATCCAGGTCCGAATTTGTTGGCAGAAGATGATTTCCCTGATATTATTGATGCAGATCCTGATACAGACTATGAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
TGTGCATTATATGGTAGTCGAGATTGTACTAGCAACTTACAGTTGCCTAATCGTTTTGATAGTTTGTAGACATTGAAATCTCAGTGGTTCAATCAAGATGGCGGCCTTCA
ATCTCGTTGCGAATATGTATGACCTGGCTTTGAAGCCTCGCTTGCTACACAAACTTCTTAGGGAGCACGTTCCTGACGATAAGCGTGGGTTTAATGATCATTCGGAACTT
TCAAAGGTGGTTTCTGTGATCAAAATCCACAATCTCCTCTCTGAATCCTCGTCTCCCATGGACCAAACGCTGATTGATAGCTGGAAATCCGCCGTTGATTCCTGGGTCAA
CCGCTTGTTTATTCTTCTCTCCAATGATATGCATCCAATCTTTCTAAAGCCTGATAAATGTTGGGCGGGAATCATTTTACTCGGAGTGACTTGTCAACAATGCAACTCTA
GTCGTTTCTTGGCATCATATACAGAATGGCTTCACAAGCTTTTACCTCATATGCAGACAGATTCTCCGTTTCTGAAGGTGGCCTCTTGTGCTTCGATCTCAGATTTATTC
TTGAGATTGGGTAGAGTTCAAAGTGTAAAGAAAGATGGGACTTCTTGTGCTGGGAAGGTCATTCAACCAGTTATAAAGTTGTTGCATGATGATAATACTGAAGCTGTTTT
GGACACTGCAGTTAATCTATTATGCAATCTGATAGCTTTCTTCCCCTTTACAATCCAACGTCATTATGACTCTGCCGAAGCTGCAATTGTTTCAAAAATATTTTCAGGAA
AGTGTAGTTCTAACATGCTGAAGAAGCTTGCTCATTGCCTGGCATCACTTCCAAAATCAAAAGGAGACGAAGATAGCTGGTCTTTACTAATGCAGAAGATTTTGTTATCC
ATCGATAGTCACTTGAATGATGCCTTCCAAGGGAGTGGTGAAGATTCAAAAGGCAATGAATTTGTAAGGTTACTGATTCCACCAGGAAAAGATCCTCCACCACCTTTAGG
TTGTAATTCAATGTCTGAAGGTTCCTTAGACAAAATAACAAAGAGCTCAGAGCGAACGTTAACATCAATTATTTCAACCTTGATGGTTTGCTGTTCCACAATGATAACAA
GTTCATACAACCATCAGGTTGCAGTTCCCATTCGCCCTTTATTAGCTCTTGTTGAGAGAGTGCTGACAGTGGACGGTTCTTTGCCACCCACTTCAGTGCCATTTATGACA
TCTCTGCAGCAAGAGTCACTGTATTTAGAACTTCCGGCACTGCATTCAGACAGTCTGGATCTCCTTATTGCCATAGTTAAGAGCCTTCGCAGGCAAGGCATCTACTATTC
AACTGCACGCCCTATATACCATAAAACCTTAATGCAGACCCAGGCTCTTTCACCTAGCCATTGTCAATACAAATATGGATGCTATATTTCTGTTTATATATTGATTATTA
AATTCCTTTGCAGTCAATTGTTACCACATGCTGCATCAATTGTACGACTTATTGTGAAGTACTTCAAGAAGTGTGTCTCTGCAGAACTGAGAGTAAAAGTCTACGCAGTT
GCTAAGTCATTGATGATGTCTTTGGGCGTTGGAATGGCTGCATCTCTTGCACGAGATGTGATTGACAATGCACTAGTCGATTTGAACCCTGTTGATAATGAGAGTTCTGA
ACCATCTAGTGTGAATCCAAAGGACACACAAAGAGAATTGCTGCAACACCATAAGAAGAGAAAACGTCCTTCAGTTCCCACCTCCATGAAAGGGCAGCACGAGAGGCATG
AACCAGGGGACGACATTACCAACAGCAGCCGTATGTCTACCTCAGTCCACTTAAGGATAGCTGCACTTGAGGCTTTGGAGACTCTTCTTACATTGGTTGGTGCTTTGAGA
TCTGAAGAAGGGTGGCGTGCAAAAGTTGAACATCTTTTAATAACAGCTGCAACATCTTCTTTTGAATGGCCACGAGCCTCAGACGACATCTTTTTCCAAGCTAATGAATC
TATTGAGGTTTGGGTGGATTATCAGTTGGCAACATTTCGTGCACTACGGACTTCATTGTTGTCTGCTGTCCATGTACGCCCTCTGGCTTTGGCTCAAGGTCTTGAGCTTT
TCCATAGAGGTAAACAAGAAAATGGAACTAAACTTGCTGAATTCTGTGCCAAAGCTCTCTTAGACATGGAGGTCCTAATACATCCAAGGGTGCTTCCCCTCTCCGATTTT
TTGCCTGCATGTTTGAGCTCTACTGAACCTCTAGCTACCTATAAATTCCAGGAAGATACGTACTTCGGTAGTCTGATTTCTAGCAAATTGTTGAAGGTCGACACGCAAGA
ACAGACTGCCGCCGATGTGGACGACGATTTCTTGTATGATAGAGAAGTTGCAGATGACATTGAAGAGGCTCCAATTAGAGATGCAGGTAATGCTCTAAATAACGATGAAA
TGACATATAACACTTCAAATGATCTCGAGGAGGCTTCTGCAAATGGCCTGGTGAGTATAGAAACGCCCAAGAGGACGGAGCAGGCCACTGCAGCAGTCATCACAGAAGTA
GGGGTTGTAGAGAAAGATGATGTCTTTGCTGATGCAAGTATGAATAGTTATCCCATCTCATCAAAATCCAATAAAACCGAAGAAGATTTCAAACGAGATCCAGGTCCGAA
TTTGTTGGCAGAAGATGATTTCCCTGATATTATTGATGCAGATCCTGATACAGACTATGAAGAGTGAACAAAAGTACTGGAAATCCCGACTCAATTTTGTAGCTTTAAGA
GTTTAGAATTCAAGATTAATTATTGTTGTGTTCTATTTCATGTCACCATAGTTTGAGTTGATATTGAGAATGACTATGTATATAAAAAGATGGAAGTTAAAGAAATTGCA
AAAGCTTTGGTTTTAGAATG
Protein sequenceShow/hide protein sequence
MAAFNLVANMYDLALKPRLLHKLLREHVPDDKRGFNDHSELSKVVSVIKIHNLLSESSSPMDQTLIDSWKSAVDSWVNRLFILLSNDMHPIFLKPDKCWAGIILLGVTCQ
QCNSSRFLASYTEWLHKLLPHMQTDSPFLKVASCASISDLFLRLGRVQSVKKDGTSCAGKVIQPVIKLLHDDNTEAVLDTAVNLLCNLIAFFPFTIQRHYDSAEAAIVSK
IFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNDAFQGSGEDSKGNEFVRLLIPPGKDPPPPLGCNSMSEGSLDKITKSSERTLTSIISTLMVCCS
TMITSSYNHQVAVPIRPLLALVERVLTVDGSLPPTSVPFMTSLQQESLYLELPALHSDSLDLLIAIVKSLRRQGIYYSTARPIYHKTLMQTQALSPSHCQYKYGCYISVY
ILIIKFLCSQLLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKSLMMSLGVGMAASLARDVIDNALVDLNPVDNESSEPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQ
HERHEPGDDITNSSRMSTSVHLRIAALEALETLLTLVGALRSEEGWRAKVEHLLITAATSSFEWPRASDDIFFQANESIEVWVDYQLATFRALRTSLLSAVHVRPLALAQ
GLELFHRGKQENGTKLAEFCAKALLDMEVLIHPRVLPLSDFLPACLSSTEPLATYKFQEDTYFGSLISSKLLKVDTQEQTAADVDDDFLYDREVADDIEEAPIRDAGNAL
NNDEMTYNTSNDLEEASANGLVSIETPKRTEQATAAVITEVGVVEKDDVFADASMNSYPISSKSNKTEEDFKRDPGPNLLAEDDFPDIIDADPDTDYEE