; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028883 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028883
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1
Genome locationtig00153210:1170070..1182147
RNA-Seq ExpressionSgr028883
SyntenySgr028883
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR012583 - Pre-rRNA-processing protein RIX1, N-terminal
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573885.1 hypothetical protein SDJN03_27772, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.34Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRLLHKLLREHVPDDK+TF+DHSELSKVVS++KIHNLLSESSSSMDQKL+D+WKSAVDSW+ RL +LLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
         TCQQCSSSRFL SY +W HKLLPH+QTDSQFLKVA+CASISDLF RL RF NVKKDGTSCAGK+IQP +KLLHD+NTEAVL+AAVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSW++LMQKIL+SID HLNEAFQGIGEDS+G+EVVRLLIPPGK+PPPPLGCNS   GS
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDK+TKSSER+LTS ISTLM CCSTMITSSYP+QVAVPIRPLLALVER+L VDGSLPP SVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER
        HAA IVRLIVKYFKKCVSAELRVKVYAVAKLLM+SLGVGMAASLARDVI+N L DLNPVDNE+ APSSV+ KD QRE  QHHKKRKR  V T   EQHE 
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGS D+T+S M T V LR AALEALETLLTLAGALR+EEGWRAK+EHLLITAATSSFEWP ASD++F QTNESIEVW DYQLAAFRALLASFLSAVH+RP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR
        LALAQGL+LFRRGK E GTKL EFCAHALLA+EVLIHPRVLPL DF PVHLSSPEPQATYKI E+MY  GMNS K LKI DT GM+QSAPDL DDFLYDR
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR

Query:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD
         V DDIEEA IRDAGNE N+  TTYNT+N+     SA+ L   ETPKR++Q+  AAAITD
Subjt:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD

KAG7012950.1 hypothetical protein SDJN02_25703 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.47Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRLLHKLLREHVPDDK+TF+DHSELSKVVS++KIHNLLSESSSSMDQKL+D+WKSAVDSW+ RL +LLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
         TCQQCSSSRFL SY +W HKLLPH+QTDSQFLKVA+CASISDLF RL RF NVKKDGTSCAGK+IQPV+KLLHD+NTEAVL+AAVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSW++LMQKIL+SID HLNEAFQGIGEDS+G+EVVRLLIPPGK+PPPPLGCNS   GS
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDK+TKSSER+LTS ISTLM CCSTMITSSYP+QVAVPIRPLLALVER+L VDGSLPP SVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER
        HAA IVRLIVKYFKKCVSAELRVKVYAVAKLLM+SLGVGMAASLARDVI+N L DLNPVDNE+ APSSV+ KD QRE  QHHKKRKR  V T   EQHE 
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGS D+T+S M T V LR AALEALETLLTLAGALR+EEGWRAK+EHLLITAATSSFEWP ASD++F QTNESIEVW DYQLAAFRALLASFLSAVH+RP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR
        LALAQGL+LFRRGK E GTKL EFCAHALLA+EVLIHPRVLPL DF PVHLSSPEPQATYKI E+MY  GMNS K LKI DT GM+QSAPDL DDFLYDR
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR

Query:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD
         V DDIEEA IRDAGNE N+  TTYNT+N+     SA+ L   ETPKR++Q+  AAAITD
Subjt:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD

XP_022150576.1 proline-, glutamic acid- and leucine-rich protein 1 isoform X1 [Momordica charantia]0.0e+0087.86Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELS  VS+IKIHNLLSESSSS DQKL+D+WKSAVDSW++RLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
        VTCQQCSSSRFL SYTEW  KLLPHIQTDSQFLKVA+CAS+SDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHD+N+EAV EAAVNLL TLIAFFPFT+
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKIL+SID+HLNEAFQGIGEDS+GSEVVRLLIPPGKDPPPPLGCNSLPGGS
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDKITKSSERLLTSSISTLMFCCSTMITSSYP+QVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQES+CSELPTLHS+ LDLLIAIIKSLRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER
        +AASIVRLIVKYFKKCVSAELRVKVYAVAKLLM+SLGVGMAASLARDV+ENAL DLNPVDNEN APSSV+SKDTQRE +QHHKKRKR SV T   +Q ER
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGSGDV N  M TPV LR AALEALETLLTLAGALRSEEGWR KIE LL TAATSSF+WPRASDN   QT+ESIEVWTDYQLAAFR LLASFLSAVHVRP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKIDT-RGMEQSAPDLYDDFLYDR
        LALAQGLELFRRGK E+GTKLAEFCAHALLA+EVLIHPRVLPL DFLPVHLSS E Q+TYK EENM+FDG+NS K+LKIDT +G+EQSAPDL DDFL++ 
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKIDT-RGMEQSAPDLYDDFLYDR

Query:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITDEGVWKK
         V DDIEEA IR+AGNE NDGETTYNT+ND  KEAS  G    ETPKRSEQ+T AAAITD GV +K
Subjt:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITDEGVWKK

XP_022968338.1 proline-, glutamic acid- and leucine-rich protein 1 [Cucurbita maxima]0.0e+0084.21Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLV NMYDPALKPRL+HKLLREHVPDDK+TF+DHSELSKVVS++KIHNLLSESSSSMDQKL+D+WKSAVDSW+ RL +LLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
        VTCQQCSSSRFL SY +W HKLLPH+QTDS FLKVA+CASISDLF RL RF NVKKDGTSCAGK+IQPV+KLLHD+NTE VL+ AVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSW++LMQKIL+SID HLNEAFQGIGEDS+G+EVVRLLIPPGK+PPPPLGCNS   GS
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDK+TKSSE++LTS ISTLMFCCSTMITSSYPNQVAVPIRPLLALVER+L VDGSLPP SVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER
        HAA IVRLIVKYFKKCVSAELRVKVYAVAKLLM+SLGVGMAASL RDVI+N L DLNPVDNE+  PSSV+ KD Q EL QHHKKRKR  V T   EQHE 
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGS D+T+S+M T V LR AALEALETLLTLAGALR+EEGWRAK+EHLLITAATSSFEWP ASD+IF QTNESIEVW DYQLAAFRALLASFLSAVH+RP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR
        LALAQGLELFRRGK E GTKL +FCAHALLA+EVLIHPRVLPL DF PVHLSSPEPQATYKI E+MYF GMNS K LKI DTR M+QSAPDL DDFLYDR
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR

Query:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD
         V DDIEEA IRDAGNE N+  TTYNT+N+     SA+ L   ETPKR+EQ+  AAAITD
Subjt:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD

XP_023542346.1 proline-, glutamic acid- and leucine-rich protein 1-like [Cucurbita pepo subsp. pepo]0.0e+0085Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRLLHKLLREHVPDDK+TF DHSELSKVVS++KIHNLLSESSSSMDQKL+D+WKSAVDSW+ RL +LLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
        VTCQQCSSSRFL SY +W HKLLPH+QTDSQFLKVA+CASISDLF RL RF NVKKDGTSCAGK+IQPV+KLLHD+NTEAVL+AAVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSW++LMQKIL+SID HLNEAFQGIGEDS+G+EVVRLLIPPGK+PPPPLGCNS   GS
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDK+TKSSER+LTS ISTLMFCCSTMITSSYP+QVAVPIRPLLALVER+LMVDGSLPP SVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER
        HAA IVRLIVKYFKKCVSAELRVK YAVAKLLM+SLGVGMAASLARDVI+N L DLNPVDNE+ APSSV+ KD QREL QHHKKRKR  V T   EQHE 
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGS D+T+S M T V LR AALEALETLLTLAGALR+EEGWRAK++HLLITAATSSFEWP ASD++F QTNESIEVW DYQLAAFRALLASFLSAVH+RP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLK-IDTRGMEQSAPDLYDDFLYDR
        LALAQGL+LFRRGK E GTKL EFCAHALLA+EVLIHPRVLPL DFLPVHLSSPEPQATYKI E+MYF GMNS K LK IDT GM+QSAPDL DDFLYDR
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLK-IDTRGMEQSAPDLYDDFLYDR

Query:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD
         V DDIEEA IRDA NE N+  TTYNT+N+     SA+ L   ETPKR+EQ+  AAAITD
Subjt:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD

TrEMBL top hitse value%identityAlignment
A0A6J1DBX6 proline-, glutamic acid- and leucine-rich protein 1 isoform X10.0e+0087.86Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELS  VS+IKIHNLLSESSSS DQKL+D+WKSAVDSW++RLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
        VTCQQCSSSRFL SYTEW  KLLPHIQTDSQFLKVA+CAS+SDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHD+N+EAV EAAVNLL TLIAFFPFT+
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKIL+SID+HLNEAFQGIGEDS+GSEVVRLLIPPGKDPPPPLGCNSLPGGS
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDKITKSSERLLTSSISTLMFCCSTMITSSYP+QVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQES+CSELPTLHS+ LDLLIAIIKSLRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER
        +AASIVRLIVKYFKKCVSAELRVKVYAVAKLLM+SLGVGMAASLARDV+ENAL DLNPVDNEN APSSV+SKDTQRE +QHHKKRKR SV T   +Q ER
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGSGDV N  M TPV LR AALEALETLLTLAGALRSEEGWR KIE LL TAATSSF+WPRASDN   QT+ESIEVWTDYQLAAFR LLASFLSAVHVRP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKIDT-RGMEQSAPDLYDDFLYDR
        LALAQGLELFRRGK E+GTKLAEFCAHALLA+EVLIHPRVLPL DFLPVHLSS E Q+TYK EENM+FDG+NS K+LKIDT +G+EQSAPDL DDFL++ 
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKIDT-RGMEQSAPDLYDDFLYDR

Query:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITDEGVWKK
         V DDIEEA IR+AGNE NDGETTYNT+ND  KEAS  G    ETPKRSEQ+T AAAITD GV +K
Subjt:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITDEGVWKK

A0A6J1FZZ0 proline-, glutamic acid- and leucine-rich protein 1-like0.0e+0084.21Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRLLHKLLREHVPDDK+TF+DHSELSKVVS++KIHNLLSESSSSMDQKL+D+WKSAVDSW+ RL +LLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
         TCQQCSSSRFL SY +W HKLLPH+QTDSQFLKVA+CASISDLF RL RF NVKKDGTSCAGK+IQPV+KLLHD+NTEAVL+AAVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKIFSG CSFNMLKKLAHCLASLPKSKGDEDSW++LMQKIL+SID HLNEAFQGIGEDS+G+EVVRLLIPPGK+PPPPLGCNS   GS
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDK+TKSSER+LTS ISTLMFCCSTMITSSYP+QVAVPIRPLLALVER+L VDGSLPP SVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER
        HAA IVRLIVKYFKKCVSAELRVKVYAVAKLLM+SLGVGMAASLARDVI+N L DLNPVDNE+ APSSV+ KD QREL QHHKKRKR  V T   EQHE 
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGS D+T+S   T V LR AALEALETLLTLAGALR+EEGW AK+EHLLITAA SSFEWP ASD++F QTNESIEVW DYQLAAFRALLASFLSAVH+RP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR
        LALAQGL+LFRRGK E GTKL EFCAHALLA+EVLIHPRVLPL DF PVHLSSPEPQATYKI E+MY  GMNS K LKI DT GM+QSAPDL DDFLYDR
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR

Query:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD
         V DDIEEA IRDAGNE N+  TTYNT+N+     SA+ L   ETPKR++Q+  AAAITD
Subjt:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD

A0A6J1GXZ0 proline-, glutamic acid- and leucine-rich protein 1-like isoform X20.0e+0081.46Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRL+HKLLREHVPDDKR F+DHSELSKVVS+IKIHNLLSES  SMDQKL+D+WKSAVDSW+ RLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
        VTCQQCSSSRFL SYTEW H+LLPH+QTDSQFLKVASCASISDLF RL RFQ+VKKDGTSCAGK+IQPV+KLLHD+NTEAVL+AAVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKI+SGKC  NMLKKLAHCLASLPKSKGDEDSWSLLMQKIL+SIDSHLNEAFQGIGEDSKG EV+RLLIPPGK+PPPPLGCNSL   S
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDKIT+SSER+LT SISTLMFCCSTMITSSY +QVAVPIRPLLA+V+RVL VDGSLPPTSVPFMTSLQQESMCSELP LHSDSLDLLIAI+K LRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTTE---QHER
        HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLM+SLGVGMAASLARDVI+NAL DLNPVDNE+  PSSV+ K+ QRELLQH+KKRKR SV T    QHER
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTTE---QHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGSGD+T+S M T V LR AALEALETLLTLAGALR+EEGWRAK+EHLLITAATSSFEWP+ASD+IF + NE IEVW DYQLAAFRALLASFLS+VHVRP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKIDTRGMEQSAPDLYDDFLYDRG
        LALAQGLELFR+GK E G+KLAEFCAHALLA+EVLIHPRVLPL DFLPV LSSPEPQATYK +E+MYF  M S KLLKIDT+GMEQS P+L D+F YDR 
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKIDTRGMEQSAPDLYDDFLYDRG

Query:  VTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITDEGVWKKM
          ++IEEA IRDA                             ETPK +EQ  A AA+T+ GV +K+
Subjt:  VTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITDEGVWKKM

A0A6J1GYU8 proline-, glutamic acid- and leucine-rich protein 1-like isoform X10.0e+0083.57Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRL+HKLLREHVPDDKR F+DHSELSKVVS+IKIHNLLSES  SMDQKL+D+WKSAVDSW+ RLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
        VTCQQCSSSRFL SYTEW H+LLPH+QTDSQFLKVASCASISDLF RL RFQ+VKKDGTSCAGK+IQPV+KLLHD+NTEAVL+AAVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKI+SGKC  NMLKKLAHCLASLPKSKGDEDSWSLLMQKIL+SIDSHLNEAFQGIGEDSKG EV+RLLIPPGK+PPPPLGCNSL   S
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDKIT+SSER+LT SISTLMFCCSTMITSSY +QVAVPIRPLLA+V+RVL VDGSLPPTSVPFMTSLQQESMCSELP LHSDSLDLLIAI+K LRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTTE---QHER
        HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLM+SLGVGMAASLARDVI+NAL DLNPVDNE+  PSSV+ K+ QRELLQH+KKRKR SV T    QHER
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTTE---QHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGSGD+T+S M T V LR AALEALETLLTLAGALR+EEGWRAK+EHLLITAATSSFEWP+ASD+IF + NE IEVW DYQLAAFRALLASFLS+VHVRP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKIDTRGMEQSAPDLYDDFLYDRG
        LALAQGLELFR+GK E G+KLAEFCAHALLA+EVLIHPRVLPL DFLPV LSSPEPQATYK +E+MYF  M S KLLKIDT+GMEQS P+L D+F YDR 
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKIDTRGMEQSAPDLYDDFLYDRG

Query:  VTDDIEEASIRDA-GNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITDEGVWKKM
          ++IEEA IRDA GN  ND E TYN +ND  KE  ANGL  IETPK +EQ  A AA+T+ GV +K+
Subjt:  VTDDIEEASIRDA-GNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITDEGVWKKM

A0A6J1HXR1 proline-, glutamic acid- and leucine-rich protein 10.0e+0084.21Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG
        MAAFNLV NMYDPALKPRL+HKLLREHVPDDK+TF+DHSELSKVVS++KIHNLLSESSSSMDQKL+D+WKSAVDSW+ RL +LLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI
        VTCQQCSSSRFL SY +W HKLLPH+QTDS FLKVA+CASISDLF RL RF NVKKDGTSCAGK+IQPV+KLLHD+NTE VL+ AVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS
        HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSW++LMQKIL+SID HLNEAFQGIGEDS+G+EVVRLLIPPGK+PPPPLGCNS   GS
Subjt:  HRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLPGGS

Query:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
        FDK+TKSSE++LTS ISTLMFCCSTMITSSYPNQVAVPIRPLLALVER+L VDGSLPP SVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP
Subjt:  FDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLP

Query:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER
        HAA IVRLIVKYFKKCVSAELRVKVYAVAKLLM+SLGVGMAASL RDVI+N L DLNPVDNE+  PSSV+ KD Q EL QHHKKRKR  V T   EQHE 
Subjt:  HAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTT---EQHER

Query:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP
        HGS D+T+S+M T V LR AALEALETLLTLAGALR+EEGWRAK+EHLLITAATSSFEWP ASD+IF QTNESIEVW DYQLAAFRALLASFLSAVH+RP
Subjt:  HGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRP

Query:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR
        LALAQGLELFRRGK E GTKL +FCAHALLA+EVLIHPRVLPL DF PVHLSSPEPQATYKI E+MYF GMNS K LKI DTR M+QSAPDL DDFLYDR
Subjt:  LALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKI-DTRGMEQSAPDLYDDFLYDR

Query:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD
         V DDIEEA IRDAGNE N+  TTYNT+N+     SA+ L   ETPKR+EQ+  AAAITD
Subjt:  GVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANGLPCIETPKRSEQDTAAAAITD

SwissProt top hitse value%identityAlignment
Q56B11 Proline-, glutamic acid- and leucine-rich protein 12.6e-0722.05Show/hide
Query:  GIILLGVTCQQCSSSRFLTSYTEWFHKLLPHIQT-DSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLI
        G+ LL +   +  +  F      W   +   +Q+ DS      + A + DL    S+   + +D ++     +   L  L  +  ++ LE     +   +
Subjt:  GIILLGVTCQQCSSSRFLTSYTEWFHKLLPHIQT-DSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLI

Query:  AFFPFTIHRHYDSAEAAIVSKIFSGKCSFN-MLKKLA-HCLASLPK-----SKG--DEDSWSLLMQKILISIDSHLNEAFQGIGE---DSKGSEVVRLLI
         +FP    R     +  + S   S   S N  L++LA  C + LP      S+G    ++W   +  +L S+ S L   F+        S+G  V  LL 
Subjt:  AFFPFTIHRHYDSAEAAIVSKIFSGKCSFN-MLKKLA-HCLASLPK-----SKG--DEDSWSLLMQKILISIDSHLNEAFQGIGE---DSKGSEVVRLLI

Query:  PPGKDPPPPLGCNSLPGGSFDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHS
        P   D    L                    L    S L  C   M++S +   V+VP++ +L L+ R+L +       ++  +       +   LP+LH 
Subjt:  PPGKDPPPPLGCNSLPGGSFDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHS

Query:  DSLDLLIAIIKSLRSQLLPHAASIVRLIVKYFKKCVS-------------AELRVKVYAVAKLLM----ISLGVGMAASLARDVIENALGDLNPVDNENG
        ++LDLL A+I +   +LL   A I RL+ +      +             + +R KVYA+ +L +     S G+    +    ++ + L D++P  +   
Subjt:  DSLDLLIAIIKSLRSQLLPHAASIVRLIVKYFKKCVS-------------AELRVKVYAVAKLLM----ISLGVGMAASLARDVIENALGDLNPVDNENG

Query:  APSSVSSKDTQRELLQHHKKRKRLSVTTEQHERHGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIF
          S+  S D     LQ  K      +  +  E            +    +  AAL  L   + + G L  EE  R ++  L++    S  +      + +
Subjt:  APSSVSSKDTQRELLQHHKKRKRLSVTTEQHERHGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIF

Query:  SQTNESIEVWTDYQLAAFRALLASFLSAVHVRPLALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLD---FLPVHLSSPEPQA
        + +   +E+        +R LLA  L+     P  L+  L+ F  G+ E   +++ FC+ AL+    L HPRV PL       P     P P+A
Subjt:  SQTNESIEVWTDYQLAAFRALLASFLSAVHVRPLALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLD---FLPVHLSSPEPQA

Q9DBD5 Proline-, glutamic acid- and leucine-rich protein 12.1e-0922.93Show/hide
Query:  GIILLGVTCQQCSSSRFLTSYTEWFHKLLPHIQT-DSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLI
        G+ LL +   +  +  F      W   +   +Q+ DS      + A + DL    S+   + +D ++     +   L  L  E  ++ LE     +   +
Subjt:  GIILLGVTCQQCSSSRFLTSYTEWFHKLLPHIQT-DSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLI

Query:  AFFPFTIHRHYDSAEAAIVSKIFSGKCSFN-MLKKLA-HCLASLPK-----SKG--DEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSE--VVRLLIP
         +FP    R   S +  + S   S   S N  L++LA  C + LP      S+G    ++W   +  +L S+ S L   F+        SE   + +L+ 
Subjt:  AFFPFTIHRHYDSAEAAIVSKIFSGKCSFN-MLKKLA-HCLASLPK-----SKG--DEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSE--VVRLLIP

Query:  PGKDPPPPLGCNSLPGGSFDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSD
          +D            G+   + +  +R      S L  C   M++S +   V+VP++ +L L+ R+L +       ++  +       +   LP+LH +
Subjt:  PGKDPPPPLGCNSLPGGSFDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSD

Query:  SLDLLIAIIKSLRSQLLPHAASIVRLIVKYFKKCVS-------------AELRVKVYAVAKLLM----ISLGVGMAASLARDVIENALGDLNPVDNENGA
        +LDLL A+I +  S+LL   A I RL+ +      +             + +R KVYA+ +L +     S G+    +    ++ + L D++P  +    
Subjt:  SLDLLIAIIKSLRSQLLPHAASIVRLIVKYFKKCVS-------------AELRVKVYAVAKLLM----ISLGVGMAASLARDVIENALGDLNPVDNENGA

Query:  PSSVSSKDTQRELLQHHKKRKRLSVTTEQHERHGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFS
         S+  S D     LQ  K      +  +  E            +    +  AAL  L   + + G L  EE  R    H L+     S +      +  S
Subjt:  PSSVSSKDTQRELLQHHKKRKRLSVTTEQHERHGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFS

Query:  QTNESIEVWTDYQLAAFRALLASFLSAVHVRPLALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLD---FLPVHLSSPEPQA
          N S       +L  +R LLA  L+     P  LA  L+ F  G+ E   +++ FC+ AL+    L HPRV PL       P     P P+A
Subjt:  QTNESIEVWTDYQLAAFRALLASFLSAVHVRPLALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLD---FLPVHLSSPEPQA

Arabidopsis top hitse value%identityAlignment
AT1G30240.1 FUNCTIONS IN: binding; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Armadillo-type fold (InterPro:IPR016024); Has 165 Blast hits to 164 proteins in 73 species: Archae - 0; Bacteria - 0; Metazoa - 47; Fungi - 68; Plants - 46; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink).1.5e-16750.91Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSES-SSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILL
        MA+F    +M D  LKP++L  LL E+VP++K+   +   LSKVVS I  H LLSES  +S+DQKL    KSAVD W+ RL  L+S+DMPDK W GI L+
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSES-SSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILL

Query:  GVTCQQCSSSRFLTSYTEWFHKLLPHIQ--TDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFP
        GVTCQ+CSS RF  SY+ WF+ LL H++    S+ ++VASC SISDL +RLSRF N KKD  S A K+I P++KLL ++++EA+LE  V+LL T++  FP
Subjt:  GVTCQQCSSSRFLTSYTEWFHKLLPHIQ--TDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFP

Query:  FTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLP
           H +YD  EAAI SKIFS K S NMLKK AH LA LPK+KGDE +WSL+MQK+LISI+ HLN  FQG+ E++KG++ ++ L PPGKD P PLG  +  
Subjt:  FTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLP

Query:  GGSFDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQ
         G  D  + +SE+L+ S +S LMFC STM+T+SY +++ +P+  LL+LVERVL+V+GSLP    PFMT +QQE +C+ELP LHS +L+LL A +KS+RSQ
Subjt:  GGSFDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQ

Query:  LLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNEN-GAPSSVSSKDTQRELLQH-HKKRKRLSVTTEQH
        LLP+AAS+VRL+  YF+KC   ELR+K+Y++   L+ S+  GMA  LA++V+ NA  DL+    E     SS +   T   LLQ   KKRK   V  E  
Subjt:  LLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNEN-GAPSSVSSKDTQRELLQH-HKKRKRLSVTTEQH

Query:  ERHGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFE--WPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAV
               + ++++ +P++L+ A+LEALETLLT+ GAL S + WR  +++LL+T AT++ E  W  A +      N+S     ++QLAA RA  AS +S  
Subjt:  ERHGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFE--WPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAV

Query:  HVRPLALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQA
         VRP  LA+GLELFR GKL+ G K+A FCAHAL+++EV+IHPR LP LD LP  LS+  P++
Subjt:  HVRPLALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQA

AT1G30240.2 unknown protein5.5e-17051.06Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSES-SSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILL
        MA+F    +M D  LKP++L  LL E+VP++K+   +   LSKVVS I  H LLSES  +S+DQKL    KSAVD W+ RL  L+S+DMPDK W GI L+
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSES-SSSMDQKLVDNWKSAVDSWIERLFLLLSNDMPDKCWAGIILL

Query:  GVTCQQCSSSRFLTSYTEWFHKLLPHIQ--TDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFP
        GVTCQ+CSS RF  SY+ WF+ LL H++    S+ ++VASC SISDL +RLSRF N KKD  S A K+I P++KLL ++++EA+LE  V+LL T++  FP
Subjt:  GVTCQQCSSSRFLTSYTEWFHKLLPHIQ--TDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAAVNLLCTLIAFFP

Query:  FTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLP
           H +YD  EAAI SKIFS K S NMLKK AH LA LPK+KGDE +WSL+MQK+LISI+ HLN  FQG+ E++KG++ ++ L PPGKD P PLG  +  
Subjt:  FTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNSLP

Query:  GGSFDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQ
         G  D  + +SE+L+ S +S LMFC STM+T+SY +++ +P+  LL+LVERVL+V+GSLP    PFMT +QQE +C+ELP LHS +L+LL A +KS+RSQ
Subjt:  GGSFDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQ

Query:  LLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNEN-GAPSSVSSKDTQRELLQH-HKKRKRLSVTTEQH
        LLP+AAS+VRL+  YF+KC   ELR+K+Y++   L+ S+G+GMA  LA++V+ NA  DL+    E     SS +   T   LLQ   KKRK   V  E  
Subjt:  LLPHAASIVRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNEN-GAPSSVSSKDTQRELLQH-HKKRKRLSVTTEQH

Query:  ERHGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFE--WPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAV
               + ++++ +P++L+ A+LEALETLLT+ GAL S + WR  +++LL+T AT++ E  W  A +      N+S     ++QLAA RA  AS +S  
Subjt:  ERHGSGDVTNSYMHTPVTLRTAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFE--WPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAV

Query:  HVRPLALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQA
         VRP  LA+GLELFR GKL+ G K+A FCAHAL+++EV+IHPR LP LD LP  LS+  P++
Subjt:  HVRPLALAQGLELFRRGKLETGTKLAEFCAHALLAVEVLIHPRVLPLLDFLPVHLSSPEPQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCACACCAGGGAGGCACCAGTGGCTGCAATCAGAGGATAGACCTTCCTGCAGACATTGCTGTGATGTCTTGAAGATATACCGGGAATCTCATCCTTCTCAGTA
CGCCCTGAAGCACGAGCAACTGTCCTGGAACGTGTTGGTGGCTGAAAGGAATCGTACAACGGATACGACTGATCGAAAACCCATTTCCCATCTGAAAAATCGCATCTCCT
CTGTGAATCCCCGCCGCTCTGAACTATATGAACTTCGTCGTCTTCTTCGCTCAGCCAGGACTCGTCGTCTTCTGAACCAAACGGCGATGAGTTTCCATGCTCGGTGGAAC
TCCTCCCCGCAGCTCTTGCTTCTACTCTTCGTGTTCCTCCCGGCTTCATCTTCCAACTTTGACTCAGCCGAAGGTCTTCTACATTCTAGGGTTTCAAAACTCAGAATGGC
GGCCTTCAATCTTGTTGCGAATATGTATGACCCAGCTTTGAAGCCTCGCTTGCTTCACAAACTTCTTAGAGAGCACGTTCCGGATGATAAGCGGACATTTCATGATCATT
CGGAACTCTCAAAGGTGGTTTCTATTATCAAAATCCACAATCTACTCTCCGAATCTTCGTCTTCCATGGACCAAAAACTGGTAGATAACTGGAAATCCGCAGTTGATTCC
TGGATCGAACGCTTGTTTCTTCTTCTTTCTAACGACATGCCTGATAAATGTTGGGCGGGAATCATTTTACTGGGAGTGACCTGTCAACAATGCAGCTCTAGTCGTTTCTT
GACATCATACACAGAATGGTTTCACAAGCTTTTACCTCACATTCAGACAGATTCTCAGTTTCTGAAGGTGGCCTCTTGTGCTTCGATCTCAGATTTATTCTCAAGATTGA
GTAGATTTCAAAATGTAAAGAAAGATGGGACTTCTTGTGCTGGAAAGATCATTCAACCAGTTCTTAAGCTGTTACATGATGAAAATACAGAAGCTGTTTTGGAAGCTGCA
GTGAATCTATTATGCACTCTGATAGCTTTCTTTCCCTTTACAATTCATCGTCATTACGATTCTGCTGAAGCTGCAATTGTTTCAAAAATCTTTTCAGGAAAGTGTAGTTT
CAATATGCTGAAGAAGCTTGCCCATTGCCTAGCGTCACTTCCAAAATCAAAAGGAGATGAAGATAGCTGGTCTTTACTGATGCAGAAAATTTTGATATCAATTGATAGTC
ACTTGAATGAGGCCTTCCAAGGCATTGGTGAAGACTCAAAAGGTAGTGAAGTTGTAAGGCTGCTGATTCCACCAGGAAAAGATCCTCCACCACCATTAGGTTGTAATTCA
TTGCCAGGAGGTTCCTTTGACAAAATAACAAAGAGCTCAGAGCGATTGTTGACATCTAGTATCTCGACCTTGATGTTTTGCTGTTCTACAATGATAACAAGTTCATACCC
CAATCAGGTGGCAGTTCCAATTCGCCCTTTGTTAGCTCTTGTTGAGAGAGTGCTGATGGTGGATGGTTCTTTGCCACCCACGTCAGTGCCATTTATGACTTCTCTGCAAC
AAGAGTCAATGTGTTCAGAACTTCCAACACTGCATTCAGACAGTTTGGATCTTCTCATTGCCATCATCAAGAGTCTTCGCAGTCAATTGTTACCACATGCTGCATCTATT
GTGCGTCTCATTGTGAAGTACTTCAAGAAGTGTGTCTCTGCAGAACTGAGAGTAAAGGTCTATGCAGTTGCTAAGTTATTGATGATATCTTTGGGCGTTGGAATGGCTGC
ATCTCTTGCACGAGATGTGATTGAGAATGCACTAGGTGACTTGAACCCTGTTGATAATGAGAATGGTGCCCCATCTAGTGTGAGTTCAAAGGACACTCAAAGAGAGTTGC
TGCAACACCATAAGAAGAGAAAACGTCTTTCAGTTACCACAGAGCAGCATGAGAGGCATGGATCAGGGGACGTTACCAACAGCTATATGCATACTCCAGTCACTTTGAGG
ACAGCTGCGCTTGAGGCTTTGGAGACTCTTCTTACATTGGCTGGTGCTTTGAGATCTGAAGAAGGGTGGCGTGCAAAAATTGAACATCTTTTAATAACAGCCGCAACATC
TTCTTTTGAATGGCCACGGGCCTCAGACAACATTTTTTCCCAAACTAATGAATCTATTGAGGTTTGGACGGATTATCAGCTGGCAGCATTTCGTGCACTGCTAGCTTCAT
TTTTGTCTGCTGTCCATGTACGCCCTCTGGCTTTGGCTCAAGGTCTTGAGCTTTTCCGTAGAGGTAAACTAGAAACTGGAACTAAACTAGCTGAATTCTGTGCCCATGCT
CTTTTAGCCGTGGAGGTCCTAATACATCCAAGGGTACTTCCCCTGTTGGATTTCTTGCCCGTGCATTTGAGCTCTCCTGAACCACAAGCTACCTATAAAATTGAGGAAAA
TATGTACTTCGATGGTATGAATTCTCGCAAATTGTTGAAGATTGACACACGGGGCATGGAGCAGAGTGCCCCTGATTTGTACGACGATTTCTTGTACGATAGAGGGGTTA
CAGATGACATTGAAGAGGCTTCAATTAGAGATGCAGGTAACGAGCCAAATGATGGCGAAACAACATACAACACCGCAAATGATCCTGGCAAGGAGGCCTCTGCCAATGGC
CTGCCGTGTATAGAAACTCCCAAGAGGAGTGAGCAGGACACTGCAGCAGCAGCCATCACAGATGAAGGAGTTTGGAAAAAGATGATGTCTTTGCTAATGCAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGCACACCAGGGAGGCACCAGTGGCTGCAATCAGAGGATAGACCTTCCTGCAGACATTGCTGTGATGTCTTGAAGATATACCGGGAATCTCATCCTTCTCAGTA
CGCCCTGAAGCACGAGCAACTGTCCTGGAACGTGTTGGTGGCTGAAAGGAATCGTACAACGGATACGACTGATCGAAAACCCATTTCCCATCTGAAAAATCGCATCTCCT
CTGTGAATCCCCGCCGCTCTGAACTATATGAACTTCGTCGTCTTCTTCGCTCAGCCAGGACTCGTCGTCTTCTGAACCAAACGGCGATGAGTTTCCATGCTCGGTGGAAC
TCCTCCCCGCAGCTCTTGCTTCTACTCTTCGTGTTCCTCCCGGCTTCATCTTCCAACTTTGACTCAGCCGAAGGTCTTCTACATTCTAGGGTTTCAAAACTCAGAATGGC
GGCCTTCAATCTTGTTGCGAATATGTATGACCCAGCTTTGAAGCCTCGCTTGCTTCACAAACTTCTTAGAGAGCACGTTCCGGATGATAAGCGGACATTTCATGATCATT
CGGAACTCTCAAAGGTGGTTTCTATTATCAAAATCCACAATCTACTCTCCGAATCTTCGTCTTCCATGGACCAAAAACTGGTAGATAACTGGAAATCCGCAGTTGATTCC
TGGATCGAACGCTTGTTTCTTCTTCTTTCTAACGACATGCCTGATAAATGTTGGGCGGGAATCATTTTACTGGGAGTGACCTGTCAACAATGCAGCTCTAGTCGTTTCTT
GACATCATACACAGAATGGTTTCACAAGCTTTTACCTCACATTCAGACAGATTCTCAGTTTCTGAAGGTGGCCTCTTGTGCTTCGATCTCAGATTTATTCTCAAGATTGA
GTAGATTTCAAAATGTAAAGAAAGATGGGACTTCTTGTGCTGGAAAGATCATTCAACCAGTTCTTAAGCTGTTACATGATGAAAATACAGAAGCTGTTTTGGAAGCTGCA
GTGAATCTATTATGCACTCTGATAGCTTTCTTTCCCTTTACAATTCATCGTCATTACGATTCTGCTGAAGCTGCAATTGTTTCAAAAATCTTTTCAGGAAAGTGTAGTTT
CAATATGCTGAAGAAGCTTGCCCATTGCCTAGCGTCACTTCCAAAATCAAAAGGAGATGAAGATAGCTGGTCTTTACTGATGCAGAAAATTTTGATATCAATTGATAGTC
ACTTGAATGAGGCCTTCCAAGGCATTGGTGAAGACTCAAAAGGTAGTGAAGTTGTAAGGCTGCTGATTCCACCAGGAAAAGATCCTCCACCACCATTAGGTTGTAATTCA
TTGCCAGGAGGTTCCTTTGACAAAATAACAAAGAGCTCAGAGCGATTGTTGACATCTAGTATCTCGACCTTGATGTTTTGCTGTTCTACAATGATAACAAGTTCATACCC
CAATCAGGTGGCAGTTCCAATTCGCCCTTTGTTAGCTCTTGTTGAGAGAGTGCTGATGGTGGATGGTTCTTTGCCACCCACGTCAGTGCCATTTATGACTTCTCTGCAAC
AAGAGTCAATGTGTTCAGAACTTCCAACACTGCATTCAGACAGTTTGGATCTTCTCATTGCCATCATCAAGAGTCTTCGCAGTCAATTGTTACCACATGCTGCATCTATT
GTGCGTCTCATTGTGAAGTACTTCAAGAAGTGTGTCTCTGCAGAACTGAGAGTAAAGGTCTATGCAGTTGCTAAGTTATTGATGATATCTTTGGGCGTTGGAATGGCTGC
ATCTCTTGCACGAGATGTGATTGAGAATGCACTAGGTGACTTGAACCCTGTTGATAATGAGAATGGTGCCCCATCTAGTGTGAGTTCAAAGGACACTCAAAGAGAGTTGC
TGCAACACCATAAGAAGAGAAAACGTCTTTCAGTTACCACAGAGCAGCATGAGAGGCATGGATCAGGGGACGTTACCAACAGCTATATGCATACTCCAGTCACTTTGAGG
ACAGCTGCGCTTGAGGCTTTGGAGACTCTTCTTACATTGGCTGGTGCTTTGAGATCTGAAGAAGGGTGGCGTGCAAAAATTGAACATCTTTTAATAACAGCCGCAACATC
TTCTTTTGAATGGCCACGGGCCTCAGACAACATTTTTTCCCAAACTAATGAATCTATTGAGGTTTGGACGGATTATCAGCTGGCAGCATTTCGTGCACTGCTAGCTTCAT
TTTTGTCTGCTGTCCATGTACGCCCTCTGGCTTTGGCTCAAGGTCTTGAGCTTTTCCGTAGAGGTAAACTAGAAACTGGAACTAAACTAGCTGAATTCTGTGCCCATGCT
CTTTTAGCCGTGGAGGTCCTAATACATCCAAGGGTACTTCCCCTGTTGGATTTCTTGCCCGTGCATTTGAGCTCTCCTGAACCACAAGCTACCTATAAAATTGAGGAAAA
TATGTACTTCGATGGTATGAATTCTCGCAAATTGTTGAAGATTGACACACGGGGCATGGAGCAGAGTGCCCCTGATTTGTACGACGATTTCTTGTACGATAGAGGGGTTA
CAGATGACATTGAAGAGGCTTCAATTAGAGATGCAGGTAACGAGCCAAATGATGGCGAAACAACATACAACACCGCAAATGATCCTGGCAAGGAGGCCTCTGCCAATGGC
CTGCCGTGTATAGAAACTCCCAAGAGGAGTGAGCAGGACACTGCAGCAGCAGCCATCACAGATGAAGGAGTTTGGAAAAAGATGATGTCTTTGCTAATGCAA
Protein sequenceShow/hide protein sequence
MSGTPGRHQWLQSEDRPSCRHCCDVLKIYRESHPSQYALKHEQLSWNVLVAERNRTTDTTDRKPISHLKNRISSVNPRRSELYELRRLLRSARTRRLLNQTAMSFHARWN
SSPQLLLLLFVFLPASSSNFDSAEGLLHSRVSKLRMAAFNLVANMYDPALKPRLLHKLLREHVPDDKRTFHDHSELSKVVSIIKIHNLLSESSSSMDQKLVDNWKSAVDS
WIERLFLLLSNDMPDKCWAGIILLGVTCQQCSSSRFLTSYTEWFHKLLPHIQTDSQFLKVASCASISDLFSRLSRFQNVKKDGTSCAGKIIQPVLKLLHDENTEAVLEAA
VNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSFNMLKKLAHCLASLPKSKGDEDSWSLLMQKILISIDSHLNEAFQGIGEDSKGSEVVRLLIPPGKDPPPPLGCNS
LPGGSFDKITKSSERLLTSSISTLMFCCSTMITSSYPNQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQESMCSELPTLHSDSLDLLIAIIKSLRSQLLPHAASI
VRLIVKYFKKCVSAELRVKVYAVAKLLMISLGVGMAASLARDVIENALGDLNPVDNENGAPSSVSSKDTQRELLQHHKKRKRLSVTTEQHERHGSGDVTNSYMHTPVTLR
TAALEALETLLTLAGALRSEEGWRAKIEHLLITAATSSFEWPRASDNIFSQTNESIEVWTDYQLAAFRALLASFLSAVHVRPLALAQGLELFRRGKLETGTKLAEFCAHA
LLAVEVLIHPRVLPLLDFLPVHLSSPEPQATYKIEENMYFDGMNSRKLLKIDTRGMEQSAPDLYDDFLYDRGVTDDIEEASIRDAGNEPNDGETTYNTANDPGKEASANG
LPCIETPKRSEQDTAAAAITDEGVWKKMMSLLMQ