; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015617 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015617
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1
Genome locationChr02:28134282..28141505
RNA-Seq ExpressionHG10015617
SyntenyHG10015617
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR012583 - Pre-rRNA-processing protein RIX1, N-terminal
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601219.1 hypothetical protein SDJN03_06452, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0077.64Show/hide
Query:  SGSTEMAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAG
        SGS +MAAFNLVANMYDPALKPRL+HKLLREHVPDDKRAFNDH ELSKVVS+IK+HNLLSES  SMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAG
Subjt:  SGSTEMAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAG

Query:  IILLGVTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAF
        I+LLGVTCQQCSSSRFLASYTEWL +LLPH+Q                                     VIQPV+KLLHDDNTEAVLD AVNLLCTLIAF
Subjt:  IILLGVTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAF

Query:  FPFTIHRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYS
        FPFTIHRHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGED KG + +RLL+PPGK+ PPPLGC S
Subjt:  FPFTIHRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYS

Query:  SSEGSLDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD-------------------------
         SE S DKIT+SSER LT SISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVL VDGSLPPTSVPFMTSLQQ+                         
Subjt:  SSEGSLDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD-------------------------

Query:  -QLLPHAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMK
         QLLPHAASIVRL++KYFKKCVS+ELRVKVYAVAKLLMMS+GVGMAA LARDVIDNALVDLNPVDNES DPSSVNPK+ Q ELLQH+KKRKRPSVPTSMK
Subjt:  -QLLPHAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMK

Query:  GQHERHEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFL
        GQHERH  GD   +SSCMSTSV+LRIAALEALETL T+AGALR+EE   AKVEHLLIT ATSSFEWP+ SDDIFF+ANE IEVW DYQL AFRALLASFL
Subjt:  GQHERHEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFL

Query:  SAVHVRPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVD
        S+VHVRPLALAQGLELFR+GKQENG+KLA+FCA ALLAMEVLIHPRVLPLS+FLPV LSSPEPQA YKFQED+YF SM SSKLLK+DTQ MEQS PEL D
Subjt:  SAVHVRPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVD

Query:  DFLYDRGVADDIEEAPIRDA-GNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDP
        +F YDR  A++IEEAPIRDA GN +++ EMT+N SND+EKEP ANGL +IETPK  EQA  AAI+E+GVVE+ DVFA      SPMSSKSNK +DF  D 
Subjt:  DFLYDRGVADDIEEAPIRDA-GNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDP

Query:  GSNLLPDDDFPDIIDADPDTDYE
        GS LL +DDFPDIIDADPDTDYE
Subjt:  GSNLLPDDDFPDIIDADPDTDYE

KAG7032014.1 hypothetical protein SDJN02_06056, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0080.48Show/hide
Query:  SGSTEMAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAG
        SGS +MAAFNLVANMYDPALKPRL+HKLLREHVPDDKRAFNDH ELSKVVS+IK+HNLLSES  SMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAG
Subjt:  SGSTEMAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAG

Query:  IILLGVTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAF
        I+LLGVTCQQCSSSRFLASYTEWL +LLPH+Q                                     VIQPV+KLLHDDNTEAVLD AVNLLCTLIAF
Subjt:  IILLGVTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAF

Query:  FPFTIHRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYS
        FPFTIHRHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGED KG + +RLL+PPGK+ PPPLGC S
Subjt:  FPFTIHRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYS

Query:  SSEGSLDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--QLLPHAASIVRLIMKYFKKCVSS
         SE S DKIT+SSER LT SISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVL VDGSLPPTSVPFMTSLQQ+  QLLPHAASIVRLI+KYFKKCVS+
Subjt:  SSEGSLDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--QLLPHAASIVRLIMKYFKKCVSS

Query:  ELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITSSSCMSTSVHL
        ELRVKVYAVAKLLMMS+GVGMAA LARDVIDNALVDLNPVDNES DPSSVNPK+ QRELLQH+KKRKRPSVPTSMKGQHERH  GD   +SSCMSTSVHL
Subjt:  ELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITSSSCMSTSVHL

Query:  RIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHVRPLALAQGLELFRRGKQEN
        RIAALEALETL T+AGALR+EEG  AKVEHLLIT ATSSFEWP+ SDDIFF+ANE IEVW DYQL AFRALLASFLS+VHVRPLALAQGLELFR+GKQEN
Subjt:  RIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHVRPLALAQGLELFRRGKQEN

Query:  GTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYDRGVADDIEEAPIRDA-GNS
        G+KLA+FCA ALLAMEVLIHPRVLPLS+FLPV LSSPEPQA YKFQED+YF SM SSKLLK+DTQ MEQS PEL D+F YDR  A++IEEAPIRDA GN 
Subjt:  GTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYDRGVADDIEEAPIRDA-GNS

Query:  LDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLLPDDDFPDIIDADPDTDYE
        +++ EMT+N SND+EKEP ANGL +IETPK  EQA  AAI+E+GVVE+ DVFA      SPMSSKSNK +DF  D GS LL +DDFPDIIDADPDTDYE
Subjt:  LDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLLPDDDFPDIIDADPDTDYE

XP_022956971.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita moschata]0.0e+0078.12Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRL+HKLLREHVPDDKRAFNDH ELSKVVS+IK+HNLLSES  SMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI
        VTCQQCSSSRFLASYTEWL +LLPH+Q                                     VIQPV+KLLHDDNTEAVLD AVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS
        HRHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGED KG + +RLL+PPGK+ PPPLGC S SE S
Subjt:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS

Query:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP
         DKIT+SSER LT SISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVL VDGSLPPTSVPFMTSLQQ+                          QLLP
Subjt:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP

Query:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER
        HAASIVRLI+KYFKKCVS+ELRVKVYAVAKLLMMS+GVGMAA LARDVIDNALVDLNPVDNES DPSSVNPK+ QRELLQH+KKRKRPSVPTSMKGQHER
Subjt:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER

Query:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV
        H  GD   +SSCMSTSVHLRIAALEALETL T+AGALR+EEG  AKVEHLLIT ATSSFEWP+ SDDIFF+ANE IEVW DYQL AFRALLASFLS+VHV
Subjt:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV

Query:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD
        RPLALAQGLELFR+GKQENG+KLA+FCA ALLAMEVLIHPRVLPLS+FLPV LSSPEPQA YKFQED+YF SM SSKLLK+DTQ MEQS PEL D+F YD
Subjt:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD

Query:  RGVADDIEEAPIRDA-GNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLL
        R  A++IEEAPIRDA GN +++ EMT+N SND+EKEP ANGL +IETPK  EQA  AA++E+GVVE+ DVFA      SPMSSKS+K +DF  D GS LL
Subjt:  RGVADDIEEAPIRDA-GNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLL

Query:  PDDDFPDIIDADPDTDYE
         +DDFPDIIDADPDTDYE
Subjt:  PDDDFPDIIDADPDTDYE

XP_023517133.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0078Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
        MAAFNLV NMYDPALKPRL+HKLLREHVPDDKRAFNDH ELSKVVS+IK+HNLLSES  SMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGI+LLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI
        VTCQQCSSSRFLASYTEWL +LLPH+Q                                     VIQPV+KLLHDDNTEAVLD AVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS
        HRHY SAEAAIVSKI+SGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGED KG + +RLL+PPGK+ PPPLGC S SE S
Subjt:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS

Query:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP
         DKIT+SSER LT SISTLM CCSTMITSSYNHQVAVPIRPLLA+VERVL VDGSLPPTSVPFMTSLQQ+                          QLLP
Subjt:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP

Query:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER
        HAASIVRLI+KYFKKCVS+ELRVKVYAVAKLLMMS+GVGMAA LARDVIDNALVDLNPVDN+S DPSSVNPK+ Q ELLQH+KKRKRPSVPTSMKGQHER
Subjt:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER

Query:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV
        H  GD   +SSCMSTSVHLRIAALEALETL T+AGALR+EEG  AKVEHLLIT ATSSFEWP+ SDDIFF+ANESIEVW DYQL AFRALLASFLSAVH+
Subjt:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV

Query:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD
        RPLALAQGLELFR+GKQENG+KLA+FCA ALLAMEVLIHPRVLPLS+FLPV LSSPEPQA YKFQED+YF SM SSKLLKVDTQ MEQS PEL D+F YD
Subjt:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD

Query:  RGVADDIEEAPIRDA-GNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLL
        R  A++IEEAPIRDA GN +++ EMT+N SND+E EP ANGL +IETPK  EQA  AAI+E+GVVE+ DVFA      SPMSSKS+K +DF  D GS LL
Subjt:  RGVADDIEEAPIRDA-GNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLL

Query:  PDDDFPDIIDADPDTDYE
         +DDFPDIIDADPDTDYE
Subjt:  PDDDFPDIIDADPDTDYE

XP_038892364.1 proline-, glutamic acid- and leucine-rich protein 1 isoform X1 [Benincasa hispida]0.0e+0084.35Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
        MAAFNL+ANMYDPALKPRLLHKLLREHVPD KRAFNDH ELS+VVSVIK HNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI
        VTCQQCSSSRFLASYTEWL KLLPH+Q                                     VIQPV+KLLHDD+TEAVLDT+VNLLC LIAFFPFTI
Subjt:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS
         RHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLA LPKSKGDEDSWSLLMQKILLSID HLNEAFQGIGED K  +  RLLVPPGKD PP LGC S SEGS
Subjt:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS

Query:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP
        LDK+TKSSERTLTSSISTLM+CCSTMIT SYNHQVAVPIRPLLALVERVL VDGSLPPTSVPFMTSLQQ+                          QLLP
Subjt:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP

Query:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER
        HAASIVRLI+KYFKKCVS+ELRVKVYAVAK LMMS+GVGMAA LARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER
Subjt:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER

Query:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV
        HEPGDDITSSSCMST+VHLRIAALEALETL T+AGALRSEEG  AKVEHLLIT ATSS EWPR SDD+FFQAN SIEVWVDYQL AFRALLASFLSAVHV
Subjt:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV

Query:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD
        RPLALAQGLELFR+GKQENGTKLA+FCA ALLAMEVLIHPRVLPLS+FLPV LSSPEPQAAYKFQED+YF SMNSSKLLKVD QSMEQSAP+LVDDF YD
Subjt:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD

Query:  RGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLLP
        RGVADDIEEAPIRDAGN L NDEMT+NTSNDIEKEPSANGLANIETPKR EQATAAAISE+GVVEQDDVF N SMNSSPMSSKS+KIEDF+RDPGSNLLP
Subjt:  RGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLLP

Query:  DDDFPDIIDADPDTDYEE
        +DDFPDIIDADPDTDYEE
Subjt:  DDDFPDIIDADPDTDYEE

TrEMBL top hitse value%identityAlignment
A0A1S3BFV9 proline-, glutamic acid- and leucine-rich protein 10.0e+0076.43Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
        MAAFNLVA+MYDPALKPRLLHKLLREHVPDDKRAF+D+ ELS VVS++  H+LLSESSSS DQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI
        VTCQ+CSSSRFLASYTEWL KLLPHMQ                                     VIQPV+KLLHDDNTEAVLD AVNLLCTLI FFPFTI
Subjt:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS
        HRHYDSAEAAIVSKIFSGKCSSNMLKKLA CLASLPKSKGDEDSWSLL+QKILLSI+S LNE FQGIGED KG +FVRLL+ PGKD PPPLGC SSSEGS
Subjt:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS

Query:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP
        LDKI KSSER L SSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVL +DGSLPPTSVPFMTSLQQ+                          QLLP
Subjt:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP

Query:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER
        HAASIVRL +KYFKKCVS+ELRVKVYAVAK LMMS+GVGMAA L+RDVIDN LVDLNPV+NES   S+VNPKDTQR+  QHH KRKRPSVPTSMKGQHER
Subjt:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER

Query:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV
        +EP +DIT SSC  TSVHLRIAALEAL+TL T AGALRSEEG  AK+EHLLIT ATSS EWPR SDD FFQANESI VWVDYQL AF ALLASFLSAVHV
Subjt:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV

Query:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD
        RPLALAQGLELFR+GKQENGTKL +FCA ALLAMEVLIHPRVLPLS+FLP+ LSSPEPQAA+KFQED+YF+S +S KLLKV TQSMEQ A E +      
Subjt:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD

Query:  RGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDD-VFANGSMNSSPMSSKSNKIEDFERDPGSNLL
          + DD+           LDN+EMT++ SNDIE EPSAN LANIE PKR EQ TAAAISE GVV QDD VFAN SMNSSP+SSKS KIEDF RD  SNLL
Subjt:  RGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDD-VFANGSMNSSPMSSKSNKIEDFERDPGSNLL

Query:  PDDDFPDIIDADPDTDYEE
         +DDFPDIIDADPDTDYEE
Subjt:  PDDDFPDIIDADPDTDYEE

A0A5A7SZC8 Proline-, glutamic acid-and leucine-rich protein 10.0e+0076.43Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
        MAAFNLVA+MYDPALKPRLLHKLLREHVPDDKRAF+D+ ELS VVS++  H+LLSESSSS DQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI
        VTCQ+CSSSRFLASYTEWL KLLPHMQ                                     VIQPV+KLLHDDNTEAVLD AVNLLCTLI FFPFTI
Subjt:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS
        HRHYDSAEAAIVSKIFSGKCSSNMLKKLA CLASLPKSKGDEDSWSLL+QKILLSI+S LNE FQGIGED KG +FVRLL+ PGKD PPPLGC SSSEGS
Subjt:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS

Query:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP
        LDKI KSSER L SSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVL +DGSLPPTSVPFMTSLQQ+                          QLLP
Subjt:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP

Query:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER
        HAASIVRL +KYFKKCVS+ELRVKVYAVAK LMMS+GVGMAA L+RDVIDN LVDLNPV+NES   S+VNPKDTQR+  QHH KRKRPSVPTSMKGQHER
Subjt:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER

Query:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV
        +EP +DIT SSC  TSVHLRIAALEAL+TL T AGALRSEEG  AK+EHLLIT ATSS EWPR SDD FFQANESI VWVDYQL AF ALLASFLSAVHV
Subjt:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV

Query:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD
        RPLALAQGLELFR+GKQENGTKL +FCA ALLAMEVLIHPRVLPLS+FLP+ LSSPEPQAA+KFQED+YF+S +S KLLKV TQSMEQ A E +      
Subjt:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD

Query:  RGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDD-VFANGSMNSSPMSSKSNKIEDFERDPGSNLL
          + DD+           LDN+EMT++ SNDIE EPSAN LANIE PKR EQ TAAAISE GVV QDD VFAN SMNSSP+SSKS KIEDF RD  SNLL
Subjt:  RGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDD-VFANGSMNSSPMSSKSNKIEDFERDPGSNLL

Query:  PDDDFPDIIDADPDTDYEE
         +DDFPDIIDADPDTDYEE
Subjt:  PDDDFPDIIDADPDTDYEE

A0A6J1DBX6 proline-, glutamic acid- and leucine-rich protein 1 isoform X10.0e+0074.48Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRLLHKLLREHVPDDKR F+DH ELS  VS+IK+HNLLSESSSS DQKLIDSWKSAVDSWV+RLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI
        VTCQQCSSSRFLASYTEWL KLLPH+Q                                     +IQPV+KLLHDDN+EAV + AVNLL TLIAFFPFT+
Subjt:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS
        HRHYDSAEAAIVSKIFSGKCS NMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSID+HLNEAFQGIGED +G + VRLL+PPGKD PPPLGC S   GS
Subjt:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS

Query:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP
         DKITKSSER LTSSISTLM CCSTMITSSY HQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQ+                          QLLP
Subjt:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP

Query:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER
        +AASIVRLI+KYFKKCVS+ELRVKVYAVAKLLMMS+GVGMAA LARDV++NAL+DLNPVDNE+  PSSVN KDTQRE +QHHKKRKRPSVPTS++ Q ER
Subjt:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER

Query:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV
        H  GD    +  MST V LRIAALEALETL T+AGALRSEEG   K+E LL T ATSSF+WPR SD+  FQ +ESIEVW DYQL AFR LLASFLSAVHV
Subjt:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV

Query:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDT-QSMEQSAPELVDDFLY
        RPLALAQGLELFRRGKQE+GTKLA+FCA ALLAMEVLIHPRVLPLS+FLPVHLSS E Q+ YKF+E+++F  +NSSK+LK+DT Q +EQSAP+L DDFL+
Subjt:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDT-QSMEQSAPELVDDFLY

Query:  DRGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLL
        +  VADDIEEAPIR+AGN +++ E T+NTSND  KE S  G ++ ETPKR EQ TAAAI+++GVVE+DD F N S+N SPMS KS+K +DFERD GSNLL
Subjt:  DRGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLL

Query:  PDDDFPDIIDADPDTDYEE
         +DDFPDIIDADPDTDYEE
Subjt:  PDDDFPDIIDADPDTDYEE

A0A6J1GXZ0 proline-, glutamic acid- and leucine-rich protein 1-like isoform X20.0e+0076.01Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRL+HKLLREHVPDDKRAFNDH ELSKVVS+IK+HNLLSES  SMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI
        VTCQQCSSSRFLASYTEWL +LLPH+Q                                     VIQPV+KLLHDDNTEAVLD AVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS
        HRHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGED KG + +RLL+PPGK+ PPPLGC S SE S
Subjt:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS

Query:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP
         DKIT+SSER LT SISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVL VDGSLPPTSVPFMTSLQQ+                          QLLP
Subjt:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP

Query:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER
        HAASIVRLI+KYFKKCVS+ELRVKVYAVAKLLMMS+GVGMAA LARDVIDNALVDLNPVDNES DPSSVNPK+ QRELLQH+KKRKRPSVPTSMKGQHER
Subjt:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER

Query:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV
        H  GD   +SSCMSTSVHLRIAALEALETL T+AGALR+EEG  AKVEHLLIT ATSSFEWP+ SDDIFF+ANE IEVW DYQL AFRALLASFLS+VHV
Subjt:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV

Query:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD
        RPLALAQGLELFR+GKQENG+KLA+FCA ALLAMEVLIHPRVLPLS+FLPV LSSPEPQA YKFQED+YF SM SSKLLK+DTQ MEQS PEL D+F YD
Subjt:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD

Query:  RGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLLP
        R  A++IEEAPIRDA                             ETPK  EQA  AA++E+GVVE+ DVFA      SPMSSKS+K +DF  D GS LL 
Subjt:  RGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLLP

Query:  DDDFPDIIDADPDTDYE
        +DDFPDIIDADPDTDYE
Subjt:  DDDFPDIIDADPDTDYE

A0A6J1GYU8 proline-, glutamic acid- and leucine-rich protein 1-like isoform X10.0e+0078.12Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
        MAAFNLVANMYDPALKPRL+HKLLREHVPDDKRAFNDH ELSKVVS+IK+HNLLSES  SMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLG

Query:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI
        VTCQQCSSSRFLASYTEWL +LLPH+Q                                     VIQPV+KLLHDDNTEAVLD AVNLLCTLIAFFPFTI
Subjt:  VTCQQCSSSRFLASYTEWLLKLLPHMQ-------------------------------------VIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTI

Query:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS
        HRHYDSAEAAIVSKI+SGKC SNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGED KG + +RLL+PPGK+ PPPLGC S SE S
Subjt:  HRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSSEGS

Query:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP
         DKIT+SSER LT SISTLM CCSTMITSSYNHQVAVPIRPLLA+V+RVL VDGSLPPTSVPFMTSLQQ+                          QLLP
Subjt:  LDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------QLLP

Query:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER
        HAASIVRLI+KYFKKCVS+ELRVKVYAVAKLLMMS+GVGMAA LARDVIDNALVDLNPVDNES DPSSVNPK+ QRELLQH+KKRKRPSVPTSMKGQHER
Subjt:  HAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHER

Query:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV
        H  GD   +SSCMSTSVHLRIAALEALETL T+AGALR+EEG  AKVEHLLIT ATSSFEWP+ SDDIFF+ANE IEVW DYQL AFRALLASFLS+VHV
Subjt:  HEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHV

Query:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD
        RPLALAQGLELFR+GKQENG+KLA+FCA ALLAMEVLIHPRVLPLS+FLPV LSSPEPQA YKFQED+YF SM SSKLLK+DTQ MEQS PEL D+F YD
Subjt:  RPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYD

Query:  RGVADDIEEAPIRDA-GNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLL
        R  A++IEEAPIRDA GN +++ EMT+N SND+EKEP ANGL +IETPK  EQA  AA++E+GVVE+ DVFA      SPMSSKS+K +DF  D GS LL
Subjt:  RGVADDIEEAPIRDA-GNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLL

Query:  PDDDFPDIIDADPDTDYE
         +DDFPDIIDADPDTDYE
Subjt:  PDDDFPDIIDADPDTDYE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30240.1 FUNCTIONS IN: binding; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Armadillo-type fold (InterPro:IPR016024); Has 165 Blast hits to 164 proteins in 73 species: Archae - 0; Bacteria - 0; Metazoa - 47; Fungi - 68; Plants - 46; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink).3.0e-13038.53Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSES-SSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILL
        MA+F    +M D  LKP++L  LL E+VP++K+   + L LSKVVS I  H LLSES  +S+DQKL    KSAVD WV RL  L+S+DMPDK W GI L+
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSES-SSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILL

Query:  GVTCQQCSSSRFLASYTEWLLKLLPHM---------------------------------------QVIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFP
        GVTCQ+CSS RF  SY+ W   LL H+                                       ++I P++KLL +D++EA+L+  V+LL T++  FP
Subjt:  GVTCQQCSSSRFLASYTEWLLKLLPHM---------------------------------------QVIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFP

Query:  FTIHRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSS
           H +YD  EAAI SKIFS K SSNMLKK AH LA LPK+KGDE +WSL+MQK+L+SI+ HLN  FQG+ E+ KG   ++ L PPGKD+P PLG     
Subjt:  FTIHRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSS

Query:  EGSLDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------Q
         G LD  + +SE+ + S +S LM C STM+T+SY  ++ +P+  LL+LVERVL+V+GSLP    PFMT +QQ+                          Q
Subjt:  EGSLDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------Q

Query:  LLPHAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSD-PSSVNPKDTQRELLQHHKKRKRPSVPTSMKG
        LLP+AAS+VRL+  YF+KC   ELR+K+Y++   L+ SM  GMA  LA++V+ NA VDL+    E+ D  SS NP  T   LLQ   K+++ S   +   
Subjt:  LLPHAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSD-PSSVNPKDTQRELLQHHKKRKRPSVPTSMKG

Query:  QHERHEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFE--WPRTSDDIFFQANESIEVWVDYQLVAFRALLASF
          E   P + + S       + L+IA+LEALETL T+ GAL S+  R + V++LL+T AT++ E  W   ++      N+S    V++QL A RA  AS 
Subjt:  QHERHEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFE--WPRTSDDIFFQANESIEVWVDYQLVAFRALLASF

Query:  LSAVHVRPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSME---QSAP
        +S   VRP  LA+GLELFR GK + G K+A FCA AL+++EV+IHPR LPL   LP  LS+  P++     E     ++N   ++  D   +    Q+  
Subjt:  LSAVHVRPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSME---QSAP

Query:  ELVDDFLYDRGVAD--DIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISE--------------IGVVEQDDVFANGS-
        ++  +    R +     ++E+     GN L    ++ +  +  +   S NG    + P+++ + +   +++               G  E +D+    S 
Subjt:  ELVDDFLYDRGVAD--DIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISE--------------IGVVEQDDVFANGS-

Query:  MNSSPMSSKSNKIEDFERDPGSNLLPDDDFPDIIDADPDTD
        M  + +  K   + + + DP  +L   D      D+D D +
Subjt:  MNSSPMSSKSNKIEDFERDPGSNLLPDDDFPDIIDADPDTD

AT1G30240.2 unknown protein8.5e-13338.64Show/hide
Query:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSES-SSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILL
        MA+F    +M D  LKP++L  LL E+VP++K+   + L LSKVVS I  H LLSES  +S+DQKL    KSAVD WV RL  L+S+DMPDK W GI L+
Subjt:  MAAFNLVANMYDPALKPRLLHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSES-SSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILL

Query:  GVTCQQCSSSRFLASYTEWLLKLLPHM---------------------------------------QVIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFP
        GVTCQ+CSS RF  SY+ W   LL H+                                       ++I P++KLL +D++EA+L+  V+LL T++  FP
Subjt:  GVTCQQCSSSRFLASYTEWLLKLLPHM---------------------------------------QVIQPVVKLLHDDNTEAVLDTAVNLLCTLIAFFP

Query:  FTIHRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSS
           H +YD  EAAI SKIFS K SSNMLKK AH LA LPK+KGDE +WSL+MQK+L+SI+ HLN  FQG+ E+ KG   ++ L PPGKD+P PLG     
Subjt:  FTIHRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDFVRLLVPPGKDTPPPLGCYSSS

Query:  EGSLDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------Q
         G LD  + +SE+ + S +S LM C STM+T+SY  ++ +P+  LL+LVERVL+V+GSLP    PFMT +QQ+                          Q
Subjt:  EGSLDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQD--------------------------Q

Query:  LLPHAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSD-PSSVNPKDTQRELLQHHKKRKRPSVPTSMKG
        LLP+AAS+VRL+  YF+KC   ELR+K+Y++   L+ SMG+GMA  LA++V+ NA VDL+    E+ D  SS NP  T   LLQ   K+++ S   +   
Subjt:  LLPHAASIVRLIMKYFKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSD-PSSVNPKDTQRELLQHHKKRKRPSVPTSMKG

Query:  QHERHEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFE--WPRTSDDIFFQANESIEVWVDYQLVAFRALLASF
          E   P + + S       + L+IA+LEALETL T+ GAL S+  R + V++LL+T AT++ E  W   ++      N+S    V++QL A RA  AS 
Subjt:  QHERHEPGDDITSSSCMSTSVHLRIAALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFE--WPRTSDDIFFQANESIEVWVDYQLVAFRALLASF

Query:  LSAVHVRPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSME---QSAP
        +S   VRP  LA+GLELFR GK + G K+A FCA AL+++EV+IHPR LPL   LP  LS+  P++     E     ++N   ++  D   +    Q+  
Subjt:  LSAVHVRPLALAQGLELFRRGKQENGTKLADFCAFALLAMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSME---QSAP

Query:  ELVDDFLYDRGVAD--DIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISE--------------IGVVEQDDVFANGS-
        ++  +    R +     ++E+     GN L    ++ +  +  +   S NG    + P+++ + +   +++               G  E +D+    S 
Subjt:  ELVDDFLYDRGVAD--DIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLANIETPKRIEQATAAAISE--------------IGVVEQDDVFANGS-

Query:  MNSSPMSSKSNKIEDFERDPGSNLLPDDDFPDIIDADPDTD
        M  + +  K   + + + DP  +L   D      D+D D +
Subjt:  MNSSPMSSKSNKIEDFERDPGSNLLPDDDFPDIIDADPDTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTGCAGTTAAGCGGCGGCGAGAAGCAACGAGTGGCTCTAGCTCGAGCATTCTTGAAAGCACCTTTCTATTCTTCTTTCCTATTTCTCAAAACCGAAAATCGAAC
CGCACCGAAACCAAACCGAATTATCGGATTCAGTTTGACTCCCAAACCGAACCGAACCGTGAACACCCGCTCTGACTCCTTTTCGTGTTCCTCTCGGCTCCATATTTTTC
ATCGTCAATCTTTGATTCAGCCGAAGAAGGCTCGCGGAAGTGGTTCAACCGAAATGGCGGCCTTTAATCTCGTTGCCAATATGTATGACCCGGCTTTGAAGCCTCGCTTG
CTCCACAAACTTCTTAGGGAGCACGTTCCTGACGATAAGCGGGCGTTTAATGATCATTTGGAACTCTCAAAGGTGGTTTCTGTGATCAAAATGCACAATCTCCTCTCTGA
ATCCTCGTCTTCCATGGACCAAAAGCTGATCGATAGCTGGAAATCCGCGGTTGATTCCTGGGTCAACCGCTTGTTTCTTCTTCTCTCCAATGATATGCCTGATAAATGTT
GGGCGGGAATCATTTTACTCGGAGTGACTTGTCAACAATGCAGCTCTAGTCGTTTCTTGGCATCATATACAGAGTGGCTTCTCAAGCTTTTACCTCACATGCAGGTCATT
CAACCAGTTGTTAAGCTGTTGCATGATGATAATACAGAAGCAGTTTTGGACACTGCAGTTAATTTATTATGCACTCTGATAGCTTTCTTCCCCTTTACAATCCATCGTCA
TTATGACTCTGCCGAAGCTGCAATTGTTTCAAAAATATTTTCAGGAAAGTGTAGTTCTAACATGCTGAAGAAGCTTGCCCATTGCCTGGCATCTCTTCCAAAATCAAAAG
GAGATGAAGATAGCTGGTCTTTACTAATGCAGAAGATTTTGTTATCCATCGATAGTCACTTGAATGAGGCCTTCCAAGGCATTGGTGAAGATCCAAAAGGCGAGGATTTT
GTAAGGTTACTGGTTCCCCCAGGAAAAGATACTCCACCACCCTTAGGTTGTTATTCATCGTCAGAAGGTTCCTTAGACAAAATAACAAAGAGCTCAGAGCGAACATTAAC
ATCTAGTATTTCAACCTTGATGGTTTGCTGTTCCACAATGATCACAAGTTCATACAACCATCAGGTGGCAGTTCCAATTCGCCCGTTATTAGCTCTTGTTGAGAGAGTGC
TGATGGTGGATGGTTCTTTGCCACCCACTTCAGTGCCATTTATGACATCTCTGCAGCAAGATCAATTGTTACCACATGCTGCATCTATTGTACGACTCATCATGAAGTAC
TTCAAGAAGTGTGTCTCTTCAGAACTGAGAGTAAAAGTCTACGCAGTTGCTAAGTTATTGATGATGTCTATGGGCGTTGGAATGGCTGCATTTCTTGCACGAGATGTGAT
TGACAATGCACTAGTTGATTTAAACCCTGTTGATAATGAGAGTTCTGATCCATCTAGTGTGAATCCAAAGGACACACAAAGAGAATTGCTGCAACACCATAAGAAGAGAA
AACGTCCTTCAGTTCCCACTTCCATGAAAGGGCAGCACGAGAGGCATGAACCAGGGGACGACATTACCAGCAGCAGCTGTATGTCTACCTCAGTCCACTTGAGGATAGCT
GCACTTGAGGCTTTGGAGACTCTTTTTACAGTGGCTGGTGCTTTGAGATCTGAAGAAGGGCGGCATGCAAAAGTCGAACATCTTTTAATAACAGTTGCAACATCTTCTTT
TGAATGGCCACGAACCTCAGACGACATCTTTTTCCAAGCTAATGAATCTATTGAGGTTTGGGTGGATTATCAGTTGGTGGCATTTCGTGCACTACTGGCTTCATTTTTGT
CTGCTGTCCATGTACGCCCTCTGGCTTTGGCTCAAGGTCTTGAGCTCTTCCGTAGAGGTAAACAAGAAAATGGAACTAAACTTGCCGACTTCTGTGCCTTTGCTCTCTTA
GCCATGGAGGTCCTAATACATCCGAGGGTCCTTCCCCTCTCGAATTTCTTGCCGGTGCACTTGAGCTCTCCTGAACCACAAGCTGCCTATAAATTCCAGGAAGACGTATA
CTTCAGTAGTATGAATTCTAGCAAACTGTTGAAAGTCGACACGCAAAGCATGGAGCAGAGTGCCCCTGAGTTGGTCGATGATTTCTTGTATGATAGAGGAGTTGCAGATG
ACATTGAAGAGGCTCCAATTAGAGATGCAGGTAATTCGCTAGATAACGATGAAATGACATTTAACACCTCAAATGATATCGAAAAGGAGCCCTCTGCAAATGGCCTGGCG
AATATAGAAACGCCCAAGAGGATCGAGCAGGCCACTGCAGCAGCCATCTCAGAAATAGGGGTTGTAGAACAAGATGATGTCTTTGCTAATGGGAGTATGAATAGTTCCCC
CATGTCATCAAAATCCAATAAAATTGAAGATTTCGAACGTGATCCGGGATCGAATTTGTTGCCAGATGATGATTTCCCTGATATTATTGATGCAGATCCTGATACAGACT
ATGAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTGCAGTTAAGCGGCGGCGAGAAGCAACGAGTGGCTCTAGCTCGAGCATTCTTGAAAGCACCTTTCTATTCTTCTTTCCTATTTCTCAAAACCGAAAATCGAAC
CGCACCGAAACCAAACCGAATTATCGGATTCAGTTTGACTCCCAAACCGAACCGAACCGTGAACACCCGCTCTGACTCCTTTTCGTGTTCCTCTCGGCTCCATATTTTTC
ATCGTCAATCTTTGATTCAGCCGAAGAAGGCTCGCGGAAGTGGTTCAACCGAAATGGCGGCCTTTAATCTCGTTGCCAATATGTATGACCCGGCTTTGAAGCCTCGCTTG
CTCCACAAACTTCTTAGGGAGCACGTTCCTGACGATAAGCGGGCGTTTAATGATCATTTGGAACTCTCAAAGGTGGTTTCTGTGATCAAAATGCACAATCTCCTCTCTGA
ATCCTCGTCTTCCATGGACCAAAAGCTGATCGATAGCTGGAAATCCGCGGTTGATTCCTGGGTCAACCGCTTGTTTCTTCTTCTCTCCAATGATATGCCTGATAAATGTT
GGGCGGGAATCATTTTACTCGGAGTGACTTGTCAACAATGCAGCTCTAGTCGTTTCTTGGCATCATATACAGAGTGGCTTCTCAAGCTTTTACCTCACATGCAGGTCATT
CAACCAGTTGTTAAGCTGTTGCATGATGATAATACAGAAGCAGTTTTGGACACTGCAGTTAATTTATTATGCACTCTGATAGCTTTCTTCCCCTTTACAATCCATCGTCA
TTATGACTCTGCCGAAGCTGCAATTGTTTCAAAAATATTTTCAGGAAAGTGTAGTTCTAACATGCTGAAGAAGCTTGCCCATTGCCTGGCATCTCTTCCAAAATCAAAAG
GAGATGAAGATAGCTGGTCTTTACTAATGCAGAAGATTTTGTTATCCATCGATAGTCACTTGAATGAGGCCTTCCAAGGCATTGGTGAAGATCCAAAAGGCGAGGATTTT
GTAAGGTTACTGGTTCCCCCAGGAAAAGATACTCCACCACCCTTAGGTTGTTATTCATCGTCAGAAGGTTCCTTAGACAAAATAACAAAGAGCTCAGAGCGAACATTAAC
ATCTAGTATTTCAACCTTGATGGTTTGCTGTTCCACAATGATCACAAGTTCATACAACCATCAGGTGGCAGTTCCAATTCGCCCGTTATTAGCTCTTGTTGAGAGAGTGC
TGATGGTGGATGGTTCTTTGCCACCCACTTCAGTGCCATTTATGACATCTCTGCAGCAAGATCAATTGTTACCACATGCTGCATCTATTGTACGACTCATCATGAAGTAC
TTCAAGAAGTGTGTCTCTTCAGAACTGAGAGTAAAAGTCTACGCAGTTGCTAAGTTATTGATGATGTCTATGGGCGTTGGAATGGCTGCATTTCTTGCACGAGATGTGAT
TGACAATGCACTAGTTGATTTAAACCCTGTTGATAATGAGAGTTCTGATCCATCTAGTGTGAATCCAAAGGACACACAAAGAGAATTGCTGCAACACCATAAGAAGAGAA
AACGTCCTTCAGTTCCCACTTCCATGAAAGGGCAGCACGAGAGGCATGAACCAGGGGACGACATTACCAGCAGCAGCTGTATGTCTACCTCAGTCCACTTGAGGATAGCT
GCACTTGAGGCTTTGGAGACTCTTTTTACAGTGGCTGGTGCTTTGAGATCTGAAGAAGGGCGGCATGCAAAAGTCGAACATCTTTTAATAACAGTTGCAACATCTTCTTT
TGAATGGCCACGAACCTCAGACGACATCTTTTTCCAAGCTAATGAATCTATTGAGGTTTGGGTGGATTATCAGTTGGTGGCATTTCGTGCACTACTGGCTTCATTTTTGT
CTGCTGTCCATGTACGCCCTCTGGCTTTGGCTCAAGGTCTTGAGCTCTTCCGTAGAGGTAAACAAGAAAATGGAACTAAACTTGCCGACTTCTGTGCCTTTGCTCTCTTA
GCCATGGAGGTCCTAATACATCCGAGGGTCCTTCCCCTCTCGAATTTCTTGCCGGTGCACTTGAGCTCTCCTGAACCACAAGCTGCCTATAAATTCCAGGAAGACGTATA
CTTCAGTAGTATGAATTCTAGCAAACTGTTGAAAGTCGACACGCAAAGCATGGAGCAGAGTGCCCCTGAGTTGGTCGATGATTTCTTGTATGATAGAGGAGTTGCAGATG
ACATTGAAGAGGCTCCAATTAGAGATGCAGGTAATTCGCTAGATAACGATGAAATGACATTTAACACCTCAAATGATATCGAAAAGGAGCCCTCTGCAAATGGCCTGGCG
AATATAGAAACGCCCAAGAGGATCGAGCAGGCCACTGCAGCAGCCATCTCAGAAATAGGGGTTGTAGAACAAGATGATGTCTTTGCTAATGGGAGTATGAATAGTTCCCC
CATGTCATCAAAATCCAATAAAATTGAAGATTTCGAACGTGATCCGGGATCGAATTTGTTGCCAGATGATGATTTCCCTGATATTATTGATGCAGATCCTGATACAGACT
ATGAAGAGTGA
Protein sequenceShow/hide protein sequence
MSLQLSGGEKQRVALARAFLKAPFYSSFLFLKTENRTAPKPNRIIGFSLTPKPNRTVNTRSDSFSCSSRLHIFHRQSLIQPKKARGSGSTEMAAFNLVANMYDPALKPRL
LHKLLREHVPDDKRAFNDHLELSKVVSVIKMHNLLSESSSSMDQKLIDSWKSAVDSWVNRLFLLLSNDMPDKCWAGIILLGVTCQQCSSSRFLASYTEWLLKLLPHMQVI
QPVVKLLHDDNTEAVLDTAVNLLCTLIAFFPFTIHRHYDSAEAAIVSKIFSGKCSSNMLKKLAHCLASLPKSKGDEDSWSLLMQKILLSIDSHLNEAFQGIGEDPKGEDF
VRLLVPPGKDTPPPLGCYSSSEGSLDKITKSSERTLTSSISTLMVCCSTMITSSYNHQVAVPIRPLLALVERVLMVDGSLPPTSVPFMTSLQQDQLLPHAASIVRLIMKY
FKKCVSSELRVKVYAVAKLLMMSMGVGMAAFLARDVIDNALVDLNPVDNESSDPSSVNPKDTQRELLQHHKKRKRPSVPTSMKGQHERHEPGDDITSSSCMSTSVHLRIA
ALEALETLFTVAGALRSEEGRHAKVEHLLITVATSSFEWPRTSDDIFFQANESIEVWVDYQLVAFRALLASFLSAVHVRPLALAQGLELFRRGKQENGTKLADFCAFALL
AMEVLIHPRVLPLSNFLPVHLSSPEPQAAYKFQEDVYFSSMNSSKLLKVDTQSMEQSAPELVDDFLYDRGVADDIEEAPIRDAGNSLDNDEMTFNTSNDIEKEPSANGLA
NIETPKRIEQATAAAISEIGVVEQDDVFANGSMNSSPMSSKSNKIEDFERDPGSNLLPDDDFPDIIDADPDTDYEE