; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg19394 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg19394
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionCysteine protease
Genome locationCarg_Chr19:6862225..6864524
RNA-Seq ExpressionCarg19394
SyntenyCarg19394
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572056.1 Cysteine protease RD19A, partial [Cucurbita argyrosperma subsp. sororia]1.7e-13782.17Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST
        MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST

Query:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL
        IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQ                                      
Subjt:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL

Query:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEHLVHSYVQRGWIMEFCWWVMA
                      CDPEEAGSCDSGCNGGLMN AFEYTLKAGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEHLVHSYVQRGWIMEFCWWVMA
Subjt:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEHLVHSYVQRGWIMEFCWWVMA

Query:  QLTIGSSKNSWGEK
        QLTIGSSK + GEK
Subjt:  QLTIGSSKNSWGEK

KAG7011723.1 Cysteine protease RD19A, partial [Cucurbita argyrosperma subsp. argyrosperma]6.3e-201100Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST
        MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST

Query:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL
        IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL
Subjt:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL

Query:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEHLVHSYVQRGWIMEFCWWVMA
        VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEHLVHSYVQRGWIMEFCWWVMA
Subjt:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEHLVHSYVQRGWIMEFCWWVMA

Query:  QLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVHTPIAAAGQ
        QLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVHTPIAAAGQ
Subjt:  QLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVHTPIAAAGQ

XP_004148554.2 cysteine protease RD19A [Cucumis sativus]3.2e-12063.59Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS
        MDR+FFL AVI A   ATLCSSEPL S HSVE DGD LIRQVV NDGDF    LGAEHHFSLFKRRFGKSYATEEEHD RFKIFKANMR+A   QSFDPS
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS

Query:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE
         IHGVTQFSDLTP EFR+AFLGLR +RLRLPVDTN A ILPTENLPI FDWR+ G VT VKNQ                  WSFSTTGALEGANFLATGE
Subjt:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE

Query:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------
        LVSLSEQQLVDCDHECDPEE  +CDSGCNGGLMN AFEYTLK            AG DR  C  D+SKIAAS+A+FSV+ S+DEDQIAANLV++      
Subjt:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------

Query:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH
                                   L+  Y   G+    + +  +W++      S   SWGE GYY+ICRGRNICGVDSLVSTVAAVH
Subjt:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH

XP_008448073.1 PREDICTED: cysteine proteinase RD19a-like [Cucumis melo]2.5e-11762.31Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS
        MD +FFL AVI    AATLC SE L SPHSV+ D D  IRQVV +DGDF R  LGAEHHFSLFKRRFGKSYATEEEHD RFKIFKANMR+A   QSFDPS
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS

Query:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE
         IHG+TQFSDLTP EFR+AFLGLR +RLRLPVDTN A ILPTENLPI FDWRERGAVT VKNQ                  WSFSTTGA+EGANFLATG+
Subjt:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE

Query:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------
        LVSLSEQQLVDCDHECDPEE  +CDSGCNGGLMN AFEYTLKA            G D   C  D+SKIAAS+ANFSV+ SLDEDQIAANLV++      
Subjt:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------

Query:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH
                                   L+  Y   G+    + +  +W++      S   +WGE GYY+ICRGRNICGVDSLVSTVAAVH
Subjt:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH

XP_038887474.1 probable cysteine protease RD19B [Benincasa hispida]2.1e-12464.75Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDF-KRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS
        MD SF L AVIAA A ATLCSSE L SP S+E DGD LIRQVV+DG+F  RLPLGAEHHFSLFK+RFGKSYATEEEHD RFKIF+ANMR+A   QSFDPS
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDF-KRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS

Query:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE
         IHG+TQFSDLTP EFR+AFLGLR +RLRLPVDTN A ILPTENLPI FDWRERGAVT VKNQ                  WSFSTTGALEGANFLATGE
Subjt:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE

Query:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH-------
        LVSLSEQQLVDCDHECDPEEA SCDSGCNGGLMN AFEYTLKA            G DR  C  D+SKIAASVANFSV+SLDEDQIAANLV++       
Subjt:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH-------

Query:  --------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH---TPIAAAGQ
                                  L+  Y   G+    + +  +W++      S   +WGE GYY+ICRGRNICGVDSLVSTVAAVH   T IAAAGQ
Subjt:  --------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH---TPIAAAGQ

TrEMBL top hitse value%identityAlignment
A0A0A0K2B5 Papain-like cysteine proteinase isoform I1.5e-12063.59Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS
        MDR+FFL AVI A   ATLCSSEPL S HSVE DGD LIRQVV NDGDF    LGAEHHFSLFKRRFGKSYATEEEHD RFKIFKANMR+A   QSFDPS
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS

Query:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE
         IHGVTQFSDLTP EFR+AFLGLR +RLRLPVDTN A ILPTENLPI FDWR+ G VT VKNQ                  WSFSTTGALEGANFLATGE
Subjt:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE

Query:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------
        LVSLSEQQLVDCDHECDPEE  +CDSGCNGGLMN AFEYTLK            AG DR  C  D+SKIAAS+A+FSV+ S+DEDQIAANLV++      
Subjt:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------

Query:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH
                                   L+  Y   G+    + +  +W++      S   SWGE GYY+ICRGRNICGVDSLVSTVAAVH
Subjt:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH

A0A1S3BIU9 cysteine proteinase RD19a-like1.2e-11762.31Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS
        MD +FFL AVI    AATLC SE L SPHSV+ D D  IRQVV +DGDF R  LGAEHHFSLFKRRFGKSYATEEEHD RFKIFKANMR+A   QSFDPS
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS

Query:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE
         IHG+TQFSDLTP EFR+AFLGLR +RLRLPVDTN A ILPTENLPI FDWRERGAVT VKNQ                  WSFSTTGA+EGANFLATG+
Subjt:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE

Query:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------
        LVSLSEQQLVDCDHECDPEE  +CDSGCNGGLMN AFEYTLKA            G D   C  D+SKIAAS+ANFSV+ SLDEDQIAANLV++      
Subjt:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------

Query:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH
                                   L+  Y   G+    + +  +W++      S   +WGE GYY+ICRGRNICGVDSLVSTVAAVH
Subjt:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH

A0A5A7SS65 Cysteine proteinase RD19a-like3.4e-11261.03Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS
        MD +FFL AVI    AATLC SE L SPHSV+ D D  IRQVV +D DF R  LGAEHHFSLFKRRFGKSYATEEEHD RFKIFKANMR+A   QSFDPS
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVV-NDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS

Query:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE
         IHG+TQFSDLTP EFR+AFLGLR +RLRLPVDTN A ILPTENLPI FDWRERGAVT VKNQ                  WSFSTTGA+EGANFLATG+
Subjt:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE

Query:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------
        LVSLSEQQLVDCDH    EE  +CDSGCNGGLMN AFEYTLKA            G D   C  D+SKIAAS+ANFSV+ SLDEDQIAANLV++      
Subjt:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVI-SLDEDQIAANLVEH------

Query:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH
                                   L+  Y   G+    + +  +W++      S   +WGE GYY+ICRGRNICGVDSLVSTVAAVH
Subjt:  ---------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVH

A0A6J1F4R7 probable cysteine protease RD19B2.3e-11661.11Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST
        MDRSFFL +V+ A AAA LCSSE LAS +SV      LIRQV++DG+   LPL AEHHF LFKR+FGKSYATEEEH+ RF+IFKANMR+AL  QSFDPS 
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST

Query:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL
        IHGVTQFSDLT SEF++ FLGLR +RL+LP+D N A ILPTENLP  FDWRERGAVT VKNQ                  WSFSTTGALEGANFLATGEL
Subjt:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL

Query:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH--------
        VSLSEQQLVDCDHECD EEAGSCDSGCNGGLMN A EYTLK             G DRE CK DRSKIAASVANFSV+SLDEDQIAANLV++        
Subjt:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH--------

Query:  -------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVHTPIAAAGQ
                                 L+  Y   G+    + +  +W++      S   +WGE GYYRIC+GRNICGVDSLVSTVAAV TPI A  +
Subjt:  -------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVHTPIAAAGQ

A0A6J1I0L8 probable cysteine protease RD19B1.9e-11560.61Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST
        MDRSF L  V+   AAA LCSSE LAS +SV      LIRQV++DG+   LPL AEHHF LFKR+FGKSYATEEEH+ RF+IF+ANMR+AL  QSFDPS 
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST

Query:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL
        IHGVTQFSDLT SEF++ FLGLR +RL+LP+  N A ILPTENLP  FDWRERGAVT VKNQ                  WSFSTTGALEGANFLATGEL
Subjt:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL

Query:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH--------
        VSLSEQQLVDCDHECD EEAGSCDSGCNGGLMN A EYTLK             G DRE CK DRSKIAASVANFSV+SLDEDQIAANLV++        
Subjt:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH--------

Query:  -------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVHTPIAAAGQ
                                 L+  Y   G+    + +  +W++      S   +WGE GYYRIC+GRNICGVDSLVSTVAAVHTPIAA  +
Subjt:  -------------------------LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAAVHTPIAAAGQ

SwissProt top hitse value%identityAlignment
P25804 Cysteine proteinase 15A2.6e-8850Show/hide
Query:  MDRSF-FLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS
        MDR F F L + AA A A    +           + D +IRQVV D +   L L AEHHF+ FK +F KSYAT+EEHD RF +FK+N+ +A   Q+ DP+
Subjt:  MDRSF-FLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPS

Query:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE
          HG+T+FSDLT SEFR  FLGL++ RLRLP     A ILPT NLP  FDWRE+GAVT VK+Q                  W+FSTTGALEGA++LATG+
Subjt:  TIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGE

Query:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAG----------CDRE-ACKLDRSKIAASVANFSVISLDEDQIAANLVEH-LVHSYVQ
        LVSLSEQQLVDCDH CDPE+AGSCDSGCNGGLMN AFEY L++G            R+ +CK D+SK+ ASV+NFSV++LDEDQIAANLV++  +   + 
Subjt:  LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAG----------CDRE-ACKLDRSKIAASVANFSVISLDEDQIAANLVEH-LVHSYVQ

Query:  RGWIMEF-----CWWVMAQ-------LTIGSSK---------------------NSWGEKGYYRICRGRNICGVDSLVSTVAAVHT
          W+  +     C +V A+       L +G  K                      +WGE+GYY+ICRGRN+CGVDS+VSTVAA  +
Subjt:  RGWIMEF-----CWWVMAQ-------LTIGSSK---------------------NSWGEKGYYRICRGRNICGVDSLVSTVAAVHT

P43295 Probable cysteine protease RD19B3.6e-9053.22Show/hide
Query:  SVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRL
        SV  D D LIRQVV++ + K   L +E HF+LFK++FGK Y + EEH  RF +FKAN+ +A+  Q  DPS  HGVTQFSDLT SEFR   LG+ +   +L
Subjt:  SVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRL

Query:  PVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNG
        P D N A ILPT+NLP  FDWR+RGAVT VKNQ                  WSFSTTGALEGA+FLATG+LVSLSEQQLVDCDHECDPEE GSCDSGCNG
Subjt:  PVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNG

Query:  GLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------LVHS
        GLMN AFEYTLK             G D  +CKLDRSKI ASV+NFSV+S++EDQIAANL+++                                 L+  
Subjt:  GLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------LVHS

Query:  YVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAA
        Y   G+    + E  +W++      S   SWGE G+Y+IC+GRNICGVDSLVSTVAA
Subjt:  YVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAA

P43296 Cysteine protease RD19A3.2e-9154.8Show/hide
Query:  DGDTL-IRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVD
        DGD L IRQVV  G  +   L +E HFSLFKR+FGK YA+ EEHD RF +FKAN+R+A   Q  DPS  HGVTQFSDLT SEFR+  LG+ R+  +LP D
Subjt:  DGDTL-IRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVD

Query:  TNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM
         N A ILPTENLP  FDWR+ GAVT VKNQ                  WSFS TGALEGANFLATG+LVSLSEQQLVDCDHECDPEEA SCDSGCNGGLM
Subjt:  TNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM

Query:  NIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------LVHSYVQ
        N AFEYTLK             G D + CKLD+SKI ASV+NFSVIS+DE+QIAANLV++                                 L+  Y  
Subjt:  NIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------LVHSYVQ

Query:  RGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAA
         G+      E  +W++      S   +WGE G+Y+IC+GRNICGVDS+VSTVAA
Subjt:  RGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAA

Q10716 Cysteine proteinase 11.9e-7544.39Show/hide
Query:  RSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIH
        R   LL++ +A A A    +E            D LIRQVV  GD   L L AE HF  F +RFGKSY   +EH  R  +FK N+R+A   Q  DPS  H
Subjt:  RSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIH

Query:  GVTQFSDLTPSEFREAFLGLRRNR----LRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATG
        GVT+FSDLTP+EFR  +LGLR++R      L    + A +LPT+ LP  FDWR+ GAV  VKNQ                  WSFS +GALEGA++LATG
Subjt:  GVTQFSDLTPSEFREAFLGLRRNR----LRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATG

Query:  ELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAG---CDRE--------ACKLDRSKIAASVANFSVISLDEDQIAANLVE--------
        +L  LSEQQ VDCDHECD  E  SCDSGCNGGLM  AF Y  KAG    +++         CK D+SKI ASV NFSV+S+DE QI+ANL++        
Subjt:  ELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAG---CDRE--------ACKLDRSKIAASVANFSVISLDEDQIAANLVE--------

Query:  --------------------HLVHSYVQRGW---------IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRG---RNICGVDSLVSTVAAVH
                            HL H  +  G+         + +  +W++      S   +WGE GYY+ICRG   RN CGVDS+VSTV+AVH
Subjt:  --------------------HLVHSYVQRGW---------IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRG---RNICGVDSLVSTVAAVH

Q9SUL1 Probable cysteine protease RD19C3.8e-9251.81Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST
        MDR  F   + A   A +L S+        V       IRQVV + + ++L L AEHHF+LFK ++ K+YAT+ EHD RF++FKAN+R+A   Q  DPS 
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST

Query:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL
        +HGVTQFSDLTP EFR  FLGL+R   RLP DT TA ILPT +LP  FDWRE+GAVT VKNQ             +    WSFS  GALEGA+FLAT EL
Subjt:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL

Query:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH-LVHSYVQ
        VSLSEQQLVDCDHECDP +A SCDSGC+GGLMN AFEY LKA            G D  ACK D+SKI ASV+NFSV+S DEDQIAANLV+H  +   + 
Subjt:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH-LVHSYVQ

Query:  RGWIMEF-----CWWVMAQ--------LTIGSS---------------KNS----WGEKGYYRICRG-RNICGVDSLVSTVAAVHT
          W+  +     C +V ++        +  GSS               KNS    WGE GYY+ICRG  N+CG+D++VSTVAAVHT
Subjt:  RGWIMEF-----CWWVMAQ--------LTIGSS---------------KNS----WGEKGYYRICRG-RNICGVDSLVSTVAAVHT

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 24.9e-2637.78Show/hide
Query:  FGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATIL--PTENLPIHFDWRERGAVTAVKNQD
        F K+Y T EE  LRF++FK N++          S   G+ +F+DL+  EF++ +LGL+ + +R   + + A       E +P   DWR++GAV  VKNQ 
Subjt:  FGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATIL--PTENLPIHFDWRERGAVTAVKNQD

Query:  IRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAG
                         W+FST  A+EG N + TG L +LSEQ+L+DCD         + ++GCNGGLM+ AFEY +K G
Subjt:  IRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKAG

AT2G21430.1 Papain family cysteine protease2.5e-9153.22Show/hide
Query:  SVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRL
        SV  D D LIRQVV++ + K   L +E HF+LFK++FGK Y + EEH  RF +FKAN+ +A+  Q  DPS  HGVTQFSDLT SEFR   LG+ +   +L
Subjt:  SVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRL

Query:  PVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNG
        P D N A ILPT+NLP  FDWR+RGAVT VKNQ                  WSFSTTGALEGA+FLATG+LVSLSEQQLVDCDHECDPEE GSCDSGCNG
Subjt:  PVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNG

Query:  GLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------LVHS
        GLMN AFEYTLK             G D  +CKLDRSKI ASV+NFSV+S++EDQIAANL+++                                 L+  
Subjt:  GLMNIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------LVHS

Query:  YVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAA
        Y   G+    + E  +W++      S   SWGE G+Y+IC+GRNICGVDSLVSTVAA
Subjt:  YVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAA

AT3G54940.2 Papain family cysteine protease3.9e-6039.34Show/hide
Query:  HSVECDGDTLIRQVVNDGDFKR---LPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRR-
        H V    D  IRQV  D    R   L    E  F LF   +GK+Y+T EE+  R  IF  N+ +A   Q  DPS +HGVTQFSDLT  EF+  + G+   
Subjt:  HSVECDGDTLIRQVVNDGDFKR---LPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRR-

Query:  NRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCD
           R       A ++  + LP  FDWRE+G VT VKNQ                  W+FSTTGA EGA+F++TG+L+SLSEQQLVDCD  CDP++  +CD
Subjt:  NRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCD

Query:  SGCNGGLMNIAFEYTLKAG-----------CDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------
        +GC GGLM  A+EY ++AG             R  CK D  K+A  V NF+ I LDE+QIAANLV H                                 
Subjt:  SGCNGGLMNIAFEYTLKAG-----------CDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------

Query:  -LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVA
         L+  Y  +G+    +    +W++      S    WGE GYY++CRG +ICG++S+VS VA
Subjt:  -LVHSYVQRGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVA

AT4G16190.1 Papain family cysteine protease2.7e-9351.81Show/hide
Query:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST
        MDR  F   + A   A +L S+        V       IRQVV + + ++L L AEHHF+LFK ++ K+YAT+ EHD RF++FKAN+R+A   Q  DPS 
Subjt:  MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPST

Query:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL
        +HGVTQFSDLTP EFR  FLGL+R   RLP DT TA ILPT +LP  FDWRE+GAVT VKNQ             +    WSFS  GALEGA+FLAT EL
Subjt:  IHGVTQFSDLTPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGEL

Query:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH-LVHSYVQ
        VSLSEQQLVDCDHECDP +A SCDSGC+GGLMN AFEY LKA            G D  ACK D+SKI ASV+NFSV+S DEDQIAANLV+H  +   + 
Subjt:  VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNIAFEYTLKA------------GCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH-LVHSYVQ

Query:  RGWIMEF-----CWWVMAQ--------LTIGSS---------------KNS----WGEKGYYRICRG-RNICGVDSLVSTVAAVHT
          W+  +     C +V ++        +  GSS               KNS    WGE GYY+ICRG  N+CG+D++VSTVAAVHT
Subjt:  RGWIMEF-----CWWVMAQ--------LTIGSS---------------KNS----WGEKGYYRICRG-RNICGVDSLVSTVAAVHT

AT4G39090.1 Papain family cysteine protease2.3e-9254.8Show/hide
Query:  DGDTL-IRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVD
        DGD L IRQVV  G  +   L +E HFSLFKR+FGK YA+ EEHD RF +FKAN+R+A   Q  DPS  HGVTQFSDLT SEFR+  LG+ R+  +LP D
Subjt:  DGDTL-IRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDLTPSEFREAFLGLRRNRLRLPVD

Query:  TNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM
         N A ILPTENLP  FDWR+ GAVT VKNQ                  WSFS TGALEGANFLATG+LVSLSEQQLVDCDHECDPEEA SCDSGCNGGLM
Subjt:  TNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM

Query:  NIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------LVHSYVQ
        N AFEYTLK             G D + CKLD+SKI ASV+NFSVIS+DE+QIAANLV++                                 L+  Y  
Subjt:  NIAFEYTLK------------AGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEH---------------------------------LVHSYVQ

Query:  RGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAA
         G+      E  +W++      S   +WGE G+Y+IC+GRNICGVDS+VSTVAA
Subjt:  RGW----IMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDSLVSTVAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGGAGTTTCTTTTTGCTCGCCGTGATCGCCGCCACCGCCGCCGCCACCCTGTGCTCATCGGAGCCTTTGGCCTCACCGCATTCCGTCGAATGCGACGGCGATAC
GCTGATTCGTCAAGTTGTCAACGATGGAGATTTCAAGCGTCTCCCTCTCGGAGCGGAACATCATTTTTCGCTCTTCAAGCGAAGGTTCGGGAAATCATATGCCACCGAGG
AAGAGCACGATCTTAGGTTCAAGATCTTCAAGGCTAATATGCGACAGGCGCTGTGCCAGCAGTCATTTGATCCGTCCACCATTCATGGCGTCACTCAGTTCTCTGATTTG
ACCCCTTCCGAGTTTAGGGAGGCCTTTCTAGGGCTCCGAAGAAACCGTCTCAGGCTTCCTGTTGATACCAATACGGCTACGATTCTTCCTACGGAGAATCTTCCGATTCA
TTTTGATTGGAGAGAACGTGGTGCTGTAACTGCTGTAAAAAATCAGGATATACGCCTATTCCATTTGGTTACCTTTGATTTCTGGATCTTGCGGATCCTGTGGAGTTTCA
GTACAACCGGTGCTCTTGAAGGTGCTAACTTCCTTGCGACAGGGGAACTTGTTAGCTTAAGCGAACAGCAGCTGGTAGATTGTGATCACGAGTGTGATCCAGAGGAAGCT
GGTTCCTGTGACTCTGGTTGCAATGGTGGCTTGATGAACATTGCATTTGAATACACATTAAAAGCTGGTTGCGATCGTGAAGCCTGCAAGTTGGACAGGTCCAAGATCGC
TGCATCAGTTGCCAATTTCAGTGTTATTTCACTTGATGAGGATCAAATTGCTGCAAATCTGGTGGAGCATCTTGTCCATTCATATGTTCAAAGAGGTTGGATCATGGAGT
TTTGCTGGTGGGTTATGGCTCAGCTGACTATTGGATCATCAAAAAACTCATGGGGAGAAAAAGGATACTACAGGATCTGCAGAGGAAGGAATATTTGTGGAGTTGATTCC
TTGGTGTCAACTGTTGCAGCTGTTCATACCCCAATTGCAGCAGCAGGTCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCGGAGTTTCTTTTTGCTCGCCGTGATCGCCGCCACCGCCGCCGCCACCCTGTGCTCATCGGAGCCTTTGGCCTCACCGCATTCCGTCGAATGCGACGGCGATAC
GCTGATTCGTCAAGTTGTCAACGATGGAGATTTCAAGCGTCTCCCTCTCGGAGCGGAACATCATTTTTCGCTCTTCAAGCGAAGGTTCGGGAAATCATATGCCACCGAGG
AAGAGCACGATCTTAGGTTCAAGATCTTCAAGGCTAATATGCGACAGGCGCTGTGCCAGCAGTCATTTGATCCGTCCACCATTCATGGCGTCACTCAGTTCTCTGATTTG
ACCCCTTCCGAGTTTAGGGAGGCCTTTCTAGGGCTCCGAAGAAACCGTCTCAGGCTTCCTGTTGATACCAATACGGCTACGATTCTTCCTACGGAGAATCTTCCGATTCA
TTTTGATTGGAGAGAACGTGGTGCTGTAACTGCTGTAAAAAATCAGGATATACGCCTATTCCATTTGGTTACCTTTGATTTCTGGATCTTGCGGATCCTGTGGAGTTTCA
GTACAACCGGTGCTCTTGAAGGTGCTAACTTCCTTGCGACAGGGGAACTTGTTAGCTTAAGCGAACAGCAGCTGGTAGATTGTGATCACGAGTGTGATCCAGAGGAAGCT
GGTTCCTGTGACTCTGGTTGCAATGGTGGCTTGATGAACATTGCATTTGAATACACATTAAAAGCTGGTTGCGATCGTGAAGCCTGCAAGTTGGACAGGTCCAAGATCGC
TGCATCAGTTGCCAATTTCAGTGTTATTTCACTTGATGAGGATCAAATTGCTGCAAATCTGGTGGAGCATCTTGTCCATTCATATGTTCAAAGAGGTTGGATCATGGAGT
TTTGCTGGTGGGTTATGGCTCAGCTGACTATTGGATCATCAAAAAACTCATGGGGAGAAAAAGGATACTACAGGATCTGCAGAGGAAGGAATATTTGTGGAGTTGATTCC
TTGGTGTCAACTGTTGCAGCTGTTCATACCCCAATTGCAGCAGCAGGTCAGTAA
Protein sequenceShow/hide protein sequence
MDRSFFLLAVIAATAAATLCSSEPLASPHSVECDGDTLIRQVVNDGDFKRLPLGAEHHFSLFKRRFGKSYATEEEHDLRFKIFKANMRQALCQQSFDPSTIHGVTQFSDL
TPSEFREAFLGLRRNRLRLPVDTNTATILPTENLPIHFDWRERGAVTAVKNQDIRLFHLVTFDFWILRILWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEA
GSCDSGCNGGLMNIAFEYTLKAGCDREACKLDRSKIAASVANFSVISLDEDQIAANLVEHLVHSYVQRGWIMEFCWWVMAQLTIGSSKNSWGEKGYYRICRGRNICGVDS
LVSTVAAVHTPIAAAGQ