; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G412 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G412
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionInhibitor_I29 domain-containing protein
Genome locationctg1:8597750..8598463
RNA-Seq ExpressionCucsat.G412
SyntenyCucsat.G412
Gene Ontology termsGO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8057439.1 hypothetical protein FH972_014132 [Carpinus fangiana]4.50e-1151.67Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL
        F SWMS+H K YES EEKL+RF IF+  LKHI + NK+ +    GLN+++DL++ EF ++
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL

KAE8650303.1 hypothetical protein Csa_010836 [Cucumis sativus]4.99e-9671.88Show/hide
Query:  MKMLVALRSYFIRNRGLEERILRHVIQEEQDFIRKLELDEAQGLKPDVVDSTLEQDFKEAQGLKPDVVESTLEQDFKEDFVKTDKPW-------------
        MKMLVALRSYFIRNRGLEERILRHVIQEEQDFIRKLELDEAQGLKPDVVDSTLEQDFKEAQGLKPDVVESTLEQDFKEDFVKTDKPW             
Subjt:  MKMLVALRSYFIRNRGLEERILRHVIQEEQDFIRKLELDEAQGLKPDVVDSTLEQDFKEAQGLKPDVVESTLEQDFKEDFVKTDKPW-------------

Query:  --------------------------------------------------GLLEDAERSKDWKKFASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLN
                                                          GLLEDAERSKDWKKFASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLN
Subjt:  --------------------------------------------------GLLEDAERSKDWKKFASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLN

Query:  KEDNGCTFGLNQYSDLTNSEFNRL
        KEDNGCTFGLNQYSDLTNSEFNRL
Subjt:  KEDNGCTFGLNQYSDLTNSEFNRL

KAF8405593.1 hypothetical protein HHK36_010500 [Tetracentron sinense]4.58e-1151.67Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL
        F SWMS+H+K YES EEKL+RF +F+G L+HI + NK+ +    GLN+++DL+N EF ++
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL

MBA0666684.1 hypothetical protein [Gossypium klotzschianum]2.24e-1051.67Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL
        F SW+S+H K YES EEKL RF +F+  LKHI K NKE +    GLN+++DL++ EF ++
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL

XP_038887495.1 pro-cathepsin H-like [Benincasa hispida]3.34e-1862.12Show/hide
Query:  DAERSKDWKKFASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSE
        DA  S+DW+ F SWMS H KKY S+EE LYRFG+F+  LK I+KLNK   GCTFG N +SDLT  E
Subjt:  DAERSKDWKKFASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSE

TrEMBL top hitse value%identityAlignment
A0A0A0L4P3 Inhibitor_I29 domain-containing protein7.53e-8263.95Show/hide
Query:  MLVALRSYFIRNRGLEERILRHVIQEEQDFIRKLELDEAQGLKPDV-------VD---STLEQDFKEAQGLKPDVVESTLEQDFKEDFVKTDKPW-----
        MLVALRSYFIRNRGLEERILRHVIQEEQDFIRKLE+        D        +D   STLEQDFKEAQGLKPDVVESTLEQDFKEDFVKTDKPW     
Subjt:  MLVALRSYFIRNRGLEERILRHVIQEEQDFIRKLELDEAQGLKPDV-------VD---STLEQDFKEAQGLKPDVVESTLEQDFKEDFVKTDKPW-----

Query:  ----------------------------------------------------------GLLEDAERSKDWKKFASWMSEHKKKYESDEEKLYRFGIFRGE
                                                                  GLLEDAERSKDWKKFASWMSEHKKKYESDEEKLYRFGIFRGE
Subjt:  ----------------------------------------------------------GLLEDAERSKDWKKFASWMSEHKKKYESDEEKLYRFGIFRGE

Query:  LKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRLV
        LKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRLV
Subjt:  LKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRLV

A0A5N6R8Z3 Inhibitor_I29 domain-containing protein2.18e-1151.67Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL
        F SWMS+H K YES EEKL+RF IF+  LKHI + NK+ +    GLN+++DL++ EF ++
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL

A0A7J8VW64 Inhibitor_I29 domain-containing protein (Fragment)1.09e-1051.67Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL
        F SW+S+H K YES EEKL RF +F+  LKHI K NKE +    GLN+++DL++ EF ++
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL

A0A7J9CR48 Inhibitor_I29 domain-containing protein (Fragment)1.52e-1051.67Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL
        F SW+S+H K YES EEKL RF +F+  LKHI K NKE +    GLN+++DL++ EF ++
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL

A0A7J9K832 Inhibitor_I29 domain-containing protein (Fragment)1.09e-1051.67Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL
        F SW+S+H K YES EEKL RF +F+  LKHI K NKE +    GLN+++DL++ EF ++
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL

SwissProt top hitse value%identityAlignment
O23791 Fruit bromelain5.5e-0945Show/hide
Query:  KKFASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLN-KEDNGCTFGLNQYSDLTNSEF
        K+F  WM+E+ + Y+ D+EK+ RF IF+  +KHI+  N + +N  T G+NQ++D+T SEF
Subjt:  KKFASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLN-KEDNGCTFGLNQYSDLTNSEF

O65493 Cysteine protease XCP12.9e-1054.39Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEF
        F SWMSEH K Y+S EEK++RF +FR  L HI + N E N    GLN+++DLT+ EF
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEF

P05994 Papaya proteinase 43.8e-1054.39Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEF
        F SWM +H K Y++ +EKLYRF IF+  LK+I + NK  NG   GLN++SDL+N EF
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEF

P10056 Caricain5.0e-1051.72Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFN
        F SWM  H K YE+ +EKLYRF IF+  L +I + NK++N    GLN+++DL+N EFN
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFN

P14080 Chymopapain1.1e-0950.85Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNR
        F SWM +H K YES +EK+YRF IFR  L +I + NK++N    GLN ++DL+N EF +
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNR

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 25.6e-0941.67Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL
        F +W+S  +K YE+ EEK  RF +F+  LKHI + NK+      GLN+++DL++ EF ++
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRL

AT4G35350.1 xylem cysteine peptidase 12.1e-1154.39Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEF
        F SWMSEH K Y+S EEK++RF +FR  L HI + N E N    GLN+++DLT+ EF
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEF

AT4G35350.2 xylem cysteine peptidase 12.1e-1154.39Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEF
        F SWMSEH K Y+S EEK++RF +FR  L HI + N E N    GLN+++DLT+ EF
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEF

AT4G39090.1 Papain family cysteine protease1.0e-0538.98Show/hide
Query:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNR
        F+ +  +  K Y S+EE  YRF +F+  L+  ++  K D   T G+ Q+SDLT SEF +
Subjt:  FASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNR

AT5G50260.1 Cysteine proteinases superfamily protein3.4e-0638.71Show/hide
Query:  WKKFASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNR
        W+ +  W S H     S EEK  RF +F+  +KHI + NK+D      LN++ D+T+ EF R
Subjt:  WKKFASWMSEHKKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AACATGAAAATGTTAGTAGCACTTCGTTCGTATTTCATTCGCAATCGAGGTCTGGAAGAACGCATCCTTCGTCACGTAATACAAGAAGAACAAGATTTTATAAGGAAATT
GGAGCTGGATGAAGCACAAGGTTTGAAGCCGGATGTTGTGGATTCAACTCTCGAACAAGATTTCAAGGAGGCACAAGGTTTGAAGCCGGATGTTGTGGAATCAACCCTGG
AACAAGATTTCAAGGAGGATTTTGTTAAGACAGACAAGCCATGGGGATTACTGGAAGATGCAGAACGTTCTAAGGATTGGAAGAAGTTCGCTTCATGGATGTCGGAGCAC
AAGAAGAAGTACGAGAGCGATGAAGAGAAGTTGTATAGGTTTGGGATATTCAGAGGAGAACTGAAACATATTAAGAAGTTGAACAAGGAGGACAATGGATGTACCTTTGG
GTTGAATCAGTACTCAGACCTAACCAATTCTGAATTTAACAGACTCGTA
mRNA sequenceShow/hide mRNA sequence
AACATGAAAATGTTAGTAGCACTTCGTTCGTATTTCATTCGCAATCGAGGTCTGGAAGAACGCATCCTTCGTCACGTAATACAAGAAGAACAAGATTTTATAAGGAAATT
GGAGCTGGATGAAGCACAAGGTTTGAAGCCGGATGTTGTGGATTCAACTCTCGAACAAGATTTCAAGGAGGCACAAGGTTTGAAGCCGGATGTTGTGGAATCAACCCTGG
AACAAGATTTCAAGGAGGATTTTGTTAAGACAGACAAGCCATGGGGATTACTGGAAGATGCAGAACGTTCTAAGGATTGGAAGAAGTTCGCTTCATGGATGTCGGAGCAC
AAGAAGAAGTACGAGAGCGATGAAGAGAAGTTGTATAGGTTTGGGATATTCAGAGGAGAACTGAAACATATTAAGAAGTTGAACAAGGAGGACAATGGATGTACCTTTGG
GTTGAATCAGTACTCAGACCTAACCAATTCTGAATTTAACAGACTCGTA
Protein sequenceShow/hide protein sequence
NMKMLVALRSYFIRNRGLEERILRHVIQEEQDFIRKLELDEAQGLKPDVVDSTLEQDFKEAQGLKPDVVESTLEQDFKEDFVKTDKPWGLLEDAERSKDWKKFASWMSEH
KKKYESDEEKLYRFGIFRGELKHIKKLNKEDNGCTFGLNQYSDLTNSEFNRLV