; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G000510 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G000510
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCysteine proteinases superfamily protein
Genome locationchr02:472585..473845
RNA-Seq ExpressionLsi02G000510
SyntenyLsi02G000510
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OBS76497.1 hypothetical protein A6R68_17053, partial [Neotoma lepida]7.6e-1142.27Show/hide
Query:  AIRITTELRKL---KKEIYHGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKIKESDN
        ++ I+  LR     K  +Y+ P+  +HL   P HAVL+VG+G+E+NGQKYWL+KNSWGE+WG  GY K++++      N   + +  SYP   E  N
Subjt:  AIRITTELRKL---KKEIYHGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKIKESDN

XP_022949074.1 zingipain-2-like [Cucurbita moschata]8.6e-2353.78Show/hide
Query:  IKIAAYERIV----DIN--ILNSKIGAAIRITTELRKLKKEIYHGPENVNHLL--QNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNII
        +KI ++ RI     D+N  +L+  I   IR T E R L+ EIY+GPE+VN LL    P HAVL+VGFG E+N QKYW+IKNSWGE+W D GYGKISQNI+
Subjt:  IKIAAYERIV----DIN--ILNSKIGAAIRITTELRKLKKEIYHGPENVNHLL--QNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNII

Query:  ETGKNSTHLLSRFSYPKIK
         T +    L+ R SYP I+
Subjt:  ETGKNSTHLLSRFSYPKIK

XP_023524541.1 zingipain-2-like [Cucurbita pepo subsp. pepo]4.0e-2056.86Show/hide
Query:  IKIAAYERI------VDINILNSKIGAAIRITTELRKLKKEIYHGPENVNHLL--QNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNII
        IKI +++RI      VD  +L+  I   IR T E + L+ EIY+GPE+VN LL    P HAVL+VGFG E+N QKYW+IKNSWGE+W D GYGKISQNI+
Subjt:  IKIAAYERI------VDINILNSKIGAAIRITTELRKLKKEIYHGPENVNHLL--QNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNII

Query:  ET
         T
Subjt:  ET

XP_030939750.1 ervatamin-B-like [Quercus lobata]4.4e-1140Show/hide
Query:  IKIAAYERIVDIN-------ILNSKIGAAIRITTELRKLKKEIYHGPENVNHLLQNP--LHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQN-
        + I  ++RI D +       +    + AA+ I  E  KLK EIY GP++   + + P  +HA+L+VG+  E NG+ YWLI+NSWG+ WG  GYGKI ++ 
Subjt:  IKIAAYERIVDIN-------ILNSKIGAAIRITTELRKLKKEIYHGPENVNHLLQNP--LHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQN-

Query:  --IIETGKNSTH---LLSRFSYPKI
           I T KN      L+ R SYP I
Subjt:  --IIETGKNSTH---LLSRFSYPKI

XP_030939752.1 ervatamin-B-like [Quercus lobata]1.5e-1139.52Show/hide
Query:  IKIAAYERIVDIN-------ILNSKIGAAIRITTELRKLKKEIYHGPEN-VNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQN--
        + I  +E+I D +       +    + AA+ I  E  KLK EIY GP++ +   +   +HA+L+VG+  E NG+ YW+I+NSWGE WG  GYGKI ++  
Subjt:  IKIAAYERIVDIN-------ILNSKIGAAIRITTELRKLKKEIYHGPEN-VNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQN--

Query:  -IIETGKNSTH---LLSRFSYPKI
          I T KN      L+ R SYP I
Subjt:  -IIETGKNSTH---LLSRFSYPKI

TrEMBL top hitse value%identityAlignment
A0A061ICE2 Cathepsin L1-like isoform 11.1e-1041.49Show/hide
Query:  AIRITTELRKL---KKEIYHGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKIKE
        +I I+  LR L   K   Y+ P+  NH    P H+VL+VG+G+E++GQKYWL+KNSWGE+WG  GY KI+++      N   + +  +YP + +
Subjt:  AIRITTELRKL---KKEIYHGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKIKE

A0A1A6HFT7 Uncharacterized protein (Fragment)3.7e-1142.27Show/hide
Query:  AIRITTELRKL---KKEIYHGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKIKESDN
        ++ I+  LR     K  +Y+ P+  +HL   P HAVL+VG+G+E+NGQKYWL+KNSWGE+WG  GY K++++      N   + +  SYP   E  N
Subjt:  AIRITTELRKL---KKEIYHGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKIKESDN

A0A6J1GBS9 zingipain-2-like4.2e-2353.78Show/hide
Query:  IKIAAYERIV----DIN--ILNSKIGAAIRITTELRKLKKEIYHGPENVNHLL--QNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNII
        +KI ++ RI     D+N  +L+  I   IR T E R L+ EIY+GPE+VN LL    P HAVL+VGFG E+N QKYW+IKNSWGE+W D GYGKISQNI+
Subjt:  IKIAAYERIV----DIN--ILNSKIGAAIRITTELRKLKKEIYHGPENVNHLL--QNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNII

Query:  ETGKNSTHLLSRFSYPKIK
         T +    L+ R SYP I+
Subjt:  ETGKNSTHLLSRFSYPKIK

A0A7N2MRH3 Pept_C1 domain-containing protein7.4e-1239.52Show/hide
Query:  IKIAAYERIVDIN-------ILNSKIGAAIRITTELRKLKKEIYHGPEN-VNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQN--
        + I  +E+I D +       +    + AA+ I  E  KLK EIY GP++ +   +   +HA+L+VG+  E NG+ YW+I+NSWGE WG  GYGKI ++  
Subjt:  IKIAAYERIVDIN-------ILNSKIGAAIRITTELRKLKKEIYHGPEN-VNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQN--

Query:  -IIETGKNSTH---LLSRFSYPKI
          I T KN      L+ R SYP I
Subjt:  -IIETGKNSTH---LLSRFSYPKI

A0A7N2RC65 Pept_C1 domain-containing protein2.2e-1140Show/hide
Query:  IKIAAYERIVDIN-------ILNSKIGAAIRITTELRKLKKEIYHGPENVNHLLQNP--LHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQN-
        + I  ++RI D +       +    + AA+ I  E  KLK EIY GP++   + + P  +HA+L+VG+  E NG+ YWLI+NSWG+ WG  GYGKI ++ 
Subjt:  IKIAAYERIVDIN-------ILNSKIGAAIRITTELRKLKKEIYHGPENVNHLLQNP--LHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQN-

Query:  --IIETGKNSTH---LLSRFSYPKI
           I T KN      L+ R SYP I
Subjt:  --IIETGKNSTH---LLSRFSYPKI

SwissProt top hitse value%identityAlignment
O46427 Pro-cathepsin H6.0e-1138.89Show/hide
Query:  ERIVDINILNSKIGAAIRITTELRKLKKEIYHG------PENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTH
        E +V+   L + +  A  +T +    +K IY        P+ VN       HAVL VG+G+E NG  YW++KNSWG QWG  GY      +IE GKN   
Subjt:  ERIVDINILNSKIGAAIRITTELRKLKKEIYHG------PENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTH

Query:  LLSRFSYP
        L +  SYP
Subjt:  LLSRFSYP

P05167 Thiol protease aleurain3.9e-1039.8Show/hide
Query:  IGAAIRITTELRKLKKEIYHG------PENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI
        +  A ++    R+ K  +Y        P++VN       HAVL VG+G E NG  YWLIKNSWG  WGD GY K     +E GKN   + +  SYP +
Subjt:  IGAAIRITTELRKLKKEIYHG------PENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI

P49935 Pro-cathepsin H2.3e-1037.74Show/hide
Query:  IVDINILNSKIGAAIRITTELRKLKKEIYHG------PENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLL
        +V+   L + +  A  +T +    K  +Y        P+ VN       HAVL VG+G E NG  YW++KNSWG QWG+ GY      +IE GKN   L 
Subjt:  IVDINILNSKIGAAIRITTELRKLKKEIYHG------PENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLL

Query:  SRFSYP
        +  SYP
Subjt:  SRFSYP

Q3T0I2 Pro-cathepsin H3.0e-1037.04Show/hide
Query:  ERIVDINILNSKIGAAIRITTELRKLKKEIYHG------PENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTH
        E +V+   L++ +  A  +T +    +K IY        P+ VN       HAVL VG+G+E  G  YW++KNSWG  WG KGY      +IE GKN   
Subjt:  ERIVDINILNSKIGAAIRITTELRKLKKEIYHG------PENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTH

Query:  LLSRFSYP
        L +  S+P
Subjt:  LLSRFSYP

Q95029 Cathepsin L3.9e-1047.62Show/hide
Query:  QNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI
        QN  H VL+VGFG + +G+ YWL+KNSWG  WGDKG+ K+ +N     +N   + S  SYP +
Subjt:  QNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI

Arabidopsis top hitse value%identityAlignment
AT3G45310.1 Cysteine proteinases superfamily protein4.0e-1038.78Show/hide
Query:  IGAAIRITTELRKLKKEIY------HGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI
        +  A  +  E R  KK ++      + P +VN       HAVL VG+G E++   YWLIKNSWG +WGD GY K     +E GKN   + +  SYP +
Subjt:  IGAAIRITTELRKLKKEIY------HGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI

AT3G48350.1 Cysteine proteinases superfamily protein6.8e-1043.86Show/hide
Query:  HAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYP
        H V+IVG+G+  NG KYW+++NSWG +WG+ GY +I + I E  +    +    SYP
Subjt:  HAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYP

AT4G23520.1 Cysteine proteinases superfamily protein1.4e-1047.44Show/hide
Query:  IYHGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKIKES
        IY+GP        N  HA++IVG+G E NGQ YW+++NSWG  WGD GY KI++N  E  K    +    SYP IK S
Subjt:  IYHGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKIKES

AT5G45890.1 senescence-associated gene 121.8e-1045.76Show/hide
Query:  HAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI
        HAV  +G+G+  NG KYW+IKNSWG +WG+ GY +I Q  ++  +    L  + SYP I
Subjt:  HAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI

AT5G60360.1 aleurain-like protease1.8e-1040Show/hide
Query:  IGAAIRITTELRKLKKEIYHGPENVNHLLQNPL---HAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI
        +  A  +    R  K  +Y      +H    P+   HAVL VG+G E +G  YWLIKNSWG  WGDKGY K     +E GKN   + +  SYP +
Subjt:  IGAAIRITTELRKLKKEIYHGPENVNHLLQNPL---HAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKNSTHLLSRFSYPKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTGTTGTTGGACAATTATCGCTGCGGAAATGGATAAAGATTGCAGCATATGAAAGGATAGTTGACATTAATATTTTAAATTCAAAGATTGGAGCTGCTATAAG
AATTACTACCGAGCTACGAAAATTGAAAAAGGAAATATATCATGGCCCAGAAAATGTGAACCATTTGTTACAGAACCCTCTACATGCAGTTTTGATAGTAGGGTTTGGCC
AAGAAAATAACGGTCAAAAGTATTGGCTTATCAAGAATTCATGGGGTGAGCAATGGGGAGACAAAGGTTATGGAAAAATATCCCAAAACATCATTGAAACGGGGAAAAAT
TCAACACATTTGTTGTCAAGATTTTCTTATCCGAAAATCAAAGAGAGTGATAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATTGTTGTTGGACAATTATCGCTGCGGAAATGGATAAAGATTGCAGCATATGAAAGGATAGTTGACATTAATATTTTAAATTCAAAGATTGGAGCTGCTATAAG
AATTACTACCGAGCTACGAAAATTGAAAAAGGAAATATATCATGGCCCAGAAAATGTGAACCATTTGTTACAGAACCCTCTACATGCAGTTTTGATAGTAGGGTTTGGCC
AAGAAAATAACGGTCAAAAGTATTGGCTTATCAAGAATTCATGGGGTGAGCAATGGGGAGACAAAGGTTATGGAAAAATATCCCAAAACATCATTGAAACGGGGAAAAAT
TCAACACATTTGTTGTCAAGATTTTCTTATCCGAAAATCAAAGAGAGTGATAACTAA
Protein sequenceShow/hide protein sequence
MAIVVGQLSLRKWIKIAAYERIVDINILNSKIGAAIRITTELRKLKKEIYHGPENVNHLLQNPLHAVLIVGFGQENNGQKYWLIKNSWGEQWGDKGYGKISQNIIETGKN
STHLLSRFSYPKIKESDN