; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014440 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014440
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptioncysteine proteinase inhibitor 1-like
Genome locationchr12:812312..813001
RNA-Seq ExpressionLag0014440
SyntenyLag0014440
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR009994 - Phloem filament PP1
IPR027214 - Cystatin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571861.1 Cysteine proteinase inhibitor 1, partial [Cucurbita argyrosperma subsp. sororia]7.5e-3561.34Show/hide
Query:  AVVVNVGQYDSNHRVINGAKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQ
        + +VNVGQYD  HRVIN  KD CPSGWT I D++E  V+E AK  V EYN+EY E LKY  I R W+ME KEGG+DY F+LEAMDC+G V++YKA+VSE 
Subjt:  AVVVNVGQYDSNHRVINGAKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQ

Query:  NYGGEKTRKLKSFKLVPKN
          GGEK  KLK+F L  +N
Subjt:  NYGGEKTRKLKSFKLVPKN

KAG6588756.1 hypothetical protein SDJN03_17321, partial [Cucurbita argyrosperma subsp. sororia]1.1e-1448.45Show/hide
Query:  CPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPKN
        CPS W  I +V E  VQ VAKFAV +YN+E+KE+ KY  I  GWFME K   + +   +E  DCLG V +   +VSE+    EK RKL+S KL+ K+
Subjt:  CPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPKN

KAG6588757.1 hypothetical protein SDJN03_17322, partial [Cucurbita argyrosperma subsp. sororia]3.0e-1549.48Show/hide
Query:  CPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPKN
        CPS W  I DV E  VQ VAKFAV +YN+ +KE+ KY  I  GWFME K   + Y   +E  DCLG +     +VSE+    EK RKL+S KL+ KN
Subjt:  CPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPKN

KAG6588758.1 hypothetical protein SDJN03_17323, partial [Cucurbita argyrosperma subsp. sororia]1.0e-1548.45Show/hide
Query:  CPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPKN
        CPS W  I DV E  VQ +AKFAV +YN+ +KE+ KY  I  GWFME K   + Y   +E  DCLG + +   +VSE+    EK RKL+S KL+ KN
Subjt:  CPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPKN

XP_023522970.1 uncharacterized protein LOC111787034 [Cucurbita pepo subsp. pepo]7.8e-1642.86Show/hide
Query:  AVVVNVGQYDSNHRVINGAKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQ
        AVV ++   D   ++       CPS W  I DV E  VQ +AKFAV +YN+ +KE+ KY  I  GWFME K   + +   +E  DCLG + +   +VSE+
Subjt:  AVVVNVGQYDSNHRVINGAKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQ

Query:  NYGGEKTRKLKSFKLVPKN
           GEK RKL+S KL+ KN
Subjt:  NYGGEKTRKLKSFKLVPKN

TrEMBL top hitse value%identityAlignment
A0A5A7TIS8 Phloem filament protein9.0e-1039.58Show/hide
Query:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFI----LEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPK
        W  I+D+ +P VQ+VAK AV ++N +  ++L Y  I +GW+   +E  ++YA +    L   DC G V ++KALV E  +G +K R LKSF+++ K
Subjt:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFI----LEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPK

A0A6J1EI90 uncharacterized protein LOC1114345926.9e-1036.96Show/hide
Query:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPK
        W  I +++  +VQEV+KFA+ ++NV+  ++LKY  I  GW+ME  +  + +   L+A DCL  V  Y+A V  +N+ G++ + ++SFKL+ +
Subjt:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPK

A0A6J1EJ75 uncharacterized protein LOC111434636 isoform X16.9e-1036.96Show/hide
Query:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPK
        W  I +++  +VQEV+KFA+ ++NV+  ++LKY  I  GW+ME  +  + +   L+A DCL  V  Y+A V  +N+ G++ + ++SFKL+ +
Subjt:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPK

A0A6J1ELQ9 uncharacterized protein LOC1114345152.6e-0940.91Show/hide
Query:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK
        W  I +V+ P VQE+AKFAV E+N +  E LKY  +  GWFM+  +  + + F L+A D LG V  Y+A+V  +++  ++ + L+SFK
Subjt:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK

A0A6J1EQ39 uncharacterized protein LOC111434636 isoform X26.9e-1036.96Show/hide
Query:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPK
        W  I +++  +VQEV+KFA+ ++NV+  ++LKY  I  GW+ME  +  + +   L+A DCL  V  Y+A V  +N+ G++ + ++SFKL+ +
Subjt:  WTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFKLVPK

SwissProt top hitse value%identityAlignment
P86472 Cysteine proteinase inhibitor 15.8e-0632.35Show/hide
Query:  VINGAKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK
        V+ G K     GW  I ++    VQ+VA+FAV+E+N +  + L+Y  + RG+   Q   G +Y  ++ A D    V  Y+A+V ++ +     R L SF+
Subjt:  VINGAKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK

Query:  LV
         V
Subjt:  LV

Q10J94 Cysteine proteinase inhibitor 82.0e-0633.71Show/hide
Query:  GWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK
        GW+ I DV +P++QE+  +AV  +     + L++  +  G   +Q   GM+Y  ++ A D  G    Y A+V EQ++    TR+L SFK
Subjt:  GWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK

Q10Q46 Cysteine proteinase inhibitor 61.7e-0535.05Show/hide
Query:  AKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYN-VEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCV-DKYKALVSEQNYGGEKTRKLKSF
        A  + P GW+ I ++++P++QE+ ++A+TE N V   + L +  +  G   +Q   GM+Y   +EA    G V   Y A+V EQ +    TRKL SF
Subjt:  AKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYN-VEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCV-DKYKALVSEQNYGGEKTRKLKSF

Q41916 Cysteine proteinase inhibitor 52.4e-0433.71Show/hide
Query:  GWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK
        GW+ I +V +P V E+ +FAV+EYN   +  LK+  +  G    Q   G +Y   + A D  G    Y A+V ++ +   K R L SF+
Subjt:  GWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK

Q6TPK4 Cysteine proteinase inhibitor 11.3e-0532.35Show/hide
Query:  VINGAKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK
        V+ G K     GW  I  +    VQ+VA+FAV+E+N +  + L+Y  + RG+   Q   G +Y  ++ A D    V  Y+A+V ++ +     R L SF+
Subjt:  VINGAKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK

Query:  LV
         V
Subjt:  LV

Arabidopsis top hitse value%identityAlignment
AT4G16500.1 Cystatin/monellin superfamily protein2.5e-0435.29Show/hide
Query:  IVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK
        I +V +P V  VAK+A+ E+N E KE L +  +  G    Q   G  Y   + A D  G +  Y+A+V E+ +   K+  L+SFK
Subjt:  IVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK

AT5G47550.1 Cystatin/monellin superfamily protein1.7e-0533.71Show/hide
Query:  GWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK
        GW+ I +V +P V E+ +FAV+EYN   +  LK+  +  G    Q   G +Y   + A D  G    Y A+V ++ +   K R L SF+
Subjt:  GWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNYGGEKTRKLKSFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGAGATGAAGAAAAACCGTGCCGTGGTTGTCAATGTTGGTCAATATGATTCCAACCATCGTGTGATAAATGGAGCGAAGGATAGTTGTCCAAGCGGATGGACATC
AATCGTTGATGTCGAAGAGCCCTATGTGCAAGAAGTTGCAAAGTTTGCCGTGACAGAGTACAACGTCGAATATAAGGAAAATCTAAAATACACGTGCATTGCGCGTGGCT
GGTTTATGGAACAGAAGGAAGGTGGCATGGATTATGCCTTCATTCTTGAGGCGATGGACTGTCTTGGATGTGTGGACAAATATAAGGCTCTTGTTTCAGAACAAAACTAT
GGAGGTGAAAAAACTAGAAAGCTCAAATCTTTCAAGCTTGTCCCAAAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGAGATGAAGAAAAACCGTGCCGTGGTTGTCAATGTTGGTCAATATGATTCCAACCATCGTGTGATAAATGGAGCGAAGGATAGTTGTCCAAGCGGATGGACATC
AATCGTTGATGTCGAAGAGCCCTATGTGCAAGAAGTTGCAAAGTTTGCCGTGACAGAGTACAACGTCGAATATAAGGAAAATCTAAAATACACGTGCATTGCGCGTGGCT
GGTTTATGGAACAGAAGGAAGGTGGCATGGATTATGCCTTCATTCTTGAGGCGATGGACTGTCTTGGATGTGTGGACAAATATAAGGCTCTTGTTTCAGAACAAAACTAT
GGAGGTGAAAAAACTAGAAAGCTCAAATCTTTCAAGCTTGTCCCAAAGAACTGA
Protein sequenceShow/hide protein sequence
MSEMKKNRAVVVNVGQYDSNHRVINGAKDSCPSGWTSIVDVEEPYVQEVAKFAVTEYNVEYKENLKYTCIARGWFMEQKEGGMDYAFILEAMDCLGCVDKYKALVSEQNY
GGEKTRKLKSFKLVPKN