; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025558 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025558
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold13:31282979..31285873
RNA-Seq ExpressionSpg025558
SyntenySpg025558
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.2e-1631.72Show/hide
Query:  SNVIFEMTIRSLKKALSQ--ATQRDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFGLT--EVV
        S++IF+ TI SLK AL    +  +     +   +E YSLY FP+AFQ  AYETIS+L++        DAIP   RWSC +S  +  L+SE+F  T  +V 
Subjt:  SNVIFEMTIRSLKKALSQ--ATQRDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFGLT--EVV

Query:  MQLVSSDAELEHMRRIVLPLQLEALVLPPQL--EAPVFPPQPEPMNDANLDHLVGSDRGLEEVGLDRGSPTKDADMVGLDEQSAHEGLPEGVGKTCQCDC
          L+++DA+ +HM R++LP ++  +  PP +   A V  P   P   A  D     + G  E       P  DA  V     SA++G  EG+ K  +   
Subjt:  MQLVSSDAELEHMRRIVLPLQLEALVLPPQL--EAPVFPPQPEPMNDANLDHLVGSDRGLEEVGLDRGSPTKDADMVGLDEQSAHEGLPEGVGKTCQCDC

Query:  KQSYES-LDRRMKEMESDVKGMKSD-------LESIKKYLCRLSKGKLVVDPTKYLGPNRSAAGDEPSDKGKDHVVEEGRGGVSEDSMVEDKGVESNSHE
        K  ++  + RR+K +++ V  ++         L+ I+ YL +L+KGK   D +KY G       D PSD+  D   E  +      SM ED+  + +   
Subjt:  KQSYES-LDRRMKEMESDVKGMKSD-------LESIKKYLCRLSKGKLVVDPTKYLGPNRSAAGDEPSDKGKDHVVEEGRGGVSEDSMVEDKGVESNSHE

Query:  VQEIVEPRE
         +++   +E
Subjt:  VQEIVEPRE

XP_022154965.1 uncharacterized protein LOC111022110 [Momordica charantia]1.3e-1834.52Show/hide
Query:  SNVIFEMTIRSLKKALSQATQ--RDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFG--LTEVV
        S++IFE T+ SLK AL    +  +  VA ++  +E YSLY FP+AFQ  AYETIS+L+  VA  +N DAIP   RWSC++S ++  L  E+F    ++VV
Subjt:  SNVIFEMTIRSLKKALSQATQ--RDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFG--LTEVV

Query:  MQLVSSDAELEHMRRIVLPLQLEALVLPPQLEAPVFPPQP-----EPM------NDANLDHLVGSDRGLEEVGLDRGSPTKD---ADMVGLD---EQSAH
        ++L ++D E +HM R++ P             APV PP P     EP+        + +   VG    L++V  D  SP  D    D++G D   +Q   
Subjt:  MQLVSSDAELEHMRRIVLPLQLEALVLPPQLEAPVFPPQP-----EPM------NDANLDHLVGSDRGLEEVGLDRGSPTKD---ADMVGLD---EQSAH

Query:  EGLPEGVGKTCQCDCKQSYESLDRRMKEMESDVKGMKSDLESIKKYLCRLSK
        +   E   K  +    +    L  R+  +E+ + GM +D++ IKK++ RL+K
Subjt:  EGLPEGVGKTCQCDCKQSYESLDRRMKEMESDVKGMKSDLESIKKYLCRLSK

XP_022155154.1 uncharacterized protein LOC111022296 [Momordica charantia]5.7e-1137.63Show/hide
Query:  LNVIYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM
        +  +Y+ FN+  NHWVM+C D + G++ V DS  A+T DA+L+++L+ +  V+  LL K   +  + +LP+  W I R +S P Q    DCG+
Subjt:  LNVIYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]2.3e-1240.86Show/hide
Query:  LNVIYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM
        +  +Y+ FN+ GNHWVM+C D + G++ V DS  A+TS A+L+++L  + TV+  LL K  V+  + +LP+  W I R +S P Q +  DCG+
Subjt:  LNVIYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]5.2e-1244.95Show/hide
Query:  SNVIFEMTIRSLKKALSQATQ--RDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFG--LTEVV
        S++IFE T+ SLK AL    +  +  VA ++  +E YSLY FP+AFQ  AYETIS+L+  VA  +N DAIP   RWSC++S ++  L  E+F    ++VV
Subjt:  SNVIFEMTIRSLKKALSQATQ--RDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFG--LTEVV

Query:  MQLVSSDAE
        ++L ++D E
Subjt:  MQLVSSDAE

TrEMBL top hitse value%identityAlignment
A0A6J1DJX9 uncharacterized protein LOC1110207575.8e-1731.72Show/hide
Query:  SNVIFEMTIRSLKKALSQ--ATQRDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFGLT--EVV
        S++IF+ TI SLK AL    +  +     +   +E YSLY FP+AFQ  AYETIS+L++        DAIP   RWSC +S  +  L+SE+F  T  +V 
Subjt:  SNVIFEMTIRSLKKALSQ--ATQRDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFGLT--EVV

Query:  MQLVSSDAELEHMRRIVLPLQLEALVLPPQL--EAPVFPPQPEPMNDANLDHLVGSDRGLEEVGLDRGSPTKDADMVGLDEQSAHEGLPEGVGKTCQCDC
          L+++DA+ +HM R++LP ++  +  PP +   A V  P   P   A  D     + G  E       P  DA  V     SA++G  EG+ K  +   
Subjt:  MQLVSSDAELEHMRRIVLPLQLEALVLPPQL--EAPVFPPQPEPMNDANLDHLVGSDRGLEEVGLDRGSPTKDADMVGLDEQSAHEGLPEGVGKTCQCDC

Query:  KQSYES-LDRRMKEMESDVKGMKSD-------LESIKKYLCRLSKGKLVVDPTKYLGPNRSAAGDEPSDKGKDHVVEEGRGGVSEDSMVEDKGVESNSHE
        K  ++  + RR+K +++ V  ++         L+ I+ YL +L+KGK   D +KY G       D PSD+  D   E  +      SM ED+  + +   
Subjt:  KQSYES-LDRRMKEMESDVKGMKSD-------LESIKKYLCRLSKGKLVVDPTKYLGPNRSAAGDEPSDKGKDHVVEEGRGGVSEDSMVEDKGVESNSHE

Query:  VQEIVEPRE
         +++   +E
Subjt:  VQEIVEPRE

A0A6J1DL40 uncharacterized protein LOC1110221106.2e-1934.52Show/hide
Query:  SNVIFEMTIRSLKKALSQATQ--RDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFG--LTEVV
        S++IFE T+ SLK AL    +  +  VA ++  +E YSLY FP+AFQ  AYETIS+L+  VA  +N DAIP   RWSC++S ++  L  E+F    ++VV
Subjt:  SNVIFEMTIRSLKKALSQATQ--RDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFG--LTEVV

Query:  MQLVSSDAELEHMRRIVLPLQLEALVLPPQLEAPVFPPQP-----EPM------NDANLDHLVGSDRGLEEVGLDRGSPTKD---ADMVGLD---EQSAH
        ++L ++D E +HM R++ P             APV PP P     EP+        + +   VG    L++V  D  SP  D    D++G D   +Q   
Subjt:  MQLVSSDAELEHMRRIVLPLQLEALVLPPQLEAPVFPPQP-----EPM------NDANLDHLVGSDRGLEEVGLDRGSPTKD---ADMVGLD---EQSAH

Query:  EGLPEGVGKTCQCDCKQSYESLDRRMKEMESDVKGMKSDLESIKKYLCRLSK
        +   E   K  +    +    L  R+  +E+ + GM +D++ IKK++ RL+K
Subjt:  EGLPEGVGKTCQCDCKQSYESLDRRMKEMESDVKGMKSDLESIKKYLCRLSK

A0A6J1DPE8 uncharacterized protein LOC1110222962.8e-1137.63Show/hide
Query:  LNVIYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM
        +  +Y+ FN+  NHWVM+C D + G++ V DS  A+T DA+L+++L+ +  V+  LL K   +  + +LP+  W I R +S P Q    DCG+
Subjt:  LNVIYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM

A0A6J1DQZ3 uncharacterized protein LOC1110234421.1e-1240.86Show/hide
Query:  LNVIYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM
        +  +Y+ FN+ GNHWVM+C D + G++ V DS  A+TS A+L+++L  + TV+  LL K  V+  + +LP+  W I R +S P Q +  DCG+
Subjt:  LNVIYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM

A0A6J1DRZ7 uncharacterized protein LOC1110238472.5e-1244.95Show/hide
Query:  SNVIFEMTIRSLKKALSQATQ--RDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFG--LTEVV
        S++IFE T+ SLK AL    +  +  VA ++  +E YSLY FP+AFQ  AYETIS+L+  VA  +N DAIP   RWSC++S ++  L  E+F    ++VV
Subjt:  SNVIFEMTIRSLKKALSQATQ--RDAVAGETGRLERYSLYDFPHAFQD-AYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFG--LTEVV

Query:  MQLVSSDAE
        ++L ++D E
Subjt:  MQLVSSDAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases7.7e-0633.33Show/hide
Query:  IYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM
        +YM FN    HWV +C DL   K+T+LDS   L  DA L  EL  LA  +L  LF+     +   + +  + + R   +P  ++  D G+
Subjt:  IYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVLLVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGCTATCAGCAATGTAATTTTTGAGATGACCATAAGGAGTTTGAAGAAAGCACTTAGTCAGGCCACCCAAAGAGACGCCGTGGCTGGAGAGACTGGTCGATTGGA
AAGATATAGTCTTTACGACTTTCCACATGCTTTTCAGGATGCGTATGAGACTATATCATCTCTGACGAATTGTGTTGCGAACTGGATGAACCAGGATGCGATCCCACCCT
TTTCTCGATGGTCATGCTCCCATTCTCCTTCGTACACCCAACTTAGCAGCGAGATATTTGGCTTGACGGAAGTAGTAATGCAATTGGTGTCGAGCGATGCAGAGCTCGAA
CACATGCGTCGTATCGTTTTGCCGCTACAACTAGAGGCCCTTGTTTTGCCACCACAACTAGAGGCCCCTGTTTTCCCACCACAACCAGAACCAATGAATGATGCAAACTT
AGATCATCTTGTGGGGAGTGATAGAGGGTTAGAGGAGGTTGGTTTGGATAGGGGTTCTCCGACAAAGGATGCAGATATGGTTGGGCTCGATGAACAATCGGCACATGAGG
GTCTACCTGAAGGCGTGGGTAAGACCTGCCAGTGTGACTGCAAGCAATCATACGAGTCACTAGACCGACGGATGAAGGAGATGGAATCCGATGTAAAAGGGATGAAATCT
GACTTAGAGTCGATCAAGAAGTACTTGTGTCGGTTATCTAAGGGTAAATTGGTGGTTGATCCTACCAAGTATTTGGGTCCCAACCGTAGTGCAGCAGGTGATGAACCATC
TGATAAAGGAAAGGACCATGTCGTGGAGGAGGGGCGTGGTGGGGTTTCAGAAGATTCGATGGTAGAGGACAAGGGTGTTGAATCAAACTCCCATGAGGTTCAAGAGATCG
TGGAACCTAGAGAAATAGTAAAGCGTCTGGGAGATCGGAAGAGAACTTTTTCTTGGAAACTTCGAACTCCATGGAAGGATACGAGGGCAGGGGTCAAAAAACAAAAGGTC
ATGCCATACAACCTCCTAACTGAGATACCTAAGAAGCTTGATAAACGTTTCCAAAAGTGGTTGGACAACATGGAGGTCATTGACTTGCTTTCTATGTTCGTCTGGAAGAA
ACTGCAACAATGGGCGGACTTGTGTCGTTGGAAAATTTTCACTGCAGATATTATTGTTACCTTAAACGTGATTTACATGTCATTCAACCTCGGGGGAAACCATTGGGTTA
TGGTATGTGCTGATCTCATAGTGGGCAAGTTGACCGTCCTCGATTCATTCACAGCGCTGACATCCGATGCAACCTTGAAGAAGGAGTTGAGCACTTTAGCCACAGTACTA
CTAGTTCTACTGTTCAAGTGCGATGTCATGAAGGCAAAGTCGCATCTCCCAGTTCACGAATGGGAAATCCATAGAGATAGTTCAGTGCCTCACCAGACGAACGGTGTGGA
CTGTGGTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCGCTATCAGCAATGTAATTTTTGAGATGACCATAAGGAGTTTGAAGAAAGCACTTAGTCAGGCCACCCAAAGAGACGCCGTGGCTGGAGAGACTGGTCGATTGGA
AAGATATAGTCTTTACGACTTTCCACATGCTTTTCAGGATGCGTATGAGACTATATCATCTCTGACGAATTGTGTTGCGAACTGGATGAACCAGGATGCGATCCCACCCT
TTTCTCGATGGTCATGCTCCCATTCTCCTTCGTACACCCAACTTAGCAGCGAGATATTTGGCTTGACGGAAGTAGTAATGCAATTGGTGTCGAGCGATGCAGAGCTCGAA
CACATGCGTCGTATCGTTTTGCCGCTACAACTAGAGGCCCTTGTTTTGCCACCACAACTAGAGGCCCCTGTTTTCCCACCACAACCAGAACCAATGAATGATGCAAACTT
AGATCATCTTGTGGGGAGTGATAGAGGGTTAGAGGAGGTTGGTTTGGATAGGGGTTCTCCGACAAAGGATGCAGATATGGTTGGGCTCGATGAACAATCGGCACATGAGG
GTCTACCTGAAGGCGTGGGTAAGACCTGCCAGTGTGACTGCAAGCAATCATACGAGTCACTAGACCGACGGATGAAGGAGATGGAATCCGATGTAAAAGGGATGAAATCT
GACTTAGAGTCGATCAAGAAGTACTTGTGTCGGTTATCTAAGGGTAAATTGGTGGTTGATCCTACCAAGTATTTGGGTCCCAACCGTAGTGCAGCAGGTGATGAACCATC
TGATAAAGGAAAGGACCATGTCGTGGAGGAGGGGCGTGGTGGGGTTTCAGAAGATTCGATGGTAGAGGACAAGGGTGTTGAATCAAACTCCCATGAGGTTCAAGAGATCG
TGGAACCTAGAGAAATAGTAAAGCGTCTGGGAGATCGGAAGAGAACTTTTTCTTGGAAACTTCGAACTCCATGGAAGGATACGAGGGCAGGGGTCAAAAAACAAAAGGTC
ATGCCATACAACCTCCTAACTGAGATACCTAAGAAGCTTGATAAACGTTTCCAAAAGTGGTTGGACAACATGGAGGTCATTGACTTGCTTTCTATGTTCGTCTGGAAGAA
ACTGCAACAATGGGCGGACTTGTGTCGTTGGAAAATTTTCACTGCAGATATTATTGTTACCTTAAACGTGATTTACATGTCATTCAACCTCGGGGGAAACCATTGGGTTA
TGGTATGTGCTGATCTCATAGTGGGCAAGTTGACCGTCCTCGATTCATTCACAGCGCTGACATCCGATGCAACCTTGAAGAAGGAGTTGAGCACTTTAGCCACAGTACTA
CTAGTTCTACTGTTCAAGTGCGATGTCATGAAGGCAAAGTCGCATCTCCCAGTTCACGAATGGGAAATCCATAGAGATAGTTCAGTGCCTCACCAGACGAACGGTGTGGA
CTGTGGTATGTGA
Protein sequenceShow/hide protein sequence
MLAISNVIFEMTIRSLKKALSQATQRDAVAGETGRLERYSLYDFPHAFQDAYETISSLTNCVANWMNQDAIPPFSRWSCSHSPSYTQLSSEIFGLTEVVMQLVSSDAELE
HMRRIVLPLQLEALVLPPQLEAPVFPPQPEPMNDANLDHLVGSDRGLEEVGLDRGSPTKDADMVGLDEQSAHEGLPEGVGKTCQCDCKQSYESLDRRMKEMESDVKGMKS
DLESIKKYLCRLSKGKLVVDPTKYLGPNRSAAGDEPSDKGKDHVVEEGRGGVSEDSMVEDKGVESNSHEVQEIVEPREIVKRLGDRKRTFSWKLRTPWKDTRAGVKKQKV
MPYNLLTEIPKKLDKRFQKWLDNMEVIDLLSMFVWKKLQQWADLCRWKIFTADIIVTLNVIYMSFNLGGNHWVMVCADLIVGKLTVLDSFTALTSDATLKKELSTLATVL
LVLLFKCDVMKAKSHLPVHEWEIHRDSSVPHQTNGVDCGM