; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027633 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027633
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationscaffold6:37798587..37801461
RNA-Seq ExpressionSpg027633
SyntenySpg027633
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK03389.1 uncharacterized protein E5676_scaffold84663G00280 [Cucumis melo var. makuwa]2.9e-1232.82Show/hide
Query:  MVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDDL
        MV++   C+LVYI +LW   +E     NF V+D + I+      + R R+L      +   Q + IPYN G HW+L V+++R+N VY+LDSL   + +D+
Subjt:  MVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDDL

Query:  KHVVNT-------SHKLVLRLENSGEDQGGC
          V+N        +  + L + +  E   GC
Subjt:  KHVVNT-------SHKLVLRLENSGEDQGGC

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]6.3e-1536.94Show/hide
Query:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD
        +M+++  +C+L YIA+LW  +E       F +VD   I+P   + E R R+L     ++   Q++ IPY  G HW+L ++N+R+N VY+LDSL   + +D
Subjt:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD

Query:  LKHVVNTSHKL
         + V+NTS K+
Subjt:  LKHVVNTSHKL

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]6.3e-1536.94Show/hide
Query:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD
        +M+++  +C+L YIA+LW  +E       F +VD   I+P   + E R R+L     ++   Q++ IPY  G HW+L ++N+R+N VY+LDSL   + +D
Subjt:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD

Query:  LKHVVNTSHKL
         + V+NTS K+
Subjt:  LKHVVNTSHKL

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]6.3e-1536.94Show/hide
Query:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD
        +M+++  +C+L YIA+LW  +E       F +VD   I+P   + E R R+L     ++   Q++ IPY  G HW+L ++N+R+N VY+LDSL   + +D
Subjt:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD

Query:  LKHVVNTSHKL
         + V+NTS K+
Subjt:  LKHVVNTSHKL

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]1.7e-1234.86Show/hide
Query:  MVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDDL
        M ++  +C+L YIA LW    ++     F +VD   I+      E R+++L     ++   Q++ IPYN G HWIL ++N+++N VY++DSL   +L++ 
Subjt:  MVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDDL

Query:  KHVVNTSHK
        + V+NTS K
Subjt:  KHVVNTSHK

TrEMBL top hitse value%identityAlignment
A0A5A7U441 Transposase2.4e-1235.78Show/hide
Query:  MVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDDL
        MV++   C+L YI +LW   +E     NF V+D + I+      + R+R+L      +   Q + IPYN G HW+L V+++R+N VY+LDSL   + +D+
Subjt:  MVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDDL

Query:  KHVVNTSHK
          V+N   K
Subjt:  KHVVNTSHK

A0A5D3BZ91 ULP_PROTEASE domain-containing protein1.4e-1232.82Show/hide
Query:  MVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDDL
        MV++   C+LVYI +LW   +E     NF V+D + I+      + R R+L      +   Q + IPYN G HW+L V+++R+N VY+LDSL   + +D+
Subjt:  MVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDDL

Query:  KHVVNT-------SHKLVLRLENSGEDQGGC
          V+N        +  + L + +  E   GC
Subjt:  KHVVNT-------SHKLVLRLENSGEDQGGC

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X13.1e-1536.94Show/hide
Query:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD
        +M+++  +C+L YIA+LW  +E       F +VD   I+P   + E R R+L     ++   Q++ IPY  G HW+L ++N+R+N VY+LDSL   + +D
Subjt:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD

Query:  LKHVVNTSHKL
         + V+NTS K+
Subjt:  LKHVVNTSHKL

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X43.1e-1536.94Show/hide
Query:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD
        +M+++  +C+L YIA+LW  +E       F +VD   I+P   + E R R+L     ++   Q++ IPY  G HW+L ++N+R+N VY+LDSL   + +D
Subjt:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD

Query:  LKHVVNTSHKL
         + V+NTS K+
Subjt:  LKHVVNTSHKL

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X23.1e-1536.94Show/hide
Query:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD
        +M+++  +C+L YIA+LW  +E       F +VD   I+P   + E R R+L     ++   Q++ IPY  G HW+L ++N+R+N VY+LDSL   + +D
Subjt:  SMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVFSLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDD

Query:  LKHVVNTSHKL
         + V+NTS K+
Subjt:  LKHVVNTSHKL

SwissProt top hitse value%identityAlignment
Q9FH04 Alcohol dehydrogenase-like 71.8e-0477.78Show/hide
Query:  VIIEMTSGGADYCFECVGMASLVHEAF
        VI EMT GGADYCFECVG +SLV EA+
Subjt:  VIIEMTSGGADYCFECVGMASLVHEAF

Arabidopsis top hitse value%identityAlignment
AT1G22430.1 GroES-like zinc-binding dehydrogenase family protein8.2e-0574.07Show/hide
Query:  VIIEMTSGGADYCFECVGMASLVHEAF
        VI EMT GG DY FECVG+ASL++EAF
Subjt:  VIIEMTSGGADYCFECVGMASLVHEAF

AT1G22430.2 GroES-like zinc-binding dehydrogenase family protein8.2e-0574.07Show/hide
Query:  VIIEMTSGGADYCFECVGMASLVHEAF
        VI EMT GG DY FECVG+ASL++EAF
Subjt:  VIIEMTSGGADYCFECVGMASLVHEAF

AT1G22440.1 Zinc-binding alcohol dehydrogenase family protein5.3e-0470.37Show/hide
Query:  VIIEMTSGGADYCFECVGMASLVHEAF
        VI EMT  GADY FEC+G+ASL+ EAF
Subjt:  VIIEMTSGGADYCFECVGMASLVHEAF

AT4G22110.1 GroES-like zinc-binding dehydrogenase family protein5.3e-0467.86Show/hide
Query:  VIIEMTSGGADYCFECVGMASLVHEAFN
        VI EMT GG DY FECVG+ SL+ EAF+
Subjt:  VIIEMTSGGADYCFECVGMASLVHEAFN

AT5G42250.1 Zinc-binding alcohol dehydrogenase family protein1.3e-0577.78Show/hide
Query:  VIIEMTSGGADYCFECVGMASLVHEAF
        VI EMT GGADYCFECVG +SLV EA+
Subjt:  VIIEMTSGGADYCFECVGMASLVHEAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCGAAAAAGTTTGGAGTTACTGAGTTTGTTCATTCTGGAAGTCTTGGGGGATAAATCTGTAATTATTGAAATGACTAGTGGTGGTGCAGACTATTGCTTTGAATG
TGTGGGAATGGCTTCCTTAGTTCACGAAGCATTTAATTTGCATGCTGCAGACAGTATGGTCGACGTATCAAATACTTGTGTATTGGTCTATATTGCGTTCCTTTGGAAGC
ATTTTGAGGAGACTGGTAGACTAGACAATTTTAAGGTCGTGGACTCAAACGACATTGCACCGATGTTTGGGACGCTAGAAAAACGTGCAAGAAGTTTAACTACCGTCTTT
TCTTTACTACAACCAGGGCAAATGATATTCATTCCATATAATCCTGGGAATCACTGGATATTGTGTGTTGTAAATGTAAGAGACAACACCGTTTATCTATTGGACTCCTT
ACATCCTAATCTCTTGGATGACCTCAAACATGTTGTAAACACCTCCCACAAGCTCGTTCTAAGGCTAGAGAATAGCGGGGAAGATCAAGGTGGTTGTCTCGAGCCCGTTC
GTGAGAAGAAAACAACAAAGGAGTTGCTTGATTCAGCCATAAAAGGTTGGTTTATCGAAACTCTGTTTTCGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGCGAAAAAGTTTGGAGTTACTGAGTTTGTTCATTCTGGAAGTCTTGGGGGATAAATCTGTAATTATTGAAATGACTAGTGGTGGTGCAGACTATTGCTTTGAATG
TGTGGGAATGGCTTCCTTAGTTCACGAAGCATTTAATTTGCATGCTGCAGACAGTATGGTCGACGTATCAAATACTTGTGTATTGGTCTATATTGCGTTCCTTTGGAAGC
ATTTTGAGGAGACTGGTAGACTAGACAATTTTAAGGTCGTGGACTCAAACGACATTGCACCGATGTTTGGGACGCTAGAAAAACGTGCAAGAAGTTTAACTACCGTCTTT
TCTTTACTACAACCAGGGCAAATGATATTCATTCCATATAATCCTGGGAATCACTGGATATTGTGTGTTGTAAATGTAAGAGACAACACCGTTTATCTATTGGACTCCTT
ACATCCTAATCTCTTGGATGACCTCAAACATGTTGTAAACACCTCCCACAAGCTCGTTCTAAGGCTAGAGAATAGCGGGGAAGATCAAGGTGGTTGTCTCGAGCCCGTTC
GTGAGAAGAAAACAACAAAGGAGTTGCTTGATTCAGCCATAAAAGGTTGGTTTATCGAAACTCTGTTTTCGGTTTAG
Protein sequenceShow/hide protein sequence
MQRKSLELLSLFILEVLGDKSVIIEMTSGGADYCFECVGMASLVHEAFNLHAADSMVDVSNTCVLVYIAFLWKHFEETGRLDNFKVVDSNDIAPMFGTLEKRARSLTTVF
SLLQPGQMIFIPYNPGNHWILCVVNVRDNTVYLLDSLHPNLLDDLKHVVNTSHKLVLRLENSGEDQGGCLEPVREKKTTKELLDSAIKGWFIETLFSV