; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038480 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038480
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr2:18453234..18456633
RNA-Seq ExpressionLag0038480
SyntenyLag0038480
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148137.1 uncharacterized protein LOC111016890 [Momordica charantia]1.7e-1432.65Show/hide
Query:  LTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCHVVPSYFGRSGLWIQGRNSLWK
        + + W + + + R       DHN+PW+  D+VY P N+GG HWV LGI+  +G                   ELK +C ++P+     G++   R  L  
Subjt:  LTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCHVVPSYFGRSGLWIQGRNSLWK

Query:  MASPSGKIKAAQRDS-GDCGVFVCKFLEYDVTGSSFDTLTQDMMLNF
        +     +++  Q+ S  DCG+F  ++ EYD TGS+ DTLTQD ++ F
Subjt:  MASPSGKIKAAQRDS-GDCGVFVCKFLEYDVTGSSFDTLTQDMMLNF

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]7.3e-1333.91Show/hide
Query:  NFLRSDDGPYKELSSG--VHPRDLT-HEWT-KASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHW-VLGIEQGRGG----------------RAELK
        NFLRS DG Y  + S   +  R  + ++W  +A +++      + D++  W  VD VYLPYN+GG+HW V+ I+   G                   ELK
Subjt:  NFLRSDDGPYKELSSG--VHPRDLT-HEWT-KASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHW-VLGIEQGRGG----------------RAELK

Query:  VLCHVVPSYFGRSGLWIQGRN---SLWKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQDMMLNF
         +  ++P+   R G+ +   N   + W++   S      Q   GDCG+F   F EYDVT  SFDTLTQ  M  F
Subjt:  VLCHVVPSYFGRSGLWIQGRN---SLWKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQDMMLNF

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]2.8e-1230.43Show/hide
Query:  NFLRSDDGPYKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCH
        N LR  DGPY  +  GV P   T++W +   + R       D++  W+  D+VY   N+GG HWV +GI+   G                    LK +C 
Subjt:  NFLRSDDGPYKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCH

Query:  VVPSYFGRSGLWIQGRNSLWKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQ
        ++P+    SG+     N               Q    DC +F  +F EYDV GS  DTL Q
Subjt:  VVPSYFGRSGLWIQGRNSLWKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQ

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]7.8e-1530.99Show/hide
Query:  DGKRRKNFLRSDDGPYKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWVL-----------------GIEQGRGGRAE
        DG++ +NFLR D                T +W+    V++    ++ D+++PW+ VD VY+P+NL G+HWVL                  +        E
Subjt:  DGKRRKNFLRSDDGPYKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWVL-----------------GIEQGRGGRAE

Query:  LKVLCHVVPSYFGRSGLWIQGRNSL---WKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQD
        ++++C   P      G  ++  N L   W +   +   ++ Q +SGDCG+F  KF EYDVTGS   TLTQD
Subjt:  LKVLCHVVPSYFGRSGLWIQGRNSL---WKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQD

XP_038885861.1 sentrin-specific protease [Benincasa hispida]4.6e-1532.37Show/hide
Query:  DGKRRKNFLRSDDGPYKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWVL-----------------GIEQGRGGRAE
        DG++ +NFLR D                T +W+K +NV++    ++ D+++PW+ VD +Y+P+NL  +HWVL                  +        E
Subjt:  DGKRRKNFLRSDDGPYKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWVL-----------------GIEQGRGGRAE

Query:  LKVLCHVVPSYFGRSGLWIQGRNSL---WKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQDMM
        ++ LC         S + ++  N L   W +   +      Q  SGDCG+F CKF EYDVTGS  DTLTQD M
Subjt:  LKVLCHVVPSYFGRSGLWIQGRNSL---WKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQDMM

TrEMBL top hitse value%identityAlignment
A0A5A7V832 Ulp1-like peptidase2.4e-0932.67Show/hide
Query:  YKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWVLGIEQGRGGRAELKVLCHVVPSYFGRSGLW-IQGRNSLWKMASP
        YKE      P D   E+     V+ S+     D   PW +VD VY P+N+ G HWV            L  L  +VP     +G +  +GR+S +K   P
Subjt:  YKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWVLGIEQGRGGRAELKVLCHVVPSYFGRSGLW-IQGRNSLWKMASP

Query:  SGKIKA--AQRDSGDCGVFVCKFLEYDVTGSSFDTLTQDMMLNFEGNMLY
           + +   QR+S DCGVF  K+ EY   G   DTL Q+ M  F   + +
Subjt:  SGKIKA--AQRDSGDCGVFVCKFLEYDVTGSSFDTLTQDMMLNFEGNMLY

A0A6J1D492 uncharacterized protein LOC1110168908.5e-1532.65Show/hide
Query:  LTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCHVVPSYFGRSGLWIQGRNSLWK
        + + W + + + R       DHN+PW+  D+VY P N+GG HWV LGI+  +G                   ELK +C ++P+     G++   R  L  
Subjt:  LTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCHVVPSYFGRSGLWIQGRNSLWK

Query:  MASPSGKIKAAQRDS-GDCGVFVCKFLEYDVTGSSFDTLTQDMMLNF
        +     +++  Q+ S  DCG+F  ++ EYD TGS+ DTLTQD ++ F
Subjt:  MASPSGKIKAAQRDS-GDCGVFVCKFLEYDVTGSSFDTLTQDMMLNF

A0A6J1DID7 uncharacterized protein LOC1110207823.1e-0930.94Show/hide
Query:  HEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCHVVPSYFGRSGLWIQGRNSLWKMA
        ++W +   + R       D++ PW+  D+VY P N+GG HWV +GI+   G                    LK +C ++P     SG+ +  R  L  + 
Subjt:  HEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCHVVPSYFGRSGLWIQGRNSLWKMA

Query:  SPSGKIKAAQRDS-GDCGVFVCKFLEYDVTGSSFDTLTQ
            +    Q+    DCG+F  +F EYDVTGS  DTL Q
Subjt:  SPSGKIKAAQRDS-GDCGVFVCKFLEYDVTGSSFDTLTQ

A0A6J1DLV0 uncharacterized protein LOC1110216463.6e-1333.91Show/hide
Query:  NFLRSDDGPYKELSSG--VHPRDLT-HEWT-KASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHW-VLGIEQGRGG----------------RAELK
        NFLRS DG Y  + S   +  R  + ++W  +A +++      + D++  W  VD VYLPYN+GG+HW V+ I+   G                   ELK
Subjt:  NFLRSDDGPYKELSSG--VHPRDLT-HEWT-KASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHW-VLGIEQGRGG----------------RAELK

Query:  VLCHVVPSYFGRSGLWIQGRN---SLWKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQDMMLNF
         +  ++P+   R G+ +   N   + W++   S      Q   GDCG+F   F EYDVT  SFDTLTQ  M  F
Subjt:  VLCHVVPSYFGRSGLWIQGRN---SLWKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQDMMLNF

A0A6J1DY60 uncharacterized protein LOC1110252731.4e-1230.43Show/hide
Query:  NFLRSDDGPYKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCH
        N LR  DGPY  +  GV P   T++W +   + R       D++  W+  D+VY   N+GG HWV +GI+   G                    LK +C 
Subjt:  NFLRSDDGPYKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPYNLGGLHWV-LGIEQGRGG----------------RAELKVLCH

Query:  VVPSYFGRSGLWIQGRNSLWKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQ
        ++P+    SG+     N               Q    DC +F  +F EYDV GS  DTL Q
Subjt:  VVPSYFGRSGLWIQGRNSLWKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGATAATCAGTTCGTGCTTGCTCGAAGATTCGTGTTGGGTTGTTGGGTTTTCTGTTCGCAGACGAAGCATCCCGGTCGGGATGAATATGGGATGCATATTGCTTG
GCCCGATCCCGGTCGGGCCACCAGATTATTCCCTAACCTCCGATTTGAGAATGATGAGGACACAGTTAAGATAGCCTTGTTTTATTTCATCGAGCTTGCGATGATGGGGA
GAGAAAGGAAACAACAAATGGATACAAGCCTGCTAGGCAAGGGTCACATTTCAACTTTTGGCCACAGAGGAAGAGGTTCAATTATGGACCGTGTGATGGAGCCGTCACAT
GCCCACTTCCATCCTACCTCATTCTCCTCCTCCAGCTCCATTCCATCACTTCCTGGTATGCATGTTGACGATGTAGATGCTAAGACTCATGATAGGACGGAGGATGTTGG
GACTAGTTCTGAGGCTCTGACAGAACTTGCAAGAAGATCTGGATCTGGCGCGAAAGGAGCAGACACCTCGTTGATTGTGGCTACTGGTTCAGTCGTTCAGACTGACACCC
AACAAAAAGCTCTAGTAGTGGAAGGGACCGGGTGGTGGTGGTGGTCCAGTTCAAGGGTGGAGAAGGGAATCATTGATATAGATGAGGGGGATAATGGTAAAGATCGGGAA
CACACTGATAGTGTTGAACTTGTAGTGGACAAGGAAGGTGTCCAGTCTACTTCACAACAAAACGAGCCCATTGAACGACGGGGGACTCGGAAGAGGAAGACTGCAAGGAA
GCTGAGAACTCCATGGAAAGACACAAGGGAAGACGGGAAGAGGCGAAAGAATTTTTTGAGAAGTGACGACGGTCCGTACAAAGAGCTAAGCAGTGGCGTACACCCCCGGG
ACTTGACACACGAATGGACCAAGGCATCAAACGTCATGAGGTCGCGAGGGAGTGAGAATTTAGATCACAATATCCCATGGACCACTGTTGATGTAGTGTACTTGCCTTAT
AATCTCGGTGGTCTCCATTGGGTTTTGGGCATTGAACAAGGAAGAGGTGGTCGAGCAGAGTTGAAGGTCCTTTGCCACGTCGTGCCTAGTTACTTTGGAAGATCGGGGCT
ATGGATTCAAGGAAGGAACTCTCTGTGGAAGATGGCCTCTCCGTCTGGAAAAATCAAGGCCGCACAGCGTGATAGTGGTGACTGTGGAGTGTTTGTATGTAAATTCTTAG
AGTACGATGTAACAGGGTCGTCATTCGACACCCTTACTCAAGATATGATGCTGAATTTCGAAGGCAATATGTTGTACAATTGTGGGCCAATATGCCATTTTCATAGTGAT
TTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCGATAATCAGTTCGTGCTTGCTCGAAGATTCGTGTTGGGTTGTTGGGTTTTCTGTTCGCAGACGAAGCATCCCGGTCGGGATGAATATGGGATGCATATTGCTTG
GCCCGATCCCGGTCGGGCCACCAGATTATTCCCTAACCTCCGATTTGAGAATGATGAGGACACAGTTAAGATAGCCTTGTTTTATTTCATCGAGCTTGCGATGATGGGGA
GAGAAAGGAAACAACAAATGGATACAAGCCTGCTAGGCAAGGGTCACATTTCAACTTTTGGCCACAGAGGAAGAGGTTCAATTATGGACCGTGTGATGGAGCCGTCACAT
GCCCACTTCCATCCTACCTCATTCTCCTCCTCCAGCTCCATTCCATCACTTCCTGGTATGCATGTTGACGATGTAGATGCTAAGACTCATGATAGGACGGAGGATGTTGG
GACTAGTTCTGAGGCTCTGACAGAACTTGCAAGAAGATCTGGATCTGGCGCGAAAGGAGCAGACACCTCGTTGATTGTGGCTACTGGTTCAGTCGTTCAGACTGACACCC
AACAAAAAGCTCTAGTAGTGGAAGGGACCGGGTGGTGGTGGTGGTCCAGTTCAAGGGTGGAGAAGGGAATCATTGATATAGATGAGGGGGATAATGGTAAAGATCGGGAA
CACACTGATAGTGTTGAACTTGTAGTGGACAAGGAAGGTGTCCAGTCTACTTCACAACAAAACGAGCCCATTGAACGACGGGGGACTCGGAAGAGGAAGACTGCAAGGAA
GCTGAGAACTCCATGGAAAGACACAAGGGAAGACGGGAAGAGGCGAAAGAATTTTTTGAGAAGTGACGACGGTCCGTACAAAGAGCTAAGCAGTGGCGTACACCCCCGGG
ACTTGACACACGAATGGACCAAGGCATCAAACGTCATGAGGTCGCGAGGGAGTGAGAATTTAGATCACAATATCCCATGGACCACTGTTGATGTAGTGTACTTGCCTTAT
AATCTCGGTGGTCTCCATTGGGTTTTGGGCATTGAACAAGGAAGAGGTGGTCGAGCAGAGTTGAAGGTCCTTTGCCACGTCGTGCCTAGTTACTTTGGAAGATCGGGGCT
ATGGATTCAAGGAAGGAACTCTCTGTGGAAGATGGCCTCTCCGTCTGGAAAAATCAAGGCCGCACAGCGTGATAGTGGTGACTGTGGAGTGTTTGTATGTAAATTCTTAG
AGTACGATGTAACAGGGTCGTCATTCGACACCCTTACTCAAGATATGATGCTGAATTTCGAAGGCAATATGTTGTACAATTGTGGGCCAATATGCCATTTTCATAGTGAT
TTGTAG
Protein sequenceShow/hide protein sequence
MTDNQFVLARRFVLGCWVFCSQTKHPGRDEYGMHIAWPDPGRATRLFPNLRFENDEDTVKIALFYFIELAMMGRERKQQMDTSLLGKGHISTFGHRGRGSIMDRVMEPSH
AHFHPTSFSSSSSIPSLPGMHVDDVDAKTHDRTEDVGTSSEALTELARRSGSGAKGADTSLIVATGSVVQTDTQQKALVVEGTGWWWWSSSRVEKGIIDIDEGDNGKDRE
HTDSVELVVDKEGVQSTSQQNEPIERRGTRKRKTARKLRTPWKDTREDGKRRKNFLRSDDGPYKELSSGVHPRDLTHEWTKASNVMRSRGSENLDHNIPWTTVDVVYLPY
NLGGLHWVLGIEQGRGGRAELKVLCHVVPSYFGRSGLWIQGRNSLWKMASPSGKIKAAQRDSGDCGVFVCKFLEYDVTGSSFDTLTQDMMLNFEGNMLYNCGPICHFHSD
L