; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000202 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000202
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationscaffold6:21309227..21317072
RNA-Seq ExpressionSpg000202
SyntenySpg000202
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]5.0e-3736.6Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGML-----------------------------------------------------EEWGKTEGSQS----KKSIEV
        DELAA  +GQDILTEALG P+  G +                                                     +E  +T  SQS    KK+ E 
Subjt:  DELAAANQGQDILTEALGAPDIEGML-----------------------------------------------------EEWGKTEGSQS----KKSIEV

Query:  ESKASRTKTKGKEIVDEPDKVL------ESKEVLEGTPCRLAVTSVDNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETL
        + +  +   KGK +V +P+++L      E + +L+G PC LA+ S+DN+ AIG M+ES V CPTIHG+PLGA+N+RV VD+I  +DV + IP+ GEIETL
Subjt:  ESKASRTKTKGKEIVDEPDKVL------ESKEVLEGTPCRLAVTSVDNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETL

Query:  SQAKGSFVAWPRELVILNNPKK----------------DDLGNLVSILSGLWTPVCPLGP------------TGSSFRALSDDIMQYCGMAEIGYSCILV
        +QA G+FVAWPR+LVIL   KK                 D+   + +L+        +                 +     DDI+QYCGM EIGYSCIL 
Subjt:  SQAKGSFVAWPRELVILNNPKK----------------DDLGNLVSILSGLWTPVCPLGP------------TGSSFRALSDDIMQYCGMAEIGYSCILV

Query:  YIAYLW
        YIA LW
Subjt:  YIAYLW

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]2.9e-4042.69Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV
        DELAA ++G+DILTEALG  +  G +   G                KT+  Q  KS    S  S+ K+KGKEIV+  +++ +  ++ +EG PC LAV SV
Subjt:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV

Query:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------
        DNI A+GT+++++V CPT+HGVPLG +NVRV+VD++ D+   I IPV GEIETL+Q  G FVAWPR LVIL+  K                         
Subjt:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------

Query:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW
        L N   +LS        +  +   F          +DIMQYC M EIGYSCIL YIAYLW
Subjt:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]2.9e-4042.69Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV
        DELAA ++G+DILTEALG  +  G +   G                KT+  Q  KS    S  S+ K+KGKEIV+  +++ +  ++ +EG PC LAV SV
Subjt:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV

Query:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------
        DNI A+GT+++++V CPT+HGVPLG +NVRV+VD++ D+   I IPV GEIETL+Q  G FVAWPR LVIL+  K                         
Subjt:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------

Query:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW
        L N   +LS        +  +   F          +DIMQYC M EIGYSCIL YIAYLW
Subjt:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]2.9e-4042.69Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV
        DELAA ++G+DILTEALG  +  G +   G                KT+  Q  KS    S  S+ K+KGKEIV+  +++ +  ++ +EG PC LAV SV
Subjt:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV

Query:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------
        DNI A+GT+++++V CPT+HGVPLG +NVRV+VD++ D+   I IPV GEIETL+Q  G FVAWPR LVIL+  K                         
Subjt:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------

Query:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW
        L N   +LS        +  +   F          +DIMQYC M EIGYSCIL YIAYLW
Subjt:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]2.9e-4042.69Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV
        DELAA ++G+DILTEALG  +  G +   G                KT+  Q  KS    S  S+ K+KGKEIV+  +++ +  ++ +EG PC LAV SV
Subjt:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV

Query:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------
        DNI A+GT+++++V CPT+HGVPLG +NVRV+VD++ D+   I IPV GEIETL+Q  G FVAWPR LVIL+  K                         
Subjt:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------

Query:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW
        L N   +LS        +  +   F          +DIMQYC M EIGYSCIL YIAYLW
Subjt:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X13.0e-3535.95Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGMLEEWG---------------------------------KTEGSQSKKSIEVE------------SKASRTKTKGK
        DELAA  +GQDILTEALG P+  G +   G                                 + E  QSK   E +            S  SR KTKGK
Subjt:  DELAAANQGQDILTEALGAPDIEGMLEEWG---------------------------------KTEGSQSKKSIEVE------------SKASRTKTKGK

Query:  E------------IVDEPDKVL------ESKEVLEGTPCRLAVTSVDNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETL
        +            +V E ++ L      E + + +G PC LA+ S+DN+ A+G M+ES V CPTIHG+PLGA N+RV VD+   +DV + IP+ G+IETL
Subjt:  E------------IVDEPDKVL------ESKEVLEGTPCRLAVTSVDNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETL

Query:  SQAKGSFVAWPRELVILNNPKK----------------DDLGNLVSILSGLWTPVCPLGP------------TGSSFRALSDDIMQYCGMAEIGYSCILV
        +QA G+FVAWPR+LVI+   KK                 D+   + +L+        +                 +     DDI+QYCGM EIGYSCIL 
Subjt:  SQAKGSFVAWPRELVILNNPKK----------------DDLGNLVSILSGLWTPVCPLGP------------TGSSFRALSDDIMQYCGMAEIGYSCILV

Query:  YIAYLW
        YIA LW
Subjt:  YIAYLW

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X11.4e-4042.69Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV
        DELAA ++G+DILTEALG  +  G +   G                KT+  Q  KS    S  S+ K+KGKEIV+  +++ +  ++ +EG PC LAV SV
Subjt:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV

Query:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------
        DNI A+GT+++++V CPT+HGVPLG +NVRV+VD++ D+   I IPV GEIETL+Q  G FVAWPR LVIL+  K                         
Subjt:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------

Query:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW
        L N   +LS        +  +   F          +DIMQYC M EIGYSCIL YIAYLW
Subjt:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.4e-4042.69Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV
        DELAA ++G+DILTEALG  +  G +   G                KT+  Q  KS    S  S+ K+KGKEIV+  +++ +  ++ +EG PC LAV SV
Subjt:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV

Query:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------
        DNI A+GT+++++V CPT+HGVPLG +NVRV+VD++ D+   I IPV GEIETL+Q  G FVAWPR LVIL+  K                         
Subjt:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------

Query:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW
        L N   +LS        +  +   F          +DIMQYC M EIGYSCIL YIAYLW
Subjt:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW

A0A6J1C398 uncharacterized protein LOC111007859 isoform X31.4e-4042.69Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV
        DELAA ++G+DILTEALG  +  G +   G                KT+  Q  KS    S  S+ K+KGKEIV+  +++ +  ++ +EG PC LAV SV
Subjt:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV

Query:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------
        DNI A+GT+++++V CPT+HGVPLG +NVRV+VD++ D+   I IPV GEIETL+Q  G FVAWPR LVIL+  K                         
Subjt:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------

Query:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW
        L N   +LS        +  +   F          +DIMQYC M EIGYSCIL YIAYLW
Subjt:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.4e-4042.69Show/hide
Query:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV
        DELAA ++G+DILTEALG  +  G +   G                KT+  Q  KS    S  S+ K+KGKEIV+  +++ +  ++ +EG PC LAV SV
Subjt:  DELAAANQGQDILTEALGAPDIEGMLEEWG----------------KTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKV-LESKEVLEGTPCRLAVTSV

Query:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------
        DNI A+GT+++++V CPT+HGVPLG +NVRV+VD++ D+   I IPV GEIETL+Q  G FVAWPR LVIL+  K                         
Subjt:  DNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDD----------------------

Query:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW
        L N   +LS        +  +   F          +DIMQYC M EIGYSCIL YIAYLW
Subjt:  LGNLVSILSGLWTPVCPLGPTGSSF------RALSDDIMQYCGMAEIGYSCILVYIAYLW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACGAAAAGGAGAAGTCTTCACTGCGTCGTCGTCTGGAATCGAGAAGTAGAAGATGAAGAGATGCGGCTGGGAGGAATCGAGAATGAGAAGAGAAGACTAAAGAC
TGAAGGTTATATTCCGTTGTTGACGTTGAGTGTACTCCGTGACAACGATGCTGTCGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCTA
GATCAAGCTCCAGTTACTTTCCAAACCAGGTTATGTCAGGGCTGCACCATGTGTATGTTGAAAGGTTCAGCGTTTCAATAGGGTCAACAGGGATCGTTAGAGGTGACGAT
GTCTGTTGGCTTCACGCCGTCTTTCGGGCTAAGCTAGCAGGTGGTCCGGGAGGGGGTGTGACAGACACCGGTGGAACTGGAGAAATGCGGTCACATTTGATTTTTACCTT
CCCAGAAATGCGATCGCATTTCTGGGACGAGCTAGCTGCAGCAAATCAAGGTCAAGATATATTGACTGAAGCATTAGGCGCGCCAGACATAGAGGGCATGTTAGAGGAGT
GGGGTAAGACTGAAGGCTCACAATCAAAGAAGTCAATAGAGGTAGAAAGTAAGGCTTCAAGGACAAAAACGAAAGGAAAGGAGATTGTTGACGAGCCAGACAAAGTGTTA
GAGTCAAAAGAAGTGTTAGAGGGAACACCATGTCGCTTGGCTGTAACTTCAGTGGATAACATTGCTGCCATAGGCACAATGTATGAATCTAGTGTCGGATGTCCAACGAT
TCATGGAGTACCACTAGGAGCCAATAATGTTCGAGTGGTGGTGGATATGATCACAGACAAAGATGTTCTCATACAAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGCC
AAGCAAAGGGTAGCTTCGTGGCATGGCCTCGCGAGCTTGTGATTTTGAATAACCCGAAAAAGGATGACCTAGGCAACTTGGTCTCAATCCTGAGTGGATTATGGACTCCT
GTCTGTCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGTGATGACATCATGCAATATTGTGGGATGGCTGAGATAGGGTACTCATGTATACTGGTGTACATTGC
GTATCTTTGGACTTTTAACACGAAAAAAGCATATACGCAAGAAGAAATTGACGAGGTTCGGATTGAGTGGGCAGGGTTCATGGGAAGATTTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAACGAAAAGGAGAAGTCTTCACTGCGTCGTCGTCTGGAATCGAGAAGTAGAAGATGAAGAGATGCGGCTGGGAGGAATCGAGAATGAGAAGAGAAGACTAAAGAC
TGAAGGTTATATTCCGTTGTTGACGTTGAGTGTACTCCGTGACAACGATGCTGTCGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCTA
GATCAAGCTCCAGTTACTTTCCAAACCAGGTTATGTCAGGGCTGCACCATGTGTATGTTGAAAGGTTCAGCGTTTCAATAGGGTCAACAGGGATCGTTAGAGGTGACGAT
GTCTGTTGGCTTCACGCCGTCTTTCGGGCTAAGCTAGCAGGTGGTCCGGGAGGGGGTGTGACAGACACCGGTGGAACTGGAGAAATGCGGTCACATTTGATTTTTACCTT
CCCAGAAATGCGATCGCATTTCTGGGACGAGCTAGCTGCAGCAAATCAAGGTCAAGATATATTGACTGAAGCATTAGGCGCGCCAGACATAGAGGGCATGTTAGAGGAGT
GGGGTAAGACTGAAGGCTCACAATCAAAGAAGTCAATAGAGGTAGAAAGTAAGGCTTCAAGGACAAAAACGAAAGGAAAGGAGATTGTTGACGAGCCAGACAAAGTGTTA
GAGTCAAAAGAAGTGTTAGAGGGAACACCATGTCGCTTGGCTGTAACTTCAGTGGATAACATTGCTGCCATAGGCACAATGTATGAATCTAGTGTCGGATGTCCAACGAT
TCATGGAGTACCACTAGGAGCCAATAATGTTCGAGTGGTGGTGGATATGATCACAGACAAAGATGTTCTCATACAAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGCC
AAGCAAAGGGTAGCTTCGTGGCATGGCCTCGCGAGCTTGTGATTTTGAATAACCCGAAAAAGGATGACCTAGGCAACTTGGTCTCAATCCTGAGTGGATTATGGACTCCT
GTCTGTCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGTGATGACATCATGCAATATTGTGGGATGGCTGAGATAGGGTACTCATGTATACTGGTGTACATTGC
GTATCTTTGGACTTTTAACACGAAAAAAGCATATACGCAAGAAGAAATTGACGAGGTTCGGATTGAGTGGGCAGGGTTCATGGGAAGATTTATGTAG
Protein sequenceShow/hide protein sequence
METKRRSLHCVVVWNREVEDEEMRLGGIENEKRRLKTEGYIPLLTLSVLRDNDAVVEIELPVPDTLPTSAESSRSSSSYFPNQVMSGLHHVYVERFSVSIGSTGIVRGDD
VCWLHAVFRAKLAGGPGGGVTDTGGTGEMRSHLIFTFPEMRSHFWDELAAANQGQDILTEALGAPDIEGMLEEWGKTEGSQSKKSIEVESKASRTKTKGKEIVDEPDKVL
ESKEVLEGTPCRLAVTSVDNIAAIGTMYESSVGCPTIHGVPLGANNVRVVVDMITDKDVLIQIPVVGEIETLSQAKGSFVAWPRELVILNNPKKDDLGNLVSILSGLWTP
VCPLGPTGSSFRALSDDIMQYCGMAEIGYSCILVYIAYLWTFNTKKAYTQEEIDEVRIEWAGFMGRFM