; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003030 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003030
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr11:16487251..16491044
RNA-Seq ExpressionHG10003030
SyntenyHG10003030
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ESR41651.1 hypothetical protein CICLE_v10012304mg [Citrus clementina]1.2e-3839.26Show/hide
Query:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW----------------------------------------------------
        M ++ PY+GV G +  +NL+VA+DQ S TN+W+  GPP QL+VILAGW                                                    
Subjt:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW----------------------------------------------------

Query:  --------------QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPK
                      QD+ +GNWW  + +D   +GY+ KELF +LS GA+ VAWGGIA  GKNG+SPP+GSG L N +FR  CYIR I+YV+ QN    P 
Subjt:  --------------QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPK

Query:  QTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG
           L+Q++  S CYGL+D + CG   +MYYC  +GG GG+CG
Subjt:  QTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG

OMP03992.1 hypothetical protein COLO4_10038 [Corchorus olitorius]2.7e-3843.94Show/hide
Query:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPP---QQLSVILAGWQ-------------------DRKSGNWWFSIYEDGKAIGYFQKELFPNL
        M E   YYG+ G + AYNLSV   Q SS NLWV  GP     QL+VIL GW                    D ++GNWW +I +    +GY+ KEL   L
Subjt:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPP---QQLSVILAGWQ-------------------DRKSGNWWFSIYEDGKAIGYFQKELFPNL

Query:  SNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG
        SNGA  VAWGGIA   K G SPP+GSG+ PN ++  +C+ + ++++N  +  + P      QYI  S CY L D + C G+ +M YCFT+GGPGGKCG
Subjt:  SNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG

XP_022145286.1 uncharacterized protein LOC111014774 [Momordica charantia]3.0e-4548.6Show/hide
Query:  YYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSV-----------ILAGWQDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGK
        YYGV G  S YNLSVAQDQ+SS+N+W++GGPP+ L+V           +   W DR +G+WW ++ +    IGY+ KELF +L++GA+QVAWGGIAK   
Subjt:  YYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSV-----------ILAGWQDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGK

Query:  NGMSPPLGSGNLP-NGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG
        NGMSPPLG+G+ P NG +  ACY + I Y++  N G+ P    +  ++ +S CYGL D       D MY+CFT+GGPGG
Subjt:  NGMSPPLGSGNLP-NGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]5.4e-4749.48Show/hide
Query:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQ------------------DRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGA
        M +   YYG +G++S YNLSVAQDQ+SS+N+W+IGGPPQ  +VILAGWQ                  DR +GNWW ++ E  K IGY+ KELF +L++G 
Subjt:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQ------------------DRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGA

Query:  KQVAWGGIAKKGKNGMSPPLGSGNLPN-GNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG
        +QVAWGGIAK   NGMSPPLG+G+ PN   +  ACY R + YV++ N G  P       Y+ ++ CY L +  TCGG +  YYC T+GGPGG
Subjt:  KQVAWGGIAKKGKNGMSPPLGSGNLPN-GNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG

XP_024038072.1 uncharacterized protein LOC18039972 [Citrus clementina]1.2e-3839.26Show/hide
Query:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW----------------------------------------------------
        M ++ PY+GV G +  +NL+VA+DQ S TN+W+  GPP QL+VILAGW                                                    
Subjt:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW----------------------------------------------------

Query:  --------------QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPK
                      QD+ +GNWW  + +D   +GY+ KELF +LS GA+ VAWGGIA  GKNG+SPP+GSG L N +FR  CYIR I+YV+ QN    P 
Subjt:  --------------QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPK

Query:  QTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG
           L+Q++  S CYGL+D + CG   +MYYC  +GG GG+CG
Subjt:  QTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG

TrEMBL top hitse value%identityAlignment
A0A1R3KA80 Uncharacterized protein1.3e-3843.94Show/hide
Query:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPP---QQLSVILAGWQ-------------------DRKSGNWWFSIYEDGKAIGYFQKELFPNL
        M E   YYG+ G + AYNLSV   Q SS NLWV  GP     QL+VIL GW                    D ++GNWW +I +    +GY+ KEL   L
Subjt:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPP---QQLSVILAGWQ-------------------DRKSGNWWFSIYEDGKAIGYFQKELFPNL

Query:  SNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG
        SNGA  VAWGGIA   K G SPP+GSG+ PN ++  +C+ + ++++N  +  + P      QYI  S CY L D + C G+ +M YCFT+GGPGGKCG
Subjt:  SNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG

A0A2H5QMC5 Uncharacterized protein5.9e-3939.26Show/hide
Query:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW----------------------------------------------------
        M ++ PY+GV G +  +NL+VA+DQ S TN+W+  GPP QL+VILAGW                                                    
Subjt:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW----------------------------------------------------

Query:  --------------QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPK
                      QD+ +GNWW  + +D   +GY+ KELF +LS GA+ VAWGGIA  GKNG+SPP+GSG L N +FR  CYIR I+YV+ QN    P 
Subjt:  --------------QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPK

Query:  QTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG
           L+Q++  S CYGL+D + CG   +MYYC  +GG GG+CG
Subjt:  QTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG

A0A6J1CVJ6 uncharacterized protein LOC1110147772.6e-4749.48Show/hide
Query:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQ------------------DRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGA
        M +   YYG +G++S YNLSVAQDQ+SS+N+W+IGGPPQ  +VILAGWQ                  DR +GNWW ++ E  K IGY+ KELF +L++G 
Subjt:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQ------------------DRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGA

Query:  KQVAWGGIAKKGKNGMSPPLGSGNLPN-GNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG
        +QVAWGGIAK   NGMSPPLG+G+ PN   +  ACY R + YV++ N G  P       Y+ ++ CY L +  TCGG +  YYC T+GGPGG
Subjt:  KQVAWGGIAKKGKNGMSPPLGSGNLPN-GNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG

A0A6J1CVW9 uncharacterized protein LOC1110147741.4e-4548.6Show/hide
Query:  YYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSV-----------ILAGWQDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGK
        YYGV G  S YNLSVAQDQ+SS+N+W++GGPP+ L+V           +   W DR +G+WW ++ +    IGY+ KELF +L++GA+QVAWGGIAK   
Subjt:  YYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSV-----------ILAGWQDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGK

Query:  NGMSPPLGSGNLP-NGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG
        NGMSPPLG+G+ P NG +  ACY + I Y++  N G+ P    +  ++ +S CYGL D       D MY+CFT+GGPGG
Subjt:  NGMSPPLGSGNLP-NGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG

V4SWW2 Uncharacterized protein5.9e-3939.26Show/hide
Query:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW----------------------------------------------------
        M ++ PY+GV G +  +NL+VA+DQ S TN+W+  GPP QL+VILAGW                                                    
Subjt:  MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW----------------------------------------------------

Query:  --------------QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPK
                      QD+ +GNWW  + +D   +GY+ KELF +LS GA+ VAWGGIA  GKNG+SPP+GSG L N +FR  CYIR I+YV+ QN    P 
Subjt:  --------------QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPK

Query:  QTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG
           L+Q++  S CYGL+D + CG   +MYYC  +GG GG+CG
Subjt:  QTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G35250.1 Protein of Unknown Function (DUF239)2.0e-1531.39Show/hide
Query:  LSVILAGWQDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQ
        L+  L  +QD  SGNW   ++++   IGY+ KELF +L+NGA  V +GG   +  +G+SPP+G+G  P  +F+   +   +  +N     +  +  +++ 
Subjt:  LSVILAGWQDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQ

Query:  YIGDSKCYGLQDDRTCGGFDQMY-YCFTYGGPGGKCG
        Y+    C+      T  G+ +     F++GGPGG CG
Subjt:  YIGDSKCYGLQDDRTCGGFDQMY-YCFTYGGPGGKCG

AT4G23390.1 Protein of Unknown Function (DUF239)5.0e-1431.16Show/hide
Query:  QQLSVILAGWQDRKSGNWWFSIYEDGKAIGYFQKELF--PNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGI-PPKQ
        QQ  + ++ +QD  + +WWF +  + + IGY+ K LF    L++GA  V WGG         SP +GSG+ P   F+ A Y+ G++ + D    +  P  
Subjt:  QQLSVILAGWQDRKSGNWWFSIYEDGKAIGYFQKELF--PNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGI-PPKQ

Query:  TELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG
        + L+ +     CY +Q     G F        +GGPGG
Subjt:  TELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG

AT5G11660.1 Protein of Unknown Function (DUF239)6.3e-1731.25Show/hide
Query:  GVTGTMSAYNLSVAQDQASSTNLWVIGGP----PQQLSVILAGWQDRKSGNWWFSIYEDGKA---IGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPP
        G  GT     +     Q S T+L     P     Q+ +V  +  Q++  GNWW +     +    IGY+ KELF  + N    V   G  +   +G+SPP
Subjt:  GVTGTMSAYNLSVAQDQASSTNLWVIGGP----PQQLSVILAGWQDRKSGNWWFSIYEDGKA---IGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPP

Query:  LGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG-KCG
        +G+G LP+ +   + +++G++ V+        K+ +L++ + D+KCYGL+D +    F +    FTYGGPGG  CG
Subjt:  LGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG-KCG

AT5G25410.1 Protein of Unknown Function (DUF239)1.3e-1725.1Show/hide
Query:  NENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW-----------------------------------------------------
        ++N PY+GV  + S + L++ +DQAS   L+V  G   Q++ I AGW                                                     
Subjt:  NENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGW-----------------------------------------------------

Query:  -------QDRKSGNWWFS---IYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTEL
               QD+++GNWW +          +GY+ KELF  + NGA  V  GG  +    G SPP+G+G  P G+ + +     I  +N            +
Subjt:  -------QDRKSGNWWFS---IYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTEL

Query:  QQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGK-CG
        ++ +   KCYG+  D+       + + F YGG GG+ CG
Subjt:  QQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGK-CG

AT5G60380.1 Protein of Unknown Function (DUF239)2.9e-1425.42Show/hide
Query:  PYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAG----------------------------------------------------------
        PY+G+    +AYNL++ +DQAS + +++  G   +++ I  G                                                          
Subjt:  PYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAG----------------------------------------------------------

Query:  -W---QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLP-NGNFRVACY----IRGIRYVNDQNLGIPPKQTEL
         W   QD ++ NWW         IGY+ KELF  + NGA  V  GG+ +   +G+SPP+G+G  P  G  R A +    +   +Y   +    P     +
Subjt:  -W---QDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLP-NGNFRVACY----IRGIRYVNDQNLGIPPKQTEL

Query:  QQYIGDSKCYGLQ-DDRTCGGFDQMYYCFTYGGPGG-KCG
         + +  S+CYGL+   R       + Y F YGGPGG  CG
Subjt:  QQYIGDSKCYGLQ-DDRTCGGFDQMYYCFTYGGPGG-KCG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGAAAATCAGCCATACTATGGAGTGACTGGAACAATGTCTGCTTACAATTTAAGTGTTGCTCAAGACCAAGCTTCATCTACAAACTTATGGGTTATTGGTGGCCC
TCCACAACAGCTTAGTGTCATTCTTGCTGGATGGCAGGACCGAAAGTCAGGAAATTGGTGGTTTTCAATTTATGAAGATGGGAAAGCGATCGGATATTTTCAAAAAGAGT
TGTTTCCAAATCTAAGCAATGGGGCAAAACAAGTAGCTTGGGGAGGCATTGCAAAGAAAGGGAAGAATGGAATGAGCCCTCCATTAGGGAGTGGAAATTTACCTAATGGA
AACTTTAGGGTTGCATGTTACATTAGAGGAATTAGATATGTGAATGATCAAAACTTGGGAATACCTCCAAAGCAAACTGAACTTCAACAATATATTGGGGACTCTAAATG
TTATGGTTTGCAAGATGATAGAACTTGTGGGGGTTTTGATCAAATGTATTATTGCTTCACATATGGTGGACCCGGTGGGAAATGTGGGGCTGCTGTTAAAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACGAAAATCAGCCATACTATGGAGTGACTGGAACAATGTCTGCTTACAATTTAAGTGTTGCTCAAGACCAAGCTTCATCTACAAACTTATGGGTTATTGGTGGCCC
TCCACAACAGCTTAGTGTCATTCTTGCTGGATGGCAGGACCGAAAGTCAGGAAATTGGTGGTTTTCAATTTATGAAGATGGGAAAGCGATCGGATATTTTCAAAAAGAGT
TGTTTCCAAATCTAAGCAATGGGGCAAAACAAGTAGCTTGGGGAGGCATTGCAAAGAAAGGGAAGAATGGAATGAGCCCTCCATTAGGGAGTGGAAATTTACCTAATGGA
AACTTTAGGGTTGCATGTTACATTAGAGGAATTAGATATGTGAATGATCAAAACTTGGGAATACCTCCAAAGCAAACTGAACTTCAACAATATATTGGGGACTCTAAATG
TTATGGTTTGCAAGATGATAGAACTTGTGGGGGTTTTGATCAAATGTATTATTGCTTCACATATGGTGGACCCGGTGGGAAATGTGGGGCTGCTGTTAAAACTTAA
Protein sequenceShow/hide protein sequence
MNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQDRKSGNWWFSIYEDGKAIGYFQKELFPNLSNGAKQVAWGGIAKKGKNGMSPPLGSGNLPNG
NFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCGAAVKT