; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019958 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019958
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMyb family transcription factor family protein
Genome locationChr04:27318982..27319500
RNA-Seq ExpressionHG10019958
SyntenyHG10019958
Gene Ontology termsGO:0000166 - nucleotide binding (molecular function)
GO:0003774 - motor activity (molecular function)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK25410.1 uncharacterized protein E5676_scaffold352G005520 [Cucumis melo var. makuwa]9.4e-4972.46Show/hide
Query:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY
        S  SSLQVL S SSS+LRL +KFKA+LQTLI S+ARAISRAKTTA          ++IALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLT+DH  
Subjt:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY

Query:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESY
           A  DQLGGYLQWLEERD     N+NN     ED+ + VNEIDKLAEIFIA CHEKFKLEKQESY
Subjt:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESY

XP_008443602.1 PREDICTED: uncharacterized protein LOC103487158 [Cucumis melo]5.2e-5574.16Show/hide
Query:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY
        S  SSLQVL S SSS+LRL +KFKA+LQTLI S+ARAISRAKTTA          ++IALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLT+DH  
Subjt:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY

Query:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF
           A  DQLGGYLQWLEERD     N+NN     ED+ + VNEIDKLAEIFIA CHEKFKLEKQESYRRFQDMMARSF
Subjt:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF

XP_022983138.1 uncharacterized protein LOC111481779 [Cucurbita maxima]2.5e-4969.41Show/hide
Query:  VSQISSLQVLASSSSSSLRLVKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY
        +S  SSLQ+  SSSS        KA+LQTLILS+ARAISRAKTTALHILKQANHQS+IA KRNK KLL+GSFRLHYNWCS SSN   HV P  LTWD   
Subjt:  VSQISSLQVLASSSSSSLRLVKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY

Query:  SGAADQLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARS
        SGAAD L GYLQWLE+RD       K+E+   H NEIDKLA+IFIA CHEKF+LEKQESYR+FQ+M ARS
Subjt:  SGAADQLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARS

XP_031739057.1 uncharacterized protein LOC116402823 [Cucumis sativus]1.5e-5472.47Show/hide
Query:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY
        S  SSLQVL S SSS+LRL +KFKA+LQTLILS+ARAISRAKTTA          ++ ALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLT DH  
Subjt:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY

Query:  ---SGAADQLGGYLQWLEERD----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF
            G  DQLGGYLQWLEERD    +++N  ++D+ +   VNEIDKLAEIFIA CHEKFKLEKQESYRRFQDMMARSF
Subjt:  ---SGAADQLGGYLQWLEERD----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF

XP_038906153.1 uncharacterized protein LOC120092033 [Benincasa hispida]5.0e-5878.61Show/hide
Query:  SQISSLQVLASSS--SSSLRLVKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKY-SHVTPAVLTWDH
        S  SSLQ+L SSS  S S  LVKFKAVLQTLILS+ARAISRAKTTA HILKQANHQ +IALKRNKKKLLYGSFRLHYNWCSVSSN Y SHVTP V+TWDH
Subjt:  SQISSLQVLASSS--SSSLRLVKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKY-SHVTPAVLTWDH

Query:  EYS----GAADQLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMM
        EYS    G  DQLGGYL+WLEER+N NNKI  +E     VNEIDKLAEIFIA  HEKFKLEKQESYRRFQDM+
Subjt:  EYS----GAADQLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMM

TrEMBL top hitse value%identityAlignment
A0A0A0LBV5 Uncharacterized protein7.3e-5572.47Show/hide
Query:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY
        S  SSLQVL S SSS+LRL +KFKA+LQTLILS+ARAISRAKTTA          ++ ALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLT DH  
Subjt:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY

Query:  ---SGAADQLGGYLQWLEERD----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF
            G  DQLGGYLQWLEERD    +++N  ++D+ +   VNEIDKLAEIFIA CHEKFKLEKQESYRRFQDMMARSF
Subjt:  ---SGAADQLGGYLQWLEERD----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF

A0A1S3B8H1 uncharacterized protein LOC1034871582.5e-5574.16Show/hide
Query:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY
        S  SSLQVL S SSS+LRL +KFKA+LQTLI S+ARAISRAKTTA          ++IALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLT+DH  
Subjt:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY

Query:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF
           A  DQLGGYLQWLEERD     N+NN     ED+ + VNEIDKLAEIFIA CHEKFKLEKQESYRRFQDMMARSF
Subjt:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF

A0A5A7TJT8 Uncharacterized protein2.5e-5574.16Show/hide
Query:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY
        S  SSLQVL S SSS+LRL +KFKA+LQTLI S+ARAISRAKTTA          ++IALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLT+DH  
Subjt:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY

Query:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF
           A  DQLGGYLQWLEERD     N+NN     ED+ + VNEIDKLAEIFIA CHEKFKLEKQESYRRFQDMMARSF
Subjt:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF

A0A5D3DQ67 Uncharacterized protein4.6e-4972.46Show/hide
Query:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY
        S  SSLQVL S SSS+LRL +KFKA+LQTLI S+ARAISRAKTTA          ++IALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLT+DH  
Subjt:  SQISSLQVLASSSSSSLRL-VKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY

Query:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESY
           A  DQLGGYLQWLEERD     N+NN     ED+ + VNEIDKLAEIFIA CHEKFKLEKQESY
Subjt:  SGAA--DQLGGYLQWLEERD-----NSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESY

A0A6J1J6X1 uncharacterized protein LOC1114817791.2e-4969.41Show/hide
Query:  VSQISSLQVLASSSSSSLRLVKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY
        +S  SSLQ+  SSSS        KA+LQTLILS+ARAISRAKTTALHILKQANHQS+IA KRNK KLL+GSFRLHYNWCS SSN   HV P  LTWD   
Subjt:  VSQISSLQVLASSSSSSLRLVKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEY

Query:  SGAADQLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARS
        SGAAD L GYLQWLE+RD       K+E+   H NEIDKLA+IFIA CHEKF+LEKQESYR+FQ+M ARS
Subjt:  SGAADQLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42180.1 unknown protein1.8e-1336.11Show/hide
Query:  LQVLASSSSSSLRLVK--FKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKY------SHVT-----PAV
        +Q+  SSSSS    +K  F  ++   +  + R++SRA++  + I            K NKK+L    F + +     S N++      SHV      P  
Subjt:  LQVLASSSSSSLRLVK--FKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKY------SHVT-----PAV

Query:  LTWDHEYSGAADQLGGYLQWLEERDNSNNKIIKDEDQADH---VNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARS
         + D       +    YLQWLEER + NN I  D+   +     ++ID+LA+ FIA CHEKF LEK ESYRRFQDM+ARS
Subjt:  LTWDHEYSGAADQLGGYLQWLEERDNSNNKIIKDEDQADH---VNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARS

AT3G57950.1 unknown protein1.4e-2139.55Show/hide
Query:  ASSSSSSLRLVKFKAVLQTL----ILSVARAISRAKTTALHILKQANHQSSIAL--------KRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPA--VLTW
        +SSSSSS   +K K ++Q L    +    RA+++AK+  L I K  ++     L         +N++K+ +GSFRLHYNWCS      SHV P      +
Subjt:  ASSSSSSLRLVKFKAVLQTL----ILSVARAISRAKTTALHILKQANHQSSIAL--------KRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPA--VLTW

Query:  DHEYSGAAD----QLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMAR
           Y    +    QL GYL+WLE +   + + I D       ++ID LA++FIA CHEKF LEK ESYRRFQ+M+ R
Subjt:  DHEYSGAAD----QLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMAR

AT5G06790.1 unknown protein3.2e-1837.14Show/hide
Query:  SQISSLQVLASSSSSSLRLVKFKAVLQTLILS----VARAISRAKTTALHIL--KQANHQSSIAL-------KRNKKKLLYGSFRLHYNWCSVSSNKYSH
        S +SS Q  +SSS +S  + K K+++QTLI+S    + R ISR  +  + +L  KQ N  S  +L       K+ K  +L+GSFRLHYN+CS      SH
Subjt:  SQISSLQVLASSSSSSLRLVKFKAVLQTLILS----VARAISRAKTTALHIL--KQANHQSSIAL-------KRNKKKLLYGSFRLHYNWCSVSSNKYSH

Query:  VTPAVL-----------------TWDHEYSGAA-----------DQLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESY
        V P                    TW+  YS  +            QL  YL+ LE++        ++E+    +NEIDKLA+ FIA CHEKF LEK +SY
Subjt:  VTPAVL-----------------TWDHEYSGAA-----------DQLGGYLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESY

Query:  RRFQDMMARS
        RR Q  + RS
Subjt:  RRFQDMMARS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCCCAAATTTCTTCACTCCAAGTTTTGGCTTCATCTTCATCTTCAAGCTTAAGACTGGTGAAATTCAAAGCTGTTTTGCAGACTCTCATTCTTTCTGTGGCTAG
AGCTATCTCCCGAGCCAAAACGACGGCGCTTCACATCTTAAAACAAGCCAATCATCAATCCTCTATAGCTTTGAAGAGGAACAAAAAGAAGCTTCTTTATGGCTCCTTCA
GACTCCACTACAATTGGTGCTCTGTTTCTTCTAATAAATATTCTCACGTGACTCCCGCAGTGCTCACGTGGGACCATGAATACTCCGGCGCCGCCGACCAGCTTGGTGGG
TATTTGCAGTGGCTGGAGGAGAGAGATAATAGTAATAATAAGATTATTAAAGATGAAGATCAGGCTGATCATGTTAATGAGATTGATAAATTGGCTGAGATTTTTATTGC
AACATGTCATGAGAAATTCAAGCTTGAAAAACAAGAGTCTTACAGGAGATTTCAAGACATGATGGCCAGAAGCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCCCAAATTTCTTCACTCCAAGTTTTGGCTTCATCTTCATCTTCAAGCTTAAGACTGGTGAAATTCAAAGCTGTTTTGCAGACTCTCATTCTTTCTGTGGCTAG
AGCTATCTCCCGAGCCAAAACGACGGCGCTTCACATCTTAAAACAAGCCAATCATCAATCCTCTATAGCTTTGAAGAGGAACAAAAAGAAGCTTCTTTATGGCTCCTTCA
GACTCCACTACAATTGGTGCTCTGTTTCTTCTAATAAATATTCTCACGTGACTCCCGCAGTGCTCACGTGGGACCATGAATACTCCGGCGCCGCCGACCAGCTTGGTGGG
TATTTGCAGTGGCTGGAGGAGAGAGATAATAGTAATAATAAGATTATTAAAGATGAAGATCAGGCTGATCATGTTAATGAGATTGATAAATTGGCTGAGATTTTTATTGC
AACATGTCATGAGAAATTCAAGCTTGAAAAACAAGAGTCTTACAGGAGATTTCAAGACATGATGGCCAGAAGCTTTTGA
Protein sequenceShow/hide protein sequence
MVSQISSLQVLASSSSSSLRLVKFKAVLQTLILSVARAISRAKTTALHILKQANHQSSIALKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTWDHEYSGAADQLGG
YLQWLEERDNSNNKIIKDEDQADHVNEIDKLAEIFIATCHEKFKLEKQESYRRFQDMMARSF