; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014731 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014731
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMacro domain-containing protein
Genome locationChr02:18806176..18810021
RNA-Seq ExpressionHG10014731
SyntenyHG10014731
Gene Ontology termsNA
InterPro domainsIPR002589 - Macro domain
IPR043472 - Macro domain-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600880.1 hypothetical protein SDJN03_06113, partial [Cucurbita argyrosperma subsp. sororia]7.6e-8094.87Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN
        PAISCGVYRYP+DEAATIALST++EFS GLKEVHFVLY+SDIYNVWL KANELLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN

KAG7031514.1 hypothetical protein SDJN02_05554 [Cucurbita argyrosperma subsp. argyrosperma]7.6e-8094.87Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN
        PAISCGVYRYP+DEAATIALST++EFS GLKEVHFVLY+SDIYNVWL KANELLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN

XP_008456945.1 PREDICTED: macro domain-containing protein VPA0103 isoform X2 [Cucumis melo]3.8e-7994.87Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANEVMLGGGGADGAIHNAAGPDLV+ACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN S NPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN
        PAISCGV+RYPYDEAATIALSTI+EFSQGLKEVHFVLYA DIYNVWL KANELLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN

XP_022943121.1 uncharacterized protein LOC111447945 [Cucurbita moschata]7.6e-8094.87Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN
        PAISCGVYRYP+DEAATIALST++EFS GLKEVHFVLY+SDIYNVWL KANELLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN

XP_038891619.1 macro domain-containing protein VPA0103, partial [Benincasa hispida]1.3e-7994.23Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPAN+VMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN
        PAISCGV+RYPYDEAAT+ALST++EFSQGLKEVHFVLYASDIYNVWL KAN+LLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN

TrEMBL top hitse value%identityAlignment
A0A1S3C4C8 macro domain-containing protein VPA0103 isoform X21.8e-7994.87Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANEVMLGGGGADGAIHNAAGPDLV+ACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN S NPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN
        PAISCGV+RYPYDEAATIALSTI+EFSQGLKEVHFVLYA DIYNVWL KANELLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN

A0A1S3C4G0 macro domain-containing protein VPA0103 isoform X11.3e-7791.93Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANEVMLGGGGADGAIHNAAGPDLV+ACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN S NPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKE-----VHFVLYASDIYNVWLAKANELLKN
        PAISCGV+RYPYDEAATIALSTI+EFSQGLKE     VHFVLYA DIYNVWL KANELLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKE-----VHFVLYASDIYNVWLAKANELLKN

A0A6J1CYJ5 uncharacterized protein LOC1110159682.1e-7590.38Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANEVMLGGGGADGAIHNAAGPDLVQACY+V EVQPGIRCPTGEARITPGF LPASHVIHTVGPIY  SSNPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN
        PAISCGV+RYPYDEAATIA+STI+EFS+ LKEVHFVL++SDIY+VWL KANELLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN

A0A6J1FQU9 uncharacterized protein LOC1114479453.7e-8094.87Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN
        PAISCGVYRYP+DEAATIALST++EFS GLKEVHFVLY+SDIYNVWL KANELLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN

A0A6J1JJ63 uncharacterized protein LOC1114849184.1e-7993.59Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN +SNPQALLRSAYRNSLAVAKENNIQYIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN
        PAISCGVYRYP+DEAATIALST++EFS GLKEVHFVLY+SDIYNVWL  ANELLKN
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN

SwissProt top hitse value%identityAlignment
Q87JZ5 Macro domain-containing protein VPA01031.6e-3550Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VN AN  MLGGGG DGAIH AAGP L+ ACY+V +V  GIRCP G+ARIT    L A +VIH VGPIY+  ++P+ +L SAY+ SL +A  N+ Q +A 
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVW
        PAISCGVY YP  EAA +A++  +       ++ F L++ ++ ++W
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVW

Q8P5Z8 Macro domain-containing protein XCC31845.9e-3550Show/hide
Query:  WRGRSVDLFVTMILVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSS-NPQALLRSAYRNS
        W+G    L V  ++VN ANE +LGGGG DGAIH AAGP L++AC ++ EV+PG+RCPTGE RIT GF L A H+ HTVGP++     N    L + Y  S
Subjt:  WRGRSVDLFVTMILVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSS-NPQALLRSAYRNS

Query:  LAVAKENNIQYIAFPAISCGVYRYPYDEAATIALSTIREFSQGLK-EVHFVLYA
        L +A++  +  IAFPAISCG+Y YP  +AA IA++  R++ +  K   H VL A
Subjt:  LAVAKENNIQYIAFPAISCGVYRYPYDEAATIALSTIREFSQGLK-EVHFVLYA

Q8PHB6 Macro domain-containing protein XAC33435.9e-3549.35Show/hide
Query:  WRGRSVDLFVTMILVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTS-SNPQALLRSAYRNS
        W+G   +L V  ++VN ANE +LGGGG DGAIH AAGP L++AC ++ +V+PG+RCPTGE RIT GF L A H+ HTVGP++     N    L + Y  S
Subjt:  WRGRSVDLFVTMILVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTS-SNPQALLRSAYRNS

Query:  LAVAKENNIQYIAFPAISCGVYRYPYDEAATIALSTIREFSQGLK-EVHFVLYA
        L +A++  +  IAFPAISCG+Y YP  +AA IA++  R++ +  K   H VL A
Subjt:  LAVAKENNIQYIAFPAISCGVYRYPYDEAATIALSTIREFSQGLK-EVHFVLYA

Q8Y2K1 Macro domain-containing protein RSc03342.7e-3247.06Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQ-ALLRSAYRNSLAVAKENNIQYIA
        +VN AN  +LGGGG DGAIH AAGP+L++AC ++        C TG+A+ITPGF LPA ++IHTVGPI+      + ALL + YRNSLA+AK+++++ IA
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQ-ALLRSAYRNSLAVAKENNIQYIA

Query:  FPAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANE
        FP IS GVY +P   AA IA+ T+RE    L ++ F  +++    ++    NE
Subjt:  FPAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANE

Q9HXU7 Macro domain-containing protein PA36933.6e-3249.06Show/hide
Query:  WRGRSVDLFVTMILVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQA-LLRSAYRNS
        W+G    L V  I VN AN  +LGGGG DGAIH AAG +LV AC  +        C TGEA+IT GFRLPA+HVIHTVGP++    N +A LL S YR S
Subjt:  WRGRSVDLFVTMILVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQA-LLRSAYRNS

Query:  LAVAKENNIQYIAFPAISCGVYRYPYDEAATIALSTI---REFSQGLKEVHFVLYASDI
        LA+A++     +AFPAISCG+Y YP ++AA IA+  +   R     L+E+  V + S +
Subjt:  LAVAKENNIQYIAFPAISCGVYRYPYDEAATIALSTI---REFSQGLKEVHFVLYASDI

Arabidopsis top hitse value%identityAlignment
AT1G69340.1 appr-1-p processing enzyme family protein2.0e-1434Show/hide
Query:  MFPWRGRSVDLFVTMILVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQA--LLRSA
        ++ WRG   +L V  + VN  NE +     + G +H AAGP L + C ++        C TG A++T  + LPA  VIHTVGP Y    +  A   L   
Subjt:  MFPWRGRSVDLFVTMILVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQA--LLRSA

Query:  YRNSLAVAKENNIQYIAFPAISCGVYRYPYDEAATIALSTIREFSQGLKE
        YR+ L +  ++ +Q IA   I      YP + AA +A+ T+R F +  K+
Subjt:  YRNSLAVAKENNIQYIAFPAISCGVYRYPYDEAATIALSTIREFSQGLKE

AT2G40600.1 appr-1-p processing enzyme family protein4.0e-6371.61Show/hide
Query:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF
        +VNPANE MLGGGGADGAIH AAGP L  ACY V EV+PG+RCPTGEARITPGF LPAS VIHTVGPIY++  NPQ  L ++Y+NSL VAKENNI+YIAF
Subjt:  LVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAF

Query:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLK
        PAISCG+Y YP+DEAA I +STI++FS   KEVHFVL+A DI++VW+ KA E+L+
Subjt:  PAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGTGAGGGACGCTGGAGAGAATGTTGGGAAAGGGATTGAGACTTCTCATAGAGAGGAGGAAGGTTGTATTGCCTTAGGGCAAGGAGATACCTTACATATTGGTTT
TTGGAGGGGGCTAGGTTTAATGTTTCCTTGGAGGGGTCGATCTGTAGACCTTTTTGTAACTATGATCTTGGTTAATCCAGCAAATGAGGTAATGCTTGGAGGTGGTGGTG
CTGATGGAGCCATACATAATGCTGCTGGGCCAGATCTCGTACAAGCATGTTATTCTGTCCAAGAAGTCCAACCTGGAATCCGTTGTCCAACTGGAGAAGCAAGGATTACA
CCAGGTTTTCGGTTGCCAGCGTCTCATGTAATCCATACTGTTGGACCTATCTACAACACCAGTAGTAACCCCCAGGCCCTACTGAGAAGCGCATATAGAAATTCCTTGGC
TGTGGCAAAGGAGAATAACATTCAATATATTGCATTTCCTGCCATATCCTGTGGTGTATATCGATATCCTTATGATGAAGCTGCCACAATAGCCTTATCTACCATTAGAG
AGTTTTCCCAGGGCTTGAAAGAAGTGCACTTTGTCCTTTATGCTTCTGATATTTACAATGTTTGGTTGGCCAAAGCAAATGAACTGCTCAAGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGTGAGGGACGCTGGAGAGAATGTTGGGAAAGGGATTGAGACTTCTCATAGAGAGGAGGAAGGTTGTATTGCCTTAGGGCAAGGAGATACCTTACATATTGGTTT
TTGGAGGGGGCTAGGTTTAATGTTTCCTTGGAGGGGTCGATCTGTAGACCTTTTTGTAACTATGATCTTGGTTAATCCAGCAAATGAGGTAATGCTTGGAGGTGGTGGTG
CTGATGGAGCCATACATAATGCTGCTGGGCCAGATCTCGTACAAGCATGTTATTCTGTCCAAGAAGTCCAACCTGGAATCCGTTGTCCAACTGGAGAAGCAAGGATTACA
CCAGGTTTTCGGTTGCCAGCGTCTCATGTAATCCATACTGTTGGACCTATCTACAACACCAGTAGTAACCCCCAGGCCCTACTGAGAAGCGCATATAGAAATTCCTTGGC
TGTGGCAAAGGAGAATAACATTCAATATATTGCATTTCCTGCCATATCCTGTGGTGTATATCGATATCCTTATGATGAAGCTGCCACAATAGCCTTATCTACCATTAGAG
AGTTTTCCCAGGGCTTGAAAGAAGTGCACTTTGTCCTTTATGCTTCTGATATTTACAATGTTTGGTTGGCCAAAGCAAATGAACTGCTCAAGAACTAG
Protein sequenceShow/hide protein sequence
MGVRDAGENVGKGIETSHREEEGCIALGQGDTLHIGFWRGLGLMFPWRGRSVDLFVTMILVNPANEVMLGGGGADGAIHNAAGPDLVQACYSVQEVQPGIRCPTGEARIT
PGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVYRYPYDEAATIALSTIREFSQGLKEVHFVLYASDIYNVWLAKANELLKN