; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020017 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020017
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPollen preferential protein
Genome locationChr04:27989555..27990034
RNA-Seq ExpressionHG10020017
SyntenyHG10020017
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043783.1 uncharacterized protein E6C27_scaffold236G001150 [Cucumis melo var. makuwa]2.0e-6985.53Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        MSN+IMLRPPSSNRRQPLL SKSASG VRFAEVAGGT AECAAVCCCCPC VINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGV+PARR R+S GG DE
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE

Query:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK
        TDIQILS GK++YS EP+GQ++EET+RKVMELEKEMWEIFYSTGFWRSPS+R+Q SI++
Subjt:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK

XP_011652052.1 uncharacterized protein LOC105434984 [Cucumis sativus]2.0e-6986.79Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        MSN+IM RPPSSNRRQPLLTSKSASG VRFAEVAGGT AECAAVCCCCPC VINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGV PARR RFS GG DE
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE

Query:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK
        TDIQILS GK VYS EP+GQ++ ET+RKVMELEKEMWEIFYSTGFWRSPS+RDQ SI++
Subjt:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK

XP_016899649.1 PREDICTED: uncharacterized protein LOC103486710 [Cucumis melo]3.1e-6280.5Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        MSN+IMLRPPSSNRRQPLL SKSASG VRFAEVAGGT AECAAVCCCCPC VINFLVLAIYKVPAGLCRRALRTKRRQRLKKK            GG DE
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE

Query:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK
        TDIQILS GK++YS EP+GQ++EET+RKVMELEKEMWEIFYSTGFWRSPS+R+Q SI++
Subjt:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK

XP_023539516.1 uncharacterized protein LOC111800159 [Cucurbita pepo subsp. pepo]1.8e-5473.08Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        MSN +M RPP S R QPLL SKS  G +RFAEVAGGT AECAAVCCCCPCFV++FLVLAIYKVPAGLCRRALRT+RR  L +KG LPARR R S    DE
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE

Query:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS
         D+QIL+ GKT+ + E KG+KS+ET+RKVMELE EMWE FY TGFWRSPSQRDQI+
Subjt:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS

XP_038905288.1 uncharacterized protein LOC120091362 [Benincasa hispida]6.6e-7390.38Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        MSN+IMLRPPS+NRRQPLLTSKS SG VRFAEVAGGT AECAAVCCCCPC V+NFLVLAIYKVPAGLCRRALRTKRRQRLKKKG +PARR RFSGGG DE
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE

Query:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS
        TDIQILS+GKTVYS EPKGQKSEET+R+VMELEKEMWEIFYSTGFWRSPSQRDQIS
Subjt:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS

TrEMBL top hitse value%identityAlignment
A0A0A0LGV0 Uncharacterized protein9.7e-7086.79Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        MSN+IM RPPSSNRRQPLLTSKSASG VRFAEVAGGT AECAAVCCCCPC VINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGV PARR RFS GG DE
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE

Query:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK
        TDIQILS GK VYS EP+GQ++ ET+RKVMELEKEMWEIFYSTGFWRSPS+RDQ SI++
Subjt:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK

A0A1S4DUJ2 uncharacterized protein LOC1034867101.5e-6280.5Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        MSN+IMLRPPSSNRRQPLL SKSASG VRFAEVAGGT AECAAVCCCCPC VINFLVLAIYKVPAGLCRRALRTKRRQRLKKK            GG DE
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE

Query:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK
        TDIQILS GK++YS EP+GQ++EET+RKVMELEKEMWEIFYSTGFWRSPS+R+Q SI++
Subjt:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK

A0A5D3DQ13 Uncharacterized protein9.7e-7085.53Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        MSN+IMLRPPSSNRRQPLL SKSASG VRFAEVAGGT AECAAVCCCCPC VINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGV+PARR R+S GG DE
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE

Query:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK
        TDIQILS GK++YS EP+GQ++EET+RKVMELEKEMWEIFYSTGFWRSPS+R+Q SI++
Subjt:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK

A0A6J1F7R8 uncharacterized protein LOC1114416321.0e-5070.06Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRL-KKKGVLPARRVRFSGGGLD
        MSN++M RP  SN RQ L+ SKS SG VRFAEVAGGT AECAAV CCCPC  +NFL+LAIYKVPAGLCRRALRTK RQ + KKK + PAR  RF GG LD
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRL-KKKGVLPARRVRFSGGGLD

Query:  ETDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS
        + DIQ++++ KTVY        SEE +RKV+ELEKEMW+IFYSTGFWRSPSQRDQIS
Subjt:  ETDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS

A0A6J1GVU4 uncharacterized protein LOC1114576051.1e-5472.44Show/hide
Query:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        MSN +M RPP S R QPLL SKS  G +RFAEVAGGT AECAAVCCCCPCFV++FLVLAIYKVPAGLCRRALRT+RR  L +KG LP+RR R S    DE
Subjt:  MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE

Query:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS
         D+QIL+ GK +++ E KG+KS+ET+RKVMELE EMWE FY TGFWRSPSQRDQIS
Subjt:  TDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27180.1 unknown protein3.7e-2140.24Show/hide
Query:  MSNSIMLRPP-------SSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRL--KKKGVLPARRV
        M+  ++L+ P       S+ R  P  T+  +  R +  EVAGG AAECAAV CCCPC V+N +VLA+YKVPA +C++A R  +R+R   K+ G+L +   
Subjt:  MSNSIMLRPP-------SSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRL--KKKGVLPARRV

Query:  RFS----GGGLDETDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS
          S       L+E D+      + V+  E      E  D  V+ LE EM + FY  GFWRSPSQ+D  S
Subjt:  RFS----GGGLDETDIQILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS

AT3G11690.1 unknown protein1.9e-3042.46Show/hide
Query:  PPSSNRRQPLL-------TSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPA-------------
        P  SNRRQPLL       + +++ G    AE  GGT A CAAV CCCPC ++N LVLAIYKVP G+CRRA+R++RR++L K G+LP              
Subjt:  PPSSNRRQPLL-------TSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPA-------------

Query:  -RRVRFSGGGLDETDI----------QILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS
         +   F+   LD  D+           +  +GK+V +     ++++E D  V+ LEKEMW  FY  GFWRSPSQR+ +S
Subjt:  -RRVRFSGGGLDETDI----------QILSVGKTVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQIS

AT5G06380.1 unknown protein2.1e-2445.58Show/hide
Query:  RQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDETDIQILSVGKTVYS
        R+ L T     G    AE  GGT A CAA+C C PC V+N +VLA+YK+P GLCRRA+R  RR+RL KK  + + R  F  GG  +           V+ 
Subjt:  RQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDETDIQILSVGKTVYS

Query:  LEPKG--QKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK
        LE +   ++ EE D  V+ LEKEMW  FYS GFWRS SQ +  S  K
Subjt:  LEPKG--QKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK

AT5G14690.1 unknown protein7.0e-0441.79Show/hide
Query:  AGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE
        A    A+C A+ CCCPC +IN L L + KVP  + RR L    R + KK+ V+  R+ R +  G DE
Subjt:  AGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATTCGATTATGCTCCGTCCGCCATCGTCGAACCGTCGGCAACCATTGCTAACGAGCAAATCCGCTTCCGGAAGGGTCCGGTTCGCGGAGGTAGCTGGCGGTAC
GGCGGCGGAGTGTGCTGCCGTATGCTGTTGTTGTCCTTGTTTTGTTATTAATTTCCTTGTACTCGCCATTTACAAGGTTCCGGCTGGTCTCTGCCGCCGTGCTTTGAGGA
CGAAGCGCCGGCAGAGGTTGAAGAAGAAAGGAGTTCTTCCGGCGAGGCGTGTCCGGTTCTCCGGTGGGGGACTCGACGAGACGGATATTCAGATTCTTTCGGTGGGAAAA
ACGGTGTACTCGTTGGAACCGAAAGGACAGAAATCGGAGGAGACGGATAGGAAAGTGATGGAACTGGAAAAAGAGATGTGGGAGATTTTCTACAGTACTGGATTCTGGAG
AAGCCCTTCACAGAGAGATCAAATTTCGATCGCTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATTCGATTATGCTCCGTCCGCCATCGTCGAACCGTCGGCAACCATTGCTAACGAGCAAATCCGCTTCCGGAAGGGTCCGGTTCGCGGAGGTAGCTGGCGGTAC
GGCGGCGGAGTGTGCTGCCGTATGCTGTTGTTGTCCTTGTTTTGTTATTAATTTCCTTGTACTCGCCATTTACAAGGTTCCGGCTGGTCTCTGCCGCCGTGCTTTGAGGA
CGAAGCGCCGGCAGAGGTTGAAGAAGAAAGGAGTTCTTCCGGCGAGGCGTGTCCGGTTCTCCGGTGGGGGACTCGACGAGACGGATATTCAGATTCTTTCGGTGGGAAAA
ACGGTGTACTCGTTGGAACCGAAAGGACAGAAATCGGAGGAGACGGATAGGAAAGTGATGGAACTGGAAAAAGAGATGTGGGAGATTTTCTACAGTACTGGATTCTGGAG
AAGCCCTTCACAGAGAGATCAAATTTCGATCGCTAAATGA
Protein sequenceShow/hide protein sequence
MSNSIMLRPPSSNRRQPLLTSKSASGRVRFAEVAGGTAAECAAVCCCCPCFVINFLVLAIYKVPAGLCRRALRTKRRQRLKKKGVLPARRVRFSGGGLDETDIQILSVGK
TVYSLEPKGQKSEETDRKVMELEKEMWEIFYSTGFWRSPSQRDQISIAK