; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012235 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012235
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMicrospore-specific promoter 2, putative
Genome locationChr01:19191430..19193979
RNA-Seq ExpressionHG10012235
SyntenyHG10012235
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652652.1 uncharacterized protein LOC105435041 isoform X2 [Cucumis sativus]1.6e-7179.07Show/hide
Query:  MEEDDNAKSFHHPQH---SSSSMTLFSRLDHLDFVMMDLEEKKQRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSG
        ME+DDN K F HPQ    SSSSMTLFSRLDHLDFVM +L EKKQRLER FG SNLEEGM  R IS+DVALKDTY KGSLLDRVAALE+RL QLCLEM+SG
Subjt:  MEEDDNAKSFHHPQH---SSSSMTLFSRLDHLDFVMMDLEEKKQRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSG

Query:  SSSNPSSL-TSSQT--SVEISSSSSPKQFCRGQ-PSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQ-VVGKTRTEKDEVGSCKNVKKG-
        SSSNPSSL TSSQT  S+EI+SSSSPK+FC GQ  SSSYPTFHYPN+G TSQ+SQ  QEKPQRQQQKKKQQS  KGQ VVGKT TEKDEVGSCKNVKKG 
Subjt:  SSSNPSSL-TSSQT--SVEISSSSSPKQFCRGQ-PSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQ-VVGKTRTEKDEVGSCKNVKKG-

Query:  -IPCLKWPHLRMFGC
             KWPHLRMFGC
Subjt:  -IPCLKWPHLRMFGC

XP_022159710.1 uncharacterized protein LOC111026052 [Momordica charantia]8.0e-7176.85Show/hide
Query:  DDNAKSFHHPQH-SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPS
        +D+AK  H PQH SSSSMTL SRLDHLDFVM  LE K             + +ERRC+SLDVALKDTYFKGSLLDRVAALE+RLFQLCL+MDSGSS+NPS
Subjt:  DDNAKSFHHPQH-SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPS

Query:  SLTSSQTSVEISSSSSPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQVVGKTRT-EKDEVGSCKNVKKGIPCLKWPHLRM
        S TS++ SVEISSSSSPKQFCRG+PSSSYPTFHYP+HGGTSQISQV QEKPQR QQKKKQ SPSKGQ +GKTR+  KDE GSCKNVKKGIP  KWPHLRM
Subjt:  SLTSSQTSVEISSSSSPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQVVGKTRT-EKDEVGSCKNVKKGIPCLKWPHLRM

Query:  FGC
        FGC
Subjt:  FGC

XP_022996286.1 uncharacterized protein LOC111491560 [Cucurbita maxima]4.5e-6677.39Show/hide
Query:  HHPQH--SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPSSLTSSQ
        HHPQH  SSSSMTLFSRLD++D VM  L EKKQRLER G      G+E++CISLDVALKDTYFKGSLLDRVA+LE+RLFQLCLEMDSGSSSNPSSL S+Q
Subjt:  HHPQH--SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPSSLTSSQ

Query:  TSVEISSSS-SPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKP--QRQQQKKKQQSPSKGQVVGKTRTEKDEVGSCKNVKKGIPCLKWPHLRMFGC
        TS +ISSSS +PKQFCRG+PSSSYP FHYP+HGGTS+ISQ IQEKP  QR +QKKKQQSP K Q +GKTR+ KDE GSCKNVKKGIP  KW HLRMFGC
Subjt:  TSVEISSSS-SPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKP--QRQQQKKKQQSPSKGQVVGKTRTEKDEVGSCKNVKKGIPCLKWPHLRMFGC

XP_031737938.1 uncharacterized protein LOC105435041 isoform X1 [Cucumis sativus]5.2e-7078.34Show/hide
Query:  MEEDDNAKSFHHPQH---SSSSMTLFSRLDHLDFVMMDLEEKKQRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRL--FQLCLEMD
        ME+DDN K F HPQ    SSSSMTLFSRLDHLDFVM +L EKKQRLER FG SNLEEGM  R IS+DVALKDTY KGSLLDRVAALE+RL   QLCLEM+
Subjt:  MEEDDNAKSFHHPQH---SSSSMTLFSRLDHLDFVMMDLEEKKQRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRL--FQLCLEMD

Query:  SGSSSNPSSL-TSSQT--SVEISSSSSPKQFCRGQ-PSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQ-VVGKTRTEKDEVGSCKNVKK
        SGSSSNPSSL TSSQT  S+EI+SSSSPK+FC GQ  SSSYPTFHYPN+G TSQ+SQ  QEKPQRQQQKKKQQS  KGQ VVGKT TEKDEVGSCKNVKK
Subjt:  SGSSSNPSSL-TSSQT--SVEISSSSSPKQFCRGQ-PSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQ-VVGKTRTEKDEVGSCKNVKK

Query:  G--IPCLKWPHLRMFGC
        G      KWPHLRMFGC
Subjt:  G--IPCLKWPHLRMFGC

XP_038888525.1 uncharacterized protein LOC120078343 [Benincasa hispida]1.2e-9089.37Show/hide
Query:  MEEDDNAKSFHHPQH---SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGS
        ME+D+ AK  HHPQH   SSSSMTLFSRLDHLDFVM DL EKKQRL RFGGSNLEEGME+RCISLDVALKDTYFKGSLLDRVA LENRLFQLCLEMDSG+
Subjt:  MEEDDNAKSFHHPQH---SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGS

Query:  SSNPSSLTSSQTSVEISSSSSPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQVVGKTRTEKDEVGSCKNVKKGIPCLKWP
        SSNPSSLTSSQTSVEI SSSSPKQFCRGQPSSSYPTFHYPNHG TSQISQ IQEKPQRQQQKK Q+S SKGQ VGKTRTEKDEVGSCKNVKKGIP LKWP
Subjt:  SSNPSSLTSSQTSVEISSSSSPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQVVGKTRTEKDEVGSCKNVKKGIPCLKWP

Query:  HLRMFGC
        HLRMFGC
Subjt:  HLRMFGC

TrEMBL top hitse value%identityAlignment
A0A0A0LEC6 Uncharacterized protein7.8e-7279.07Show/hide
Query:  MEEDDNAKSFHHPQH---SSSSMTLFSRLDHLDFVMMDLEEKKQRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSG
        ME+DDN K F HPQ    SSSSMTLFSRLDHLDFVM +L EKKQRLER FG SNLEEGM  R IS+DVALKDTY KGSLLDRVAALE+RL QLCLEM+SG
Subjt:  MEEDDNAKSFHHPQH---SSSSMTLFSRLDHLDFVMMDLEEKKQRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSG

Query:  SSSNPSSL-TSSQT--SVEISSSSSPKQFCRGQ-PSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQ-VVGKTRTEKDEVGSCKNVKKG-
        SSSNPSSL TSSQT  S+EI+SSSSPK+FC GQ  SSSYPTFHYPN+G TSQ+SQ  QEKPQRQQQKKKQQS  KGQ VVGKT TEKDEVGSCKNVKKG 
Subjt:  SSSNPSSL-TSSQT--SVEISSSSSPKQFCRGQ-PSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQ-VVGKTRTEKDEVGSCKNVKKG-

Query:  -IPCLKWPHLRMFGC
             KWPHLRMFGC
Subjt:  -IPCLKWPHLRMFGC

A0A5D3E4W5 Uncharacterized protein1.2e-4880.13Show/hide
Query:  MERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDS-GSSSNPSSLT-SSQTSVE-ISSSSSPKQFCRGQPS--SSYPTFHYPNHGGTSQISQVI
        M  R IS+DVALKDTYFKGSLLDRVAALE+RL QLCLEM+S GSSSNPSSLT SSQTS+E I+SSSSPK FC GQ S  SSYPTFHYPN+G TSQISQ  
Subjt:  MERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDS-GSSSNPSSLT-SSQTSVE-ISSSSSPKQFCRGQPS--SSYPTFHYPNHGGTSQISQVI

Query:  QEKPQRQQQKKKQQSPSKGQ-VVGKTRTEKD-EVGSCKNVKKGIPCLKWPHLRMFG
        QEKPQRQQQKKKQQS  KGQ VV KT TEKD EVGSCKNVKKG P  KWPHLR+FG
Subjt:  QEKPQRQQQKKKQQSPSKGQ-VVGKTRTEKD-EVGSCKNVKKGIPCLKWPHLRMFG

A0A6J1DZI5 uncharacterized protein LOC1110260523.9e-7176.85Show/hide
Query:  DDNAKSFHHPQH-SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPS
        +D+AK  H PQH SSSSMTL SRLDHLDFVM  LE K             + +ERRC+SLDVALKDTYFKGSLLDRVAALE+RLFQLCL+MDSGSS+NPS
Subjt:  DDNAKSFHHPQH-SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPS

Query:  SLTSSQTSVEISSSSSPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQVVGKTRT-EKDEVGSCKNVKKGIPCLKWPHLRM
        S TS++ SVEISSSSSPKQFCRG+PSSSYPTFHYP+HGGTSQISQV QEKPQR QQKKKQ SPSKGQ +GKTR+  KDE GSCKNVKKGIP  KWPHLRM
Subjt:  SLTSSQTSVEISSSSSPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQVVGKTRT-EKDEVGSCKNVKKGIPCLKWPHLRM

Query:  FGC
        FGC
Subjt:  FGC

A0A6J1ES90 uncharacterized protein LOC1114372641.9e-6575.36Show/hide
Query:  DDNAKSF-HHPQH--SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSN
        +D AK   HHPQH  SSSSMTLFSRLD++D VM  L EKKQRLER G      G+E++CISLDVALKDTYFKGSLLDRVA+LE+RLFQLCLEMDSGSSSN
Subjt:  DDNAKSF-HHPQH--SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSN

Query:  PSSLTSSQTSVEISSSS-SPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEK--PQRQQQKKKQQSPSKGQVVGKTRTEKDEVGSCKNVKKGIPCLKWP
        PSSL S+QTS +ISSSS +PKQFCRG+PSSSYP FHYP+HGGTS+ISQ IQEK   QR +QKKKQQSP K Q +GKTR+ KDE GSCKNVKKGIP  KW 
Subjt:  PSSLTSSQTSVEISSSS-SPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEK--PQRQQQKKKQQSPSKGQVVGKTRTEKDEVGSCKNVKKGIPCLKWP

Query:  HLRMFGC
        HLRMFGC
Subjt:  HLRMFGC

A0A6J1K4A8 uncharacterized protein LOC1114915602.2e-6677.39Show/hide
Query:  HHPQH--SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPSSLTSSQ
        HHPQH  SSSSMTLFSRLD++D VM  L EKKQRLER G      G+E++CISLDVALKDTYFKGSLLDRVA+LE+RLFQLCLEMDSGSSSNPSSL S+Q
Subjt:  HHPQH--SSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPSSLTSSQ

Query:  TSVEISSSS-SPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKP--QRQQQKKKQQSPSKGQVVGKTRTEKDEVGSCKNVKKGIPCLKWPHLRMFGC
        TS +ISSSS +PKQFCRG+PSSSYP FHYP+HGGTS+ISQ IQEKP  QR +QKKKQQSP K Q +GKTR+ KDE GSCKNVKKGIP  KW HLRMFGC
Subjt:  TSVEISSSS-SPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKP--QRQQQKKKQQSPSKGQVVGKTRTEKDEVGSCKNVKKGIPCLKWPHLRMFGC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46795.1 microspore-specific promoter 24.6e-1633.99Show/hide
Query:  KSFHHPQHSSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPSSLTSS
        ++    +HS SS+ + SRL+HLDFV+ +L E++Q L ++   +      R  I    A+++ YFKGSLLDR+AALE RLFQ+CLE++S S S+    TS+
Subjt:  KSFHHPQHSSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPSSLTSS

Query:  QTSVEISSSSSPKQFCRGQP--SSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQVVGK--TRTEKDEVG-SCKNVKKGIPC-LKWPHLRM
          S E SS    +   +  P  SS+   FH P      Q  Q ++E  ++ +++K+++   +  ++ K   +T+K++   +CK  KK      KW    +
Subjt:  QTSVEISSSSSPKQFCRGQP--SSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQVVGK--TRTEKDEVG-SCKNVKKGIPC-LKWPHLRM

Query:  FGC
         GC
Subjt:  FGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGATGATAATGCAAAGTCATTCCATCATCCACAACATTCCTCTTCCTCAATGACCCTTTTTTCTAGATTGGATCATTTGGATTTTGTTATGATGGATTTGGA
GGAGAAAAAACAGAGATTGGAAAGATTTGGAGGAAGTAATTTAGAAGAAGGAATGGAAAGAAGATGCATATCATTGGATGTTGCACTAAAAGATACTTACTTTAAAGGTT
CATTGTTGGATCGAGTGGCAGCTCTTGAGAATAGACTTTTTCAGCTATGTTTGGAGATGGATTCAGGGAGCTCCTCAAATCCTTCATCATTAACTTCATCACAAACATCA
GTAGAGATTAGTTCTTCTTCTTCTCCAAAACAATTTTGCAGAGGACAACCATCTTCTTCATACCCCACATTCCACTATCCCAACCATGGAGGAACCTCACAAATTTCTCA
AGTTATTCAGGAGAAGCCTCAAAGACAACAACAGAAAAAGAAGCAACAGAGTCCTTCAAAAGGCCAAGTAGTTGGTAAGACAAGGACTGAAAAAGATGAAGTGGGATCTT
GCAAAAATGTGAAGAAAGGGATTCCTTGTCTCAAATGGCCACACTTGAGAATGTTTGGTTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGATGATAATGCAAAGTCATTCCATCATCCACAACATTCCTCTTCCTCAATGACCCTTTTTTCTAGATTGGATCATTTGGATTTTGTTATGATGGATTTGGA
GGAGAAAAAACAGAGATTGGAAAGATTTGGAGGAAGTAATTTAGAAGAAGGAATGGAAAGAAGATGCATATCATTGGATGTTGCACTAAAAGATACTTACTTTAAAGGTT
CATTGTTGGATCGAGTGGCAGCTCTTGAGAATAGACTTTTTCAGCTATGTTTGGAGATGGATTCAGGGAGCTCCTCAAATCCTTCATCATTAACTTCATCACAAACATCA
GTAGAGATTAGTTCTTCTTCTTCTCCAAAACAATTTTGCAGAGGACAACCATCTTCTTCATACCCCACATTCCACTATCCCAACCATGGAGGAACCTCACAAATTTCTCA
AGTTATTCAGGAGAAGCCTCAAAGACAACAACAGAAAAAGAAGCAACAGAGTCCTTCAAAAGGCCAAGTAGTTGGTAAGACAAGGACTGAAAAAGATGAAGTGGGATCTT
GCAAAAATGTGAAGAAAGGGATTCCTTGTCTCAAATGGCCACACTTGAGAATGTTTGGTTGCTAA
Protein sequenceShow/hide protein sequence
MEEDDNAKSFHHPQHSSSSMTLFSRLDHLDFVMMDLEEKKQRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNPSSLTSSQTS
VEISSSSSPKQFCRGQPSSSYPTFHYPNHGGTSQISQVIQEKPQRQQQKKKQQSPSKGQVVGKTRTEKDEVGSCKNVKKGIPCLKWPHLRMFGC