; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G12810 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G12810
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionMicrospore-specific promoter 2, putative
Genome locationClcChr09:12178093..12180645
RNA-Seq ExpressionClc09G12810
SyntenyClc09G12810
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652652.1 uncharacterized protein LOC105435041 isoform X2 [Cucumis sativus]3.5e-7178.67Show/hide
Query:  EDDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSS
        +DDN KLF HPQ    SSSSMTLFSRLDHLDFVMK+LEKK RLER FG SNLEEGM  R IS+DVALKDTY KGSLLDRVAALE+RL QLCLEM+SGSSS
Subjt:  EDDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSS

Query:  NLSSL-TSSQT--SVEISSSSSPKNFCRGQ-SSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQ-VVGKTRTEKEEVGSCKNVKKG--IPS
        N SSL TSSQT  S+EI+SSSSPK FC GQ SSSSYPTFHYPN+  TSQ+SQ QEKPQRQQ+KKKQQS  KGQ VVGKT TEK+EVGSCKNVKKG    +
Subjt:  NLSSL-TSSQT--SVEISSSSSPKNFCRGQ-SSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQ-VVGKTRTEKEEVGSCKNVKKG--IPS

Query:  LKWPHLRMFGC
         KWPHLRMFGC
Subjt:  LKWPHLRMFGC

XP_022159710.1 uncharacterized protein LOC111026052 [Momordica charantia]4.3e-6975.86Show/hide
Query:  MEDDNAKLFHHPQH-SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNL
        MEDD AKL H PQH SSSSMTL SRLDHLDFVMK LE+K            + +ERRC+SLDVALKDTYFKGSLLDRVAALE+RLFQLCL+MDSGSS+N 
Subjt:  MEDDNAKLFHHPQH-SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNL

Query:  SSLTSSQTSVEISSSSSPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQVVGKTRT-EKEEVGSCKNVKKGIPSLKWPHLRM
        SS TS++ SVEISSSSSPK FCRG+ SSSYPTFHYP+H GTSQISQ+QEKPQR Q+KKKQ SPSKGQ +GKTR+  K+E GSCKNVKKGIP  KWPHLRM
Subjt:  SSLTSSQTSVEISSSSSPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQVVGKTRT-EKEEVGSCKNVKKGIPSLKWPHLRM

Query:  FGC
        FGC
Subjt:  FGC

XP_023532991.1 uncharacterized protein LOC111795001 [Cucurbita pepo subsp. pepo]1.1e-6473.3Show/hide
Query:  MEDDNAKLFHHPQH--SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSN
        MED    L HHPQH  SSSSMTLFSRLD++D VMK LEKK RLER G      G+E++CISLDVALKDTYFKGSLLDRVA+LE+RLFQLCLEMDSGSSSN
Subjt:  MEDDNAKLFHHPQH--SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSN

Query:  LSSLTSSQTSVEISSSS-SPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKP--QRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPSLKWPH
         SSL S+QTS +ISSSS +PK FCRG+ SSSYP FHYP+H GTS+ISQ+QEKP  QR ++KKKQQSP K Q +GKTR+ K+E GSCKNVKKGIP  KW H
Subjt:  LSSLTSSQTSVEISSSS-SPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKP--QRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPSLKWPH

Query:  LRMFGC
        LRMFGC
Subjt:  LRMFGC

XP_031737938.1 uncharacterized protein LOC105435041 isoform X1 [Cucumis sativus]1.1e-6977.93Show/hide
Query:  EDDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRL--FQLCLEMDSGS
        +DDN KLF HPQ    SSSSMTLFSRLDHLDFVMK+LEKK RLER FG SNLEEGM  R IS+DVALKDTY KGSLLDRVAALE+RL   QLCLEM+SGS
Subjt:  EDDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRL--FQLCLEMDSGS

Query:  SSNLSSL-TSSQT--SVEISSSSSPKNFCRGQ-SSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQ-VVGKTRTEKEEVGSCKNVKKG--I
        SSN SSL TSSQT  S+EI+SSSSPK FC GQ SSSSYPTFHYPN+  TSQ+SQ QEKPQRQQ+KKKQQS  KGQ VVGKT TEK+EVGSCKNVKKG   
Subjt:  SSNLSSL-TSSQT--SVEISSSSSPKNFCRGQ-SSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQ-VVGKTRTEKEEVGSCKNVKKG--I

Query:  PSLKWPHLRMFGC
         + KWPHLRMFGC
Subjt:  PSLKWPHLRMFGC

XP_038888525.1 uncharacterized protein LOC120078343 [Benincasa hispida]1.4e-8887.68Show/hide
Query:  EDDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSN
        +D+ AKL HHPQH   SSSSMTLFSRLDHLDFVMKDLEKK RL RFGGSNLEEGME+RCISLDVALKDTYFKGSLLDRVA LENRLFQLCLEMDSG+SSN
Subjt:  EDDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSN

Query:  LSSLTSSQTSVEISSSSSPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPSLKWPHLRM
         SSLTSSQTSVEI SSSSPK FCRGQ SSSYPTFHYPNH  TSQISQ+QEKPQRQQ+KK Q+S SKGQ VGKTRTEK+EVGSCKNVKKGIPSLKWPHLRM
Subjt:  LSSLTSSQTSVEISSSSSPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPSLKWPHLRM

Query:  FGC
        FGC
Subjt:  FGC

TrEMBL top hitse value%identityAlignment
A0A0A0LEC6 Uncharacterized protein1.7e-7178.67Show/hide
Query:  EDDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSS
        +DDN KLF HPQ    SSSSMTLFSRLDHLDFVMK+LEKK RLER FG SNLEEGM  R IS+DVALKDTY KGSLLDRVAALE+RL QLCLEM+SGSSS
Subjt:  EDDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSS

Query:  NLSSL-TSSQT--SVEISSSSSPKNFCRGQ-SSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQ-VVGKTRTEKEEVGSCKNVKKG--IPS
        N SSL TSSQT  S+EI+SSSSPK FC GQ SSSSYPTFHYPN+  TSQ+SQ QEKPQRQQ+KKKQQS  KGQ VVGKT TEK+EVGSCKNVKKG    +
Subjt:  NLSSL-TSSQT--SVEISSSSSPKNFCRGQ-SSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQ-VVGKTRTEKEEVGSCKNVKKG--IPS

Query:  LKWPHLRMFGC
         KWPHLRMFGC
Subjt:  LKWPHLRMFGC

A0A1S3CRK1 uncharacterized protein LOC1035035419.1e-4980.77Show/hide
Query:  DDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDS-GSSS
        DDN KLFH PQ    SSSSMTLFSRLDHLDFVMKDLEKK RLER FGGSNLEEGM  R IS+DVALKDTYFKGSLLDRVAALE+RL QLCLEM+S GSSS
Subjt:  DDNAKLFHHPQH---SSSSMTLFSRLDHLDFVMKDLEKKHRLER-FGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDS-GSSS

Query:  NLSSLT-SSQTSVE-ISSSSSPKNFCRGQSS--SSYPTFHYPNHEGTSQISQLQEK
        N SSLT SSQTS+E I+SSSSPK FC GQSS  SSYPTFHYPN+  TSQISQ Q K
Subjt:  NLSSLT-SSQTSVE-ISSSSSPKNFCRGQSS--SSYPTFHYPNHEGTSQISQLQEK

A0A6J1DZI5 uncharacterized protein LOC1110260522.1e-6975.86Show/hide
Query:  MEDDNAKLFHHPQH-SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNL
        MEDD AKL H PQH SSSSMTL SRLDHLDFVMK LE+K            + +ERRC+SLDVALKDTYFKGSLLDRVAALE+RLFQLCL+MDSGSS+N 
Subjt:  MEDDNAKLFHHPQH-SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNL

Query:  SSLTSSQTSVEISSSSSPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQVVGKTRT-EKEEVGSCKNVKKGIPSLKWPHLRM
        SS TS++ SVEISSSSSPK FCRG+ SSSYPTFHYP+H GTSQISQ+QEKPQR Q+KKKQ SPSKGQ +GKTR+  K+E GSCKNVKKGIP  KWPHLRM
Subjt:  SSLTSSQTSVEISSSSSPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQVVGKTRT-EKEEVGSCKNVKKGIPSLKWPHLRM

Query:  FGC
        FGC
Subjt:  FGC

A0A6J1ES90 uncharacterized protein LOC1114372644.5e-6472.82Show/hide
Query:  MEDDNAKLFHHPQH--SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSN
        MED    L HHPQH  SSSSMTLFSRLD++D VMK LEKK RLER G      G+E++CISLDVALKDTYFKGSLLDRVA+LE+RLFQLCLEMDSGSSSN
Subjt:  MEDDNAKLFHHPQH--SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSN

Query:  LSSLTSSQTSVEISSSS-SPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEK--PQRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPSLKWPH
         SSL S+QTS +ISSSS +PK FCRG+ SSSYP FHYP+H GTS+ISQ+QEK   QR ++KKKQQSP K Q +GKTR+ K+E GSCKNVKKGIP  KW H
Subjt:  LSSLTSSQTSVEISSSS-SPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEK--PQRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPSLKWPH

Query:  LRMFGC
        LRMFGC
Subjt:  LRMFGC

A0A6J1K4A8 uncharacterized protein LOC1114915601.5e-6473.66Show/hide
Query:  DDNAKLF-HHPQH--SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNL
        +D AKL  HHPQH  SSSSMTLFSRLD++D VMK LEKK RLER G      G+E++CISLDVALKDTYFKGSLLDRVA+LE+RLFQLCLEMDSGSSSN 
Subjt:  DDNAKLF-HHPQH--SSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNL

Query:  SSLTSSQTSVEISSSS-SPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKP--QRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPSLKWPHL
        SSL S+QTS +ISSSS +PK FCRG+ SSSYP FHYP+H GTS+ISQ+QEKP  QR ++KKKQQSP K Q +GKTR+ K+E GSCKNVKKGIP  KW HL
Subjt:  SSLTSSQTSVEISSSS-SPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKP--QRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPSLKWPHL

Query:  RMFGC
        RMFGC
Subjt:  RMFGC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46795.1 microspore-specific promoter 29.1e-1734.21Show/hide
Query:  QHSSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNLSSLTSSQTSVEIS
        +HS SS+ + SRL+HLDFV+K+LE++  L ++   +      R  I    A+++ YFKGSLLDR+AALE RLFQ+CLE++S S S+ S+  S +TS +  
Subjt:  QHSSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNLSSLTSSQTSVEIS

Query:  SSSSPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPS-LKWPHLRMFGC
             K       SS+   FH P  +    + +++EK + ++E++        +   K   + +   +CK  KK   S  KW    + GC
Subjt:  SSSSPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPS-LKWPHLRMFGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATGATAATGCAAAGTTATTCCATCATCCACAACATTCCTCTTCCTCAATGACCCTTTTTTCTAGATTGGATCATTTGGATTTTGTTATGAAGGACTTGGAGAA
AAAACATAGATTGGAAAGATTTGGAGGCAGTAATTTAGAAGAAGGAATGGAAAGAAGATGCATATCATTGGATGTTGCACTAAAAGATACTTACTTTAAAGGTTCATTGT
TGGATCGAGTGGCAGCTCTTGAGAATAGACTTTTTCAGCTATGTTTGGAGATGGATTCAGGCAGCTCCTCAAATCTTTCATCATTAACTTCATCACAAACATCAGTAGAG
ATTAGTTCTTCTTCTTCTCCAAAAAATTTTTGTAGAGGACAATCATCTTCTTCATATCCCACATTCCATTATCCCAACCATGAAGGAACTTCACAAATTTCTCAACTTCA
GGAGAAGCCTCAAAGACAACAAGAGAAAAAGAAGCAACAAAGTCCTTCAAAAGGGCAAGTAGTTGGTAAGACAAGGACTGAAAAGGAAGAAGTAGGATCTTGCAAAAATG
TGAAGAAAGGGATTCCTTCTCTCAAATGGCCACACTTGAGAATGTTCGGTTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATGATAATGCAAAGTTATTCCATCATCCACAACATTCCTCTTCCTCAATGACCCTTTTTTCTAGATTGGATCATTTGGATTTTGTTATGAAGGACTTGGAGAA
AAAACATAGATTGGAAAGATTTGGAGGCAGTAATTTAGAAGAAGGAATGGAAAGAAGATGCATATCATTGGATGTTGCACTAAAAGATACTTACTTTAAAGGTTCATTGT
TGGATCGAGTGGCAGCTCTTGAGAATAGACTTTTTCAGCTATGTTTGGAGATGGATTCAGGCAGCTCCTCAAATCTTTCATCATTAACTTCATCACAAACATCAGTAGAG
ATTAGTTCTTCTTCTTCTCCAAAAAATTTTTGTAGAGGACAATCATCTTCTTCATATCCCACATTCCATTATCCCAACCATGAAGGAACTTCACAAATTTCTCAACTTCA
GGAGAAGCCTCAAAGACAACAAGAGAAAAAGAAGCAACAAAGTCCTTCAAAAGGGCAAGTAGTTGGTAAGACAAGGACTGAAAAGGAAGAAGTAGGATCTTGCAAAAATG
TGAAGAAAGGGATTCCTTCTCTCAAATGGCCACACTTGAGAATGTTCGGTTGTTAA
Protein sequenceShow/hide protein sequence
MEDDNAKLFHHPQHSSSSMTLFSRLDHLDFVMKDLEKKHRLERFGGSNLEEGMERRCISLDVALKDTYFKGSLLDRVAALENRLFQLCLEMDSGSSSNLSSLTSSQTSVE
ISSSSSPKNFCRGQSSSSYPTFHYPNHEGTSQISQLQEKPQRQQEKKKQQSPSKGQVVGKTRTEKEEVGSCKNVKKGIPSLKWPHLRMFGC