; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G012930 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G012930
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDUF4228 domain-containing protein
Genome locationCG_Chr02:26686015..26686856
RNA-Seq ExpressionClCG02G012930
SyntenyClCG02G012930
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045798.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]6.0e-3893.26Show/hide
Query:  MDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSALRTLQSPFF
        MDG IEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLS DENLQIGQIYFLLPLSLA SPLSLPDLCNFAIKASSALR + SPFF
Subjt:  MDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSALRTLQSPFF

KAE8651930.1 hypothetical protein Csa_006329 [Cucumis sativus]6.2e-5181.34Show/hide
Query:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQ-SNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPL
        MG CASTQ T +  NPK G++ PKQQQ QQQ  NLDSIK++HMDG IEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLS DENLQ+GQIYFLLPL
Subjt:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQ-SNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPL

Query:  SLAQSPLSLPDLCNFAIKASSALRTLQSPFFSKE
        SLA SPLSLPDLCNFAIKASSALR +QS  F  E
Subjt:  SLAQSPLSLPDLCNFAIKASSALRTLQSPFFSKE

KAG6571421.1 hypothetical protein SDJN03_30336, partial [Cucurbita argyrosperma subsp. sororia]1.4e-4778.46Show/hide
Query:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS
        MG CAST T T+  NPK GMRL      QQQ NLDSIK+IHMDG ++EFSDPIKASKITS NPN  LCNS++M IGS VPSLSPDENLQ+GQIYFLLPLS
Subjt:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS

Query:  LAQSPLSLPDLCNFAIKASSALRTLQSPFF
        LAQSPLSLPDLCNFAIKASSAL TLQSP F
Subjt:  LAQSPLSLPDLCNFAIKASSALRTLQSPFF

XP_008457799.2 PREDICTED: uncharacterized protein LOC103497401 [Cucumis melo]6.2e-5182.09Show/hide
Query:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQS----NLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFL
        MG CASTQ T +  NPK G++  KQQQ QQQ     NLDSIK+IHMDG IEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLS DENLQIGQIYFL
Subjt:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQS----NLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFL

Query:  LPLSLAQSPLSLPDLCNFAIKASSALRTLQSPFF
        LPLSLA SPLSLPDLCNFAIKASSALR + SPFF
Subjt:  LPLSLAQSPLSLPDLCNFAIKASSALRTLQSPFF

XP_022158695.1 uncharacterized protein LOC111025158 [Momordica charantia]2.4e-3465.85Show/hide
Query:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS
        MG C STQTT    + + G+ + + +   ++S   +IKI+HMDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+P  
Subjt:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS

Query:  LAQSPLSLPDLCNFAIKASSALR
         A +PLSLPDLCN AIKASSALR
Subjt:  LAQSPLSLPDLCNFAIKASSALR

TrEMBL top hitse value%identityAlignment
A0A0A0LJJ9 Uncharacterized protein5.1e-5182.44Show/hide
Query:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQ-SNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPL
        MG CASTQ T +  NPK G++ PKQQQ QQQ  NLDSIK++HMDG IEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLS DENLQ+GQIYFLLPL
Subjt:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQ-SNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPL

Query:  SLAQSPLSLPDLCNFAIKASSALRTLQSPFF
        SLA SPLSLPDLCNFAIKASSALR +QS  F
Subjt:  SLAQSPLSLPDLCNFAIKASSALRTLQSPFF

A0A1S3C6C0 uncharacterized protein LOC1034974013.0e-5182.09Show/hide
Query:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQS----NLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFL
        MG CASTQ T +  NPK G++  KQQQ QQQ     NLDSIK+IHMDG IEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLS DENLQIGQIYFL
Subjt:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQS----NLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFL

Query:  LPLSLAQSPLSLPDLCNFAIKASSALRTLQSPFF
        LPLSLA SPLSLPDLCNFAIKASSALR + SPFF
Subjt:  LPLSLAQSPLSLPDLCNFAIKASSALRTLQSPFF

A0A3S3P874 Uncharacterized protein3.7e-2552.8Show/hide
Query:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS
        MG CAS QTT                  Q++S   + K+IHMDG ++EF  PIKA  I S+NP  FLC+SE ML+ S VP +  DE LQ+GQIYFL+PL 
Subjt:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS

Query:  LAQSPLSLPDLCNFAIKASSALRTL
         +QSPLSLPDLC  AIKAS+AL +L
Subjt:  LAQSPLSLPDLCNFAIKASSALRTL

A0A5A7TSG7 DUF4228 domain-containing protein2.9e-3893.26Show/hide
Query:  MDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSALRTLQSPFF
        MDG IEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLS DENLQIGQIYFLLPLSLA SPLSLPDLCNFAIKASSALR + SPFF
Subjt:  MDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSALRTLQSPFF

A0A6J1DWJ0 uncharacterized protein LOC1110251581.1e-3465.85Show/hide
Query:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS
        MG C STQTT    + + G+ + + +   ++S   +IKI+HMDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+P  
Subjt:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS

Query:  LAQSPLSLPDLCNFAIKASSALR
         A +PLSLPDLCN AIKASSALR
Subjt:  LAQSPLSLPDLCNFAIKASSALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein1.3e-1133.02Show/hide
Query:  QHQQQSNLDSIKIIHMDGCIEEFSDPIKASKI----------TSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAI
        +++  S+  + KI+ ++G + E+  P+ AS++          +S + ++FLCNS+ +     +P++  DE LQ  QIYF+LP+S  Q  LS  D+   A+
Subjt:  QHQQQSNLDSIKIIHMDGCIEEFSDPIKASKI----------TSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAI

Query:  KASSAL
        KAS A+
Subjt:  KASSAL

AT2G23690.1 unknown protein2.2e-1438.71Show/hide
Query:  QQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSAL
        + + + + K+I  DG + EF+ P+K   +  +NP  F+CNS+ M   + V ++S DE  Q+GQ+YF LPLS     L   ++   A+KASSAL
Subjt:  QQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSAL

AT3G50800.1 unknown protein8.4e-1437.23Show/hide
Query:  QQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSAL
        ++    ++ K+I  DG ++EFS P+K  +I  +NP  F+CNS+ M     V ++   E+L+ G++YF+LPL+    PL   ++   A+KASSAL
Subjt:  QQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSAL

AT4G37240.1 unknown protein7.6e-1535.25Show/hide
Query:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS
        MG C+S+++T + T                       K+I  DG + EF++P+K   +  + P  F+CNS+ M     V ++S DE LQ+GQIYF LPL 
Subjt:  MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLS

Query:  LAQSPLSLPDLCNFAIKASSAL
          + PL   ++   A+KASSAL
Subjt:  LAQSPLSLPDLCNFAIKASSAL

AT5G66580.1 unknown protein2.4e-1643.18Show/hide
Query:  DSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSAL
        DS K+I +DG ++EFS P+K  +I  +NP  F+CNS++M     V +++ +E L+ GQ+YF+LPL+    PL   ++   A+KASSAL
Subjt:  DSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPDLCNFAIKASSAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCTTGTGCTTCGACTCAAACAACGACAATGCCAACAAATCCGAAATTCGGAATGAGATTGCCAAAACAACAACAACATCAACAACAATCAAATCTGGATTCAAT
CAAGATCATTCACATGGACGGCTGCATCGAGGAGTTTTCCGATCCAATCAAAGCCTCCAAAATCACATCTCGAAACCCTAACTTCTTCCTCTGTAACTCCGAGCAAATGT
TAATCGGTAGCTGCGTTCCTTCTCTTTCCCCCGATGAAAACCTTCAAATCGGCCAAATCTACTTCCTCCTCCCTCTCTCTCTCGCTCAATCCCCTCTTTCTCTACCCGAT
CTCTGCAATTTCGCCATTAAAGCTTCCTCTGCTCTTCGAACGCTTCAATCTCCATTCTTCAGCAAAGAATGTTATGGTGGTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCTTGTGCTTCGACTCAAACAACGACAATGCCAACAAATCCGAAATTCGGAATGAGATTGCCAAAACAACAACAACATCAACAACAATCAAATCTGGATTCAAT
CAAGATCATTCACATGGACGGCTGCATCGAGGAGTTTTCCGATCCAATCAAAGCCTCCAAAATCACATCTCGAAACCCTAACTTCTTCCTCTGTAACTCCGAGCAAATGT
TAATCGGTAGCTGCGTTCCTTCTCTTTCCCCCGATGAAAACCTTCAAATCGGCCAAATCTACTTCCTCCTCCCTCTCTCTCTCGCTCAATCCCCTCTTTCTCTACCCGAT
CTCTGCAATTTCGCCATTAAAGCTTCCTCTGCTCTTCGAACGCTTCAATCTCCATTCTTCAGCAAAGAATGTTATGGTGGTGACTAG
Protein sequenceShow/hide protein sequence
MGACASTQTTTMPTNPKFGMRLPKQQQHQQQSNLDSIKIIHMDGCIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSPDENLQIGQIYFLLPLSLAQSPLSLPD
LCNFAIKASSALRTLQSPFFSKECYGGD