; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G007100 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G007100
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionMagnesium transporter 4 isoform 1
Genome locationCG_Chr02:8147691..8156191
RNA-Seq ExpressionClCG02G007100
SyntenyClCG02G007100
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
InterPro domainsIPR008796 - Photosystem I reaction centre subunit N, chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588719.1 hypothetical protein SDJN03_17284, partial [Cucurbita argyrosperma subsp. sororia]5.1e-3577.6Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+   P T T T +A S   RRGL+LS  AAVAA    AVDSRT+LLKRYLKKSEENKEKN+KERLES+YK
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK

Query:  RNYKDYFEFVEGSVKNKNELSEAEK
        RNYKDYFEFVEGS+KNK+ELSEAEK
Subjt:  RNYKDYFEFVEGSVKNKNELSEAEK

XP_004136934.1 uncharacterized protein LOC101214221 [Cucumis sativus]4.4e-3981.75Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK
        MSSIGQSILMALAVTLNKFASSNVQSVQRN+      AT TAT  S  GRR LLLS +A  +AA    VDSRTELLKRYLKKSEENKEKN+KERLESYYK
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK

Query:  RNYKDYFEFVEGSVKNKNELSEAEKG
        RNYKDYFEFVEGSVKNKNELSEAEKG
Subjt:  RNYKDYFEFVEGSVKNKNELSEAEKG

XP_008455049.1 PREDICTED: uncharacterized protein LOC103495319 [Cucumis melo]1.4e-3781.89Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIA-AVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYY
        MSSIGQSILMALAVTLNKFASSNVQSVQRN+A        TAT  S  GRR LLLS +A A  AA   AVDSRTELLKRYLKKSEENKEKN+KERLESYY
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIA-AVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYY

Query:  KRNYKDYFEFVEGSVKNKNELSEAEKG
        KRNYKDYFEFVEGSVKNKNELSEAEKG
Subjt:  KRNYKDYFEFVEGSVKNKNELSEAEKG

XP_022989008.1 uncharacterized protein LOC111486201 [Cucurbita maxima]3.0e-3578.91Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANG--RRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESY
        MSSIGQ+ILMALA+TLN+FASSNVQSVQR   NKP+T  TTAT  +++   RRGLLLS  AAVAA    AVDSRTELLKRYLKKSEENKEKN+KERLES+
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANG--RRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESY

Query:  YKRNYKDYFEFVEGSVKNKNELSEAEKG
        YKRNYKDYFEFVEGS+KNK+ELSEAEKG
Subjt:  YKRNYKDYFEFVEGSVKNKNELSEAEKG

XP_038887440.1 uncharacterized protein LOC120077574 [Benincasa hispida]3.1e-4892.86Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK
        MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTA  TAT GS  GRRGLLLSA+AA AA PEEAVDSRTELLKRYLKKSEENKEKN+KERLESYYK
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK

Query:  RNYKDYFEFVEGSVKNKNELSEAEKG
        RNYKDYFEFVEGSVKNKNELSEAEKG
Subjt:  RNYKDYFEFVEGSVKNKNELSEAEKG

TrEMBL top hitse value%identityAlignment
A0A1S3C176 uncharacterized protein LOC1034953196.9e-3881.89Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIA-AVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYY
        MSSIGQSILMALAVTLNKFASSNVQSVQRN+A        TAT  S  GRR LLLS +A A  AA   AVDSRTELLKRYLKKSEENKEKN+KERLESYY
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIA-AVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYY

Query:  KRNYKDYFEFVEGSVKNKNELSEAEKG
        KRNYKDYFEFVEGSVKNKNELSEAEKG
Subjt:  KRNYKDYFEFVEGSVKNKNELSEAEKG

A0A6J1D574 uncharacterized protein LOC111017388 isoform X11.0e-3376.8Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK
        MSSIGQSILMALAVT+NKFASSNVQSV RNQ        + A A S  GRRGLL SA+AA A AP   VDSRTELLKRYLKKSE+NKEKN+KERL+SYYK
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK

Query:  RNYKDYFEFVEGSVKNKNELSEAEK
        RNYKDYFEFVEGSV+NK+ELSE EK
Subjt:  RNYKDYFEFVEGSVKNKNELSEAEK

A0A6J1EM63 uncharacterized protein LOC1114346153.2e-3576.15Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANG----RRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLE
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+   P T T T +A ++      RRGLLLS  AAVAA    AVDSRTELLKRYLKKSEENKEKN+KERLE
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANG----RRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLE

Query:  SYYKRNYKDYFEFVEGSVKNKNELSEAEKG
        S+YKRNYKDYFEFVEGS+KNK+ELSEAEKG
Subjt:  SYYKRNYKDYFEFVEGSVKNKNELSEAEKG

A0A6J1JNZ7 uncharacterized protein LOC1114862011.4e-3578.91Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANG--RRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESY
        MSSIGQ+ILMALA+TLN+FASSNVQSVQR   NKP+T  TTAT  +++   RRGLLLS  AAVAA    AVDSRTELLKRYLKKSEENKEKN+KERLES+
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANG--RRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESY

Query:  YKRNYKDYFEFVEGSVKNKNELSEAEKG
        YKRNYKDYFEFVEGS+KNK+ELSEAEKG
Subjt:  YKRNYKDYFEFVEGSVKNKNELSEAEKG

A0A6P4AUL6 uncharacterized protein LOC107427022 isoform X16.9e-3066.67Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK
        MSSIGQSILMAL VT+NKFASSNV +V R Q + P + +TT     ANGRRGLLLS + A +   E+  DSRT+LLK+YLKKSEENK KN+KERL+SYYK
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYK

Query:  RNYKDYFEFVEGSVKNK-NELSEAEKGSL
        RNYKDYFEF EG+++ K  ELSE+EKG L
Subjt:  RNYKDYFEFVEGSVKNK-NELSEAEKGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49975.1 INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Photosystem I reaction centre subunit N (InterPro:IPR008796); Has 34 Blast hits to 34 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).6.0e-2660.16Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSAN-GRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYY
        MSSI QSILMAL VT+NK+ASSNVQ+V+RN      T   + TA  A+ GRR +L S+ + +AA    A+ S  +LL++YLKK+EENK KN+KERL+S+Y
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSAN-GRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYY

Query:  KRNYKDYFEFVEGSVKNKN--ELSEAEK
        KRNYKDYFEFVEGS+K K   ELSE+EK
Subjt:  KRNYKDYFEFVEGSVKNKN--ELSEAEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCATCGGCCAAAGCATTCTGATGGCCCTCGCCGTCACTCTCAACAAATTCGCTTCCTCCAACGTTCAATCCGTTCAGAGAAACCAAGCCAACAAGCCTCGCAC
CGCCACCACCACTGCCACCGCCGGTTCTGCAAATGGAAGAAGAGGCCTCCTCTTGTCTGCCATTGCCGCCGTGGCTGCCGCTCCCGAAGAAGCCGTGGACTCCAGAACCG
AGCTGCTAAAAAGGTACCTCAAGAAGTCTGAAGAAAACAAAGAAAAGAATGAGAAGGAGAGATTGGAGAGTTACTACAAGAGAAATTACAAAGATTATTTTGAGTTTGTT
GAAGGATCAGTGAAGAATAAGAACGAACTTTCAGAAGCTGAAAAAGGAAGTTTACTGATTGTTGAGTTTGAGGAAGGTTTGGAAATTCCGGCGAAGATAGTCTTCAAAGG
TTGTAGTGAGAAGAGTGTGATTACTAGTGATGTTGAGTCTGCTGAAGACACGTTTGATGATATCATTCCTATTAAGGTATTAAGCCAAAAAAGAAAGAAGCAAGCTAGTT
CTACTGGCTATGATACGTCTCTCCCTTCAAAGAAAATTCAGATTCTGAGTGGTGAGAGGATGATGATACTGGTTTTGAGTGTGAAAAATCTAAAGTCTCTGAAGGAAAAC
AAACATGATGCCCAAGTAAGTAGCACATGGGGCATGGAACGTCGCAATGCTCAAGGGTACTGCAGCACGACGCGCCTACAAAGGCATAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCCATCGGCCAAAGCATTCTGATGGCCCTCGCCGTCACTCTCAACAAATTCGCTTCCTCCAACGTTCAATCCGTTCAGAGAAACCAAGCCAACAAGCCTCGCAC
CGCCACCACCACTGCCACCGCCGGTTCTGCAAATGGAAGAAGAGGCCTCCTCTTGTCTGCCATTGCCGCCGTGGCTGCCGCTCCCGAAGAAGCCGTGGACTCCAGAACCG
AGCTGCTAAAAAGGTACCTCAAGAAGTCTGAAGAAAACAAAGAAAAGAATGAGAAGGAGAGATTGGAGAGTTACTACAAGAGAAATTACAAAGATTATTTTGAGTTTGTT
GAAGGATCAGTGAAGAATAAGAACGAACTTTCAGAAGCTGAAAAAGGAAGTTTACTGATTGTTGAGTTTGAGGAAGGTTTGGAAATTCCGGCGAAGATAGTCTTCAAAGG
TTGTAGTGAGAAGAGTGTGATTACTAGTGATGTTGAGTCTGCTGAAGACACGTTTGATGATATCATTCCTATTAAGGTATTAAGCCAAAAAAGAAAGAAGCAAGCTAGTT
CTACTGGCTATGATACGTCTCTCCCTTCAAAGAAAATTCAGATTCTGAGTGGTGAGAGGATGATGATACTGGTTTTGAGTGTGAAAAATCTAAAGTCTCTGAAGGAAAAC
AAACATGATGCCCAAGTAAGTAGCACATGGGGCATGGAACGTCGCAATGCTCAAGGGTACTGCAGCACGACGCGCCTACAAAGGCATAATTAA
Protein sequenceShow/hide protein sequence
MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPRTATTTATAGSANGRRGLLLSAIAAVAAAPEEAVDSRTELLKRYLKKSEENKEKNEKERLESYYKRNYKDYFEFV
EGSVKNKNELSEAEKGSLLIVEFEEGLEIPAKIVFKGCSEKSVITSDVESAEDTFDDIIPIKVLSQKRKKQASSTGYDTSLPSKKIQILSGERMMILVLSVKNLKSLKEN
KHDAQVSSTWGMERRNAQGYCSTTRLQRHN