; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018717 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018717
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMagnesium transporter 4 isoform 1
Genome locationChr04:7415284..7417390
RNA-Seq ExpressionHG10018717
SyntenyHG10018717
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
InterPro domainsIPR008796 - Photosystem I reaction centre subunit N, chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022506.1 hypothetical protein SDJN02_16238, partial [Cucurbita argyrosperma subsp. argyrosperma]4.7e-4379.29Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTT--TTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERL
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+   PPTT  TT+A+ +A S I RRGL+LSA    A AA   AVDSRTELLKRYLKKSE+NKEKNDKERL
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTT--TTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERL

Query:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGS+KNK+ELSEAEKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

XP_004136934.1 uncharacterized protein LOC101214221 [Cucumis sativus]5.6e-4481.16Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES
        MSSIGQSILMALAVTLNKFASSNVQSVQRN+A         ATAT  S IGRR LLLS +A  +AAA    VDSRTELLKRYLKKSE+NKEKNDKERLES
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES

Query:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        YYKRNYKDYFEFVEGSVKNKNELSEAEKGI+EWLKR+K
Subjt:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

XP_008455049.1 PREDICTED: uncharacterized protein LOC103495319 [Cucumis melo]2.3e-4581.88Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES
        MSSIGQSILMALAVTLNKFASSNVQSVQRN+          ATAT  S IGRR LLLS VA  + AA   AVDSRTELLKRYLKKSE+NKEKNDKERLES
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES

Query:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        YYKRNYKDYFEFVEGSVKNKNELSEAEKGI+EWLKRNK
Subjt:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

XP_022927848.1 uncharacterized protein LOC111434615 [Cucurbita moschata]2.8e-4380Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTT--TTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERL
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+   PPTT  TT+A+ +A S I RRGLLLSA    A AA   AVDSRTELLKRYLKKSE+NKEKNDKERL
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTT--TTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERL

Query:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGS+KNK+ELSEAEKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

XP_038887440.1 uncharacterized protein LOC120077574 [Benincasa hispida]5.1e-5392.03Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES
        MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKP T    ATAT GS IGRRGLLLSAVA+ AAA PEEAVDSRTELLKRYLKKSE+NKEKNDKERLES
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES

Query:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
Subjt:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

TrEMBL top hitse value%identityAlignment
A0A1S3C176 uncharacterized protein LOC1034953191.1e-4581.88Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES
        MSSIGQSILMALAVTLNKFASSNVQSVQRN+          ATAT  S IGRR LLLS VA  + AA   AVDSRTELLKRYLKKSE+NKEKNDKERLES
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES

Query:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        YYKRNYKDYFEFVEGSVKNKNELSEAEKGI+EWLKRNK
Subjt:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

A0A2I4HLI8 uncharacterized protein LOC1090192133.9e-3569.78Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES
        MSSIGQSILMAL VT+N+FASSNVQ+V R +   P +TTT  T    S IGRR LLL    ST  AAP+ A DSRT+LLK+YLKKSE+NK KNDKERL+S
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES

Query:  YYKRNYKDYFEFVEGSVK-NKNELSEAEKGIIEWLKRNK
        YYKRNYKDYFEFVEG+ K N+ +LSEAEKGII+WL+RNK
Subjt:  YYKRNYKDYFEFVEGSVK-NKNELSEAEKGIIEWLKRNK

A0A6J1D574 uncharacterized protein LOC111017388 isoform X14.1e-4077.54Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES
        MSSIGQSILMALAVT+NKFASSNVQSV RNQ          + A A S IGRRGLL SAVA  AA AP   VDSRTELLKRYLKKSE NKEKNDKERL+S
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES

Query:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        YYKRNYKDYFEFVEGSV+NK+ELSE EK IIEWL+RNK
Subjt:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

A0A6J1EM63 uncharacterized protein LOC1114346151.3e-4380Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTT--TTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERL
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+   PPTT  TT+A+ +A S I RRGLLLSA    A AA   AVDSRTELLKRYLKKSE+NKEKNDKERL
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTT--TTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERL

Query:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGS+KNK+ELSEAEKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

A0A6J1JNZ7 uncharacterized protein LOC1114862016.7e-4380.43Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+   PPTT   AT +A S I RRGLLLSA    A AA   AVDSRTELLKRYLKKSE+NKEKNDKERLES
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES

Query:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        +YKRNYKDYFEFVEGS+KNK+ELSEAEKGIIEWLKRNK
Subjt:  YYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49975.1 INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Photosystem I reaction centre subunit N (InterPro:IPR008796); Has 34 Blast hits to 34 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.6e-3158.57Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES
        MSSI QSILMAL VT+NK+ASSNVQ+V+RN   +   T   A       +GRR +L S+ +  AA     A+ S  +LL++YLKK+E+NK KNDKERL+S
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLES

Query:  YYKRNYKDYFEFVEGSVKNKN--ELSEAEKGIIEWLKRNK
        +YKRNYKDYFEFVEGS+K K   ELSE+EK I+EWLK NK
Subjt:  YYKRNYKDYFEFVEGSVKNKN--ELSEAEKGIIEWLKRNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCATCGGCCAAAGCATTCTCATGGCCCTCGCCGTCACTCTCAACAAATTCGCTTCCTCTAACGTTCAATCCGTTCAGAGAAACCAAGCCAACAAGCCTCCCAC
CACCACCACCGCCGCCACTGCCACCGCCGGTTCTGCAATCGGAAGAAGAGGCCTCCTCTTGTCCGCCGTTGCTTCCACCGCCGCCGCCGCTCCTGAAGAAGCCGTCGACT
CCAGAACCGAGCTGCTAAAAAGGTACCTCAAGAAGTCTGAAGATAACAAAGAAAAGAATGACAAGGAGAGATTGGAAAGTTACTACAAGCGAAATTACAAAGATTATTTT
GAGTTTGTTGAAGGATCAGTGAAGAATAAGAACGAACTTTCAGAAGCTGAAAAAGGTATAATTGAGTGGCTTAAACGAAATAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCCATCGGCCAAAGCATTCTCATGGCCCTCGCCGTCACTCTCAACAAATTCGCTTCCTCTAACGTTCAATCCGTTCAGAGAAACCAAGCCAACAAGCCTCCCAC
CACCACCACCGCCGCCACTGCCACCGCCGGTTCTGCAATCGGAAGAAGAGGCCTCCTCTTGTCCGCCGTTGCTTCCACCGCCGCCGCCGCTCCTGAAGAAGCCGTCGACT
CCAGAACCGAGCTGCTAAAAAGGTACCTCAAGAAGTCTGAAGATAACAAAGAAAAGAATGACAAGGAGAGATTGGAAAGTTACTACAAGCGAAATTACAAAGATTATTTT
GAGTTTGTTGAAGGATCAGTGAAGAATAAGAACGAACTTTCAGAAGCTGAAAAAGGTATAATTGAGTGGCTTAAACGAAATAAATAA
Protein sequenceShow/hide protein sequence
MSSIGQSILMALAVTLNKFASSNVQSVQRNQANKPPTTTTAATATAGSAIGRRGLLLSAVASTAAAAPEEAVDSRTELLKRYLKKSEDNKEKNDKERLESYYKRNYKDYF
EFVEGSVKNKNELSEAEKGIIEWLKRNK