; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014526 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014526
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionINVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages;
Genome locationchr12:1707613..1709613
RNA-Seq ExpressionLag0014526
SyntenyLag0014526
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
InterPro domainsIPR008796 - Photosystem I reaction centre subunit N, chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455049.1 PREDICTED: uncharacterized protein LOC103495319 [Cucumis melo]8.0e-4379.41Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY
        MSSIGQSILMALAVTLNKFASSNVQSVQ+NK    +TAT     +S IGRR LLLS +A A+  A   AVDSRT+LLKRYLKKSEENKEKNDKERLESYY
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY

Query:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK
        KRNYKDYFEFVEGS+KNK+ELSE+EKGI+EWLKRNK
Subjt:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK

XP_022927848.1 uncharacterized protein LOC111434615 [Cucurbita moschata]1.0e-4277.86Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTAT----AATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERL
        MSSIGQ+ILMALA+TLN+FASSNVQSVQ+NK  TP T T    A+T+ +S I RRGLLLSA  AA       AVDSRT+LLKRYLKKSEENKEKNDKERL
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTAT----AATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERL

Query:  ESYYKRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGSLKNK ELSE+EKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK

XP_022989008.1 uncharacterized protein LOC111486201 [Cucurbita maxima]6.1e-4380.15Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY
        MSSIGQ+ILMALA+TLN+FASSNVQSVQ+NK  TP T TA T+ +S I RRGLLLSA  AA       AVDSRT+LLKRYLKKSEENKEKNDKERLES+Y
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY

Query:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK
        KRNYKDYFEFVEGSLKNK ELSE+EKGIIEWLKRNK
Subjt:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK

XP_023531124.1 uncharacterized protein LOC111793462 [Cucurbita pepo subsp. pepo]1.4e-4277.86Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATA----GASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERL
        MSSIGQ+ILMALA+TLN+FASSNVQSVQ+NK  TP T TA T+     +S I RRGLLLSA  AA       AVDSRT+LLKRYLKKSEENKEKNDKERL
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATA----GASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERL

Query:  ESYYKRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGSLKNK ELSE+EKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK

XP_038887440.1 uncharacterized protein LOC120077574 [Benincasa hispida]8.8e-5087.5Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY
        MSSIGQSILMALAVTLNKFASSNVQSVQ+N+ N P TATA T   S IGRRGLLLSA+AAAAAT PE+AVDSRT+LLKRYLKKSEENKEKNDKERLESYY
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY

Query:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK
        KRNYKDYFEFVEGS+KNK+ELSE+EKGIIEWLKRNK
Subjt:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK

TrEMBL top hitse value%identityAlignment
A0A1S3C176 uncharacterized protein LOC1034953193.9e-4379.41Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY
        MSSIGQSILMALAVTLNKFASSNVQSVQ+NK    +TAT     +S IGRR LLLS +A A+  A   AVDSRT+LLKRYLKKSEENKEKNDKERLESYY
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY

Query:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK
        KRNYKDYFEFVEGS+KNK+ELSE+EKGI+EWLKRNK
Subjt:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK

A0A2H5P8W1 Uncharacterized protein1.1e-3466.67Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATA---ATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLE
        MSSIGQSILMAL VT+NK+AS NVQ+V + +  TP T  A   A A    IGRRGLLLS++ AA     +   DS+TQLL++YLKKSEENK KNDKER++
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATA---ATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLE

Query:  SYYKRNYKDYFEFVEGSLKNKS--ELSESEKGIIEWLKRNK
        SYYKRNYKDYF+F+EGSLK KS  ELSESEKGI+EWLK NK
Subjt:  SYYKRNYKDYFEFVEGSLKNKS--ELSESEKGIIEWLKRNK

A0A6J1D574 uncharacterized protein LOC111017388 isoform X12.3e-4075.74Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY
        MSSIGQSILMALAVT+NKFASSNVQSV +N+        +A A AS IGRRGLL SA+AAA A      VDSRT+LLKRYLKKSE+NKEKNDKERL+SYY
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY

Query:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK
        KRNYKDYFEFVEGS++NKSELSE+EK IIEWL+RNK
Subjt:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK

A0A6J1EM63 uncharacterized protein LOC1114346155.0e-4377.86Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTAT----AATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERL
        MSSIGQ+ILMALA+TLN+FASSNVQSVQ+NK  TP T T    A+T+ +S I RRGLLLSA  AA       AVDSRT+LLKRYLKKSEENKEKNDKERL
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTAT----AATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERL

Query:  ESYYKRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGSLKNK ELSE+EKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK

A0A6J1JNZ7 uncharacterized protein LOC1114862013.0e-4380.15Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY
        MSSIGQ+ILMALA+TLN+FASSNVQSVQ+NK  TP T TA T+ +S I RRGLLLSA  AA       AVDSRT+LLKRYLKKSEENKEKNDKERLES+Y
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY

Query:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK
        KRNYKDYFEFVEGSLKNK ELSE+EKGIIEWLKRNK
Subjt:  KRNYKDYFEFVEGSLKNKSELSESEKGIIEWLKRNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49975.1 INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Photosystem I reaction centre subunit N (InterPro:IPR008796); Has 34 Blast hits to 34 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.2e-3261.59Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY
        MSSI QSILMAL VT+NK+ASSNVQ+V++N     S     TA  + +GRR +L S+ +  AA     A+ S  QLL++YLKK+EENK KNDKERL+S+Y
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYY

Query:  KRNYKDYFEFVEGSLKNK--SELSESEKGIIEWLKRNK
        KRNYKDYFEFVEGS+K K  +ELSESEK I+EWLK NK
Subjt:  KRNYKDYFEFVEGSLKNK--SELSESEKGIIEWLKRNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCCATCGGCCAAAGCATTCTGATGGCTCTCGCCGTCACTCTCAACAAATTCGCTTCCTCTAACGTTCAATCTGTTCAGAAAAACAAACTCAACACACCATCTAC
CGCCACCGCTGCCACCGCCGGCGCTTCTCAAATTGGAAGAAGAGGCCTCCTCTTGTCTGCCATCGCCGCCGCCGCCGCCACCGCTCCCGAAGACGCCGTCGACTCCAGAA
CCCAACTGCTAAAAAGATACCTAAAGAAGTCCGAAGAAAACAAAGAAAAGAATGACAAGGAGAGATTGGAGAGTTACTACAAGCGAAATTACAAAGATTATTTTGAATTT
GTTGAAGGATCTTTGAAGAATAAGAGTGAACTTTCAGAGTCTGAGAAAGGTATTATTGAGTGGCTTAAGCGAAACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCCATCGGCCAAAGCATTCTGATGGCTCTCGCCGTCACTCTCAACAAATTCGCTTCCTCTAACGTTCAATCTGTTCAGAAAAACAAACTCAACACACCATCTAC
CGCCACCGCTGCCACCGCCGGCGCTTCTCAAATTGGAAGAAGAGGCCTCCTCTTGTCTGCCATCGCCGCCGCCGCCGCCACCGCTCCCGAAGACGCCGTCGACTCCAGAA
CCCAACTGCTAAAAAGATACCTAAAGAAGTCCGAAGAAAACAAAGAAAAGAATGACAAGGAGAGATTGGAGAGTTACTACAAGCGAAATTACAAAGATTATTTTGAATTT
GTTGAAGGATCTTTGAAGAATAAGAGTGAACTTTCAGAGTCTGAGAAAGGTATTATTGAGTGGCTTAAGCGAAACAAATGA
Protein sequenceShow/hide protein sequence
MSSIGQSILMALAVTLNKFASSNVQSVQKNKLNTPSTATAATAGASQIGRRGLLLSAIAAAAATAPEDAVDSRTQLLKRYLKKSEENKEKNDKERLESYYKRNYKDYFEF
VEGSLKNKSELSESEKGIIEWLKRNK