; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016091 (gene) of Snake gourd v1 genome

Gene IDTan0016091
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPhotosystem I reaction centre subunit N
Genome locationLG02:15614369..15617208
RNA-Seq ExpressionTan0016091
SyntenyTan0016091
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
InterPro domainsIPR008796 - Photosystem I reaction centre subunit N, chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455049.1 PREDICTED: uncharacterized protein LOC103495319 [Cucumis melo]7.8e-4381.95Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN
        MSSIGQSILMALAVTLNKFASSNVQSVQRN+     TAT S S IGRR LLLS V  A+ AA    VDSRTELLKRYLKKSEENKEKNDKERLESYYKRN
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN

Query:  YKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK
        YKDYFEFVEGS+KNK ELS++EKGI+EWLKRNK
Subjt:  YKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK

XP_022927848.1 uncharacterized protein LOC111434615 [Cucurbita moschata]2.3e-4277.14Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATA-------SGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERL
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+PK+ PT TA       + S+I RR LLLSA  AAA       VDSRTELLKRYLKKSEENKEKNDKERL
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATA-------SGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERL

Query:  ESYYKRNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGSLKNK ELS++EKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK

XP_022989008.1 uncharacterized protein LOC111486201 [Cucurbita maxima]1.3e-4279.26Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPT--ATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYK
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+PK+ PT   T++ S+I RR LLLSA  AAA       VDSRTELLKRYLKKSEENKEKNDKERLES+YK
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPT--ATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYK

Query:  RNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK
        RNYKDYFEFVEGSLKNK ELS++EKGIIEWLKRNK
Subjt:  RNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK

XP_023531124.1 uncharacterized protein LOC111793462 [Cucurbita pepo subsp. pepo]2.3e-4277.14Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATA-------SGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERL
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+PK+ PT TA       + S+I RR LLLSA  AAA       VDSRTELLKRYLKKSEENKEKNDKERL
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATA-------SGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERL

Query:  ESYYKRNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGSLKNK ELS++EKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK

XP_038887440.1 uncharacterized protein LOC120077574 [Benincasa hispida]6.2e-4888.06Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATA-SGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKR
        MSSIGQSILMALAVTLNKFASSNVQSVQRNQ     TATA +GS IGRR LLLSAV AAAAA PEE VDSRTELLKRYLKKSEENKEKNDKERLESYYKR
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATA-SGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKR

Query:  NYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK
        NYKDYFEFVEGS+KNK ELS++EKGIIEWLKRNK
Subjt:  NYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK

TrEMBL top hitse value%identityAlignment
A0A1S3C176 uncharacterized protein LOC1034953193.8e-4381.95Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN
        MSSIGQSILMALAVTLNKFASSNVQSVQRN+     TAT S S IGRR LLLS V  A+ AA    VDSRTELLKRYLKKSEENKEKNDKERLESYYKRN
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN

Query:  YKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK
        YKDYFEFVEGS+KNK ELS++EKGI+EWLKRNK
Subjt:  YKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK

A0A6A1UJ56 Uncharacterized protein1.4e-3468.89Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQ-PKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKR
        MSS+GQSILMAL VT+N+FASSNVQ+V R +  K++ T T + S+IGRR LLLS V    AAAP+ P DSR ELLK+Y KKS+ENK KNDKERL+SYYKR
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQ-PKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKR

Query:  NYKDYFEFVEGSLKNK-AELSDSEKGIIEWLKRNK
        NYKDYFEFVEGSLK K  +LS+SEKGI++WL+ NK
Subjt:  NYKDYFEFVEGSLKNK-AELSDSEKGIIEWLKRNK

A0A6J1D574 uncharacterized protein LOC111017388 isoform X11.6e-4177.44Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN
        MSSIGQSILMALAVT+NKFASSNVQSV RNQ     +A A+ S+IGRR LL SAV AA A     PVDSRTELLKRYLKKSE+NKEKNDKERL+SYYKRN
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN

Query:  YKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK
        YKDYFEFVEGS++NK+ELS++EK IIEWL+RNK
Subjt:  YKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK

A0A6J1EM63 uncharacterized protein LOC1114346151.1e-4277.14Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATA-------SGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERL
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+PK+ PT TA       + S+I RR LLLSA  AAA       VDSRTELLKRYLKKSEENKEKNDKERL
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATA-------SGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERL

Query:  ESYYKRNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGSLKNK ELS++EKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK

A0A6J1JNZ7 uncharacterized protein LOC1114862016.4e-4379.26Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPT--ATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYK
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRN+PK+ PT   T++ S+I RR LLLSA  AAA       VDSRTELLKRYLKKSEENKEKNDKERLES+YK
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPT--ATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYK

Query:  RNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK
        RNYKDYFEFVEGSLKNK ELS++EKGIIEWLKRNK
Subjt:  RNYKDYFEFVEGSLKNKAELSDSEKGIIEWLKRNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49975.1 INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Photosystem I reaction centre subunit N (InterPro:IPR008796); Has 34 Blast hits to 34 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.7e-3362.96Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN
        MSSI QSILMAL VT+NK+ASSNVQ+V+RN  K   + TA  +++GRR +L S+    AAA     + S  +LL++YLKK+EENK KNDKERL+S+YKRN
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN

Query:  YKDYFEFVEGSLKNK--AELSDSEKGIIEWLKRNK
        YKDYFEFVEGS+K K  AELS+SEK I+EWLK NK
Subjt:  YKDYFEFVEGSLKNK--AELSDSEKGIIEWLKRNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCATCGGGCAAAGCATTCTGATGGCTCTCGCCGTCACTCTCAATAAATTCGCTTCCTCAAACGTTCAATCCGTTCAGAGAAACCAACCCAAAAGCTCACCCAC
CGCCACCGCCTCCGGTTCTGAAATCGGAAGAAGAGCCCTCCTCTTGTCCGCCGTTGGTGCCGCCGCCGCTGCCGCTCCTGAAGAACCCGTCGACTCCAGAACCGAGCTGC
TAAAAAGGTACCTAAAGAAGTCCGAAGAAAACAAAGAAAAGAATGACAAGGAGAGATTGGAGAGCTACTACAAGCGAAATTACAAAGATTATTTTGAGTTTGTTGAAGGA
TCTTTGAAGAATAAGGCTGAACTTTCTGACTCTGAGAAAGGTATTATTGAGTGGCTTAAGCGAAACAAATAA
mRNA sequenceShow/hide mRNA sequence
CTTTAATAGAGCTCTATGGACCAAGAAAGCCTCATCTCTTCTTCAACCTCATCCTCATCTTCAGACTGAAAACTGTCACTCCGATGAGTTCCATCGGGCAAAGCATTCTG
ATGGCTCTCGCCGTCACTCTCAATAAATTCGCTTCCTCAAACGTTCAATCCGTTCAGAGAAACCAACCCAAAAGCTCACCCACCGCCACCGCCTCCGGTTCTGAAATCGG
AAGAAGAGCCCTCCTCTTGTCCGCCGTTGGTGCCGCCGCCGCTGCCGCTCCTGAAGAACCCGTCGACTCCAGAACCGAGCTGCTAAAAAGGTACCTAAAGAAGTCCGAAG
AAAACAAAGAAAAGAATGACAAGGAGAGATTGGAGAGCTACTACAAGCGAAATTACAAAGATTATTTTGAGTTTGTTGAAGGATCTTTGAAGAATAAGGCTGAACTTTCT
GACTCTGAGAAAGGTATTATTGAGTGGCTTAAGCGAAACAAATAAAATTGATCATCTTTTTTCATATTCATTCAGTTAATTCATCTATCAGTTTCTTCTATTACATCAAT
TAATTACAACAATAGTTTTAATCTAGGTTGGCTTGGTAATAATTTTATTTTTAGTTTTATGATTTTGAAAACTAAGTTTGTGTAGTGTGATAATTTGTTTTTGAAATGAA
ATGTAAAAACTACATCTGAATTATTGGGGTTAATCACACATTTGATCTTTTCATTTGAGCCCTTATCTCTCAAGTTTTTCTTAAAGAAAAAAGGGTTAAATTACAAGTTT
AGTCCCTAAAATTTCAAGGTTGTATCTATTTGGTCCCTAAACTTTGAAAAGCATCAAATAGGTACATGAACTTTCAAGTTTATACCTAATAAGTCTCTGAACTCTTAAAT
GTGTCTACTAAGTTATTCAACTTTCAATTTTATGTTTAATAGGTTCTTGAGTTTTCAATTTTGTGTCCACAAGCCATTGACCAAGTTGACATTTTTAAAATTTATAAACT
TACTGAACACAAAATTGAAAGTTCAAATTTGACACTTTTAAAGTTCATACACCAAATAGACACAAACTTGAATGTTCAGGGACTAAACTTGTATTTAACCTATAACCCTC
TCGAAATTGCTTAGTAATGTAACATTTATATTCCGTTTGGTAACCATTGGTTTTTTGTTTTTGGGTTTTTAGAAATTATGCTTGTTTTTCTACAATTTAATTTCCTTCCT
CTTCTCTAGAATGATTTCCATCTTTTTTGTGGTACCACTTAAATTTCTAGCCAATTTTTTTTTTAAAAAATAAAGTTTTTGAAAACTATTTTTTTTTAGTTTTCAAAACT
TAGCTTGGTTTTTTTAAAATATATAGGTACAAAGTAGATAAGAAAACATAAAACCACATTGGTAAAAGTGGTGTTTTTAGGCTTTATTTTTAAAAAACCAAAAACTAAAA
ACAAAATGGTTATCAAATGGGGCTTTAATTACTAAGATTTCATAATTATTCAAATCAAACAGTAATAACAACTTCCATGGTTTTTGTAATTTTATTTTTGTACTGTCAGT
GAATTACTAGTTTTTGAAAATTATTACGATAAAATACCATTTTGGTCTCTGTACTTTAAACAGTGTTCTATTTTTGTCTCTATATTTTCAAATGTCCAATTTTAGTTTCT
GTACTTTTAATAAATCTTAAAACTGG
Protein sequenceShow/hide protein sequence
MSSIGQSILMALAVTLNKFASSNVQSVQRNQPKSSPTATASGSEIGRRALLLSAVGAAAAAAPEEPVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKDYFEFVEG
SLKNKAELSDSEKGIIEWLKRNK