; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031859 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031859
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationchr11:16942921..16944344
RNA-Seq ExpressionLag0031859
SyntenyLag0031859
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645760.1 hypothetical protein Csa_020345 [Cucumis sativus]4.9e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDAKK EEWA+S TA SLVEFASREGEV+SILKDIAE+A SKG+F+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]4.9e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDA+KLEEWA+S TA SLVEFASREGEV+SILKDIAE+A SKG+F+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

XP_022136235.1 protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia]4.9e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDAKKLEEWA+S TA SLVEFAS+EGEV+SILKDIAE+A  KGSF+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

XP_022136236.1 protein THYLAKOID FORMATION1, chloroplastic isoform X2 [Momordica charantia]4.9e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDAKKLEEWA+S TA SLVEFAS+EGEV+SILKDIAE+A  KGSF+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]4.9e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDA+KLEEWA+S TA SLVEFASREGEV+SILKDIAE+A SKG+F+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

TrEMBL top hitse value%identityAlignment
A0A0A0K3P0 Uncharacterized protein2.4e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDAKK EEWA+S TA SLVEFASREGEV+SILKDIAE+A SKG+F+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

A0A5D3C7D3 Protein THYLAKOID FORMATION12.4e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDA+KLEEWA+S TA SLVEFASREGEV+SILKDIAE+A SKG+F+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

A0A6J1C2Y9 protein THYLAKOID FORMATION1, chloroplastic isoform X22.4e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDAKKLEEWA+S TA SLVEFAS+EGEV+SILKDIAE+A  KGSF+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

A0A6J1HVN0 protein THYLAKOID FORMATION1, chloroplastic-like isoform X12.4e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDA+KLEEWA+S TA SLVEFASREGEV+SILKDIAE+A SKG+F+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X22.4e-1080.85Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDA+KLEEWA+S TA SLVEFASREGEV+SILKDIAE+A SKG+F+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

SwissProt top hitse value%identityAlignment
Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic1.9e-0959.57Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        R DA+K+EEWA+S    SLVEF+S++GE+++ILKDI+E+A+ KGSF+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic1.6e-0857.45Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDA+K+EEWA+S T+ SLV+F+S+EG+++++LKDIA +A SK  F+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein1.2e-0957.45Show/hide
Query:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN
        RIDA+K+EEWA+S T+ SLV+F+S+EG+++++LKDIA +A SK  F+
Subjt:  RIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAACTTCACTGGAAATAATATTCCTACCTTTGACTATTCTTTCAATAATTTCAAAAATGTTTCTTTATTTTTAGTATTTTCTGTTATTCCGTCTAGAATTGATGC
TAAAAAATTGGAAGAGTGGGCTCAATCTCATACTGCAACTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTCAGAGTATCTTGAAGGACATTGCAGAACAAGCAC
GGAGTAAGGGGAGTTTCAATGATGACATCACGTCAATTTTACCATGGGCGTTTGGAATCCGAAATTTTACCGTTGAAGAGAAGGAGGCTAAGCGAAGAAAAAGACAACAG
AGAGTTGAAGAACAAGAAAGAGCGAGAGAGGAGGAGGTTGTGGCAAAAGAAGATGAAAACCCAAAAGAATCTGACAAACCGAATCAAGAAACAGAGGCTGAAGGTGAGCA
TACCAAGGAGACAACACCGGAGCCGGTGCAGGAGGCCCATGTTGAAATCATTATGCCTGAACCACCCAAGCGCCGCCGCATCAAAAGGAAGGCGGGTCGCGTGAGGGTGG
TTCGGAACACTCCATCGCCTCCGACGTCGGACTCTGAGGAAGAAAGAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGAAGAAGAAGAACGTTTGCGC
GAACAGAGAGAAAGCAAGGGCAAAGGAATTGCCGAAGCATCGGGAGAAATTGAGGAGCCGAGGGCGCCATTCATTCGCTACGTCAACGATTTTGCCCGAGCAAAATACCA
GGAGGTGCTGAAACGAGACTTCTTGTTCGAACGTGGATTTGGCAATGATTTACCTAGGTTTTTGGAGTCTGGAATAGTGAACCTCGGTTGGAGGCAATTTTGTGCGAAGC
TAGAACCTGTGAATGCCAACATTGTTCGAGAATTTTACGCCAACCTTGACATTAAGAATGATTTTGAGGTTATTGTTCGAGGAGTGCCTGTACAGTGGAGCCCTGAGGCC
ATCAATGAAATGTTCGATCTCCAGGATTTTCCGCATGTCGTTTTTAATGAGATGGTGGCTGCACCATCTAGTGATCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGA
GGGGGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATAACTTCACTGGAAATAATATTCCTACCTTTGACTATTCTTTCAATAATTTCAAAAATGTTTCTTTATTTTTAGTATTTTCTGTTATTCCGTCTAGAATTGATGC
TAAAAAATTGGAAGAGTGGGCTCAATCTCATACTGCAACTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTCAGAGTATCTTGAAGGACATTGCAGAACAAGCAC
GGAGTAAGGGGAGTTTCAATGATGACATCACGTCAATTTTACCATGGGCGTTTGGAATCCGAAATTTTACCGTTGAAGAGAAGGAGGCTAAGCGAAGAAAAAGACAACAG
AGAGTTGAAGAACAAGAAAGAGCGAGAGAGGAGGAGGTTGTGGCAAAAGAAGATGAAAACCCAAAAGAATCTGACAAACCGAATCAAGAAACAGAGGCTGAAGGTGAGCA
TACCAAGGAGACAACACCGGAGCCGGTGCAGGAGGCCCATGTTGAAATCATTATGCCTGAACCACCCAAGCGCCGCCGCATCAAAAGGAAGGCGGGTCGCGTGAGGGTGG
TTCGGAACACTCCATCGCCTCCGACGTCGGACTCTGAGGAAGAAAGAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGAAGAAGAAGAACGTTTGCGC
GAACAGAGAGAAAGCAAGGGCAAAGGAATTGCCGAAGCATCGGGAGAAATTGAGGAGCCGAGGGCGCCATTCATTCGCTACGTCAACGATTTTGCCCGAGCAAAATACCA
GGAGGTGCTGAAACGAGACTTCTTGTTCGAACGTGGATTTGGCAATGATTTACCTAGGTTTTTGGAGTCTGGAATAGTGAACCTCGGTTGGAGGCAATTTTGTGCGAAGC
TAGAACCTGTGAATGCCAACATTGTTCGAGAATTTTACGCCAACCTTGACATTAAGAATGATTTTGAGGTTATTGTTCGAGGAGTGCCTGTACAGTGGAGCCCTGAGGCC
ATCAATGAAATGTTCGATCTCCAGGATTTTCCGCATGTCGTTTTTAATGAGATGGTGGCTGCACCATCTAGTGATCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGA
GGGGGCCTAG
Protein sequenceShow/hide protein sequence
MYNFTGNNIPTFDYSFNNFKNVSLFLVFSVIPSRIDAKKLEEWAQSHTATSLVEFASREGEVQSILKDIAEQARSKGSFNDDITSILPWAFGIRNFTVEEKEAKRRKRQQ
RVEEQERAREEEVVAKEDENPKESDKPNQETEAEGEHTKETTPEPVQEAHVEIIMPEPPKRRRIKRKAGRVRVVRNTPSPPTSDSEEERREAENKEKEEEARKEEEERLR
EQRESKGKGIAEASGEIEEPRAPFIRYVNDFARAKYQEVLKRDFLFERGFGNDLPRFLESGIVNLGWRQFCAKLEPVNANIVREFYANLDIKNDFEVIVRGVPVQWSPEA
INEMFDLQDFPHVVFNEMVAAPSSDQLSAAVREVGIEGA