; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G04230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G04230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationChr7:3242950..3245643
RNA-Seq ExpressionCSPI07G04230
SyntenyCSPI07G04230
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645760.1 hypothetical protein Csa_020345 [Cucumis sativus]1.6e-160100Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

XP_004136805.1 protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus]4.2e-161100Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]6.5e-15496.31Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR  +PSSRS SSNF GF FRTS+FTHYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

XP_022136235.1 protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia]5.9e-14791.58Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCS+RRLL+PS+RS +SNF GF FRTSVF HYS VR S++SSRMV+HCMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKK EEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

XP_038888611.1 protein THYLAKOID FORMATION1, chloroplastic [Benincasa hispida]5.1e-15194.63Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAV SISFSTLNQCSDRRL +PS+RS +SNF GF FRTSVF+HYSRVRASTFSS MVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+K EEWARSQTAASLV+FASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCA+LNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

TrEMBL top hitse value%identityAlignment
A0A0A0K3P0 Uncharacterized protein2.0e-161100Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic3.1e-15496.31Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR  +PSSRS SSNF GF FRTS+FTHYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

A0A5D3C7D3 Protein THYLAKOID FORMATION13.1e-15496.31Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR  +PSSRS SSNF GF FRTS+FTHYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

A0A6J1C3B8 protein THYLAKOID FORMATION1, chloroplastic isoform X12.8e-14791.58Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCS+RRLL+PS+RS +SNF GF FRTSVF HYS VR S++SSRMV+HCMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKK EEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X22.0e-14590.27Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCSDRRL +PS+RS +S+F GF FR SVF HYS VR S+FSSRMVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf11.1e-3436.99Show/hide
Query:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWA
        ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ M+GY  + D++AIF A  KA   DP Q + D ++  E A
Subjt:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWA

Query:  RSQTAASLVEF---ASREG--EVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL
        +S++A  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LNI +  + +DL++YR  L K+
Subjt:  RSQTAASLVEF---ASREG--EVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL

Query:  VQAKELLKEYVDREKKKRD
         Q ++ + + ++ +KK+R+
Subjt:  VQAKELLKEYVDREKKKRD

Q116P5 Protein Thf17.7e-3335.65Show/hide
Query:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+GY   ED+ +IF A I+   EDP +YR DAK  E+ A   
Subjt:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQ

Query:  TAASLVEFASREGEVES--ILKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL
        +A+ ++ +      +++   L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  
Subjt:  TAASLVEFASREGEVES--ILKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYVDREKKKRDERA
        +++ +   +KKR++R+
Subjt:  LKEYVDREKKKRDERA

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic1.0e-10668.6Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAAV S+SFS + Q ++R+  + SSRS  +    F FR++       VR+S  +SR V+HC  S+  D+ TVA+TKL FL AYKRPIP++YNTVLQELIV
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLMEGYPS+EDR AIF+AYI+AL EDPEQYR DA+K EEWAR+Q A +LV+F+S+EGE+E+I KDIA+RAG+K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEY
         F YSR FA+GLFRLLELAN T+P+ILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER  +Q ANE +TKCLG+Y
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEY

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic8.1e-9967.71Show/hide
Query:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAA++S+ F+ L + +D R   PS+ + ++         SV     R R     SR V+ C++   DV  TVAETK+NFLK+YKRPI SIY+TVLQEL+V
Subjt:  MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKG
        QQHLMRYK TY+YD VFALGFVTVYDQLMEGYPS+EDR+AIF+AYI ALNEDPEQYR DA+K EEWARSQ   SLVEF+S++GE+E+ILKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITK
        +FSYSRFFA+GLFRLLELANATEP+IL+KLCAALNI+K+ VDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER+ +  +NEA+TK
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITK

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic1.8e-10671.13Show/hide
Query:  AVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFS-SRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        A++S+SF  L Q SD+     SSR  +S          + T +SR+  ++ S S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQ
Subjt:  AVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFS-SRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDA+K EEWARSQT+ASLV+F+S+EG++E++LKDIA RAGSK  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein1.3e-10771.13Show/hide
Query:  AVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFS-SRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        A++S+SF  L Q SD+     SSR  +S          + T +SR+  ++ S S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQ
Subjt:  AVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFS-SRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDA+K EEWARSQT+ASLV+F+S+EG++E++LKDIA RAGSK  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCATTTCATTCTCTACACTAAACCAATGTTCCGATAGGAGATTGCTGCTTCCCTCCTCTCGTTCGCACTCCTCCAATTTCCACGGCTTTCCTTT
TCGTACTAGCGTTTTCACTCATTATTCCCGAGTACGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACCACTGTGGCCGAGA
CAAAATTGAACTTCCTTAAGGCCTATAAACGGCCTATCCCTAGTATTTACAACACGGTTCTGCAAGAATTGATTGTTCAGCAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGATCCTGTTTTCGCTCTTGGATTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGAAGCCATCTTCCAAGCGTATATTAAGGC
TTTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTTGAAGAGTGGGCTCGATCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAG
TTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGGAGCAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTATTTCGTCTCCTTGAATTGGCAAATGCT
ACTGAGCCCAGTATCCTGGAAAAGCTCTGTGCCGCTTTAAATATCGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGTAACCTGCTTTCAAAGTTGGTTCAGGC
GAAAGAGCTCCTAAAGGAATATGTCGACAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCA
TGCAGACTGGTTTATAA
mRNA sequenceShow/hide mRNA sequence
TGGCCTCTCAGTCTCACCCTTCTCTTCTTTCTTCCTTCTTTCTCTTTCTTTTTCTTTCTTTCTAAAAAATTTCTTCGAATTTCATTTTCTATTCCTTTTTTCTCCCTCTC
TTCTTCGATATGAAATCCATTTTCTCCATAAATTCCTAAGCTTCGCAGATTTTTCCTCATTCTTCTTCAATGGCGGCTGTTAATTCCATTTCATTCTCTACACTAAACCA
ATGTTCCGATAGGAGATTGCTGCTTCCCTCCTCTCGTTCGCACTCCTCCAATTTCCACGGCTTTCCTTTTCGTACTAGCGTTTTCACTCATTATTCCCGAGTACGAGCAT
CCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACCACTGTGGCCGAGACAAAATTGAACTTCCTTAAGGCCTATAAACGGCCTATCCCT
AGTATTTACAACACGGTTCTGCAAGAATTGATTGTTCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCTCTTGGATTTGTTACTGTATA
TGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGAAGCCATCTTCCAAGCGTATATTAAGGCTTTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAA
AATTTGAAGAGTGGGCTCGATCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGGAGC
AAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTATTTCGTCTCCTTGAATTGGCAAATGCTACTGAGCCCAGTATCCTGGAAAAGCTCTGTGCCGCTTTAAA
TATCGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGTAACCTGCTTTCAAAGTTGGTTCAGGCGAAAGAGCTCCTAAAGGAATATGTCGACAGAGAGAAGAAGA
AAAGAGATGAGAGGGCTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTATAAAAGGTAATTGGAGTCGCTACATAA
TTTGAGATAGACTTGAGAGAGTTTATAGCAATATTATTCTAAATACTTGTTAGATATGCATCTATAATGTATTATGTTGGGTCTCATGCATTTGGTAAATTTTGTATTCA
GCCACGGCTCTACCTTTATTTCTCATTTGATATAACCACCACTCAAGCTATTTTTTAATCATTTGTATTACATACTTTGAGTGCAACTGTCAATACAAAATGCTTTCGCA
CTCTGTGCAGTAATCACTCCTCCAGGGATATTTGACCA
Protein sequenceShow/hide protein sequence
MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANA
TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGL