; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005347 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005347
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationChr07:1712410..1714835
RNA-Seq ExpressionHG10005347
SyntenyHG10005347
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645760.1 hypothetical protein Csa_020345 [Cucumis sativus]4.6e-15295.29Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRL +PS+RS +SNF GF FRTSVF+HYSRVRASTFSSRMVIHCMSAGTDVT VAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTG

XP_004136805.1 protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus]1.6e-15295.3Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRL +PS+RS +SNF GF FRTSVF+HYSRVRASTFSSRMVIHCMSAGTDVT VAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]8.4e-15494.97Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR PVPS+RSL+SNFDGFRFRTS+F+HYSRVR STFSSRMVIHCMSAGTDVT VAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]6.9e-14891.28Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCSDRRLP+PSARSLAS+FDGFRFR SVF HYS VR S+FSSRMVIHCM++GTDVT VAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LM+GYPSDEDRDAIFQAYI ALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

XP_038888611.1 protein THYLAKOID FORMATION1, chloroplastic [Benincasa hispida]3.4e-15596.64Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAV SISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSS MVIHCMSAGTDVT VAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDRDAIFQAYI+ALNEDPEQYRIDA+K EEWARSQTAASLV+FASREGEVESTLKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCA+LNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

TrEMBL top hitse value%identityAlignment
A0A0A0K3P0 Uncharacterized protein7.7e-15395.3Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRL +PS+RS +SNF GF FRTSVF+HYSRVRASTFSSRMVIHCMSAGTDVT VAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic4.1e-15494.97Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR PVPS+RSL+SNFDGFRFRTS+F+HYSRVR STFSSRMVIHCMSAGTDVT VAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

A0A5D3C7D3 Protein THYLAKOID FORMATION14.1e-15494.97Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR PVPS+RSL+SNFDGFRFRTS+F+HYSRVR STFSSRMVIHCMSAGTDVT VAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

A0A6J1GJN0 protein THYLAKOID FORMATION1, chloroplastic-like isoform X21.7e-14790.6Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCSDRRLP+PSARSLAS+FDGFRFR SVF HYS VR  +F+SRMVIHCM++GTDVT VAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LM+GYPSDEDRDAIFQAYI ALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X23.3e-14891.28Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCSDRRLP+PSARSLAS+FDGFRFR SVF HYS VR S+FSSRMVIHCM++GTDVT VAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LM+GYPSDEDRDAIFQAYI ALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf11.8e-3437.67Show/hide
Query:  VAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQT
        V++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ MDGY  + D+DAIF A  +A   DP Q + D ++  E A+S++
Subjt:  VAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQT

Query:  AASLVEF---ASREG--EVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAK
        A  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LNI +  + +DL++YR  L K+ Q +
Subjt:  AASLVEF---ASREG--EVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAK

Query:  ELLKEYVDREKKKRD
        + + + ++ +KK+R+
Subjt:  ELLKEYVDREKKKRD

Q116P5 Protein Thf11.3e-3235.81Show/hide
Query:  VAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQT
        V++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M GY   ED+ +IF A I+   EDP +YR DAK  E+ A   +
Subjt:  VAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQT

Query:  AASLVEFASREGEVEST--LKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELL
        A+ ++ +      +++T  L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  +
Subjt:  AASLVEFASREGEVEST--LKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELL

Query:  KEYVDREKKKRDERT
        ++ +   +KKR++R+
Subjt:  KEYVDREKKKRDERT

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic1.2e-10768.6Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHC-MSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAAV S+SFS + Q ++R+  V S+RS+    D FRFR++       VR+S  +SR V+HC  S+  D+  VA+TKL FL AYKRPIP++YNTVLQELIV
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHC-MSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLM+GYPS+EDR+AIF+AYIEAL EDPEQYR DA+K EEWAR+Q A +LV+F+S+EGE+E+  KDIA+RAG+K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEY
         F YSR FA+GLFRLLELAN T+P+ILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER  +Q ANE +TKCLG+Y
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEY

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic3.6e-9967.71Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTP-VAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAA++S+ F+ L + +D R   PS  + A+         SV     R R     SR V+ C++   DV P VAETK+NFLK+YKRPI SIY+TVLQEL+V
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTP-VAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKG
        QQHLMRYK TY+YD VFALGFVTVYDQLM+GYPS+EDRDAIF+AYI ALNEDPEQYR DA+K EEWARSQ   SLVEF+S++GE+E+ LKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITK
        +FSYSRFFA+GLFRLLELANATEP+IL+KLCAALNI+K+ VDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER+ +  +NEA+TK
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITK

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic1.6e-10773.1Show/hide
Query:  AVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ
        A++S+SF  L Q SD+     S+R LAS     R  T  FS  S    ST  S+ +IHCMS  T DV PV+ETK  FLKAYKRPIPSIYNTVLQELIVQQ
Subjt:  AVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNF
        HLMRYK+TYRYDPVFALGFVTVYDQLM+GYPSD+DRDAIF+AYIEALNEDP+QYRIDA+K EEWARSQT+ASLV+F+S+EG++E+ LKDIA RAGSK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNF

Query:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGE
        SYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER  SQ ANE I+KCLG+
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein1.1e-10873.1Show/hide
Query:  AVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ
        A++S+SF  L Q SD+     S+R LAS     R  T  FS  S    ST  S+ +IHCMS  T DV PV+ETK  FLKAYKRPIPSIYNTVLQELIVQQ
Subjt:  AVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNF
        HLMRYK+TYRYDPVFALGFVTVYDQLM+GYPSD+DRDAIF+AYIEALNEDP+QYRIDA+K EEWARSQT+ASLV+F+S+EG++E+ LKDIA RAGSK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNF

Query:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGE
        SYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER  SQ ANE I+KCLG+
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCATTTCATTCTCAACATTGAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTCCGCTCGTTCACTCGCCTCCAATTTCGACGGCTTTCGTTT
TCGTACGAGCGTTTTCAGTCATTATTCCCGAGTTCGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACACCTGTAGCCGAGA
CAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAAGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGACCCTGTTTTCGCTCTTGGTTTTGTTACTGTATATGATCAGCTTATGGACGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTGAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTTGAAGAGTGGGCTCGGTCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAG
TTGAGAGTACTTTGAAGGACATTGCAGAACGAGCAGGGAGTAAGGGGAATTTTAGTTACAGCCGATTTTTTGCTATTGGACTTTTTCGACTCCTTGAATTGGCAAATGCT
ACTGAACCCAGTATCCTGGAAAAGCTCTGTGCCGCTTTAAACATAGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGC
GAAAGAGCTCCTAAAGGAATACGTCGATAGAGAGAAGAAGAAAAGAGATGAGAGGACTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCA
TGCAGACTGGTTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTGTTAATTCCATTTCATTCTCAACATTGAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTCCGCTCGTTCACTCGCCTCCAATTTCGACGGCTTTCGTTT
TCGTACGAGCGTTTTCAGTCATTATTCCCGAGTTCGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACACCTGTAGCCGAGA
CAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAAGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGACCCTGTTTTCGCTCTTGGTTTTGTTACTGTATATGATCAGCTTATGGACGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTGAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTTGAAGAGTGGGCTCGGTCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAG
TTGAGAGTACTTTGAAGGACATTGCAGAACGAGCAGGGAGTAAGGGGAATTTTAGTTACAGCCGATTTTTTGCTATTGGACTTTTTCGACTCCTTGAATTGGCAAATGCT
ACTGAACCCAGTATCCTGGAAAAGCTCTGTGCCGCTTTAAACATAGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGC
GAAAGAGCTCCTAAAGGAATACGTCGATAGAGAGAAGAAGAAAAGAGATGAGAGGACTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCA
TGCAGACTGGTTTATAA
Protein sequenceShow/hide protein sequence
MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTPVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELANA
TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL