; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G015090 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G015090
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationchr11:23555771..23558670
RNA-Seq ExpressionLsi11G015090
SyntenyLsi11G015090
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645760.1 hypothetical protein Csa_020345 [Cucumis sativus]7.9e-15295.29Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRL +PS+RS +SNF GF FRTSVF+HYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDAKK EEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTG

XP_004136805.1 protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus]2.7e-15295.3Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRL +PS+RS +SNF GF FRTSVF+HYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDAKK EEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]9.9e-15595.64Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR PVPS+RSL+SNFDGFRFRTS+F+HYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDA+KLEEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]8.1e-14991.95Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCSDRRLP+PSARSLAS+FDGFRFR SVF HYS VR S+FSSRMVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LM+GYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

XP_038888611.1 protein THYLAKOID FORMATION1, chloroplastic [Benincasa hispida]4.0e-15697.32Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAV SISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSS MVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDRDAIFQAYI+ALNEDPEQYRIDA+KLEEWARSQTAASLV+FASREGEVESTLKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCA+LNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

TrEMBL top hitse value%identityAlignment
A0A0A0K3P0 Uncharacterized protein1.3e-15295.3Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRL +PS+RS +SNF GF FRTSVF+HYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDAKK EEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic4.8e-15595.64Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR PVPS+RSL+SNFDGFRFRTS+F+HYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDA+KLEEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

A0A5D3C7D3 Protein THYLAKOID FORMATION14.8e-15595.64Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR PVPS+RSL+SNFDGFRFRTS+F+HYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDR+AIFQAYI+ALNEDPEQYRIDA+KLEEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

A0A6J1GJN0 protein THYLAKOID FORMATION1, chloroplastic-like isoform X22.0e-14891.28Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCSDRRLP+PSARSLAS+FDGFRFR SVF HYS VR  +F+SRMVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LM+GYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X23.9e-14991.95Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCSDRRLP+PSARSLAS+FDGFRFR SVF HYS VR S+FSSRMVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LM+GYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf11.6e-3537.9Show/hide
Query:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWA
        ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ MDGY  + D+DAIF A  +A   DP Q + D ++L E A
Subjt:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWA

Query:  RSQTAASLVEF---ASREG--EVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL
        +S++A  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LNI +  + +DL++YR  L K+
Subjt:  RSQTAASLVEF---ASREG--EVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL

Query:  VQAKELLKEYVDREKKKRD
         Q ++ + + ++ +KK+R+
Subjt:  VQAKELLKEYVDREKKKRD

Q116P5 Protein Thf11.2e-3336.57Show/hide
Query:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M GY   ED+ +IF A I+   EDP +YR DAK LE+ A   
Subjt:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQ

Query:  TAASLVEFASREGEVEST--LKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL
        +A+ ++ +      +++T  L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  
Subjt:  TAASLVEFASREGEVEST--LKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYVDREKKKRDERT
        +++ +   +KKR++R+
Subjt:  LKEYVDREKKKRDERT

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic1.5e-10869.28Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAAV S+SFS + Q ++R+  V S+RS+    D FRFR++       VR+S  +SR V+HC  S+  D+ TVA+TKL FL AYKRPIP++YNTVLQELIV
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLM+GYPS+EDR+AIF+AYIEAL EDPEQYR DA+KLEEWAR+Q A +LV+F+S+EGE+E+  KDIA+RAG+K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEY
         F YSR FA+GLFRLLELAN T+P+ILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER  +Q ANE +TKCLG+Y
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEY

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic6.1e-9967.71Show/hide
Query:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAA++S+ F+ L + +D R   PS  + A+         SV     R R     SR V+ C++   DV  TVAETK+NFLK+YKRPI SIY+TVLQEL+V
Subjt:  MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKG
        QQHLMRYK TY+YD VFALGFVTVYDQLM+GYPS+EDRDAIF+AYI ALNEDPEQYR DA+K+EEWARSQ   SLVEF+S++GE+E+ LKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITK
        +FSYSRFFA+GLFRLLELANATEP+IL+KLCAALNI+K+ VDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER+ +  +NEA+TK
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITK

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic1.0e-10672.76Show/hide
Query:  AVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ
        A++S+SF  L Q SD+     S+R LAS     R  T  FS  S    ST  S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQ
Subjt:  AVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNF
        HLMRYK+TYRYDPVFALGFVTVYDQLM+GYPSD+DRDAIF+AYIEALNEDP+QYRIDA+K+EEWARSQT+ASLV+F+S+EG++E+ LKDIA RAGSK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNF

Query:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGE
        SYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER  SQ ANE I+KCLG+
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein7.4e-10872.76Show/hide
Query:  AVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ
        A++S+SF  L Q SD+     S+R LAS     R  T  FS  S    ST  S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQ
Subjt:  AVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNF
        HLMRYK+TYRYDPVFALGFVTVYDQLM+GYPSD+DRDAIF+AYIEALNEDP+QYRIDA+K+EEWARSQT+ASLV+F+S+EG++E+ LKDIA RAGSK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNF

Query:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGE
        SYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER  SQ ANE I+KCLG+
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCATTTCATTCTCAACATTGAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTCCGCTCGTTCACTCGCCTCCAATTTCGACGGCTTTCGTTT
TCGTACGAGCGTTTTCAGTCATTATTCCCGAGTTCGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACAACTGTAGCCGAGA
CAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAAGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGACCCTGTTTTCGCTCTTGGTTTTGTTACTGTATATGATCAGCTTATGGACGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTGAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAG
TTGAGAGTACTTTGAAGGACATTGCAGAACGAGCAGGGAGTAAAGGGAATTTTAGTTACAGCCGATTTTTTGCTATTGGACTTTTTCGACTCCTTGAATTGGCAAATGCT
ACTGAACCCAGTATCCTGGAAAAGCTCTGTGCCGCTTTAAACATAGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGC
GAAAGAGCTCCTAAAGGAATACGTCGATAGAGAGAAGAAGAAAAGAGATGAGAGGACTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCA
TGCAGACTGGTTTATAA
mRNA sequenceShow/hide mRNA sequence
CTCAGTCTCACCCTCTCTTCTTTCCTTTTTTCCCAGAAAATTTCTTCGAATTTCATTTTCTATACGTCTTTCTCCCTCTCTTCTTCGATATGAAATCCATTTTCTCCAAA
AATTCTTAAGCTTCGCAGATTTTTCCTCATTCTTCTTCAATGGCGGCTGTTAATTCCATTTCATTCTCAACATTGAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTC
CGCTCGTTCACTCGCCTCCAATTTCGACGGCTTTCGTTTTCGTACGAGCGTTTTCAGTCATTATTCCCGAGTTCGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATT
GCATGTCCGCCGGAACAGATGTGACAACTGTAGCCGAGACAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAAGAATTG
ATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATACCGTTATGACCCTGTTTTCGCTCTTGGTTTTGTTACTGTATATGATCAGCTTATGGACGGGTACCCTAGCGA
TGAGGATCGTGATGCCATCTTCCAAGCATACATTGAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTG
CAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTACTTTGAAGGACATTGCAGAACGAGCAGGGAGTAAAGGGAATTTTAGTTACAGCCGATTTTTT
GCTATTGGACTTTTTCGACTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCCTGGAAAAGCTCTGTGCCGCTTTAAACATAGACAAAAAAGGTGTAGACCGAGACCT
TGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGCGAAAGAGCTCCTAAAGGAATACGTCGATAGAGAGAAGAAGAAAAGAGATGAGAGGACTGGATCACAGACAG
CTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTATAAAAAGGTGATTGGAGCATTTCCTAATTTGAGATAGACTTGAGAGTTATAGCAATA
TTCTAAATACCTGATAGATATGCATCTGTAATGTATTGTTGGGTCTCATGCATTTGGTAAATTTTGTATTCAGCCAGGCAGAACCTTTATTTCTCATTTGACGTGACCAC
CTCAAGCTATTTTTTAATCATTTGTATTACATATTTTGAGTGCAATTGTCAATACAAATGCTTTCGCACTCTGTGCAGTAATCTATTTGACCATCTGCTTTTGTATGTGC
AATCATTTATGGGTTATTATGATATTTCACCCTTTGCTTTATGGTCTAAA
Protein sequenceShow/hide protein sequence
MAAVNSISFSTLNQCSDRRLPVPSARSLASNFDGFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMDGYPSDEDRDAIFQAYIEALNEDPEQYRIDAKKLEEWARSQTAASLVEFASREGEVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELANA
TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERTGSQTANEAITKCLGEYSMQTGL