; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025834 (gene) of Chayote v1 genome

Gene IDSed0025834
OrganismSechium edule (Chayote v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationLG05:33079862..33085079
RNA-Seq ExpressionSed0025834
SyntenySed0025834
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]3.4e-13988.26Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN +SFSTL+QCS+R  PVPS RSL+ NF GFRFR+S+F   S VR ST +SR VI C +AGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL
        FSYSRFFAIGLFRLLELAN TEPSILEKLCAALN+DKK VDRDLDVYRNLLSKLVQAKELLKEYI+REKKKRDERAGSQ ANE ITKCLGE+SMQ GL
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL

XP_022136235.1 protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia]2.8e-14189.9Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN VSFS LSQCSER L VPS RSLA NF GFRFR+SVFC  SGVRTS+ +SR V+ C +AGTDVTTVAETK+NFLK+YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFASKEGEVESILKDIAERAG KGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAG
        FSYSRFFAIGLFRLLELAN TEPSILEKLCAALNV+KKSVDRDLDVYRNLLSKLVQAKELLKEY++REKKKRDERAGSQ ANE ITKCLGE+SMQ G
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAG

XP_022952157.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita moschata]5.8e-13987.25Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN VSFS LSQCS+R LP+PS RSLA +F GFRFR SVFC  SGVRT + NSR VI C A+GTDVTTVAETK+NFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL
        FSYSRFFAIGLFRLLELAN +EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEY++REKKKRDERAGSQ A+E ITKCLGE+SMQ GL
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]1.0e-13887.25Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN VSFS LSQCS+R LP+PS RSLA +F GFRFR SVFC  SGVRTS+ +SR VI C A+GTDVTTVAETK+NFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL
        FSYSRFFAIGLFRLLELAN +EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEY++REKKKRDERAGSQ A+E ITKCLGE+SMQ GL
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL

XP_023554556.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo]3.8e-13886.58Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN VSF+ LSQCS+R LP+PS RSLA +F GFRFR SVFC  SGVRTS+ +SR VI C A+GTDVTTVAETK+NFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL
        FSYSRFFAIGLFRLLELAN +EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEY++REKKKRDERAGSQ A+E ITKCLGE+SMQ GL
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL

TrEMBL top hitse value%identityAlignment
A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic1.7e-13988.26Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN +SFSTL+QCS+R  PVPS RSL+ NF GFRFR+S+F   S VR ST +SR VI C +AGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL
        FSYSRFFAIGLFRLLELAN TEPSILEKLCAALN+DKK VDRDLDVYRNLLSKLVQAKELLKEYI+REKKKRDERAGSQ ANE ITKCLGE+SMQ GL
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL

A0A5D3C7D3 Protein THYLAKOID FORMATION11.7e-13988.26Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN +SFSTL+QCS+R  PVPS RSL+ NF GFRFR+S+F   S VR ST +SR VI C +AGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL
        FSYSRFFAIGLFRLLELAN TEPSILEKLCAALN+DKK VDRDLDVYRNLLSKLVQAKELLKEYI+REKKKRDERAGSQ ANE ITKCLGE+SMQ GL
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL

A0A6J1C3B8 protein THYLAKOID FORMATION1, chloroplastic isoform X11.4e-14189.9Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN VSFS LSQCSER L VPS RSLA NF GFRFR+SVFC  SGVRTS+ +SR V+ C +AGTDVTTVAETK+NFLK+YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFASKEGEVESILKDIAERAG KGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAG
        FSYSRFFAIGLFRLLELAN TEPSILEKLCAALNV+KKSVDRDLDVYRNLLSKLVQAKELLKEY++REKKKRDERAGSQ ANE ITKCLGE+SMQ G
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAG

A0A6J1GJN0 protein THYLAKOID FORMATION1, chloroplastic-like isoform X22.8e-13987.25Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN VSFS LSQCS+R LP+PS RSLA +F GFRFR SVFC  SGVRT + NSR VI C A+GTDVTTVAETK+NFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL
        FSYSRFFAIGLFRLLELAN +EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEY++REKKKRDERAGSQ A+E ITKCLGE+SMQ GL
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X24.8e-13987.25Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ
        MAAVN VSFS LSQCS+R LP+PS RSLA +F GFRFR SVFC  SGVRTS+ +SR VI C A+GTDVTTVAETK+NFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL
        FSYSRFFAIGLFRLLELAN +EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEY++REKKKRDERAGSQ A+E ITKCLGE+SMQ GL
Subjt:  FSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf11.1e-3636.91Show/hide
Query:  DVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA
        ++ TV++TK  F  ++ RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ M+GY  + D++AIF A  KA   DP Q + D Q+L E A
Subjt:  DVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA

Query:  RSQTAASLVEF---ASKEG--EVESILKDIAERAGSKGSFSYSRFFAIGLFRLLELA--NVTE-----PSILEKLCAALNVDKKSVDRDLDVYRNLLSKL
        +S++A  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+  N+T+        L  +C  LN+ +  + +DL++YR  L K+
Subjt:  RSQTAASLVEF---ASKEG--EVESILKDIAERAGSKGSFSYSRFFAIGLFRLLELA--NVTE-----PSILEKLCAALNVDKKSVDRDLDVYRNLLSKL

Query:  VQAKELLKEYIEREKKKRD-ERAGSQAANETIT
         Q ++ + + +E +KK+R+ ++A  + +++T T
Subjt:  VQAKELLKEYIEREKKKRD-ERAGSQAANETIT

Q116P5 Protein Thf13.4e-3336.57Show/hide
Query:  TVAETKSNFLKMYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+GY   ED+ +IF A I+   EDP +YR DA+ LE+ A   
Subjt:  TVAETKSNFLKMYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ

Query:  TAASLVEF--ASKEGEVESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANV-------TEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKEL
        +A+ ++ +   SK  +    L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  
Subjt:  TAASLVEF--ASKEGEVESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANV-------TEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYIEREKKKRDERA
        +++ +   +KKR++R+
Subjt:  LKEYIEREKKKRDERA

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic9.5e-10870.99Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCT-AAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIV
        MAAV  VSFS ++Q +ER   V S RS+      FRFRS+       VR+S S SR V+ CT ++  D+ TVA+TK  FL  YKRPIP++YNTVLQELIV
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCT-AAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLMEGYPS+EDR AIF+AYI+AL EDPEQYR DAQKLEEWAR+Q A +LV+F+SKEGE+E+I KDIA+RAG+K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKG

Query:  SFSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEF
         F YSR FA+GLFRLLELANVT+P+ILEKLCAALNV+KKSVDRDLDVYRNLLSKLVQAKELLKEY+EREKKKR ER  +Q ANET+TKCLG++
Subjt:  SFSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEF

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic2.8e-9967.35Show/hide
Query:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDV-TTVAETKSNFLKMYKRPIPSIYNTVLQELIV
        MAA++ + F+ L + ++     PS  + A   G      SV  RR         SRSV++C A   DV  TVAETK NFLK YKRPI SIY+TVLQEL+V
Subjt:  MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDV-TTVAETKSNFLKMYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKG
        QQHLMRYK TY+YD VFALGFVTVYDQLMEGYPS+EDR+AIF+AYI ALNEDPEQYR DAQK+EEWARSQ   SLVEF+SK+GE+E+ILKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKG

Query:  SFSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFS
        SFSYSRFFA+GLFRLLELAN TEP+IL+KLCAALN++K+SVDRDLDVYRN+LSKLVQAKELLKEY+EREKKKR+ER+ +  +NE +TK  G  +
Subjt:  SFSYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFS

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic1.7e-10471.38Show/hide
Query:  AVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGT-DVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQQ
        A++ +SF  L Q S++     S R LA      R   + F R S    S S S+S+I C +  T DV  V+ETKS FLK YKRPIPSIYNTVLQELIVQQ
Subjt:  AVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGT-DVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGSF
        HLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQT+ASLV+F+SKEG++E++LKDIA RAGSK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGSF

Query:  SYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGE
        SYSRFFA+GLFRLLELA+ T+P++L+KLCA+LN++KKSVDRDLDVYRNLLSKLVQAKELLKEY+EREKKK+ ERA SQ ANETI+KCLG+
Subjt:  SYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein1.2e-10571.38Show/hide
Query:  AVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGT-DVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQQ
        A++ +SF  L Q S++     S R LA      R   + F R S    S S S+S+I C +  T DV  V+ETKS FLK YKRPIPSIYNTVLQELIVQQ
Subjt:  AVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGT-DVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGSF
        HLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQT+ASLV+F+SKEG++E++LKDIA RAGSK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGSF

Query:  SYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGE
        SYSRFFA+GLFRLLELA+ T+P++L+KLCA+LN++KKSVDRDLDVYRNLLSKLVQAKELLKEY+EREKKK+ ERA SQ ANETI+KCLG+
Subjt:  SYSRFFAIGLFRLLELANVTEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATCCCGTTTCGTTCTCAACATTGAGTCAGTGTTCTGAGAGGTGGTTGCCGGTTCCGTCGCCTCGTTCACTCGCCTTCAATTTCGGCGGTTTCCGATT
TCGTTCGAGCGTTTTCTGTCGTCGTTCCGGAGTACGAACCTCGACTTCCAATTCTCGCTCGGTTATTCAATGCACGGCCGCCGGAACAGATGTGACGACTGTTGCCGAGA
CTAAATCAAACTTCCTCAAGATGTATAAACGGCCTATCCCTAGCATATACAACACTGTTCTGCAAGAGTTGATCGTGCAGCAGCACTTGATGAGGTATAAGAGGACATAC
CGTTATGATCCTGTTTTTGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATACATTAAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAGTTGGAAGAATGGGCTCGGTCTCAAACTGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAGG
TTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGAAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTCGCTATTGGGCTATTTCGACTTCTGGAATTGGCAAACGTT
ACTGAACCCAGTATCCTGGAAAAGCTTTGTGCCGCTTTGAATGTTGACAAAAAAAGCGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAGGC
CAAAGAACTCCTAAAGGAATACATCGAAAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGGCAGCTAATGAGACCATAACAAAATGCTTGGGAGAATTCAGCA
TGCAGGCTGGTTTGTGA
mRNA sequenceShow/hide mRNA sequence
CTCTTTTCTTTTATTCTTTTCCCATAAAACATTTCTTCAATTTTCACATTCTGAACGATTTTTTCCAACTCTTCTTCAATATGAAATCCATTTTCTTTGTAAGTCTGTAA
GAATCGCAAGTTTCTTCTCAATTCCTCTGCAATGGCGGCTGTTAATCCCGTTTCGTTCTCAACATTGAGTCAGTGTTCTGAGAGGTGGTTGCCGGTTCCGTCGCCTCGTT
CACTCGCCTTCAATTTCGGCGGTTTCCGATTTCGTTCGAGCGTTTTCTGTCGTCGTTCCGGAGTACGAACCTCGACTTCCAATTCTCGCTCGGTTATTCAATGCACGGCC
GCCGGAACAGATGTGACGACTGTTGCCGAGACTAAATCAAACTTCCTCAAGATGTATAAACGGCCTATCCCTAGCATATACAACACTGTTCTGCAAGAGTTGATCGTGCA
GCAGCACTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTTGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATC
GAGAGGCCATTTTCCAAGCATACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAGTTGGAAGAATGGGCTCGGTCTCAAACTGCAGCTTCA
TTGGTTGAGTTTGCATCAAAAGAAGGAGAGGTTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGAAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTCGCTATTGG
GCTATTTCGACTTCTGGAATTGGCAAACGTTACTGAACCCAGTATCCTGGAAAAGCTTTGTGCCGCTTTGAATGTTGACAAAAAAAGCGTAGACCGAGACCTTGATGTAT
ACCGCAACCTGCTTTCAAAGTTGGTTCAGGCCAAAGAACTCCTAAAGGAATACATCGAAAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGGCAGCTAATGAG
ACCATAACAAAATGCTTGGGAGAATTCAGCATGCAGGCTGGTTTGTGAGAGCTGATAACATCAAGTGGAGCACTGCTTAATTGGATGTAGATTTGAGTTATAGCAATATT
CTGAATCATGGTAGATATGCATTTGTAATGTATTGTTGGGTTTCGAGCATTTGATGAATTTTGTATTCAGCCAGGCACTACCTTAATTCAAGATTTTTTTTTTAATAATT
TGTATTACATATTTTTAGTGCAATTGTCTATACAGATGCTTAGATATTTGATAATTTGCTTTTCAATATGCAATCATTTCTGGATTCTTAATCAGTTATAAGCAGCATAT
GCATCGAAG
Protein sequenceShow/hide protein sequence
MAAVNPVSFSTLSQCSERWLPVPSPRSLAFNFGGFRFRSSVFCRRSGVRTSTSNSRSVIQCTAAGTDVTTVAETKSNFLKMYKRPIPSIYNTVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANV
TEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKRDERAGSQAANETITKCLGEFSMQAGL