; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G019300 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G019300
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic-like
Genome locationCmo_Chr11:13362845..13366235
RNA-Seq ExpressionCmoCh11G019300
SyntenyCmoCh11G019300
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589193.1 Protein THYLAKOID FORMATION1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.6e-14998.61Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSPVPSAR+LASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMS GTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEF SKEGEVES+LKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]1.5e-13990.94Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNS+SFST++Q SDRR PVPS+RSL+SNFDGFRFR+S+F H+S VR S+FSSR+VIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYN+VLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ+AASLVEFAS+EGEVES+LKDIAERA SKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEA+TK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

XP_022136235.1 protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia]3.3e-13991.99Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFS +SQ S+RR  VPSARSLASNFDGFRFR+SVF H+SGVRTSS+SSR+V+HCMSAGTDVTTVAETK NFLK YKRPIPSIYN+VLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQ+AASLVEFASKEGEVES+LKDIAERA  KGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNV+KKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEA+TK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

XP_022930881.1 protein THYLAKOID FORMATION1, chloroplastic-like [Cucurbita moschata]1.4e-150100Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

XP_022988874.1 protein THYLAKOID FORMATION1, chloroplastic-like [Cucurbita maxima]7.1e-15098.95Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSP+PSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMS GTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVES+LKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

TrEMBL top hitse value%identityAlignment
A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic7.2e-14090.94Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNS+SFST++Q SDRR PVPS+RSL+SNFDGFRFR+S+F H+S VR S+FSSR+VIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYN+VLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ+AASLVEFAS+EGEVES+LKDIAERA SKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEA+TK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

A0A5D3C7D3 Protein THYLAKOID FORMATION17.2e-14090.94Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNS+SFST++Q SDRR PVPS+RSL+SNFDGFRFR+S+F H+S VR S+FSSR+VIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYN+VLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ+AASLVEFAS+EGEVES+LKDIAERA SKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEA+TK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

A0A6J1C3B8 protein THYLAKOID FORMATION1, chloroplastic isoform X11.6e-13991.99Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFS +SQ S+RR  VPSARSLASNFDGFRFR+SVF H+SGVRTSS+SSR+V+HCMSAGTDVTTVAETK NFLK YKRPIPSIYN+VLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQ+AASLVEFASKEGEVES+LKDIAERA  KGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNV+KKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEA+TK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

A0A6J1ES50 protein THYLAKOID FORMATION1, chloroplastic-like6.9e-151100Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

A0A6J1JMT7 protein THYLAKOID FORMATION1, chloroplastic-like3.4e-15098.95Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSP+PSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMS GTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVES+LKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf19.3e-3638.36Show/hide
Query:  DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA
        ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ MDGY  + D++AIF A  KA   DP Q + D Q+L E A
Subjt:  DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA

Query:  RSQSAASLVEF---ASKEG--EVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNVDKKSVDRDLDVYRNLLSKL
        +S+SA  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LN+ +  + +DL++YR  L K+
Subjt:  RSQSAASLVEF---ASKEG--EVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNVDKKSVDRDLDVYRNLLSKL

Query:  VQAKELLKEYVDREKKKRD
         Q ++ + + ++ +KK+R+
Subjt:  VQAKELLKEYVDREKKKRD

Q116P5 Protein Thf11.6e-3237.04Show/hide
Query:  TVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M GY   ED+ +IF A I+   EDP +YR DA+ LE+ A   
Subjt:  TVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ

Query:  SAASLVEF--ASKEGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKEL
        SA+ ++ +   SK  +    L+D     +    F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  
Subjt:  SAASLVEF--ASKEGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYVDREKKKRDERA
        +++ +   +KKR++R+
Subjt:  LKEYVDREKKKRDERA

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic2.9e-10670.83Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIV
        MAAV SVSFS ++Q ++R+S V S+RS+    D FRFRS+  +    VR+S+ +SR V+HC  S+  D+ TVA+TKL FL AYKRPIP++YN+VLQELIV
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLM+GYPS+EDR AIF+AYI+AL EDPEQYR DAQKLEEWAR+Q+A +LV+F+SKEGE+E++ KDIA+RA +K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKG

Query:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
         F YSR FA+GLFRLLELAN T+P+ILEKLCAALNV+KKSVDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER  +Q ANE VTK
Subjt:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic8.5e-9867.36Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNSVLQELIV
        MAA++S+ F+ + + +D R   PS  + A+         SV             SR V+ C++   DV  TVAETK+NFLK+YKRPI SIY++VLQEL+V
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNSVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKG
        QQHLMRYK TY+YD VFALGFVTVYDQLM+GYPS+EDR+AIF+AYI ALNEDPEQYR DAQK+EEWARSQ+  SLVEF+SK+GE+E++LKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKG

Query:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        SFSYSRFFA+GLFRLLELANATEP+IL+KLCAALN++K+SVDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER+ +  +NEAVTK
Subjt:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic1.3e-10170.28Show/hide
Query:  AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQ
        A++S+SF  + Q SD+ S   S+R LAS       R    +    + + S +S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYN+VLQELIVQQ
Subjt:  AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGSF
        HLMRYK+TYRYDPVFALGFVTVYDQLM+GYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQ++ASLV+F+SKEG++E+VLKDIA RA SK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGSF

Query:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        SYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN++KKSVDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE ++K
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein9.1e-10370.28Show/hide
Query:  AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQ
        A++S+SF  + Q SD+ S   S+R LAS       R    +    + + S +S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYN+VLQELIVQQ
Subjt:  AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGSF
        HLMRYK+TYRYDPVFALGFVTVYDQLM+GYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQ++ASLV+F+SKEG++E+VLKDIA RA SK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGSF

Query:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
        SYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN++KKSVDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE ++K
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCGTGTCATTCTCAACGGTAAGTCAGTTTTCTGATCGAAGGTCGCCGGTTCCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTTCGTTT
CCGTTCTAGTGTTTTCTATCATCATTCGGGAGTTCGAACCTCGAGTTTCAGTTCTCGCTTGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACGACTGTAGCTGAGA
CTAAATTGAACTTTCTAAAGGCGTATAAACGACCTATCCCCAGCATATACAACTCTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACGTAC
CGTTATGATCCTGTTTTCGCCCTCGGTTTTGTTACTGTATATGATCAGCTTATGGATGGGTACCCTAGCGATGAGGATCGGGAGGCCATTTTCCAAGCCTACATTAAGGC
GTTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGTCTGCAGCTTCATTGGTTGAATTTGCATCAAAAGAAGGAGAAG
TTGAGAGTGTTTTGAAAGACATTGCAGAACGAGCTGCGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTCGAATTGGCAAATGCT
ACTGAACCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATGTTGACAAAAAAAGTGTGGACCGAGACCTTGATGTATACCGCAACCTGCTTTCGAAGTTGGTTCAGGC
GAAGGAGCTCCTAAAGGAATACGTTGATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAACGAGGCCGTAACAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTGTTAATTCCGTGTCATTCTCAACGGTAAGTCAGTTTTCTGATCGAAGGTCGCCGGTTCCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTTCGTTT
CCGTTCTAGTGTTTTCTATCATCATTCGGGAGTTCGAACCTCGAGTTTCAGTTCTCGCTTGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACGACTGTAGCTGAGA
CTAAATTGAACTTTCTAAAGGCGTATAAACGACCTATCCCCAGCATATACAACTCTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACGTAC
CGTTATGATCCTGTTTTCGCCCTCGGTTTTGTTACTGTATATGATCAGCTTATGGATGGGTACCCTAGCGATGAGGATCGGGAGGCCATTTTCCAAGCCTACATTAAGGC
GTTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGTCTGCAGCTTCATTGGTTGAATTTGCATCAAAAGAAGGAGAAG
TTGAGAGTGTTTTGAAAGACATTGCAGAACGAGCTGCGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTCGAATTGGCAAATGCT
ACTGAACCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATGTTGACAAAAAAAGTGTGGACCGAGACCTTGATGTATACCGCAACCTGCTTTCGAAGTTGGTTCAGGC
GAAGGAGCTCCTAAAGGAATACGTTGATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAACGAGGCCGTAACAAAATGAATGATTGGGTGAATAC
AGCATAGAGAGTGGTTTTTGAGAGCTGATGACATCAATTGGAGCACTCACCGCCTAATTTGAGATAAGACTTTAAAGAGTTATAGCAATATTCTAAATACCTGATAGATA
TGCATTTGTACTGTATTGTTGGGTCTTCTGCATTTGGTAAATTTTGTATTCTGCCAGCTTCTACTTTTATTCTCATTTGTATTACATATTTTGAGTGCAATTGTCTATAC
AAATGCTTTCGCACTCTGTGCAGTAATCATATTTGTCTCTCTCTTGACCACCA
Protein sequenceShow/hide protein sequence
MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANA
TEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK