; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025114 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025114
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationtig00003412:1460492..1463120
RNA-Seq ExpressionSgr025114
SyntenySgr025114
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]7.5e-12581.61Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFSTL+QCS+RR  VPS+RSL+SNF+GFRFRTS+F HYS VR S+ SS MVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEG                          IDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG
        F+YSRFFA+GLFRLLELANATEPSILEKLCAALN++K+ VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+  GSQTANEAITKCLGEYSMQTG
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG

XP_022136235.1 protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia]2.2e-13286.33Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQCSERRLLVPSARSLASNF+GFRFRTSVFCHYSGVRTSS SS MV+HCMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEG                          IDA+KLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTGF
        F+YSRFFA+GLFRLLELANATEPSILEKLCAALNVNK+SVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+  GSQTANEAITKCLGEYSMQTGF
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTGF

XP_022952157.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita moschata]1.7e-12481.61Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQCS+RRL +PSARSLAS+F+GFRFR SVFCHYSGVRT S +S MVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKKTYRYDPVFALGFVTVYD+LMEG                          IDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG
        F+YSRFFA+GLFRLLELANA+EPSILEKLCAALNV+K+ VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+  GSQTA+EAITKCLGEYSMQTG
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]3.4e-12582.27Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQCS+RRL +PSARSLAS+F+GFRFR SVFCHYSGVRTSS SS MVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKKTYRYDPVFALGFVTVYD+LMEG                          IDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG
        F+YSRFFA+GLFRLLELANA+EPSILEKLCAALNV+K+ VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+  GSQTA+EAITKCLGEYSMQTG
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG

XP_023554556.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo]8.9e-12682.27Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSF+ LSQCS+RRL +PSARSLAS+F+GFRFR SVFCHYSGVRTSS SS MVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKKTYRYDPVFALGFVTVYD+LMEG                          IDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAGGKG+
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG
        F+YSRFFA+GLFRLLELANA+EPSILEKLCAALNV+K+ VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+  GSQTA+EAITKCLGEYSMQTG
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG

TrEMBL top hitse value%identityAlignment
A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic3.6e-12581.61Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFSTL+QCS+RR  VPS+RSL+SNF+GFRFRTS+F HYS VR S+ SS MVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEG                          IDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG
        F+YSRFFA+GLFRLLELANATEPSILEKLCAALN++K+ VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+  GSQTANEAITKCLGEYSMQTG
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG

A0A5D3C7D3 Protein THYLAKOID FORMATION13.6e-12581.61Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFSTL+QCS+RR  VPS+RSL+SNF+GFRFRTS+F HYS VR S+ SS MVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEG                          IDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG
        F+YSRFFA+GLFRLLELANATEPSILEKLCAALN++K+ VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+  GSQTANEAITKCLGEYSMQTG
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG

A0A6J1C3B8 protein THYLAKOID FORMATION1, chloroplastic isoform X11.1e-13286.33Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQCSERRLLVPSARSLASNF+GFRFRTSVFCHYSGVRTSS SS MV+HCMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEG                          IDA+KLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTGF
        F+YSRFFA+GLFRLLELANATEPSILEKLCAALNVNK+SVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+  GSQTANEAITKCLGEYSMQTGF
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTGF

A0A6J1GJN0 protein THYLAKOID FORMATION1, chloroplastic-like isoform X28.1e-12581.61Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQCS+RRL +PSARSLAS+F+GFRFR SVFCHYSGVRT S +S MVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKKTYRYDPVFALGFVTVYD+LMEG                          IDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG
        F+YSRFFA+GLFRLLELANA+EPSILEKLCAALNV+K+ VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+  GSQTA+EAITKCLGEYSMQTG
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X21.6e-12582.27Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQCS+RRL +PSARSLAS+F+GFRFR SVFCHYSGVRTSS SS MVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKKTYRYDPVFALGFVTVYD+LMEG                          IDAQKLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG
        F+YSRFFA+GLFRLLELANA+EPSILEKLCAALNV+K+ VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+  GSQTA+EAITKCLGEYSMQTG
Subjt:  FNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTG

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf11.1e-2531.09Show/hide
Query:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVTVYDQLMEGI--------------------------DAQKLEEWA
        ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R  + +RYDP+FALG  T +D+ M+G                           D Q+L E A
Subjt:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVTVYDQLMEGI--------------------------DAQKLEEWA

Query:  RSQTAASLVEF---ASKEG--EVESILKDIAERAGGKGSFNYSRFFAVGLFRLLELA--NATE-----PSILEKLCAALNVNKRSVDRDLDVYRNLLSKL
        +S++A  ++++   A+  G  E++  L++IA+       F YSR FA+GLF LLEL+  N T+        L  +C  LN+++  + +DL++YR  L K+
Subjt:  RSQTAASLVEF---ASKEG--EVESILKDIAERAGGKGSFNYSRFFAVGLFRLLELA--NATE-----PSILEKLCAALNVNKRSVDRDLDVYRNLLSKL

Query:  VQAKELLKEYVDREKKKRD----ERSGTGSQTANEAIT
         Q ++ + + ++ +KK+R+    ++ G+      EA T
Subjt:  VQAKELLKEYVDREKKKRD----ERSGTGSQTANEAIT

Q116P5 Protein Thf17.3e-2231.02Show/hide
Query:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVTVYDQLMEGI--------------------------DAQKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+G                           DA+ LE+ A   
Subjt:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVTVYDQLMEGI--------------------------DAQKLEEWARSQ

Query:  TAASLVEF--ASKEGEVESILKDIAERAGGKGSFNYSRFFAVGLFRLLELANA-------TEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKEL
        +A+ ++ +   SK  +    L+D          F YSR FA+GLF LLE+ +             L+K+C +LN+ +  + +D+D+Y + L ++ QA+  
Subjt:  TAASLVEF--ASKEGEVESILKDIAERAGGKGSFNYSRFFAVGLFRLLELANA-------TEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYVDREKKKRDERS
        +++ +   +KKR++RS
Subjt:  LKEYVDREKKKRDERS

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic7.9e-9364.41Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAAV SVSFS ++Q +ER+  V S+RS+    + FRFR++       VR+S+S+S  V+HC  S+  D+ TVA+TKL FL AYKRPIP++YNTVLQELIV
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKG
        QQHL RYKK+Y+YDPVFALGFVTVYDQLMEG                           DAQKLEEWAR+Q A +LV+F+SKEGE+E+I KDIA+RAG K 
Subjt:  QQHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKG

Query:  SFNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEY
         F YSR FAVGLFRLLELAN T+P+ILEKLCAALNVNK+SVDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER    +Q ANE +TKCLG+Y
Subjt:  SFNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEY

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic2.8e-8261.38Show/hide
Query:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAA++S+ F+ L + ++ R   PS  + A+         SV             S  V+ C++   DV  TVAETK+NFLK+YKRPI SIY+TVLQEL+V
Subjt:  MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKG
        QQHLMRYK TY+YD VFALGFVTVYDQLMEG                           DAQK+EEWARSQ   SLVEF+SK+GE+E+ILKDI+ERA GKG
Subjt:  QQHLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKG

Query:  SFNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITK
        SF+YSRFFAVGLFRLLELANATEP+IL+KLCAALN+NKRSVDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ERS T    +NEA+TK
Subjt:  SFNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITK

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic8.5e-8764.73Show/hide
Query:  AVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ
        A++S+SF  L Q S++     S+R LAS     R  T  F   S    S S+S  +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQ
Subjt:  AVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGSF
        HLMRYKKTYRYDPVFALGFVTVYDQLMEG                          IDAQK+EEWARSQT+ASLV+F+SKEG++E++LKDIA RAG K  F
Subjt:  HLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGSF

Query:  NYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGE
        +YSRFFAVGLFRLLELA+AT+P++L+KLCA+LN+NK+SVDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER+   SQ ANE I+KCLG+
Subjt:  NYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein6.0e-8864.73Show/hide
Query:  AVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ
        A++S+SF  L Q S++     S+R LAS     R  T  F   S    S S+S  +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQ
Subjt:  AVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGSF
        HLMRYKKTYRYDPVFALGFVTVYDQLMEG                          IDAQK+EEWARSQT+ASLV+F+SKEG++E++LKDIA RAG K  F
Subjt:  HLMRYKKTYRYDPVFALGFVTVYDQLMEG--------------------------IDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGSF

Query:  NYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGE
        +YSRFFAVGLFRLLELA+AT+P++L+KLCA+LN+NK+SVDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER+   SQ ANE I+KCLG+
Subjt:  NYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCGTGTCATTCTCAACATTAAGTCAATGTTCTGAAAGAAGATTGTTGGTTCCGTCGGCTCGTTCACTAGCCTCGAATTTCAACGGGTTTCGTTT
TCGTACGAGCGTTTTCTGTCATTATTCGGGAGTTCGGACATCGAGTTCCAGTTCTCACATGGTTATTCATTGCATGTCAGCCGGAACAGATGTGACCACTGTAGCTGAGA
CAAAGTTGAACTTCCTCAAGGCGTATAAGCGGCCCATCCCTAGCATTTACAACACTGTTCTGCAAGAGTTGATTGTCCAGCAGCATTTGATGAGGTATAAGAAGACATAC
CGTTATGACCCTGTTTTCGCCCTCGGTTTTGTTACTGTATATGATCAACTTATGGAAGGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCAGCTTC
ATTGGTTGAGTTTGCATCAAAAGAAGGAGAAGTTGAGAGTATTTTGAAGGACATCGCAGAACGGGCAGGGGGTAAGGGGAGTTTCAATTACAGCCGATTTTTTGCTGTTG
GGCTCTTTCGCCTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCAGCTTTGAATGTTAACAAAAGAAGTGTGGATCGAGACCTTGATGTA
TACCGCAACCTGCTTTCGAAGTTGGTTCAGGCAAAAGAGCTCCTAAAGGAATACGTTGATAGAGAGAAGAAGAAAAGAGATGAGAGGTCTGGAACTGGATCACAGACAGC
TAATGAGGCCATAACAAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTGTTAATTCCGTGTCATTCTCAACATTAAGTCAATGTTCTGAAAGAAGATTGTTGGTTCCGTCGGCTCGTTCACTAGCCTCGAATTTCAACGGGTTTCGTTT
TCGTACGAGCGTTTTCTGTCATTATTCGGGAGTTCGGACATCGAGTTCCAGTTCTCACATGGTTATTCATTGCATGTCAGCCGGAACAGATGTGACCACTGTAGCTGAGA
CAAAGTTGAACTTCCTCAAGGCGTATAAGCGGCCCATCCCTAGCATTTACAACACTGTTCTGCAAGAGTTGATTGTCCAGCAGCATTTGATGAGGTATAAGAAGACATAC
CGTTATGACCCTGTTTTCGCCCTCGGTTTTGTTACTGTATATGATCAACTTATGGAAGGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCAGCTTC
ATTGGTTGAGTTTGCATCAAAAGAAGGAGAAGTTGAGAGTATTTTGAAGGACATCGCAGAACGGGCAGGGGGTAAGGGGAGTTTCAATTACAGCCGATTTTTTGCTGTTG
GGCTCTTTCGCCTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCAGCTTTGAATGTTAACAAAAGAAGTGTGGATCGAGACCTTGATGTA
TACCGCAACCTGCTTTCGAAGTTGGTTCAGGCAAAAGAGCTCCTAAAGGAATACGTTGATAGAGAGAAGAAGAAAAGAGATGAGAGGTCTGGAACTGGATCACAGACAGC
TAATGAGGCCATAACAAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTTTGA
Protein sequenceShow/hide protein sequence
MAAVNSVSFSTLSQCSERRLLVPSARSLASNFNGFRFRTSVFCHYSGVRTSSSSSHMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTY
RYDPVFALGFVTVYDQLMEGIDAQKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGSFNYSRFFAVGLFRLLELANATEPSILEKLCAALNVNKRSVDRDLDV
YRNLLSKLVQAKELLKEYVDREKKKRDERSGTGSQTANEAITKCLGEYSMQTGF