; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G01460 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G01460
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationClcChr02:1426390..1429732
RNA-Seq ExpressionClc02G01460
SyntenyClc02G01460
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645760.1 hypothetical protein Csa_020345 [Cucumis sativus]2.5e-15094.61Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRRL +PS RS +SNF  F FRTSVF+HYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKK EEWARSQTA SLVEFASREGEVES LKDIAERAG+KGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTG

XP_004136805.1 protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus]8.7e-15194.63Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRRL +PS RS +SNF  F FRTSVF+HYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKK EEWARSQTA SLVEFASREGEVES LKDIAERAG+KGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]3.2e-15394.97Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRR PVPS RSL+SNFD FRFRTS+F+HYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAG+KGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]1.7e-14690.6Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+S S L+QCSDRRLP+PS RSLAS+FD FRFR SVF HYS VR S+FSSRMVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAG+KGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

XP_038888611.1 protein THYLAKOID FORMATION1, chloroplastic [Benincasa hispida]2.9e-15496.31Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAV SIS STLNQCSDRRLPVPS RSLASNFD FRFRTSVFSHYSRVRASTFSS MVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLV+FASREGEVESTLKDIAERAG+KGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCA+LNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

TrEMBL top hitse value%identityAlignment
A0A0A0K3P0 Uncharacterized protein4.2e-15194.63Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRRL +PS RS +SNF  F FRTSVF+HYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKK EEWARSQTA SLVEFASREGEVES LKDIAERAG+KGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic1.5e-15394.97Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRR PVPS RSL+SNFD FRFRTS+F+HYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAG+KGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

A0A5D3C7D3 Protein THYLAKOID FORMATION11.5e-15394.97Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRR PVPS RSL+SNFD FRFRTS+F+HYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAG+KGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

A0A6J1C3B8 protein THYLAKOID FORMATION1, chloroplastic isoform X11.4e-14691.25Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+S S L+QCS+RRL VPS RSLASNFD FRFRTSVF HYS VR S++SSRMV+HCMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKKLEEWARSQTA SLVEFAS+EGEVES LKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTG

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X28.2e-14790.6Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+S S L+QCSDRRLP+PS RSLAS+FD FRFR SVF HYS VR S+FSSRMVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAG+KGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDER+GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf17.4e-3637.9Show/hide
Query:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWA
        ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ M+GY  + D+DAIF A  KA   DP Q + D ++L E A
Subjt:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWA

Query:  RSQTATSLVEF---ASREG--EVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL
        +S++A  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LNI +  + +DL++YR  L K+
Subjt:  RSQTATSLVEF---ASREG--EVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL

Query:  VQAKELLKEYVDREKKKRD
         Q ++ + + ++ +KK+R+
Subjt:  VQAKELLKEYVDREKKKRD

Q116P5 Protein Thf12.4e-3437.04Show/hide
Query:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+GY   ED+ +IF A I+   EDP +YR DAK LE+ A   
Subjt:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQ

Query:  TATSLVEFASREGEVEST--LKDIAERAGNKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL
        +A+ ++ +      +++T  L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  
Subjt:  TATSLVEFASREGEVEST--LKDIAERAGNKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYVDREKKKRDERS
        +++ +   +KKR++RS
Subjt:  LKEYVDREKKKRDERS

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic5.5e-10868.94Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAAV S+S S + Q ++R+  V S RS+    D FRFR++       VR+S  +SR V+HC  S+  D+ TVA+TKL FL AYKRPIP++YNTVLQELIV
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLMEGYPS+EDR+AIF+AYI+AL EDPEQYR DA+KLEEWAR+Q A +LV+F+S+EGE+E+  KDIA+RAG K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEY
         F YSR FA+GLFRLLELAN T+P+ILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER  +Q ANE +TKCLG+Y
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEY

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic1.2e-9968.4Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAA++S+  + L + +D R   PS  + A+   A     SV     R R     SR V+ C++   DV  TVAETK+NFLK+YKRPI SIY+TVLQEL+V
Subjt:  MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKG
        QQHLMRYK TY+YD VFALGFVTVYDQLMEGYPS+EDRDAIF+AYI ALNEDPEQYR DA+K+EEWARSQ   SLVEF+S++GE+E+ LKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITK
        +FSYSRFFA+GLFRLLELANATEP+IL+KLCAALNI+K+ VDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ERS +  +NEA+TK
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITK

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic2.0e-10572.07Show/hide
Query:  AVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ
        A++S+S   L Q SD+     S R LAS   A R  T  FS  S    ST  S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQ
Subjt:  AVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGNF
        HLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DRDAIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ SLV+F+S+EG++E+ LKDIA RAG+K  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGNF

Query:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGE
        SYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER+ SQ ANE I+KCLG+
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein1.4e-10672.07Show/hide
Query:  AVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ
        A++S+S   L Q SD+     S R LAS   A R  T  FS  S    ST  S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQ
Subjt:  AVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGNF
        HLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DRDAIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ SLV+F+S+EG++E+ LKDIA RAG+K  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGNF

Query:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGE
        SYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ER+ SQ ANE I+KCLG+
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCATTTCAATCTCAACATTAAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTCCCCTCGTTCACTCGCCTCCAATTTCGACGCCTTTCGTTT
TCGTACGAGCGTTTTCAGTCATTATTCCCGAGTAAGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCTGCCGGAACAGATGTGACCACTGTAGCCGAGA
CAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAAGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGACCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTAAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCCACTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAG
TTGAGAGTACGTTGAAGGACATTGCAGAACGGGCAGGGAATAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTTTTCCGACTCCTTGAATTGGCAAATGCT
ACTGAACCCAGTATCTTGGAAAAGCTCTGTGCTGCTTTAAACATTGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGC
GAAGGAGCTCCTAAAGGAATACGTCGATAGAGAGAAGAAGAAAAGAGATGAGAGGAGTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCA
TGCAGACTGGTTTGTAA
mRNA sequenceShow/hide mRNA sequence
AATATTTCTGAAAGCCCAATCCATATTGGACCTGGCCCATACCAACTATGCCACGTGTCGATCTTTTACTGGCCTCTTTGGGAAGGTGTAGAAATTAACACTTGGATAAG
CCTCTCTATCATGGCCTCTCAGTCTCACCCTCTCTTCTTTCCATTTTTCCCAGAAAATTTCTTCCAATTTCATTCTCTATACCTCTTTTCTCCCTCTCTTCTTCGATATG
AAACCCATTTTCTCCAAAAATTCATAAGCTTCCTACATTTTCCCTCATTCTTCTTTAATGGCGGCTGTTAATTCCATTTCAATCTCAACATTAAACCAATGTTCTGATAG
AAGATTGCCGGTTCCGTCCCCTCGTTCACTCGCCTCCAATTTCGACGCCTTTCGTTTTCGTACGAGCGTTTTCAGTCATTATTCCCGAGTAAGAGCATCCACCTTCAGTT
CTCGCATGGTCATTCATTGCATGTCTGCCGGAACAGATGTGACCACTGTAGCCGAGACAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAAC
ACTGTTCTGCAAGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATACCGTTATGACCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTAT
GGAAGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAAGAGT
GGGCTCGGTCTCAGACTGCCACTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTACGTTGAAGGACATTGCAGAACGGGCAGGGAATAAGGGGAATTTC
AGTTACAGCCGATTTTTTGCTATTGGACTTTTCCGACTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCTGCTTTAAACATTGACAAAAA
AGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGCGAAGGAGCTCCTAAAGGAATACGTCGATAGAGAGAAGAAGAAAAGAGATGAGA
GGAGTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTGTAAAGAAAAGATGATTGGAGCACTTCCTAATTTGAGATA
GACTTGAGAGTTTATAGCAATATTCTAAATTACCCTATAGATATGCATCTGTAATGAATTGTTGGGTCTCATGCATTTGGTAAATTTTGTATTCAGCCAGGCACTACCTT
TATTTCTCATTTGATATAACCACCACTCAAGCTATTTTTTAATCATTTGTATTACATATTTTAGAGCGCAATTGTTAATACAAATGCTATCGCACTCTGTGCAGTAATCA
CTCTCCAAGTGATATTTGACCATCTGGCTTTTGTATTGTGCAATCATTTCTGGATTATTATGATCTTTCACCCTTTGCTTTTGTAGGGTCTAAATTCATTCAGTTACAAA
CATTTAGGATTGTTTTTAAATACAAATTGGCTAGGGGATTTGAACTACGAACCTCTTGATTGCACCTATGTTAGTTGAGTTGAGCTTTTTTTTGGCACATTTAGGATTTT
TATTGATGTTTTAAGTGATACGACCTCCAAAGTTTAGATGTATACATTTAATTTTTTTCCCCTTTTTAACCCATTGTTGAAGACATGAGCTTTCTTGAACACATGATTTG
TTTTATGGGTGTTATTTAGATTGTGTAGAGTTGTGAAGCTCAATGACAAAATTTAATTAAATTAGTTAACT
Protein sequenceShow/hide protein sequence
MAAVNSISISTLNQCSDRRLPVPSPRSLASNFDAFRFRTSVFSHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGNKGNFSYSRFFAIGLFRLLELANA
TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERSGSQTANEAITKCLGEYSMQTGL