; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G018630 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G018630
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationCicolChr02:1451735..1454719
RNA-Seq ExpressionCcUC02G018630
SyntenyCcUC02G018630
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010182 - sugar mediated signaling pathway (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0015996 - chlorophyll catabolic process (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
GO:0050829 - defense response to Gram-negative bacterium (biological process)
GO:1902458 - positive regulation of stomatal opening (biological process)
GO:1903426 - regulation of reactive oxygen species biosynthetic process (biological process)
GO:2000070 - regulation of response to water deprivation (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009527 - plastid outer membrane (cellular component)
GO:0009528 - plastid inner membrane (cellular component)
GO:0009532 - plastid stroma (cellular component)
GO:0010319 - stromule (cellular component)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645760.1 hypothetical protein Csa_020345 [Cucumis sativus]2.5e-15094.28Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRRL +PS+RS +SNF GF FRTS F+HYSRVRASTFSSRMVIHCM AGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKK EEWARSQTA SLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTG

XP_004136805.1 protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus]8.7e-15194.3Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRRL +PS+RS +SNF GF FRTS F+HYSRVRASTFSSRMVIHCM AGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKK EEWARSQTA SLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]6.4e-15495.64Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRR PVPS+RSL+SNFDGFRFRTS F+HYSRVR STFSSRMVIHCM AGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]2.6e-14790.94Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+S S L+QCSDRRLP+PSARSLAS+FDGFRFR S F HYS VR S+FSSRMVIHCM +GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

XP_038888611.1 protein THYLAKOID FORMATION1, chloroplastic [Benincasa hispida]1.3e-15496.31Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAV SIS STLNQCSDRRLPVPSARSLASNFDGFRFRTS FSHYSRVRASTFSS MVIHCM AGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLV+FASREGEVESTLKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCA+LNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

TrEMBL top hitse value%identityAlignment
A0A0A0K3P0 Uncharacterized protein4.2e-15194.3Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRRL +PS+RS +SNF GF FRTS F+HYSRVRASTFSSRMVIHCM AGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAKK EEWARSQTA SLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic3.1e-15495.64Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRR PVPS+RSL+SNFDGFRFRTS F+HYSRVR STFSSRMVIHCM AGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

A0A5D3C7D3 Protein THYLAKOID FORMATION13.1e-15495.64Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSIS STLNQCSDRR PVPS+RSL+SNFDGFRFRTS F+HYSRVR STFSSRMVIHCM AGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDER+GSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

A0A6J1GJN0 protein THYLAKOID FORMATION1, chloroplastic-like isoform X26.3e-14790.27Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+S S L+QCSDRRLP+PSARSLAS+FDGFRFR S F HYS VR  +F+SRMVIHCM +GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X21.3e-14790.94Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+S S L+QCSDRRLP+PSARSLAS+FDGFRFR S F HYS VR S+FSSRMVIHCM +GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDRDAIFQAYI ALNEDPEQYRIDA+KLEEWARSQTA SLVEFASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDER+GSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf15.7e-3637.9Show/hide
Query:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWA
        ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ M+GY  + D+DAIF A  KA   DP Q + D ++L E A
Subjt:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWA

Query:  RSQTATSLVEF---ASREG--EVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL
        +S++A  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LNI +  + +DL++YR  L K+
Subjt:  RSQTATSLVEF---ASREG--EVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL

Query:  VQAKELLKEYIDREKKKRD
         Q ++ + + ++ +KK+R+
Subjt:  VQAKELLKEYIDREKKKRD

Q116P5 Protein Thf11.8e-3437.04Show/hide
Query:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+GY   ED+ +IF A I+   EDP +YR DAK LE+ A   
Subjt:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQ

Query:  TATSLVEFASREGEVEST--LKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL
        +A+ ++ +      +++T  L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  
Subjt:  TATSLVEFASREGEVEST--LKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYIDREKKKRDERS
        +++ +   +KKR++RS
Subjt:  LKEYIDREKKKRDERS

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic3.3e-10868.6Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHC-MCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAAV S+S S + Q ++R+  V S+RS+    D FRFR++F      VR+S  +SR V+HC   +  D+ TVA+TKL FL AYKRPIP++YNTVLQELIV
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHC-MCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLMEGYPS+EDR+AIF+AYI+AL EDPEQYR DA+KLEEWAR+Q A +LV+F+S+EGE+E+  KDIA+RAG+K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEY
         F YSR FA+GLFRLLELAN T+P+ILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEY++REKKKR ER  +Q ANE +TKCLG+Y
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEY

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic1.8e-9867.36Show/hide
Query:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAA++S+  + L + +D R   PS  + A+         S       VR     SR V+ C+    DV  TVAETK+NFLK+YKRPI SIY+TVLQEL+V
Subjt:  MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKG
        QQHLMRYK TY+YD VFALGFVTVYDQLMEGYPS+EDRDAIF+AYI ALNEDPEQYR DA+K+EEWARSQ   SLVEF+S++GE+E+ LKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITK
        +FSYSRFFA+GLFRLLELANATEP+IL+KLCAALNI+K+ VDRDLDVYRN+LSKLVQAKELLKEY++REKKKR+ERS +  +NEA+TK
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITK

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic5.7e-10570.79Show/hide
Query:  AVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFS-SRMVIHCMCAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        A++S+S   L Q SD+     S+R LAS     R  T F    SR+  ++ S S+ +IHCM   T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQ
Subjt:  AVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFS-SRMVIHCMCAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DRDAIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ SLV+F+S+EG++E+ LKDIA RAGSK  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEY++REKKK+ ER+ SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein4.1e-10670.79Show/hide
Query:  AVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFS-SRMVIHCMCAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        A++S+S   L Q SD+     S+R LAS     R  T F    SR+  ++ S S+ +IHCM   T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQ
Subjt:  AVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFS-SRMVIHCMCAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DRDAIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ SLV+F+S+EG++E+ LKDIA RAGSK  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEY++REKKK+ ER+ SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCATTTCAATCTCAACATTAAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTCCGCTCGTTCACTCGCCTCCAATTTCGACGGCTTTCGTTT
TCGTACGAGCTTTTTCAGTCATTATTCCCGAGTTAGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTGTGCCGGAACAGATGTGACCACTGTAGCCGAGA
CAAAATTGAACTTCCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAGGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGACCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTAAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCCACTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAG
TTGAGAGTACGTTGAAGGACATTGCAGAACGAGCAGGGAGTAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTTTTCCGACTCCTTGAATTGGCAAATGCT
ACAGAACCCAGTATCTTGGAAAAGCTCTGTGCAGCTTTAAACATTGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGC
GAAGGAGCTCCTAAAGGAATACATCGATAGAGAGAAGAAGAAAAGAGATGAGAGGAGTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCA
TGCAGACTGGTTTGTAA
mRNA sequenceShow/hide mRNA sequence
CTTTTTGGGAAGGTGTAGAAATTAACACTTGGATAAGCCTCTCCATCATGGCCTCTCAGTCTCACCCTCTCTTCTTTCCATTTTTCCCAGAAAATTCCTTCGAATTTCAT
TCTCTATACGTCTTTTCTCCCTCTCTTCTTCGATATGAAACCCATTTTCTCCAAAAATTCATAAGCTTCGTACATTTTTCCTCATTCTTCTTTAATGGCGGCTGTTAATT
CCATTTCAATCTCAACATTAAACCAATGTTCTGATAGAAGATTGCCGGTTCCGTCCGCTCGTTCACTCGCCTCCAATTTCGACGGCTTTCGTTTTCGTACGAGCTTTTTC
AGTCATTATTCCCGAGTTAGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTGTGCCGGAACAGATGTGACCACTGTAGCCGAGACAAAATTGAACTTCCT
CAAGGCGTATAAACGGCCTATCCCTAGCATTTACAACACTGTTCTGCAGGAATTGATTGTTCAACAGCATTTGATGAGGTATAAGAGGACATACCGTTATGACCCTGTTT
TCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGATGCCATCTTCCAAGCATACATTAAGGCATTGAATGAGGATCCA
GAGCAATATAGAATTGATGCTAAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCCACTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTACGTTGAA
GGACATTGCAGAACGAGCAGGGAGTAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTTTTCCGACTCCTTGAATTGGCAAATGCTACAGAACCCAGTATCT
TGGAAAAGCTCTGTGCAGCTTTAAACATTGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAAGCGAAGGAGCTCCTAAAG
GAATACATCGATAGAGAGAAGAAGAAAAGAGATGAGAGGAGTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTGTA
AAGAAAAGATGATTGGAGCACTTCCTAATTTGAGATAGACTTGGGAGTTTATAGCAATATTCTAAATTACCCTATAGATATGCATCTGTAATGTATTGTTGGGTCTCATG
CATTTGGTAAATTTTGTATTCAGCCAGGCACTACCTTTATTTCTCATTTGATATAACCACCACTCAAGCTATTTTTTAATCATTTGTATTACATATTTTAGAGTGCAATT
GTCAATACAAATGCTATCGCACTCTGTGCAGTAATCACGCTTCAAGTGATATTCGACCATCTGGCTTTTGTATGTCAATCATTTCTGGATTATTATGATCTTTCGCCCTT
TGCTTTTGTATGGTCTAAATTCATTCAGTTACAAACATTTAGGATTGT
Protein sequenceShow/hide protein sequence
MAAVNSISISTLNQCSDRRLPVPSARSLASNFDGFRFRTSFFSHYSRVRASTFSSRMVIHCMCAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMEGYPSDEDRDAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTATSLVEFASREGEVESTLKDIAERAGSKGNFSYSRFFAIGLFRLLELANA
TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERSGSQTANEAITKCLGEYSMQTGL