; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0001066 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0001066
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationchr01:2664052..2666808
RNA-Seq ExpressionIVF0001066
SyntenyIVF0001066
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645760.1 hypothetical protein Csa_020345 [Cucumis sativus]3.95e-19296.3Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR  +PSSRS SSNF GF FRTS+FTHYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTG

XP_004136805.1 protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus]3.63e-19896.31Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR  +PSSRS SSNF GF FRTS+FTHYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]2.60e-207100Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]2.62e-19191.28Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCSDRR P+PS+RSL+S+FDGFRFR S+F HYS VR S+FSSRMVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

XP_038888611.1 protein THYLAKOID FORMATION1, chloroplastic [Benincasa hispida]8.93e-19995.64Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAV SISFSTLNQCSDRR PVPS+RSL+SNFDGFRFRTS+F+HYSRVR STFSS MVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDR+AIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLV+FASREGEVES LKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCA+LNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

TrEMBL top hitse value%identityAlignment
A0A0A0K3P0 Uncharacterized protein3.1e-15496.31Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRR  +PSSRS SSNF GF FRTS+FTHYSRVR STFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+K EEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic3.4e-161100Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

A0A5D3C7D3 Protein THYLAKOID FORMATION13.4e-161100Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

A0A6J1C3B8 protein THYLAKOID FORMATION1, chloroplastic isoform X12.6e-14891.58Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCS+RR  VPS+RSL+SNFDGFRFRTS+F HYS VR S++SSRMV+HCMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTG

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X25.1e-14991.28Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCSDRR P+PS+RSL+S+FDGFRFR S+F HYS VR S+FSSRMVIHCM++GTDVTTVAETK NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALN+DKKGVDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTA+EAITKCLGEYSMQTGL
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf11.3e-3537.9Show/hide
Query:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA
        ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ M+GY  + D++AIF A  KA   DP Q + D Q+L E A
Subjt:  DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA

Query:  RSQTAASLVEF---ASREG--EVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL
        +S++A  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LNI +  + +DL++YR  L K+
Subjt:  RSQTAASLVEF---ASREG--EVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNIDKKGVDRDLDVYRNLLSKL

Query:  VQAKELLKEYIDREKKKRD
         Q ++ + + ++ +KK+R+
Subjt:  VQAKELLKEYIDREKKKRD

Q116P5 Protein Thf15.9e-3335.65Show/hide
Query:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+GY   ED+ +IF A I+   EDP +YR DA+ LE+ A   
Subjt:  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ

Query:  TAASLVEFASREGEVES--ILKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL
        +A+ ++ +      +++   L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  
Subjt:  TAASLVEFASREGEVES--ILKDIAERAGSKGNFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYIDREKKKRDERA
        +++ +   +KKR++R+
Subjt:  LKEYIDREKKKRDERA

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic7.7e-11069.97Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAAV S+SFS + Q ++R+  V SSRS+    D FRFR++       VR S  +SR V+HC  S+  D+ TVA+TKL FL AYKRPIP++YNTVLQELIV
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHC-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLMEGYPS+EDR AIF+AYI+AL EDPEQYR DAQKLEEWAR+Q A +LV+F+S+EGE+E+I KDIA+RAG+K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEY
         F YSR FA+GLFRLLELAN T+P+ILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEY++REKKKR ER  +Q ANE +TKCLG+Y
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEY

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic4.3e-10067.71Show/hide
Query:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV
        MAA++S+ F+ L + +D R   PS+ + ++         S       VRP    SR V+ C++   DV  TVAETK+NFLK+YKRPI SIY+TVLQEL+V
Subjt:  MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKG
        QQHLMRYK TY+YD VFALGFVTVYDQLMEGYPS+EDR+AIF+AYI ALNEDPEQYR DAQK+EEWARSQ   SLVEF+S++GE+E+ILKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKG

Query:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITK
        +FSYSRFFA+GLFRLLELANATEP+IL+KLCAALNI+K+ VDRDLDVYRN+LSKLVQAKELLKEY++REKKKR+ER+ +  +NEA+TK
Subjt:  NFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITK

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic1.6e-10771.48Show/hide
Query:  AVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVR-PSTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        A++S+SF  L Q SD+     SSR L+S          + T +SR+   S  +S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQ
Subjt:  AVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVR-PSTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQT+ASLV+F+S+EG++E++LKDIA RAGSK  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEY++REKKK+ ERA SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein1.1e-10871.48Show/hide
Query:  AVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVR-PSTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ
        A++S+SF  L Q SD+     SSR L+S          + T +SR+   S  +S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQ
Subjt:  AVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVR-PSTFSSRMVIHCMSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQT+ASLV+F+S+EG++E++LKDIA RAGSK  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGN

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK VDRDLDVYRNLLSKLVQAKELLKEY++REKKK+ ERA SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCATTTCATTCTCAACACTAAACCAATGTTCCGATAGGAGATTCCCGGTTCCGTCCTCTCGTTCCCTTTCCTCCAATTTCGACGGCTTCCGTTT
TCGTACGAGCCTTTTCACTCATTATTCCCGAGTTCGACCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACCACTGTGGCCGAGA
CAAAATTGAACTTCCTCAAGGCCTATAAACGGCCTATCCCTAGTATTTACAACACGGTTCTGCAAGAATTGATTGTTCAGCAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGATCCTGTTTTCGCTCTTGGATTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGAAGCCATCTTTCAAGCGTACATTAAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGATCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAG
TTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGGAGCAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTATTTCGTCTTCTTGAATTGGCAAATGCT
ACTGAGCCCAGTATCTTGGAAAAGCTCTGTGCCGCTCTAAACATCGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAGGC
AAAAGAGCTCCTAAAGGAATACATCGACAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAATGAGGCAATAACTAAATGCTTGGGAGAATACAGCA
TGCAGACCGGTTTATAA
mRNA sequenceShow/hide mRNA sequence
TGGCCTCTCAGTCTCACCCTTCTCTTCCTTCTTTCTTTCTTTCCTTCTTTATTTTTTTTTCTCAAAAATTTCTTCGAATTTCATTTTCTATTCCTTTTTTTTTCTCCCTC
TCTTCTTCGATATGAAATCCAGTTTCTCCAAAAATTCCTAAGCTTCGCAGATTTTTCCTCATTCCTATTCAATGGCGGCTGTTAATTCCATTTCATTCTCAACACTAAAC
CAATGTTCCGATAGGAGATTCCCGGTTCCGTCCTCTCGTTCCCTTTCCTCCAATTTCGACGGCTTCCGTTTTCGTACGAGCCTTTTCACTCATTATTCCCGAGTTCGACC
ATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACCACTGTGGCCGAGACAAAATTGAACTTCCTCAAGGCCTATAAACGGCCTATCC
CTAGTATTTACAACACGGTTCTGCAAGAATTGATTGTTCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCTCTTGGATTTGTTACTGTA
TATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGAAGCCATCTTTCAAGCGTACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCA
AAAATTGGAAGAGTGGGCTCGATCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGGA
GCAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTATTTCGTCTTCTTGAATTGGCAAATGCTACTGAGCCCAGTATCTTGGAAAAGCTCTGTGCCGCTCTA
AACATCGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAGGCAAAAGAGCTCCTAAAGGAATACATCGACAGAGAGAAGAA
GAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAATGAGGCAATAACTAAATGCTTGGGAGAATACAGCATGCAGACCGGTTTATAAGAGGTAATTGCAGACGCTACCT
AATTTGAGATAAAACTTGAGAGAGTTTATAGCAATATTATTCTAAATACTTGTTAGATATGCATCTATAATGTATTATGTTGGGTCTCATGCATTTGGTAAATTTTGTAT
TCAGCCAGGGCTCTACCTTTATTTCTCATTTGATATAACCACCACTCGAGCTATTTTTTAATCATTGTATTACATATTTTGAGTGCAACTGTCAATACAAAATGCTTTCG
CACTCTGTGCAGTAATTACTCCTCCAGGGATATTTGAC
Protein sequenceShow/hide protein sequence
MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANA
TEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTGL