; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022422 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022422
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationscaffold47:3243842..3246422
RNA-Seq ExpressionMS022422
SyntenyMS022422
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]2.6e-14791.58Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCS+RR  VPS+RSL+SNFDGFRFRTS+F HYS VR S++SSRMV+HCMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

XP_022136235.1 protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia]1.7e-159100Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF

XP_022952157.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita moschata]1.5e-14791.58Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFSALSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRT S++SRMV+HCM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]5.3e-14891.92Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRTSS+SSRMV+HCM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

XP_023554556.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo]8.1e-14992.26Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNSVSF+ALSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRTSS+SSRMV+HCM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+EGEVESILKDIAERAGGKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

TrEMBL top hitse value%identityAlignment
A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic1.3e-14791.58Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCS+RR  VPS+RSL+SNFDGFRFRTS+F HYS VR S++SSRMV+HCMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

A0A5D3C7D3 Protein THYLAKOID FORMATION11.3e-14791.58Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFS L+QCS+RR  VPS+RSL+SNFDGFRFRTS+F HYS VR S++SSRMV+HCMSAGTDVTTVAETK NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN++KK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

A0A6J1C3B8 protein THYLAKOID FORMATION1, chloroplastic isoform X18.5e-160100Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF

A0A6J1GJN0 protein THYLAKOID FORMATION1, chloroplastic-like isoform X27.4e-14891.58Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFSALSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRT S++SRMV+HCM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X22.6e-14891.92Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQCS+RRL +PSARSLAS+FDGFRFR SVFCHYSGVRTSS+SSRMV+HCM++GTDVTTVAETKANFLK YKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDA+KLEEWARSQTAASLVEFAS+EGEVESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNV+KK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTA+EAITKCLGEYSMQTG
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf11.6e-3536.99Show/hide
Query:  DVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWA
        ++ TV++TK  F  ++ RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ M+GY  + D++AIF A  KA   DP Q + D ++L E A
Subjt:  DVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWA

Query:  RSQTAASLVEF---ASKEG--EVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNVNKKSVDRDLDVYRNLLSKL
        +S++A  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LN+++  + +DL++YR  L K+
Subjt:  RSQTAASLVEF---ASKEG--EVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNVNKKSVDRDLDVYRNLLSKL

Query:  VQAKELLKEYVDREKKKRD
         Q ++ + + ++ +KK+R+
Subjt:  VQAKELLKEYVDREKKKRD

Q116P5 Protein Thf13.4e-3337.04Show/hide
Query:  TVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+GY   ED+ +IF A I+   EDP +YR DAK LE+ A   
Subjt:  TVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQ

Query:  TAASLVEF--ASKEGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKEL
        +A+ ++ +   SK  +    L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  
Subjt:  TAASLVEF--ASKEGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYVDREKKKRDERA
        +++ +   +KKR++R+
Subjt:  LKEYVDREKKKRDERA

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic1.3e-10971.67Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHC-MSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIV
        MAAV SVSFSA++Q +ER+  V S+RS+    D FRFR++       VR+S+ +SR VVHC  S+  D+ TVA+TK  FL  YKRPIP++YNTVLQELIV
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHC-MSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLMEGYPS+EDR AIF+AYI+AL EDPEQYR DA+KLEEWAR+Q A +LV+F+SKEGE+E+I KDIA+RAG K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKG

Query:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEY
         F YSR FA+GLFRLLELAN T+P+ILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER  +Q ANE +TKCLG+Y
Subjt:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEY

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic3.3e-10069.1Show/hide
Query:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDV-TTVAETKANFLKVYKRPIPSIYNTVLQELIV
        MAA++S+ F+AL + ++ R   PS  + A+         SV             SR VV C++   DV  TVAETK NFLK YKRPI SIY+TVLQEL+V
Subjt:  MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDV-TTVAETKANFLKVYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKG
        QQHLMRYK TY+YD VFALGFVTVYDQLMEGYPS+EDR+AIF+AYI ALNEDPEQYR DA+K+EEWARSQ   SLVEF+SK+GE+E+ILKDI+ERA GKG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKG

Query:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITK
        SFSYSRFFA+GLFRLLELANATEP+IL+KLCAALN+NK+SVDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER+ +  +NEA+TK
Subjt:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITK

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic4.0e-10670.79Show/hide
Query:  AVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYS-SRMVVHCMSAGT-DVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        A++S+SF AL Q S++     S+R LAS          +   +S +  +S S S+ ++HCMS  T DV  V+ETK+ FLK YKRPIPSIYNTVLQELIVQ
Subjt:  AVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYS-SRMVVHCMSAGT-DVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ASLV+F+SKEG++E++LKDIA RAG K  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN+NKKSVDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein2.8e-10770.79Show/hide
Query:  AVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYS-SRMVVHCMSAGT-DVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ
        A++S+SF AL Q S++     S+R LAS          +   +S +  +S S S+ ++HCMS  T DV  V+ETK+ FLK YKRPIPSIYNTVLQELIVQ
Subjt:  AVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYS-SRMVVHCMSAGT-DVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDA+K+EEWARSQT+ASLV+F+SKEG++E++LKDIA RAG K  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN+NKKSVDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCGTGTCATTCTCCGCATTAAGTCAATGTTCTGAAAGAAGATTGCTGGTTCCTTCGGCTCGTTCACTAGCCTCGAATTTCGACGGGTTTCGTTT
TCGTACAAGCGTTTTCTGCCATTATTCGGGAGTTCGGACATCGAGTTACAGTTCTCGAATGGTCGTCCATTGCATGTCTGCCGGAACAGATGTGACCACCGTGGCCGAGA
CAAAGGCGAACTTCCTCAAGGTGTATAAGCGGCCTATTCCTAGCATTTACAATACTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACAGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATATATTAAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAGGAGTGGGCTCGGTCTCAGACAGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAAG
TTGAGAGTATTTTAAAGGACATTGCAGAACGGGCAGGGGGTAAGGGGAGTTTCAGTTACAGCCGTTTTTTTGCTATTGGGCTGTTTCGACTCCTTGAATTGGCCAATGCT
ACCGAACCCAGTATCTTGGAAAAGCTCTGTGCTGCTTTAAATGTCAACAAAAAAAGTGTGGACCGAGACCTAGATGTCTACCGCAACCTGCTTTCAAAGTTGGTTCAGGC
AAAAGAGCTCCTAAAGGAATACGTGGATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTCAGACAGCTAATGAGGCCATAACAAAATGCTTGGGAGAGTACAGCA
TGCAGACTGGTTTT
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTGTTAATTCCGTGTCATTCTCCGCATTAAGTCAATGTTCTGAAAGAAGATTGCTGGTTCCTTCGGCTCGTTCACTAGCCTCGAATTTCGACGGGTTTCGTTT
TCGTACAAGCGTTTTCTGCCATTATTCGGGAGTTCGGACATCGAGTTACAGTTCTCGAATGGTCGTCCATTGCATGTCTGCCGGAACAGATGTGACCACCGTGGCCGAGA
CAAAGGCGAACTTCCTCAAGGTGTATAAGCGGCCTATTCCTAGCATTTACAATACTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACAGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATATATTAAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTGGAGGAGTGGGCTCGGTCTCAGACAGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAAG
TTGAGAGTATTTTAAAGGACATTGCAGAACGGGCAGGGGGTAAGGGGAGTTTCAGTTACAGCCGTTTTTTTGCTATTGGGCTGTTTCGACTCCTTGAATTGGCCAATGCT
ACCGAACCCAGTATCTTGGAAAAGCTCTGTGCTGCTTTAAATGTCAACAAAAAAAGTGTGGACCGAGACCTAGATGTCTACCGCAACCTGCTTTCAAAGTTGGTTCAGGC
AAAAGAGCTCCTAAAGGAATACGTGGATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTCAGACAGCTAATGAGGCCATAACAAAATGCTTGGGAGAGTACAGCA
TGCAGACTGGTTTT
Protein sequenceShow/hide protein sequence
MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVHCMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASKEGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANA
TEPSILEKLCAALNVNKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGF