; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018023 (gene) of Snake gourd v1 genome

Gene IDTan0018023
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
Genome locationLG02:2293785..2296717
RNA-Seq ExpressionTan0018023
SyntenyTan0018023
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]1.2e-14490.94Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFSTL+QC+DRR PV S+RSL+SNFDGFRFR+S+F HYS VR S FSSR+VIHCMSAGTDVTTVAETK NFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGE ESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERA    GSQTANEAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

XP_022136235.1 protein THYLAKOID FORMATION1, chloroplastic isoform X1 [Momordica charantia]1.0e-14692.62Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQC++RRL V SARSLASNFDGFRFR+SVFCHYSGVRTS +SSR+V+HCMSAGTDVTTVAETK+NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFASKEGE ESILKDIAERAG KGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNV+KKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTANEAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

XP_022952157.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita moschata]4.6e-14489.93Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRT  F+SR+VIHCM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGE ESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

XP_022969189.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita maxima]9.3e-14590.6Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRTS FSSR+VIHCM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGE ESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

XP_023554556.1 protein THYLAKOID FORMATION1, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo]3.5e-14489.93Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSF+ LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRTS FSSR+VIHCM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGE ESILKDIAERAG KG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

TrEMBL top hitse value%identityAlignment
A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic5.9e-14590.94Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFSTL+QC+DRR PV S+RSL+SNFDGFRFR+S+F HYS VR S FSSR+VIHCMSAGTDVTTVAETK NFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGE ESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERA    GSQTANEAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

A0A5D3C7D3 Protein THYLAKOID FORMATION15.9e-14590.94Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNS+SFSTL+QC+DRR PV S+RSL+SNFDGFRFR+S+F HYS VR S FSSR+VIHCMSAGTDVTTVAETK NFLKAYKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGE ESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK VDRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERA    GSQTANEAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

A0A6J1C3B8 protein THYLAKOID FORMATION1, chloroplastic isoform X14.8e-14792.62Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQC++RRL V SARSLASNFDGFRFR+SVFCHYSGVRTS +SSR+V+HCMSAGTDVTTVAETK+NFLK YKRPIPSIYNTVLQELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQTAASLVEFASKEGE ESILKDIAERAG KGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANATEPSILEKLCAALNV+KKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTANEAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

A0A6J1GJN0 protein THYLAKOID FORMATION1, chloroplastic-like isoform X22.2e-14489.93Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRT  F+SR+VIHCM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGE ESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

A0A6J1I1V8 protein THYLAKOID FORMATION1, chloroplastic-like isoform X24.5e-14590.6Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        MAAVNSVSFS LSQC+DRRLP+ SARSLAS+FDGFRFR SVFCHYSGVRTS FSSR+VIHCM++GTDVTTVAETK+NFLKAYKRPIPSIYNTV+QELIVQ
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYD+LMEGYPSDEDR+AIFQAYI ALNEDPEQYRIDAQKLEEWARSQTAASLVEFAS+EGE ESILKDIAERAGSKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL
        FSYSRFFAIGLFRLLELANA+EPSILEKLCAALNVDKK VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA    GSQTA+EAITKCLGEY++
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf11.3e-3536.97Show/hide
Query:  DVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA
        ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ M+GY  + D++AIF A  KA   DP Q + D Q+L E A
Subjt:  DVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA

Query:  RSQTAASLVEF---ASKEG--EAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNVDKKSVDRDLDVYRNLLSKL
        +S++A  ++++   A+  G  E +  L++IA+       F YSR FAIGLF LLEL+  N T+        L  +C  LN+ +  + +DL++YR  L K+
Subjt:  RSQTAASLVEF---ASKEG--EAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALNVDKKSVDRDLDVYRNLLSKL

Query:  VQAKELLKEYVDREKKKR--DERAGSGSGSQTANEAIT
         Q ++ + + ++ +KK+R  D+    GS      EA T
Subjt:  VQAKELLKEYVDREKKKR--DERAGSGSGSQTANEAIT

Q116P5 Protein Thf15.9e-3335.62Show/hide
Query:  TVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M+GY   ED+ +IF A I+   EDP +YR DA+ LE+ A   
Subjt:  TVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ

Query:  TAASLVEF--ASKEGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKEL
        +A+ ++ +   SK  +    L+D          F YSR FAIGLF LLE+ +             L+K+C +LN+ ++ + +D+D+Y + L ++ QA+  
Subjt:  TAASLVEF--ASKEGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKEL

Query:  LKEYVDREKKKRDERA-----GSGSGSQTANEA
        +++ +   +KKR++R+      S SG++T+ ++
Subjt:  LKEYVDREKKKRDERA-----GSGSGSQTANEA

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic1.0e-10971.04Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHC-MSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIV
        MAAV SVSFS ++Q A+R+  VSS+RS+    D FRFRS+       VR+S  +SR V+HC  S+  D+ TVA+TK  FL AYKRPIP++YNTVLQELIV
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHC-MSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLMEGYPS+EDR AIF+AYI+AL EDPEQYR DAQKLEEWAR+Q A +LV+F+SKEGE E+I KDIA+RAG+K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKG

Query:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEY
         F YSR FA+GLFRLLELAN T+P+ILEKLCAALNV+KKSVDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER      +Q ANE +TKCLG+Y
Subjt:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEY

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic4.0e-9867.11Show/hide
Query:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDV-TTVAETKSNFLKAYKRPIPSIYNTVLQELIV
        MAA++S+ F+ L + AD R   ++A + A          +V       R     SR V+ C++   DV  TVAETK NFLK+YKRPI SIY+TVLQEL+V
Subjt:  MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDV-TTVAETKSNFLKAYKRPIPSIYNTVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKG
        QQHLMRYK TY+YD VFALGFVTVYDQLMEGYPS+EDR+AIF+AYI ALNEDPEQYR DAQK+EEWARSQ   SLVEF+SK+GE E+ILKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKG

Query:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYN
        SFSYSRFFA+GLFRLLELANATEP+IL+KLCAALN++K+SVDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER    S +  +NEA+TK  G  N
Subjt:  SFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYN

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic1.7e-10470.51Show/hide
Query:  AVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVR-TSGFSSRLVIHCMSAGT-DVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        A++S+SF  L Q +D+    +S+R LAS          +   +S +   S  +S+ +IHCMS  T DV  V+ETKS FLKAYKRPIPSIYNTVLQELIVQ
Subjt:  AVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVR-TSGFSSRLVIHCMSAGT-DVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQT+ASLV+F+SKEG+ E++LKDIA RAGSK  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN++KKSVDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA     SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGE

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein1.2e-10570.51Show/hide
Query:  AVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVR-TSGFSSRLVIHCMSAGT-DVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ
        A++S+SF  L Q +D+    +S+R LAS          +   +S +   S  +S+ +IHCMS  T DV  V+ETKS FLKAYKRPIPSIYNTVLQELIVQ
Subjt:  AVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVR-TSGFSSRLVIHCMSAGT-DVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS
        QHLMRYK+TYRYDPVFALGFVTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQT+ASLV+F+SKEG+ E++LKDIA RAGSK  
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGE
        FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN++KKSVDRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA     SQ ANE I+KCLG+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCGTATCATTCTCAACATTAAGTCAATGTGCTGATAGAAGGTTGCCGGTTTCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTCCGTTT
TCGTTCGAGCGTTTTCTGTCATTATTCGGGAGTTCGAACCTCGGGTTTCAGTTCTCGCTTGGTTATTCATTGCATGTCCGCCGGAACAGATGTGACTACTGTAGCCGAGA
CTAAATCGAACTTTCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAATACTGTTCTACAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACATAC
CGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGAGAGGCCATTTTCCAAGCATACATTAAGGC
ATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAGG
CTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTTGAATTGGCAAATGCT
ACTGAACCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATGTTGACAAAAAGAGCGTGGACAGAGACCTTGATGTATACCGCAACCTGCTTTCAAAGTTGGTTCAGGC
GAAAGAGCTCTTAAAGGAATATGTTGACAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTGGATCTGGATCACAGACAGCTAATGAGGCAATAACAAAATGCTTGG
GAGAATACAACTTGTAG
mRNA sequenceShow/hide mRNA sequence
CACCCGCTCTTTCTTTCTTTTTTCCCAGAACTTTTCTTCCAATTTCATGTTCTATAGGATTTTTCTCCCACTCTTCTTCTTCGATATGAAATCCATTTGCTCTGGAAGTT
CGTAAGCTTTTCAAGTTTCTTCTCATTCTTCTACAATGGCGGCTGTTAATTCCGTATCATTCTCAACATTAAGTCAATGTGCTGATAGAAGGTTGCCGGTTTCGTCGGCT
CGTTCACTCGCCTCGAATTTCGACGGGTTCCGTTTTCGTTCGAGCGTTTTCTGTCATTATTCGGGAGTTCGAACCTCGGGTTTCAGTTCTCGCTTGGTTATTCATTGCAT
GTCCGCCGGAACAGATGTGACTACTGTAGCCGAGACTAAATCGAACTTTCTCAAGGCGTATAAACGGCCTATCCCTAGCATTTACAATACTGTTCTACAAGAGTTGATTG
TGCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCCCTTGGTTTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAG
GATCGAGAGGCCATTTTCCAAGCATACATTAAGGCATTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGACTGCAGC
TTCATTGGTTGAGTTTGCATCAAAAGAAGGAGAGGCTGAGAGTATTTTGAAGGACATTGCAGAACGAGCAGGGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTA
TTGGGCTATTTCGACTCCTTGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATGTTGACAAAAAGAGCGTGGACAGAGACCTTGAT
GTATACCGCAACCTGCTTTCAAAGTTGGTTCAGGCGAAAGAGCTCTTAAAGGAATATGTTGACAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCTGGATCTGGATC
ACAGACAGCTAATGAGGCAATAACAAAATGCTTGGGAGAATACAACTTGTAGACTGGTTTTTGAGAGCTGATTACATCAATTGGAGCACTGCCTAATTTGAGATAAGACT
TGAGAGTTATAGCAATATTCTAAATTTACCTGATAGATATGCTGCATTTGGTAATGTGTTGTTGGGTCTTTTCGCATTTGGTAAATTTTGTATTAAGCCAGGCACTACCT
TTATTCAAGCTATTTTTTAATCATTTGTATTACATATTTTGAGTGCAATTGTCAATACAAATGCTTTCGCACTCTGTGTAGTAATCATATTTTGCAGTTCTCTCCTTCAA
GGGATATTTGACCATTTACTTTTGTTTGTGCAATTATTTCTGGATTCTTAGTGAGTTCAAAATATATATATTGAAGGTATATGAGAA
Protein sequenceShow/hide protein sequence
MAAVNSVSFSTLSQCADRRLPVSSARSLASNFDGFRFRSSVFCHYSGVRTSGFSSRLVIHCMSAGTDVTTVAETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASKEGEAESILKDIAERAGSKGSFSYSRFFAIGLFRLLELANA
TEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSGSGSQTANEAITKCLGEYNL