; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg19098 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg19098
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic-like
Genome locationCarg_Chr11:12352665..12355239
RNA-Seq ExpressionCarg19098
SyntenyCarg19098
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010207 - photosystem II assembly (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0045038 - protein import into chloroplast thylakoid membrane (biological process)
InterPro domainsIPR017499 - Protein Thf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589193.1 Protein THYLAKOID FORMATION1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.5e-11898.69Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSPVPSAR+LASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEF SKEGEVESILKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKV
        FSYSRFFAIGLFRLLELANATEPSILEK+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKV

KAG7022895.1 Protein THYLAKOID FORMATION1, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]2.7e-123100Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKVYFIFLFVI
        FSYSRFFAIGLFRLLELANATEPSILEKVYFIFLFVI
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKVYFIFLFVI

XP_008455361.1 PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Cucumis melo]1.9e-10889.96Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNS+SFST++Q SDRR PVPS+RSL+SNFDGFRFR+S+F H+S VR S+FSSR+VIHCMS GTDVTTVAETKLNFLKAYKRPIPSIYN+VLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ+AASLVEFAS+EGEVESILKDIAERA SKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKV
        FSYSRFFAIGLFRLLELANATEPSILEK+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKV

XP_022930881.1 protein THYLAKOID FORMATION1, chloroplastic-like [Cucurbita moschata]2.0e-11898.69Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMS GTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVES+LKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKV
        FSYSRFFAIGLFRLLELANATEPSILEK+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKV

XP_022988874.1 protein THYLAKOID FORMATION1, chloroplastic-like [Cucurbita maxima]5.3e-11999.13Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSP+PSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKV
        FSYSRFFAIGLFRLLELANATEPSILEK+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKV

TrEMBL top hitse value%identityAlignment
A0A1S3C0V5 protein THYLAKOID FORMATION1, chloroplastic9.2e-10989.96Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNS+SFST++Q SDRR PVPS+RSL+SNFDGFRFR+S+F H+S VR S+FSSR+VIHCMS GTDVTTVAETKLNFLKAYKRPIPSIYN+VLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ+AASLVEFAS+EGEVESILKDIAERA SKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKV
        FSYSRFFAIGLFRLLELANATEPSILEK+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKV

A0A5D3C7D3 Protein THYLAKOID FORMATION19.2e-10989.96Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNS+SFST++Q SDRR PVPS+RSL+SNFDGFRFR+S+F H+S VR S+FSSR+VIHCMS GTDVTTVAETKLNFLKAYKRPIPSIYN+VLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ+AASLVEFAS+EGEVESILKDIAERA SKG+
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKV
        FSYSRFFAIGLFRLLELANATEPSILEK+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKV

A0A6J1C3B8 protein THYLAKOID FORMATION1, chloroplastic isoform X12.7e-10890.39Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFS +SQ S+RR  VPSARSLASNFDGFRFR+SVF H+SGVRTSS+SSR+V+HCMS GTDVTTVAETK NFLK YKRPIPSIYN+VLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQ+AASLVEFASKEGEVESILKDIAERA  KGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKV
        FSYSRFFAIGLFRLLELANATEPSILEK+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKV

A0A6J1ES50 protein THYLAKOID FORMATION1, chloroplastic-like9.8e-11998.69Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMS GTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVES+LKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKV
        FSYSRFFAIGLFRLLELANATEPSILEK+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKV

A0A6J1JMT7 protein THYLAKOID FORMATION1, chloroplastic-like2.6e-11999.13Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
        MAAVNSVSFSTVSQFSDRRSP+PSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQ

Query:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
        QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS
Subjt:  QHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGS

Query:  FSYSRFFAIGLFRLLELANATEPSILEKV
        FSYSRFFAIGLFRLLELANATEPSILEK+
Subjt:  FSYSRFFAIGLFRLLELANATEPSILEKV

SwissProt top hitse value%identityAlignment
B0C3M8 Protein Thf12.2e-2743.31Show/hide
Query:  DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA
        ++ TV++TK  F   + RP+ S+Y  V++EL+V+ HL+R    +RYDP+FALG  T +D+ MDGY  + D++AIF A  KA   DP Q + D Q+L E A
Subjt:  DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWA

Query:  RSQSAASLVEF---ASKEG--EVESILKDIAERAASKGSFSYSRFFAIGLFRLLELA
        +S+SA  ++++   A+  G  E++  L++IA+       F YSR FAIGLF LLEL+
Subjt:  RSQSAASLVEF---ASKEG--EVESILKDIAERAASKGSFSYSRFFAIGLFRLLELA

Q116P5 Protein Thf15.5e-2642.76Show/hide
Query:  TVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ
        TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +D+ M GY   ED+ +IF A I+   EDP +YR DA+ LE+ A   
Subjt:  TVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ

Query:  SAASLVEF--ASKEGEVESILKDIAERAASKGSFSYSRFFAIGLFRLLELAN
        SA+ ++ +   SK  +    L+D     +    F YSR FAIGLF LLE+ +
Subjt:  SAASLVEF--ASKEGEVESILKDIAERAASKGSFSYSRFFAIGLFRLLELAN

Q7XAB8 Protein THYLAKOID FORMATION1, chloroplastic7.8e-8166.96Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHC-MSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIV
        MAAV SVSFS ++Q ++R+S V S+RS+    D FRFRS+  +    VR+S+ +SR V+HC  S+  D+ TVA+TKL FL AYKRPIP++YN+VLQELIV
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHC-MSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKG
        QQHL RYK++Y+YDPVFALGFVTVYDQLM+GYPS+EDR AIF+AYI+AL EDPEQYR DAQKLEEWAR+Q+A +LV+F+SKEGE+E+I KDIA+RA +K 
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKG

Query:  SFSYSRFFAIGLFRLLELANATEPSILEKV
         F YSR FA+GLFRLLELAN T+P+ILEK+
Subjt:  SFSYSRFFAIGLFRLLELANATEPSILEKV

Q84PB7 Protein THYLAKOID FORMATION1, chloroplastic1.8e-7264.78Show/hide
Query:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDV-TTVAETKLNFLKAYKRPIPSIYNSVLQELIV
        MAA++S+ F+ + + +D R   PS  + A+         SV             SR V+ C++T  DV  TVAETK+NFLK+YKRPI SIY++VLQEL+V
Subjt:  MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDV-TTVAETKLNFLKAYKRPIPSIYNSVLQELIV

Query:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKG
        QQHLMRYK TY+YD VFALGFVTVYDQLM+GYPS+EDR+AIF+AYI ALNEDPEQYR DAQK+EEWARSQ+  SLVEF+SK+GE+E+ILKDI+ERA  KG
Subjt:  QQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKG

Query:  SFSYSRFFAIGLFRLLELANATEPSILEKV
        SFSYSRFFA+GLFRLLELANATEP+IL+K+
Subjt:  SFSYSRFFAIGLFRLLELANATEPSILEKV

Q9SKT0 Protein THYLAKOID FORMATION 1, chloroplastic1.3e-7566.67Show/hide
Query:  AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQ
        A++S+SF  + Q SD+ S   S+R LAS       R    +    + + S +S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYN+VLQELIVQQ
Subjt:  AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGSF
        HLMRYK+TYRYDPVFALGFVTVYDQLM+GYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQ++ASLV+F+SKEG++E++LKDIA RA SK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGSF

Query:  SYSRFFAIGLFRLLELANATEPSILEKV
        SYSRFFA+GLFRLLELA+AT+P++L+K+
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKV

Arabidopsis top hitse value%identityAlignment
AT2G20890.1 photosystem II reaction center PSB29 protein9.2e-7766.67Show/hide
Query:  AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQ
        A++S+SF  + Q SD+ S   S+R LAS       R    +    + + S +S+ +IHCMS  T DV  V+ETK  FLKAYKRPIPSIYN+VLQELIVQQ
Subjt:  AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQ

Query:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGSF
        HLMRYK+TYRYDPVFALGFVTVYDQLM+GYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQ++ASLV+F+SKEG++E++LKDIA RA SK  F
Subjt:  HLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGSF

Query:  SYSRFFAIGLFRLLELANATEPSILEKV
        SYSRFFA+GLFRLLELA+AT+P++L+K+
Subjt:  SYSRFFAIGLFRLLELANATEPSILEKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTGTTAATTCCGTGTCATTCTCAACGGTTAGTCAGTTTTCTGATCGAAGGTCGCCGGTTCCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTTCGTTT
CCGTTCTAGTGTTTTCTATCATCATTCGGGAGTTCGAACCTCGAGTTTCAGTTCTCGCTTGGTCATTCATTGCATGTCCACCGGAACAGATGTGACGACTGTAGCTGAGA
CTAAATTGAACTTTCTAAAGGCGTATAAACGACCTATCCCTAGCATATACAACTCTGTTTTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACGTAC
CGTTATGATCCTGTTTTCGCCCTCGGTTTTGTTACTGTATATGATCAGCTTATGGATGGGTACCCTAGCGATGAGGATCGGGAGGCCATTTTCCAAGCCTACATTAAGGC
GCTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGTCTGCAGCTTCATTGGTTGAATTTGCATCAAAAGAAGGAGAAG
TTGAGAGTATTTTGAAAGACATTGCAGAACGAGCTGCGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTCGAATTGGCAAATGCT
ACTGAACCCAGTATCTTGGAAAAGGTTTATTTTATCTTCCTTTTTGTGATATAG
mRNA sequenceShow/hide mRNA sequence
TCTCAGTTTCGTCCTCTCTTGTTTCTTTCGTTTCTTTTTTTTTTTTTCTTCTCCCAGAAAAAATCTTCGACATGAAATCCTGTTTCTCTGGAAGTTCGTAAGATTTGCAA
ATTTCTTCTCATTCTTCTGCAATGGCGGCTGTTAATTCCGTGTCATTCTCAACGGTTAGTCAGTTTTCTGATCGAAGGTCGCCGGTTCCGTCGGCTCGTTCACTCGCCTC
GAATTTCGACGGGTTTCGTTTCCGTTCTAGTGTTTTCTATCATCATTCGGGAGTTCGAACCTCGAGTTTCAGTTCTCGCTTGGTCATTCATTGCATGTCCACCGGAACAG
ATGTGACGACTGTAGCTGAGACTAAATTGAACTTTCTAAAGGCGTATAAACGACCTATCCCTAGCATATACAACTCTGTTTTGCAAGAGTTGATTGTGCAGCAGCATTTG
ATGAGGTATAAGAGGACGTACCGTTATGATCCTGTTTTCGCCCTCGGTTTTGTTACTGTATATGATCAGCTTATGGATGGGTACCCTAGCGATGAGGATCGGGAGGCCAT
TTTCCAAGCCTACATTAAGGCGCTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGTCTGCAGCTTCATTGGTTGAAT
TTGCATCAAAAGAAGGAGAAGTTGAGAGTATTTTGAAAGACATTGCAGAACGAGCTGCGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGA
CTCCTCGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGGTTTATTTTATCTTCCTTTTTGTGATATAGCTTTTAAGTAGATGACGATTTTGAGGTGATAGTAT
ATCCAGCCTATTATTGCTAAATTAAACATTTACATGATAGCCAGCCCCTCCTCTCATAGTACTTTTACTTATAGATTATATTCTTTCTCTGAGTGCTGCCATTCGTAAAG
TGTAATTGTGTTTCTTGCTCATTATTATCTTTTTCACTCATTAGAGAAATTGGATCCTGATGGAAGAGTGCTTAACCTTATTCAGTTCGATAGAATTAGGTTCAAGGTAC
TCTTTCCTGCATAAGTCTCCATTTTACTTTTCTAGGTAACACCAATCTTTTTGTAGTATTTGGTTGAAGAGAAATATGAGCTAACCATTTTTTGGAACATTCCCCTTTGC
TCGGGACAATCCCACATACACAATTACAGTGTTTTTGTGGGATTTCCTTCTCCTTTGTGGAGATTGTACTGATCTTGTGGATGTAGTTTATGCTCTTTAGTAGTCTCAAC
TGGTCCTTTTCATTCAGTTTATTAGCTCTTGTGAAGGGAAACATACATATGTTCCTCTGCCCCCATCAACAACAGTTATCATGAACTATTGCTTTAATTGCCCATCTCTA
TTTAGCTTGTCTATCACCAAACATACTTTTATTTTTCTAAAACAGCTAAAGCCATGTAATGGAGATTTTACCAAGTGTTGACATCTAACACTTTGCTGCTTCTGAACAGC
TCTGTGCCGCTTTAAATGTTGACAAAAAAAGTGTGGACCGAGACCTTGATGTATACCGCAACCTGCTTTCGAAGTTGGTTCAGGCGAAAGAGCTCCTAAAGGAATACGTT
GATAGGTAAGAGATTTTTAAGTTCATGGTTCCTGGCCTGGTAGATTAACATACCCACAATCTGCTCTTTCACACCACCATATTTTCTTGGATTTTATGAACCCATAATCG
TTTTGAAAACAATCCAGTATTTGTCAAATCCTCTACAACTCACCGTCATATCATTTATATTTACTAGATATCCGCCACTTGGCTGTGTTTTAAGATGAATGTGATGTTTT
ATGAAAGTTTAAAT
Protein sequenceShow/hide protein sequence
MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTY
RYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESILKDIAERAASKGSFSYSRFFAIGLFRLLELANA
TEPSILEKVYFIFLFVI