; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016004 (gene) of Snake gourd v1 genome

Gene IDTan0016004
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPSII 6.1 kDa protein
Genome locationLG08:3871609..3872878
RNA-Seq ExpressionTan0016004
SyntenyTan0016004
Gene Ontology termsGO:0042549 - photosystem II stabilization (biological process)
GO:0009523 - photosystem II (cellular component)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009806 - Photosystem II PsbW, class 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456049.1 PREDICTED: photosystem II reaction center W protein, chloroplastic-like isoform X2 [Cucumis melo]2.3e-4779.7Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVA----AAVAAAGPAVALVDERLSTEGTGLPFGLSNNI
        MASTLL HS L+LKGS+K+LPS TMAA RLP LQKM+ G IKC++EEKPEGK S+ESM+  TVA    AA+AAAGPAVALVDER+STEGTGLPFGLSNN+
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVA----AAVAAAGPAVALVDERLSTEGTGLPFGLSNNI

Query:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        LGWILAG+FGLIWALYFVYASSL+EDEDSGLSL
Subjt:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

XP_022937622.1 photosystem II reaction center W protein, chloroplastic-like [Cucurbita moschata]7.1e-4981.95Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI
        MASTLLSHSG+ LKGS K+LP P MAA RLP +QKM+KGGIKC+LEEK EGK SKES+IS TVAA     VA AGPAVALVDERLSTEGTGLPFGLSNN+
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI

Query:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        LGWILAG+FG IWALYFVYASSLEEDEDSGLSL
Subjt:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

XP_022966002.1 photosystem II reaction center W protein, chloroplastic-like [Cucurbita maxima]1.7e-4781.95Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI
        MASTLLSHSG+ LKGS K+LP P MAA RL  +QK +KGGIKCRLEEK E KSSKESMIS TVAA     VA AGPAVALVDERLSTEGTGLPFGLSNN+
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI

Query:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        LGWILAG+FG IWALYFVYASSLEEDEDSGLSL
Subjt:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

XP_023521133.1 photosystem II reaction center W protein, chloroplastic-like [Cucurbita pepo subsp. pepo]1.2e-4882.71Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI
        MASTLLSHSG+ LKGS K+LP P MAA RL  +QKM+KGGIKCRLEEK EGK SKESMIS TVAA     VA AGPAVALVDERLSTEGTGLPFGLSNN+
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI

Query:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        LGWILAG+FG IWALYFVYASSLEEDEDSGLSL
Subjt:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

XP_038890119.1 photosystem II reaction center W protein, chloroplastic-like [Benincasa hispida]1.1e-4981.95Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVA----AAVAAAGPAVALVDERLSTEGTGLPFGLSNNI
        MASTLLSHS L+LKGS+K+LPSP +AA RLP LQKM+KGGIKCR+EEKPEGK S+ES++S T+A    AA+AAAGPAVALVDERLSTEGTGLPFGLSNN+
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVA----AAVAAAGPAVALVDERLSTEGTGLPFGLSNNI

Query:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        LGWILAGVF LIWALYFVYASSL+EDEDSGLSL
Subjt:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

TrEMBL top hitse value%identityAlignment
A0A0A0LJA2 PSII 6.1 kDa protein3.5e-4679.85Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKS-SKESMISATVA----AAVAAAGPAVALVDERLSTEGTGLPFGLSNN
        MASTLL HS L+LKGS+K+L SPTMAA RLP LQKM+KGGIKC +EEKPEGK  S+ESM+  TV     AA+AAAGPAVALVDERLSTEGTGLPFGLSNN
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKS-SKESMISATVA----AAVAAAGPAVALVDERLSTEGTGLPFGLSNN

Query:  ILGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        +LGWILA VF LIWALYFVYASSL+EDEDSGLSL
Subjt:  ILGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

A0A1S3C2B0 PSII 6.1 kDa protein1.1e-4779.7Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVA----AAVAAAGPAVALVDERLSTEGTGLPFGLSNNI
        MASTLL HS L+LKGS+K+LPS TMAA RLP LQKM+ G IKC++EEKPEGK S+ESM+  TVA    AA+AAAGPAVALVDER+STEGTGLPFGLSNN+
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVA----AAVAAAGPAVALVDERLSTEGTGLPFGLSNNI

Query:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        LGWILAG+FGLIWALYFVYASSL+EDEDSGLSL
Subjt:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

A0A6J1CNY5 PSII 6.1 kDa protein3.3e-4477.44Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISAT----VAAAVAAAGPAVALVDERLSTEGTGLPFGLSNNI
        MASTLLSHSGL LKGS K+ P P MAA RL  + KM+KGGIKC++EE+PE KSS ES +S T    VAA VAAAGPA ALVDERLSTEGTGLPFGLSNN+
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISAT----VAAAVAAAGPAVALVDERLSTEGTGLPFGLSNNI

Query:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        L WILAGVF LIWALYFVYASSL+EDEDSGLSL
Subjt:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

A0A6J1FBR0 PSII 6.1 kDa protein3.4e-4981.95Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI
        MASTLLSHSG+ LKGS K+LP P MAA RLP +QKM+KGGIKC+LEEK EGK SKES+IS TVAA     VA AGPAVALVDERLSTEGTGLPFGLSNN+
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI

Query:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        LGWILAG+FG IWALYFVYASSLEEDEDSGLSL
Subjt:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

A0A6J1HLU7 PSII 6.1 kDa protein8.4e-4881.95Show/hide
Query:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI
        MASTLLSHSG+ LKGS K+LP P MAA RL  +QK +KGGIKCRLEEK E KSSKESMIS TVAA     VA AGPAVALVDERLSTEGTGLPFGLSNN+
Subjt:  MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAA----AVAAAGPAVALVDERLSTEGTGLPFGLSNNI

Query:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        LGWILAG+FG IWALYFVYASSLEEDEDSGLSL
Subjt:  LGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

SwissProt top hitse value%identityAlignment
Q39194 Photosystem II reaction center W protein, chloroplastic5.1e-2658.12Show/hide
Query:  ILPSPTMAARR----LPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAAAVAA--AGPAVALVDERLSTEGTGLPFGLSNNILGWILAGVFGLIWALY
        +L  PT+A       LP + K +KGG++C +E K    S   + +SA   AA+ A  + PA+ALVDER+STEGTGLPFGLSNN+LGWIL GVFGLIW  +
Subjt:  ILPSPTMAARR----LPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAAAVAA--AGPAVALVDERLSTEGTGLPFGLSNNILGWILAGVFGLIWALY

Query:  FVYASSLEEDEDSGLSL
        FVY SSLEEDE+SGLSL
Subjt:  FVYASSLEEDEDSGLSL

Q41387 Photosystem II reaction center W protein, chloroplastic1.0e-2653.73Show/hide
Query:  ASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPE----GKSSKESMISATVAAAVAA--AGPAVALVDERLSTEGTGLPFGLSNN
        +++L++ + LV      +  SP +    LP + K  K  + C +E KP       ++ +SM ++ +AAA AA  + PA+ALVDER+STEGTGLPFGLSNN
Subjt:  ASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPE----GKSSKESMISATVAAAVAA--AGPAVALVDERLSTEGTGLPFGLSNN

Query:  ILGWILAGVFGLIWALYFVYASSLEEDEDSGLSL
        +LGWIL GVFGLIWALYFVYAS LEEDE+SGLSL
Subjt:  ILGWILAGVFGLIWALYFVYASSLEEDEDSGLSL

Q5ZBY9 Photosystem II reaction center W protein, chloroplastic1.4e-2051.55Show/hide
Query:  LPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAAAVAAAGPAVALVDERLSTEGTGLPFGLSNNILGWILAGVFGLIWALYFVYASSLEEDEDSG
        LP+++      ++C   ++    ++      A++ A  A A PA+ALVDER+STEGTGL  GLSNN+LGWIL GVFGLIW+LY +Y S LEEDE+SG
Subjt:  LPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAAAVAAAGPAVALVDERLSTEGTGLPFGLSNNILGWILAGVFGLIWALYFVYASSLEEDEDSG

Q9SPI9 Photosystem II reaction center W protein, chloroplastic2.1e-1138.18Show/hide
Query:  SPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAAAVAAAGPAVALVDERLSTEGTGLPFGLSNNILGWILAGVFGLIWALYFVYASSLE-
        +P +AA+ +          + C+        + K +M+S     A  AA PA ALVDER++ +GTG PFG+++ +LGW+L GVFG +WA++F+    L  
Subjt:  SPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAAAVAAAGPAVALVDERLSTEGTGLPFGLSNNILGWILAGVFGLIWALYFVYASSLE-

Query:  -EDEDSGLSL
         ED D GL L
Subjt:  -EDEDSGLSL

Arabidopsis top hitse value%identityAlignment
AT2G30570.1 photosystem II reaction center W3.6e-2758.12Show/hide
Query:  ILPSPTMAARR----LPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAAAVAA--AGPAVALVDERLSTEGTGLPFGLSNNILGWILAGVFGLIWALY
        +L  PT+A       LP + K +KGG++C +E K    S   + +SA   AA+ A  + PA+ALVDER+STEGTGLPFGLSNN+LGWIL GVFGLIW  +
Subjt:  ILPSPTMAARR----LPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAAAVAA--AGPAVALVDERLSTEGTGLPFGLSNNILGWILAGVFGLIWALY

Query:  FVYASSLEEDEDSGLSL
        FVY SSLEEDE+SGLSL
Subjt:  FVYASSLEEDEDSGLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTCTGCTTTCTCACTCTGGGCTTGTACTCAAAGGATCAGCCAAGATTCTTCCATCGCCAACAATGGCAGCTCGCAGGTTGCCTGAGCTGCAAAAGATGGA
GAAGGGAGGGATTAAGTGCAGGTTGGAAGAGAAGCCAGAAGGAAAGAGCTCAAAAGAAAGCATGATTTCGGCGACGGTGGCGGCTGCGGTGGCAGCAGCGGGTCCGGCGG
TGGCGCTGGTGGATGAGAGGTTGAGCACAGAAGGGACAGGGCTGCCATTTGGTTTAAGCAACAACATTCTGGGATGGATTCTGGCTGGGGTGTTTGGTTTAATTTGGGCT
CTCTATTTTGTTTATGCTTCATCCTTGGAGGAAGATGAAGATTCTGGTTTGTCACTCTGA
mRNA sequenceShow/hide mRNA sequence
TTTTCTTTTAACTAACCATTAGAAGAAAAATCTGTAGATCTGAGTGTTACAGCCAAACAAAGCTTTTCCCTTCCCCACCTGTAGCAGGTTTTTTTGGCCGGAAAAAATGG
CTTCCACTCTGCTTTCTCACTCTGGGCTTGTACTCAAAGGATCAGCCAAGATTCTTCCATCGCCAACAATGGCAGCTCGCAGGTTGCCTGAGCTGCAAAAGATGGAGAAG
GGAGGGATTAAGTGCAGGTTGGAAGAGAAGCCAGAAGGAAAGAGCTCAAAAGAAAGCATGATTTCGGCGACGGTGGCGGCTGCGGTGGCAGCAGCGGGTCCGGCGGTGGC
GCTGGTGGATGAGAGGTTGAGCACAGAAGGGACAGGGCTGCCATTTGGTTTAAGCAACAACATTCTGGGATGGATTCTGGCTGGGGTGTTTGGTTTAATTTGGGCTCTCT
ATTTTGTTTATGCTTCATCCTTGGAGGAAGATGAAGATTCTGGTTTGTCACTCTGAATCAATCACAACATTTCTTGATAAAAAAATTGAATATGATTTTAAATGGTCAGT
AAACAAGTATATATATGAAGCATAGCTAAATAATGAGCTTGATCAAAACATATTTATTATC
Protein sequenceShow/hide protein sequence
MASTLLSHSGLVLKGSAKILPSPTMAARRLPELQKMEKGGIKCRLEEKPEGKSSKESMISATVAAAVAAAGPAVALVDERLSTEGTGLPFGLSNNILGWILAGVFGLIWA
LYFVYASSLEEDEDSGLSL