; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008084 (gene) of Snake gourd v1 genome

Gene IDTan0008084
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionphotosystem II 5 kDa protein, chloroplastic
Genome locationLG07:12130932..12131593
RNA-Seq ExpressionTan0008084
SyntenyTan0008084
Gene Ontology termsNA
InterPro domainsIPR040296 - Photosystem II 5kDa protein, chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143237.1 photosystem II 5 kDa protein, chloroplastic [Cucumis sativus]2.1e-3077.36Show/hide
Query:  MASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMPT
        MASMAMTASFLP TA K PS++ RR L+V KAS     SN N+++KNVKVESRQGRRELVAAAVTVAAA+LAK AMADEP  G+PEAKQKYA ICVTMPT
Subjt:  MASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMPT

Query:  ARICRK
        ARICRK
Subjt:  ARICRK

XP_008449703.1 PREDICTED: photosystem II 5 kDa protein, chloroplastic [Cucumis melo]4.0e-3470.59Show/hide
Query:  INYILF---PLTHLLLSTISPSLTNTHSLKMASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAAS
        INYILF   P+  L+ +  +P+ +N   L MASMAMTASFLP TA K PSA+ RRAL+V KAS   + SN NL++KNVKVESRQGRRELVAAAVTVAAA+
Subjt:  INYILF---PLTHLLLSTISPSLTNTHSLKMASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAAS

Query:  LAKVAMADEPKPGTPEAKQKYAVICVTMPTARICRK
        LAK AMADEP  G+PEAKQKYA ICVTMPTARICRK
Subjt:  LAKVAMADEPKPGTPEAKQKYAVICVTMPTARICRK

XP_022148791.1 photosystem II 5 kDa protein, chloroplastic [Momordica charantia]9.6e-2873.58Show/hide
Query:  MASMAMTASFLPVTAKHPSASTRRALVVVKASV---DGSNTNLDI-KNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMP
        MAS A+ ASFLP+TAKHPSA+TRRAL+V KAS    D S  NL++ ++V+VES QGRRELVAAAVTVAAA+LAK AMADEPK G+ EAKQKYA +CVTMP
Subjt:  MASMAMTASFLPVTAKHPSASTRRALVVVKASV---DGSNTNLDI-KNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMP

Query:  TARICR
        TARICR
Subjt:  TARICR

XP_022944588.1 photosystem II 5 kDa protein, chloroplastic [Cucurbita moschata]4.8e-2772.73Show/hide
Query:  MASMAMTASFLPVTAKHPS-ASTRRALVVVKASV------DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICV
        MASM++TASFLP TA  PS A+ RRAL V KAS         SN NL+IKNVKVES QGRRELVAAAVTVAAA+ AK AMADEP  G+ EAKQKYA ICV
Subjt:  MASMAMTASFLPVTAKHPS-ASTRRALVVVKASV------DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICV

Query:  TMPTARICRK
        TMPTARICRK
Subjt:  TMPTARICRK

XP_038901593.1 photosystem II 5 kDa protein, chloroplastic [Benincasa hispida]1.1e-3181.13Show/hide
Query:  MASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMPT
        MASMAMTASFLP TA K PS + RRALVV KAS   D SN NL+IKNVKVE+RQGRRELVAAAVTVAAA+LAK AMADEP  G+PEAKQKYA ICVTMPT
Subjt:  MASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMPT

Query:  ARICRK
        ARICRK
Subjt:  ARICRK

TrEMBL top hitse value%identityAlignment
A0A0A0KIB2 Uncharacterized protein1.0e-3077.36Show/hide
Query:  MASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMPT
        MASMAMTASFLP TA K PS++ RR L+V KAS     SN N+++KNVKVESRQGRRELVAAAVTVAAA+LAK AMADEP  G+PEAKQKYA ICVTMPT
Subjt:  MASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMPT

Query:  ARICRK
        ARICRK
Subjt:  ARICRK

A0A1S3BML8 photosystem II 5 kDa protein, chloroplastic2.0e-3470.59Show/hide
Query:  INYILF---PLTHLLLSTISPSLTNTHSLKMASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAAS
        INYILF   P+  L+ +  +P+ +N   L MASMAMTASFLP TA K PSA+ RRAL+V KAS   + SN NL++KNVKVESRQGRRELVAAAVTVAAA+
Subjt:  INYILF---PLTHLLLSTISPSLTNTHSLKMASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAAS

Query:  LAKVAMADEPKPGTPEAKQKYAVICVTMPTARICRK
        LAK AMADEP  G+PEAKQKYA ICVTMPTARICRK
Subjt:  LAKVAMADEPKPGTPEAKQKYAVICVTMPTARICRK

A0A5A7TID9 Photosystem II 5 kDa protein2.0e-3470.59Show/hide
Query:  INYILF---PLTHLLLSTISPSLTNTHSLKMASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAAS
        INYILF   P+  L+ +  +P+ +N   L MASMAMTASFLP TA K PSA+ RRAL+V KAS   + SN NL++KNVKVESRQGRRELVAAAVTVAAA+
Subjt:  INYILF---PLTHLLLSTISPSLTNTHSLKMASMAMTASFLPVTA-KHPSASTRRALVVVKASV--DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAAS

Query:  LAKVAMADEPKPGTPEAKQKYAVICVTMPTARICRK
        LAK AMADEP  G+PEAKQKYA ICVTMPTARICRK
Subjt:  LAKVAMADEPKPGTPEAKQKYAVICVTMPTARICRK

A0A6J1D6G0 photosystem II 5 kDa protein, chloroplastic4.7e-2873.58Show/hide
Query:  MASMAMTASFLPVTAKHPSASTRRALVVVKASV---DGSNTNLDI-KNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMP
        MAS A+ ASFLP+TAKHPSA+TRRAL+V KAS    D S  NL++ ++V+VES QGRRELVAAAVTVAAA+LAK AMADEPK G+ EAKQKYA +CVTMP
Subjt:  MASMAMTASFLPVTAKHPSASTRRALVVVKASV---DGSNTNLDI-KNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMP

Query:  TARICR
        TARICR
Subjt:  TARICR

A0A6J1J874 photosystem II 5 kDa protein, chloroplastic2.3e-2772.73Show/hide
Query:  MASMAMTASFLPVTAKHPS-ASTRRALVVVKASV------DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICV
        MASM++TASFLP TA  PS A+ RRAL V KAS         SN NL+IKNVKVES QGRRELVAAAVTVAAA+ AK AMADEP  G+ EAKQKYA ICV
Subjt:  MASMAMTASFLPVTAKHPS-ASTRRALVVVKASV------DGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICV

Query:  TMPTARICRK
        TMPTARICRK
Subjt:  TMPTARICRK

SwissProt top hitse value%identityAlignment
B3EWI4 Photosystem II 5 kDa protein, chloroplastic3.3e-1554.21Show/hide
Query:  MASMAMTASFL----PVTAKHPSASTRRALVVVKASVDGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMAD-EPKPGTPEAKQKYAVICVTM
        MAS+ M +SFL       AK PSA+ RR +V+VKA  +G N N+ I   +     GRREL  A    AA S+AK AMAD EPK GTPEAK+KY+ +CVT 
Subjt:  MASMAMTASFL----PVTAKHPSASTRRALVVVKASVDGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMAD-EPKPGTPEAKQKYAVICVTM

Query:  PTARICR
        PTARICR
Subjt:  PTARICR

P31336 Photosystem II 5 kDa protein, chloroplastic2.9e-1147.62Show/hide
Query:  MASMAMTASFLPVT--AKHPSASTRRALVVVKASVDGSNTNLDIKNV-KVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMPT
        MAS+ MT SFL  T   K     T+R LVV  A+      ++ +    K E   GRRE++ AA   A  S+A VA A EPK G+ EAK+ YA +CVTMPT
Subjt:  MASMAMTASFLPVT--AKHPSASTRRALVVVKASVDGSNTNLDIKNV-KVESRQGRRELVAAAVTVAAASLAKVAMADEPKPGTPEAKQKYAVICVTMPT

Query:  ARICR
        ARICR
Subjt:  ARICR

Q39195 Photosystem II 5 kDa protein, chloroplastic4.7e-1753.85Show/hide
Query:  MASMAMTASFLPVTAKHPSASTRRALVVVKASVDGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMA--DEPKPGTPEAKQKYAVICVTMPTA
        MASM MTA+F P  AK PSA+  R L VV+AS   +  +L++K  +  S   RR+L+  A   A  SLAKVAMA  +EPK GT  AK+KYA +CVTMPTA
Subjt:  MASMAMTASFLPVTAKHPSASTRRALVVVKASVDGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMA--DEPKPGTPEAKQKYAVICVTMPTA

Query:  RICR
        +ICR
Subjt:  RICR

Arabidopsis top hitse value%identityAlignment
AT1G51400.1 Photosystem II 5 kD protein8.7e-1955.66Show/hide
Query:  MASMAMTASFLPVTAKHP---SASTRRALVVVKASVDGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMA-DEPKPGTPEAKQKYAVICVTMP
        MASM MT+SFLP  +  P   S+++RR+L VVKAS   + T+L+ K  + +S + RR+LV  A   A  SLAKVAMA DEPK GT  AK+KYA +CVTMP
Subjt:  MASMAMTASFLPVTAKHP---SASTRRALVVVKASVDGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMA-DEPKPGTPEAKQKYAVICVTMP

Query:  TARICR
        TA+ICR
Subjt:  TARICR

AT3G21055.1 photosystem II subunit T3.3e-1853.85Show/hide
Query:  MASMAMTASFLPVTAKHPSASTRRALVVVKASVDGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMA--DEPKPGTPEAKQKYAVICVTMPTA
        MASM MTA+F P  AK PSA+  R L VV+AS   +  +L++K  +  S   RR+L+  A   A  SLAKVAMA  +EPK GT  AK+KYA +CVTMPTA
Subjt:  MASMAMTASFLPVTAKHPSASTRRALVVVKASVDGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMA--DEPKPGTPEAKQKYAVICVTMPTA

Query:  RICR
        +ICR
Subjt:  RICR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGGGCACAATCCAAAACAATAAACTATATCCTCTTTCCCCTCACCCACTTACTACTGTCAACTATCTCTCCATCTCTTACAAACACACATTCCCTAAAAATGGC
ATCCATGGCTATGACAGCCTCCTTCCTCCCTGTCACGGCCAAGCACCCCTCCGCCTCCACTCGAAGAGCCCTCGTCGTCGTCAAGGCCTCCGTCGATGGCTCGAATACCA
ACTTGGATATCAAGAATGTGAAGGTTGAGAGCAGGCAAGGGCGAAGGGAGTTGGTGGCAGCCGCCGTGACAGTGGCAGCGGCGAGCTTGGCAAAGGTTGCAATGGCGGAC
GAGCCTAAGCCTGGCACCCCTGAAGCCAAGCAGAAGTATGCTGTCATTTGTGTAACAATGCCCACTGCACGTATTTGCCGCAAGTGA
mRNA sequenceShow/hide mRNA sequence
TGATGTTTCATTGAAGAATCAACATGAACCCCTTGAATGAATTGGGCACAATCCAAAACAATAAACTATATCCTCTTTCCCCTCACCCACTTACTACTGTCAACTATCTC
TCCATCTCTTACAAACACACATTCCCTAAAAATGGCATCCATGGCTATGACAGCCTCCTTCCTCCCTGTCACGGCCAAGCACCCCTCCGCCTCCACTCGAAGAGCCCTCG
TCGTCGTCAAGGCCTCCGTCGATGGCTCGAATACCAACTTGGATATCAAGAATGTGAAGGTTGAGAGCAGGCAAGGGCGAAGGGAGTTGGTGGCAGCCGCCGTGACAGTG
GCAGCGGCGAGCTTGGCAAAGGTTGCAATGGCGGACGAGCCTAAGCCTGGCACCCCTGAAGCCAAGCAGAAGTATGCTGTCATTTGTGTAACAATGCCCACTGCACGTAT
TTGCCGCAAGTGATTTCTAGAGATGGTCTCTCTCTGTGTTCTGTCTTCTTTCTTTCTCTCCTTCTCTTTAAACTCTTTCGTTTGGTTTCAAAATTCAGAGTTTGATTGTA
AAATCTGTAAAAGAAAGCTATGAATCTCTTTTCGTGTCTTTCTCTGATTTTACTTCAGTAATTTTTCACATTTTATCATCATAACACTTAAATAAACACCATATTTGAAT
AA
Protein sequenceShow/hide protein sequence
MNWAQSKTINYILFPLTHLLLSTISPSLTNTHSLKMASMAMTASFLPVTAKHPSASTRRALVVVKASVDGSNTNLDIKNVKVESRQGRRELVAAAVTVAAASLAKVAMAD
EPKPGTPEAKQKYAVICVTMPTARICRK