; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009684 (gene) of Snake gourd v1 genome

Gene IDTan0009684
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionphotosystem II core complex proteins psbY, chloroplastic
Genome locationLG02:90852180..90852794
RNA-Seq ExpressionTan0009684
SyntenyTan0009684
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0034219 - carbohydrate transmembrane transport (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009523 - photosystem II (cellular component)
GO:0009534 - chloroplast thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030145 - manganese ion binding (molecular function)
GO:0051119 - sugar transmembrane transporter activity (molecular function)
InterPro domainsIPR009388 - Photosystem II PsbY
IPR038760 - Photosystem II PsbY, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598253.1 Photosystem II core complex proteins psbY, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.3e-8588.24Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA+MATMAVLSVKCTSINST+  N PK IPKPISLLSLQNLPKGLIS K+ E+ NLS FLS TAIAGAVFS LS SDPAFAAQQIA+IAADGDNRG+AL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFM  APDASASE+AMI ADASSSD RGQLLLFVV PAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRSE
Subjt:  MRSE

KGN62604.2 hypothetical protein Csa_018753 [Cucumis sativus]7.9e-8790.2Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA+MATMAVLSVKCTSINSTKT N  K IPKPISLLSLQNLPKGLIS K+ +N NLSTFLS TAIAGAVF+TL  SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLPLIPA+AWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFM  APDASASEIAMI ADASSSD RGQLLLFVVAPAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRSE
Subjt:  MRSE

XP_004142959.3 photosystem II core complex proteins psbY, chloroplastic [Cucumis sativus]7.9e-8790.2Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA+MATMAVLSVKCTSINSTKT N  K IPKPISLLSLQNLPKGLIS K+ +N NLSTFLS TAIAGAVF+TL  SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLPLIPA+AWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFM  APDASASEIAMI ADASSSD RGQLLLFVVAPAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRSE
Subjt:  MRSE

XP_008444395.1 PREDICTED: photosystem II core complex proteins psbY, chloroplastic [Cucumis melo]6.1e-8790.69Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA+MATMAVLSVKCTSINSTKT N  K IPKPISLLSLQNLPKGLIS K+ EN NLSTFLS TAIAGAVF+TL  SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFM  APDASASEIAMI ADASSSD RGQLLLFVVAPAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRS+
Subjt:  MRSE

XP_038886078.1 LOW QUALITY PROTEIN: photosystem II core complex proteins psbY, chloroplastic-like [Benincasa hispida]4.2e-8890.69Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA+MATMAVLSVKCTSINSTK  N PKPIPKPISLLSLQNLPKGL+S K+ +N NLST LS TAIAGAVFSTLS SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLPLIPAIAWVLFNILQPALNQ+N+MRSDKGVIIGLGLGGLTASGFM  APDASASEIAMI ADASSSD RGQLLLFVVAPAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRSE
Subjt:  MRSE

TrEMBL top hitse value%identityAlignment
A0A0A0LKA1 Uncharacterized protein3.8e-8790.2Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA+MATMAVLSVKCTSINSTKT N  K IPKPISLLSLQNLPKGLIS K+ +N NLSTFLS TAIAGAVF+TL  SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLPLIPA+AWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFM  APDASASEIAMI ADASSSD RGQLLLFVVAPAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRSE
Subjt:  MRSE

A0A1S3B9R1 photosystem II core complex proteins psbY, chloroplastic2.9e-8790.69Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA+MATMAVLSVKCTSINSTKT N  K IPKPISLLSLQNLPKGLIS K+ EN NLSTFLS TAIAGAVF+TL  SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFM  APDASASEIAMI ADASSSD RGQLLLFVVAPAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRS+
Subjt:  MRSE

A0A5A7UYJ2 Photosystem II core complex proteins psbY2.9e-8790.69Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA+MATMAVLSVKCTSINSTKT N  K IPKPISLLSLQNLPKGLIS K+ EN NLSTFLS TAIAGAVF+TL  SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFM  APDASASEIAMI ADASSSD RGQLLLFVVAPAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRS+
Subjt:  MRSE

A0A6J1BQF3 photosystem II core complex proteins psbY, chloroplastic1.5e-8386.76Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA+MATMAVLSVKCTSINS+K    PKPI  PISLLSLQNLPK LIS KA++NPNLSTFLS TAIAGAVFS  S SDPAFAAQQIAEIAA+GDNRGLAL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGL AS FM  APDASA+EIAMI ADASSSD RGQLLLFV++PAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRSE
Subjt:  MRSE

F6GVJ9 Uncharacterized protein4.4e-7579.9Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL
        MAA++ATMA+L+ KC SINS K  N+ KP  KPISLLS+QNLPKGL + K++EN NLST L+GTAIAGA+FSTLS  DPA AAQQIAEI ADGDNRGLAL
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR
        LLP+IPAIAWVLFNILQPALNQLN+MRS KGVIIGLGLGGL ASGFM+  P ASASEIA + ADA+SSD RGQLLLFVV PAILWVLYNILQPALNQLNR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNR

Query:  MRSE
        MRSE
Subjt:  MRSE

SwissProt top hitse value%identityAlignment
O49347 Photosystem II core complex proteins psbY, chloroplastic1.7e-4756.52Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIA---ADGDNRG
        MAA+MAT    + KC S+N       P P PK    L  Q   K  IS      PN+S  ++ TA+AGAVFS+LS S+PA A QQIA++A   A  DNRG
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIA---ADGDNRG

Query:  LALLLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGL-GGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALN
        LALLLP++PAIAWVL+NILQPA+NQ+N+MR  KG+++GLG+ GGL ASG +   P+A A+      A A+SSD RGQLLL VV PA+LWVLYNILQPALN
Subjt:  LALLLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGL-GGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALN

Query:  QLNRMRS
        Q+N+MRS
Subjt:  QLNRMRS

P80470 Photosystem II core complex proteins psbY, chloroplastic6.8e-4957.01Show/hide
Query:  MAASMA-TMAVLSVKCTSINSTKTQNI-PKPIPKPISL----LSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAAD--
        MAA+MA TMAVL+ KC ++N+ KT +  PKP  KPISL    LS   LP G           LS  ++  AIAGAVF+TL   DPAFA QQ+A+IAA+  
Subjt:  MAASMA-TMAVLSVKCTSINSTKTQNI-PKPIPKPISL----LSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAAD--

Query:  -GDNRGLALLLPLIPAIAWVLFNILQPALNQLNRMRSD-KGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNI
          DNRGLALLLP+IPA+ WVLFNILQPALNQ+N+MR++ K  I+GLGL GL  SG ++  P+A A+   +    A  SD RG LLL VV PAI WVL+NI
Subjt:  -GDNRGLALLLPLIPAIAWVLFNILQPALNQLNRMRSD-KGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNI

Query:  LQPALNQLNRMRSE
        LQPALNQLN+MRS+
Subjt:  LQPALNQLNRMRSE

Arabidopsis top hitse value%identityAlignment
AT1G67740.1 photosystem II BY1.2e-4856.52Show/hide
Query:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIA---ADGDNRG
        MAA+MAT    + KC S+N       P P PK    L  Q   K  IS      PN+S  ++ TA+AGAVFS+LS S+PA A QQIA++A   A  DNRG
Subjt:  MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIA---ADGDNRG

Query:  LALLLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGL-GGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALN
        LALLLP++PAIAWVL+NILQPA+NQ+N+MR  KG+++GLG+ GGL ASG +   P+A A+      A A+SSD RGQLLL VV PA+LWVLYNILQPALN
Subjt:  LALLLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGL-GGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALN

Query:  QLNRMRS
        Q+N+MRS
Subjt:  QLNRMRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCTTCCATGGCTACAATGGCAGTGCTCAGTGTCAAGTGCACAAGCATCAACTCCACCAAAACTCAGAACATCCCAAAGCCAATCCCCAAACCCATCTCCCTCCT
CTCTCTCCAGAACCTTCCAAAAGGACTAATCTCACCAAAAGCTACTGAAAATCCCAACTTATCAACCTTCCTCTCCGGCACCGCCATCGCCGGGGCTGTCTTCTCAACCT
TGAGCTTATCAGATCCTGCTTTTGCAGCCCAACAAATTGCAGAGATAGCCGCCGACGGTGACAACCGTGGTTTGGCCCTTTTGCTACCCCTTATTCCGGCTATAGCATGG
GTTCTGTTCAACATACTACAGCCAGCACTCAACCAGCTCAACAGAATGCGCAGTGACAAGGGGGTGATAATTGGGTTGGGATTAGGGGGGTTGACTGCATCAGGGTTTAT
GATGAACGCACCTGATGCTTCGGCCAGTGAGATCGCCATGATTGCCGCCGATGCCTCTTCAAGTGATGGCAGGGGGCAGCTTCTGCTGTTTGTCGTAGCACCCGCCATTC
TCTGGGTTCTGTACAACATTCTACAGCCAGCTTTGAATCAGCTCAACAGGATGAGATCCGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCTTCCATGGCTACAATGGCAGTGCTCAGTGTCAAGTGCACAAGCATCAACTCCACCAAAACTCAGAACATCCCAAAGCCAATCCCCAAACCCATCTCCCTCCT
CTCTCTCCAGAACCTTCCAAAAGGACTAATCTCACCAAAAGCTACTGAAAATCCCAACTTATCAACCTTCCTCTCCGGCACCGCCATCGCCGGGGCTGTCTTCTCAACCT
TGAGCTTATCAGATCCTGCTTTTGCAGCCCAACAAATTGCAGAGATAGCCGCCGACGGTGACAACCGTGGTTTGGCCCTTTTGCTACCCCTTATTCCGGCTATAGCATGG
GTTCTGTTCAACATACTACAGCCAGCACTCAACCAGCTCAACAGAATGCGCAGTGACAAGGGGGTGATAATTGGGTTGGGATTAGGGGGGTTGACTGCATCAGGGTTTAT
GATGAACGCACCTGATGCTTCGGCCAGTGAGATCGCCATGATTGCCGCCGATGCCTCTTCAAGTGATGGCAGGGGGCAGCTTCTGCTGTTTGTCGTAGCACCCGCCATTC
TCTGGGTTCTGTACAACATTCTACAGCCAGCTTTGAATCAGCTCAACAGGATGAGATCCGAGTGA
Protein sequenceShow/hide protein sequence
MAASMATMAVLSVKCTSINSTKTQNIPKPIPKPISLLSLQNLPKGLISPKATENPNLSTFLSGTAIAGAVFSTLSLSDPAFAAQQIAEIAADGDNRGLALLLPLIPAIAW
VLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMMNAPDASASEIAMIAADASSSDGRGQLLLFVVAPAILWVLYNILQPALNQLNRMRSE