; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005089 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005089
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionphotosystem II core complex proteins psbY, chloroplastic
Genome locationChr08:22803403..22804011
RNA-Seq ExpressionHG10005089
SyntenyHG10005089
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0034219 - carbohydrate transmembrane transport (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009523 - photosystem II (cellular component)
GO:0009534 - chloroplast thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030145 - manganese ion binding (molecular function)
GO:0051119 - sugar transmembrane transporter activity (molecular function)
InterPro domainsIPR009388 - Photosystem II PsbY
IPR038760 - Photosystem II PsbY, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029231.1 Photosystem II core complex proteins psbY, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-9495.05Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAATMATMAVLSVKCTSINST+NHNTPK IPKPISLLSLQNLPKGLISSKSN +SNLSTFLSSTAIAGAVFS LSSSDPAFAAQQIA+IAADGDNRG+AL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASG MYAPDASASE+AMIADASSSDSRGQLLLFVV PAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

KGN62604.2 hypothetical protein Csa_018753 [Cucumis sativus]6.0e-9596.53Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAATMATMAVLSVKCTSINSTK HNT K IPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVF+TL SSDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPA+AWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASG MYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

XP_004142959.3 photosystem II core complex proteins psbY, chloroplastic [Cucumis sativus]6.0e-9596.53Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAATMATMAVLSVKCTSINSTK HNT K IPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVF+TL SSDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPA+AWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASG MYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

XP_008444395.1 PREDICTED: photosystem II core complex proteins psbY, chloroplastic [Cucumis melo]6.6e-9495.54Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAATMATMAVLSVKCTSINSTK HNT K IPKPISLLSLQNLPKGLISSKSN+NSNLSTFLSSTAIAGAVF+TL +SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASG MYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        S+
Subjt:  SE

XP_038886078.1 LOW QUALITY PROTEIN: photosystem II core complex proteins psbY, chloroplastic-like [Benincasa hispida]1.7e-9798.02Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGL+SSKSNQNSNLST LSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQIN+MRSDKGVIIGLGLGGLTASG MYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

TrEMBL top hitse value%identityAlignment
A0A0A0LKA1 Uncharacterized protein2.9e-9596.53Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAATMATMAVLSVKCTSINSTK HNT K IPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVF+TL SSDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPA+AWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASG MYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

A0A1S3B9R1 photosystem II core complex proteins psbY, chloroplastic3.2e-9495.54Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAATMATMAVLSVKCTSINSTK HNT K IPKPISLLSLQNLPKGLISSKSN+NSNLSTFLSSTAIAGAVF+TL +SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASG MYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        S+
Subjt:  SE

A0A5A7UYJ2 Photosystem II core complex proteins psbY3.2e-9495.54Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAATMATMAVLSVKCTSINSTK HNT K IPKPISLLSLQNLPKGLISSKSN+NSNLSTFLSSTAIAGAVF+TL +SDPAFAAQQIAEIAADGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASG MYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        S+
Subjt:  SE

A0A6J1BQF3 photosystem II core complex proteins psbY, chloroplastic1.1e-8991.09Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAATMATMAVLSVKCTSINS+KNH TPKPI  PISLLSLQNLPK LISSK++QN NLSTFLSSTAIAGAVFS  SSSDPAFAAQQIAEIAA+GDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGL AS  MYAPDASA+EIAMIADASSSD+RGQLLLFV++PAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

F6GVJ9 Uncharacterized protein6.1e-7780.69Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL
        MAAT+ATMA+L+ KC SINS KN N  KP  KPISLLS+QNLPKGL + KS++N NLST L+ TAIAGA+FSTLSS DPA AAQQIAEI ADGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLP+IPAIAWVLFNILQPALNQ+N+MRS KGVIIGLGLGGL ASG M  P ASASEIA +ADA+SSD+RGQLLLFVV PAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

SwissProt top hitse value%identityAlignment
O49347 Photosystem II core complex proteins psbY, chloroplastic1.8e-4657.56Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIA---ADGDNRG
        MAA MAT    + KC S+N       P P PK    L  Q   K  IS  +    N+S  ++STA+AGAVFS+LS S+PA A QQIA++A   A  DNRG
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIA---ADGDNRG

Query:  LALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGL-GGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQL
        LALLLP++PAIAWVL+NILQPA+NQ+N+MR  KG+++GLG+ GGL ASGL+  P  + +  A  A A+SSDSRGQLLL VV PA+LWVLYNILQPALNQ+
Subjt:  LALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGL-GGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQL

Query:  NRMRS
        N+MRS
Subjt:  NRMRS

P80470 Photosystem II core complex proteins psbY, chloroplastic6.1e-5060.77Show/hide
Query:  MAATMA-TMAVLSVKCTSINSTKNHNT-PKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAD---GDN
        MAATMA TMAVL+ KC ++N+ K  +T PKP  KPISL      P GL +SK      LS  +++ AIAGAVF+TL S DPAFA QQ+A+IAA+    DN
Subjt:  MAATMA-TMAVLSVKCTSINSTKNHNT-PKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAD---GDN

Query:  RGLALLLPLIPAIAWVLFNILQPALNQINRMRSD-KGVIIGLGLGGLTASGLMYA-PDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPAL
        RGLALLLP+IPA+ WVLFNILQPALNQIN+MR++ K  I+GLGL GL  SGL+ A P+A A+   +   A  SD+RG LLL VV PAI WVL+NILQPAL
Subjt:  RGLALLLPLIPAIAWVLFNILQPALNQINRMRSD-KGVIIGLGLGGLTASGLMYA-PDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPAL

Query:  NQLNRMRSE
        NQLN+MRS+
Subjt:  NQLNRMRSE

Arabidopsis top hitse value%identityAlignment
AT1G67740.1 photosystem II BY1.3e-4757.56Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIA---ADGDNRG
        MAA MAT    + KC S+N       P P PK    L  Q   K  IS  +    N+S  ++STA+AGAVFS+LS S+PA A QQIA++A   A  DNRG
Subjt:  MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIA---ADGDNRG

Query:  LALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGL-GGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQL
        LALLLP++PAIAWVL+NILQPA+NQ+N+MR  KG+++GLG+ GGL ASGL+  P  + +  A  A A+SSDSRGQLLL VV PA+LWVLYNILQPALNQ+
Subjt:  LALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGL-GGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQL

Query:  NRMRS
        N+MRS
Subjt:  NRMRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCTACCATGGCTACAATGGCAGTGCTCAGTGTCAAGTGCACAAGCATCAACTCCACCAAAAACCACAACACCCCAAAGCCAATCCCCAAACCCATCTCCCTCCT
CTCTCTCCAAAACCTTCCCAAAGGACTAATCTCATCAAAATCTAATCAAAATTCCAACTTATCAACCTTCCTCTCCAGCACCGCCATCGCCGGAGCTGTCTTTTCAACCT
TGAGCTCATCGGATCCTGCATTTGCAGCCCAACAAATTGCAGAGATAGCAGCTGATGGCGACAACCGCGGTTTAGCCCTTTTGCTACCTCTTATTCCGGCCATAGCATGG
GTTCTGTTCAACATATTACAGCCAGCACTCAATCAGATCAACAGAATGCGCAGTGACAAGGGTGTGATAATTGGGTTGGGATTAGGGGGGTTGACTGCATCAGGGCTTAT
GTACGCACCTGATGCTTCGGCCAGCGAGATCGCCATGATTGCCGATGCTTCTTCAAGTGATAGCAGGGGCCAGCTTCTGCTGTTTGTCGTAGCACCAGCCATTCTTTGGG
TGCTGTACAACATTCTACAGCCGGCTTTGAATCAGCTCAACAGGATGAGGTCCGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCTACCATGGCTACAATGGCAGTGCTCAGTGTCAAGTGCACAAGCATCAACTCCACCAAAAACCACAACACCCCAAAGCCAATCCCCAAACCCATCTCCCTCCT
CTCTCTCCAAAACCTTCCCAAAGGACTAATCTCATCAAAATCTAATCAAAATTCCAACTTATCAACCTTCCTCTCCAGCACCGCCATCGCCGGAGCTGTCTTTTCAACCT
TGAGCTCATCGGATCCTGCATTTGCAGCCCAACAAATTGCAGAGATAGCAGCTGATGGCGACAACCGCGGTTTAGCCCTTTTGCTACCTCTTATTCCGGCCATAGCATGG
GTTCTGTTCAACATATTACAGCCAGCACTCAATCAGATCAACAGAATGCGCAGTGACAAGGGTGTGATAATTGGGTTGGGATTAGGGGGGTTGACTGCATCAGGGCTTAT
GTACGCACCTGATGCTTCGGCCAGCGAGATCGCCATGATTGCCGATGCTTCTTCAAGTGATAGCAGGGGCCAGCTTCTGCTGTTTGTCGTAGCACCAGCCATTCTTTGGG
TGCTGTACAACATTCTACAGCCGGCTTTGAATCAGCTCAACAGGATGAGGTCCGAGTGA
Protein sequenceShow/hide protein sequence
MAATMATMAVLSVKCTSINSTKNHNTPKPIPKPISLLSLQNLPKGLISSKSNQNSNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAADGDNRGLALLLPLIPAIAW
VLFNILQPALNQINRMRSDKGVIIGLGLGGLTASGLMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMRSE