; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004678 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004678
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionphotosystem II core complex proteins psbY, chloroplastic
Genome locationchr6:6036605..6037210
RNA-Seq ExpressionLag0004678
SyntenyLag0004678
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0034219 - carbohydrate transmembrane transport (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009523 - photosystem II (cellular component)
GO:0009534 - chloroplast thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030145 - manganese ion binding (molecular function)
GO:0051119 - sugar transmembrane transporter activity (molecular function)
InterPro domainsIPR009388 - Photosystem II PsbY
IPR038760 - Photosystem II PsbY, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598253.1 Photosystem II core complex proteins psbY, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.5e-9093.07Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL
        MAATMATMAVLSVKCTSINST+N NTPKLIPKPISLLSLQNLPKGLISSK+NE  NLS FLSSTAIAGAVFS LSSSDPAFAAQQIA+IAA+ DNRG+AL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFMYAPDASASE+AMIADASSSDSRGQLLLFVV PAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

KAG7029231.1 Photosystem II core complex proteins psbY, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-9093.07Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL
        MAATMATMAVLSVKCTSINST+N NTPKLIPKPISLLSLQNLPKGLISSK+N   NLSTFLSSTAIAGAVFS LSSSDPAFAAQQIA+IAA+ DNRG+AL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFMYAPDASASE+AMIADASSSDSRGQLLLFVV PAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

KGN62604.2 hypothetical protein Csa_018753 [Cucumis sativus]3.4e-9093.07Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL
        MAATMATMAVLSVKCTSINSTK  NT K+IPKPISLLSLQNLPKGLISSK+N+  NLSTFLSSTAIAGAVF+TL SSDPAFAAQQIAEIAA+ DNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPA+AWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

XP_004142959.3 photosystem II core complex proteins psbY, chloroplastic [Cucumis sativus]3.4e-9093.07Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL
        MAATMATMAVLSVKCTSINSTK  NT K+IPKPISLLSLQNLPKGLISSK+N+  NLSTFLSSTAIAGAVF+TL SSDPAFAAQQIAEIAA+ DNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPA+AWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

XP_038886078.1 LOW QUALITY PROTEIN: photosystem II core complex proteins psbY, chloroplastic-like [Benincasa hispida]5.2e-9194.06Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL
        MAATMATMAVLSVKCTSINSTKN NTPK IPKPISLLSLQNLPKGL+SSK+N+  NLST LSSTAIAGAVFSTLSSSDPAFAAQQIAEIAA+ DNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQ+N+MRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

TrEMBL top hitse value%identityAlignment
A0A0A0LKA1 Uncharacterized protein1.6e-9093.07Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL
        MAATMATMAVLSVKCTSINSTK  NT K+IPKPISLLSLQNLPKGLISSK+N+  NLSTFLSSTAIAGAVF+TL SSDPAFAAQQIAEIAA+ DNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPA+AWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

A0A1S3B9R1 photosystem II core complex proteins psbY, chloroplastic3.7e-9093.07Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL
        MAATMATMAVLSVKCTSINSTK  NT K+IPKPISLLSLQNLPKGLISSK+NE  NLSTFLSSTAIAGAVF+TL +SDPAFAAQQIAEIAA+ DNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        S+
Subjt:  SE

A0A5A7UYJ2 Photosystem II core complex proteins psbY3.7e-9093.07Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL
        MAATMATMAVLSVKCTSINSTK  NT K+IPKPISLLSLQNLPKGLISSK+NE  NLSTFLSSTAIAGAVF+TL +SDPAFAAQQIAEIAA+ DNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        S+
Subjt:  SE

A0A6J1BQF3 photosystem II core complex proteins psbY, chloroplastic3.2e-8690.1Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL
        MAATMATMAVLSVKCTSINS+KN  TPK I  PISLLSLQNLPK LISSKA++ PNLSTFLSSTAIAGAVFS  SSSDPAFAAQQIAEIAAE DNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE-DNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQ+NRMRSDKGVIIGLGLGGL AS FMYAPDASA+EIAMIADASSSD+RGQLLLFV++PAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

F6GVJ9 Uncharacterized protein3.3e-7580.1Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAEDNRGLALL
        MAAT+ATMA+L+ KC SINS KN N  K   KPISLLS+QNLPKGL + K++E  NLST L+ TAIAGA+FSTLSS DPA AAQQIAEIA  DNRGLALL
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAEDNRGLALL

Query:  LPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMRS
        LP+IPAIAWVLFNILQPALNQLN+MRS KGVIIGLGLGGL ASGFM  P ASASEIA +ADA+SSD+RGQLLLFVV PAILWVLYNILQPALNQLNRMRS
Subjt:  LPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMRS

Query:  E
        E
Subjt:  E

SwissProt top hitse value%identityAlignment
O49347 Photosystem II core complex proteins psbY, chloroplastic6.3e-4757.07Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAA----EDNRG
        MAA MAT    + KC S+    NP+ PK        L  Q   K  IS      PN+S  ++STA+AGAVFS+LS S+PA A QQIA++AA     DNRG
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAA----EDNRG

Query:  LALLLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGL-GGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQL
        LALLLP++PAIAWVL+NILQPA+NQ+N+MR  KG+++GLG+ GGL ASG +  P  + +  A  A A+SSDSRGQLLL VV PA+LWVLYNILQPALNQ+
Subjt:  LALLLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGL-GGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQL

Query:  NRMRS
        N+MRS
Subjt:  NRMRS

P80470 Photosystem II core complex proteins psbY, chloroplastic2.0e-4860Show/hide
Query:  MAATMA-TMAVLSVKCTSINSTKNPNT-PKLIPKPISLLSLQNLPKGLISSKANEIP-NLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE----D
        MAATMA TMAVL+ KC ++N+ K  +T PK   KPISL      P GL +SK   +P  LS  +++ AIAGAVF+TL S DPAFA QQ+A+IAAE    D
Subjt:  MAATMA-TMAVLSVKCTSINSTKNPNT-PKLIPKPISLLSLQNLPKGLISSKANEIP-NLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAE----D

Query:  NRGLALLLPLIPAIAWVLFNILQPALNQLNRMRSD-KGVIIGLGLGGLTASGFMYA-PDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPA
        NRGLALLLP+IPA+ WVLFNILQPALNQ+N+MR++ K  I+GLGL GL  SG + A P+A A+   +   A  SD+RG LLL VV PAI WVL+NILQPA
Subjt:  NRGLALLLPLIPAIAWVLFNILQPALNQLNRMRSD-KGVIIGLGLGGLTASGFMYA-PDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPA

Query:  LNQLNRMRSE
        LNQLN+MRS+
Subjt:  LNQLNRMRSE

Arabidopsis top hitse value%identityAlignment
AT1G67740.1 photosystem II BY4.5e-4857.07Show/hide
Query:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAA----EDNRG
        MAA MAT    + KC S+    NP+ PK        L  Q   K  IS      PN+S  ++STA+AGAVFS+LS S+PA A QQIA++AA     DNRG
Subjt:  MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAA----EDNRG

Query:  LALLLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGL-GGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQL
        LALLLP++PAIAWVL+NILQPA+NQ+N+MR  KG+++GLG+ GGL ASG +  P  + +  A  A A+SSDSRGQLLL VV PA+LWVLYNILQPALNQ+
Subjt:  LALLLPLIPAIAWVLFNILQPALNQLNRMRSDKGVIIGLGL-GGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQL

Query:  NRMRS
        N+MRS
Subjt:  NRMRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCAACCATGGCTACAATGGCAGTGCTCAGTGTCAAGTGCACAAGCATCAACTCCACCAAAAACCCCAACACCCCAAAGCTGATCCCCAAACCCATCTCGCTCCT
CTCTCTCCAGAATCTTCCAAAAGGACTGATCTCATCAAAAGCTAATGAAATTCCCAACCTATCAACCTTCCTCTCCAGCACCGCCATCGCCGGAGCTGTCTTCTCGACCT
TGAGCTCATCAGATCCTGCTTTTGCAGCCCAACAAATTGCAGAGATAGCAGCCGAGGACAACCGTGGTTTAGCCCTTCTGCTACCCCTTATTCCTGCCATAGCATGGGTT
CTGTTCAACATACTACAGCCAGCACTCAACCAGCTCAACAGAATGCGCAGTGACAAGGGTGTGATAATTGGGTTGGGATTAGGGGGATTGACTGCATCAGGGTTTATGTA
CGCACCTGATGCTTCGGCCAGTGAGATCGCCATGATCGCTGATGCTTCTTCGAGTGATAGCAGGGGGCAGCTTCTGCTGTTTGTCGTTGCACCCGCCATTCTTTGGGTGC
TGTACAACATTCTACAGCCAGCTTTGAATCAGCTCAACAGGATGAGATCCGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCAACCATGGCTACAATGGCAGTGCTCAGTGTCAAGTGCACAAGCATCAACTCCACCAAAAACCCCAACACCCCAAAGCTGATCCCCAAACCCATCTCGCTCCT
CTCTCTCCAGAATCTTCCAAAAGGACTGATCTCATCAAAAGCTAATGAAATTCCCAACCTATCAACCTTCCTCTCCAGCACCGCCATCGCCGGAGCTGTCTTCTCGACCT
TGAGCTCATCAGATCCTGCTTTTGCAGCCCAACAAATTGCAGAGATAGCAGCCGAGGACAACCGTGGTTTAGCCCTTCTGCTACCCCTTATTCCTGCCATAGCATGGGTT
CTGTTCAACATACTACAGCCAGCACTCAACCAGCTCAACAGAATGCGCAGTGACAAGGGTGTGATAATTGGGTTGGGATTAGGGGGATTGACTGCATCAGGGTTTATGTA
CGCACCTGATGCTTCGGCCAGTGAGATCGCCATGATCGCTGATGCTTCTTCGAGTGATAGCAGGGGGCAGCTTCTGCTGTTTGTCGTTGCACCCGCCATTCTTTGGGTGC
TGTACAACATTCTACAGCCAGCTTTGAATCAGCTCAACAGGATGAGATCCGAGTGA
Protein sequenceShow/hide protein sequence
MAATMATMAVLSVKCTSINSTKNPNTPKLIPKPISLLSLQNLPKGLISSKANEIPNLSTFLSSTAIAGAVFSTLSSSDPAFAAQQIAEIAAEDNRGLALLLPLIPAIAWV
LFNILQPALNQLNRMRSDKGVIIGLGLGGLTASGFMYAPDASASEIAMIADASSSDSRGQLLLFVVAPAILWVLYNILQPALNQLNRMRSE