; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001888 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001888
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionlight-harvesting complex-like protein 3 isotype 1, chloroplastic
Genome locationChr11:1410521..1413337
RNA-Seq ExpressionHG10001888
SyntenyHG10001888
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576837.1 Mediator of RNA polymerase II transcription subunit 31, partial [Cucurbita argyrosperma subsp. sororia]9.2e-7778.12Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM
        MASIAISAS PRS TS NHV  K+QHA Q RP YSS  KNPT ++I T +VGDRD F A +RH  VASA +KLEED DD NHG+ F D+RWKNGTWDLNM
Subjt:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM

Query:  FVKNGKMDWEGVIIAGRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE
        FVKNGKMDWEGVI+AGRAAMIGFFMAY+VDALTGIGIVGQSGNFI K+A+F+TVIGVL+FRQT+DIEGLRK+AEEATFYDKQWQ+SWQN NE
Subjt:  FVKNGKMDWEGVIIAGRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE

XP_022922577.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita moschata]6.0e-6863.45Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM
        MASIAISAS PRS TS NHV   KQ   QARP YSS TKNPT ++I T  VGDRD   A +RH  VASA +KLEED DD NHG+ F D+RWKNGTWDLNM
Subjt:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM

Query:  FVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTV
        FVKNGKMDWEGVI+A                                              GRAAMIGFFMAY+VDALTGIGIVGQSGNFI KAA+F+TV
Subjt:  FVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTV

Query:  IGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE
        IGVLLFRQT+DIEGLRK+AEEATFYDKQWQ+SWQN NE
Subjt:  IGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE

XP_022984403.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita maxima]1.4e-6964.02Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRD-GFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLN
        MASIAISAS  RS TS NHV  K+QHA QARP YSS TKNPT ++I T DVG+RD  F A +RH  VASA +KLEED DD NHG+ F D+RWKNGTWDLN
Subjt:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRD-GFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLN

Query:  MFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLT
        MFVKNGKMDWEGVI+A                                              GRAAMIGFFMAY+VDALTGIGIVGQSGNFI KAA+F+T
Subjt:  MFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLT

Query:  VIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE
        VIGVLLFRQT+DIEGLRK+AEEATFYDKQWQ+SWQN NE
Subjt:  VIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE

XP_023551912.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita pepo subsp. pepo]7.5e-7163.87Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM
        MASIAISAS PRS TS NHV   +QHA QARP +SS TKNPT ++I T DVGDRD F A +RH  VASA +KLEED DD NHG+ F D+RWKNGTWDLNM
Subjt:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM

Query:  FVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTV
        FVKNGKMDWEGVI+A                                              GRAAMIGFFMAY+VDALTGIGIVGQSGNFI K+A+F+TV
Subjt:  FVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTV

Query:  IGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE
        IGVLLFRQT+DIEGLRK+AEEATFYDKQWQ+SWQN NE
Subjt:  IGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE

XP_038875279.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Benincasa hispida]2.6e-7165.97Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM
        MASIAISASLPR+STS NHVF+K               KNPT KVISTLDVGDRDG GA ERH  VASAREK+EEDNDDWNHGEKF DKRWKNGTWDLNM
Subjt:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM

Query:  FVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTV
        FVKNGKMDWEGVI+A                                              GRAAMIGFFMAY+VDALTGIGIVGQSGNFI KAA+F TV
Subjt:  FVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTV

Query:  IGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE
        IGVLLFRQTEDIE LR IAEEAT YDKQWQ+SWQN NE
Subjt:  IGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE

TrEMBL top hitse value%identityAlignment
A0A0A0KUF5 Uncharacterized protein8.1e-6361.76Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEK--VASAREKLEED-NDDW-NHGEKFKDKRWKNGTW
        MASIAISASLPR+S SN+    KKQ  H A+P YSSTTKNPT KVISTLDV +RD  GAA+++ +   ASA  K EED NDDW N+GE+F DKRWKNGTW
Subjt:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEK--VASAREKLEED-NDDW-NHGEKFKDKRWKNGTW

Query:  DLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAV
        DLNMFV+NGKMDWEGVI+                                               GRAAMIGFFM Y VDALTG+GIVGQSGNFI K A+
Subjt:  DLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAV

Query:  FLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ
        FLTVIGVLLFRQ+EDIE LR IAEEATFYDKQWQSSWQ
Subjt:  FLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ

A0A1S3BXB8 uncharacterized protein LOC1034944455.1e-6562.24Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKK--QHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEK----VASAREKLEEDNDDW-NHGEKFKDKRWKN
        MASIAISASLPR++ + NHV MKK  QHAH A+PAYSS TKNPT KVISTLDVG+RD   A ++H       +SA  K+EEDNDDW N+GE+F DKRWKN
Subjt:  MASIAISASLPRSSTSNNHVFMKK--QHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEK----VASAREKLEEDNDDW-NHGEKFKDKRWKN

Query:  GTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGK
        GTWDLNMFVKNGKMDWEGVI+                                               GRAAMIGFFM Y VDALTGIGIVGQSGNFI K
Subjt:  GTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGK

Query:  AAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ
         A+FLTVIGVLLFRQ+ED+E LR IAEEATFYDKQWQSSWQ
Subjt:  AAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ

A0A5D3CYT8 Chlorophyll A-B binding protein5.1e-6562.24Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKK--QHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEK----VASAREKLEEDNDDW-NHGEKFKDKRWKN
        MASIAISASLPR++ + NHV MKK  QHAH A+PAYSS TKNPT KVISTLDVG+RD   A ++H       +SA  K+EEDNDDW N+GE+F DKRWKN
Subjt:  MASIAISASLPRSSTSNNHVFMKK--QHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEK----VASAREKLEEDNDDW-NHGEKFKDKRWKN

Query:  GTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGK
        GTWDLNMFVKNGKMDWEGVI+                                               GRAAMIGFFM Y VDALTGIGIVGQSGNFI K
Subjt:  GTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGK

Query:  AAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ
         A+FLTVIGVLLFRQ+ED+E LR IAEEATFYDKQWQSSWQ
Subjt:  AAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ

A0A6J1E3P1 light-harvesting complex-like protein 3 isotype 1, chloroplastic2.9e-6863.45Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM
        MASIAISAS PRS TS NHV   KQ   QARP YSS TKNPT ++I T  VGDRD   A +RH  VASA +KLEED DD NHG+ F D+RWKNGTWDLNM
Subjt:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNM

Query:  FVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTV
        FVKNGKMDWEGVI+A                                              GRAAMIGFFMAY+VDALTGIGIVGQSGNFI KAA+F+TV
Subjt:  FVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTV

Query:  IGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE
        IGVLLFRQT+DIEGLRK+AEEATFYDKQWQ+SWQN NE
Subjt:  IGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE

A0A6J1JAE2 light-harvesting complex-like protein 3 isotype 1, chloroplastic6.9e-7064.02Show/hide
Query:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRD-GFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLN
        MASIAISAS  RS TS NHV  K+QHA QARP YSS TKNPT ++I T DVG+RD  F A +RH  VASA +KLEED DD NHG+ F D+RWKNGTWDLN
Subjt:  MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRD-GFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLN

Query:  MFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLT
        MFVKNGKMDWEGVI+A                                              GRAAMIGFFMAY+VDALTGIGIVGQSGNFI KAA+F+T
Subjt:  MFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLT

Query:  VIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE
        VIGVLLFRQT+DIEGLRK+AEEATFYDKQWQ+SWQN NE
Subjt:  VIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNE

SwissProt top hitse value%identityAlignment
Q6NKS4 Light-harvesting complex-like protein 3 isotype 2, chloroplastic7.2e-2438.67Show/hide
Query:  KFKDKRWKNGTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIV
        K+++ +W NGTWDL  F K+GK DW+ VI++                                              GRAAMIGFFMAY VD+LTG+G+V
Subjt:  KFKDKRWKNGTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIV

Query:  GQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ
         Q GNF  K  +F+ V GVL  R+ ED++ L+ + +E T YDKQWQ++W+
Subjt:  GQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ

Q9SYX1 Light-harvesting complex-like protein 3 isotype 1, chloroplastic2.6e-2640.65Show/hide
Query:  KFKDKRWKNGTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIV
        KF+D RW NGTWDL  F K+GK DW+ VI+A                                              GRAAMIGFFMAY VD+LTG+G+V
Subjt:  KFKDKRWKNGTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIV

Query:  GQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNEK
         Q GNF  K  +F+ V GVL  R+ ED++ L+ + +E T YDKQWQ++W+N +++
Subjt:  GQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNEK

Arabidopsis top hitse value%identityAlignment
AT4G17600.1 Chlorophyll A-B binding family protein1.9e-2740.65Show/hide
Query:  KFKDKRWKNGTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIV
        KF+D RW NGTWDL  F K+GK DW+ VI+A                                              GRAAMIGFFMAY VD+LTG+G+V
Subjt:  KFKDKRWKNGTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIV

Query:  GQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNEK
         Q GNF  K  +F+ V GVL  R+ ED++ L+ + +E T YDKQWQ++W+N +++
Subjt:  GQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNEK

AT5G47110.1 Chlorophyll A-B binding family protein5.1e-2538.67Show/hide
Query:  KFKDKRWKNGTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIV
        K+++ +W NGTWDL  F K+GK DW+ VI++                                              GRAAMIGFFMAY VD+LTG+G+V
Subjt:  KFKDKRWKNGTWDLNMFVKNGKMDWEGVIIA----------------------------------------------GRAAMIGFFMAYVVDALTGIGIV

Query:  GQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ
         Q GNF  K  +F+ V GVL  R+ ED++ L+ + +E T YDKQWQ++W+
Subjt:  GQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAATTGCTATCTCTGCCTCATTGCCAAGATCATCCACTTCAAATAATCATGTCTTCATGAAGAAACAACATGCTCATCAAGCTAGACCTGCCTATTCCTCAAC
GACGAAAAACCCGACATCAAAGGTAATTAGTACTCTCGACGTCGGAGACAGAGATGGTTTCGGTGCTGCGGAACGTCATGAAAAGGTTGCATCAGCTAGGGAGAAACTAG
AGGAAGATAATGATGATTGGAACCATGGTGAAAAGTTCAAAGATAAGAGATGGAAAAATGGAACTTGGGATCTCAATATGTTTGTAAAGAATGGTAAGATGGATTGGGAA
GGTGTCATTATTGCAGGTAGGGCAGCCATGATAGGTTTTTTCATGGCCTATGTAGTGGATGCATTAACAGGAATTGGAATAGTTGGGCAAAGTGGTAATTTCATAGGCAA
AGCAGCTGTTTTTCTAACAGTCATTGGTGTGTTGCTGTTTAGACAAACTGAAGACATTGAGGGCTTAAGAAAGATAGCTGAAGAAGCCACCTTTTATGACAAGCAATGGC
AATCTTCATGGCAAAACCATAATGAAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAATTGCTATCTCTGCCTCATTGCCAAGATCATCCACTTCAAATAATCATGTCTTCATGAAGAAACAACATGCTCATCAAGCTAGACCTGCCTATTCCTCAAC
GACGAAAAACCCGACATCAAAGGTAATTAGTACTCTCGACGTCGGAGACAGAGATGGTTTCGGTGCTGCGGAACGTCATGAAAAGGTTGCATCAGCTAGGGAGAAACTAG
AGGAAGATAATGATGATTGGAACCATGGTGAAAAGTTCAAAGATAAGAGATGGAAAAATGGAACTTGGGATCTCAATATGTTTGTAAAGAATGGTAAGATGGATTGGGAA
GGTGTCATTATTGCAGGTAGGGCAGCCATGATAGGTTTTTTCATGGCCTATGTAGTGGATGCATTAACAGGAATTGGAATAGTTGGGCAAAGTGGTAATTTCATAGGCAA
AGCAGCTGTTTTTCTAACAGTCATTGGTGTGTTGCTGTTTAGACAAACTGAAGACATTGAGGGCTTAAGAAAGATAGCTGAAGAAGCCACCTTTTATGACAAGCAATGGC
AATCTTCATGGCAAAACCATAATGAAAAATAA
Protein sequenceShow/hide protein sequence
MASIAISASLPRSSTSNNHVFMKKQHAHQARPAYSSTTKNPTSKVISTLDVGDRDGFGAAERHEKVASAREKLEEDNDDWNHGEKFKDKRWKNGTWDLNMFVKNGKMDWE
GVIIAGRAAMIGFFMAYVVDALTGIGIVGQSGNFIGKAAVFLTVIGVLLFRQTEDIEGLRKIAEEATFYDKQWQSSWQNHNEK