; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g39960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g39960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionChlorophyll A-B binding protein
Genome locationchr8:30577268..30583730
RNA-Seq ExpressionMoc08g39960
SyntenyMoc08g39960
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR023329 - Chlorophyll a/b binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451793.1 PREDICTED: uncharacterized protein LOC103492970 isoform X1 [Cucumis melo]6.8e-5080.15Show/hide
Query:  SQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAINGAWAMIGL
        SQYLSFRH+HP ATFSR GW+RDQ  G+ THRTRGQAFRI    NVSP KDG +K+VIMVDPLEAKRMAAK+MEKIKAKEK KRRRQIEAINGAWAMIGL
Subjt:  SQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAINGAWAMIGL

Query:  TAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        TAGLVIEGQTGKGILAQL DYF+ +++ F+R
Subjt:  TAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

XP_022153344.1 uncharacterized protein LOC111020859 [Momordica charantia]3.5e-70100Show/hide
Query:  MASTALSSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAING
        MASTALSSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAING
Subjt:  MASTALSSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAING

Query:  AWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        AWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
Subjt:  AWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

XP_022937783.1 uncharacterized protein LOC111444076 isoform X1 [Cucurbita moschata]3.9e-5379.58Show/hide
Query:  MASTAL----SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIE
        MASTAL        LSFRH HP ATFSRWGWNRDQ  GK THRTRGQAFRILANPNVSPGKD  +K+VIMVDPLEAKR+AAK+MEKIKAKEK KRRRQIE
Subjt:  MASTAL----SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIE

Query:  AINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        AINGAWAMIGLTAGL++EGQTGKGILAQL  Y   VV+ FVR
Subjt:  AINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

XP_022969695.1 uncharacterized protein LOC111468645 isoform X1 [Cucurbita maxima]9.5e-5275.68Show/hide
Query:  MASTAL----------SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLK
        MASTAL          SSQ LSFRH H  ATFSRWGW+RD+  G  THRTRGQAFRILANPNVSPGKD  +K+VIMVDPLEAKR+AAK+MEKIKAKEK K
Subjt:  MASTAL----------SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLK

Query:  RRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        RRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQL  Y   VV+ FVR
Subjt:  RRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

XP_023538319.1 uncharacterized protein LOC111799137 isoform X1 [Cucurbita pepo subsp. pepo]2.3e-5379.58Show/hide
Query:  MASTAL----SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIE
        MASTAL        LSFRH HP ATFSRWGWNRDQ  GK THRTRGQAFRILANPNVSPGKD  +K+VIMVDPLEAKR+AAK+MEKIKAKEK KRRRQIE
Subjt:  MASTAL----SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIE

Query:  AINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        AINGAWAMIGLTAGL++EGQTGKGILAQL  Y   VV+ FVR
Subjt:  AINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

TrEMBL top hitse value%identityAlignment
A0A1S3BSE0 uncharacterized protein LOC103492970 isoform X13.3e-5080.15Show/hide
Query:  SQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAINGAWAMIGL
        SQYLSFRH+HP ATFSR GW+RDQ  G+ THRTRGQAFRI    NVSP KDG +K+VIMVDPLEAKRMAAK+MEKIKAKEK KRRRQIEAINGAWAMIGL
Subjt:  SQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAINGAWAMIGL

Query:  TAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        TAGLVIEGQTGKGILAQL DYF+ +++ F+R
Subjt:  TAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

A0A6J1DIT0 uncharacterized protein LOC1110208591.7e-70100Show/hide
Query:  MASTALSSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAING
        MASTALSSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAING
Subjt:  MASTALSSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAING

Query:  AWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        AWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
Subjt:  AWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

A0A6J1FC77 uncharacterized protein LOC111444076 isoform X11.9e-5379.58Show/hide
Query:  MASTAL----SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIE
        MASTAL        LSFRH HP ATFSRWGWNRDQ  GK THRTRGQAFRILANPNVSPGKD  +K+VIMVDPLEAKR+AAK+MEKIKAKEK KRRRQIE
Subjt:  MASTAL----SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIE

Query:  AINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        AINGAWAMIGLTAGL++EGQTGKGILAQL  Y   VV+ FVR
Subjt:  AINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

A0A6J1FHS7 uncharacterized protein LOC111444076 isoform X25.8e-4774.64Show/hide
Query:  MASTALSSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAING
        M  T+LS+     +H  P +  SRWGWNRDQ  GK THRTRGQAFRILANPNVSPGKD  +K+VIMVDPLEAKR+AAK+MEKIKAKEK KRRRQIEAING
Subjt:  MASTALSSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAING

Query:  AWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        AWAMIGLTAGL++EGQTGKGILAQL  Y   VV+ FVR
Subjt:  AWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

A0A6J1I0N1 uncharacterized protein LOC111468645 isoform X14.6e-5275.68Show/hide
Query:  MASTAL----------SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLK
        MASTAL          SSQ LSFRH H  ATFSRWGW+RD+  G  THRTRGQAFRILANPNVSPGKD  +K+VIMVDPLEAKR+AAK+MEKIKAKEK K
Subjt:  MASTAL----------SSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLK

Query:  RRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR
        RRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQL  Y   VV+ FVR
Subjt:  RRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVFVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28025.1 unknown protein1.2e-3159.84Show/hide
Query:  RHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVS----PGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAINGAWAMIGLTA
        R A P ++  + G  R Q    + +R R    R+LANPNVS    PGK    KEVIMVDPLEAKR+A+KQME+IK +EK +RRR+IEAINGAWA+IGL  
Subjt:  RHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVS----PGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAINGAWAMIGLTA

Query:  GLVIEGQTGKGILAQLTDYFNTVVHVF
        GLVIE QTGKGILAQL  Y++ VVH+F
Subjt:  GLVIEGQTGKGILAQLTDYFNTVVHVF

AT4G28025.2 unknown protein.1.2e-3156.94Show/hide
Query:  MASTALSSQYLSFRHAHPPATFS----RWGWNRDQGGGKITHRTRGQAFRILANPNVS----PGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRR
        M  T+ SS    F      A FS    R G  R Q    + +R R    R+LANPNVS    PGK    KEVIMVDPLEAKR+A+KQME+IK +EK +RR
Subjt:  MASTALSSQYLSFRHAHPPATFS----RWGWNRDQGGGKITHRTRGQAFRILANPNVS----PGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRR

Query:  RQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVF
        R+IEAINGAWA+IGL  GLVIE QTGKGILAQL  Y++ VVH+F
Subjt:  RQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLTDYFNTVVHVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTGCCTTGTCTTCCCAATACCTCTCTTTCCGCCATGCCCATCCTCCTGCAACTTTCTCCAGGTGGGGTTGGAACAGGGATCAAGGTGGAGGCAAAATTAC
CCATAGAACAAGGGGTCAAGCGTTTCGAATCTTGGCCAACCCTAATGTCTCTCCTGGGAAAGATGGCTTTGTTAAAGAGGTGATCATGGTAGATCCTTTGGAAGCCAAAC
GAATGGCTGCGAAACAAATGGAAAAGATAAAAGCAAAAGAGAAATTGAAGCGAAGACGTCAAATAGAAGCGATTAATGGAGCATGGGCAATGATTGGTCTCACTGCAGGG
CTCGTCATCGAAGGTCAAACTGGAAAAGGCATTCTAGCGCAGTTGACCGATTACTTCAACACCGTTGTCCATGTCTTTGTACGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACTGCCTTGTCTTCCCAATACCTCTCTTTCCGCCATGCCCATCCTCCTGCAACTTTCTCCAGGTGGGGTTGGAACAGGGATCAAGGTGGAGGCAAAATTAC
CCATAGAACAAGGGGTCAAGCGTTTCGAATCTTGGCCAACCCTAATGTCTCTCCTGGGAAAGATGGCTTTGTTAAAGAGGTGATCATGGTAGATCCTTTGGAAGCCAAAC
GAATGGCTGCGAAACAAATGGAAAAGATAAAAGCAAAAGAGAAATTGAAGCGAAGACGTCAAATAGAAGCGATTAATGGAGCATGGGCAATGATTGGTCTCACTGCAGGG
CTCGTCATCGAAGGTCAAACTGGAAAAGGCATTCTAGCGCAGTTGACCGATTACTTCAACACCGTTGTCCATGTCTTTGTACGATAG
Protein sequenceShow/hide protein sequence
MASTALSSQYLSFRHAHPPATFSRWGWNRDQGGGKITHRTRGQAFRILANPNVSPGKDGFVKEVIMVDPLEAKRMAAKQMEKIKAKEKLKRRRQIEAINGAWAMIGLTAG
LVIEGQTGKGILAQLTDYFNTVVHVFVR