; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003808 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003808
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLEA_2 domain-containing protein
Genome locationscaffold127:677941..678513
RNA-Seq ExpressionMS003808
SyntenyMS003808
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589903.1 hypothetical protein SDJN03_15326, partial [Cucurbita argyrosperma subsp. sororia]7.0e-7781.58Show/hide
Query:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR
        S S+D S+PVPYS +PPN AA QNVVVLSLYRPP +R+RRLLRLCA YSAAFLLLSA  FLLFP+DPSLQLVRLKLN + VRLLP ++LDLSFSASVRVR
Subjt:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR

Query:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        N NFFSLDYNYLGVSVGYRGRRLGFVSS+GGRVSARG SYVNATLDLNG ++IHD  +L+EDL  GI+PFDTETEVEG MGLFFIKFPIK
Subjt:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

XP_004148717.1 uncharacterized protein LOC101219269 [Cucumis sativus]9.7e-7982.2Show/hide
Query:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV
        MTSSS DDSVPVPY+L+P NAA QNVVVLSLYRPP  R RRLLRLCAFYSAAFLLL AVAFLLFP+DPSLQLVRLKLNR+KV L+PV+ LDLSFS S+RV
Subjt:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        RN NFFSL+YN+LGVSVGYRGRRLG+VSSEGGRVSARG SYVNATLDLNG EV+HD +YL+ DL  GI+PFDTET+VEG MGLFFIK PIK
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

XP_022144909.1 uncharacterized protein LOC111014473 [Momordica charantia]1.2e-97100Show/hide
Query:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV
        MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV
Subjt:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

XP_022960913.1 uncharacterized protein LOC111461574 [Cucurbita moschata]2.4e-7781.58Show/hide
Query:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR
        S S+D S+PVPYS +PPN AA QN+VVLSLYRPP +R+RRLLRLC  YSAAFLLLSAV FLLFP+DPSLQLVRLKLN + VRLLP ++LDLSFSASVRVR
Subjt:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR

Query:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        NNNFFSLDYNYLGVSVGYRGRRLGFVSS+GGRVSARG SYVNATLDLNG ++IHD  +L+EDL  GI+PFDTETEVEG MGLFFIKFPIK
Subjt:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

XP_022987870.1 uncharacterized protein LOC111485280 [Cucurbita maxima]1.4e-7782.11Show/hide
Query:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR
        S S+D S+PVPYS +PPN AA QNVVVLSLYRPP +R+RRLLRLCA YSAAFLLLSAV FLLFP+DPSLQLVRLKLN +KVRLLP ++LDLSFSASVRVR
Subjt:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR

Query:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        N NFFSLDYNYLGVSVG+RGRRLGFVSS+GGRVSARG SYVNATLDLNG ++IHD  +L+EDL  GI+PFDTETEVEG MGLFFIKFPIK
Subjt:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein4.7e-7982.2Show/hide
Query:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV
        MTSSS DDSVPVPY+L+P NAA QNVVVLSLYRPP  R RRLLRLCAFYSAAFLLL AVAFLLFP+DPSLQLVRLKLNR+KV L+PV+ LDLSFS S+RV
Subjt:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        RN NFFSL+YN+LGVSVGYRGRRLG+VSSEGGRVSARG SYVNATLDLNG EV+HD +YL+ DL  GI+PFDTET+VEG MGLFFIK PIK
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

A0A5A7TX90 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 19.2e-7580.1Show/hide
Query:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV
        MT+SS DDSVPVPY+LL  NAA QNVVVLSLYRP   R RRLLRL AFYSAAFLLL AVAFLLFP+DPSLQLVRLKLNR+KV L+P + LDLSFS S+RV
Subjt:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        RN NFFSL+YN+LGVSVGYRGRRLG+VSS GGRVSARG SYVNATLDLNG EV+HD +YL+ DL  GI+PFDTETEVEG MGLFFIK PIK
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

A0A6J1CTN0 uncharacterized protein LOC1110144735.9e-98100Show/hide
Query:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV
        MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV
Subjt:  MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

A0A6J1HAC8 uncharacterized protein LOC1114615741.2e-7781.58Show/hide
Query:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR
        S S+D S+PVPYS +PPN AA QN+VVLSLYRPP +R+RRLLRLC  YSAAFLLLSAV FLLFP+DPSLQLVRLKLN + VRLLP ++LDLSFSASVRVR
Subjt:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR

Query:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        NNNFFSLDYNYLGVSVGYRGRRLGFVSS+GGRVSARG SYVNATLDLNG ++IHD  +L+EDL  GI+PFDTETEVEG MGLFFIKFPIK
Subjt:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

A0A6J1JI07 uncharacterized protein LOC1114852806.8e-7882.11Show/hide
Query:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR
        S S+D S+PVPYS +PPN AA QNVVVLSLYRPP +R+RRLLRLCA YSAAFLLLSAV FLLFP+DPSLQLVRLKLN +KVRLLP ++LDLSFSASVRVR
Subjt:  SSSRDDSVPVPYSLLPPN-AAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVR

Query:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        N NFFSLDYNYLGVSVG+RGRRLGFVSS+GGRVSARG SYVNATLDLNG ++IHD  +L+EDL  GI+PFDTETEVEG MGLFFIKFPIK
Subjt:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.3e-2836.67Show/hide
Query:  YSLLPPNAAHQ--NVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVRNNNFFSLDYN
        Y  LP +++H+  + V++S +  P  RRR ++ +     A+ L+     ++ +P+DP ++++R+K++ + V   PV  +D++   +++V N + +S D+ 
Subjt:  YSLLPPNAAHQ--NVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVRNNNFFSLDYN

Query:  YLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
         L V++ YRG+ LG VSS+GG V+A G SY++A  +L+G  V  D I+LI DLA G V FDT TE  G +G+ F +FP+K
Subjt:  YLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.3e-2836.67Show/hide
Query:  YSLLPPNAAHQ--NVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVRNNNFFSLDYN
        Y  LP +++H+  + V++S +  P  RRR ++ +     A+ L+     ++ +P+DP ++++R+K++ + V   PV  +D++   +++V N + +S D+ 
Subjt:  YSLLPPNAAHQ--NVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVRNNNFFSLDYN

Query:  YLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
         L V++ YRG+ LG VSS+GG V+A G SY++A  +L+G  V  D I+LI DLA G V FDT TE  G +G+ F +FP+K
Subjt:  YLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.9e-4247.4Show/hide
Query:  SSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRL---LRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVR
        +SS+ +   +PY+ LP +   Q+V++L+ YR  R RR  L   LR    ++A  LLLSA  +LL+P+DP + + R+ LN + V     + LDLSFS +++
Subjt:  SSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRL---LRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVR

Query:  VRNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK
        VRN +FFSLDY+ L VS+GYRGR LG V S+GG + AR  SY++ATL+L+G EV+HD IYLI DLA G++PFDT  +V+G +G+     PI+
Subjt:  VRNNNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTCCAGCTCCAGGGACGATTCTGTCCCCGTGCCCTACTCTCTTCTTCCTCCAAATGCCGCTCACCAAAACGTCGTCGTATTGTCCCTCTACCGTCCCCCACGATT
CCGGCGTCGGCGGCTTCTGCGCCTCTGTGCCTTCTACTCCGCCGCCTTCCTCCTCCTCTCCGCCGTAGCCTTTCTTCTTTTCCCGGCCGATCCGTCGCTACAACTCGTCC
GATTGAAACTCAACCGCCTCAAAGTCCGTCTGTTGCCTGTTCTCCTCCTTGACCTATCTTTCTCTGCTTCTGTTAGGGTTCGCAATAACAATTTCTTCTCTCTCGACTAC
AATTACCTCGGCGTTTCCGTCGGGTACCGTGGAAGGCGACTCGGATTTGTGAGCTCGGAGGGCGGCCGAGTGTCTGCTCGAGGCTTGTCTTACGTGAATGCCACTCTCGA
TTTGAATGGCTTCGAGGTCATCCACGACGGCATTTACTTGATCGAGGATTTGGCGACGGGTATCGTCCCGTTCGATACGGAGACAGAGGTGGAAGGATACATGGGGCTTT
TCTTTATCAAATTCCCGATTAAG
mRNA sequenceShow/hide mRNA sequence
ATGACCTCCAGCTCCAGGGACGATTCTGTCCCCGTGCCCTACTCTCTTCTTCCTCCAAATGCCGCTCACCAAAACGTCGTCGTATTGTCCCTCTACCGTCCCCCACGATT
CCGGCGTCGGCGGCTTCTGCGCCTCTGTGCCTTCTACTCCGCCGCCTTCCTCCTCCTCTCCGCCGTAGCCTTTCTTCTTTTCCCGGCCGATCCGTCGCTACAACTCGTCC
GATTGAAACTCAACCGCCTCAAAGTCCGTCTGTTGCCTGTTCTCCTCCTTGACCTATCTTTCTCTGCTTCTGTTAGGGTTCGCAATAACAATTTCTTCTCTCTCGACTAC
AATTACCTCGGCGTTTCCGTCGGGTACCGTGGAAGGCGACTCGGATTTGTGAGCTCGGAGGGCGGCCGAGTGTCTGCTCGAGGCTTGTCTTACGTGAATGCCACTCTCGA
TTTGAATGGCTTCGAGGTCATCCACGACGGCATTTACTTGATCGAGGATTTGGCGACGGGTATCGTCCCGTTCGATACGGAGACAGAGGTGGAAGGATACATGGGGCTTT
TCTTTATCAAATTCCCGATTAAG
Protein sequenceShow/hide protein sequence
MTSSSRDDSVPVPYSLLPPNAAHQNVVVLSLYRPPRFRRRRLLRLCAFYSAAFLLLSAVAFLLFPADPSLQLVRLKLNRLKVRLLPVLLLDLSFSASVRVRNNNFFSLDY
NYLGVSVGYRGRRLGFVSSEGGRVSARGLSYVNATLDLNGFEVIHDGIYLIEDLATGIVPFDTETEVEGYMGLFFIKFPIK