; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009629 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009629
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLEA_2 domain-containing protein
Genome locationscaffold813:2857161..2857976
RNA-Seq ExpressionMS009629
SyntenyMS009629
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575419.1 hypothetical protein SDJN03_26058, partial [Cucurbita argyrosperma subsp. sororia]3.1e-8679.72Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLAS T+FSN FCC QICT+YTLKRFC+FLLFIA+FS+IA+ IAALPVI LLKPR+PIFSL+SLRLDWYNIS+ S S F+SSVFTLTL SQNPN+I I
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDAT E VE+K++GDVGVEL VLHMAV+KMKVALNC+V+V+YR LNF+ E+
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        L NG+ T+ KAL
Subjt:  LDNGSGTIKKAL

XP_022146483.1 uncharacterized protein LOC111015690 isoform X1 [Momordica charantia]2.7e-10699.06Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
        KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATREL EIKVVGDVGVELLVLHMAVLKMKVALNCDV VNYRELNFRNEV
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        LDNGSGTIKKAL
Subjt:  LDNGSGTIKKAL

XP_022146491.1 uncharacterized protein LOC111015690 isoform X2 [Momordica charantia]2.7e-10699.06Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
        KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATREL EIKVVGDVGVELLVLHMAVLKMKVALNCDV VNYRELNFRNEV
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        LDNGSGTIKKAL
Subjt:  LDNGSGTIKKAL

XP_022992383.1 uncharacterized protein LOC111488707 isoform X1 [Cucurbita maxima]8.2e-8779.72Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLAS T+FSN FCC QICT+YTLKRFC+FLLFIA+FS+IA+ IAALPVI LLKPR+PIFSL+SLRLDWYNIS+ SGS F+SSVFTLTL SQNPN+I I
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDAT E VE+K++GD+GVEL VLHMAV+KMKVALNC+V+V+YR LNFR ++
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        L NG+ T+ KAL
Subjt:  LDNGSGTIKKAL

XP_023548218.1 uncharacterized protein LOC111806923 [Cucurbita pepo subsp. pepo]1.8e-8679.72Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLAS T+FSN FCC QICT+YTLKRFC+FLLFIA+FS+IA+ IAALPVI LLKPR+PIFSL+SLRLDWYNIS+ S S F+SSVFTLTLNS+NPN+I I
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDAT E VE+K++GDVGVEL VLHMAV+KMKVALNC+V+V+YR LNFR ++
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        L NG+ T+ KAL
Subjt:  LDNGSGTIKKAL

TrEMBL top hitse value%identityAlignment
A0A6J1CXE3 uncharacterized protein LOC111015690 isoform X38.6e-8283.02Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLD                               
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
           PSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATREL EIKVVGDVGVELLVLHMAVLKMKVALNCDV VNYRELNFRNEV
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        LDNGSGTIKKAL
Subjt:  LDNGSGTIKKAL

A0A6J1CYP8 uncharacterized protein LOC111015690 isoform X11.3e-10699.06Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
        KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATREL EIKVVGDVGVELLVLHMAVLKMKVALNCDV VNYRELNFRNEV
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        LDNGSGTIKKAL
Subjt:  LDNGSGTIKKAL

A0A6J1CZI3 uncharacterized protein LOC111015690 isoform X21.3e-10699.06Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
        KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATREL EIKVVGDVGVELLVLHMAVLKMKVALNCDV VNYRELNFRNEV
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        LDNGSGTIKKAL
Subjt:  LDNGSGTIKKAL

A0A6J1GP37 uncharacterized protein LOC111456218 isoform X12.6e-8679.25Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLAS T+FS+ FCC QICT+YTLKRFC+FLLFIA+FS+IA+ IAALPVI LLKPR+PIFSL+SLRLDWYNIS+ S S F+SSVFTLTLNS+NPN+I I
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDAT E VE+K++GDVGVEL VLHMAV+KMKVALNC+V+V+YR LNFR ++
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        L NG+ T+ KAL
Subjt:  LDNGSGTIKKAL

A0A6J1JZ17 uncharacterized protein LOC111488707 isoform X14.0e-8779.72Show/hide
Query:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI
        MDVLAS T+FSN FCC QICT+YTLKRFC+FLLFIA+FS+IA+ IAALPVI LLKPR+PIFSL+SLRLDWYNIS+ SGS F+SSVFTLTL SQNPN+I I
Subjt:  MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDAT E VE+K++GD+GVEL VLHMAV+KMKVALNC+V+V+YR LNFR ++
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEV

Query:  LDNGSGTIKKAL
        L NG+ T+ KAL
Subjt:  LDNGSGTIKKAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.3e-1031.46Show/hide
Query:  CIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSG-----SSFVSSVFTLTLNSQNPNRIAIKYSPSRLLVIYDGNAVIGTIRVP
        C+FLLF   F  + VL   L VIL +KP++P F LQ + + +  IS  S      ++ +S    +   + NPN++ I+Y  S   V+Y G   +G   VP
Subjt:  CIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSG-----SSFVSSVFTLTLNSQNPNRIAIKYSPSRLLVIYDGNAVIGTIRVP

Query:  EVFQPARSDDRSVRTRLLLHRFNV------DLF-DAT-RELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRE
          +Q A S  ++V   + + R N+      DL  DA+  + VE+ V GDVG ++ V++     ++V++NC + ++ R+
Subjt:  EVFQPARSDDRSVRTRLLLHRFNV------DLF-DAT-RELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRE

AT3G44380.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.9e-0535.56Show/hide
Query:  AVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAIKYSPSRLLVIYDGNAVIGTIRVPEVFQPARS
        A    A   +L  KP++P F L S+ L     S+K     + +   LT++  NPN  AI YS +++ ++YDG  V+G+  V    QPARS
Subjt:  AVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAIKYSPSRLLVIYDGNAVIGTIRVPEVFQPARS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTCTTGCATCTCGAACTCAATTTTCCAATTGCTTTTGCTGCCAACAGATCTGTACGATTTATACTCTGAAACGATTCTGCATTTTTCTTCTCTTCATCGCTAT
CTTTTCAGTTATCGCAGTTCTGATCGCCGCATTGCCTGTAATTCTCCTGTTGAAGCCTCGAGAGCCGATTTTTTCTCTCCAGTCGTTGCGATTGGATTGGTACAACATTA
GCGTCAAATCTGGCTCTTCGTTTGTTTCTTCTGTTTTCACTCTCACTCTCAATTCTCAAAACCCTAACAGAATCGCCATTAAGTACAGTCCGTCGAGGCTCCTCGTGATC
TACGATGGAAACGCCGTGATCGGAACGATTCGAGTCCCTGAAGTTTTCCAGCCGGCTCGCAGCGACGATCGGAGTGTTCGAACTCGTCTGTTGTTGCATCGATTCAATGT
CGATTTGTTCGACGCGACGCGCGAGTTGGTTGAGATCAAAGTTGTTGGCGACGTTGGAGTGGAGCTGCTCGTGCTTCATATGGCCGTGTTGAAGATGAAGGTTGCTCTGA
ATTGCGATGTGAATGTCAATTACAGAGAGCTTAATTTCAGAAATGAAGTACTTGACAATGGATCAGGAACTATAAAAAAAGCTCTG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTTCTTGCATCTCGAACTCAATTTTCCAATTGCTTTTGCTGCCAACAGATCTGTACGATTTATACTCTGAAACGATTCTGCATTTTTCTTCTCTTCATCGCTAT
CTTTTCAGTTATCGCAGTTCTGATCGCCGCATTGCCTGTAATTCTCCTGTTGAAGCCTCGAGAGCCGATTTTTTCTCTCCAGTCGTTGCGATTGGATTGGTACAACATTA
GCGTCAAATCTGGCTCTTCGTTTGTTTCTTCTGTTTTCACTCTCACTCTCAATTCTCAAAACCCTAACAGAATCGCCATTAAGTACAGTCCGTCGAGGCTCCTCGTGATC
TACGATGGAAACGCCGTGATCGGAACGATTCGAGTCCCTGAAGTTTTCCAGCCGGCTCGCAGCGACGATCGGAGTGTTCGAACTCGTCTGTTGTTGCATCGATTCAATGT
CGATTTGTTCGACGCGACGCGCGAGTTGGTTGAGATCAAAGTTGTTGGCGACGTTGGAGTGGAGCTGCTCGTGCTTCATATGGCCGTGTTGAAGATGAAGGTTGCTCTGA
ATTGCGATGTGAATGTCAATTACAGAGAGCTTAATTTCAGAAATGAAGTACTTGACAATGGATCAGGAACTATAAAAAAAGCTCTG
Protein sequenceShow/hide protein sequence
MDVLASRTQFSNCFCCQQICTIYTLKRFCIFLLFIAIFSVIAVLIAALPVILLLKPREPIFSLQSLRLDWYNISVKSGSSFVSSVFTLTLNSQNPNRIAIKYSPSRLLVI
YDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDATRELVEIKVVGDVGVELLVLHMAVLKMKVALNCDVNVNYRELNFRNEVLDNGSGTIKKAL