; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g38230 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g38230
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLate embryogenesis abundant protein (LEA) family protein
Genome locationchr8:28535089..28537365
RNA-Seq ExpressionMoc08g38230
SyntenyMoc08g38230
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598970.1 hypothetical protein SDJN03_08748, partial [Cucurbita argyrosperma subsp. sororia]1.1e-3058.49Show/hide
Query:  MASFTLLKTLPKFCHLGA-AIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG
        M S+TL K+LPKF H GA AIARRS  SN  LL  SNPK +HT  PQD       EAKDAINP ANE M PGE         Y  A   + V EK  DM 
Subjt:  MASFTLLKTLPKFCHLGA-AIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG

Query:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR
        GM+ +   E+S KAK+ MEAA DSA RAKDTVVE TK+SK+FVKANA++V+K MNTKNR
Subjt:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR

XP_022144549.1 uncharacterized protein At4g13230-like [Momordica charantia]1.9e-6490.38Show/hide
Query:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG
        MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGES        Y  A   Q VREKASDMGG
Subjt:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG

Query:  MVMSEISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNRP
        MV SEISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNRP
Subjt:  MVMSEISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNRP

XP_022933143.1 uncharacterized protein At4g13230 [Cucurbita moschata]4.7e-3159.12Show/hide
Query:  MASFTLLKTLPKFCHLGA-AIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG
        M S TL K+LPKF H GA AIARRS  SN  LL  SNPKF+HT  PQD       EAKDAINP ANE M PGE         Y  A   + V EK  DM 
Subjt:  MASFTLLKTLPKFCHLGA-AIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG

Query:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR
        GM+ +   E+S KAK+ MEAA DSA RAKDTVVE TK+SK+FVKANA++V+K MNTKNR
Subjt:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR

XP_023546410.1 uncharacterized protein At4g13230 [Cucurbita pepo subsp. pepo]5.2e-3057.86Show/hide
Query:  MASFTLLKTLPKFCHLGA-AIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG
        M S TL K+LPKF H GA AIARRS  SN  LL  SNPK +HT  PQD +     EAKDAI P ANE M PGE         Y  A   + V EK  DM 
Subjt:  MASFTLLKTLPKFCHLGA-AIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG

Query:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR
        GM+ +   E+S KAK+ MEAA DSA RAKDTVVE TK+SK+FVKANA++V+K MNTKNR
Subjt:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR

XP_038889268.1 uncharacterized protein At4g13230-like [Benincasa hispida]3.9e-3359.01Show/hide
Query:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG
        MAS TL K LPKF H GAAI RRST SN  L+L SNPKF+HT   QD       EAKDAINPGANE MMPGE+        Y  A   + V EK  DM G
Subjt:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG

Query:  MVMSE---ISEKAKQTMEAAWDS----AQRAKDTVVEATKESKEFVKANAESVKKSMNTKN
        MV ++   +S KAKQ MEAAWDS    AQRAKDT+V+   +SK+FVKAN +SV+KSMNTKN
Subjt:  MVMSE---ISEKAKQTMEAAWDS----AQRAKDTVVEATKESKEFVKANAESVKKSMNTKN

TrEMBL top hitse value%identityAlignment
A0A2N9J0W5 Uncharacterized protein2.9e-1844.1Show/hide
Query:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG
        MAS +L+  LPKF H GAAIARR+T  NP L + SNP+ +   S  +A    +  A DA+  GAN+A   G+                  V +K  DM G
Subjt:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG

Query:  MVMS---EISEKAKQTMEAAW----DSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKN
        M+ +   +++EKAKQT + AW    D+AQ+AKDTV+   +ESKE +K NAE+VK+SMNTKN
Subjt:  MVMS---EISEKAKQTMEAAW----DSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKN

A0A2N9J2L7 Uncharacterized protein9.0e-2044.72Show/hide
Query:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG
        MAS +L+  LPKF H GAAIARR+T  NP L + SNP+ +   S  +A    +  A DA+  GAN+A   G++   K+ +        + V +K  DM G
Subjt:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG

Query:  MVMS---EISEKAKQTMEAAW----DSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKN
        M+ +   +++EKAKQT + AW    D+AQ+AKDTV+   +ESKE +K NAE+VK+SMNTKN
Subjt:  MVMS---EISEKAKQTMEAAW----DSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKN

A0A6J1CSM2 uncharacterized protein At4g13230-like9.2e-6590.38Show/hide
Query:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG
        MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGES        Y  A   Q VREKASDMGG
Subjt:  MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGG

Query:  MVMSEISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNRP
        MV SEISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNRP
Subjt:  MVMSEISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNRP

A0A6J1F3W7 uncharacterized protein At4g132302.3e-3159.12Show/hide
Query:  MASFTLLKTLPKFCHLGA-AIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG
        M S TL K+LPKF H GA AIARRS  SN  LL  SNPKF+HT  PQD       EAKDAINP ANE M PGE         Y  A   + V EK  DM 
Subjt:  MASFTLLKTLPKFCHLGA-AIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG

Query:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR
        GM+ +   E+S KAK+ MEAA DSA RAKDTVVE TK+SK+FVKANA++V+K MNTKNR
Subjt:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR

A0A6J1ICL4 uncharacterized protein At4g132308.1e-2957.23Show/hide
Query:  MASFTLLKTLPKFCHLGAA-IARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG
        M S TL K+LPKF H GAA IA RS  SN  L   SNPKF+HT  PQD       EAKDAINP ANE M  GE         Y  A   + V EK  DM 
Subjt:  MASFTLLKTLPKFCHLGAA-IARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMG

Query:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR
        GM+ +   E+S KAKQ MEAA DSA RAKD+VVE TK+SK+FVKANA++V+K MNTKNR
Subjt:  GMVMS---EISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNR

SwissProt top hitse value%identityAlignment
Q8LFD5 Uncharacterized protein At4g132301.2e-0540.7Show/hide
Query:  KSCACYCVAQPTQWVREKASDMGGMVMSE----ISEKAKQTMEAAW----DSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKN
        K   C    +  Q V +KA D G   +S+    + +KAK T E AW    D+ ++ KDTV   T+E+KE +KA A++V++SMNTKN
Subjt:  KSCACYCVAQPTQWVREKASDMGGMVMSE----ISEKAKQTMEAAW----DSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKN

Arabidopsis top hitse value%identityAlignment
AT4G13230.1 Late embryogenesis abundant protein (LEA) family protein8.6e-0740.7Show/hide
Query:  KSCACYCVAQPTQWVREKASDMGGMVMSE----ISEKAKQTMEAAW----DSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKN
        K   C    +  Q V +KA D G   +S+    + +KAK T E AW    D+ ++ KDTV   T+E+KE +KA A++V++SMNTKN
Subjt:  KSCACYCVAQPTQWVREKASDMGGMVMSE----ISEKAKQTMEAAW----DSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCTTTACTCTCCTCAAAACCCTTCCAAAGTTTTGTCATCTTGGTGCTGCAATTGCAAGGAGATCTACTGCCTCCAACCCTATCCTCCTTTTGCCTTCT
AACCCTAAATTTATGCATACAGGTTCACCACAAGATGCCATCAACCCAGGAGCCCCAGAAGCCAAAGATGCCATAAACCCAGGAGCTAATGAAGCAATGATGCCT
GGTGAAAGTTTTGGATTCAAGTCATGTGCTTGTTATTGTGTTGCTCAACCCACGCAATGGGTGAGGGAAAAGGCAAGCGATATGGGAGGCATGGTGATGAGTGAG
ATTTCAGAGAAGGCGAAGCAGACGATGGAAGCGGCGTGGGACTCCGCCCAGAGGGCAAAAGACACAGTGGTGGAGGCTACCAAAGAATCTAAGGAATTTGTCAAA
GCAAACGCAGAATCTGTCAAGAAGAGCATGAACACCAAGAATCGTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGCTTTACTCTCCTCAAAACCCTTCCAAAGTTTTGTCATCTTGGTGCTGCAATTGCAAGGAGATCTACTGCCTCCAACCCTATCCTCCTTTTGCCTTCT
AACCCTAAATTTATGCATACAGGTTCACCACAAGATGCCATCAACCCAGGAGCCCCAGAAGCCAAAGATGCCATAAACCCAGGAGCTAATGAAGCAATGATGCCT
GGTGAAAGTTTTGGATTCAAGTCATGTGCTTGTTATTGTGTTGCTCAACCCACGCAATGGGTGAGGGAAAAGGCAAGCGATATGGGAGGCATGGTGATGAGTGAG
ATTTCAGAGAAGGCGAAGCAGACGATGGAAGCGGCGTGGGACTCCGCCCAGAGGGCAAAAGACACAGTGGTGGAGGCTACCAAAGAATCTAAGGAATTTGTCAAA
GCAAACGCAGAATCTGTCAAGAAGAGCATGAACACCAAGAATCGTCCTTGA
Protein sequenceShow/hide protein sequence
MASFTLLKTLPKFCHLGAAIARRSTASNPILLLPSNPKFMHTGSPQDAINPGAPEAKDAINPGANEAMMPGESFGFKSCACYCVAQPTQWVREKASDMGGMVMSE
ISEKAKQTMEAAWDSAQRAKDTVVEATKESKEFVKANAESVKKSMNTKNRP