; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002471 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002471
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLate embryogenesis abundant protein (LEA) family protein
Genome locationscaffold318:339637..340904
RNA-Seq ExpressionMS002471
SyntenyMS002471
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV70662.1 hypothetical protein CFOL_v3_14160, partial [Cephalotus follicularis]8.8e-2759.7Show/hide
Query:  SVLVRSPWSSRVFVSVRPR--HASSSQFAAAAAADGAAQGVKRA---TEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVK
        +VL+R+PWSSRVFV V  R  HAS S+  A++A D A Q  K+     +K  E+VKDKA+S AE+VT+K ++ A KV ETAQDL  KAKQTVQ+AWGSVK
Subjt:  SVLVRSPWSSRVFVSVRPR--HASSSQFAAAAAADGAAQGVKRA---TEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVK

Query:  DTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKS
        DTT  IK+ VVGKAEESKE+IK+TA+N+K  +K+
Subjt:  DTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKS

XP_008457975.1 PREDICTED: uncharacterized protein At4g13230 [Cucumis melo]1.3e-5181.21Show/hide
Query:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK
        MAS LA++AT+PKL RAGS++V +PWSS+VFVS+RPRH SS     +AAADGAAQ VKR TEKAAEEVKDKA SAAEEVTQKTK+VAGKVSETAQD+AGK
Subjt:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC
        AKQTV++AWGSVKDTTQNIKEKVVGKAEESKEAIKDTA+NIK+ IKSNC
Subjt:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC

XP_011659168.1 uncharacterized protein At4g13230 [Cucumis sativus]8.2e-4979.19Show/hide
Query:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK
        MAS LA++ATIPKL RAGS++V +PWSS VFVS+RPRH SS     AAAAD AA  VKR TE+AAEEVKDKA SAA+EVTQ+TKEVAGKVSETAQD+AGK
Subjt:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC
        AKQTV++AWGSVKDTTQNIK+KVVGKAEESKEAIKDTA+NIK  IKSNC
Subjt:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC

XP_022152517.1 uncharacterized protein At4g13230 [Momordica charantia]1.1e-6499.33Show/hide
Query:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK
        MASSLASAATI KLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK
Subjt:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC
        AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC
Subjt:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC

XP_038897424.1 uncharacterized protein At4g13230 [Benincasa hispida]3.0e-5183.22Show/hide
Query:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK
        MAS LA +AT+PKL+RA S+ VR+PWSS+VFVSVRP+H SS    AAAAADGAAQ VKRATEKAAE+VKDKA SAA+EVTQKTKEVAGKVSETAQDLAGK
Subjt:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC
        AKQTVQ+AWGSVKDTTQNIKEKVVGKAEESKEAIKDTADN+K  IK+NC
Subjt:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC

TrEMBL top hitse value%identityAlignment
A0A0A0K9U4 Uncharacterized protein4.0e-4979.19Show/hide
Query:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK
        MAS LA++ATIPKL RAGS++V +PWSS VFVS+RPRH SS     AAAAD AA  VKR TE+AAEEVKDKA SAA+EVTQ+TKEVAGKVSETAQD+AGK
Subjt:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC
        AKQTV++AWGSVKDTTQNIK+KVVGKAEESKEAIKDTA+NIK  IKSNC
Subjt:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC

A0A1Q3BS03 Uncharacterized protein (Fragment)4.3e-2759.7Show/hide
Query:  SVLVRSPWSSRVFVSVRPR--HASSSQFAAAAAADGAAQGVKRA---TEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVK
        +VL+R+PWSSRVFV V  R  HAS S+  A++A D A Q  K+     +K  E+VKDKA+S AE+VT+K ++ A KV ETAQDL  KAKQTVQ+AWGSVK
Subjt:  SVLVRSPWSSRVFVSVRPR--HASSSQFAAAAAADGAAQGVKRA---TEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVK

Query:  DTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKS
        DTT  IK+ VVGKAEESKE+IK+TA+N+K  +K+
Subjt:  DTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKS

A0A1S3C7D4 uncharacterized protein At4g132306.5e-5281.21Show/hide
Query:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK
        MAS LA++AT+PKL RAGS++V +PWSS+VFVS+RPRH SS     +AAADGAAQ VKR TEKAAEEVKDKA SAAEEVTQKTK+VAGKVSETAQD+AGK
Subjt:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC
        AKQTV++AWGSVKDTTQNIKEKVVGKAEESKEAIKDTA+NIK+ IKSNC
Subjt:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC

A0A5B7AK94 Uncharacterized protein (Fragment)3.3e-2754.19Show/hide
Query:  SSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRP-------RHASSSQFAAA--AAADGAAQGVKRA------TEKAAEEVKDKASSAAEEVTQKTKEVA
        +S+A   T+ K      + VRSPWSSRVFV + P       RH S++   +A   AAD A QG   A       +K  +++K+KA+S AE+VT K+K+VA
Subjt:  SSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRP-------RHASSSQFAAA--AAADGAAQGVKRA------TEKAAEEVKDKASSAAEEVTQKTKEVA

Query:  GKVSETAQDLAGKAKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIK
        GKVSETAQDLA KAKQT Q+AWGSVKDTTQ IKE V GKAEESK++IKD+A+N+K
Subjt:  GKVSETAQDLAGKAKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIK

A0A6J1DGG7 uncharacterized protein At4g132305.1e-6599.33Show/hide
Query:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK
        MASSLASAATI KLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK
Subjt:  MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC
        AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC
Subjt:  AKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC

SwissProt top hitse value%identityAlignment
Q8LFD5 Uncharacterized protein At4g132305.0e-0945.35Show/hide
Query:  VKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIK
        V   T    + V DKA+ A ++V +K  E A  +S+ A +L  KAK T +EAW  VKDTT+ IK+ V GK EE+KE+IK TA  ++
Subjt:  VKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIK

Arabidopsis top hitse value%identityAlignment
AT1G04670.1 unknown protein2.9e-0433.33Show/hide
Query:  AATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFA-----AAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAK
        A+ + K+  +     RS    R FVS  PR     + A        AA+   +G K+  E   E ++D AS+ A  VT+ TK+V  KV+ET   +  KAK
Subjt:  AATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFA-----AAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQEAWGSVKDTTQNIKEKVVG
         +V    G+ K+ T  IK K++G
Subjt:  QTVQEAWGSVKDTTQNIKEKVVG

AT4G13230.1 Late embryogenesis abundant protein (LEA) family protein3.6e-1045.35Show/hide
Query:  VKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIK
        V   T    + V DKA+ A ++V +K  E A  +S+ A +L  KAK T +EAW  VKDTT+ IK+ V GK EE+KE+IK TA  ++
Subjt:  VKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADNIK

AT4G21020.1 Late embryogenesis abundant protein (LEA) family protein3.8e-0433.02Show/hide
Query:  SSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADN
        + ++ A AA D A +G  +A +K A E K++A   A E  +K K+ A    E A+D A + K  V E      D  ++ KEK    AE++ +  K+ A +
Subjt:  SSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQNIKEKVVGKAEESKEAIKDTADN

Query:  IKDKIK
         K+K+K
Subjt:  IKDKIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCAGCTTAGCCTCAGCAGCAACTATTCCGAAGCTTAATCGTGCTGGATCTGTTCTTGTGCGCAGCCCATGGAGCTCTAGGGTTTTTGTCTCTGTCAGACCAAG
ACATGCAAGTTCTTCACAGTTTGCTGCAGCAGCAGCAGCTGATGGAGCGGCTCAGGGTGTGAAGCGAGCCACAGAGAAGGCTGCCGAAGAAGTCAAAGATAAGGCTTCCT
CTGCCGCCGAAGAGGTGACTCAGAAAACAAAGGAAGTTGCAGGGAAAGTGAGTGAAACAGCACAGGATTTGGCTGGGAAGGCAAAGCAAACAGTTCAAGAAGCATGGGGA
TCTGTGAAGGATACAACTCAGAACATCAAAGAGAAGGTTGTTGGCAAAGCTGAGGAATCCAAAGAAGCCATTAAAGACACTGCAGACAACATCAAAGACAAAATCAAGTC
AAACTGT
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGCAGCTTAGCCTCAGCAGCAACTATTCCGAAGCTTAATCGTGCTGGATCTGTTCTTGTGCGCAGCCCATGGAGCTCTAGGGTTTTTGTCTCTGTCAGACCAAG
ACATGCAAGTTCTTCACAGTTTGCTGCAGCAGCAGCAGCTGATGGAGCGGCTCAGGGTGTGAAGCGAGCCACAGAGAAGGCTGCCGAAGAAGTCAAAGATAAGGCTTCCT
CTGCCGCCGAAGAGGTGACTCAGAAAACAAAGGAAGTTGCAGGGAAAGTGAGTGAAACAGCACAGGATTTGGCTGGGAAGGCAAAGCAAACAGTTCAAGAAGCATGGGGA
TCTGTGAAGGATACAACTCAGAACATCAAAGAGAAGGTTGTTGGCAAAGCTGAGGAATCCAAAGAAGCCATTAAAGACACTGCAGACAACATCAAAGACAAAATCAAGTC
AAACTGT
Protein sequenceShow/hide protein sequence
MASSLASAATIPKLNRAGSVLVRSPWSSRVFVSVRPRHASSSQFAAAAAADGAAQGVKRATEKAAEEVKDKASSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWG
SVKDTTQNIKEKVVGKAEESKEAIKDTADNIKDKIKSNC