; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0004692 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0004692
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionLate embryogenesis abundant protein (LEA) family protein
Genome locationchr01:20027371..20029309
RNA-Seq ExpressionPI0004692
SyntenyPI0004692
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV70662.1 hypothetical protein CFOL_v3_14160, partial [Cephalotus follicularis]8.7e-2755.97Show/hide
Query:  SIIVHNPWSSKVFVSLRPRHV----SSQISAAAADGAAQVVKR---TTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVK
        ++++ NPWSS+VFV +  R +    S Q +++A D A Q  K+     +K  E+VKDKA S AE+VT+K ++ A KV ETAQDL  KAKQTV+DAWGSVK
Subjt:  SIIVHNPWSSKVFVSLRPRHV----SSQISAAAADGAAQVVKR---TTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVK

Query:  DTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS
        DTT  IK+ VVGKAEESKE+IK+TAEN+K N+K+
Subjt:  DTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS

XP_008457975.1 PREDICTED: uncharacterized protein At4g13230 [Cucumis melo]1.3e-6294.56Show/hide
Query:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK
        MASFLAT ATLPKLQRAGSIIVHNPWSSKVFVSLRPRH SSQ+S AAADGAAQ VKRTTEKAAEEVKDKAVSAAEEVTQKTK+VAGKV+ETAQD+AGKAK
Subjt:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK

Query:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC
        QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC
Subjt:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC

XP_011659168.1 uncharacterized protein At4g13230 [Cucumis sativus]1.6e-6089.8Show/hide
Query:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK
        MASFLAT AT+PKLQRAGSIIVHNPWSS VFVSLRPRH SSQ++AAAAD AA VVKRTTE+AAEEVKDKAVSAA+EVTQ+TKEVAGKV+ETAQD+AGKAK
Subjt:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK

Query:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC
        QTVEDAWGSVKDTTQNIK+KVVGKAEESKEAIKDTAENIK NIKSNC
Subjt:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC

XP_022152517.1 uncharacterized protein At4g13230 [Momordica charantia]9.2e-5381.88Show/hide
Query:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQ--ISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGK
        MAS LA+ AT+ KL RAGS++V +PWSS+VFVS+RPRH SS    +AAAADGAAQ VKR TEKAAEEVKDKA SAAEEVTQKTKEVAGKV+ETAQDLAGK
Subjt:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQ--ISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGK

Query:  AKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC
        AKQTV++AWGSVKDTTQNIKEKVVGKAEESKEAIKDTA+NIK+ IKSNC
Subjt:  AKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC

XP_038897424.1 uncharacterized protein At4g13230 [Benincasa hispida]3.1e-5685.03Show/hide
Query:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK
        MASFLA  ATLPKL RA SI V  PWSSKVFVS+RP+H SS+++AAAADGAAQVVKR TEKAAE+VKDKAVSAA+EVTQKTKEVAGKV+ETAQDLAGKAK
Subjt:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK

Query:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC
        QTV+DAWGSVKDTTQNIKEKVVGKAEESKEAIKDTA+N+K NIK+NC
Subjt:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC

TrEMBL top hitse value%identityAlignment
A0A0A0K9U4 Uncharacterized protein7.6e-6189.8Show/hide
Query:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK
        MASFLAT AT+PKLQRAGSIIVHNPWSS VFVSLRPRH SSQ++AAAAD AA VVKRTTE+AAEEVKDKAVSAA+EVTQ+TKEVAGKV+ETAQD+AGKAK
Subjt:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK

Query:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC
        QTVEDAWGSVKDTTQNIK+KVVGKAEESKEAIKDTAENIK NIKSNC
Subjt:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC

A0A1Q3BS03 Uncharacterized protein (Fragment)4.2e-2755.97Show/hide
Query:  SIIVHNPWSSKVFVSLRPRHV----SSQISAAAADGAAQVVKR---TTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVK
        ++++ NPWSS+VFV +  R +    S Q +++A D A Q  K+     +K  E+VKDKA S AE+VT+K ++ A KV ETAQDL  KAKQTV+DAWGSVK
Subjt:  SIIVHNPWSSKVFVSLRPRHV----SSQISAAAADGAAQVVKR---TTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVK

Query:  DTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS
        DTT  IK+ VVGKAEESKE+IK+TAEN+K N+K+
Subjt:  DTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS

A0A1S3C7D4 uncharacterized protein At4g132306.2e-6394.56Show/hide
Query:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK
        MASFLAT ATLPKLQRAGSIIVHNPWSSKVFVSLRPRH SSQ+S AAADGAAQ VKRTTEKAAEEVKDKAVSAAEEVTQKTK+VAGKV+ETAQD+AGKAK
Subjt:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAK

Query:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC
        QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC
Subjt:  QTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC

A0A5B7AK94 Uncharacterized protein (Fragment)1.2e-2650Show/hide
Query:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRP-------RHVSSQISAAAADGAAQVVKR----------TTEKAAEEVKDKAVSAAEEVTQKTKE
        MAS +A   TL K Q    I V +PWSS+VFV + P       RH S+  + +A + AA    +            +K  +++K+KA S AE+VT K+K+
Subjt:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRP-------RHVSSQISAAAADGAAQVVKR----------TTEKAAEEVKDKAVSAAEEVTQKTKE

Query:  VAGKVNETAQDLAGKAKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS
        VAGKV+ETAQDLA KAKQT +DAWGSVKDTTQ IKE V GKAEESK++IKD+AEN+K ++ +
Subjt:  VAGKVNETAQDLAGKAKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS

A0A6J1DGG7 uncharacterized protein At4g132304.5e-5381.88Show/hide
Query:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQ--ISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGK
        MAS LA+ AT+ KL RAGS++V +PWSS+VFVS+RPRH SS    +AAAADGAAQ VKR TEKAAEEVKDKA SAAEEVTQKTKEVAGKV+ETAQDLAGK
Subjt:  MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQ--ISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGK

Query:  AKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC
        AKQTV++AWGSVKDTTQNIKEKVVGKAEESKEAIKDTA+NIK+ IKSNC
Subjt:  AKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC

SwissProt top hitse value%identityAlignment
Q8LFD5 Uncharacterized protein At4g132304.5e-1041.94Show/hide
Query:  QVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS
        ++V  TT    + V DKA  A ++V +K  E A  +++ A +L  KAK T E+AW  VKDTT+ IK+ V GK EE+KE+IK TA+ ++ ++ +
Subjt:  QVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS

Arabidopsis top hitse value%identityAlignment
AT1G04670.1 unknown protein6.4e-0433.66Show/hide
Query:  KVFVSLRPRHVSSQISAAAAD---GAAQVVK---RTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVKDTTQNIKEKVV
        + FVS  PR +  +  A        AA+ VK   +  ++  E ++D A + A  VT+ TK+V  KV ET   +  KAK +V    G+ K+ T  IK K++
Subjt:  KVFVSLRPRHVSSQISAAAAD---GAAQVVK---RTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVKDTTQNIKEKVV

Query:  G
        G
Subjt:  G

AT4G13230.1 Late embryogenesis abundant protein (LEA) family protein3.2e-1141.94Show/hide
Query:  QVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS
        ++V  TT    + V DKA  A ++V +K  E A  +++ A +L  KAK T E+AW  VKDTT+ IK+ V GK EE+KE+IK TA+ ++ ++ +
Subjt:  QVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCTTTTTGGCAACACCAGCAACCCTTCCAAAGCTTCAACGTGCTGGATCTATTATTGTACACAACCCGTGGAGTTCTAAAGTTTTTGTCTCTCTCAGACCAAG
ACATGTAAGTTCACAGATTAGTGCAGCAGCAGCTGATGGAGCTGCTCAAGTTGTGAAGCGAACCACCGAGAAGGCCGCCGAAGAAGTCAAAGATAAGGCCGTCTCTGCTG
CCGAGGAGGTGACTCAAAAAACAAAAGAAGTCGCAGGGAAGGTGAATGAAACAGCACAAGATTTGGCTGGAAAGGCAAAGCAAACAGTAGAAGATGCATGGGGATCAGTG
AAGGATACAACTCAAAACATTAAAGAGAAAGTCGTTGGGAAAGCTGAAGAATCTAAAGAAGCCATTAAAGACACTGCAGAAAACATCAAAAACAATATCAAGTCAAACTG
TTGA
mRNA sequenceShow/hide mRNA sequence
TTTCATCAGCTGAGATCTGATTATAAGTGTAAAAAAAAAGTGAACTAATTAATTAGAGATAAAAATAATAATTATTAGATTGAAGAATGGCAAGCTTTTTGGCAACACCA
GCAACCCTTCCAAAGCTTCAACGTGCTGGATCTATTATTGTACACAACCCGTGGAGTTCTAAAGTTTTTGTCTCTCTCAGACCAAGACATGTAAGTTCACAGATTAGTGC
AGCAGCAGCTGATGGAGCTGCTCAAGTTGTGAAGCGAACCACCGAGAAGGCCGCCGAAGAAGTCAAAGATAAGGCCGTCTCTGCTGCCGAGGAGGTGACTCAAAAAACAA
AAGAAGTCGCAGGGAAGGTGAATGAAACAGCACAAGATTTGGCTGGAAAGGCAAAGCAAACAGTAGAAGATGCATGGGGATCAGTGAAGGATACAACTCAAAACATTAAA
GAGAAAGTCGTTGGGAAAGCTGAAGAATCTAAAGAAGCCATTAAAGACACTGCAGAAAACATCAAAAACAATATCAAGTCAAACTGTTGAACATTATTCCTTAGATTTCT
TCTATATTATTATTACACTTGCATCTATTATATTTTCCATAATGCTTTTTCATTATGTTCCCCATTTCATACAAAATGGGAAACTTTTGTTTCTCAAGTTGGTTACCATA
TTCAACATTAGTGCCCATTAATTGGTATATTAAATAAAATCCTCACTATCTTCTTTC
Protein sequenceShow/hide protein sequence
MASFLATPATLPKLQRAGSIIVHNPWSSKVFVSLRPRHVSSQISAAAADGAAQVVKRTTEKAAEEVKDKAVSAAEEVTQKTKEVAGKVNETAQDLAGKAKQTVEDAWGSV
KDTTQNIKEKVVGKAEESKEAIKDTAENIKNNIKSNC