; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G09680 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G09680
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLate embryogenesis abundant protein (LEA) family protein
Genome locationClcChr09:8367228..8369194
RNA-Seq ExpressionClc09G09680
SyntenyClc09G09680
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV70662.1 hypothetical protein CFOL_v3_14160, partial [Cephalotus follicularis]3.3e-2657.89Show/hide
Query:  SILVPTPWSSKVFVIVRPRH----GSSKVAAAATDGAAQVVKRA---TEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVK
        ++L+  PWSS+VFV V  R     GS + A++A D A Q  K+     +K  E+VKDKA S AE+VT+K ++ A KV ETAQDL  KAKQTVQDAWGSVK
Subjt:  SILVPTPWSSKVFVIVRPRH----GSSKVAAAATDGAAQVVKRA---TEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVK

Query:  DTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIK
        DTT  IK+ VVGKAEESKE+IK+TA+N+K+N+K
Subjt:  DTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIK

XP_008457975.1 PREDICTED: uncharacterized protein At4g13230 [Cucumis melo]1.8e-5686.39Show/hide
Query:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MASFLA SAT+PKL RAGSI+V  PWSSKVFV +RPRHGSS+V+AAA DGAAQ VKR TEKAAEEVKDKAVSAAEEVTQKTK+VAGKVSETAQD+AGKAK
Subjt:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC
        QTV+DAWGSVKDTTQNIKEKVVGKAEESKEAIKDTA+N+K NIK+NC
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC

XP_011659168.1 uncharacterized protein At4g13230 [Cucumis sativus]2.0e-5583.67Show/hide
Query:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MASFLA SATIPKL RAGSI+V  PWSS VFV +RPRHGSS+V AAA D AA VVKR TE+AAEEVKDKAVSAA+EVTQ+TKEVAGKVSETAQD+AGKAK
Subjt:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC
        QTV+DAWGSVKDTTQNIK+KVVGKAEESKEAIKDTA+N+K+NIK+NC
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC

XP_022152517.1 uncharacterized protein At4g13230 [Momordica charantia]9.2e-5383.89Show/hide
Query:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSK--VAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGK
        MAS LA +ATI KL+RAGS+LV +PWSS+VFV VRPRH SS    AAAA DGAAQ VKRATEKAAEEVKDKA SAAEEVTQKTKEVAGKVSETAQDLAGK
Subjt:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSK--VAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC
        AKQTVQ+AWGSVKDTTQNIKEKVVGKAEESKEAIKDTA N+K  IK+NC
Subjt:  AKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC

XP_038897424.1 uncharacterized protein At4g13230 [Benincasa hispida]2.0e-6091.84Show/hide
Query:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MASFLAPSAT+PKLHRA SI V TPWSSKVFV VRP+H SSKVAAAA DGAAQVVKRATEKAAE+VKDKAVSAA+EVTQKTKEVAGKVSETAQDLAGKAK
Subjt:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC
        QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTA NLK+NIKTNC
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC

TrEMBL top hitse value%identityAlignment
A0A0A0K9U4 Uncharacterized protein9.6e-5683.67Show/hide
Query:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MASFLA SATIPKL RAGSI+V  PWSS VFV +RPRHGSS+V AAA D AA VVKR TE+AAEEVKDKAVSAA+EVTQ+TKEVAGKVSETAQD+AGKAK
Subjt:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC
        QTV+DAWGSVKDTTQNIK+KVVGKAEESKEAIKDTA+N+K+NIK+NC
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC

A0A1Q3BS03 Uncharacterized protein (Fragment)1.6e-2657.89Show/hide
Query:  SILVPTPWSSKVFVIVRPRH----GSSKVAAAATDGAAQVVKRA---TEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVK
        ++L+  PWSS+VFV V  R     GS + A++A D A Q  K+     +K  E+VKDKA S AE+VT+K ++ A KV ETAQDL  KAKQTVQDAWGSVK
Subjt:  SILVPTPWSSKVFVIVRPRH----GSSKVAAAATDGAAQVVKRA---TEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVK

Query:  DTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIK
        DTT  IK+ VVGKAEESKE+IK+TA+N+K+N+K
Subjt:  DTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIK

A0A1S3C7D4 uncharacterized protein At4g132308.7e-5786.39Show/hide
Query:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MASFLA SAT+PKL RAGSI+V  PWSSKVFV +RPRHGSS+V+AAA DGAAQ VKR TEKAAEEVKDKAVSAAEEVTQKTK+VAGKVSETAQD+AGKAK
Subjt:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC
        QTV+DAWGSVKDTTQNIKEKVVGKAEESKEAIKDTA+N+K NIK+NC
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC

A0A5B7AK94 Uncharacterized protein (Fragment)2.1e-2653.09Show/hide
Query:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRP-------RHGS--------SKVAAAATDGAAQVVKRATE--KAAEEVKDKAVSAAEEVTQKTKE
        MAS +A + T+ K      I V +PWSS+VFV + P       RH S        +K A AA  GA    + A E  K  +++K+KA S AE+VT K+K+
Subjt:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRP-------RHGS--------SKVAAAATDGAAQVVKRATE--KAAEEVKDKAVSAAEEVTQKTKE

Query:  VAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKT
        VAGKVSETAQDLA KAKQT QDAWGSVKDTTQ IKE V GKAEESK++IKD+A+N+K+++ T
Subjt:  VAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKT

A0A6J1DGG7 uncharacterized protein At4g132304.5e-5383.89Show/hide
Query:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSK--VAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGK
        MAS LA +ATI KL+RAGS+LV +PWSS+VFV VRPRH SS    AAAA DGAAQ VKRATEKAAEEVKDKA SAAEEVTQKTKEVAGKVSETAQDLAGK
Subjt:  MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSK--VAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC
        AKQTVQ+AWGSVKDTTQNIKEKVVGKAEESKEAIKDTA N+K  IK+NC
Subjt:  AKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC

SwissProt top hitse value%identityAlignment
Q8LFD5 Uncharacterized protein At4g132307.7e-1041.94Show/hide
Query:  QVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKT
        ++V   T    + V DKA  A ++V +K  E A  +S+ A +L  KAK T ++AW  VKDTT+ IK+ V GK EE+KE+IK TA+ +++++ T
Subjt:  QVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKT

Arabidopsis top hitse value%identityAlignment
AT4G13230.1 Late embryogenesis abundant protein (LEA) family protein5.4e-1141.94Show/hide
Query:  QVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKT
        ++V   T    + V DKA  A ++V +K  E A  +S+ A +L  KAK T ++AW  VKDTT+ IK+ V GK EE+KE+IK TA+ +++++ T
Subjt:  QVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCTTCTTGGCTCCATCAGCAACTATTCCAAAGCTTCATCGTGCTGGATCTATTCTTGTACCTACCCCATGGAGTTCTAAAGTTTTTGTCATTGTCAGACCAAG
ACATGGAAGTTCAAAGGTTGCTGCCGCAGCAACTGATGGAGCTGCTCAAGTTGTGAAGCGAGCCACTGAGAAGGCTGCCGAAGAAGTCAAAGATAAGGCTGTCTCTGCCG
CTGAAGAGGTGACTCAAAAAACAAAGGAAGTGGCAGGGAAGGTATCTGAAACAGCACAAGATTTGGCTGGGAAGGCAAAGCAAACAGTTCAAGATGCATGGGGTTCAGTG
AAGGATACAACTCAAAACATCAAAGAGAAAGTGGTTGGCAAAGCTGAAGAATCCAAAGAAGCCATTAAAGACACTGCACAGAACCTCAAACAGAATATCAAGACAAATTG
TTGA
mRNA sequenceShow/hide mRNA sequence
CTTCAATATTTCAAAGTCTTAAAACAAAATATTCATAAGCTGAGATCCGATCATTAAAACAGAGAAAAAAAAAATTTACATTGAAGAATGGCAAGCTTCTTGGCTCCATC
AGCAACTATTCCAAAGCTTCATCGTGCTGGATCTATTCTTGTACCTACCCCATGGAGTTCTAAAGTTTTTGTCATTGTCAGACCAAGACATGGAAGTTCAAAGGTTGCTG
CCGCAGCAACTGATGGAGCTGCTCAAGTTGTGAAGCGAGCCACTGAGAAGGCTGCCGAAGAAGTCAAAGATAAGGCTGTCTCTGCCGCTGAAGAGGTGACTCAAAAAACA
AAGGAAGTGGCAGGGAAGGTATCTGAAACAGCACAAGATTTGGCTGGGAAGGCAAAGCAAACAGTTCAAGATGCATGGGGTTCAGTGAAGGATACAACTCAAAACATCAA
AGAGAAAGTGGTTGGCAAAGCTGAAGAATCCAAAGAAGCCATTAAAGACACTGCACAGAACCTCAAACAGAATATCAAGACAAATTGTTGATTCCTATTTTTCTCTACTT
TAATTCCTTCACCTATTATTATTTTCCATAATGGTTCATTCCCCCATTTCATGGAAAATGAAAATGGGAATTTTTGTTTCTGGAGTATGGTTTTTTTGTTGTTGACTTTT
GAGCATGTTCAACTTTTTAGTCCATTAATTGATATTTGAAATAAAATGTCACTCTTTTCCTTTTCCATATTGTTGCTTATTTATTGCTTGCAAAGTTCAATTTAATCATT
GCTAATATATATGCTTC
Protein sequenceShow/hide protein sequence
MASFLAPSATIPKLHRAGSILVPTPWSSKVFVIVRPRHGSSKVAAAATDGAAQVVKRATEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSV
KDTTQNIKEKVVGKAEESKEAIKDTAQNLKQNIKTNC