; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G192330 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G192330
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionLate embryogenesis abundant protein, group 2
Genome locationCiama_Chr10:26867318..26871378
RNA-Seq ExpressionCaUC10G192330
SyntenyCaUC10G192330
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018082.1 hypothetical protein SDJN02_19948, partial [Cucurbita argyrosperma subsp. argyrosperma]6.5e-15883.38Show/hide
Query:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG----HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIV
        +M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPNG    HHH NPTQEASRFTLSHYSSS GSNHG GTDNGEARL+V
Subjt:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG----HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIV

Query:  GCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGT
        G   G D  EE+ E  + +EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVKV E+++FMLGEGVDKTGVGT
Subjt:  GCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGT

Query:  KILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWK
        KILTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWK
Subjt:  KILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWK

Query:  IIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTSR
        II+P FHRRV+CLLV+  AYDRKRHTR FNSTCLTSR
Subjt:  IIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTSR

XP_004149613.1 uncharacterized protein LOC101209149 [Cucumis sativus]7.0e-16886.78Show/hide
Query:  MSEKSSLRHSL-RRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG--HHHCNPTQEASRFTLSHYSSSRGSNHG
        MSE++ L  +L   VME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN   HHH NPTQEASRFTLSHYSSSRGSNHG
Subjt:  MSEKSSLRHSL-RRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG--HHHCNPTQEASRFTLSHYSSSRGSNHG

Query:  AGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFM
        AGTDNGEARLIVG  NG DC+EEEEE E G+EEGYYGK+KRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPII+VKVGEIEEFM
Subjt:  AGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFM

Query:  LGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIR
        LGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIA SQGPR+YAESG T F+LSVG SNK MYGAGRDMEDKL+SG+GLELTIR
Subjt:  LGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIR

Query:  VNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS
        +NFISNYRVVWK I PHFHR V+CLL+L K YDR  HTRSFNSTC TS
Subjt:  VNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS

XP_008461795.2 PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo]3.7e-16986.78Show/hide
Query:  MSEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHG
        MSE++   H    VME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN    HHH NPTQEASRFTLSHYSSSRGSNHG
Subjt:  MSEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHG

Query:  AGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFM
        AGTDNGEARLIVG  +GRDC EEEEED +G+EEGYYGK+KRGCWKRYFTYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VTNPPSPIISVKVGEI+EFM
Subjt:  AGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFM

Query:  LGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIR
        LGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPR+YAESG T F LSVG SNK MYGAGR+MEDKL+SGMGLELTIR
Subjt:  LGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIR

Query:  VNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS
        +NFISNYRVVWK I PHFHR V+CLL+LGKAYDRKRHT SFNSTC TS
Subjt:  VNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS

XP_022934269.1 uncharacterized protein LOC111441481 [Cucurbita moschata]3.8e-15883.58Show/hide
Query:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG
        +M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPNG   HHH NPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG
Subjt:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG

Query:  CSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTK
           G D  EE++E  + +EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVKV E++EFMLGEGVDKTGVGTK
Subjt:  CSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTK

Query:  ILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKI
        ILTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKI
Subjt:  ILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKI

Query:  IRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS
        I+P FHRRV+CLLV+   YDRKRHTR FNSTCLTS
Subjt:  IRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS

XP_038905771.1 uncharacterized protein LOC120091726 [Benincasa hispida]5.1e-18794.27Show/hide
Query:  MSEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG----HHHCNPTQEASRFTLSHYSSSRGSNH
        MSEKSSLRHSLRRVMEAAEEQEAVLFHSYPC+YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG    HHH NPTQEASRFTLSHYSSSRGSNH
Subjt:  MSEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG----HHHCNPTQEASRFTLSHYSSSRGSNH

Query:  GAGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEF
        GAGTDNGE RLIVG  NGRDC+EE+E DEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEF
Subjt:  GAGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEF

Query:  MLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTI
        MLGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF LSVG SNKPMYGAGRDMEDKLESGMGLELTI
Subjt:  MLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTI

Query:  RVNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS
        R+NFISNYRVVWK IRPHFHR VECLLVLGKAYDRKRHTRSFNSTCL S
Subjt:  RVNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS

TrEMBL top hitse value%identityAlignment
A0A0A0LD21 Uncharacterized protein3.4e-16886.78Show/hide
Query:  MSEKSSLRHSL-RRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG--HHHCNPTQEASRFTLSHYSSSRGSNHG
        MSE++ L  +L   VME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN   HHH NPTQEASRFTLSHYSSSRGSNHG
Subjt:  MSEKSSLRHSL-RRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG--HHHCNPTQEASRFTLSHYSSSRGSNHG

Query:  AGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFM
        AGTDNGEARLIVG  NG DC+EEEEE E G+EEGYYGK+KRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPII+VKVGEIEEFM
Subjt:  AGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFM

Query:  LGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIR
        LGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIA SQGPR+YAESG T F+LSVG SNK MYGAGRDMEDKL+SG+GLELTIR
Subjt:  LGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIR

Query:  VNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS
        +NFISNYRVVWK I PHFHR V+CLL+L K YDR  HTRSFNSTC TS
Subjt:  VNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS

A0A1S3CFE9 uncharacterized protein LOC1035003121.8e-16986.78Show/hide
Query:  MSEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHG
        MSE++   H    VME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN    HHH NPTQEASRFTLSHYSSSRGSNHG
Subjt:  MSEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHG

Query:  AGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFM
        AGTDNGEARLIVG  +GRDC EEEEED +G+EEGYYGK+KRGCWKRYFTYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VTNPPSPIISVKVGEI+EFM
Subjt:  AGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFM

Query:  LGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIR
        LGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPR+YAESG T F LSVG SNK MYGAGR+MEDKL+SGMGLELTIR
Subjt:  LGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIR

Query:  VNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS
        +NFISNYRVVWK I PHFHR V+CLL+LGKAYDRKRHT SFNSTC TS
Subjt:  VNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS

A0A6J1DGR2 uncharacterized protein LOC1110203362.7e-15786.49Show/hide
Query:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG-HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCS
        MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPL SDTFP G HHH N TQEASR TLS YSSSR SNHGAGTDNGEARLIVG  
Subjt:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG-HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCS

Query:  NGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL
        NGR+ DEE EED  GDEEGYYGKK+RGCWK YFTYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISVK+G +EEFMLGEGVDKTGVGTKIL
Subjt:  NGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL

Query:  TCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIR
        TCN TM+V VDN+SKLFGLHILPPSLH+SFGPLPIATSQG R+YAESGTTTFQLSVG SN+ MYGAGR MED LESGMGLEL IR+NFISNYRVVWKIIR
Subjt:  TCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS
        PHF  RVEC LVLGK YDRKRHTRSFNSTCLTS
Subjt:  PHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS

A0A6J1F239 uncharacterized protein LOC1114414811.9e-15883.58Show/hide
Query:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG
        +M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPNG   HHH NPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG
Subjt:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG

Query:  CSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTK
           G D  EE++E  + +EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVKV E++EFMLGEGVDKTGVGTK
Subjt:  CSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTK

Query:  ILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKI
        ILTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKI
Subjt:  ILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKI

Query:  IRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS
        I+P FHRRV+CLLV+   YDRKRHTR FNSTCLTS
Subjt:  IRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS

A0A6J1J6W9 uncharacterized protein LOC1114819094.6e-15783.28Show/hide
Query:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG
        +M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG   HHH N TQEASRFTLSHYSSS GSNHG GTDNGEARL+VG
Subjt:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG

Query:  CSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTK
           G D  EE+ E  + +EE YYGKK+RGCWK YFTYRNSD+NAWICLQLSWRA+FSMG+ALLVFYIVTNPP PIISV+V E++EFMLGEGVDKTGVGTK
Subjt:  CSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTK

Query:  ILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKI
        ILTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF+L+VG S KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKI
Subjt:  ILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKI

Query:  IRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS
        I+P FHR V+CLLV+  AYDRKRHTR FNSTCLTS
Subjt:  IRPHFHRRVECLLVLGKAYDRKRHTRSFNSTCLTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45688.1 unknown protein6.4e-1024.53Show/hide
Query:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSS--RGSNHGAGTDNGEARLIVGCSNGRDCDEEEEED--EDGD
        YYVQSPS  SH +         S+   SP+ S      H H +  + +   + S +S S   GS      D  + +   G    ++C   EEE   +DGD
Subjt:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSS--RGSNHGAGTDNGEARLIVGCSNGRDCDEEEEED--EDGD

Query:  EEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSM--GIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHS
         +G  G  +R                  C  L++   F +  G   L+ Y    P  P I+VK    E   +  G D  GVGT ++T N T+ ++  N  
Subjt:  EEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSM--GIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHS

Query:  KLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNK-PMYGAGRDM----------EDKLESGMGLEL----------TIRVNFISNYR
          FG+H+    + +SF  + I +    + Y    +    L   I  K P+YG+G  +          + K + G  + +           + ++F+   R
Subjt:  KLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNK-PMYGAGRDM----------EDKLESGMGLEL----------TIRVNFISNYR

Query:  --VVWKIIRPHFHRRVEC
          V+ K+++P F++++EC
Subjt:  --VVWKIIRPHFHRRVEC

AT3G08490.1 BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein, group 2 (TAIR:AT3G24600.1)7.2e-5452.82Show/hide
Query:  KRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMS
        KR      S+S+ WI LQ+ WR +FS+G+ALLVFYI T PP P IS ++G   +FML EGVD  GV TK LT NC+  +I+DN S +FGLHI PPS+   
Subjt:  KRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMS

Query:  FGPLPIATSQGPRMYAES-GTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHT
        FGPL  A +QGP++Y  S  +TTFQL +  +N+ MYGAG +M D L S  GL L +R + IS+YRVVW II P +H +VECLL+L    D++RH+
Subjt:  FGPLPIATSQGPRMYAES-GTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRKRHT

AT3G24600.1 Late embryogenesis abundant protein, group 21.9e-1429.45Show/hide
Query:  VFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQL-SVGISN
        V +  ++P SPI+SVK  +I  F  GEG+D+TGV TKIL+ N ++ V +D+ +  FG+H+   +  ++F  L +AT Q    Y    +    +  +  + 
Subjt:  VFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQL-SVGISN

Query:  KPMYGAGRDMEDKLESG---MGLELTIRVNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRK
         P+YGAG  +    + G   + LE  IR    S   ++ K+++      V C   +  +   K
Subjt:  KPMYGAGRDMEDKLESG---MGLELTIRVNFISNYRVVWKIIRPHFHRRVECLLVLGKAYDRK

AT4G35170.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.9e-0722.76Show/hide
Query:  NPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLV
        N   + S F   H+S +  S++     +G  R         D D    EDED DE     +K+R   + Y               L +  + +  +  L+
Subjt:  NPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLV

Query:  FYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQ-GPRMYAESGTTTFQLSVGISNK
         + V+   +PI ++K   +E   +  G D++GV T +LT N T+ ++  N +  F +H+    L +S+  L +A+ Q G            +  V     
Subjt:  FYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQ-GPRMYAESGTTTFQLSVGISNK

Query:  PMYG------AGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECLLV-----LGKAYD
        P+YG        R   D++   + L  T+R    +   V+ ++++  FH  ++C +      LGK  D
Subjt:  PMYG------AGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECLLV-----LGKAYD

AT5G42860.1 unknown protein4.4e-1124.68Show/hide
Query:  AYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEG
        AY+VQSPS  SH       +   +    SP+ S   P+ H        +SRF  S  + S+   H      GE +  +         EEE   +DGD   
Subjt:  AYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCSNGRDCDEEEEEDEDGDEEG

Query:  YYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIAL--LVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLF
                        R  ++    C  L++   FS+  A   L+ Y    P  P ISVK    E+  +  G D  G+GT ++T N T+ ++  N    F
Subjt:  YYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIAL--LVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLF

Query:  GLHILPPSLHMSFGPLPIATSQGPRMY-AESGTTTFQLSVGISNKPMYGAGRDM----------EDKLESG---------MGLELTIRVNFISNYR--VV
        G+H+    + +SF  + I +    + Y +     T  ++V     P+YG+G  +          + K + G             + +R+NF    R  V+
Subjt:  GLHILPPSLHMSFGPLPIATSQGPRMY-AESGTTTFQLSVGISNKPMYGAGRDM----------EDKLESG---------MGLELTIRVNFISNYR--VV

Query:  WKIIRPHFHRRVECLL
         K+++P F++R+ CL+
Subjt:  WKIIRPHFHRRVECLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGAGAAATCATCTCTCCGCCACTCTCTCCGCCGTGTGATGGAGGCTGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTCTTATCCATGTGCTTATTACGTACAAAG
CCCCTCCACCCTCTCTCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCCTCCGCTTGCCACTCGCCCCTCCCCTCCGACACCTTCCCCAACGGCCACCACCACT
GCAACCCGACCCAGGAAGCCTCTCGCTTCACCCTCTCCCACTACTCATCCTCCCGGGGCTCGAACCATGGGGCCGGGACCGACAATGGTGAGGCTCGTCTGATAGTCGGT
TGCAGCAATGGTCGAGATTGCGATGAGGAGGAGGAGGAGGACGAGGACGGAGACGAGGAAGGGTATTATGGGAAGAAAAAAAGAGGTTGTTGGAAGAGGTATTTTACGTA
TAGGAATTCGGATTCTAATGCATGGATTTGCTTGCAGTTGAGTTGGAGAGCAATTTTCAGTATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAACCCTCCCTCAC
CAATCATTTCTGTTAAGGTGGGAGAAATAGAAGAGTTTATGCTAGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATTCTAACATGCAATTGCACAATGAATGTA
ATTGTGGACAACCATTCTAAGCTTTTTGGCCTTCACATCCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCTATTGCCACTTCACAAGGTCCAAGAATGTATGC
TGAGAGTGGAACGACGACGTTTCAATTAAGCGTGGGCATTAGCAATAAGCCAATGTACGGTGCGGGGAGGGACATGGAAGACAAGCTTGAATCAGGGATGGGATTGGAGC
TTACAATTCGAGTCAATTTCATTTCAAATTATAGAGTAGTTTGGAAAATCATAAGGCCCCACTTTCATCGTCGTGTCGAATGCTTATTGGTCCTCGGAAAAGCCTATGAT
AGGAAGCGTCACACCCGATCCTTCAATAGTACTTGCCTAACTTCTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGAGAAATCATCTCTCCGCCACTCTCTCCGCCGTGTGATGGAGGCTGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTCTTATCCATGTGCTTATTACGTACAAAG
CCCCTCCACCCTCTCTCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCCTCCGCTTGCCACTCGCCCCTCCCCTCCGACACCTTCCCCAACGGCCACCACCACT
GCAACCCGACCCAGGAAGCCTCTCGCTTCACCCTCTCCCACTACTCATCCTCCCGGGGCTCGAACCATGGGGCCGGGACCGACAATGGTGAGGCTCGTCTGATAGTCGGT
TGCAGCAATGGTCGAGATTGCGATGAGGAGGAGGAGGAGGACGAGGACGGAGACGAGGAAGGGTATTATGGGAAGAAAAAAAGAGGTTGTTGGAAGAGGTATTTTACGTA
TAGGAATTCGGATTCTAATGCATGGATTTGCTTGCAGTTGAGTTGGAGAGCAATTTTCAGTATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAACCCTCCCTCAC
CAATCATTTCTGTTAAGGTGGGAGAAATAGAAGAGTTTATGCTAGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATTCTAACATGCAATTGCACAATGAATGTA
ATTGTGGACAACCATTCTAAGCTTTTTGGCCTTCACATCCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCTATTGCCACTTCACAAGGTCCAAGAATGTATGC
TGAGAGTGGAACGACGACGTTTCAATTAAGCGTGGGCATTAGCAATAAGCCAATGTACGGTGCGGGGAGGGACATGGAAGACAAGCTTGAATCAGGGATGGGATTGGAGC
TTACAATTCGAGTCAATTTCATTTCAAATTATAGAGTAGTTTGGAAAATCATAAGGCCCCACTTTCATCGTCGTGTCGAATGCTTATTGGTCCTCGGAAAAGCCTATGAT
AGGAAGCGTCACACCCGATCCTTCAATAGTACTTGCCTAACTTCTCGATGATCATTGTGAACCAACAAATTTTGCTTTTTAATGATAAAAAAAA
Protein sequenceShow/hide protein sequence
MSEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG
CSNGRDCDEEEEEDEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMNV
IVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECLLVLGKAYD
RKRHTRSFNSTCLTSR