; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC10G193740 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC10G193740
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionLate embryogenesis abundant protein, group 2
Genome locationCmU531Chr10:24899775..24904172
RNA-Seq ExpressionCmUC10G193740
SyntenyCmUC10G193740
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018082.1 hypothetical protein SDJN02_19948, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-15882.74Show/hide
Query:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG----HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIV
        +M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPNG    HHH NPTQEASRFTLSHYSSS GSNHG GTDNGEARL+V
Subjt:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG----HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIV

Query:  GCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTK
        G  +G   +E+ E+ + +EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVKV E+++FMLGEGVDKTGVGTK
Subjt:  GCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTK

Query:  ILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKI
        ILTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKI
Subjt:  ILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKI

Query:  IRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTSR
        I+P FHRRV+C LV+  AYDRKRHTR FNSTCLTSR
Subjt:  IRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTSR

XP_004149613.1 uncharacterized protein LOC101209149 [Cucumis sativus]1.2e-16785.88Show/hide
Query:  ISEKSSLRHSL-RRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG--HHHCNPTQEASRFTLSHYSSSRGSNHG
        +SE++ L  +L   VME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN   HHH NPTQEASRFTLSHYSSSRGSNHG
Subjt:  ISEKSSLRHSL-RRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG--HHHCNPTQEASRFTLSHYSSSRGSNHG

Query:  AGTDNGEARLIVGCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFML
        AGTDNGEARLIVG  NG DC+EEEEE +G+EEGYYGK+KRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPII+VKV EIEEFML
Subjt:  AGTDNGEARLIVGCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFML

Query:  GEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRV
        GEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIA SQGPR+YAESG T F+LSVG SNK MYGAGRDMEDKL+SG+GLELTIR+
Subjt:  GEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRV

Query:  NFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS
        NFISNYRVVWK I PHFHR V+C L+L K YDR  HTRSFNSTC TS
Subjt:  NFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS

XP_008461795.2 PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo]1.4e-16886.3Show/hide
Query:  SSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTD
        S   H    VME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN    HHH NPTQEASRFTLSHYSSSRGSNHGAGTD
Subjt:  SSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTD

Query:  NGEARLIVGCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGV
        NGEARLIVG  +GRDC+EEEE+ +G+EEGYYGK+KRGCWKRYFTYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VTNPPSPIISVKV EI+EFMLGEGV
Subjt:  NGEARLIVGCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGV

Query:  DKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFIS
        DKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPR+YAESG T F LSVG SNK MYGAGR+MEDKL+SGMGLELTIR+NFIS
Subjt:  DKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFIS

Query:  NYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS
        NYRVVWK I PHFHR V+C L+LGKAYDRKRHT SFNSTC TS
Subjt:  NYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS

XP_022934269.1 uncharacterized protein LOC111441481 [Cucurbita moschata]3.9e-15882.93Show/hide
Query:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG
        +M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPNG   HHH NPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG
Subjt:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG

Query:  CSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKI
          +G   +E++E+ + +EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVKV E++EFMLGEGVDKTGVGTKI
Subjt:  CSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKI

Query:  LTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKII
        LTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKII
Subjt:  LTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS
        +P FHRRV+C LV+   YDRKRHTR FNSTCLTS
Subjt:  RPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS

XP_038905771.1 uncharacterized protein LOC120091726 [Benincasa hispida]9.3e-18493.12Show/hide
Query:  ISEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG----HHHCNPTQEASRFTLSHYSSSRGSNH
        +SEKSSLRHSLRRVMEAAEEQEAVLFHSYPC+YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG    HHH NPTQEASRFTLSHYSSSRGSNH
Subjt:  ISEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG----HHHCNPTQEASRFTLSHYSSSRGSNH

Query:  GAGTDNGEARLIVGCSNGRDCDEEEE-EEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEF
        GAGTDNGE RLIVG  NGRDC+EE+E +EDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKV EIEEF
Subjt:  GAGTDNGEARLIVGCSNGRDCDEEEE-EEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEF

Query:  MLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTI
        MLGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF LSVG SNKPMYGAGRDMEDKLESGMGLELTI
Subjt:  MLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTI

Query:  RVNFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS
        R+NFISNYRVVWK IRPHFHR VEC LVLGKAYDRKRHTRSFNSTCL S
Subjt:  RVNFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS

TrEMBL top hitse value%identityAlignment
A0A0A0LD21 Uncharacterized protein5.9e-16885.88Show/hide
Query:  ISEKSSLRHSL-RRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG--HHHCNPTQEASRFTLSHYSSSRGSNHG
        +SE++ L  +L   VME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN   HHH NPTQEASRFTLSHYSSSRGSNHG
Subjt:  ISEKSSLRHSL-RRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG--HHHCNPTQEASRFTLSHYSSSRGSNHG

Query:  AGTDNGEARLIVGCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFML
        AGTDNGEARLIVG  NG DC+EEEEE +G+EEGYYGK+KRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPII+VKV EIEEFML
Subjt:  AGTDNGEARLIVGCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFML

Query:  GEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRV
        GEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIA SQGPR+YAESG T F+LSVG SNK MYGAGRDMEDKL+SG+GLELTIR+
Subjt:  GEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRV

Query:  NFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS
        NFISNYRVVWK I PHFHR V+C L+L K YDR  HTRSFNSTC TS
Subjt:  NFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS

A0A1S3CFE9 uncharacterized protein LOC1035003126.9e-16986.3Show/hide
Query:  SSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTD
        S   H    VME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN    HHH NPTQEASRFTLSHYSSSRGSNHGAGTD
Subjt:  SSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTD

Query:  NGEARLIVGCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGV
        NGEARLIVG  +GRDC+EEEE+ +G+EEGYYGK+KRGCWKRYFTYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VTNPPSPIISVKV EI+EFMLGEGV
Subjt:  NGEARLIVGCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGV

Query:  DKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFIS
        DKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPR+YAESG T F LSVG SNK MYGAGR+MEDKL+SGMGLELTIR+NFIS
Subjt:  DKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFIS

Query:  NYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS
        NYRVVWK I PHFHR V+C L+LGKAYDRKRHT SFNSTC TS
Subjt:  NYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS

A0A6J1DGR2 uncharacterized protein LOC1110203361.7e-15485.59Show/hide
Query:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG-HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCS
        MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPL SDTFP G HHH N TQEASR TLS YSSSR SNHGAGTDNGEARLIVG  
Subjt:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG-HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCS

Query:  NGRDCDEEEEEED-GDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKIL
        NGR+ DEE EE+  GDEEGYYGKK+RGCWK YFTYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISVK+  +EEFMLGEGVDKTGVGTKIL
Subjt:  NGRDCDEEEEEED-GDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKIL

Query:  TCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIR
        TCN TM+V VDN+SKLFGLHILPPSLH+SFGPLPIATSQG R+YAESGTTTF+LSVG SN+ MYGAGR MED LESGMGLEL IR+NFISNYRVVWKIIR
Subjt:  TCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS
        PHF  RVEC LVLGK YDRKRHTRSFNSTCLTS
Subjt:  PHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS

A0A6J1F239 uncharacterized protein LOC1114414811.9e-15882.93Show/hide
Query:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG
        +M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPNG   HHH NPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG
Subjt:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG

Query:  CSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKI
          +G   +E++E+ + +EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVKV E++EFMLGEGVDKTGVGTKI
Subjt:  CSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKI

Query:  LTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKII
        LTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKII
Subjt:  LTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS
        +P FHRRV+C LV+   YDRKRHTR FNSTCLTS
Subjt:  RPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS

A0A6J1J6W9 uncharacterized protein LOC1114819094.7e-15782.63Show/hide
Query:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG
        +M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG   HHH N TQEASRFTLSHYSSS GSNHG GTDNGEARL+VG
Subjt:  VMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNG---HHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG

Query:  CSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKI
          +G   +E+ E+ + +EE YYGKK+RGCWK YFTYRNSD+NAWICLQLSWRA+FSMG+ALLVFYIVTNPP PIISV+V E++EFMLGEGVDKTGVGTKI
Subjt:  CSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKI

Query:  LTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKII
        LTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIATSQGPR+YAESGTTTF+L+VG S KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKII
Subjt:  LTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS
        +P FHR V+C LV+  AYDRKRHTR FNSTCLTS
Subjt:  RPHFHRRVECFLVLGKAYDRKRHTRSFNSTCLTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45688.1 unknown protein3.8e-1024.53Show/hide
Query:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSS--RGSNHGAGTDNGEARLIVGCSNGRDCDEEEEE---EDGD
        YYVQSPS  SH +         S+   SP+ S      H H +  + +   + S +S S   GS      D  + +   G    ++C   EEE   +DGD
Subjt:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSS--RGSNHGAGTDNGEARLIVGCSNGRDCDEEEEE---EDGD

Query:  EEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSM--GIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHS
         +G  G  +R                  C  L++   F +  G   L+ Y    P  P I+VK    E   +  G D  GVGT ++T N T+ ++  N  
Subjt:  EEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSM--GIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHS

Query:  KLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNK-PMYGAGRDM----------EDKLESGMGLEL----------TIRVNFISNYR
          FG+H+    + +SF  + I +    + Y    +    L   I  K P+YG+G  +          + K + G  + +           + ++F+   R
Subjt:  KLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNK-PMYGAGRDM----------EDKLESGMGLEL----------TIRVNFISNYR

Query:  --VVWKIIRPHFHRRVEC
          V+ K+++P F++++EC
Subjt:  --VVWKIIRPHFHRRVEC

AT2G41990.1 CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864)1.5e-0621.84Show/hide
Query:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCSNGRDCDEEEEEEDGDEEGYY
        YYVQSPS      + D+   +  S C S + S T P+ ++HC+P   +   + S +S     ++       E R  +        D +++ + GD++   
Subjt:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCSNGRDCDEEEEEEDGDEEGYY

Query:  GKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHI
                     +RN     W+ L +    IF   +  L+ +  +    P ++VK   + +  L  G D +GV T +L+ N T+ +   N S  F +H+
Subjt:  GKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHI

Query:  LPPSLHMSFGPLPIATSQGPRM-YAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECFLVL
            L + +  L +++ +  +     +G T     V     P+YG      D L     L L + +   S   ++ +++   F+ R+ C   L
Subjt:  LPPSLHMSFGPLPIATSQGPRM-YAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECFLVL

AT3G08490.1 BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein, group 2 (TAIR:AT3G24600.1)5.3e-5251.28Show/hide
Query:  KRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMS
        KR      S+S+ WI LQ+ WR +FS+G+ALLVFYI T PP P IS ++    +FML EGVD  GV TK LT NC+  +I+DN S +FGLHI PPS+   
Subjt:  KRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMS

Query:  FGPLPIATSQGPRMYAES-GTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHT
        FGPL  A +QGP++Y  S  +TTF+L +  +N+ MYGAG +M D L S  GL L +R + IS+YRVVW II P +H +VEC L+L    D++RH+
Subjt:  FGPLPIATSQGPRMYAES-GTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRKRHT

AT3G24600.1 Late embryogenesis abundant protein, group 22.0e-1429.45Show/hide
Query:  VFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKL-SVGISN
        V +  ++P SPI+SVK  +I  F  GEG+D+TGV TKIL+ N ++ V +D+ +  FG+H+   +  ++F  L +AT Q    Y    +    +  +  + 
Subjt:  VFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKL-SVGISN

Query:  KPMYGAGRDMEDKLESG---MGLELTIRVNFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRK
         P+YGAG  +    + G   + LE  IR    S   ++ K+++      V C   +  +   K
Subjt:  KPMYGAGRDMEDKLESG---MGLELTIRVNFISNYRVVWKIIRPHFHRRVECFLVLGKAYDRK

AT5G42860.1 unknown protein3.8e-1024.13Show/hide
Query:  AYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCSNGRDCDEEEEEEDGDEEGY
        AY+VQSPS  SH       +   +    SP+ S                     SH SSSR S        G A    G       +EE   +DGD    
Subjt:  AYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGCSNGRDCDEEEEEEDGDEEGY

Query:  YGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIAL--LVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFG
                       R  ++    C  L++   FS+  A   L+ Y    P  P ISVK    E+  +  G D  G+GT ++T N T+ ++  N    FG
Subjt:  YGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIAL--LVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFG

Query:  LHILPPSLHMSFGPLPIATSQGPRMY-AESGTTTFKLSVGISNKPMYGAGRDM----------EDKLESG---------MGLELTIRVNFISNYR--VVW
        +H+    + +SF  + I +    + Y +     T  ++V     P+YG+G  +          + K + G             + +R+NF    R  V+ 
Subjt:  LHILPPSLHMSFGPLPIATSQGPRMY-AESGTTTFKLSVGISNKPMYGAGRDM----------EDKLESG---------MGLELTIRVNFISNYR--VVW

Query:  KIIRPHFHRRVECFL
        K+++P F++R+ C +
Subjt:  KIIRPHFHRRVECFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCCCATTATTTCTCTTTCCCATCTCAGAGAAATCATCTCTCCGCCACTCTCTCCGCCGTGTGATGGAGGCTGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTC
TTATCCATGTGCTTATTACGTACAAAGCCCCTCCACCCTCTCTCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCCTCCGCTTGCCACTCGCCCCTCCCCTCCG
ACACCTTCCCCAACGGCCACCACCACTGCAACCCGACCCAGGAAGCCTCTCGCTTCACCCTCTCCCACTACTCATCCTCCCGGGGCTCGAACCATGGGGCCGGGACCGAC
AATGGTGAGGCTCGTCTGATAGTCGGTTGCAGCAATGGTCGAGATTGCGATGAGGAGGAGGAGGAGGAGGACGGAGACGAGGAAGGGTATTATGGGAAGAAAAAAAGAGG
TTGTTGGAAGAGGTATTTTACGTATAGGAATTCGGATTCTAATGCATGGATTTGCTTGCAGTTGAGTTGGAGAGCAATTTTCAGTATGGGAATTGCTTTGCTTGTGTTTT
ACATTGTCACTAACCCTCCCTCACCAATCATTTCTGTTAAGGTGGAAGAAATAGAAGAGTTTATGCTAGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATTCTA
ACATGCAATTGCACAATGAATGTAATTGTGGACAACCATTCTAAGCTTTTTGGCCTTCACATCCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCTATTGCTAC
TTCACAAGGTCCAAGAATGTATGCTGAGAGTGGAACGACGACGTTTAAATTAAGCGTGGGCATTAGCAATAAGCCAATGTACGGTGCGGGGAGGGACATGGAAGACAAGC
TTGAATCAGGGATGGGATTGGAGCTTACAATTCGAGTCAATTTCATTTCAAATTATAGAGTAGTTTGGAAAATCATAAGGCCCCACTTTCATCGTCGTGTCGAATGCTTT
TTGGTCCTCGGAAAAGCCTATGATAGGAAGCGTCACACCCGATCCTTCAATAGTACTTGCTTAACTTCTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTCCCATTATTTCTCTTTCCCATCTCAGAGAAATCATCTCTCCGCCACTCTCTCCGCCGTGTGATGGAGGCTGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTC
TTATCCATGTGCTTATTACGTACAAAGCCCCTCCACCCTCTCTCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCCTCCGCTTGCCACTCGCCCCTCCCCTCCG
ACACCTTCCCCAACGGCCACCACCACTGCAACCCGACCCAGGAAGCCTCTCGCTTCACCCTCTCCCACTACTCATCCTCCCGGGGCTCGAACCATGGGGCCGGGACCGAC
AATGGTGAGGCTCGTCTGATAGTCGGTTGCAGCAATGGTCGAGATTGCGATGAGGAGGAGGAGGAGGAGGACGGAGACGAGGAAGGGTATTATGGGAAGAAAAAAAGAGG
TTGTTGGAAGAGGTATTTTACGTATAGGAATTCGGATTCTAATGCATGGATTTGCTTGCAGTTGAGTTGGAGAGCAATTTTCAGTATGGGAATTGCTTTGCTTGTGTTTT
ACATTGTCACTAACCCTCCCTCACCAATCATTTCTGTTAAGGTGGAAGAAATAGAAGAGTTTATGCTAGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATTCTA
ACATGCAATTGCACAATGAATGTAATTGTGGACAACCATTCTAAGCTTTTTGGCCTTCACATCCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCTATTGCTAC
TTCACAAGGTCCAAGAATGTATGCTGAGAGTGGAACGACGACGTTTAAATTAAGCGTGGGCATTAGCAATAAGCCAATGTACGGTGCGGGGAGGGACATGGAAGACAAGC
TTGAATCAGGGATGGGATTGGAGCTTACAATTCGAGTCAATTTCATTTCAAATTATAGAGTAGTTTGGAAAATCATAAGGCCCCACTTTCATCGTCGTGTCGAATGCTTT
TTGGTCCTCGGAAAAGCCTATGATAGGAAGCGTCACACCCGATCCTTCAATAGTACTTGCTTAACTTCTCGATGA
Protein sequenceShow/hide protein sequence
MAVPLFLFPISEKSSLRHSLRRVMEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGHHHCNPTQEASRFTLSHYSSSRGSNHGAGTD
NGEARLIVGCSNGRDCDEEEEEEDGDEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVEEIEEFMLGEGVDKTGVGTKIL
TCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFKLSVGISNKPMYGAGRDMEDKLESGMGLELTIRVNFISNYRVVWKIIRPHFHRRVECF
LVLGKAYDRKRHTRSFNSTCLTSR