; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019421 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019421
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLate embryogenesis abundant protein, group 2
Genome locationChr04:21697118..21700585
RNA-Seq ExpressionHG10019421
SyntenyHG10019421
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149613.1 uncharacterized protein LOC101209149 [Cucumis sativus]1.4e-17090.39Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV-HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
        ME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN  HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV-HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL
        NG DC EE+EE+G+G EEGYYGK+KRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPII+VKVGEIEEFMLGEGVDKTGVGTKIL
Subjt:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL

Query:  TCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR
        TCNCTMDVIVDN+SKLFGLHILPPSLHMSFGPLPIA SQGPRLYAESG T F+LSVG SNK MYGAGRDMEDKL+SG GLELTIR+NFISNYRVVWK I 
Subjt:  TCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        PHFHR VQCLL+L K YDR  HTRSFNSTC TS
Subjt:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

XP_008461795.2 PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo]5.9e-17290.42Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV--HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN   HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV--HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI
        G+GRDC EE+EEDG+G EEGYYGK+KRGCWKRYFTYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VTNPPSPIISVKVGEI+EFMLGEGVDKTGVGTKI
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI

Query:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
        LTCNCTMDVIVDN+SKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESG T F LSVG SNK MYGAGR+MEDKL+SG GLELTIR+NFISNYRVVWK I
Subjt:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
         PHFHR VQCLL+LGKAYDRKRHT SFNSTC TS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

XP_022152674.1 uncharacterized protein LOC111020336 [Momordica charantia]2.3e-16087.09Show/hide
Query:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
        MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPL SDTFP  HHHH N TQEASR TLS YSSSR SNHGAGTDNGEARLIVGRG
Subjt:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL
        NGR+ DEE+EEDG G EEGYYGKK+RGCWK YFTYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISVK+G +EEFMLGEGVDKTGVGTKIL
Subjt:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL

Query:  TCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR
        TCN TMDV VDNNSKLFGLHILPPSLH+SFGPLPIATSQG RLYAESGTTTFQLSVG SN+ MYGAGR MED LESG GLEL IR+NFISNYRVVWKIIR
Subjt:  TCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        PHF  RV+C L+LGK YDRKRHTRSFNSTCLTS
Subjt:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

XP_022934269.1 uncharacterized protein LOC111441481 [Cucurbita moschata]4.4e-15984.43Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN--VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPN   HHHHRNPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG 
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN--VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI
        G+G    EE++E  +  EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVKV E++EFMLGEGVDKTGVGTKI
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI

Query:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
        LTCNCTMDVIVDN SKLF LHILPPSLHMSFGPLPIATSQGPRLYAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKII
Subjt:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        +P FHRRV CLL++   YDRKRHTR FNSTCLTS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

XP_038905771.1 uncharacterized protein LOC120091726 [Benincasa hispida]1.3e-17994.03Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN---VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG
        MEAAEEQEAVLFHSYPC+YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN    HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGE RLIVG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN---VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG

Query:  RGNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTK
        RGNGRDC+EEQE D DG EEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTK
Subjt:  RGNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTK

Query:  ILTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKI
        ILTCNCTMDVIVDN+SKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTF LSVG SNKPMYGAGRDMEDKLESG GLELTIR+NFISNYRVVWK 
Subjt:  ILTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKI

Query:  IRPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        IRPHFHR V+CLL+LGKAYDRKRHTRSFNSTCL S
Subjt:  IRPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

TrEMBL top hitse value%identityAlignment
A0A0A0LD21 Uncharacterized protein7.0e-17190.39Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV-HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
        ME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN  HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV-HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL
        NG DC EE+EE+G+G EEGYYGK+KRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPII+VKVGEIEEFMLGEGVDKTGVGTKIL
Subjt:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL

Query:  TCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR
        TCNCTMDVIVDN+SKLFGLHILPPSLHMSFGPLPIA SQGPRLYAESG T F+LSVG SNK MYGAGRDMEDKL+SG GLELTIR+NFISNYRVVWK I 
Subjt:  TCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        PHFHR VQCLL+L K YDR  HTRSFNSTC TS
Subjt:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

A0A1S3CFE9 uncharacterized protein LOC1035003122.8e-17290.42Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV--HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN   HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV--HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI
        G+GRDC EE+EEDG+G EEGYYGK+KRGCWKRYFTYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VTNPPSPIISVKVGEI+EFMLGEGVDKTGVGTKI
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI

Query:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
        LTCNCTMDVIVDN+SKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESG T F LSVG SNK MYGAGR+MEDKL+SG GLELTIR+NFISNYRVVWK I
Subjt:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
         PHFHR VQCLL+LGKAYDRKRHT SFNSTC TS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

A0A6J1DGR2 uncharacterized protein LOC1110203361.1e-16087.09Show/hide
Query:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
        MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPL SDTFP  HHHH N TQEASR TLS YSSSR SNHGAGTDNGEARLIVGRG
Subjt:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL
        NGR+ DEE+EEDG G EEGYYGKK+RGCWK YFTYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISVK+G +EEFMLGEGVDKTGVGTKIL
Subjt:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKIL

Query:  TCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR
        TCN TMDV VDNNSKLFGLHILPPSLH+SFGPLPIATSQG RLYAESGTTTFQLSVG SN+ MYGAGR MED LESG GLEL IR+NFISNYRVVWKIIR
Subjt:  TCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        PHF  RV+C L+LGK YDRKRHTRSFNSTCLTS
Subjt:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

A0A6J1F239 uncharacterized protein LOC1114414812.1e-15984.43Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN--VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPN   HHHHRNPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG 
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN--VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI
        G+G    EE++E  +  EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVKV E++EFMLGEGVDKTGVGTKI
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI

Query:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
        LTCNCTMDVIVDN SKLF LHILPPSLHMSFGPLPIATSQGPRLYAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKII
Subjt:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        +P FHRRV CLL++   YDRKRHTR FNSTCLTS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

A0A6J1J6W9 uncharacterized protein LOC1114819094.4e-15783.83Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVH--HHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN    HHHRN TQEASRFTLSHYSSS GSNHG GTDNGEARL+VG 
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVH--HHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI
        G+G    EE+ E  +  EE YYGKK+RGCWK YFTYRNSD+NAWICLQLSWRA+FSMG+ALLVFYIVTNPP PIISV+V E++EFMLGEGVDKTGVGTKI
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKI

Query:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
        LTCNCTMDVIVDN SKLF LHILPPSLHMSFGPLPIATSQGPRLYAESGTTTF+L+VG S KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKII
Subjt:  LTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        +P FHR V CLL++  AYDRKRHTR FNSTCLTS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45688.1 unknown protein2.6e-0823.66Show/hide
Query:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGRDCDEEQEED--GDGGE
        YYVQSPS  SH +         S+   SP+ S    +      +    +SRF+ S    SR  N     D  + +   G    ++C   +EE    DG  
Subjt:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGRDCDEEQEED--GDGGE

Query:  EGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSM--GIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSK
        +G  G  +R                  C  L++   F +  G   L+ Y    P  P I+VK    E   +  G D  GVGT ++T N T+ ++  N   
Subjt:  EGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSM--GIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSK

Query:  LFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNK-PMYGAGRDM----------EDKLESGTGLEL----------TIRVNFISNYR-
         FG+H+    + +SF  + I +    + Y    +    L   I  K P+YG+G  +          + K + G  + +           + ++F+   R 
Subjt:  LFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNK-PMYGAGRDM----------EDKLESGTGLEL----------TIRVNFISNYR-

Query:  -VVWKIIRPHFHRRVQC
         V+ K+++P F+++++C
Subjt:  -VVWKIIRPHFHRRVQC

AT1G45688.2 unknown protein1.3e-0426.13Show/hide
Query:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGRDCDEEQEED--GDGGE
        YYVQSPS  SH +         S+   SP+ S    +      +    +SRF+ S    SR  N     D  + +   G    ++C   +EE    DG  
Subjt:  YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGRDCDEEQEED--GDGGE

Query:  EGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSM--GIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSK
        +G  G  +R                  C  L++   F +  G   L+ Y    P  P I+VK    E   +  G D  GVGT ++T N T+ ++  N   
Subjt:  EGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSM--GIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSK

Query:  LFGLHILPPSLHMSFGPLPIAT
         FG+H+    + +SF  + I +
Subjt:  LFGLHILPPSLHMSFGPLPIAT

AT3G08490.1 BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein, group 2 (TAIR:AT3G24600.1)3.1e-5452.82Show/hide
Query:  KRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSKLFGLHILPPSLHMS
        KR      S+S+ WI LQ+ WR +FS+G+ALLVFYI T PP P IS ++G   +FML EGVD  GV TK LT NC+  +I+DN S +FGLHI PPS+   
Subjt:  KRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSKLFGLHILPPSLHMS

Query:  FGPLPIATSQGPRLYAES-GTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIRPHFHRRVQCLLILGKAYDRKRHT
        FGPL  A +QGP+LY  S  +TTFQL +  +N+ MYGAG +M D L S  GL L +R + IS+YRVVW II P +H +V+CLL+L    D++RH+
Subjt:  FGPLPIATSQGPRLYAES-GTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIRPHFHRRVQCLLILGKAYDRKRHT

AT3G24600.1 Late embryogenesis abundant protein, group 21.8e-1429.45Show/hide
Query:  VFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQL-SVGISN
        V +  ++P SPI+SVK  +I  F  GEG+D+TGV TKIL+ N ++ V +D+ +  FG+H+   +  ++F  L +AT Q    Y    +    +  +  + 
Subjt:  VFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPRLYAESGTTTFQL-SVGISN

Query:  KPMYGAGRDMEDKLESG---TGLELTIRVNFISNYRVVWKIIRPHFHRRVQCLLILGKAYDRK
         P+YGAG  +    + G     LE  IR    S   ++ K+++      V C   +  +   K
Subjt:  KPMYGAGRDMEDKLESG---TGLELTIRVNFISNYRVVWKIIRPHFHRRVQCLLILGKAYDRK

AT5G42860.1 unknown protein8.5e-1225.65Show/hide
Query:  CLQLSWRAIFSMGIAL--LVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPR
        C  L++   FS+  A   L+ Y    P  P ISVK    E+  +  G D  G+GT ++T N T+ ++  N    FG+H+    + +SF  + I +    +
Subjt:  CLQLSWRAIFSMGIAL--LVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSKLFGLHILPPSLHMSFGPLPIATSQGPR

Query:  LY-AESGTTTFQLSVGISNKPMYGAGRDM----------EDKLESG---------TGLELTIRVNFISNYR--VVWKIIRPHFHRRVQCLL
         Y +     T  ++V     P+YG+G  +          + K + G             + +R+NF    R  V+ K+++P F++R+ CL+
Subjt:  LY-AESGTTTFQLSVGISNKPMYGAGRDM----------EDKLESG---------TGLELTIRVNFISNYR--VVWKIIRPHFHRRVQCLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCTGCCGAGGAGCAAGAAGCCGTTCTCTTCCACTCCTATCCATGTGCTTATTACGTACAAAGCCCCTCTACCCTCTCCCACGCCAACAGCTCCGACATCCGAAA
CCCCGCCGAGTCCTCGGCTTGCCACTCGCCTCTTCCCTCAGACACTTTTCCCAACGTCCACCACCACCACCGCAACCCGACTCAAGAAGCCTCTCGCTTCACTCTCTCCC
ACTATTCATCCTCCCGTGGCTCAAACCATGGGGCCGGGACCGACAATGGCGAGGCTCGCTTGATAGTTGGTCGTGGCAATGGTCGAGATTGTGACGAGGAGCAGGAGGAG
GATGGGGACGGAGGCGAGGAAGGGTATTATGGGAAGAAAAAAAGAGGTTGTTGGAAGAGGTATTTTACGTATAGGAATTCGGATTCTAATGCATGGATTTGCTTGCAGTT
GAGTTGGAGGGCAATTTTCAGTATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAACCCTCCTTCACCAATCATTTCTGTTAAGGTGGGAGAAATAGAAGAGTTCA
TGCTAGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATTCTAACATGCAATTGCACAATGGATGTAATTGTGGATAACAATTCTAAGCTTTTTGGCCTTCACATT
CTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTTCCTATTGCAACTTCACAAGGTCCAAGATTGTATGCTGAGAGTGGAACGACGACGTTTCAATTAAGCGTGGGCAT
TAGCAACAAGCCGATGTACGGTGCGGGAAGGGACATGGAAGACAAGCTTGAATCAGGAACGGGATTGGAGCTTACAATTCGAGTCAATTTCATTTCAAATTATAGAGTAG
TTTGGAAAATCATAAGGCCCCACTTTCATCGTCGTGTCCAATGCTTATTGATCCTTGGAAAAGCCTACGATAGGAAGCGTCACACCCGATCCTTCAATAGTACTTGCTTA
ACTTCTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCTGCCGAGGAGCAAGAAGCCGTTCTCTTCCACTCCTATCCATGTGCTTATTACGTACAAAGCCCCTCTACCCTCTCCCACGCCAACAGCTCCGACATCCGAAA
CCCCGCCGAGTCCTCGGCTTGCCACTCGCCTCTTCCCTCAGACACTTTTCCCAACGTCCACCACCACCACCGCAACCCGACTCAAGAAGCCTCTCGCTTCACTCTCTCCC
ACTATTCATCCTCCCGTGGCTCAAACCATGGGGCCGGGACCGACAATGGCGAGGCTCGCTTGATAGTTGGTCGTGGCAATGGTCGAGATTGTGACGAGGAGCAGGAGGAG
GATGGGGACGGAGGCGAGGAAGGGTATTATGGGAAGAAAAAAAGAGGTTGTTGGAAGAGGTATTTTACGTATAGGAATTCGGATTCTAATGCATGGATTTGCTTGCAGTT
GAGTTGGAGGGCAATTTTCAGTATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAACCCTCCTTCACCAATCATTTCTGTTAAGGTGGGAGAAATAGAAGAGTTCA
TGCTAGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATTCTAACATGCAATTGCACAATGGATGTAATTGTGGATAACAATTCTAAGCTTTTTGGCCTTCACATT
CTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTTCCTATTGCAACTTCACAAGGTCCAAGATTGTATGCTGAGAGTGGAACGACGACGTTTCAATTAAGCGTGGGCAT
TAGCAACAAGCCGATGTACGGTGCGGGAAGGGACATGGAAGACAAGCTTGAATCAGGAACGGGATTGGAGCTTACAATTCGAGTCAATTTCATTTCAAATTATAGAGTAG
TTTGGAAAATCATAAGGCCCCACTTTCATCGTCGTGTCCAATGCTTATTGATCCTTGGAAAAGCCTACGATAGGAAGCGTCACACCCGATCCTTCAATAGTACTTGCTTA
ACTTCTTCATGA
Protein sequenceShow/hide protein sequence
MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGRDCDEEQEE
DGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNNSKLFGLHI
LPPSLHMSFGPLPIATSQGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIRPHFHRRVQCLLILGKAYDRKRHTRSFNSTCL
TSS