; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005918 (gene) of Snake gourd v1 genome

Gene IDTan0005918
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLate embryogenesis abundant protein, group 2
Genome locationLG04:83053698..83059914
RNA-Seq ExpressionTan0005918
SyntenyTan0005918
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581577.1 hypothetical protein SDJN03_21579, partial [Cucurbita argyrosperma subsp. sororia]4.2e-15482.53Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPL SD FPNG  HHHHHRNPTQEASRFTLS YSSS GSNHG GTDNGEARL+VG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG

Query:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT
         G   +EK+++ E EEE YYGRKR GCWKTY TYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVT PP P+ISV+V EV+EFMLGEGVDKTGVGTKILT
Subjt:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT

Query:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP
        CNCT++VIVDN+SKLF LHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF+L+VG + KPMYGAGR++ED LESG GLEL IRLNFISNYRVVWKII+P
Subjt:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP

Query:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
         FHRRV+C+LV+  AY RKRHTR  NSTCL S
Subjt:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

KAG7018082.1 hypothetical protein SDJN02_19948, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-15583.13Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPL SD FPNG  HHHHHRNPTQEASRFTLS YSSS GSNHG GTDNGEARL+VG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG

Query:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT
         G   +EK+++ E EEE YYGRKRRGCWKTY TYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVT PP P+ISV+V EV++FMLGEGVDKTGVGTKILT
Subjt:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT

Query:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP
        CNCT++VIVDN+SKLF LHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF+L+VGI+ KPMYGAGR++ED LESG GLEL IRLNFISNYRVVWKII+P
Subjt:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP

Query:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
         FHRRV+C+LV+  AYDRKRHTR  NSTCL S
Subjt:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

XP_008461795.2 PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo]2.5e-15482.63Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG
        ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSD RNPAE S CHSPL SD FPN  HHHHHHRNPTQEASRFTLS YSSSRGSNHG GTDNGEARLIVG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG

Query:  H--GRD-QDEKQDEDENEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKI
           GRD ++E++D + NEE YYG+++RGCWK Y TYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VT PPSPIISV+VGE++EFMLGEGVDKTGVGTKI
Subjt:  H--GRD-QDEKQDEDENEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKI

Query:  LTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKII
        LTCNCT++VIVDNHSKLFGLHILPPSLH+SFGPLPIATSQGPRLYAESG T F LSVG +NK MYGAGR+MED L+SG GLEL IRLNFISNYRVVWK I
Subjt:  LTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKII

Query:  RPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
         PHFHR V+C+L+LGKAYDRKRHT S NSTC  S
Subjt:  RPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

XP_022934269.1 uncharacterized protein LOC111441481 [Cucurbita moschata]4.2e-15483.13Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPL SD FPNG   HHHHRNPTQEASRFTLS YSSS GSNHG GTDNGEARL+VG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG

Query:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT
         G   +EKQ++ E EEE YYGRKRRGCWKTY TYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVT PP P+ISV+V EV+EFMLGEGVDKTGVGTKILT
Subjt:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT

Query:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP
        CNCT++VIVDN+SKLF LHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF+L+VGI+ KPMYGAGR++ED LESG GLEL IRLNFISNYRVVWKII+P
Subjt:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP

Query:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
         FHRRV+C+LV+   YDRKRHTR  NSTCL S
Subjt:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

XP_038905771.1 uncharacterized protein LOC120091726 [Benincasa hispida]4.8e-16688.36Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIV-
        MEAAEEQEAVLFHSYPC+YYVQSPSTLSHANSSD RNPAESSACHSPL SD FPNG HHHHHHRNPTQEASRFTLS YSSSRGSNHG GTDNGE RLIV 
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIV-

Query:  -GHGRDQDEKQ--DEDENEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTK
         G+GRD +E+Q  DED +EE YYG+K+RGCWK Y TYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVT PPSPIISV+VGE+EEFMLGEGVDKTGVGTK
Subjt:  -GHGRDQDEKQ--DEDENEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTK

Query:  ILTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKI
        ILTCNCT++VIVDNHSKLFGLHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF LSVG +NKPMYGAGRDMED LESG GLEL IRLNFISNYRVVWK 
Subjt:  ILTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKI

Query:  IRPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
        IRPHFHR VEC+LVLGKAYDRKRHTRS NSTCLPS
Subjt:  IRPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

TrEMBL top hitse value%identityAlignment
A0A0A0LD21 Uncharacterized protein5.0e-15382.04Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG
        ME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSD RNPAE S CHSPL SD FPN   HHHHHRNPTQEASRFTLS YSSSRGSNHG GTDNGEARLIVG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG

Query:  HGRDQDEKQDEDE---NEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKI
         G   D +++E+E   NEE YYG+++RGCWK Y TYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVT PPSPII+V+VGE+EEFMLGEGVDKTGVGTKI
Subjt:  HGRDQDEKQDEDE---NEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKI

Query:  LTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKII
        LTCNCT++VIVDNHSKLFGLHILPPSLH+SFGPLPIA SQGPRLYAESG T F+LSVG +NK MYGAGRDMED L+SG GLEL IRLNFISNYRVVWK I
Subjt:  LTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKII

Query:  RPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
         PHFHR V+C+L+L K YDR  HTRS NSTC  S
Subjt:  RPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

A0A1S3CFE9 uncharacterized protein LOC1035003121.2e-15482.63Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG
        ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSD RNPAE S CHSPL SD FPN  HHHHHHRNPTQEASRFTLS YSSSRGSNHG GTDNGEARLIVG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG

Query:  H--GRD-QDEKQDEDENEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKI
           GRD ++E++D + NEE YYG+++RGCWK Y TYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VT PPSPIISV+VGE++EFMLGEGVDKTGVGTKI
Subjt:  H--GRD-QDEKQDEDENEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKI

Query:  LTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKII
        LTCNCT++VIVDNHSKLFGLHILPPSLH+SFGPLPIATSQGPRLYAESG T F LSVG +NK MYGAGR+MED L+SG GLEL IRLNFISNYRVVWK I
Subjt:  LTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKII

Query:  RPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
         PHFHR V+C+L+LGKAYDRKRHT S NSTC  S
Subjt:  RPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

A0A6J1DGR2 uncharacterized protein LOC1110203363.0e-15083.63Show/hide
Query:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIV
        MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSD RN AESSACHSPLRSD FP GHHHHH   N TQEASR TLSRYSSSR SNHG GTDNGEARLIV
Subjt:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIV

Query:  --GHGRDQDEKQDED--ENEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGT
          G+GR+ DE+++ED   +EE YYG+KRRGCWKTY TYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISV++G VEEFMLGEGVDKTGVGT
Subjt:  --GHGRDQDEKQDED--ENEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGT

Query:  KILTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWK
        KILTCN T++V VDN+SKLFGLHILPPSLH+SFGPLPIATSQG RLYAESGTTTFQLSVG +N+ MYGAGR MEDMLESG GLEL IRLNFISNYRVVWK
Subjt:  KILTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWK

Query:  IIRPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
        IIRPHF  RVEC LVLGK YDRKRHTRS NSTCL S
Subjt:  IIRPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

A0A6J1F239 uncharacterized protein LOC1114414812.0e-15483.13Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPL SD FPNG   HHHHRNPTQEASRFTLS YSSS GSNHG GTDNGEARL+VG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG

Query:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT
         G   +EKQ++ E EEE YYGRKRRGCWKTY TYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVT PP P+ISV+V EV+EFMLGEGVDKTGVGTKILT
Subjt:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT

Query:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP
        CNCT++VIVDN+SKLF LHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF+L+VGI+ KPMYGAGR++ED LESG GLEL IRLNFISNYRVVWKII+P
Subjt:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP

Query:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
         FHRRV+C+LV+   YDRKRHTR  NSTCL S
Subjt:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

A0A6J1J6W9 uncharacterized protein LOC1114819095.2e-15081.63Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPL SD FPNG    HHHRN TQEASRFTLS YSSS GSNHG GTDNGEARL+VG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVG

Query:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT
         G   +EK+++ E EEE YYG+KRRGCWKTY TYRNSD+NAWICLQLSWRA+FSMG+ALLVFYIVT PP PIISVQV EV+EFMLGEGVDKTGVGTKILT
Subjt:  HGRDQDEKQDEDENEEE-YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILT

Query:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP
        CNCT++VIVDN+SKLF LHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF+L+VG + KPMYGAGR++ED LESG GLEL IRLNFISNYRVVWKII+P
Subjt:  CNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRP

Query:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS
         FHR V+C+LV+  AYDRKRHTR  NSTCL S
Subjt:  HFHRRVECVLVLGKAYDRKRHTRSLNSTCLPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45688.1 unknown protein1.7e-0724.61Show/hide
Query:  YYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVGHGRDQDEKQDEDENEEEY
        YYVQSPS  SH +         S+   SP+ S   P+ H     H   +  +SRF+ S    SR  N   G+         GHG ++  K+     EE  
Subjt:  YYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVGHGRDQDEKQDEDENEEEY

Query:  YGRKRRGCWKTYCTYRNSDSNAWI---CLQLSWRAIFSM--GIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSK
                        + D +  +   C  L++   F +  G   L+ Y   KP  P I+V+    E   +  G D  GVGT ++T N T+ ++  N   
Subjt:  YGRKRRGCWKTYCTYRNSDSNAWI---CLQLSWRAIFSM--GIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSK

Query:  LFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNK-PMYGAGRDM----------EDMLESGTGLEL----------RIRLNFISNYR-
         FG+H+    + LSF  + I +    + Y    +    L   I  K P+YG+G  +          +   + G  + +           + L+F+   R 
Subjt:  LFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNK-PMYGAGRDM----------EDMLESGTGLEL----------RIRLNFISNYR-

Query:  -VVWKIIRPHFHRRVEC
         V+ K+++P F++++EC
Subjt:  -VVWKIIRPHFHRRVEC

AT2G41990.1 CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864)2.2e-0724.05Show/hide
Query:  YYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHH---HHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVGHGRDQDEKQDEDENE
        YYVQSPS      + D    +  S C S + S   P+ +H    HH   + T   S   L  Y S R           E R  +  G D+ +  D+D+  
Subjt:  YYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHH---HHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVGHGRDQDEKQDEDENE

Query:  EEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSKLF
                         +RN     W+ L +    IF   +  L+ +  +K   P ++V+   V +  L  G D +GV T +L+ N TV +   N S  F
Subjt:  EEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSKLF

Query:  GLHILPPSLHLSFGPLPIATSQGPRL-YAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRPHFHRRVECVLVLGKA
         +H+    L L +  L +++ +  +     +G T     V  +  P+YG      D L     L L + +   S   ++ +++   F+ R+ C   L   
Subjt:  GLHILPPSLHLSFGPLPIATSQGPRL-YAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRPHFHRRVECVLVLGKA

Query:  YDRKRHTRSLNSTCLP
        +  K  + SL  +C+P
Subjt:  YDRKRHTRSLNSTCLP

AT3G08490.1 BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein, group 2 (TAIR:AT3G24600.1)1.2e-5354.55Show/hide
Query:  SDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIAT
        S+S+ WI LQ+ WR +FS+G+ALLVFYI T+PP P IS ++G   +FML EGVD  GV TK LT NC+  +I+DN S +FGLHI PPS+   FGPL  A 
Subjt:  SDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIAT

Query:  SQGPRLYAES-GTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRPHFHRRVECVLVLGKAYDRKRHT
        +QGP+LY  S  +TTFQL +   N+ MYGAG +M DML S  GL L +R + IS+YRVVW II P +H +VEC+L+L    D++RH+
Subjt:  SQGPRLYAES-GTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRPHFHRRVECVLVLGKAYDRKRHT

AT3G24600.1 Late embryogenesis abundant protein, group 25.3e-1427.5Show/hide
Query:  VFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQL-SVGINN
        V +  + P SPI+SV+  ++  F  GEG+D+TGV TKIL+ N +V V +D+ +  FG+H+   +  L+F  L +AT Q    Y    +    +  +    
Subjt:  VFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSKLFGLHILPPSLHLSFGPLPIATSQGPRLYAESGTTTFQL-SVGINN

Query:  KPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRPHFHRRVECVLVLGKAYDRK
         P+YGAG  +    + G  + +++     S   ++ K+++      V C   +  +   K
Subjt:  KPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRPHFHRRVECVLVLGKAYDRK

AT5G42860.1 unknown protein2.3e-0924.68Show/hide
Query:  AYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVGHGRDQDEKQDEDENEEE
        AY+VQSPS  SH    D    A S      L S   P G   H H       +SRF  S+ + S+   H      GE +  +    +++   D+ + E+E
Subjt:  AYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVGHGRDQDEKQDEDENEEE

Query:  YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIAL--LVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSKLF
           R+                    C  L++   FS+  A   L+ Y   KP  P ISV+    E+  +  G D  G+GT ++T N T+ ++  N    F
Subjt:  YYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIAL--LVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSKLF

Query:  GLHILPPSLHLSFGPLPIATSQGPRLY-AESGTTTFQLSVGINNKPMYGAGRDMED-------------------MLESGTGLELRIRLNFISNYR--VV
        G+H+    + LSF  + I +    + Y +     T  ++V  +  P+YG+G  +                     +        + +RLNF    R  V+
Subjt:  GLHILPPSLHLSFGPLPIATSQGPRLY-AESGTTTFQLSVGINNKPMYGAGRDMED-------------------MLESGTGLELRIRLNFISNYR--VV

Query:  WKIIRPHFHRRVECVL
         K+++P F++R+ C++
Subjt:  WKIIRPHFHRRVECVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCCGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCCTCTCCCACGCCAACAGCTCCGACAACCGAAA
CCCCGCCGAGTCGTCGGCCTGTCACTCGCCTCTCCGCTCCGACAACTTTCCCAATGGCCATCACCACCACCACCATCACCGCAACCCAACTCAGGAAGCCTCTCGCTTCA
CTCTCTCCCGCTACTCCTCCTCCCGTGGCTCGAACCATGGGGTCGGGACCGACAATGGCGAGGCTCGCCTGATCGTTGGTCACGGTCGTGATCAGGACGAGAAGCAGGAC
GAGGACGAAAACGAGGAAGAGTATTATGGGAGGAAGAGGAGAGGATGTTGGAAGACATATTGTACGTATAGAAATTCGGACTCTAATGCATGGATTTGCTTGCAATTGAG
TTGGAGGGCAATTTTCAGTATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAAGCCTCCCTCACCCATCATTTCTGTTCAGGTGGGAGAAGTGGAAGAATTTATGT
TAGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATCCTTACATGCAATTGCACAGTGAATGTAATTGTGGACAATCACTCTAAGCTCTTCGGCCTTCACATTCTT
CCTCCATCTCTTCATTTGTCTTTTGGGCCTCTCCCCATTGCTACTTCACAAGGTCCAAGATTATATGCTGAGAGTGGAACGACGACGTTTCAATTAAGCGTCGGCATCAA
CAATAAGCCAATGTATGGTGCAGGGAGGGATATGGAAGACATGCTTGAATCAGGAACAGGATTGGAGCTCAGAATTCGACTCAATTTCATTTCCAACTATAGGGTAGTTT
GGAAAATCATAAGGCCTCACTTTCATCGTCGTGTCGAATGCGTATTGGTCCTCGGAAAAGCCTACGATAGGAAGCGTCACACCCGATCCCTCAATAGTACTTGCCTACCT
TCTTGA
mRNA sequenceShow/hide mRNA sequence
AAGTTTAAAAGCCCTCCCATTATATGCTATTTCCCTTGTCAGAGAAATAATCTCTCCGCCACTCTCTCCGCCGCTGATGGAGGCCGCCGAAGAGCAAGAAGCCGTTCTCT
TCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCCTCTCCCACGCCAACAGCTCCGACAACCGAAACCCCGCCGAGTCGTCGGCCTGTCACTCGCCTCTC
CGCTCCGACAACTTTCCCAATGGCCATCACCACCACCACCATCACCGCAACCCAACTCAGGAAGCCTCTCGCTTCACTCTCTCCCGCTACTCCTCCTCCCGTGGCTCGAA
CCATGGGGTCGGGACCGACAATGGCGAGGCTCGCCTGATCGTTGGTCACGGTCGTGATCAGGACGAGAAGCAGGACGAGGACGAAAACGAGGAAGAGTATTATGGGAGGA
AGAGGAGAGGATGTTGGAAGACATATTGTACGTATAGAAATTCGGACTCTAATGCATGGATTTGCTTGCAATTGAGTTGGAGGGCAATTTTCAGTATGGGAATTGCTTTG
CTTGTGTTTTACATTGTCACTAAGCCTCCCTCACCCATCATTTCTGTTCAGGTGGGAGAAGTGGAAGAATTTATGTTAGGGGAAGGAGTGGACAAAACAGGGGTTGGAAC
TAAGATCCTTACATGCAATTGCACAGTGAATGTAATTGTGGACAATCACTCTAAGCTCTTCGGCCTTCACATTCTTCCTCCATCTCTTCATTTGTCTTTTGGGCCTCTCC
CCATTGCTACTTCACAAGGTCCAAGATTATATGCTGAGAGTGGAACGACGACGTTTCAATTAAGCGTCGGCATCAACAATAAGCCAATGTATGGTGCAGGGAGGGATATG
GAAGACATGCTTGAATCAGGAACAGGATTGGAGCTCAGAATTCGACTCAATTTCATTTCCAACTATAGGGTAGTTTGGAAAATCATAAGGCCTCACTTTCATCGTCGTGT
CGAATGCGTATTGGTCCTCGGAAAAGCCTACGATAGGAAGCGTCACACCCGATCCCTCAATAGTACTTGCCTACCTTCTTGATCATCTTCAACCAACAAATTAATTAAAT
TTTGCTTCTAAATTAATGATAAAAGAAAAACAACGAG
Protein sequenceShow/hide protein sequence
MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLRSDNFPNGHHHHHHHRNPTQEASRFTLSRYSSSRGSNHGVGTDNGEARLIVGHGRDQDEKQD
EDENEEEYYGRKRRGCWKTYCTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVQVGEVEEFMLGEGVDKTGVGTKILTCNCTVNVIVDNHSKLFGLHIL
PPSLHLSFGPLPIATSQGPRLYAESGTTTFQLSVGINNKPMYGAGRDMEDMLESGTGLELRIRLNFISNYRVVWKIIRPHFHRRVECVLVLGKAYDRKRHTRSLNSTCLP
S