; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016100 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016100
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGroup 2, putative
Genome locationscaffold9_2:383183..386307
RNA-Seq ExpressionMS016100
SyntenyMS016100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149613.1 uncharacterized protein LOC101209149 [Cucumis sativus]9.7e-15182.99Show/hide
Query:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHH-NATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGR
        ME A E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AE S CHSPL SDTFP  HHHHH N TQEASRFTLS YSSSRGSNHGAGTDNGEARLIVGR
Subjt:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHH-NATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGREGD-EEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK
        GNG  GD EE EE+G G+EEGYYGK++RGCWK YFTYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP I+VK+G +EEFMLGEGVDKTGVGTK
Subjt:  GNGREGD-EEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK

Query:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI
        ILTCN TMDV VDN+SKLFGLHILPPSLH+SFGPLPIA SQGPRLYAESG T F+LSVGTSN+AMYGAGR MED L+SG+GLEL IRLNFISNYRVVWK 
Subjt:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI

Query:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS
        I PHF   V+C L+L K YDR  HTRSFNSTC TS
Subjt:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS

XP_008461795.2 PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo]1.5e-15183.33Show/hide
Query:  EDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHH--NATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGRGNGR
        E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AE S CHSPL SDTFP  HHHHH  N TQEASRFTLS YSSSRGSNHGAGTDNGEARLIVGRG+GR
Subjt:  EDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHH--NATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGRGNGR

Query:  EGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCN
        + +EE EEDG G+EEGYYGK++RGCWK YFTYR+SDSNAWI LQLSWRAIFSMGIALLVFY+VT PPSP ISVK+G ++EFMLGEGVDKTGVGTKILTCN
Subjt:  EGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCN

Query:  FTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHF
         TMDV VDN+SKLFGLHILPPSLH+SFGPLPIATSQGPRLYAESG T F LSVGTSN+AMYGAGR MED L+SGMGLEL IRLNFISNYRVVWK I PHF
Subjt:  FTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHF

Query:  RHRVECSLVLEKGYDRKRHTRSFNSTCLTS
           V+C L+L K YDRKRHT SFNSTC TS
Subjt:  RHRVECSLVLEKGYDRKRHTRSFNSTCLTS

XP_022152674.1 uncharacterized protein LOC111020336 [Momordica charantia]6.7e-18498.5Show/hide
Query:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGRG
        MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHHNATQEASR TLSRYSSSR SNHGAGTDNGEARLIVGRG
Subjt:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKIL
        NGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVT PPSPNISVKMGGVEEFMLGEGVDKTGVGTKIL
Subjt:  NGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKIL

Query:  TCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIR
        TCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQG RLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIR
Subjt:  TCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIR

Query:  PHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS
        PHFRHRVECSLVL KGYDRKRHTRSFNSTCLTS
Subjt:  PHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS

XP_022934269.1 uncharacterized protein LOC111441481 [Cucurbita moschata]1.2e-14379.4Show/hide
Query:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTG--HHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVG
        M  A EDQE VLFHSYPCAYYVQSPST+SHANSSD RN AESSACHSPL SDTFP G  HHHH N TQEASRFTLS YSSS GSNHG GTDNGEARL+VG
Subjt:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTG--HHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVG

Query:  RGNGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK
         G+G E  +E+ E+   +EE YYG+KRRGCWKTYFTYRNSDSNAWI LQLSWRA+FSMG+ALLVFYIVT PP P ISVK+  V+EFMLGEGVDKTGVGTK
Subjt:  RGNGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK

Query:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI
        ILTCN TMDV VDN SKLF LHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF+L+VG S + MYGAGR +ED LESG GLEL IRLNFISNYRVVWKI
Subjt:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI

Query:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS
        I+P F  RV+C LV++  YDRKRHTR FNSTCLTS
Subjt:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS

XP_038905771.1 uncharacterized protein LOC120091726 [Benincasa hispida]6.3e-15886.01Show/hide
Query:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTG---HHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIV
        MEAA E+QEAVLFHSYPC+YYVQSPST+SHANSSDIRN AESSACHSPL SDTFP G   HHHH N TQEASRFTLS YSSSRGSNHGAGTDNGE RLIV
Subjt:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTG---HHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIV

Query:  GRGNGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGT
        GRGNGR+ +EE+E D  GDEEGYYGKK+RGCWK YFTYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISVK+G +EEFMLGEGVDKTGVGT
Subjt:  GRGNGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGT

Query:  KILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWK
        KILTCN TMDV VDN+SKLFGLHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF LSVGTSN+ MYGAGR MED LESGMGLEL IRLNFISNYRVVWK
Subjt:  KILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWK

Query:  IIRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS
         IRPHF   VEC LVL K YDRKRHTRSFNSTCL S
Subjt:  IIRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS

TrEMBL top hitse value%identityAlignment
A0A0A0LD21 Uncharacterized protein4.7e-15182.99Show/hide
Query:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHH-NATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGR
        ME A E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AE S CHSPL SDTFP  HHHHH N TQEASRFTLS YSSSRGSNHGAGTDNGEARLIVGR
Subjt:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHH-NATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGREGD-EEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK
        GNG  GD EE EE+G G+EEGYYGK++RGCWK YFTYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP I+VK+G +EEFMLGEGVDKTGVGTK
Subjt:  GNGREGD-EEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK

Query:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI
        ILTCN TMDV VDN+SKLFGLHILPPSLH+SFGPLPIA SQGPRLYAESG T F+LSVGTSN+AMYGAGR MED L+SG+GLEL IRLNFISNYRVVWK 
Subjt:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI

Query:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS
        I PHF   V+C L+L K YDR  HTRSFNSTC TS
Subjt:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS

A0A1S3CFE9 uncharacterized protein LOC1035003127.3e-15283.33Show/hide
Query:  EDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHH--NATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGRGNGR
        E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AE S CHSPL SDTFP  HHHHH  N TQEASRFTLS YSSSRGSNHGAGTDNGEARLIVGRG+GR
Subjt:  EDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHH--NATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGRGNGR

Query:  EGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCN
        + +EE EEDG G+EEGYYGK++RGCWK YFTYR+SDSNAWI LQLSWRAIFSMGIALLVFY+VT PPSP ISVK+G ++EFMLGEGVDKTGVGTKILTCN
Subjt:  EGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCN

Query:  FTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHF
         TMDV VDN+SKLFGLHILPPSLH+SFGPLPIATSQGPRLYAESG T F LSVGTSN+AMYGAGR MED L+SGMGLEL IRLNFISNYRVVWK I PHF
Subjt:  FTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHF

Query:  RHRVECSLVLEKGYDRKRHTRSFNSTCLTS
           V+C L+L K YDRKRHT SFNSTC TS
Subjt:  RHRVECSLVLEKGYDRKRHTRSFNSTCLTS

A0A6J1DGR2 uncharacterized protein LOC1110203363.2e-18498.5Show/hide
Query:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGRG
        MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHHNATQEASR TLSRYSSSR SNHGAGTDNGEARLIVGRG
Subjt:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKIL
        NGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVT PPSPNISVKMGGVEEFMLGEGVDKTGVGTKIL
Subjt:  NGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKIL

Query:  TCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIR
        TCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQG RLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIR
Subjt:  TCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIR

Query:  PHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS
        PHFRHRVECSLVL KGYDRKRHTRSFNSTCLTS
Subjt:  PHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS

A0A6J1F239 uncharacterized protein LOC1114414815.6e-14479.4Show/hide
Query:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTG--HHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVG
        M  A EDQE VLFHSYPCAYYVQSPST+SHANSSD RN AESSACHSPL SDTFP G  HHHH N TQEASRFTLS YSSS GSNHG GTDNGEARL+VG
Subjt:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTG--HHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVG

Query:  RGNGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK
         G+G E  +E+ E+   +EE YYG+KRRGCWKTYFTYRNSDSNAWI LQLSWRA+FSMG+ALLVFYIVT PP P ISVK+  V+EFMLGEGVDKTGVGTK
Subjt:  RGNGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK

Query:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI
        ILTCN TMDV VDN SKLF LHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF+L+VG S + MYGAGR +ED LESG GLEL IRLNFISNYRVVWKI
Subjt:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI

Query:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS
        I+P F  RV+C LV++  YDRKRHTR FNSTCLTS
Subjt:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS

A0A6J1J6W9 uncharacterized protein LOC1114819091.6e-14379.1Show/hide
Query:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGH--HHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVG
        M  A EDQE VLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPL SDTFP G   HHH N TQEASRFTLS YSSS GSNHG GTDNGEARL+VG
Subjt:  MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGH--HHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVG

Query:  RGNGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK
         G+G E   E+ E+   +EE YYGKKRRGCWKTYFTYRNSD+NAWI LQLSWRA+FSMG+ALLVFYIVT PP P ISV++  V+EFMLGEGVDKTGVGTK
Subjt:  RGNGREGDEEREEDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTK

Query:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI
        ILTCN TMDV VDN SKLF LHILPPSLH+SFGPLPIATSQGPRLYAESGTTTF+L+VGTS + MYGAGR +ED LESG GLEL IRLNFISNYRVVWKI
Subjt:  ILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI

Query:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS
        I+P F   V+C LV++  YDRKRHTR FNSTCLTS
Subjt:  IRPHFRHRVECSLVLEKGYDRKRHTRSFNSTCLTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41990.1 CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864)2.2e-0723.93Show/hide
Query:  YYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHH---HHHNATQEASRFT---LSRYSSSRGSNHGAGTDNGEARLIVGRGNGREGDEEREEDG
        YYVQSPS      + D+   +  S C S + S T P  +H    HH+     SRF+   L  Y S R           E R  +      +GD+  + DG
Subjt:  YYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHH---HHHNATQEASRFT---LSRYSSSRGSNHGAGTDNGEARLIVGRGNGREGDEEREEDG

Query:  AGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNN
          D++                +RN     W+LL +    IF   +  L+ +  +K   P ++VK   V +  L  G D +GV T +L+ N T+ +   N 
Subjt:  AGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNN

Query:  SKLFGLHILPPSLHISFGPLPIATSQGPRL-YAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHFRHRVECSLV
        S  F +H+    L + +  L +++ +  +     +G T     V      +YG      D L   + L +++     S   ++ +++   F  R+ CS  
Subjt:  SKLFGLHILPPSLHISFGPLPIATSQGPRL-YAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHFRHRVECSLV

Query:  LEKGY
        L+  +
Subjt:  LEKGY

AT3G08490.1 BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein, group 2 (TAIR:AT3G24600.1)5.8e-5342.16Show/hide
Query:  YYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHHNATQEASRFTLSRYSS-SRGSNHGAGTDNGEARLIVGRGNGREGDEEREEDGAGDEE
        Y+VQSPSTV H   S+ +         SP+RSD+ P             + F   RYS   R S +    ++ E RL+                      
Subjt:  YYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHHNATQEASRFTLSRYSS-SRGSNHGAGTDNGEARLIVGRGNGREGDEEREEDGAGDEE

Query:  GYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNNSKLFG
                           S+S+ WI+LQ+ WR +FS+G+ALLVFYI T+PP PNIS ++G   +FML EGVD  GV TK LT N +  + +DN S +FG
Subjt:  GYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNNSKLFG

Query:  LHILPPSLHISFGPLPIATSQGPRLYAES-GTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHFRHRVECSLVLEKGY
        LHI PPS+   FGPL  A +QGP+LY  S  +TTFQL + T+NRAMYGAG  M DML S  GL L++R + IS+YRVVW II P + H+VEC L+L    
Subjt:  LHILPPSLHISFGPLPIATSQGPRLYAES-GTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHFRHRVECSLVLEKGY

Query:  DRKRHT
        D++RH+
Subjt:  DRKRHT

AT3G24600.1 Late embryogenesis abundant protein, group 25.9e-1326.97Show/hide
Query:  ILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRL
        +L+ L    +FS+  ++L  +  + P SP +SVK   +  F  GEG+D+TGV TKIL+ N ++ VT+D+ +  FG+H+   +  ++F  L +AT Q    
Subjt:  ILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATSQGPRL

Query:  YAESGTTTFQLSVGTSNRA-MYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHFRHRVECSLVLEKGYDRK
        Y    +    +   T     +YGAG  +    + G  + + +     S   ++ K+++    + V CS  +      K
Subjt:  YAESGTTTFQLSVGTSNRA-MYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHFRHRVECSLVLEKGYDRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCCGCCGGCGAAGATCAAGAAGCCGTTCTCTTCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCGTCTCCCACGCCAACAGCTCCGACATCCG
AAACGCGGCCGAGTCGTCAGCCTGCCACTCCCCTCTCCGCTCCGACACCTTTCCCACCGGCCACCACCACCACCACAACGCGACCCAGGAGGCCTCTCGCTTCACCCTCT
CCCGCTACTCCTCATCTCGCGGGTCCAACCACGGGGCCGGGACCGACAATGGCGAGGCCCGCCTGATCGTCGGTCGTGGCAACGGCCGTGAGGGGGACGAGGAGCGGGAG
GAGGACGGGGCCGGAGACGAGGAAGGGTATTATGGGAAGAAGAGGAGAGGTTGTTGGAAGACATATTTTACGTATAGGAATTCAGATTCAAATGCATGGATTTTATTGCA
GCTAAGTTGGAGGGCAATTTTCAGCATGGGAATTGCTTTGCTTGTGTTCTATATTGTCACTAAGCCTCCCTCGCCCAACATTTCTGTTAAGATGGGAGGAGTGGAAGAAT
TTATGCTGGGGGAAGGAGTGGACAAAACAGGAGTTGGAACTAAGATTCTAACATGCAATTTCACAATGGATGTAACTGTGGACAATAATTCTAAGCTCTTTGGCCTTCAC
ATTCTTCCTCCATCTCTTCACATCTCATTTGGGCCTCTCCCCATTGCAACTTCTCAAGGTCCAAGATTGTATGCAGAGAGTGGAACGACGACGTTCCAGTTAAGCGTGGG
CACAAGCAATAGGGCAATGTATGGTGCGGGGAGGAGCATGGAAGACATGCTTGAATCAGGAATGGGATTGGAGCTCATGATTCGACTCAATTTCATCTCCAATTATCGGG
TGGTTTGGAAGATTATAAGGCCCCACTTTCGTCACCGTGTCGAATGCTCATTGGTTCTCGAGAAAGGATACGATAGGAAGCGTCACACACGATCCTTCAATAGTACTTGC
TTAACTTCT
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCCGCCGGCGAAGATCAAGAAGCCGTTCTCTTCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCGTCTCCCACGCCAACAGCTCCGACATCCG
AAACGCGGCCGAGTCGTCAGCCTGCCACTCCCCTCTCCGCTCCGACACCTTTCCCACCGGCCACCACCACCACCACAACGCGACCCAGGAGGCCTCTCGCTTCACCCTCT
CCCGCTACTCCTCATCTCGCGGGTCCAACCACGGGGCCGGGACCGACAATGGCGAGGCCCGCCTGATCGTCGGTCGTGGCAACGGCCGTGAGGGGGACGAGGAGCGGGAG
GAGGACGGGGCCGGAGACGAGGAAGGGTATTATGGGAAGAAGAGGAGAGGTTGTTGGAAGACATATTTTACGTATAGGAATTCAGATTCAAATGCATGGATTTTATTGCA
GCTAAGTTGGAGGGCAATTTTCAGCATGGGAATTGCTTTGCTTGTGTTCTATATTGTCACTAAGCCTCCCTCGCCCAACATTTCTGTTAAGATGGGAGGAGTGGAAGAAT
TTATGCTGGGGGAAGGAGTGGACAAAACAGGAGTTGGAACTAAGATTCTAACATGCAATTTCACAATGGATGTAACTGTGGACAATAATTCTAAGCTCTTTGGCCTTCAC
ATTCTTCCTCCATCTCTTCACATCTCATTTGGGCCTCTCCCCATTGCAACTTCTCAAGGTCCAAGATTGTATGCAGAGAGTGGAACGACGACGTTCCAGTTAAGCGTGGG
CACAAGCAATAGGGCAATGTATGGTGCGGGGAGGAGCATGGAAGACATGCTTGAATCAGGAATGGGATTGGAGCTCATGATTCGACTCAATTTCATCTCCAATTATCGGG
TGGTTTGGAAGATTATAAGGCCCCACTTTCGTCACCGTGTCGAATGCTCATTGGTTCTCGAGAAAGGATACGATAGGAAGCGTCACACACGATCCTTCAATAGTACTTGC
TTAACTTCT
Protein sequenceShow/hide protein sequence
MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHHHHNATQEASRFTLSRYSSSRGSNHGAGTDNGEARLIVGRGNGREGDEERE
EDGAGDEEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTKPPSPNISVKMGGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNNSKLFGLH
ILPPSLHISFGPLPIATSQGPRLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKIIRPHFRHRVECSLVLEKGYDRKRHTRSFNSTC
LTS