; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1502 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1502
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF506)
Genome locationMC06:22338260..22344956
RNA-Seq ExpressionMC06g1502
SyntenyMC06g1502
Gene Ontology termsNA
InterPro domainsIPR006502 - Protein of unknown function PDDEXK-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571445.1 hypothetical protein SDJN03_28173, partial [Cucurbita argyrosperma subsp. sororia]3.74e-22083.2Show/hide
Query:  MKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSD-CFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRSRC
        MKIQPIDFD+AEEAAR ELVKP VKS KLKRLFE+QF NVLRNSAEKANFEE NVNKDSSD   S LEPSS+CLA MVQNFIEDNNEKQFSASRCGRSRC
Subjt:  MKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSD-CFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRSRC

Query:  NCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPSYP
        NCFNGNYTDSSEEELD H GFGD+ FSSGGEAWELLKSLIPCT+VHERNLLADTARIVEKNKVCKRKDNLARQ+VTNGLLALGYDASICKSHWEKSP++P
Subjt:  NCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPSYP

Query:  AGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLSI
        AGDYEY+DVII+GERLLID+DFRSEFEIARSTKSY+ ILQL+P+I+VG P RLQRIVS++SEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLS 
Subjt:  AGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLSI

Query:  LGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED
        LGP+ ESK  L N Q+  R  AEKSVD NE G V+             VKEWKPPE+KPKSSS+GARNLKIVTGLASVIED
Subjt:  LGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED

XP_022145977.1 uncharacterized protein LOC111015299 [Momordica charantia]1.18e-277100Show/hide
Query:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR
        MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR
Subjt:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR

Query:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP
        SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP
Subjt:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP

Query:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS
        SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS
Subjt:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS

Query:  LSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP
        LSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP
Subjt:  LSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP

XP_022931152.1 uncharacterized protein LOC111437420 [Cucurbita moschata]2.07e-22082.12Show/hide
Query:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR
        MPFQMKIQPIDFD+ +EAAR ELVKPA KS KLKRLFE+QF NVLRNSAEK NFEEL+VNK+SSD FS LEPSSICLA MVQNF+EDNNEKQFS SRCGR
Subjt:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR

Query:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP
        SRCNCFNGNYTDSS+EE D   GFGD+KFSSGGEAWELLKSLIPC+SVHERN+LADTARIVEKNKVCKRKDNLAR+IVTNGLLALGYDASICKSHWEKSP
Subjt:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP

Query:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS
        SYPAGDYEYVDVIIEGERLLID+DFRSEFEIARSTKSY++ILQL+P+I+VGK  RLQRIVS+VSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH RASS
Subjt:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS

Query:  LSILGPNSESKQPLENFQLEPRQV--AEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED
        L+ILG N ESK PLEN Q++P      EKS D NEF E               VKEWKPPE+KPKSSS+G RNLKIVTGLASVIED
Subjt:  LSILGPNSESKQPLENFQLEPRQV--AEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED

XP_022963817.1 uncharacterized protein LOC111464003, partial [Cucurbita moschata]4.58e-22083.03Show/hide
Query:  FQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSD-CFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRS
        FQMKIQPIDFD+AEEAAR ELVKP VKS KLKRLFE+QF NVLRNSAEKANFEE NVNKDSSD   S LEPSS+CLA MVQNFIEDNNEKQFSASRCGRS
Subjt:  FQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSD-CFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRS

Query:  RCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPS
        RCNCFNGNYTDSSEEELD H GFGD+ FSSGGEAWELLKSLIPCT+VHERNLLADTARIVEKNKVCKRKDNLARQ+VTNGLLALGYDA ICKSHWEKSP+
Subjt:  RCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPS

Query:  YPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSL
        +PAGDYEY+DVII GERLLID+DFRSEFEIARSTKSY+ ILQL+P+I+VG P RLQRIVS++SEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSL
Subjt:  YPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSL

Query:  SILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED
        S LGP+ ESK  L N Q+  R  AEKSVD NE G V+             VKEWKPPE+KPKSSS+GARNLKIVTGLASVIED
Subjt:  SILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED

XP_038888024.1 uncharacterized protein LOC120077958 [Benincasa hispida]6.28e-22483.9Show/hide
Query:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSD-CFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCG
        MPFQMKIQPIDFD+ EEAAR ELVKP VKS KLKRLFE+QF NVLRNSAEKANFEELN NKDSSD   S LEPSSICLA MVQNFIED+NEKQFSASRCG
Subjt:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSD-CFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCG

Query:  RSRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKS
        RSRCNCFNGNYTDSSEE++D H GFGD+ FSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLAR+IVTNGLLALGYD+SICKSHWEKS
Subjt:  RSRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKS

Query:  PSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRAS
        P+YPAGDYEYVDVIIEGERLLID+DFRSEFEIARSTKSY++ILQLVP+I+VG P RLQRIVS+VSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRAS
Subjt:  PSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRAS

Query:  SLSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED
        SLSILGP+ ESK  LEN Q       EK VDRN  GE+           E  VKEWKPPE+KPKSSS+GARNLKIVTGLASVIED
Subjt:  SLSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED

TrEMBL top hitse value%identityAlignment
A0A1S4DYH4 uncharacterized protein LOC1034912597.13e-21982.81Show/hide
Query:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR
        MPFQMKIQPIDFD+ EEAAR ELVKP VKS KLKRLFE+QF NVLRNSAEKANFEELN NKDSSD  + LEPSS+CLA MVQNFIEDNNEKQFSASRCGR
Subjt:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR

Query:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP
        SRCNCFNGN TDSSEE+LDSHG FGD+ FSSGGEAWELLKSL+PCT+VHERNLLADTARIVEKNKVCKRKDNLAR+IVTNGLLALGYDASICKSHWEKSP
Subjt:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP

Query:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS
        +YPAGDYEY+DVIIEGERLLID+D RSEFEIARSTKSY++ILQL+P+I+VG P RLQRIVS+VSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS
Subjt:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS

Query:  LSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED
        LS  GP+ ESK  +EN        AEKSVDRN  GE++     T+ V    VKEWKPPE+KPKSSS+GARNLKIVTGLASVIED
Subjt:  LSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED

A0A6J1CY78 uncharacterized protein LOC1110152995.70e-278100Show/hide
Query:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR
        MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR
Subjt:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR

Query:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP
        SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP
Subjt:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP

Query:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS
        SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS
Subjt:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS

Query:  LSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP
        LSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP
Subjt:  LSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP

A0A6J1ESR4 uncharacterized protein LOC1114374201.00e-22082.12Show/hide
Query:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR
        MPFQMKIQPIDFD+ +EAAR ELVKPA KS KLKRLFE+QF NVLRNSAEK NFEEL+VNK+SSD FS LEPSSICLA MVQNF+EDNNEKQFS SRCGR
Subjt:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR

Query:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP
        SRCNCFNGNYTDSS+EE D   GFGD+KFSSGGEAWELLKSLIPC+SVHERN+LADTARIVEKNKVCKRKDNLAR+IVTNGLLALGYDASICKSHWEKSP
Subjt:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP

Query:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS
        SYPAGDYEYVDVIIEGERLLID+DFRSEFEIARSTKSY++ILQL+P+I+VGK  RLQRIVS+VSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH RASS
Subjt:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS

Query:  LSILGPNSESKQPLENFQLEPRQV--AEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED
        L+ILG N ESK PLEN Q++P      EKS D NEF E               VKEWKPPE+KPKSSS+G RNLKIVTGLASVIED
Subjt:  LSILGPNSESKQPLENFQLEPRQV--AEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED

A0A6J1HG87 uncharacterized protein LOC1114640032.22e-22083.03Show/hide
Query:  FQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSD-CFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRS
        FQMKIQPIDFD+AEEAAR ELVKP VKS KLKRLFE+QF NVLRNSAEKANFEE NVNKDSSD   S LEPSS+CLA MVQNFIEDNNEKQFSASRCGRS
Subjt:  FQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSD-CFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRS

Query:  RCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPS
        RCNCFNGNYTDSSEEELD H GFGD+ FSSGGEAWELLKSLIPCT+VHERNLLADTARIVEKNKVCKRKDNLARQ+VTNGLLALGYDA ICKSHWEKSP+
Subjt:  RCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPS

Query:  YPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSL
        +PAGDYEY+DVII GERLLID+DFRSEFEIARSTKSY+ ILQL+P+I+VG P RLQRIVS++SEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSL
Subjt:  YPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSL

Query:  SILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED
        S LGP+ ESK  L N Q+  R  AEKSVD NE G V+             VKEWKPPE+KPKSSS+GARNLKIVTGLASVIED
Subjt:  SILGPNSESKQPLENFQLEPRQVAEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED

A0A6J1K2U7 uncharacterized protein LOC1114912067.00e-21981.65Show/hide
Query:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR
        MPFQMKIQPIDFD+ +EAAR ELVKPA KS KLKRLFE+QF NVLRNSAEK NFE+L+VNK+SSD FS LEPSSICLA MVQNFIEDNNEKQF  SRCGR
Subjt:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGR

Query:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP
        +RCNCFNGNYTDSS+EE D   GFGD+KFSSGGEAWELLKSLIPC+SVHERN+LADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP
Subjt:  SRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSP

Query:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS
        SYPAGDYEYVDVIIEGERLLID+DFRSEFEIARSTKSY++ILQL+P+I+VGK  RLQRIVS+VSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH RASS
Subjt:  SYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS

Query:  LSILGP-NSESKQPLENFQLEPRQV--AEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED
        L+ILG  N ESK PLEN Q++P      EKS+D NEF E+              VKEWKPPE+KPKSSS+G RNLKIVTGLASVIED
Subjt:  LSILGP-NSESKQPLENFQLEPRQV--AEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38820.1 Protein of unknown function (DUF506)1.3e-7550Show/hide
Query:  MPFQMKIQPID-FDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEK--ANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNN--EKQFSA
        MP  MKIQPID  D +EE    E ++   K S+LKRLFE+QF N  +N +EK   +  E  +++ +S  F   EPSS+CLAKMV NF+EDNN  EKQ   
Subjt:  MPFQMKIQPID-FDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEK--ANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNN--EKQFSA

Query:  SRCGRSRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSH
         RCGRSRCNCF+G+ T+SS++E +           S GEA E+LKSL+ C S+  RNLL D  +I E +                      YDA++CKS 
Subjt:  SRCGRSRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSH

Query:  WEKSPSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH
        WEKSPS PAG+YEYVDVI++GERLLID+DF+S+FEIAR+TK+Y+++LQ +P+I+VGK  RLQ+I+ ++ +AAKQSLKKKG+ VPPWR+AEYVK+KWLS H
Subjt:  WEKSPSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH

Query:  IRASSLSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFG
        +R    S    N E KQ       E  +V  +SV    FG
Subjt:  IRASSLSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFG

AT2G38820.2 Protein of unknown function (DUF506)1.4e-8553.24Show/hide
Query:  MPFQMKIQPID-FDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEK--ANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNN--EKQFSA
        MP  MKIQPID  D +EE    E ++   K S+LKRLFE+QF N  +N +EK   +  E  +++ +S  F   EPSS+CLAKMV NF+EDNN  EKQ   
Subjt:  MPFQMKIQPID-FDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEK--ANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNN--EKQFSA

Query:  SRCGRSRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSH
         RCGRSRCNCF+G+ T+SS++E +           S GEA E+LKSL+ C S+  RNLL D  +I E +K CK KD    + V NGL++LGYDA++CKS 
Subjt:  SRCGRSRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSH

Query:  WEKSPSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH
        WEKSPS PAG+YEYVDVI++GERLLID+DF+S+FEIAR+TK+Y+++LQ +P+I+VGK  RLQ+I+ ++ +AAKQSLKKKG+ VPPWR+AEYVK+KWLS H
Subjt:  WEKSPSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH

Query:  IRASSLSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFG
        +R    S    N E KQ       E  +V  +SV    FG
Subjt:  IRASSLSILGPNSESKQPLENFQLEPRQVAEKSVDRNEFG

AT3G22970.1 Protein of unknown function (DUF506)1.3e-9651.91Show/hide
Query:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVL---EPSSICLAKMVQNFIEDNNEKQFSASR
        MPF MKIQPID DS+   AR E     V  S+LKRLF++ F NVLRNS      +   V      C  V+   EPSS+CLAKMVQNFIE+NNEKQ   ++
Subjt:  MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVL---EPSSICLAKMVQNFIEDNNEKQFSASR

Query:  CGRSRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWE
        CGR+RCNCFNGN   SS++E D  GG  D     G +A + LKSLIPCT+V ERNLLAD A+IV+KNK  KRKD++ ++IV  GLL+L Y++SICKS W+
Subjt:  CGRSRCNCFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWE

Query:  KSPSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIR
        KSPS+PAG+YEY+DVII  ERL+IDVDFRSEF+IAR T  Y+ +LQ +PFI+VGK  RL +IV ++SEAAKQSLKKKGMP PPWRKAEY+++KWLS + R
Subjt:  KSPSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIR

Query:  ASSLSILGPNSESKQPLENFQLEPRQVAEKSVDRNE----FGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP
        AS + +     E+    +    +     EK VD  E    F E  L P V +              V+        R +K VTGLAS+ +++P
Subjt:  ASSLSILGPNSESKQPLENFQLEPRQVAEKSVDRNE----FGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP

AT3G22970.2 Protein of unknown function (DUF506)1.0e-6151.38Show/hide
Query:  LLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKS
        ++ SLIPCT+V ERNLLAD A+IV+KNK  KRKD++ ++IV  GLL+L Y++SICKS W+KSPS+PAG+YEY+DVII  ERL+IDVDFRSEF+IAR T  
Subjt:  LLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKS

Query:  YRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLSILGPNSESKQPLENFQLEPRQVAEKSVDRNE----
        Y+ +LQ +PFI+VGK  RL +IV ++SEAAKQSLKKKGMP PPWRKAEY+++KWLS + RAS + +     E+    +    +     EK VD  E    
Subjt:  YRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLSILGPNSESKQPLENFQLEPRQVAEKSVDRNE----

Query:  FGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP
        F E  L P V +              V+        R +K VTGLAS+ +++P
Subjt:  FGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP

AT4G14620.1 Protein of unknown function (DUF506)2.2e-7245.24Show/hide
Query:  MKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRSRCN
        MKIQPI+ D    A R+E     V  S+LKRL ++ F  +       +N E+L ++ D     +  EPS   LAKMVQN++E+NN+KQ    R    RCN
Subjt:  MKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRSRCN

Query:  CFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPSYPA
        CFNGN  D S++ELD    F D         ++  KSLI C S  E++LL +  +I+EKNK  KRKD L R+IV + L +LGYD+SICKS W+K+ S PA
Subjt:  CFNGNYTDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPSYPA

Query:  GDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLSIL
        G+YEY+DVI+ GERL+ID+DFRSEFEIAR T  Y+ +LQ +P I+VGK  R+++IVS+VSEA+KQSLKKKGM  PPWRKA+Y++AKWLS + R       
Subjt:  GDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLSIL

Query:  GPNSESKQPLENFQLEPRQVAEKSVDRNE----FGEVDL-----GPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED
          NS  K+P        + VAE  +D +E    F E  L      P  ++G ++D V E     VK        +  K+VTGLA + ++
Subjt:  GPNSESKQPLENFQLEPRQVAEKSVDRNE----FGEVDL-----GPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTTCAAATGAAGATCCAGCCAATCGATTTCGACAGTGCCGAAGAAGCAGCTCGGTTAGAGCTGGTTAAGCCGGCCGTGAAGTCGTCGAAGTTGAAGCGACTATT
TGAAAAACAGTTCCATAACGTACTGAGGAATTCGGCAGAGAAGGCAAACTTTGAAGAATTGAATGTTAACAAAGACAGCTCCGACTGCTTCTCTGTGTTGGAGCCTAGCT
CTATTTGCTTGGCGAAGATGGTCCAGAATTTCATAGAGGACAACAATGAGAAACAATTCAGTGCGTCTAGGTGTGGTCGTAGTCGCTGCAACTGCTTCAATGGAAACTAT
ACGGACAGCTCCGAAGAGGAATTGGATTCACATGGTGGTTTTGGCGATGCCAAATTTTCTTCCGGTGGTGAGGCTTGGGAACTTCTGAAGAGCTTAATTCCTTGTACGAG
TGTTCACGAGAGGAATTTATTAGCAGACACGGCCAGGATCGTGGAGAAGAACAAGGTCTGCAAGCGCAAAGACAACTTAGCAAGACAGATTGTTACTAATGGATTGCTGG
CTCTCGGATACGACGCTTCCATCTGCAAATCTCACTGGGAAAAGTCCCCCTCATATCCGGCTGGGGATTACGAATACGTTGACGTGATTATCGAAGGAGAACGCTTACTG
ATAGACGTCGACTTCCGATCAGAATTTGAAATTGCGCGGTCAACCAAAAGTTACAGAACAATCCTCCAACTTGTTCCTTTTATCTACGTCGGCAAGCCTGGTCGGCTCCA
GAGGATCGTATCCGTCGTATCAGAGGCTGCAAAACAGAGCTTAAAAAAGAAGGGGATGCCTGTTCCCCCATGGCGAAAAGCCGAGTATGTCAAAGCGAAGTGGCTCTCTC
CTCATATTCGTGCGTCATCCTTGTCGATTTTGGGTCCAAATTCTGAGTCAAAGCAACCCCTTGAAAACTTCCAACTTGAGCCCCGTCAAGTGGCCGAGAAATCGGTGGAT
CGTAACGAGTTTGGAGAGGTAGACTTGGGGCCATCGGTAACCATTGGTGTTGAGGAGGACACTGTGAAAGAATGGAAGCCTCCGGAGGTTAAGCCAAAAAGCTCATCGCT
TGGGGCTAGAAATCTTAAGATCGTAACTGGTTTGGCATCGGTTATTGAAGACGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
TAATAACCTTCTCGTCCATTAGGCTTCTGGTCGCAATTTTCAACTAAGTTGAGCCTTGCCTCCCCTAATGAAGGAGGGCATTATCGTTATTTCGCTTTGACACGAGAATT
TCACCGGAGTGGGTTGGTTGGTTTTCTCCAGCTGCCCTTCGCCCACGCCACACGATCATGTGGCGGTTGTGAACGACTCCGTGTATGAGGGTCATTGGCAAACTCGTGAT
TTTGAATTTTTGTTGAGGGCATTTTCGGCATTTAGTTTTTCCCAAATATCGTCACCCAAACATAACGTGGAGGAACTTCGCAAGAGCCACGAAGCAAATTTGTCGACTGC
TCGACACGTGGTCCTTGCGGGTGATAATGTCGTTTTAACGCCACGCCGGATTCCACGTGGACATGAATCCTAGTCAGCACCTCCGATTTTTCTTTTTAAAGACCCCGAAG
ACTTCTCATTTCTCTTCCATCGTCCACTGAAAGAAAGCCGAGAGCGTACTTAAGTCTTAAACCGAGTTCATTTCTACCTTCCGTCTCCCTGATTTTCTTGGCCGGCGGCT
ACTGTGCCACGCCGGAAAATACGCTCCTCGCCGGAAATTATAGAGTTTCTTATCTTCGGTTGCGTTGCAGATACGGTCTGGGCACTCTGACCGTATCTGATTTCCCCAAG
CAAAAGCGACTTTACTTGGCTGAGTTTCGAAGTTATTCTATATTCTGTTTAAAAATAAGGAGAAAAACTACATTATAATTATTAGGGAATAGGTTTCTGATTTTTAGATA
TCTCAAATTTGTCCGAGTACATTTAATTTTTGGGGGCAAAACTCTGGATTTGTGTGTCGATTTTATTTTCTGATAATCTGTAATCAGATACGCTTCAGAGTCGTCGATGG
CGAATTCAAACGAACTGTAGGTTGTGGAAGTCGCATTTTCCGTCTGCTACCATAATACTGTTTTGCTTATTTTTCTGTAAACACACTTGTCCGTTCTCGAATCTTGAATT
AACCGAAGGAATTGAAGATTACGAGCTTCAATTTTGGAAAATGGTGAAGTTAATAGTTGTTGAGATCTGAGTATCTGAGAGGTGCTTAAATTTTCTCGGAAGAGCTTACG
GCTTCTCTGATCATGCCATTTCAAATGAAGATCCAGCCAATCGATTTCGACAGTGCCGAAGAAGCAGCTCGGTTAGAGCTGGTTAAGCCGGCCGTGAAGTCGTCGAAGTT
GAAGCGACTATTTGAAAAACAGTTCCATAACGTACTGAGGAATTCGGCAGAGAAGGCAAACTTTGAAGAATTGAATGTTAACAAAGACAGCTCCGACTGCTTCTCTGTGT
TGGAGCCTAGCTCTATTTGCTTGGCGAAGATGGTCCAGAATTTCATAGAGGACAACAATGAGAAACAATTCAGTGCGTCTAGGTGTGGTCGTAGTCGCTGCAACTGCTTC
AATGGAAACTATACGGACAGCTCCGAAGAGGAATTGGATTCACATGGTGGTTTTGGCGATGCCAAATTTTCTTCCGGTGGTGAGGCTTGGGAACTTCTGAAGAGCTTAAT
TCCTTGTACGAGTGTTCACGAGAGGAATTTATTAGCAGACACGGCCAGGATCGTGGAGAAGAACAAGGTCTGCAAGCGCAAAGACAACTTAGCAAGACAGATTGTTACTA
ATGGATTGCTGGCTCTCGGATACGACGCTTCCATCTGCAAATCTCACTGGGAAAAGTCCCCCTCATATCCGGCTGGGGATTACGAATACGTTGACGTGATTATCGAAGGA
GAACGCTTACTGATAGACGTCGACTTCCGATCAGAATTTGAAATTGCGCGGTCAACCAAAAGTTACAGAACAATCCTCCAACTTGTTCCTTTTATCTACGTCGGCAAGCC
TGGTCGGCTCCAGAGGATCGTATCCGTCGTATCAGAGGCTGCAAAACAGAGCTTAAAAAAGAAGGGGATGCCTGTTCCCCCATGGCGAAAAGCCGAGTATGTCAAAGCGA
AGTGGCTCTCTCCTCATATTCGTGCGTCATCCTTGTCGATTTTGGGTCCAAATTCTGAGTCAAAGCAACCCCTTGAAAACTTCCAACTTGAGCCCCGTCAAGTGGCCGAG
AAATCGGTGGATCGTAACGAGTTTGGAGAGGTAGACTTGGGGCCATCGGTAACCATTGGTGTTGAGGAGGACACTGTGAAAGAATGGAAGCCTCCGGAGGTTAAGCCAAA
AAGCTCATCGCTTGGGGCTAGAAATCTTAAGATCGTAACTGGTTTGGCATCGGTTATTGAAGACGAGCCATGAAATTTTTTGTTTTGTTTCTTCTCTCCTTCTTTGGTTT
TTGACTACCGTATAAACAAATAGTTATATACAACCTATGTATGGTGTTAACCTTCAAATCCCCTTCTGGTTTTTGTGTAAGGGTTTGGATATTACTCAAATCAATAAAAT
TTGAAAACATGAATAGTTAACAGCTATTAAGTTTGTAAAGAGCCTTTTTCTTCAACACCCCCCTCCCTCCCCCATAAAAAGAGGGGAAAAAAATCACTCTCCTGGTCCTG
GACAGATGATCTGGTTTCTGCTTGCATCCATCTTCTGCCATGTATTGCCATCATCATCCAGATTTAAGTGAAGGTGCAGCTGGAAACAGTGAAGGTAAAGGAACTAACTT
TATGATTGAAGAAGATTACATAAAGTATTGGTGTCTCGAGCTCCGTGAAATGAGAATATCTTAAAAAGAAGGAAAGCAAAGCAAAGAAAAGCAGAGATCCAGAGTCAATA
TCGGGTATGACTTTTTCTCGATTATATAAAGTATAATACTTAATAGTAGGTTACTTGCTAACATTGTAAAGGTAGCACCCGTTCCTTTTCACCCAACAAAAAAAAGAGAG
AGAAATTATATTATATTTCTTTGCTTCTAAATTTCCTGCTTCTTAATTAATTTGATTTGATGGGAAGATGTTTATGATGATTCTGGACATTCAGGTAAAATAGGTATAAA
AATAAGCACAGAAATTTACCTGAAAAGACCCAACTTGATCATCTTGCTTACTCTTTCATATTTGCTTAATTAAAATTGTAAGTATATGCATGTGATGAAATGTTAAAGAG
AGCTTTATTGGAAATGGGGCAGGGTGCATATTCTTTTTAGACATAGGGATTCTGCTCCTTTTGGTAGGCAAAGCAAATCAAAGGTAAAACGTAATTGGTTTCCTCTACTT
GTGGCCATGGAAGTCAATATGTTTTGCTTGTGCTATTCAATTTCATGTTCTACTCCCCCACACAACTCCACCCTCTTGCATGATGGCATCTCAAATGTAGTCTTTTCATA
CTGTAGCACACCAAAATTTAGGGACACAATGTTTTAAATAAATATTTTCTAAAATATCATAAGTTAGCCTAAAACTATTTTTAAAATATCCGTCTTTTGAATTTAAAACA
CTTTATTTATCATCTTGTTTTGGTCATGTTCAAACGATTTCCTTATAGTATTAGATCCTTATGGAAGGGTTATTAAGACCCGTTAGAAATTTATGATACAATTCTAATTA
GGAATAGTTTTGAAAATTAAAGTACAGATGTGCCTTGAAAATGTTCTCTTGAGATTTTTTTTAGTTCAAGATCAACCATGGAAAAGGTGGGTTCATTGCAATTTCTAATC
AGTTAGAAGTTAGCTAACTATTAGTTATTGTTAGTTGATTAAGATGTTGTATTCTCCCTCAATTTAAGTGGGGAACTCATCATCCTTTAAACGTGAATAAGGAAGTTATA
CGCCACCTTTTTATTTTAGATTATGCAGTGTCAAAGAGTTTTTAGAGATGGCTGTAAAAATTTCTTGGGGGTTCCACGTATTTCTTTTCTTTTAAATGTAATAACTAGAC
TTTCTAGATTAAATAAATATTTAAAGAAAAGATGTTTTAGAAATATCTTAAGCTGAATTCCTTTGAACAATGATAGATTAAGAAGAATTTTTGGATAAAAGATAGATTAA
GAAGTGTCTGGAGGGATCTAAAACAATCTATTTATTTATTTGGGCTTGAAGTATAAAGCAAGATGTTTGTACATATTGGGCTGAAAGGATAGATTGTTCATTTGGGTTCG
AAGGATAAAGTGAGATGTTTGTACGTATTAGGCAGGGGAAAATAAAGTGAAGAGAATTAAGAGTATGAGATAGCTTTCTTCTAAAGAGTACCTATAAAATAAAAAAGTTT
ATTTCTTTAAGGGATCTCTTTCTTCTTGTACTAATTTATTTAACAAAATAAAATAATTGTCAATCTGAAATTAACTTAATTAGTTGTCAGTTCGACGCTCTACCTTCACA
AGGAAAGAGTAAACTTTGTCTACAAGTTTTCAATTGAAAAAGGAATGCCTTTAGACATTACGATGACTTGACAAATAAATTAGGTGTAAATATATTTAAATTAAAAAAAT
GGAATGCCAGAAATTGTAAGCAAATGTCAATTTAAATAAGGTACGAAGAAAATTAACGTTGGTGGCCGTGAGGTTCGCTTCGCTAGGACTTCCTTTTGATAAGCAAATTT
TACATTAGGTATGTTATTGTTAGAAATTATTTATTATTTATCGTTAGATATTAGGATTGTTTTGCACCCTAATATAATTTAGGGGAAATATCAATTTTTACTCCAATTTT
TTAACATATGAATTAAATTCGCAGAATTTGGC
Protein sequenceShow/hide protein sequence
MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVNKDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRSRCNCFNGNY
TDSSEEELDSHGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTNGLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLL
IDVDFRSEFEIARSTKSYRTILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLSILGPNSESKQPLENFQLEPRQVAEKSVD
RNEFGEVDLGPSVTIGVEEDTVKEWKPPEVKPKSSSLGARNLKIVTGLASVIEDEP