; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011624 (gene) of Snake gourd v1 genome

Gene IDTan0011624
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationLG11:47746717..47748021
RNA-Seq ExpressionTan0011624
SyntenyTan0011624
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131634.1 UPF0481 protein At3g47200-like [Momordica charantia]2.1e-9147.1Show/hide
Query:  MEENNLGHELTLEESIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHH-HRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESK
        ++E +  H +   E + K L     D  E SIYRVPKRL NMN KAY P+VI+IGPFHH ++ +L+VT+  KLQ L+SYL+R+   VE VV   ++WE++
Subjt:  MEENNLGHELTLEESIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHH-HRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESK

Query:  ARSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYL
        ARSCY EPI M +D FV M+L+D                            A+  +I  D+ MLENQLPFFVLQ L+DL P + +  +  SLI L+  + 
Subjt:  ARSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYL

Query:  SNGLVRDYKLLLPTQVKVNHFIDLLSLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVR
        S  +         +   V H +DLLSL++LP  DT     K + DK   L+ P V+ELC+A V +KK  EA  LMDI+FKNGVLEIP   I D FE  VR
Subjt:  SNGLVRDYKLLLPTQVKVNHFIDLLSLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVR

Query:  NLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYY-QQMSEDLHAYCKKWWHRSMATLRRD
        NLMAF+HY    + ++Y + Y  F+D +ISTE+DV LL +  II+N IGGS++EVS++FN+L KY  IP   +Y   +++ LH +CKKWW RS ATL+RD
Subjt:  NLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYY-QQMSEDLHAYCKKWWHRSMATLRRD

Query:  YFNSPWACISFIGATFLIILTFLQTVFSGLS
        YFNSPWA IS + AT++IILT LQT+F+ +S
Subjt:  YFNSPWACISFIGATFLIILTFLQTVFSGLS

XP_022132118.1 UPF0481 protein At3g47200-like [Momordica charantia]1.7e-12056.07Show/hide
Query:  ELTLEESIEKHLQQESP-DFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEP
        +L+LE SI K L+++ P   +EW IYRVPKRL +M   AY P+VIAIGPFHH R DL+ T+  KL C  +YL RI   V+ VVA AR WE KAR  YAEP
Subjt:  ELTLEESIEKHLQQESP-DFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEP

Query:  INMKSDDFVKMMLVDA----------------------------IWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDY
        INM SDDFV+MML+DA                            I  ++ ++L MLENQLPFFVLQ LFDLFP  K K + IS I L  ++LSNGL+R Y
Subjt:  INMKSDDFVKMMLVDA----------------------------IWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDY

Query:  KLL--LPTQVKVNHFIDLLSLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFD
         L   + +  +VNH +DLL  +Y+PS DTE Y +      ++ LLPP++++LC+A VKVKKAI A+SL+DI+FK GVL+IP F IHDDFEIYVRNLMAF+
Subjt:  KLL--LPTQVKVNHFIDLLSLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFD

Query:  HYRILE-DDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPW
         Y + E DD++YV+ YIEF+DGLIST EDV LL KE II+N IGGSN+EVS++FNNLCK TPIP  FY+   S++LH +C+KWW RS ATLRRDYF+SPW
Subjt:  HYRILE-DDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPW

Query:  ACISFIGATFLIILTFLQTVFSGLSISN
        A IS   ATFLI+L  LQT+F+  S  N
Subjt:  ACISFIGATFLIILTFLQTVFSGLSISN

XP_022158989.1 UPF0481 protein At3g47200-like isoform X1 [Momordica charantia]4.3e-9247.15Show/hide
Query:  NNLGHELTLEE---SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKA
        NN+ H   ++E   SI+K LQ+  P   E +I+RVP+RL   N +AYMP++I+IGPFHH R DLM  E  KL+ L+ YL R NF +E  V   RSWE+ A
Subjt:  NNLGHELTLEE---SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKA

Query:  RSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLS
        R+CYAEPINM SD+FVKMMLVD                            A+  ++  DLIMLENQLPFFVLQ LFD F LE     G+S + L   + +
Subjt:  RSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLS

Query:  NG-LVRDYKLLLPTQV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDD
         G L++   L LP  V     KVNH +D LS +Y P+  +  ++  + +  +K    PP+V+EL +A +  KKA+ AK +MDI+FK+ VL+IP   I D 
Subjt:  NG-LVRDYKLLLPTQV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDD

Query:  FEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSM
        FE YVRNLMAF+ Y    +D KY + Y  F++GLIS E+DV+LL K  II N IGG+N+EVS +FN+LCK   +  D   +  ++E LH +C   W++ M
Subjt:  FEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSM

Query:  ATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLSIS
        A+LRRDYFN+PWA ISF+ A FLI+LTFLQT+FS +S+S
Subjt:  ATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLSIS

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]4.3e-9247.15Show/hide
Query:  NNLGHELTLEE---SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKA
        NN+ H   ++E   SI+K LQ+  P   E +I+RVP+RL   N +AYMP++I+IGPFHH R DLM  E  KL+ L+ YL R NF +E  V   RSWE+ A
Subjt:  NNLGHELTLEE---SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKA

Query:  RSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLS
        R+CYAEPINM SD+FVKMMLVD                            A+  ++  DLIMLENQLPFFVLQ LFD F LE     G+S + L   + +
Subjt:  RSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLS

Query:  NG-LVRDYKLLLPTQV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDD
         G L++   L LP  V     KVNH +D LS +Y P+  +  ++  + +  +K    PP+V+EL +A +  KKA+ AK +MDI+FK+ VL+IP   I D 
Subjt:  NG-LVRDYKLLLPTQV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDD

Query:  FEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSM
        FE YVRNLMAF+ Y    +D KY + Y  F++GLIS E+DV+LL K  II N IGG+N+EVS +FN+LCK   +  D   +  ++E LH +C   W++ M
Subjt:  FEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSM

Query:  ATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLSIS
        A+LRRDYFN+PWA ISF+ A FLI+LTFLQT+FS +S+S
Subjt:  ATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLSIS

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]2.5e-9247.76Show/hide
Query:  SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPINMKSDD
        SI+K LQ+  P   E +I+RVP+RL   N +AYMP++I+IGPFHH R DLM  E  KL+ L+ YL R NF +E  V   RSWE+ AR+CYAEPINM SD+
Subjt:  SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPINMKSDD

Query:  FVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNG-LVRDYKLLLPT
        FVKMMLVD                            A+  ++  DLIMLENQLPFFVLQ LFD F LE     G+S + L   + + G L++   L LP 
Subjt:  FVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNG-LVRDYKLLLPT

Query:  QV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFDHY
         V     KVNH +D LS +Y P+  +  ++  + +  +K    PP+V+EL +A +  KKA+ AK +MDI+FK+ VL+IP   I D FE YVRNLMAF+ Y
Subjt:  QV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFDHY

Query:  RILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWAC
            +D KY + Y  F++GLIS E+DV+LL K  II N IGG+N+EVS +FN+LCK   +  D   +  ++E LH +C   W++ MA+LRRDYFN+PWA 
Subjt:  RILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWAC

Query:  ISFIGATFLIILTFLQTVFSGLSIS
        ISF+ A FLI+LTFLQT+FS +S+S
Subjt:  ISFIGATFLIILTFLQTVFSGLSIS

TrEMBL top hitse value%identityAlignment
A0A0A0LM96 Uncharacterized protein1.2e-9549.25Show/hide
Query:  SIEKHLQQ-ESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHH-RIDLMVTESLKLQCLNSYLYRIN----------FRVEDVVAKARSWESKARS
        SIEK L Q  S    + SIYRVPK+L  MN KAY P++I+IGPF++H   +L+  E  KLQ  N++L+R+N            + D+V KA+SW  +AR+
Subjt:  SIEKHLQQ-ESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHH-RIDLMVTESLKLQCLNSYLYRIN----------FRVEDVVAKARSWESKARS

Query:  CYAEPINMKSDDFVKMMLVDAIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDYKLLLPTQVKVNHFIDLLSLFYLP
        CYAE INM  +DF+KMMLVD    +I+ DLI LENQLPFFVLQHLFDL P  K  D+      L  +YL+ G + +Y+      +K  HFID LS +++P
Subjt:  CYAEPINMKSDDFVKMMLVDAIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDYKLLLPTQVKVNHFIDLLSLFYLP

Query:  SSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLIST
            E   + +   +  +++PPS++ELC+A V +KKA   K LM+I F+NG+LEIP   I D FE  +RNL+AF+H+ + E +N YV+ Y+ FMD LIST
Subjt:  SSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLIST

Query:  EEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKY-TPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLS
        E+DV LL KEKII+N IGGS+ EVS++FNNLCK+ +  PND Y+  +SE L  +C +WW+++ A+L+ +YFN+PWA ISF  AT L++LT LQTVFS +S
Subjt:  EEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKY-TPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLS

A0A6J1BVD4 UPF0481 protein At3g47200-like8.1e-12156.07Show/hide
Query:  ELTLEESIEKHLQQESP-DFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEP
        +L+LE SI K L+++ P   +EW IYRVPKRL +M   AY P+VIAIGPFHH R DL+ T+  KL C  +YL RI   V+ VVA AR WE KAR  YAEP
Subjt:  ELTLEESIEKHLQQESP-DFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEP

Query:  INMKSDDFVKMMLVDA----------------------------IWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDY
        INM SDDFV+MML+DA                            I  ++ ++L MLENQLPFFVLQ LFDLFP  K K + IS I L  ++LSNGL+R Y
Subjt:  INMKSDDFVKMMLVDA----------------------------IWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDY

Query:  KLL--LPTQVKVNHFIDLLSLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFD
         L   + +  +VNH +DLL  +Y+PS DTE Y +      ++ LLPP++++LC+A VKVKKAI A+SL+DI+FK GVL+IP F IHDDFEIYVRNLMAF+
Subjt:  KLL--LPTQVKVNHFIDLLSLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFD

Query:  HYRILE-DDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPW
         Y + E DD++YV+ YIEF+DGLIST EDV LL KE II+N IGGSN+EVS++FNNLCK TPIP  FY+   S++LH +C+KWW RS ATLRRDYF+SPW
Subjt:  HYRILE-DDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPW

Query:  ACISFIGATFLIILTFLQTVFSGLSISN
        A IS   ATFLI+L  LQT+F+  S  N
Subjt:  ACISFIGATFLIILTFLQTVFSGLSISN

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X22.1e-9247.15Show/hide
Query:  NNLGHELTLEE---SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKA
        NN+ H   ++E   SI+K LQ+  P   E +I+RVP+RL   N +AYMP++I+IGPFHH R DLM  E  KL+ L+ YL R NF +E  V   RSWE+ A
Subjt:  NNLGHELTLEE---SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKA

Query:  RSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLS
        R+CYAEPINM SD+FVKMMLVD                            A+  ++  DLIMLENQLPFFVLQ LFD F LE     G+S + L   + +
Subjt:  RSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLS

Query:  NG-LVRDYKLLLPTQV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDD
         G L++   L LP  V     KVNH +D LS +Y P+  +  ++  + +  +K    PP+V+EL +A +  KKA+ AK +MDI+FK+ VL+IP   I D 
Subjt:  NG-LVRDYKLLLPTQV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDD

Query:  FEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSM
        FE YVRNLMAF+ Y    +D KY + Y  F++GLIS E+DV+LL K  II N IGG+N+EVS +FN+LCK   +  D   +  ++E LH +C   W++ M
Subjt:  FEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSM

Query:  ATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLSIS
        A+LRRDYFN+PWA ISF+ A FLI+LTFLQT+FS +S+S
Subjt:  ATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLSIS

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X31.2e-9247.76Show/hide
Query:  SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPINMKSDD
        SI+K LQ+  P   E +I+RVP+RL   N +AYMP++I+IGPFHH R DLM  E  KL+ L+ YL R NF +E  V   RSWE+ AR+CYAEPINM SD+
Subjt:  SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPINMKSDD

Query:  FVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNG-LVRDYKLLLPT
        FVKMMLVD                            A+  ++  DLIMLENQLPFFVLQ LFD F LE     G+S + L   + + G L++   L LP 
Subjt:  FVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNG-LVRDYKLLLPT

Query:  QV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFDHY
         V     KVNH +D LS +Y P+  +  ++  + +  +K    PP+V+EL +A +  KKA+ AK +MDI+FK+ VL+IP   I D FE YVRNLMAF+ Y
Subjt:  QV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFDHY

Query:  RILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWAC
            +D KY + Y  F++GLIS E+DV+LL K  II N IGG+N+EVS +FN+LCK   +  D   +  ++E LH +C   W++ MA+LRRDYFN+PWA 
Subjt:  RILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWAC

Query:  ISFIGATFLIILTFLQTVFSGLSIS
        ISF+ A FLI+LTFLQT+FS +S+S
Subjt:  ISFIGATFLIILTFLQTVFSGLSIS

A0A6J1E120 UPF0481 protein At3g47200-like isoform X12.1e-9247.15Show/hide
Query:  NNLGHELTLEE---SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKA
        NN+ H   ++E   SI+K LQ+  P   E +I+RVP+RL   N +AYMP++I+IGPFHH R DLM  E  KL+ L+ YL R NF +E  V   RSWE+ A
Subjt:  NNLGHELTLEE---SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKA

Query:  RSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLS
        R+CYAEPINM SD+FVKMMLVD                            A+  ++  DLIMLENQLPFFVLQ LFD F LE     G+S + L   + +
Subjt:  RSCYAEPINMKSDDFVKMMLVD----------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLS

Query:  NG-LVRDYKLLLPTQV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDD
         G L++   L LP  V     KVNH +D LS +Y P+  +  ++  + +  +K    PP+V+EL +A +  KKA+ AK +MDI+FK+ VL+IP   I D 
Subjt:  NG-LVRDYKLLLPTQV-----KVNHFIDLLSLFYLPSSDTEGYSKKTRS-DKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDD

Query:  FEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSM
        FE YVRNLMAF+ Y    +D KY + Y  F++GLIS E+DV+LL K  II N IGG+N+EVS +FN+LCK   +  D   +  ++E LH +C   W++ M
Subjt:  FEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDF-YYQQMSEDLHAYCKKWWHRSM

Query:  ATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLSIS
        A+LRRDYFN+PWA ISF+ A FLI+LTFLQT+FS +S+S
Subjt:  ATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLSIS

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026457.0e-2122.74Show/hide
Query:  LTLEESIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRIN-FRVEDVVAKARSWESKARSCYAEPI
        + +++S++  L++   +    SI+ VPK L   +  +Y P  ++IGP+H  + +L   E  KL        + N FR  D+V K +S E K R+CY + I
Subjt:  LTLEESIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRIN-FRVEDVVAKARSWESKARSCYAEPI

Query:  NMKSDDFVKMMLVDAIWF----------------------NINRDLIMLENQLPFFVLQHL--FDLFPLEKKKDDGISLIGLMAEYLSNGLVR--DYKLL
            +  + +M VD+ +                        I RD++M+ENQ+P FVL+    F L   E   D  +S++  + + LS  +++  D ++L
Subjt:  NMKSDDFVKMMLVDAIWF----------------------NINRDLIMLENQLPFFVLQHL--FDLFPLEKKKDDGISLIGLMAEYLSNGLVR--DYKLL

Query:  LPTQVKVNHFIDLLSLFYLP---SSDTEGYSKKTRSDKK---------------------------VLLLP-----------------------------
             + NH +D L    +P     + E   ++ R+D+                            +L  P                             
Subjt:  LPTQVKVNHFIDLLSLFYLP---SSDTEGYSKKTRSDKK---------------------------VLLLP-----------------------------

Query:  -----------------------PSVSELCKASVKVKKAIEAKSLMDITF--KNGVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLD-YIEFMD
                               PSVS+L KA V+ K      ++  +TF   +G   +P   +  + E  +RNL+A   Y         V   Y E ++
Subjt:  -----------------------PSVSELCKASVKVKKAIEAKSLMDITF--KNGVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLD-YIEFMD

Query:  GLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQ
        G+I +EEDV LL ++ ++V+ +  S++E ++M+N + K   +    +  +  ED++ Y    W   +  L   Y    W  ++F+ A  L++L  LQ
Subjt:  GLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQ

Q9SD53 UPF0481 protein At3g472003.7e-3027.69Show/hide
Query:  SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFR--VEDVVAKA-RSWESKARSCYAEPINMK
        S E  L  ES       I+RVP+    +N KAY P+V++IGP+H+    L + +  K + L  +L     +   E+V+ KA    E K R  Y+E +   
Subjt:  SIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFR--VEDVVAKA-RSWESKARSCYAEPINMK

Query:  SDDFVKMMLVDAIWF--------------------------NINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDYKLL-LP
          D + MM++D  +                           +I  DL++LENQ+PFFVLQ L+    +  K      L  +   +  N + ++       
Subjt:  SDDFVKMMLVDAIWF--------------------------NINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDYKLL-LP

Query:  TQVKVNHFIDLLSLFYLPSSDT--------------EGYSKKTRS-DKKVLLLPPSVSELCKASVKVK-KAIEAKSLMDITFKNGVLEIPCFVIHDDFEI
           K  H +DL+   +LP++                EG S    S D K + L  S   L    +K + +  +  S++++  K   L+IP          
Subjt:  TQVKVNHFIDLLSLFYLPSSDT--------------EGYSKKTRS-DKKVLLLPPSVSELCKASVKVK-KAIEAKSLMDITFKNGVLEIPCFVIHDDFEI

Query:  YVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPND-FYYQQMSEDLHAYCKKWWHRSMATL
        +  N +AF+ +    D +  +  YI FM  L++ EEDVT L  +K+I+    GSN EVS+ F  + K      D  Y   + + ++ Y KKW++   A  
Subjt:  YVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPND-FYYQQMSEDLHAYCKKWWHRSMATL

Query:  RRDYFNSPWACISFIGATFLIILTFLQTVFSGLSISN
        R  +F SPW  +S     F+I+LT LQ+  + LS  N
Subjt:  RRDYFNSPWACISFIGATFLIILTFLQTVFSGLSISN

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)1.9e-4529.75Show/hide
Query:  LTLEESIEK-HLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPI
        +++ + +E+ H   ++  + +  IYRVP  L+  + K+Y P+ +++GP+HH +  L   +  K + +N  L R N  ++  +   R  E KAR+CY  P+
Subjt:  LTLEESIEK-HLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPI

Query:  NMKSDDFVKMMLVDAIW------------------------------FNINRDLIMLENQLPFFVLQHLFDL------------------FPLEKKKDDG
        ++ S++F++M+++D  +                               +I RD++MLENQLP FVL  L +L                  F      D+ 
Subjt:  NMKSDDFVKMMLVDAIW------------------------------FNINRDLIMLENQLPFFVLQHLFDL------------------FPLEKKKDDG

Query:  ISLIGLMAEYLSNGLVRDYKLLLPTQVKVNHFIDLLSLFYLPSS-------DTEGYSKKTR-SDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKN
        ++  G     L N L RD        +   H +D+     L SS         + +S+ TR +DK+   L   V+EL +A +K ++  +     D+ FKN
Subjt:  ISLIGLMAEYLSNGLVRDYKLLLPTQVKVNHFIDLLSLFYLPSS-------DTEGYSKKTR-SDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKN

Query:  GVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPI-PNDFYYQQMSED
        G LEIP  +IHD  +    NL+AF+   I  D +  +  YI FMD LI + EDV+ L    II + + GS+ EV+ +FN LC+       D Y  ++S +
Subjt:  GVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPI-PNDFYYQQMSED

Query:  LHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQTVFS
        ++ Y    W+   ATL+  YFN+PWA +SF  A  L++LTF Q+ ++
Subjt:  LHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQTVFS

AT3G50150.1 Plant protein of unknown function (DUF247)1.4e-4831.32Show/hide
Query:  LTLEESIEKHLQQESPD-FSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPI
        +++++ +EK L  ++ + + +  IYRVP  L+  + K+Y+P+ ++IGP+HH ++ L   E  K + +N  + R    +E  +   +  E +AR+CY  PI
Subjt:  LTLEESIEKHLQQESPD-FSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPI

Query:  NMK-SDDFVKMMLVD------------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVR
        +MK S++F +M+++D                               +  +I RD+IMLENQLP FVL  L  L      +    +  G++AE      VR
Subjt:  NMK-SDDFVKMMLVD------------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVR

Query:  DYKLLLPTQVKVN---------------------HFIDLLSLFYLPSSDTEG----YSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKN
         +K L+PT   +                      H +D+     + SS+T      Y   +  +K+  L+   V+EL  A V   +  E   L DI FKN
Subjt:  DYKLLLPTQVKVN---------------------HFIDLLSLFYLPSSDTEG----YSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKN

Query:  GVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPI-PNDFYYQQMSED
        G L+IP  +IHD  +    NL+AF+       +N  +  YI FMD LI++ +DV+ L  + II + + GS+ EV+ +FN LCK     P D Y  Q+S +
Subjt:  GVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPI-PNDFYYQQMSED

Query:  LHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQTVFS
        ++ Y  + W+   ATLR+ YFN+PWA  SF  A  L+ LTF Q+ F+
Subjt:  LHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQTVFS

AT3G50160.1 Plant protein of unknown function (DUF247)6.2e-4933.75Show/hide
Query:  IYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPINMKSDDFVKMMLVDAIWF-----
        IYRVP  L+  + K+YMP++++IGP+HH    LM  E  K + +N  + R    +E  +   +  E KAR+CY  PINM  ++F++M+++D ++      
Subjt:  IYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPINMKSDDFVKMMLVDAIWF-----

Query:  -------------------------NINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDYKLLLPTQVKVN-----HFIDLL
                                 +I RD++MLENQLP+ VL+ L     L+ ++ D +  + +            ++ LLPT+  +      H +D+L
Subjt:  -------------------------NINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDYKLLLPTQVKVN-----HFIDLL

Query:  SLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFM
            L SS T      +  +K+   L   V+EL  A V+  +  E     DI FKNG L+IP  +IHD  +    NL+AF+   I    +K +  YI FM
Subjt:  SLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFM

Query:  DGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPI-PNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQT
        D LI++ EDV+ L    II N + GS+ EVS +FN L K     PND Y   ++ +++ Y ++ W+   ATLR  YFN+PWA  SFI A  L+I TF Q+
Subjt:  DGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPI-PNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQT

Query:  VFS
         F+
Subjt:  VFS

AT3G50170.1 Plant protein of unknown function (DUF247)1.7e-4631.76Show/hide
Query:  SIEKHLQQESPD-----FSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPIN
        SI   L+Q   D     + +  IYRVP  L+  + K+Y P+ +++GP+HH +  L   E  K + LN  L R+  R+E      R  E KAR+CY  PI+
Subjt:  SIEKHLQQESPD-----FSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPIN

Query:  MKSDDFVKMMLVD------------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDY
        +  ++F +M+++D                               +  +I RD+IMLENQLP FVL  L +L  L  +   GI +  +  ++    +    
Subjt:  MKSDDFVKMMLVD------------------------------AIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDY

Query:  KLLLPTQVKVN----------------HFIDLLSLFYLPSSDT-------EGYSKKTR-SDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVL
         L  P Q K+                 H +D+     L SS T       +  ++ TR  DK+   L   V+EL +A VK +K  +     DI FKNG L
Subjt:  KLLLPTQVKVN----------------HFIDLLSLFYLPSSDT-------EGYSKKTR-SDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVL

Query:  EIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPI-PNDFYYQQMSEDLHA
        EIP  +IHD  +    NL+AF+   I  + + ++  YI FMD LI++ EDV+ L    II + + GS+ EV+ +FN LC+     P D +  ++S D++ 
Subjt:  EIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPI-PNDFYYQQMSEDLHA

Query:  YCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQTVFS
        Y  + W+   ATL   YFN+PWA  SF  A  L++LT  Q+ ++
Subjt:  YCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQTVFS

AT4G31980.1 unknown protein1.1e-5835.12Show/hide
Query:  EENNLGHELTLEESIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKAR
        E  N      L +SI+  L   S   ++  IY+VP +L+ +N  AY P +++ GP H  + +L   E  K + L S++ R N  +ED+V  AR+WE  AR
Subjt:  EENNLGHELTLEESIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKAR

Query:  SCYAEPINMKSDDFVKMMLVDAIWF---------------------------NINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNG
        SCYAE + + SD+FV+M++VD  +                            ++ RD+I++ENQLPFFV++ +F L  L   +    S+I L   + S  
Subjt:  SCYAEPINMKSDDFVKMMLVDAIWF---------------------------NINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNG

Query:  LVR--DYKLLLPTQVKVNHFIDLLSLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRN
        L R  D K +     +  HF+DLL   YLP    +      + D       P  +EL  A V+ K A  +  L+DI+F +GVL+IP  V+ D  E   +N
Subjt:  LVR--DYKLLLPTQVKVNHFIDLLSLFYLPSSDTEGYSKKTRSDKKVLLLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRN

Query:  LMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYF
        ++ F+  R     NK  LDYI  +   I +  D  LL    IIVN +G S  +VS +FN++ K       FY+  +SE+L AYC   W+R  A LRRDYF
Subjt:  LMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMFNNLCKYTPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYF

Query:  NSPWACISFIGATFLIILTFLQTVFSGLSI
        ++PWA  S   A  L++LTF+Q+V S L++
Subjt:  NSPWACISFIGATFLIILTFLQTVFSGLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGAATAATTTGGGTCATGAATTGACCCTCGAGGAATCCATTGAAAAACATCTCCAGCAAGAATCTCCTGACTTTTCAGAATGGAGCATTTATCGTGTTCCCAA
ACGGCTAAAAAACATGAATTGTAAAGCTTATATGCCTGAAGTCATTGCCATCGGCCCTTTTCACCATCATCGAATAGACTTGATGGTCACAGAATCACTTAAACTCCAAT
GTCTAAATAGTTATCTATATCGTATAAATTTCAGAGTTGAGGATGTTGTGGCAAAGGCTCGAAGTTGGGAGAGCAAAGCTCGGAGTTGCTATGCCGAACCCATAAACATG
AAGAGCGATGATTTTGTGAAAATGATGCTCGTGGATGCTATCTGGTTCAATATAAATCGTGACTTGATAATGCTTGAAAACCAACTTCCTTTCTTCGTTCTTCAACATCT
ATTTGACCTATTTCCACTCGAAAAAAAGAAAGATGATGGCATCTCCTTGATAGGACTTATGGCAGAATATTTGTCCAATGGGTTGGTAAGAGATTACAAGCTGCTGCTTC
CCACACAAGTAAAGGTAAACCACTTCATTGACTTATTAAGCTTATTCTACCTCCCCTCAAGTGATACAGAAGGGTATAGCAAAAAAACTCGGTCGGATAAAAAGGTTCTT
CTTCTTCCCCCAAGTGTGAGTGAGCTATGCAAGGCTAGTGTCAAAGTAAAGAAAGCAATAGAAGCCAAAAGCTTGATGGACATAACTTTCAAAAATGGAGTTCTAGAAAT
TCCATGTTTTGTAATTCATGACGACTTCGAAATCTACGTACGAAATTTGATGGCATTTGATCATTACCGTATACTAGAAGATGATAACAAGTATGTACTGGATTATATTG
AGTTCATGGATGGTTTGATAAGCACAGAGGAAGATGTGACTTTACTTGCCAAGGAAAAAATTATAGTCAACCTTATCGGTGGGAGTAACGAAGAAGTTTCGAAAATGTTT
AACAATTTATGTAAATACACTCCAATTCCAAATGATTTTTACTACCAGCAAATGAGCGAAGACTTACATGCTTATTGTAAGAAATGGTGGCACAGATCGATGGCTACACT
GAGACGTGACTATTTCAATAGTCCATGGGCTTGTATCTCATTTATTGGGGCTACATTCCTCATTATCCTCACTTTCCTCCAAACCGTGTTTTCAGGTTTATCCATTTCGA
ATAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGAATAATTTGGGTCATGAATTGACCCTCGAGGAATCCATTGAAAAACATCTCCAGCAAGAATCTCCTGACTTTTCAGAATGGAGCATTTATCGTGTTCCCAA
ACGGCTAAAAAACATGAATTGTAAAGCTTATATGCCTGAAGTCATTGCCATCGGCCCTTTTCACCATCATCGAATAGACTTGATGGTCACAGAATCACTTAAACTCCAAT
GTCTAAATAGTTATCTATATCGTATAAATTTCAGAGTTGAGGATGTTGTGGCAAAGGCTCGAAGTTGGGAGAGCAAAGCTCGGAGTTGCTATGCCGAACCCATAAACATG
AAGAGCGATGATTTTGTGAAAATGATGCTCGTGGATGCTATCTGGTTCAATATAAATCGTGACTTGATAATGCTTGAAAACCAACTTCCTTTCTTCGTTCTTCAACATCT
ATTTGACCTATTTCCACTCGAAAAAAAGAAAGATGATGGCATCTCCTTGATAGGACTTATGGCAGAATATTTGTCCAATGGGTTGGTAAGAGATTACAAGCTGCTGCTTC
CCACACAAGTAAAGGTAAACCACTTCATTGACTTATTAAGCTTATTCTACCTCCCCTCAAGTGATACAGAAGGGTATAGCAAAAAAACTCGGTCGGATAAAAAGGTTCTT
CTTCTTCCCCCAAGTGTGAGTGAGCTATGCAAGGCTAGTGTCAAAGTAAAGAAAGCAATAGAAGCCAAAAGCTTGATGGACATAACTTTCAAAAATGGAGTTCTAGAAAT
TCCATGTTTTGTAATTCATGACGACTTCGAAATCTACGTACGAAATTTGATGGCATTTGATCATTACCGTATACTAGAAGATGATAACAAGTATGTACTGGATTATATTG
AGTTCATGGATGGTTTGATAAGCACAGAGGAAGATGTGACTTTACTTGCCAAGGAAAAAATTATAGTCAACCTTATCGGTGGGAGTAACGAAGAAGTTTCGAAAATGTTT
AACAATTTATGTAAATACACTCCAATTCCAAATGATTTTTACTACCAGCAAATGAGCGAAGACTTACATGCTTATTGTAAGAAATGGTGGCACAGATCGATGGCTACACT
GAGACGTGACTATTTCAATAGTCCATGGGCTTGTATCTCATTTATTGGGGCTACATTCCTCATTATCCTCACTTTCCTCCAAACCGTGTTTTCAGGTTTATCCATTTCGA
ATAAGTAA
Protein sequenceShow/hide protein sequence
MEENNLGHELTLEESIEKHLQQESPDFSEWSIYRVPKRLKNMNCKAYMPEVIAIGPFHHHRIDLMVTESLKLQCLNSYLYRINFRVEDVVAKARSWESKARSCYAEPINM
KSDDFVKMMLVDAIWFNINRDLIMLENQLPFFVLQHLFDLFPLEKKKDDGISLIGLMAEYLSNGLVRDYKLLLPTQVKVNHFIDLLSLFYLPSSDTEGYSKKTRSDKKVL
LLPPSVSELCKASVKVKKAIEAKSLMDITFKNGVLEIPCFVIHDDFEIYVRNLMAFDHYRILEDDNKYVLDYIEFMDGLISTEEDVTLLAKEKIIVNLIGGSNEEVSKMF
NNLCKYTPIPNDFYYQQMSEDLHAYCKKWWHRSMATLRRDYFNSPWACISFIGATFLIILTFLQTVFSGLSISNK