; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014557 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014557
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationtig00000729:545144..546506
RNA-Seq ExpressionSgr014557
SyntenySgr014557
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131634.1 UPF0481 protein At3g47200-like [Momordica charantia]1.7e-8344.63Show/hide
Query:  NIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGRE-HLKAMEWHKLQCLYIYLRCLNMNVEAAVEIVQ-------------
        ++  SI+   K+L    I P+C+IYRV  RL+ +N  AYTPQV+SIGPFH+  + +L   + HKLQ L  YL  + M VEA V+I Q             
Subjt:  NIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGRE-HLKAMEWHKLQCLYIYLRCLNMNVEAAVEIVQ-------------

Query:  ---------------------FMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR
                             F+IL ++ +    NG D SFYEA+  D+Y D  MLENQLP FVL+ L+D +  + + +  S ++ I   FF   + N  
Subjt:  ---------------------FMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR

Query:  MVIPSYVSEGKAKHLVDFLSSCHVPSYDTIYKNG-IAFLRPPTLTALHEAGVRILKATNKP-LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLR
          IP +VS    KHLVD LS   +P  DT  ++    +L  P +T L EAGV I K      LMDISFK+GVL IPP  I D+FE   RN+MAFE +   
Subjt:  MVIPSYVSEGKAKHLVDFLSSCHVPSYDTIYKNG-IAFLRPPTLTALHEAGVRILKATNKP-LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLR

Query:  SGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVV
        +      +Y  FLD +ISTEKD  LLV+AGI+ N+IGGSD+E+++LFN+L K V+IPGG  Y   +   LHD+CKK WP+ KATL+R+YFN+PW  IS+V
Subjt:  SGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVV

Query:  AATFLILLTLLQTLFSALS
        AAT++I+LTLLQT+F+A+S
Subjt:  AATFLILLTLLQTLFSALS

XP_022132066.1 UPF0481 protein At3g47200-like [Momordica charantia]1.1e-7740.83Show/hide
Query:  MEPEHVGTHQQNYSPFDIEKDADSEQFRLDNLNIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLY
        ME +H+ T       +D+ K  D  +  L+  ++  S++   +KL    I+ +C+IYRVS RL  IN  AYTPQ +SIGPFH+G++   AME  KL+ L 
Subjt:  MEPEHVGTHQQNYSPFDIEKDADSEQFRLDNLNIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLY

Query:  IYLRCLNMNVEAAVEIVQ----------------------FMILVHHRF------------HQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLF
         YLR + M +E A EI Q                       M+LV   F              T   L+ + ++AI  D+Y+D I+LENQLP F+LECL 
Subjt:  IYLRCLNMNVEAAVEIVQ----------------------FMILVHHRF------------HQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLF

Query:  DRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSYVSEGKAKHLVDFLSSCHVPSYDTIYKNGIAFLR---PPTLTALHEAGVRILKAT--NKPLMDI
        D+ S         FV F   F        +R +I   +   K  HLVDFLS  +     T   + + + +   PPT T L EAGV   KAT   + +MDI
Subjt:  DRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSYVSEGKAKHLVDFLSSCHVPSYDTIYKNGIAFLR---PPTLTALHEAGVRILKAT--NKPLMDI

Query:  SFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDM
         FKDGVL+IP  +I+D FE Y RN++A+E + +      + +Y+ FLD LISTE+D SLLVKAGI+TNNIGG++E+++KLFN+LCK++ I     YY D+
Subjt:  SFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDM

Query:  ANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIINTK
        +  LH YC+  W +  A+LRR+YFNTPW  IS +AATFL+LLT +Q ++SA+S   +K
Subjt:  ANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIINTK

XP_022132118.1 UPF0481 protein At3g47200-like [Momordica charantia]2.3e-8042.42Show/hide
Query:  HQQNY------SPFDIEKDAD---SEQFRLDNLNIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCL
        HQQ+Y      +P+  +   D    +Q   D+L++  SI +  ++    + A  C IYRV  RL  +   AYTP+V++IGPFH+GR  L A +  KL C 
Subjt:  HQQNY------SPFDIEKDAD---SEQFRLDNLNIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCL

Query:  YIYLRCLNMNVEAAVE----------------------------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECL
          YL  +  +V+  V                                   IV+ MI+++    +T  G D+ F + I  DLYQ+  MLENQLP FVL+ L
Subjt:  YIYLRCLNMNVEAAVE----------------------------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECL

Query:  FDRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSYVSEGKAKHLVDFLSSCHVPSYDT-------IYKNGIAFLRPPTLTALHEAGVRILKATN-KP
        FD     ++   +SF++ +   F  NGL+    +     S  +  HLVD L   +VPS DT         KN I FL PPT+T L EAGV++ KA   + 
Subjt:  FDRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSYVSEGKAKHLVDFLSSCHVPSYDT-------IYKNGIAFLRPPTLTALHEAGVRILKATN-KP

Query:  LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGN--VTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGD
        L+DISFK GVL IPPF+I+D+FE Y RN+MAFEQ+ +R    +  V  YI FLDGLIST +D +LLVK GI+ N+IGGS++E+++LFNNLCK   IP   
Subjt:  LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGN--VTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGD

Query:  KYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIIN
         Y+Y  +  LHD+C+K WP+ KATLRR+YF++PW  ISV AATFLILL LLQT+F+A S  N
Subjt:  KYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIIN

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]2.0e-7643.56Show/hide
Query:  IANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAVE------------------
        + +SI++  ++L    +A +CNI+RV  RL+  N  AY PQ++SIGPFH+GR+ L  ME HKL+ L  YLR  N  +E  V                   
Subjt:  IANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAVE------------------

Query:  ----------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR-M
                        IV+ M++V     +T    D   + A+  DLY D IMLENQLP FVL+ LFD+ SLE     LSF++  H F+    L+  R +
Subjt:  ----------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR-M

Query:  VIPS--YVSEGKAKHLVDFLSSCHVPSYDTI--YKNGIAFLR-----PPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA
         +P    +S  K  HLVDFLS  + P+  ++    + +A  R     PPT+T L EAG+   KA   K +MDISFKD VL IPP +I D FE Y RN+MA
Subjt:  VIPS--YVSEGKAKHLVDFLSSCHVPSYDTI--YKNGIAFLR-----PPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA

Query:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP
        FEQ+   +      +Y  FL+GLIS E+D SLLVKA I+TN IGG+++E++ LFN+LCK+V + G    +  +  ALH++C  RW K  A+LRR+YFNTP
Subjt:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP

Query:  WTLISVVAATFLILLTLLQTLFSALSI
        W  IS VAA FLILLT LQTLFSA+S+
Subjt:  WTLISVVAATFLILLTLLQTLFSALSI

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]2.0e-7643.56Show/hide
Query:  IANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAVE------------------
        + +SI++  ++L    +A +CNI+RV  RL+  N  AY PQ++SIGPFH+GR+ L  ME HKL+ L  YLR  N  +E  V                   
Subjt:  IANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAVE------------------

Query:  ----------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR-M
                        IV+ M++V     +T    D   + A+  DLY D IMLENQLP FVL+ LFD+ SLE     LSF++  H F+    L+  R +
Subjt:  ----------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR-M

Query:  VIPS--YVSEGKAKHLVDFLSSCHVPSYDTI--YKNGIAFLR-----PPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA
         +P    +S  K  HLVDFLS  + P+  ++    + +A  R     PPT+T L EAG+   KA   K +MDISFKD VL IPP +I D FE Y RN+MA
Subjt:  VIPS--YVSEGKAKHLVDFLSSCHVPSYDTI--YKNGIAFLR-----PPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA

Query:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP
        FEQ+   +      +Y  FL+GLIS E+D SLLVKA I+TN IGG+++E++ LFN+LCK+V + G    +  +  ALH++C  RW K  A+LRR+YFNTP
Subjt:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP

Query:  WTLISVVAATFLILLTLLQTLFSALSI
        W  IS VAA FLILLT LQTLFSA+S+
Subjt:  WTLISVVAATFLILLTLLQTLFSALSI

TrEMBL top hitse value%identityAlignment
A0A6J1BQT6 UPF0481 protein At3g47200-like8.2e-8444.63Show/hide
Query:  NIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGRE-HLKAMEWHKLQCLYIYLRCLNMNVEAAVEIVQ-------------
        ++  SI+   K+L    I P+C+IYRV  RL+ +N  AYTPQV+SIGPFH+  + +L   + HKLQ L  YL  + M VEA V+I Q             
Subjt:  NIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGRE-HLKAMEWHKLQCLYIYLRCLNMNVEAAVEIVQ-------------

Query:  ---------------------FMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR
                             F+IL ++ +    NG D SFYEA+  D+Y D  MLENQLP FVL+ L+D +  + + +  S ++ I   FF   + N  
Subjt:  ---------------------FMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR

Query:  MVIPSYVSEGKAKHLVDFLSSCHVPSYDTIYKNG-IAFLRPPTLTALHEAGVRILKATNKP-LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLR
          IP +VS    KHLVD LS   +P  DT  ++    +L  P +T L EAGV I K      LMDISFK+GVL IPP  I D+FE   RN+MAFE +   
Subjt:  MVIPSYVSEGKAKHLVDFLSSCHVPSYDTIYKNG-IAFLRPPTLTALHEAGVRILKATNKP-LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLR

Query:  SGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVV
        +      +Y  FLD +ISTEKD  LLV+AGI+ N+IGGSD+E+++LFN+L K V+IPGG  Y   +   LHD+CKK WP+ KATL+R+YFN+PW  IS+V
Subjt:  SGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVV

Query:  AATFLILLTLLQTLFSALS
        AAT++I+LTLLQT+F+A+S
Subjt:  AATFLILLTLLQTLFSALS

A0A6J1BR71 UPF0481 protein At3g47200-like5.1e-7840.83Show/hide
Query:  MEPEHVGTHQQNYSPFDIEKDADSEQFRLDNLNIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLY
        ME +H+ T       +D+ K  D  +  L+  ++  S++   +KL    I+ +C+IYRVS RL  IN  AYTPQ +SIGPFH+G++   AME  KL+ L 
Subjt:  MEPEHVGTHQQNYSPFDIEKDADSEQFRLDNLNIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLY

Query:  IYLRCLNMNVEAAVEIVQ----------------------FMILVHHRF------------HQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLF
         YLR + M +E A EI Q                       M+LV   F              T   L+ + ++AI  D+Y+D I+LENQLP F+LECL 
Subjt:  IYLRCLNMNVEAAVEIVQ----------------------FMILVHHRF------------HQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLF

Query:  DRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSYVSEGKAKHLVDFLSSCHVPSYDTIYKNGIAFLR---PPTLTALHEAGVRILKAT--NKPLMDI
        D+ S         FV F   F        +R +I   +   K  HLVDFLS  +     T   + + + +   PPT T L EAGV   KAT   + +MDI
Subjt:  DRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSYVSEGKAKHLVDFLSSCHVPSYDTIYKNGIAFLR---PPTLTALHEAGVRILKAT--NKPLMDI

Query:  SFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDM
         FKDGVL+IP  +I+D FE Y RN++A+E + +      + +Y+ FLD LISTE+D SLLVKAGI+TNNIGG++E+++KLFN+LCK++ I     YY D+
Subjt:  SFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDM

Query:  ANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIINTK
        +  LH YC+  W +  A+LRR+YFNTPW  IS +AATFL+LLT +Q ++SA+S   +K
Subjt:  ANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIINTK

A0A6J1BVD4 UPF0481 protein At3g47200-like1.1e-8042.42Show/hide
Query:  HQQNY------SPFDIEKDAD---SEQFRLDNLNIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCL
        HQQ+Y      +P+  +   D    +Q   D+L++  SI +  ++    + A  C IYRV  RL  +   AYTP+V++IGPFH+GR  L A +  KL C 
Subjt:  HQQNY------SPFDIEKDAD---SEQFRLDNLNIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCL

Query:  YIYLRCLNMNVEAAVE----------------------------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECL
          YL  +  +V+  V                                   IV+ MI+++    +T  G D+ F + I  DLYQ+  MLENQLP FVL+ L
Subjt:  YIYLRCLNMNVEAAVE----------------------------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECL

Query:  FDRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSYVSEGKAKHLVDFLSSCHVPSYDT-------IYKNGIAFLRPPTLTALHEAGVRILKATN-KP
        FD     ++   +SF++ +   F  NGL+    +     S  +  HLVD L   +VPS DT         KN I FL PPT+T L EAGV++ KA   + 
Subjt:  FDRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSYVSEGKAKHLVDFLSSCHVPSYDT-------IYKNGIAFLRPPTLTALHEAGVRILKATN-KP

Query:  LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGN--VTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGD
        L+DISFK GVL IPPF+I+D+FE Y RN+MAFEQ+ +R    +  V  YI FLDGLIST +D +LLVK GI+ N+IGGS++E+++LFNNLCK   IP   
Subjt:  LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGN--VTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGD

Query:  KYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIIN
         Y+Y  +  LHD+C+K WP+ KATLRR+YF++PW  ISV AATFLILL LLQT+F+A S  N
Subjt:  KYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIIN

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X29.7e-7743.56Show/hide
Query:  IANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAVE------------------
        + +SI++  ++L    +A +CNI+RV  RL+  N  AY PQ++SIGPFH+GR+ L  ME HKL+ L  YLR  N  +E  V                   
Subjt:  IANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAVE------------------

Query:  ----------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR-M
                        IV+ M++V     +T    D   + A+  DLY D IMLENQLP FVL+ LFD+ SLE     LSF++  H F+    L+  R +
Subjt:  ----------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR-M

Query:  VIPS--YVSEGKAKHLVDFLSSCHVPSYDTI--YKNGIAFLR-----PPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA
         +P    +S  K  HLVDFLS  + P+  ++    + +A  R     PPT+T L EAG+   KA   K +MDISFKD VL IPP +I D FE Y RN+MA
Subjt:  VIPS--YVSEGKAKHLVDFLSSCHVPSYDTI--YKNGIAFLR-----PPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA

Query:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP
        FEQ+   +      +Y  FL+GLIS E+D SLLVKA I+TN IGG+++E++ LFN+LCK+V + G    +  +  ALH++C  RW K  A+LRR+YFNTP
Subjt:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP

Query:  WTLISVVAATFLILLTLLQTLFSALSI
        W  IS VAA FLILLT LQTLFSA+S+
Subjt:  WTLISVVAATFLILLTLLQTLFSALSI

A0A6J1E120 UPF0481 protein At3g47200-like isoform X19.7e-7743.56Show/hide
Query:  IANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAVE------------------
        + +SI++  ++L    +A +CNI+RV  RL+  N  AY PQ++SIGPFH+GR+ L  ME HKL+ L  YLR  N  +E  V                   
Subjt:  IANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAVE------------------

Query:  ----------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR-M
                        IV+ M++V     +T    D   + A+  DLY D IMLENQLP FVL+ LFD+ SLE     LSF++  H F+    L+  R +
Subjt:  ----------------IVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSR-M

Query:  VIPS--YVSEGKAKHLVDFLSSCHVPSYDTI--YKNGIAFLR-----PPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA
         +P    +S  K  HLVDFLS  + P+  ++    + +A  R     PPT+T L EAG+   KA   K +MDISFKD VL IPP +I D FE Y RN+MA
Subjt:  VIPS--YVSEGKAKHLVDFLSSCHVPSYDTI--YKNGIAFLR-----PPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA

Query:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP
        FEQ+   +      +Y  FL+GLIS E+D SLLVKA I+TN IGG+++E++ LFN+LCK+V + G    +  +  ALH++C  RW K  A+LRR+YFNTP
Subjt:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP

Query:  WTLISVVAATFLILLTLLQTLFSALSI
        W  IS VAA FLILLT LQTLFSA+S+
Subjt:  WTLISVVAATFLILLTLLQTLFSALSI

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.2e-1030.17Show/hide
Query:  PTLTALHEAGVRILKATNKPLMDISF--KDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSD
        P+++ LH+AGVR     +  +  ++F    G   +P   +  N E   RN++A+E     SGP   T+Y   ++G+I +E+D  LL + G+L + +  SD
Subjt:  PTLTALHEAGVRILKATNKPLMDISF--KDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSD

Query:  EEIAKLFNNLCKEVTIPGG---DKYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFS
        +E A+++N + K V +      DK   D    ++ Y   RW      L   Y    W +++ +AA  L++L  LQ LFS
Subjt:  EEIAKLFNNLCKEVTIPGG---DKYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFS

Q9SD53 UPF0481 protein At3g472001.2e-3125.61Show/hide
Query:  NIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCL-------NMNVEAAVEI---------
        N  +S  ++   LL S+    C I+RV    VA+N  AY P+VVSIGP+HYG +HL+ ++ HK + L ++L          N+ V+A V++         
Subjt:  NIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCL-------NMNVEAAVEI---------

Query:  --------VQFMILVHHRFHQ-----TANGLDVS-----FYEAIRPDLYQDFIMLENQLPLFVLECLF--DRLSLERDDRKLSFVKFIHNFFFDNGLVNS
                + FM+++   F        +  +++S         +   +  D ++LENQ+P FVL+ L+   ++ +  D  +++F       FF N +   
Subjt:  --------VQFMILVHHRFHQ-----TANGLDVS-----FYEAIRPDLYQDFIMLENQLPLFVLECLF--DRLSLERDDRKLSFVKFIHNFFFDNGLVNS

Query:  RMVIPSYVSEGKAKHLVDFL--------------------------SSCHVPSYDTIYKNGIAFLRPPTLTALHEAGVRILKATNKPLMDISFKDGVLNI
              +    KAKHL+D +                           S +VPS D+     +  +       L     R+ ++    ++++  K   L I
Subjt:  RMVIPSYVSEGKAKHLVDFL--------------------------SSCHVPSYDTIYKNGIAFLRPPTLTALHEAGVRILKATNKPLMDISFKDGVLNI

Query:  PPFKIYDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCK
        P  +       +  N +AFEQF   S    +T YI F+  L++ E+D + L    ++  N  GS+ E+++ F  + K+V       Y  ++   +++Y K
Subjt:  PPFKIYDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCK

Query:  KRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIINTK
        K +    A  R  +F +PWT +S  A  F+ILLT+LQ+  + LS +N K
Subjt:  KRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIINTK

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)9.6e-3727.42Show/hide
Query:  IQRQFKKL-----LSSSIA--PKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYL-RCLNMNVEAAVEIVQFMILVHHRFHQ
        I+R  KKL     L SS A  P C+I+RV   ++  N   Y P+VVSIGP+H G+  LK +E HK + L + L R  N+ +E  ++ V+ +  V    + 
Subjt:  IQRQFKKL-----LSSSIA--PKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYL-RCLNMNVEAAVEIVQFMILVHHRFHQ

Query:  TANGLDVSFYEA---------------------------------IRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNS
            +D   +                                   + P  Y+DF+ LENQ+P FVLE LF+    + ++   + ++ +   FF+N +  +
Subjt:  TANGLDVSFYEA---------------------------------IRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNS

Query:  RMVIPSYVSEGKAKHLVDFL-------SSCHVPSYDTIYKNGIAFLRPPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA
           +  +  E +AKHL+D L       S  H P      K  +      +++ L  AG+++ +  + +  + + F+ G + +P   + D    +  N +A
Subjt:  RMVIPSYVSEGKAKHLVDFL-------SSCHVPSYDTIYKNGIAFLRPPTLTALHEAGVRILKATN-KPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMA

Query:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP
        +EQ  +     + T Y   LD L +T KD   L    I+ N   G+D E+AK  N+L ++V       Y  D+   +++Y K  W    AT +  YFN+P
Subjt:  FEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTP

Query:  WTLISVVAATFLILLTLLQTLFS
        W+ +S +AA  L++L+++QT+++
Subjt:  WTLISVVAATFLILLTLLQTLFS

AT3G50150.1 Plant protein of unknown function (DUF247)2.9e-4131.19Show/hide
Query:  SIQRQFKKLLS---SSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAV-------------------
        SI+ + +K LS   ++   K  IYRV   L   +  +Y PQ VSIGP+H+G+ HL+ ME HK + + + +     N+E  +                   
Subjt:  SIQRQFKKLLS---SSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAV-------------------

Query:  -----EIVQFMIL----VHHRFHQTANGLDVSFYEAIRP---------DLYQDFIMLENQLPLFVLECLFDRLSLER-DDRKLSFVKFIHNFFFDNGLVN
             E  + ++L    V   F  T  G     Y    P          + +D IMLENQLPLFVL+ L   L L+     +   V  +   FF   +  
Subjt:  -----EIVQFMIL----VHHRFHQTANGLDVSFYEAIRP---------DLYQDFIMLENQLPLFVLECLFDRLSLER-DDRKLSFVKFIHNFFFDNGLVN

Query:  SRMVIP---SYVSEGKAKHLVD-------------FLSSCHVPSYDTIYKNGIAFLRPPTL----TALHEAGVRILKATNKPLMDISFKDGVLNIPPFKI
        S ++     S  S+ K+  L D              + S    +  T Y++     +   L    T L  AGV  ++     L DI FK+G L IP   I
Subjt:  SRMVIP---SYVSEGKAKHLVD-------------FLSSCHVPSYDTIYKNGIAFLRPPTL----TALHEAGVRILKATNKPLMDISFKDGVLNIPPFKI

Query:  YDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPK
        +D  +    N++AFEQ   +S   N+T YI F+D LI++ +D S L   GI+ + + GSD E+A LFN LCKEV     D Y   ++  ++ Y  ++W  
Subjt:  YDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPK

Query:  WKATLRREYFNTPWTLISVVAATFLILLTLLQTLFS
         KATLR++YFN PW   S  AA  L+ LT  Q+ F+
Subjt:  WKATLRREYFNTPWTLISVVAATFLILLTLLQTLFS

AT3G50170.1 Plant protein of unknown function (DUF247)7.9e-3930.37Show/hide
Query:  SSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVE-----------------------AAVEIVQFMIL--
        ++I  K  IYRV + L   +  +Y PQ VS+GP+H+G++ L+ ME HK + L   L+ L   +E                       +  E  + ++L  
Subjt:  SSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVE-----------------------AAVEIVQFMIL--

Query:  --VHHRFHQTANGLDVSFYEAIRP---------DLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFD--------------NGLVNSR
          V   F  T  G     Y    P          + +D IMLENQLPLFVL+ L + L L   + +   V  +   FFD              + L+N  
Subjt:  --VHHRFHQTANGLDVSFYEAIRP---------DLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFD--------------NGLVNSR

Query:  MVIPSYVSEGKAKHLVD-----FLSSCHVPSYDTIYKNGIAFLR---------PPTLTALHEAGVRILKATNKPLMDISFKDGVLNIPPFKIYDNFEPYG
              + +    H +D      L S   P+  ++ K      R            +T L EAGV+  K       DI FK+G L IP   I+D  +   
Subjt:  MVIPSYVSEGKAKHLVD-----FLSSCHVPSYDTIYKNGIAFLR---------PPTLTALHEAGVRILKATNKPLMDISFKDGVLNIPPFKIYDNFEPYG

Query:  RNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRRE
         N++AFEQ  + S   ++T YI F+D LI++ +D S L   GI+ + + GSD E+A LFN LC+EV     D +   ++  ++ Y  ++W   KATL  +
Subjt:  RNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRRE

Query:  YFNTPWTLISVVAATFLILLTLLQTLFS
        YFN PW   S  AA  L+LLTL Q+ ++
Subjt:  YFNTPWTLISVVAATFLILLTLLQTLFS

AT4G31980.1 unknown protein3.5e-4732.62Show/hide
Query:  NLNIANSIQRQFKKLLS--SSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAV--------------
        N N  +++    K  L+  SS++ KC IY+V N+L  +N  AYTP++VS GP H G+E L+AME  K + L  ++   N ++E  V              
Subjt:  NLNIANSIQRQFKKLLS--SSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNVEAAV--------------

Query:  ---------EIVQFMI---------LVHHRFHQTANGLDVSFYEAIR-PDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVN
                 E V+ ++         L+   + +     D  F  ++   D+ +D I++ENQLP FV++ +F  L         S ++     F       
Subjt:  ---------EIVQFMI---------LVHHRFHQTANGLDVSFYEAIR-PDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVN

Query:  SRMVIPSYVSEGKAKHLVDFLSSCHVPSYD-TIYKNGIAFLRPPTLTALHEAGVRILKA-TNKPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFR
        SR+    +++E   +H VD L SC++P +   +    +     P  T LH AGVR   A T+  L+DISF DGVL IP   + D  E   +N++ FEQ R
Subjt:  SRMVIPSYVSEGKAKHLVDFLSSCHVPSYD-TIYKNGIAFLRPPTLTALHEAGVRILKA-TNKPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFR

Query:  LRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDM-ANALHDYCKKRWPKWKATLRREYFNTPWTLI
          +   N   YI  L   I +  DA LL+ +GI+ N +G S  +++ LFN++ KEV      ++Y+ M +  L  YC   W +WKA LRR+YF+ PW + 
Subjt:  LRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEEIAKLFNNLCKEVTIPGGDKYYYDM-ANALHDYCKKRWPKWKATLRREYFNTPWTLI

Query:  SVVAATFLILLTLLQTLFSALSI
        SV AA  L+LLT +Q++ S L++
Subjt:  SVVAATFLILLTLLQTLFSALSI

AT5G11290.1 Plant protein of unknown function (DUF247)3.3e-3737.77Show/hide
Query:  DLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSY---VSEGKAKHLVDFLSSCHVPSYDTIYKNGIAFLRPPTLT
        D+  D ++LENQLP FV+E +F  L ++         + IHN F         M IPS+   +S+ K  H VD L S H+P     +  G   +    L+
Subjt:  DLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSY---VSEGKAKHLVDFLSSCHVPSYDTIYKNGIAFLRPPTLT

Query:  A--LHEAGVRILKATNKP-LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQ-FRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEE
        A  +  AGV++  A N    +DISF +GVL IP  KI D  E   RN++ FEQ  RL         Y+ FL   I +  DA L +  GI+ N  G + E+
Subjt:  A--LHEAGVRILKATNKP-LMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQ-FRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNNIGGSDEE

Query:  IAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSI
        +++LFN++ KE +  G   YY  +   L  +C   W KWKATLRR+YF+ PW+  SVVAA  L+LLT +Q + S L++
Subjt:  IAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCCGAACATGTTGGAACCCATCAACAGAACTACAGTCCTTTCGATATTGAAAAAGATGCTGACTCTGAACAGTTTCGTCTGGATAATTTGAACATCGCGAATTC
CATACAGAGACAGTTTAAGAAATTGCTTTCTTCTTCCATCGCTCCAAAATGCAACATCTATCGAGTTTCCAACCGACTAGTCGCCATTAATCATGCAGCCTATACGCCTC
AAGTCGTTTCCATCGGCCCTTTTCACTATGGTCGAGAGCATTTGAAGGCCATGGAATGGCATAAGCTTCAGTGTCTCTATATTTACCTACGCTGTTTAAATATGAATGTT
GAGGCTGCCGTTGAAATCGTTCAGTTTATGATACTAGTTCATCACAGATTCCACCAAACTGCAAACGGGTTAGATGTTTCATTCTATGAAGCTATAAGGCCTGATTTATA
TCAAGACTTTATAATGCTTGAGAATCAACTTCCTCTTTTTGTTCTTGAGTGTCTATTTGACAGACTTTCACTCGAAAGAGACGATAGAAAACTCTCCTTTGTAAAATTTA
TACACAATTTTTTCTTTGATAATGGGTTGGTAAATTCTCGTATGGTAATTCCTAGTTATGTCTCCGAAGGAAAAGCAAAACACTTGGTCGATTTCTTAAGCTCCTGCCAC
GTCCCCTCTTATGATACAATCTACAAGAATGGAATCGCTTTTCTGCGTCCCCCAACTTTAACCGCGCTCCATGAGGCTGGTGTTAGGATCTTGAAAGCAACAAACAAACC
CTTGATGGACATAAGCTTCAAAGATGGGGTTCTAAATATACCACCTTTCAAAATTTACGATAACTTCGAACCCTATGGGCGGAACATGATGGCGTTTGAGCAGTTCCGCT
TACGTTCTGGACCGGGGAATGTAACCAAGTATATTGCATTTCTAGATGGCTTGATAAGCACAGAGAAAGACGCGAGTTTACTTGTGAAGGCGGGAATCCTAACCAACAAT
ATTGGTGGCAGTGACGAAGAAATTGCAAAACTGTTTAACAATCTATGTAAAGAGGTGACCATTCCAGGTGGTGACAAGTACTACTACGATATGGCCAACGCTTTACATGA
CTACTGCAAGAAACGGTGGCCCAAGTGGAAAGCTACACTGAGACGTGAGTATTTCAATACGCCATGGACTTTAATCTCCGTCGTAGCTGCAACCTTCCTCATTCTCCTCA
CTCTCCTGCAAACCCTATTTTCTGCTTTATCGATTATCAATACCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCCGAACATGTTGGAACCCATCAACAGAACTACAGTCCTTTCGATATTGAAAAAGATGCTGACTCTGAACAGTTTCGTCTGGATAATTTGAACATCGCGAATTC
CATACAGAGACAGTTTAAGAAATTGCTTTCTTCTTCCATCGCTCCAAAATGCAACATCTATCGAGTTTCCAACCGACTAGTCGCCATTAATCATGCAGCCTATACGCCTC
AAGTCGTTTCCATCGGCCCTTTTCACTATGGTCGAGAGCATTTGAAGGCCATGGAATGGCATAAGCTTCAGTGTCTCTATATTTACCTACGCTGTTTAAATATGAATGTT
GAGGCTGCCGTTGAAATCGTTCAGTTTATGATACTAGTTCATCACAGATTCCACCAAACTGCAAACGGGTTAGATGTTTCATTCTATGAAGCTATAAGGCCTGATTTATA
TCAAGACTTTATAATGCTTGAGAATCAACTTCCTCTTTTTGTTCTTGAGTGTCTATTTGACAGACTTTCACTCGAAAGAGACGATAGAAAACTCTCCTTTGTAAAATTTA
TACACAATTTTTTCTTTGATAATGGGTTGGTAAATTCTCGTATGGTAATTCCTAGTTATGTCTCCGAAGGAAAAGCAAAACACTTGGTCGATTTCTTAAGCTCCTGCCAC
GTCCCCTCTTATGATACAATCTACAAGAATGGAATCGCTTTTCTGCGTCCCCCAACTTTAACCGCGCTCCATGAGGCTGGTGTTAGGATCTTGAAAGCAACAAACAAACC
CTTGATGGACATAAGCTTCAAAGATGGGGTTCTAAATATACCACCTTTCAAAATTTACGATAACTTCGAACCCTATGGGCGGAACATGATGGCGTTTGAGCAGTTCCGCT
TACGTTCTGGACCGGGGAATGTAACCAAGTATATTGCATTTCTAGATGGCTTGATAAGCACAGAGAAAGACGCGAGTTTACTTGTGAAGGCGGGAATCCTAACCAACAAT
ATTGGTGGCAGTGACGAAGAAATTGCAAAACTGTTTAACAATCTATGTAAAGAGGTGACCATTCCAGGTGGTGACAAGTACTACTACGATATGGCCAACGCTTTACATGA
CTACTGCAAGAAACGGTGGCCCAAGTGGAAAGCTACACTGAGACGTGAGTATTTCAATACGCCATGGACTTTAATCTCCGTCGTAGCTGCAACCTTCCTCATTCTCCTCA
CTCTCCTGCAAACCCTATTTTCTGCTTTATCGATTATCAATACCAAGTAA
Protein sequenceShow/hide protein sequence
MEPEHVGTHQQNYSPFDIEKDADSEQFRLDNLNIANSIQRQFKKLLSSSIAPKCNIYRVSNRLVAINHAAYTPQVVSIGPFHYGREHLKAMEWHKLQCLYIYLRCLNMNV
EAAVEIVQFMILVHHRFHQTANGLDVSFYEAIRPDLYQDFIMLENQLPLFVLECLFDRLSLERDDRKLSFVKFIHNFFFDNGLVNSRMVIPSYVSEGKAKHLVDFLSSCH
VPSYDTIYKNGIAFLRPPTLTALHEAGVRILKATNKPLMDISFKDGVLNIPPFKIYDNFEPYGRNMMAFEQFRLRSGPGNVTKYIAFLDGLISTEKDASLLVKAGILTNN
IGGSDEEIAKLFNNLCKEVTIPGGDKYYYDMANALHDYCKKRWPKWKATLRREYFNTPWTLISVVAATFLILLTLLQTLFSALSIINTK