; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022317 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022317
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionacid phosphatase 1-like
Genome locationtig00154107:531813..542031
RNA-Seq ExpressionSgr022317
SyntenySgr022317
Gene Ontology termsNA
InterPro domainsIPR005519 - Acid phosphatase, class B-like
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5329430.1 unnamed protein product [Arabidopsis thaliana]6.3e-11143.68Show/hide
Query:  FILALLAFASAAAAASRPI-IREFPKQSLLRAVGGADSKCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFAATVNLK
        FI+AL       A +SR     + P+ S+        S CESW+ + E NN+   + +P +C  ++ +Y+  NG      YD+  V   A ++A TV + 
Subjt:  FILALLAFASAAAAASRPI-IREFPKQSLLRAVGGADSKCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFAATVNLK

Query:  GDGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWEKLIMRGP
        GDGKDAW+FDID+TLLSN+ Y+  +GYG      IK +     G       +L LYK L+  GF  I+LT RDE  RS TE  L    Y  W +L++RG 
Subjt:  GDGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWEKLIMRGP

Query:  EDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRSECIRGNTSS
         DQGK  + +KSE+ +++V+EGY +HGN GDQW DLLG    +R+FK+PNPI            W      F VF F       PS   R+  I+   S 
Subjt:  EDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRSECIRGNTSS

Query:  GRR------------------SVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFN
        G R                   +P  CV+ V EY N D++LSD   + D++L FA++V  ++GDGKD W+FD+DETLL+N+ YY+ +G+GSEPY+D  F+
Subjt:  GRR------------------SVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFN

Query:  EWVNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSEL
        EWV +G APA  ASLRLYN LKKLGF I LLTGR E QR  T+ NL  AGYS W+ L+L                    RG +D+GK A  YKSE+RS+L
Subjt:  EWVNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSEL

Query:  VKQGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP
        +++G+KI+G+SGDQWSDL G+A+A RSFK+PNPMYYIP
Subjt:  VKQGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP

CAE6170045.1 unnamed protein product [Arabidopsis arenosa]3.8e-10842.91Show/hide
Query:  HEGAGERASAINAALVEELLHSFLAMAFLKSFLFILALLAFASAAAAASRPIIREFPKQSLLRAVGGADSKCESWKFSFEANNLRS-RTVPKECIKFVVS
        H  AG++ S +    V     SF     +    FILAL A     A +SR     F K     ++    S CESW+ + E NN  + + VP +C  +V +
Subjt:  HEGAGERASAINAALVEELLHSFLAMAFLKSFLFILALLAFASAAAAASRPIIREFPKQSLLRAVGGADSKCESWKFSFEANNLRS-RTVPKECIKFVVS

Query:  YMVDNGSNSRYFYDLSYVVDSANEFAATVNLKGDGKDAWIFDIDDTLLSNVLYFLESGYGQ--VLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKA
        Y+  NG      YD+  V   A  +A TV L  DGKDAW+FDID+TLLSN+ Y+   GYG     N +  + V       +    +L LYK L+  GF  
Subjt:  YMVDNGSNSRYFYDLSYVVDSANEFAATVNLKGDGKDAWIFDIDDTLLSNVLYFLESGYGQ--VLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKA

Query:  IILTERDESTRSSTENILLRNSYTDWEKLIMRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWL
        I+LT RDES RS TE  L    Y  W +L++RG  DQGK  + +KSE+ +++V+EGY++HGN GDQW DL G     R+FK+PNPI         F    
Subjt:  IILTERDESTRSSTENILLRNSYTDWEKLIMRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWL

Query:  LLRSRFSVFSFFSPQPPPPSPKRRSE---CIRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNL
        L+    S+ + F   P     +           N +     +P  CV+ V EY N D++ SD + +AD++L FA++V  ++GDGKD W+FD+DETLL+N+
Subjt:  LLRSRFSVFSFFSPQPPPPSPKRRSE---CIRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNL

Query:  PYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRG
         YY+ +G+GSEPY+D SF+EWV +G APA  ASLRLYN LKKLGF I LLTGR E QR  T+ NL  AGYS W+ L+L                    RG
Subjt:  PYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRG

Query:  SSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP
         +D+GK A  YKSE+RS+L+++G+KI+G+SGDQWSDL+G+A+A RSFK+PNP+YYIP
Subjt:  SSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP

KAF3603479.1 hypothetical protein F2Q69_00033971 [Brassica cretica]2.6e-10141.28Show/hide
Query:  LAMAFLKSFLFILALLAFASAAAAASRPIIREFPKQSLLRAVGGADSKCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSAN
        + ++ L   LF LA     S++    RP+I E  +          +  C SW+F+ E NNL   +T+P EC  +V  Y++  G    Y  DL  V + AN
Subjt:  LAMAFLKSFLFILALLAFASAAAAASRPIIREFPKQSLLRAVGGADSKCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSAN

Query:  EFAATVNLKG-DGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYT
         +A++    G DGKD WIFDID+TLLSN+ Y+LE G G  +    K D     G +  I  +L LY+ ++  G+K I+LT R E+ R  T   L+   + 
Subjt:  EFAATVNLKG-DGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYT

Query:  DWEKLIMRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPI---TRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSP
        +W++LI+R  +D  K  ++FKSEK  E+V+EGYR+ GN GDQW DLLGS  + R+FK+PNPI   + + P S     + +L+   +VF          S 
Subjt:  DWEKLIMRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPI---TRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSP

Query:  KRRSECIRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNK
          +    +       +++P  C  +V++Y     Y+ D E V++ +  +A +      DGKD W+FD+DETLLSNLPYY E+G G E ++ T F++WV K
Subjt:  KRRSECIRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNK

Query:  GLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGY
        G APAI  SL+LY K+K LG+K+ LLTGR E+ R  T +NL  AG++NWD LILR+       LD             D+ K A  +KSEKR E+VK+GY
Subjt:  GLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGY

Query:  KIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP
        +I+G+SGDQWSDL+G A+++RSFKLPNPMYYIP
Subjt:  KIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP

KAF3788723.1 Acid phosphatase 1 [Nymphaea thermarum]5.2e-10543.4Show/hide
Query:  FILALLAFASAAAAASRPIIREFPKQSLLRAVGGADS----KCESWKFSFEANNL-RSRTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFAATV
        F++ LLA  +    A    I+    +S  R  G   S     C SW F  E N      TVP+ C+ +V  YM    +   Y  D + V   A EFA +V
Subjt:  FILALLAFASAAAAASRPIIREFPKQSLLRAVGGADS----KCESWKFSFEANNL-RSRTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFAATV

Query:  NLKGDGKDAWIFDIDDTLLSNVLYFLESGYG-QVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWEKLI
        N+ GDGKDAW+FDIDDTL+S + Y+ E GYG +V N    D+ +    ++  +     LY  L   GF+  ++T R ES R+ T + L +  Y+ W+KLI
Subjt:  NLKGDGKDAWIFDIDDTLLSNVLYFLESGYG-QVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWEKLI

Query:  MRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRSEC---
        +R    +GK   ++KS+K  E+ +EGYR+HG+ GDQW DLLG+ + TR+FK+PNPI  ++            R R  V     P            C   
Subjt:  MRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRSEC---

Query:  ---IRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLA
           +  N +    +VP  CV +V++Y    RY SDS  V+  +  FA  V  +AGDGKDAWVFD+DETLLSNLPYY  NGFGSE +N+ +F+EW  +G+A
Subjt:  ---IRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLA

Query:  PAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQ
        PA+P SL+LY  LK LGF++FLLTGRSE QR  T  NL  AGY +W+ LIL                    R   D+GK AI YKSE+R E+ K+GY+I 
Subjt:  PAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQ

Query:  GSSGDQWSDLVGYALAKRSFKLPNPMYYIP
        GSSGDQWSDL+G+A+A RSFKLPNPMYYIP
Subjt:  GSSGDQWSDLVGYALAKRSFKLPNPMYYIP

KAG5409474.1 hypothetical protein IGI04_005793 [Brassica rapa subsp. trilocularis]1.5e-9637.68Show/hide
Query:  PAMASLKTSLSALVTAAASQPIIP----ISPGKSHVQPD-----EGSTFS-KHIWKLGWAHETATPKKRFFHEGAGE--RASAINAALVEELLHSFLAMA
        P + SLK          AS P  P    I P    V  D     +G T+  K I K    H ++ P       GA E  +A+     +  + +     M 
Subjt:  PAMASLKTSLSALVTAAASQPIIP----ISPGKSHVQPD-----EGSTFS-KHIWKLGWAHETATPKKRFFHEGAGE--RASAINAALVEELLHSFLAMA

Query:  FLKSFLFILALLAFASAAAAASRPIIREFPKQSLLRAVGG-ADSKCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFA
               +++L + A +   +    I ++P +   R  G   +  C SW+F+ E NNL    T+P EC  +V  Y++  G    Y  DL  V + A+ FA
Subjt:  FLKSFLFILALLAFASAAAAASRPIIREFPKQSLLRAVGG-ADSKCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFA

Query:  ATVNLK-GDGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWE
        +TV+L  GDGKDAWIFDID+TLLSN+ Y+++ G+G  L    + D     G++  I  +L LY+ +   G++  +LT R ES R  T   L+   +  W+
Subjt:  ATVNLK-GDGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWE

Query:  KLIMRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRSEC
        KLI+R P++Q K  +++KSEK  E+V+EGYR+ GN GDQW DLLGS  + R+FK+ NPI  + P+S   RP  LL                         
Subjt:  KLIMRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRSEC

Query:  IRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVV--AGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAP
                     +PC   V++Y     Y++D E V++ +  FA TV     AGDGKDAW+FD+DETLLSNLPY+ E+GFG E ++ + F++WV +G+AP
Subjt:  IRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVV--AGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAP

Query:  AIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQG
        AI  SL+LY ++  LG+++FLLTGR ES R  T +NL  AG+ NWD LILR+                      ++ K A  YKSEKR E+VK+G++I+G
Subjt:  AIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQG

Query:  SSGDQWSDLVGYALAKRSFKLPNPMYYIP
        + G+QWSDL+G ++++RSFKL NPMYYIP
Subjt:  SSGDQWSDLVGYALAKRSFKLPNPMYYIP

TrEMBL top hitse value%identityAlignment
A0A6J1DPG8 acid phosphatase 1-like9.9e-8669.49Show/hide
Query:  PSPKRRSECIRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEW
        PS +     +  N +    SVPRPCV+FVR+YFN DRYLSDSEAVAD+SL+FA ++NV    GKDAWVFDVDETLLSNLPYYR +GFGSEPYNDTSFNEW
Subjt:  PSPKRRSECIRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEW

Query:  VNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVK
        VN+G+APA+PASLRL+NK+K LG KIFLLTGR E+QRA TQQNL  AGYS W+ LIL                    RG+ DEGKKAIAYKSEKRSEL +
Subjt:  VNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVK

Query:  QGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP
        +GY IQGSSGDQWSDLVGYAL+KRSFKLPNPMYYIP
Subjt:  QGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP

A0A6J1KB36 acid phosphatase 1-like3.3e-8168.61Show/hide
Query:  NTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPAS
        N +    S+PRPCV FV++YFN+ RYLSDS +VA +S NFA +VNV  GDG DAWVFDVDETLLSNLPYY++NGFGSEPYN+TSFNEWV KGLAP +PAS
Subjt:  NTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPAS

Query:  LRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQ
        L LY ++KKLGFKIF+LTGR+E QRA T+QNL  AGYS W+ LIL                    RG  DEGKKAI YKSEKR ELVKQGY+IQGSSGDQ
Subjt:  LRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQ

Query:  WSDLVGYALAKRSFKLPNPMYYI
        WSDL+G+ALAKRSFKLPNPMYY+
Subjt:  WSDLVGYALAKRSFKLPNPMYYI

A0A6J1KFL8 acid phosphatase 1-like2.1e-8069.2Show/hide
Query:  NTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPAS
        N +   ++VPRPC+EFVREYFN  RYLSDS AVA++SL FA +V V    G+DAWVFDVDETLLSNLPYYR NGFGS+P+NDTSFNEWVN G AP +P S
Subjt:  NTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPAS

Query:  LRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQ
        LRLY KLK LGFKIFLLTGR ESQR  TQQNL QAGYS W+ LIL                    RG  DEGKKA  YKSEKR+ELVKQGY IQG+SGDQ
Subjt:  LRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQ

Query:  WSDLVGYALAKRSFKLPNPMYYIP
        WSDL+G+ALAKRSFKLPN MYYIP
Subjt:  WSDLVGYALAKRSFKLPNPMYYIP

A0A7G2F1H0 (thale cress) hypothetical protein3.0e-11143.68Show/hide
Query:  FILALLAFASAAAAASRPI-IREFPKQSLLRAVGGADSKCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFAATVNLK
        FI+AL       A +SR     + P+ S+        S CESW+ + E NN+   + +P +C  ++ +Y+  NG      YD+  V   A ++A TV + 
Subjt:  FILALLAFASAAAAASRPI-IREFPKQSLLRAVGGADSKCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFAATVNLK

Query:  GDGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWEKLIMRGP
        GDGKDAW+FDID+TLLSN+ Y+  +GYG      IK +     G       +L LYK L+  GF  I+LT RDE  RS TE  L    Y  W +L++RG 
Subjt:  GDGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWEKLIMRGP

Query:  EDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRSECIRGNTSS
         DQGK  + +KSE+ +++V+EGY +HGN GDQW DLLG    +R+FK+PNPI            W      F VF F       PS   R+  I+   S 
Subjt:  EDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRSECIRGNTSS

Query:  GRR------------------SVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFN
        G R                   +P  CV+ V EY N D++LSD   + D++L FA++V  ++GDGKD W+FD+DETLL+N+ YY+ +G+GSEPY+D  F+
Subjt:  GRR------------------SVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFN

Query:  EWVNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSEL
        EWV +G APA  ASLRLYN LKKLGF I LLTGR E QR  T+ NL  AGYS W+ L+L                    RG +D+GK A  YKSE+RS+L
Subjt:  EWVNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSEL

Query:  VKQGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP
        +++G+KI+G+SGDQWSDL G+A+A RSFK+PNPMYYIP
Subjt:  VKQGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP

A0A7J6EUM8 Uncharacterized protein3.3e-8136.67Show/hide
Query:  LFILALLAFASAAAAASRPIIREFPKQSLLRAVGGADS------KCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFA
        LF+  L+  ASA  ++  P+I    +  L+R   G+         C SW+F  E NN+ + +TVP  C  ++  YM+     S+Y  D   +   A   A
Subjt:  LFILALLAFASAAAAASRPIIREFPKQSLLRAVGGADS------KCESWKFSFEANNLRS-RTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFA

Query:  ATVNLKGDGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWEK
        +++N+  +G DAW+FDID+T LSN+ Y+ E+G+G V     K +     G +  +  +L LY  L   G K + +T R E  R  TE  L    Y  W++
Subjt:  ATVNLKGDGKDAWIFDIDDTLLSNVLYFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWEK

Query:  LIMRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRS---
        L+++     GK    +KS +  ++ E+GYR+ GN GDQW DLLG+    RTFK+P+P+            +++ + + S F   S Q     P+  S   
Subjt:  LIMRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVGDQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRS---

Query:  -----EC------IRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKD-AWVFDVDETLLSNLPYYRENGFGSEPYNDT
              C      +  N+    +++P  C ++V  Y   D+Y  DS+A+ D +  + +T+N+      D  WVFD+DET LSNLPYY ++GFG E +N T
Subjt:  -----EC------IRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKD-AWVFDVDETLLSNLPYYRENGFGSEPYNDT

Query:  SFNEWVNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKR
        SFNEWV  G A A+P SL+LY KL +L  KI  LTGR+E QR  T+ NL   GY++WD L+L                    +G +  GK A  YKS +R
Subjt:  SFNEWVNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKR

Query:  SELVKQGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYI
          L K+GYKI G+ GDQWSDL+G  +  R+FKLP+PMYYI
Subjt:  SELVKQGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYI

SwissProt top hitse value%identityAlignment
O49195 Vegetative storage protein 11.8e-2833.79Show/hide
Query:  SVPRPCVEFVREY-FNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKG-LAPAIPASLRLYN
        +VP  C  +V +Y   S +Y  DS+ V   +  +A+ +  +  D  + W+FD+D+TLLS++PYY + G+G+E     ++  W+  G   P +P +L LY 
Subjt:  SVPRPCVEFVREY-FNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKG-LAPAIPASLRLYN

Query:  KLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLV
         L +LG +  +++ R +     T +NL   G + W  LIL+ + ++L                       + YKS+ R+ LVK+GY I G+ GDQW+DLV
Subjt:  KLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLV

Query:  GYALAKRSFKLPNPMYYIP
              R FKLPNP+YY+P
Subjt:  GYALAKRSFKLPNPMYYIP

P10742 Stem 31 kDa glycoprotein (Fragment)2.2e-4241.67Show/hide
Query:  GRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLY
        G  ++P  CVE  +EY + ++Y SDS+ V   +  +A  + V     KD +VF +D T+LSN+PYY+++G+G E +N T ++EWVNKG APA+P +L+ Y
Subjt:  GRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLY

Query:  NKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDL
        NKL  LGFKI  L+GR+  ++A T+ NL +AGY  W+ LIL+                            A++YK+  R +L++QGY I G  GDQWSDL
Subjt:  NKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDL

Query:  V-GYALAKRSFKLPNP
        + G+    R+FKLPNP
Subjt:  V-GYALAKRSFKLPNP

P10743 Stem 31 kDa glycoprotein1.7e-4242.01Show/hide
Query:  RSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNK
        +++P  CVE  ++Y N +++ SDS+ V   +  +A    V      D ++F +D T+LSN+PYY ++G+G E +N+T ++EWVNKG APA+P +L+ YNK
Subjt:  RSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNK

Query:  LKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILR-THLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLV
        L  LGFKI  L+GR   + A T+ NL +AG+  W+ LIL+  HL                         A++YKS  R  L++QGY+I G  GDQWSDL+
Subjt:  LKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILR-THLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLV

Query:  G-YALAKRSFKLPNPMYYI
        G +    R+FKLPNPMYYI
Subjt:  G-YALAKRSFKLPNPMYYI

P15490 Stem 28 kDa glycoprotein1.4e-4442.27Show/hide
Query:  GRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLY
        G  ++P  CVE  +EY + ++Y SDS+ V   +  +A  + V     KD +VF +D T+LSN+PYY+++G+G E +N T ++EWVNKG APA+P +L+ Y
Subjt:  GRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLY

Query:  NKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDL
        NKL  LGFKI  L+GR+  ++A T+ NL +AGY  W+ LIL+                            A++YK+  R +L++QGY I G  GDQWSDL
Subjt:  NKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDL

Query:  V-GYALAKRSFKLPNPMYYI
        + G+    R+FKLPNP+YYI
Subjt:  V-GYALAKRSFKLPNPMYYI

P27061 Acid phosphatase 11.5e-5446.02Show/hide
Query:  IRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAI
        +  N  S  +++P  C ++V+EY     Y  + + V+D +  +A++V+ +  DG+D W+FDVDETLLSNLPYY ++ +G E ++D  F++WV  G APA+
Subjt:  IRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAI

Query:  PASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSS
         +SL+LY ++ KLGFK+FLLTGRSE  R+ T +NL  AG+ +W  LIL                    RGS D GK A  YKSE+R+ +V++G++I G+S
Subjt:  PASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSS

Query:  GDQWSDLVGYALAKRSFKLPNPMYYI
        GDQWSDL+G +++ RSFKLPNPMYYI
Subjt:  GDQWSDLVGYALAKRSFKLPNPMYYI

Arabidopsis top hitse value%identityAlignment
AT4G25150.1 HAD superfamily, subfamily IIIB acid phosphatase2.5e-5750Show/hide
Query:  RSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNK
        +++P  C ++V++Y   + Y+ D E V++ +  +A +     GDGKD W+FD+DETLLSNLPYY E+G G E ++ + F+ WV KG+APAI  SL+LY K
Subjt:  RSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNK

Query:  LKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVG
        +  LG+K+ LLTGR E+ R  T +NL  AG+ NWD LILR+       LD             D  K A  YKSEKR E+VK+GY+I+G+SGDQWSDL+G
Subjt:  LKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVG

Query:  YALAKRSFKLPNPMYYIP
         A+++RSFKLPNPMYYIP
Subjt:  YALAKRSFKLPNPMYYIP

AT4G29260.1 HAD superfamily, subfamily IIIB acid phosphatase4.6e-6755.09Show/hide
Query:  VPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNKLK
        +P  CV+ V EY N D++LSD   + D++L FA++V  ++GDGKD W+FD+DETLL+N+ YY+ +G+GSEPY+D  F+EWV +G APA  ASLRLYN LK
Subjt:  VPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNKLK

Query:  KLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVGYA
        KLGF I LLTGR E QR  T+ NL  AGYS W+ L+L                    RG +D+GK A  YKSE+RS+L+++G+KI+G+SGDQWSDL G+A
Subjt:  KLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVGYA

Query:  LAKRSFKLPNPMYYIP
        +A RSFK+PNPMYYIP
Subjt:  LAKRSFKLPNPMYYIP

AT4G29270.1 HAD superfamily, subfamily IIIB acid phosphatase1.1e-6051.16Show/hide
Query:  VPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNKLK
        +P  C  +++ Y N  ++  D + VA +++++A+TV  V GDGKDAWVFD+DETLLSN+ YY+ NG+GSEPY+   +NE V KG  P   ASLRLY  LK
Subjt:  VPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNKLK

Query:  KLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVGYA
        KLGF I LLTGR E  R+ T++NL  AGY  W+ L+L                    RG +D+GK A  YKSE+RS++VK+GY I G++GDQWSDL+G+A
Subjt:  KLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVGYA

Query:  LAKRSFKLPNPMYYI
        +A RSFK+PNPMYY+
Subjt:  LAKRSFKLPNPMYYI

AT5G44020.1 HAD superfamily, subfamily IIIB acid phosphatase1.3e-4239.91Show/hide
Query:  VPRPCVEFVREYFNSDRYLSDSEAVADFSLNF--AETVNVVAGDGKDAWVFDVDETLLSNLPYYRENG-FGSEPYNDTSFNEWVNKGLAPAIPASLRLYN
        VP+ CV FV++Y  S +Y  D E   D ++ +           DG DAW+FD+D+TLLS +PY++ NG FG E  N T F EW N G APA+P  ++LY+
Subjt:  VPRPCVEFVREYFNSDRYLSDSEAVADFSLNF--AETVNVVAGDGKDAWVFDVDETLLSNLPYYRENG-FGSEPYNDTSFNEWVNKGLAPAIPASLRLYN

Query:  KLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLV
        ++++ GFKIFL++ R E  R+ T +NL +AGY +W +L+L                    RG  DE K    YK++ R+ L   GY++ G  G QW+   
Subjt:  KLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLV

Query:  GYALAKRSFKLPNPMYYI
        G  + KR+FKLPN +YY+
Subjt:  GYALAKRSFKLPNPMYYI

AT5G51260.1 HAD superfamily, subfamily IIIB acid phosphatase3.0e-5849.08Show/hide
Query:  RSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNK
        +++P  C ++V++Y     YL+D E V++ +L FA ++   +GDGKD W+FD+DETLLSNLPYY ++GFG E ++ + F++WV +G+APAI  SL+LY +
Subjt:  RSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETVNVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNK

Query:  LKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVG
        +  LG+K+FLLTGR ES R  T +NL  AG+ NWD LILR+                      ++ K A  YKSEKR E+VK+GY+I+G+SGDQWSDL+G
Subjt:  LKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAILDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVG

Query:  YALAKRSFKLPNPMYYIP
         ++++RSFKL NPMYYIP
Subjt:  YALAKRSFKLPNPMYYIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCCCTATGGAGGAGTTTTCCGGCCATGGCTTCCCTCAAAACGTCTCTCTCCGCCCTGGTTACCGCCGCTGCCTCCCAACCGATCATCCCGATATCTCCCGGAAA
AAGCCATGTACAACCTGATGAAGGAAGCACGTTTTCAAAGCACATTTGGAAATTGGGATGGGCCCATGAGACTGCGACCCCTAAGAAACGATTTTTTCACGAGGGAGCCG
GGGAGCGAGCGTCGGCAATAAATGCAGCCCTTGTGGAAGAGTTGCTCCACAGTTTTCTGGCCATGGCTTTCCTCAAATCCTTTCTTTTCATTCTTGCCCTTCTTGCCTTC
GCTTCCGCCGCCGCCGCAGCCTCCCGACCGATCATTCGGGAATTTCCCAAACAGAGCCTTCTGCGAGCTGTTGGCGGAGCGGATTCGAAATGCGAGAGCTGGAAATTCTC
CTTCGAAGCCAATAACCTTAGAAGCCGGACCGTCCCAAAGGAGTGCATTAAATTTGTTGTGAGCTATATGGTGGACAATGGCAGTAATAGTCGTTATTTCTACGACTTGT
CGTACGTCGTTGATAGCGCCAATGAATTCGCAGCGACGGTGAATCTCAAAGGCGACGGAAAGGACGCTTGGATCTTCGATATTGACGACACACTGCTCTCCAATGTGCTT
TATTTCTTGGAGAGTGGATACGGGCAAGTTCTAAATTTGATCATCAAAGACGATGTTAAATCAAATCTCGGGCTGTCATATCCTATATTTTACAACTTATTCCTTTACAA
AGGGCTTCAAATCGCGGGCTTCAAGGCTATTATATTAACTGAGAGAGATGAATCTACGAGGAGTTCCACCGAAAATATCCTTCTGAGAAATAGCTACACTGATTGGGAGA
AACTTATCATGAGGGGACCTGAGGATCAAGGCAAAGAAGTAAGTGTGTTCAAATCAGAAAAGATAGCAGAGTTGGTAGAAGAAGGATATAGATTGCACGGGAATGTCGGA
GATCAGTGGAGGGATTTGTTAGGCTCCCCATCGACAACACGTACCTTCAAAATTCCTAATCCTATTACCCGCAGCGAACCAGACTCCAGAACTTTCCGGCCATGGCTGCT
CCTCCGATCTCGTTTCTCTGTCTTCTCATTCTTCTCGCCACAGCCTCCTCCGCCGTCTCCCAAACGGCGATCAGAATGTATCCGAGGGAACACGTCGTCCGGGCGGAGGT
CTGTGCCCCGGCCGTGCGTGGAGTTCGTCCGGGAGTACTTCAACAGCGACCGGTATCTTTCGGACTCTGAGGCGGTGGCAGATTTCTCGTTGAACTTCGCTGAGACGGTG
AACGTGGTGGCCGGCGACGGGAAGGACGCTTGGGTTTTCGACGTGGACGAGACGTTGCTTTCGAATTTGCCATATTATAGAGAAAATGGGTTCGGGTCGGAGCCATACAA
CGATACTTCTTTCAACGAGTGGGTGAACAAGGGTTTGGCTCCAGCGATACCAGCCAGCTTAAGGCTATACAATAAGCTCAAAAAGCTTGGATTCAAGATTTTTCTTTTAA
CTGGCCGAAGTGAATCTCAAAGAGCCCCCACCCAACAGAACCTCCATCAAGCTGGCTATTCCAACTGGGACAGCCTCATCTTGAGAACTCATTTGACTGAACTTGCAATT
CTTGATTCTTGGGAAAAAAAAAATTATGAAATTAGGGGATCTTCTGATGAAGGAAAGAAAGCTATTGCATACAAATCAGAGAAGAGATCTGAATTGGTGAAACAAGGTTA
CAAAATTCAAGGAAGCTCTGGAGATCAATGGAGTGATTTGGTGGGCTATGCTCTCGCAAAAAGATCCTTCAAGCTCCCAAATCCAATGTATTACATTCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCCCTATGGAGGAGTTTTCCGGCCATGGCTTCCCTCAAAACGTCTCTCTCCGCCCTGGTTACCGCCGCTGCCTCCCAACCGATCATCCCGATATCTCCCGGAAA
AAGCCATGTACAACCTGATGAAGGAAGCACGTTTTCAAAGCACATTTGGAAATTGGGATGGGCCCATGAGACTGCGACCCCTAAGAAACGATTTTTTCACGAGGGAGCCG
GGGAGCGAGCGTCGGCAATAAATGCAGCCCTTGTGGAAGAGTTGCTCCACAGTTTTCTGGCCATGGCTTTCCTCAAATCCTTTCTTTTCATTCTTGCCCTTCTTGCCTTC
GCTTCCGCCGCCGCCGCAGCCTCCCGACCGATCATTCGGGAATTTCCCAAACAGAGCCTTCTGCGAGCTGTTGGCGGAGCGGATTCGAAATGCGAGAGCTGGAAATTCTC
CTTCGAAGCCAATAACCTTAGAAGCCGGACCGTCCCAAAGGAGTGCATTAAATTTGTTGTGAGCTATATGGTGGACAATGGCAGTAATAGTCGTTATTTCTACGACTTGT
CGTACGTCGTTGATAGCGCCAATGAATTCGCAGCGACGGTGAATCTCAAAGGCGACGGAAAGGACGCTTGGATCTTCGATATTGACGACACACTGCTCTCCAATGTGCTT
TATTTCTTGGAGAGTGGATACGGGCAAGTTCTAAATTTGATCATCAAAGACGATGTTAAATCAAATCTCGGGCTGTCATATCCTATATTTTACAACTTATTCCTTTACAA
AGGGCTTCAAATCGCGGGCTTCAAGGCTATTATATTAACTGAGAGAGATGAATCTACGAGGAGTTCCACCGAAAATATCCTTCTGAGAAATAGCTACACTGATTGGGAGA
AACTTATCATGAGGGGACCTGAGGATCAAGGCAAAGAAGTAAGTGTGTTCAAATCAGAAAAGATAGCAGAGTTGGTAGAAGAAGGATATAGATTGCACGGGAATGTCGGA
GATCAGTGGAGGGATTTGTTAGGCTCCCCATCGACAACACGTACCTTCAAAATTCCTAATCCTATTACCCGCAGCGAACCAGACTCCAGAACTTTCCGGCCATGGCTGCT
CCTCCGATCTCGTTTCTCTGTCTTCTCATTCTTCTCGCCACAGCCTCCTCCGCCGTCTCCCAAACGGCGATCAGAATGTATCCGAGGGAACACGTCGTCCGGGCGGAGGT
CTGTGCCCCGGCCGTGCGTGGAGTTCGTCCGGGAGTACTTCAACAGCGACCGGTATCTTTCGGACTCTGAGGCGGTGGCAGATTTCTCGTTGAACTTCGCTGAGACGGTG
AACGTGGTGGCCGGCGACGGGAAGGACGCTTGGGTTTTCGACGTGGACGAGACGTTGCTTTCGAATTTGCCATATTATAGAGAAAATGGGTTCGGGTCGGAGCCATACAA
CGATACTTCTTTCAACGAGTGGGTGAACAAGGGTTTGGCTCCAGCGATACCAGCCAGCTTAAGGCTATACAATAAGCTCAAAAAGCTTGGATTCAAGATTTTTCTTTTAA
CTGGCCGAAGTGAATCTCAAAGAGCCCCCACCCAACAGAACCTCCATCAAGCTGGCTATTCCAACTGGGACAGCCTCATCTTGAGAACTCATTTGACTGAACTTGCAATT
CTTGATTCTTGGGAAAAAAAAAATTATGAAATTAGGGGATCTTCTGATGAAGGAAAGAAAGCTATTGCATACAAATCAGAGAAGAGATCTGAATTGGTGAAACAAGGTTA
CAAAATTCAAGGAAGCTCTGGAGATCAATGGAGTGATTTGGTGGGCTATGCTCTCGCAAAAAGATCCTTCAAGCTCCCAAATCCAATGTATTACATTCCCTAA
Protein sequenceShow/hide protein sequence
MEPLWRSFPAMASLKTSLSALVTAAASQPIIPISPGKSHVQPDEGSTFSKHIWKLGWAHETATPKKRFFHEGAGERASAINAALVEELLHSFLAMAFLKSFLFILALLAF
ASAAAAASRPIIREFPKQSLLRAVGGADSKCESWKFSFEANNLRSRTVPKECIKFVVSYMVDNGSNSRYFYDLSYVVDSANEFAATVNLKGDGKDAWIFDIDDTLLSNVL
YFLESGYGQVLNLIIKDDVKSNLGLSYPIFYNLFLYKGLQIAGFKAIILTERDESTRSSTENILLRNSYTDWEKLIMRGPEDQGKEVSVFKSEKIAELVEEGYRLHGNVG
DQWRDLLGSPSTTRTFKIPNPITRSEPDSRTFRPWLLLRSRFSVFSFFSPQPPPPSPKRRSECIRGNTSSGRRSVPRPCVEFVREYFNSDRYLSDSEAVADFSLNFAETV
NVVAGDGKDAWVFDVDETLLSNLPYYRENGFGSEPYNDTSFNEWVNKGLAPAIPASLRLYNKLKKLGFKIFLLTGRSESQRAPTQQNLHQAGYSNWDSLILRTHLTELAI
LDSWEKKNYEIRGSSDEGKKAIAYKSEKRSELVKQGYKIQGSSGDQWSDLVGYALAKRSFKLPNPMYYIP