; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022242 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022242
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription factor CSA-like
Genome locationtig00154002:702730..703947
RNA-Seq ExpressionSgr022242
SyntenySgr022242
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW67202.1 Transcription factor MYB105 [Vitis vinifera]1.5e-8548.84Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------
        MV ++ G  + S+NPN       SSSQES  +    S+VENGR  WGFPF  N T   L         + K S CSD G GEN                 
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------

Query:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP
            GKETDSG SKLCARGHWRPAEDT LKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF+EEEE+RLM AH++YGNKWAMIARLFP
Subjt:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP

Query:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------
        GRTDNAVKNHWHVIMARK REQ SAYRRRKLSQ             CRDTA      PP+ +   S  +      +PFA F                   
Subjt:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------

Query:  ------------YSVSRANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEA
                    ++   A  TPF+    GP+  + + + S  +HRS     D+ N+ ++ +YP  P +MM MQQ+N +N  +  PNS  S       + A
Subjt:  ------------YSVSRANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEA

Query:  SSSVSEDDGVGRRLESVPPPFFDFLGVGAT
        +   S ++ V    E++PPPF DFLGVGAT
Subjt:  SSSVSEDDGVGRRLESVPPPFFDFLGVGAT

XP_010651935.1 PREDICTED: transcription factor RAX1 [Vitis vinifera]1.1e-8549.3Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------
        MV ++ G  + S+NPN       SSSQES  +    S+VENGR  WGFPF  N T   L         + K S CSD G GEN                 
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------

Query:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP
            GKETDSG SKLCARGHWRPAEDT LKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF+EEEE+RLM AH++YGNKWAMIARLFP
Subjt:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP

Query:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------
        GRTDNAVKNHWHVIMARK REQ SAYRRRKLSQ             CRDTA      PP+ +   S  +      +PFA F                   
Subjt:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------

Query:  ----YSVSR--------ANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEA
             S S+        A  TPF+    GP+  + + + S  +HRS     D+ N+ ++ +YP  P +MM MQQ+N +N  +  PNS  S       + A
Subjt:  ----YSVSR--------ANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEA

Query:  SSSVSEDDGVGRRLESVPPPFFDFLGVGAT
        +   S ++ V    E++PPPF DFLGVGAT
Subjt:  SSSVSEDDGVGRRLESVPPPFFDFLGVGAT

XP_015895032.1 transcription factor CSA-like [Ziziphus jujuba]1.6e-8750.82Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDISKV----ENGRGFWGFPFPSNPTTTPLSSEDKIS-GCSDYGVGE------NKPKG-----------GK
        MV ++ G  S SL+PN  G   S SS ESY+ISK+    +NGR FWG PFPS+        E K S  CSD  +GE      NKP+            GK
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDISKV----ENGRGFWGFPFPSNPTTTPLSSEDKIS-GCSDYGVGE------NKPKG-----------GK

Query:  ETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNA
        ETD G SKLCARGHWRPAED+ LKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEE+RLMQAH+IYGNKWAMIARLFPGRTDNA
Subjt:  ETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNA

Query:  VKNHWHVIMARKCREQFSAYRRRKLSQPACR-----------DTAAAPPHCLSQP----LTKASPSYPFANFYS-------------------------V
        VKNHWHVIMARK REQ SAYRRRKLSQ   R            + AA  +C + P    L+  + SYPF   +S                          
Subjt:  VKNHWHVIMARKCREQFSAYRRRKLSQPACR-----------DTAAAPPHCLSQP----LTKASPSYPFANFYS-------------------------V

Query:  SRANH-------------TPFDLIPPGPEISNEMEVVSSRR--HRSTSDDENHLSAAYYPPLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVS
          +NH             TPFD   PG +    M ++   R   RS  + E  +  A+YPP  +  MQQ +NY+ LH +  S  S      G    SS  
Subjt:  SRANH-------------TPFDLIPPGPEISNEMEVVSSRR--HRSTSDDENHLSAAYYPPLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVS

Query:  EDDGVGRRLESVPPPFFDFLGVGAT
          +GVG   E++PPPF DFLGVGAT
Subjt:  EDDGVGRRLESVPPPFFDFLGVGAT

XP_021691872.1 myb-like protein Q [Hevea brasiliensis]5.7e-8550.84Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDISKVENGRGFWGFPFPSNPTTTPLSSEDKISGCSDYGVGENKP-----------------KGGKETDSG
        MV ++ G  S SL+PN    GV   +  S++   +E GR  W FPF  +  ++ +  E K S CSD   GEN                     GKETDSG
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDISKVENGRGFWGFPFPSNPTTTPLSSEDKISGCSDYGVGENKP-----------------KGGKETDSG

Query:  QSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHW
        QSKLCARGHWRPAEDT LKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEE+RLMQAH++YGNKWAMIARLFPGRTDNAVKNHW
Subjt:  QSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHW

Query:  HVIMARKCREQFSAYRRRKLSQP-----------ACRD--TAAAPP-HCLSQP----LTKASPSYPFANFYS------------------VSR-------
        HVIMARK REQ SAYRRRKL+Q             CRD  T A PP +CLS P    L+  SP YP   F+S                  VS        
Subjt:  HVIMARKCREQFSAYRRRKLSQP-----------ACRD--TAAAPP-HCLSQP----LTKASPSYPFANFYS------------------VSR-------

Query:  ----ANHTPFDLIPPGPEISNEMEVVSSRRHRSTSDDENHLSAAYYPPL---MMMGMQQTNNYNFLHRIPNSNVSKSDNILGSE----ASSSVSEDDGVG
            A  TPFD    GP+ ++ + + S  R      DE H+S  +YP L    +  MQQ+N  N  +   +S  S    +  +E     SSSV+E+   G
Subjt:  ----ANHTPFDLIPPGPEISNEMEVVSSRRHRSTSDDENHLSAAYYPPL---MMMGMQQTNNYNFLHRIPNSNVSKSDNILGSE----ASSSVSEDDGVG

Query:  RRLESVPPPFFDFLGVGAT
          LE++PPPF DFLGVGAT
Subjt:  RRLESVPPPFFDFLGVGAT

XP_034696697.1 transcription factor MYB120-like [Vitis riparia]8.8e-8649.65Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------
        MV ++ G  + S+NPN       SSSQES  +    S+VENGR  WGFPF  N T   L         + K S CSD G GEN                 
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------

Query:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP
            GKETDSG SKLCARGHWRPAEDT LKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF+EEEE+RLM AH++YGNKWAMIARLFP
Subjt:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP

Query:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------
        GRTDNAVKNHWHVIMARK REQ SAYRRRKLSQ             CRDTA      PP+ +   S  +      +PFA F                   
Subjt:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------

Query:  ----YSVSR--------ANHTPFDLIPPGPEISNEMEVVSSRR--HRSTSDDENHLSAAY--YPPLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEAS
             S S+        A  TPF+    GP+  + + + S  R   R   +  N++S  Y  YPPL MM MQQ+N +N  +  PNS  S       + A+
Subjt:  ----YSVSR--------ANHTPFDLIPPGPEISNEMEVVSSRR--HRSTSDDENHLSAAY--YPPLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEAS

Query:  SSVSEDDGVGRRLESVPPPFFDFLGVGAT
           S ++ V    E++PPPF DFLGVGAT
Subjt:  SSVSEDDGVGRRLESVPPPFFDFLGVGAT

TrEMBL top hitse value%identityAlignment
A0A438G4R3 Transcription factor MYB1057.2e-8648.84Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------
        MV ++ G  + S+NPN       SSSQES  +    S+VENGR  WGFPF  N T   L         + K S CSD G GEN                 
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------

Query:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP
            GKETDSG SKLCARGHWRPAEDT LKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF+EEEE+RLM AH++YGNKWAMIARLFP
Subjt:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP

Query:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------
        GRTDNAVKNHWHVIMARK REQ SAYRRRKLSQ             CRDTA      PP+ +   S  +      +PFA F                   
Subjt:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------

Query:  ------------YSVSRANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEA
                    ++   A  TPF+    GP+  + + + S  +HRS     D+ N+ ++ +YP  P +MM MQQ+N +N  +  PNS  S       + A
Subjt:  ------------YSVSRANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEA

Query:  SSSVSEDDGVGRRLESVPPPFFDFLGVGAT
        +   S ++ V    E++PPPF DFLGVGAT
Subjt:  SSSVSEDDGVGRRLESVPPPFFDFLGVGAT

A0A438G7H9 Transcription factor MYB1052.0e-8348.07Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------
        MV ++ G  + S+NPN       SSSQES  +    S+VENGR  WGFPF  N T   L         + K S CSD G GEN                 
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------

Query:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRS-----------GKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYG
            GKETDSG SKLCARGHWRPAEDT LKELVALYGPQNWNLIAEKLEGRS           GKSCRLRWFNQLDPRINRRAF+EEEE+RLM AH++YG
Subjt:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRS-----------GKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYG

Query:  NKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF--------
        NKWAMIARLFPGRTDNAVKNHWHVIMARK REQ SAYRRRKLSQ             CRDTA      PP+ +   S  +      +PFA F        
Subjt:  NKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF--------

Query:  ---------------YSVSR--------ANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNV
                        S S+        A  TPF+    GP+  + + + S  +HRS     D+ N+ ++ +YP  P +MM MQQ+N +N  +  PNS  
Subjt:  ---------------YSVSR--------ANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNV

Query:  SKSDNILGSEASSSVSEDDGVGRRLESVPPPFFDFLGVGAT
        S       + A+   S ++ V    E++PPPF DFLGVGAT
Subjt:  SKSDNILGSEASSSVSEDDGVGRRLESVPPPFFDFLGVGAT

A0A6P4B003 transcription factor CSA-like7.7e-8850.82Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDISKV----ENGRGFWGFPFPSNPTTTPLSSEDKIS-GCSDYGVGE------NKPKG-----------GK
        MV ++ G  S SL+PN  G   S SS ESY+ISK+    +NGR FWG PFPS+        E K S  CSD  +GE      NKP+            GK
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDISKV----ENGRGFWGFPFPSNPTTTPLSSEDKIS-GCSDYGVGE------NKPKG-----------GK

Query:  ETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNA
        ETD G SKLCARGHWRPAED+ LKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEE+RLMQAH+IYGNKWAMIARLFPGRTDNA
Subjt:  ETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNA

Query:  VKNHWHVIMARKCREQFSAYRRRKLSQPACR-----------DTAAAPPHCLSQP----LTKASPSYPFANFYS-------------------------V
        VKNHWHVIMARK REQ SAYRRRKLSQ   R            + AA  +C + P    L+  + SYPF   +S                          
Subjt:  VKNHWHVIMARKCREQFSAYRRRKLSQPACR-----------DTAAAPPHCLSQP----LTKASPSYPFANFYS-------------------------V

Query:  SRANH-------------TPFDLIPPGPEISNEMEVVSSRR--HRSTSDDENHLSAAYYPPLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVS
          +NH             TPFD   PG +    M ++   R   RS  + E  +  A+YPP  +  MQQ +NY+ LH +  S  S      G    SS  
Subjt:  SRANH-------------TPFDLIPPGPEISNEMEVVSSRR--HRSTSDDENHLSAAYYPPLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVS

Query:  EDDGVGRRLESVPPPFFDFLGVGAT
          +GVG   E++PPPF DFLGVGAT
Subjt:  EDDGVGRRLESVPPPFFDFLGVGAT

A0A7J9AU72 Uncharacterized protein1.5e-8349.76Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESY--DISKVENGRGFWGFPFPSNPTTTPLSS-------EDKISGCSD-YG-----------VGENKPK----
        MV  + G  S   N   LG  V SS+Q+SY   + ++EN R  WGF F  N  T            E + S CSD +G           + E  P     
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESY--DISKVENGRGFWGFPFPSNPTTTPLSS-------EDKISGCSD-YG-----------VGENKPK----

Query:  GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRT
         GKETDSGQSKLCARGHWRPAEDT LKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEE+RLMQAH++YGNKWAMIARLFPGRT
Subjt:  GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRT

Query:  DNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTAA---APPHCLSQPLTKAS--PSYPFANFYSVSR-------------------
        DNAVKNHWHVIMARK REQ +AYRRRKLSQP            CRD A     PP+CL+ P  +      Y F  F   +                    
Subjt:  DNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTAA---APPHCLSQPLTKAS--PSYPFANFYSVSR-------------------

Query:  ---ANHTPFDLIPPGPEISNEMEVVSSRRHRSTSDDENHLSAAY----YPPLMMMGMQQTNNYNFL-HRIPNSNVSKSDNILGSEASSSV-SEDDGVGRR
           A   PFD I PG + ++ M + S  R      DE  +S  Y    + P  +M MQQ+    FL  +    + + +  I GSE SSSV          
Subjt:  ---ANHTPFDLIPPGPEISNEMEVVSSRRHRSTSDDENHLSAAY----YPPLMMMGMQQTNNYNFL-HRIPNSNVSKSDNILGSEASSSV-SEDDGVGRR

Query:  LESVPPPFFDFLGVGA
         E+VPPPF DFLGVGA
Subjt:  LESVPPPFFDFLGVGA

F6HPQ3 Uncharacterized protein5.5e-8649.3Show/hide
Query:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------
        MV ++ G  + S+NPN       SSSQES  +    S+VENGR  WGFPF  N T   L         + K S CSD G GEN                 
Subjt:  MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDI----SKVENGRGFWGFPFPSNPTTTPLS-------SEDKISGCSDYGVGENKPK--------------

Query:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP
            GKETDSG SKLCARGHWRPAEDT LKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF+EEEE+RLM AH++YGNKWAMIARLFP
Subjt:  ---GGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFP

Query:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------
        GRTDNAVKNHWHVIMARK REQ SAYRRRKLSQ             CRDTA      PP+ +   S  +      +PFA F                   
Subjt:  GRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP-----------ACRDTA----AAPPHCL---SQPLTKASPSYPFANF-------------------

Query:  ----YSVSR--------ANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEA
             S S+        A  TPF+    GP+  + + + S  +HRS     D+ N+ ++ +YP  P +MM MQQ+N +N  +  PNS  S       + A
Subjt:  ----YSVSR--------ANHTPFDLIPPGPEISNEMEVVSSRRHRS---TSDDENHLSAAYYP--PLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEA

Query:  SSSVSEDDGVGRRLESVPPPFFDFLGVGAT
        +   S ++ V    E++PPPF DFLGVGAT
Subjt:  SSSVSEDDGVGRRLESVPPPFFDFLGVGAT

SwissProt top hitse value%identityAlignment
Q5NBM8 Transcription factor CSA8.6e-6063.16Show/hide
Query:  GFWGFPFPSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSG------QSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWF
        GF+G P P  P T                VGE + +  +ET+ G      Q KLCARGHWRPAED  LK+LVA YGPQNWNLIAEKL+GRSGKSCRLRWF
Subjt:  GFWGFPFPSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSG------QSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWF

Query:  NQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPL
        NQLDPRINRRAFTEEEE+RLM AH+ YGNKWA+IARLFPGRTDNAVKNHWHV+MAR+ REQ  A+RRRK S  +     A  P    QP+
Subjt:  NQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPL

Q6R053 Transcription factor MYB563.6e-5062.73Show/hide
Query:  PSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSG-QSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF
        P N + + L SED+         GEN+        SG  +K+C+RGHWRP ED  LKELVA +GPQNWNLI+  L GRSGKSCRLRWFNQLDPRIN+RAF
Subjt:  PSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSG-QSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF

Query:  TEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP
        TEEEE RL+ AH+ YGNKWA+I+RLFPGRTDNAVKNHWHVIMAR+ RE      +R+  QP
Subjt:  TEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP

Q9FX36 Transcription factor MYB542.0e-4846.36Show/hide
Query:  LCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVI
        +C+RGHWRPAED  LK+LV  YGP NWN IA KL GRSGKSCRLRWFNQLDPRINR  FTEEEE+RL+ AH+I+GN+W++IARLFPGRTDNAVKNHWHVI
Subjt:  LCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVI

Query:  MARKCREQFSAYRRRKLSQP-ACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVVSSRRHRSTSDDENHLSAAYYPPL
        MAR         R R+ S+P     T ++     S+ +  +S  Y   N+ S  R    P D I    + S+   +   +   +     NH +     P+
Subjt:  MARKCREQFSAYRRRKLSQP-ACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVVSSRRHRSTSDDENHLSAAYYPPL

Query:  MMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVSEDDGVGRRLESVPPPFFDFLGVG
                  YNFL    N++ +KS+ I      S  S+ D    + ES   PFFDFL VG
Subjt:  MMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVSEDDGVGRRLESVPPPFFDFLGVG

Q9LQX5 Transcription factor MYB1172.7e-6148.51Show/hide
Query:  NKPKGGKE-TDSGQS--------KLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGN
        N   G KE TDSGQS         +  RGHWRPAED  LKELV++YGPQNWNLIAEKL+GRSGKSCRLRWFNQLDPRINRRAFTEEEE+RLMQAH++YGN
Subjt:  NKPKGGKE-TDSGQS--------KLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGN

Query:  KWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVV
        KWAMIARLFPGRTDN+VKNHWHV+MARK RE  SAYRRRKL            PH     LT      P  N++S    NH       P PE +    +V
Subjt:  KWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVV

Query:  SSRRHRSTSDDENHLSAAYY---------PPL--------MMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVSEDDGVGRRLESV--PPPFFDFL
        +   +   + D N L   ++         PP+        MM+G       + L  IP+ + S  +    ++A   +  D       E     P FFDFL
Subjt:  SSRRHRSTSDDENHLSAAYY---------PPL--------MMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVSEDDGVGRRLESV--PPPFFDFL

Query:  GVG
        G+G
Subjt:  GVG

Q9SEZ4 Transcription factor MYB1052.8e-5843.67Show/hide
Query:  SSSSQESYDISKVENGRGFWGFPFPSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSGQSKL-CARGHWRPAEDTMLKELVALYGPQNWNLIAEKLE
        +SS   S +I   ++ R ++          T    ED     SDY     K      +    SK   +RGHWRPAEDT LKELVA+YGPQNWNLIAEKL+
Subjt:  SSSSQESYDISKVENGRGFWGFPFPSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSGQSKL-CARGHWRPAEDTMLKELVALYGPQNWNLIAEKLE

Query:  GRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQ
        GRSGKSCRLRWFNQLDPRINRRAFTEEEE+RLMQAH++YGNKWAMIARLFPGRTDN+VKNHWHVIMARK REQ S+YRRRK +  + +      PH  + 
Subjt:  GRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQ

Query:  -PLTKASPSYPFANFYSVSRANHTPFDL-IPPGPEISNEMEVVSSRRHRSTSDDENHLSAAYYPPLMMMG-----MQQTNNYNFLHRIPNSNVSKSDNIL
           T+ + ++       ++ ++H    L +P  P   +E              +E+ L    +   MM+G      Q+   ++FL++   S + +  N  
Subjt:  -PLTKASPSYPFANFYSVSRANHTPFDL-IPPGPEISNEMEVVSSRRHRSTSDDENHLSAAYYPPLMMMG-----MQQTNNYNFLHRIPNSNVSKSDNIL

Query:  GSEASSSVSEDDGVGRRLESVPPPFFDFLGVG
                          E   PPFFDFLG+G
Subjt:  GSEASSSVSEDDGVGRRLESVPPPFFDFLGVG

Arabidopsis top hitse value%identityAlignment
AT1G26780.1 myb domain protein 1171.4e-6057.73Show/hide
Query:  NKPKGGKE-TDSGQS--------KLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGN
        N   G KE TDSGQS         +  RGHWRPAED  LKELV++YGPQNWNLIAEKL+GRSGKSCRLRWFNQLDPRINRRAFTEEEE+RLMQAH++YGN
Subjt:  NKPKGGKE-TDSGQS--------KLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGN

Query:  KWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVV
        KWAMIARLFPGRTDN+VKNHWHV+MARK RE  SAYRRRKL            PH     LT      P  N++S    NH       P PE +    +V
Subjt:  KWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVV

Query:  SSRRHRSTSDDENHLSAAYY
        +   +   + D N L   ++
Subjt:  SSRRHRSTSDDENHLSAAYY

AT1G26780.2 myb domain protein 1171.9e-6248.51Show/hide
Query:  NKPKGGKE-TDSGQS--------KLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGN
        N   G KE TDSGQS         +  RGHWRPAED  LKELV++YGPQNWNLIAEKL+GRSGKSCRLRWFNQLDPRINRRAFTEEEE+RLMQAH++YGN
Subjt:  NKPKGGKE-TDSGQS--------KLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGN

Query:  KWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVV
        KWAMIARLFPGRTDN+VKNHWHV+MARK RE  SAYRRRKL            PH     LT      P  N++S    NH       P PE +    +V
Subjt:  KWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVV

Query:  SSRRHRSTSDDENHLSAAYY---------PPL--------MMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVSEDDGVGRRLESV--PPPFFDFL
        +   +   + D N L   ++         PP+        MM+G       + L  IP+ + S  +    ++A   +  D       E     P FFDFL
Subjt:  SSRRHRSTSDDENHLSAAYY---------PPL--------MMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVSEDDGVGRRLESV--PPPFFDFL

Query:  GVG
        G+G
Subjt:  GVG

AT1G69560.1 myb domain protein 1052.0e-5943.67Show/hide
Query:  SSSSQESYDISKVENGRGFWGFPFPSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSGQSKL-CARGHWRPAEDTMLKELVALYGPQNWNLIAEKLE
        +SS   S +I   ++ R ++          T    ED     SDY     K      +    SK   +RGHWRPAEDT LKELVA+YGPQNWNLIAEKL+
Subjt:  SSSSQESYDISKVENGRGFWGFPFPSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSGQSKL-CARGHWRPAEDTMLKELVALYGPQNWNLIAEKLE

Query:  GRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQ
        GRSGKSCRLRWFNQLDPRINRRAFTEEEE+RLMQAH++YGNKWAMIARLFPGRTDN+VKNHWHVIMARK REQ S+YRRRK +  + +      PH  + 
Subjt:  GRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQ

Query:  -PLTKASPSYPFANFYSVSRANHTPFDL-IPPGPEISNEMEVVSSRRHRSTSDDENHLSAAYYPPLMMMG-----MQQTNNYNFLHRIPNSNVSKSDNIL
           T+ + ++       ++ ++H    L +P  P   +E              +E+ L    +   MM+G      Q+   ++FL++   S + +  N  
Subjt:  -PLTKASPSYPFANFYSVSRANHTPFDL-IPPGPEISNEMEVVSSRRHRSTSDDENHLSAAYYPPLMMMG-----MQQTNNYNFLHRIPNSNVSKSDNIL

Query:  GSEASSSVSEDDGVGRRLESVPPPFFDFLGVG
                          E   PPFFDFLG+G
Subjt:  GSEASSSVSEDDGVGRRLESVPPPFFDFLGVG

AT3G29020.2 myb domain protein 1101.7e-5040.99Show/hide
Query:  KGGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGR
        K  K+     S++C+RGHWR +EDT L ELV++YGPQNWN IAE ++GR+GKSCRLRWFNQLDPRIN+RAF++EEE+RL+ AH+ +GNKWAMIA+LF GR
Subjt:  KGGKETDSGQSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGR

Query:  TDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVVSSR----RHRST
        TDNA+KNHWHV+MARK R+Q S+Y +R                              F      S  +H  F+L P   +   ++ +        +  +T
Subjt:  TDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLSQPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVVSSR----RHRST

Query:  SDDENHLSAAYYPPLMMMGMQQTNNYNF------LHRIPNSNVSKSDNILGSEASSSVSEDDGVGRRLESV-PPPFFDFLGVG
        +    +L   Y    M M     +   F      L    +     S + L   +SS+  E   V R  E++ PP F DFLGVG
Subjt:  SDDENHLSAAYYPPLMMMGMQQTNNYNF------LHRIPNSNVSKSDNILGSEASSSVSEDDGVGRRLESV-PPPFFDFLGVG

AT5G17800.1 myb domain protein 562.6e-5162.73Show/hide
Query:  PSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSG-QSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF
        P N + + L SED+         GEN+        SG  +K+C+RGHWRP ED  LKELVA +GPQNWNLI+  L GRSGKSCRLRWFNQLDPRIN+RAF
Subjt:  PSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSG-QSKLCARGHWRPAEDTMLKELVALYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF

Query:  TEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP
        TEEEE RL+ AH+ YGNKWA+I+RLFPGRTDNAVKNHWHVIMAR+ RE      +R+  QP
Subjt:  TEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGTTCTGAGTTTGGTTGTCGCTCTACCTCCCTCAATCCCAACCAATTAGGACGAGGGGTTTCTTCTTCGTCTCAAGAGAGTTACGACATCTCTAAAGTTGAAAA
TGGGAGAGGTTTTTGGGGTTTTCCATTTCCTAGTAACCCCACGACAACTCCTCTAAGCTCCGAGGACAAGATCTCAGGGTGCAGCGATTATGGGGTTGGCGAAAACAAGC
CCAAAGGAGGAAAGGAGACGGACAGTGGACAGTCGAAGCTTTGTGCAAGAGGACATTGGAGGCCCGCTGAAGACACCATGCTCAAGGAACTCGTCGCTCTGTATGGCCCT
CAGAACTGGAACCTTATAGCCGAGAAACTGGAGGGGAGATCTGGTAAAAGTTGCAGACTGAGATGGTTTAACCAGTTGGACCCGAGGATCAATCGAAGAGCCTTCACAGA
AGAAGAGGAAGATAGGCTAATGCAAGCTCATAAAATTTACGGTAACAAATGGGCCATGATAGCCAGGCTTTTCCCAGGAAGAACTGATAATGCTGTGAAAAACCATTGGC
ATGTTATAATGGCTAGAAAATGCAGAGAACAGTTCAGTGCTTACCGGAGGCGGAAGCTGAGCCAACCGGCGTGCCGAGACACAGCCGCCGCGCCGCCACATTGCCTCAGT
CAGCCACTCACAAAAGCTTCTCCTTCTTACCCATTTGCAAACTTTTACAGCGTGTCGCGTGCAAATCACACACCTTTTGATCTCATCCCGCCTGGTCCAGAAATCAGCAA
CGAAATGGAAGTTGTATCGTCTCGCCGCCATAGAAGTACCTCAGATGATGAGAACCATCTTTCTGCTGCATATTATCCACCATTGATGATGATGGGAATGCAACAGACAA
ACAATTACAACTTTCTTCACCGAATTCCGAACTCAAACGTTTCAAAATCCGATAATATTTTGGGGAGTGAAGCGTCGTCGTCGGTGAGCGAGGACGACGGAGTTGGTAGG
CGTTTGGAGAGCGTTCCACCGCCGTTCTTCGACTTTCTTGGAGTAGGAGCCACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGTTCTGAGTTTGGTTGTCGCTCTACCTCCCTCAATCCCAACCAATTAGGACGAGGGGTTTCTTCTTCGTCTCAAGAGAGTTACGACATCTCTAAAGTTGAAAA
TGGGAGAGGTTTTTGGGGTTTTCCATTTCCTAGTAACCCCACGACAACTCCTCTAAGCTCCGAGGACAAGATCTCAGGGTGCAGCGATTATGGGGTTGGCGAAAACAAGC
CCAAAGGAGGAAAGGAGACGGACAGTGGACAGTCGAAGCTTTGTGCAAGAGGACATTGGAGGCCCGCTGAAGACACCATGCTCAAGGAACTCGTCGCTCTGTATGGCCCT
CAGAACTGGAACCTTATAGCCGAGAAACTGGAGGGGAGATCTGGTAAAAGTTGCAGACTGAGATGGTTTAACCAGTTGGACCCGAGGATCAATCGAAGAGCCTTCACAGA
AGAAGAGGAAGATAGGCTAATGCAAGCTCATAAAATTTACGGTAACAAATGGGCCATGATAGCCAGGCTTTTCCCAGGAAGAACTGATAATGCTGTGAAAAACCATTGGC
ATGTTATAATGGCTAGAAAATGCAGAGAACAGTTCAGTGCTTACCGGAGGCGGAAGCTGAGCCAACCGGCGTGCCGAGACACAGCCGCCGCGCCGCCACATTGCCTCAGT
CAGCCACTCACAAAAGCTTCTCCTTCTTACCCATTTGCAAACTTTTACAGCGTGTCGCGTGCAAATCACACACCTTTTGATCTCATCCCGCCTGGTCCAGAAATCAGCAA
CGAAATGGAAGTTGTATCGTCTCGCCGCCATAGAAGTACCTCAGATGATGAGAACCATCTTTCTGCTGCATATTATCCACCATTGATGATGATGGGAATGCAACAGACAA
ACAATTACAACTTTCTTCACCGAATTCCGAACTCAAACGTTTCAAAATCCGATAATATTTTGGGGAGTGAAGCGTCGTCGTCGGTGAGCGAGGACGACGGAGTTGGTAGG
CGTTTGGAGAGCGTTCCACCGCCGTTCTTCGACTTTCTTGGAGTAGGAGCCACATGA
Protein sequenceShow/hide protein sequence
MVCSEFGCRSTSLNPNQLGRGVSSSSQESYDISKVENGRGFWGFPFPSNPTTTPLSSEDKISGCSDYGVGENKPKGGKETDSGQSKLCARGHWRPAEDTMLKELVALYGP
QNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFTEEEEDRLMQAHKIYGNKWAMIARLFPGRTDNAVKNHWHVIMARKCREQFSAYRRRKLSQPACRDTAAAPPHCLS
QPLTKASPSYPFANFYSVSRANHTPFDLIPPGPEISNEMEVVSSRRHRSTSDDENHLSAAYYPPLMMMGMQQTNNYNFLHRIPNSNVSKSDNILGSEASSSVSEDDGVGR
RLESVPPPFFDFLGVGAT