; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G018740 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G018740
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionMYB transcription factor
Genome locationCG_Chr01:33231743..33233218
RNA-Seq ExpressionClCG01G018740
SyntenyClCG01G018740
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604035.1 Transcription factor MYB92, partial [Cucurbita argyrosperma subsp. sororia]4.8e-12165.65Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HIQNHG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF++EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPP----SLTNIINSINPTEIE
        DNEIKNFWNTH+K+KLI+MGFDPMTHRP+T+I  SL HII LA+LK+LM NP WEE          Q+ ++     LLQPP    SLTN   + NPTEIE
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPP----SLTNIINSINPTEIE

Query:  IMSLINSLSSPLFQNRMIMTQNYTNDNNNNP------LPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTA
        IM+LINS    LF+N+M      T  N N P      L  L TDS GL  SHLPSLEI+PN TN YET PF  KEMC  QNNEV   NN  WQLPSSST 
Subjt:  IMSLINSLSSPLFQNRMIMTQNYTNDNNNNP------LPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTA

Query:  ASP-------PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
         SP       P  NI HDGPSSN +N GDACS+SSF GSSA IWPDHLLLE +  H+ P L
Subjt:  ASP-------PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

XP_004143558.1 transcription factor MYB92 [Cucumis sativus]2.4e-12870.17Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HIQNHG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF++EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNIINSINPTEIEIM
        DNEIKNFWNTH+K+KLI+MGFDPMTHRP+T+I  SL HII LA+LK+LM NP WEE    L          QYL   +LLQPPSL  I N+ NPTEI+IM
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNIINSINPTEIEIM

Query:  SLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIP-NNTNIYETTPFNVKEMCVVQNNEVCD-TNNSPWQLPSSSTAASP---
        +LINSLSSPLF+N++ +TQN  N+N N PL  LGTDSV L  SHLPSLE++P NNTN YET PF  KEMC  QNNEVCD  NNSPWQLP SSTA SP   
Subjt:  SLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIP-NNTNIYETTPFNVKEMCVVQNNEVCD-TNNSPWQLPSSSTAASP---

Query:  -PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
         P  NI ++GPSSN +N GDACSSSS  GSS+ IWPDHLLLE +  H+ P L
Subjt:  -PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

XP_008440591.1 PREDICTED: myb-related protein Hv1 [Cucumis melo]1.5e-12769.32Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HI+NHG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF++EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNIINSINPTEIEIM
        DNEIKNFWNTH+K+KLI+MG+DPMTHRP+T+I  SL HII LA+LK+LM NP WEE    L          QYL   +LLQPPSL N  N+ NPTEIEIM
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNIINSINPTEIEIM

Query:  SLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIP-NNTNIYETTPFNVKEMCVVQNNEVCD-TNNSPWQLPSSSTAASP---
        +LINSLSSP+F+N++ +TQN  N++ N PL  LGTDS+ L  SHLPSLE++P NNTN YETT F  KEMC  QNNEVCD  NNSPWQLPSSSTA SP   
Subjt:  SLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIP-NNTNIYETTPFNVKEMCVVQNNEVCD-TNNSPWQLPSSSTAASP---

Query:  -PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
         P  NI ++GPSSN +N GDACSSSS  GSS+ IWPDHLLLE +  H+ P L
Subjt:  -PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

XP_023544463.1 transcription factor MYB41-like [Cucurbita pepo subsp. pepo]2.8e-12165.65Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HIQNHG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF++EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPP----SLTNIINSINPTEIE
        DNEIKNFWNTH+K+KLI+MGFDPMTHRP+T+I  SL HII LA+LK+LM NP WEE          Q+ ++     LLQPP    SLTN   + NPTEIE
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPP----SLTNIINSINPTEIE

Query:  IMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPS------LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTA
        IM+LINS    LF+N+M      T  N N PL +      L TDS GL  SHLPSLEI+PN TN YET PF  KEMC  QNNEV   NN  WQLPSSST 
Subjt:  IMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPS------LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTA

Query:  ASP-------PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
         SP       P  NI HDGPSSN +N GDACS+SSF GSSA IWPDHLLLE +  H+ P L
Subjt:  ASP-------PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

XP_038883366.1 transcription factor MYB53-like [Benincasa hispida]1.0e-13471.67Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HI+NHG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF++EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNI---INSINPTEI
        DNEIKNFWNTH+K+KLI+MGFDPMTHRP+T+I  SL HII LA+LK+LM NP WEE    L          QYL   +LLQPPSLTNI    N+ NPTEI
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNI---INSINPTEI

Query:  EIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTAASPPH
        EIM+LINSLSSPLF+N+M MTQ      NNNP+  LGTDSVGL  SHLPSLE++PNNTN YETTPF  KEM   QNNEVCD NN+PWQL SSSTA SPP 
Subjt:  EIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTAASPPH

Query:  ----RNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
             NI HDGPSSN+NN GDACSSSSF GSS  IWPDHLLLE +  H+ P L
Subjt:  ----RNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

TrEMBL top hitse value%identityAlignment
A0A1S3B278 myb-related protein Hv17.5e-12869.32Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HI+NHG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF++EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNIINSINPTEIEIM
        DNEIKNFWNTH+K+KLI+MG+DPMTHRP+T+I  SL HII LA+LK+LM NP WEE    L          QYL   +LLQPPSL N  N+ NPTEIEIM
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNIINSINPTEIEIM

Query:  SLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIP-NNTNIYETTPFNVKEMCVVQNNEVCD-TNNSPWQLPSSSTAASP---
        +LINSLSSP+F+N++ +TQN  N++ N PL  LGTDS+ L  SHLPSLE++P NNTN YETT F  KEMC  QNNEVCD  NNSPWQLPSSSTA SP   
Subjt:  SLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIP-NNTNIYETTPFNVKEMCVVQNNEVCD-TNNSPWQLPSSSTAASP---

Query:  -PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
         P  NI ++GPSSN +N GDACSSSS  GSS+ IWPDHLLLE +  H+ P L
Subjt:  -PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

A0A5A7T2B1 Myb-related protein Hv17.5e-12869.32Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HI+NHG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF++EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNIINSINPTEIEIM
        DNEIKNFWNTH+K+KLI+MG+DPMTHRP+T+I  SL HII LA+LK+LM NP WEE    L          QYL   +LLQPPSL N  N+ NPTEIEIM
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQIL----------QYLDQYHLLQPPSLTNIINSINPTEIEIM

Query:  SLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIP-NNTNIYETTPFNVKEMCVVQNNEVCD-TNNSPWQLPSSSTAASP---
        +LINSLSSP+F+N++ +TQN  N++ N PL  LGTDS+ L  SHLPSLE++P NNTN YETT F  KEMC  QNNEVCD  NNSPWQLPSSSTA SP   
Subjt:  SLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIP-NNTNIYETTPFNVKEMCVVQNNEVCD-TNNSPWQLPSSSTAASP---

Query:  -PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
         P  NI ++GPSSN +N GDACSSSS  GSS+ IWPDHLLLE +  H+ P L
Subjt:  -PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

A0A6J1BVM9 transcription factor MYB41-like9.5e-11563.38Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKL+ HIQ HG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF++EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPPSLTNIINSINPTEIEIMSL
        DNEIKNFWNTH+K+KLI+MGFDPMTHRP+T+I  SL HII LA+LK+LM NP WEE          Q+ ++    +LLQPP+ +  +N+ NP+EIEIM+L
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPPSLTNIINSINPTEIEIMSL

Query:  INSLSSPLFQNRMIMTQNYTNDNNNN-----PLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFN--VKEMCVVQNNEVCDTN-NSPWQLPSSSTAAS
        INSLSSPL +N +    +++N+NN N      L  LGTDS+ L  SHLPSLEI  N TN YET P N   KEM   Q+++  +++ NSPWQLPSSST  S
Subjt:  INSLSSPLFQNRMIMTQNYTNDNNNN-----PLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFN--VKEMCVVQNNEVCDTN-NSPWQLPSSSTAAS

Query:  ---PPHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
           PPH NIN    S++ NN GDACS+SSF GSSA IWPDHLLLE    H+ P L
Subjt:  ---PPHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

A0A6J1GF42 transcription factor MYB41-like5.2e-12165.37Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HIQNHG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF++EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPP----SLTNIINSINPTEIE
        DNEIKNFWNTH+K+KLI+MGFDPMTHRP+T+I  SL HII LA+LK+LM NP WEE          Q+ ++     LLQPP    SLTN   + NPTEIE
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPP----SLTNIINSINPTEIE

Query:  IMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPS------LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTA
        IM+LINS    LF+++M      T  N N PL +      L TDS GL  SHLPSLEI+PN TN YET PF  KEMC  QNNEV   NN  WQLPSSST 
Subjt:  IMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPS------LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTA

Query:  ASP-------PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
         SP       P  NI HDGPSSN +N GDACS+SSF GSSA IWPDHLLLE +  H+ P L
Subjt:  ASP-------PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

A0A6J1ITC6 transcription factor MYB41-like3.0e-12165.65Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HIQNHG G+WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF+ EEE+ ILNLHS+LGNKWSAIA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPP----SLTNIINSINPTEIE
        DNEIKNFWNTH+K+KLI+MGFDPMTHRP+T+I  SL HII LA+LK+LM NP WEE          Q+ ++     LLQPP    SLTN   + NPTEIE
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHP--------QILQYLDQYHLLQPP----SLTNIINSINPTEIE

Query:  IMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPS------LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTA
        IM+LINS    LF+N+M      T  N N PL +      L TDS GL  SHLPSLEI+PN TN YET PF  KEMC  QNNEV   NN  WQLPSSST 
Subjt:  IMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPS------LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTA

Query:  ASP-------PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL
         SP       P  NI HDGPSSN +N GDACS+SSF GSSA IWPDHLLLE +  H+ P L
Subjt:  ASP-------PHRNINHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSFLHNTPLL

SwissProt top hitse value%identityAlignment
Q9FJP2 Transcription factor MYB536.6e-6554.13Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSP  D + LKKGPW PEED KLI +I  HG  +W +LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF+ EEEE ILNLH++LGNKWS IA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQT-NITWSLAHIIGLATLKQLMNNPLWEEHP----------------QILQYLDQYHLLQPPSLTNIINSIN
        DNEIKNFWNTH+K+KLI+MGFDPMTH+P+T +I  SL+ ++ L+ L+ L++  L ++ P                Q+ QY     LLQP      IN+IN
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQT-NITWSLAHIIGLATLKQLMNNPLWEEHP----------------QILQYLDQYHLLQPPSLTNIINSIN

Query:  PTEIEIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGT
        P  + ++   NS++S +    +    ++  D NNN LPSL T
Subjt:  PTEIEIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGT

Q9LE63 Transcription factor MYB1065.6e-5671.65Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD + LKKGPW+PEEDQKL+ +I+ HG G+WRSLP+ AG+ RCGKSCRLRW NYLRPDIKRGKFT +EE+ I+ LH++LGN+WSAIA  LP RT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHR
        DNEIKN+WNTH+K++LI+MG DP+TH+
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHR

Q9LXF1 Transcription factor MYB163.3e-5669.77Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD   LKKGPW+PEEDQKL+ +I+ HG G+WRSLP+ AG++RCGKSCRLRW NYLRPDIKRGKF  +EE+ I+ LH++LGN+WSAIA  LP RT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQ
        DNEIKN+WNTH+K++L++MG DP+TH+P+
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQ

Q9S9Z2 Transcription factor MYB935.4e-6743.9Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD + LKKGPW+PEEDQKLI +I  HG G+WR+LPKLA +NRCGKSCRLRW NYLRPDIKRGKF+ EEE+ IL+LHSILGNKWSAIA  L GRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMN-----NPLWEEHPQI--LQYLDQYHLLQPPSLTNIINSINPTEI------
        DNEIKNFWNTH+K+KLI+MG DP+TH+P+T++  SL  +I LA LK L+      + +  E  Q+  LQYL +          N  N+ +P+ I      
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMN-----NPLWEEHPQI--LQYLDQYHLLQPPSLTNIINSINPTEI------

Query:  EIMSLINSLSS------PLF----------QNRMIMTQNYTNDNNNNPLPS----LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCV-------
          M+L+NS+ S      P F          QN+ +    +  D    PL      L      L     P L+ +P +      TP N ++  +       
Subjt:  EIMSLINSLSS------PLF----------QNRMIMTQNYTNDNNNNPLPS----LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCV-------

Query:  VQNNEVCDTNNSPWQLPS------SSTAASPPHRNINHDGPSSNTNNCGDACSSSSFVGSSAC--IWPD
          +    D N S W LPS       +  +S PH            NN  DA SSSS+ G  A    WPD
Subjt:  VQNNEVCDTNNSPWQLPS------SSTAASPPHRNINHDGPSSNTNNCGDACSSSSFVGSSAC--IWPD

Q9SBF3 Transcription factor MYB922.6e-6945.71Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSP  D S LKKGPW+P+ED+KL+ ++Q HG  +WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRG+F+ +EE+ ILNLHS+LGNKWS IA QLPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLAT-----LKQLMNNPLWEEHP-----------QILQYLDQYHLLQPPSLTNIINSINP
        DNEIKNFWNTH+K+KLI+MGFDPMTHRP+T+I   L+ ++ L++     +      P+ +EH            Q+ QY     LLQP S++   N++NP
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLAT-----LKQLMNNPLWEEHP-----------QILQYLDQYHLLQPPSLTNIINSINP

Query:  TEIEIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSP-WQLPSSSTAA
         + + +SL+NS++S  F+     T N T  NN + L  LG  S       LPSL+ + +N       P N+ +     + +  +   SP W    SST  
Subjt:  TEIEIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSP-WQLPSSSTAA

Query:  SPPHRN----INHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSF
        +P H N     N  G     +N   +    S   +SA  WPDHLL +  F
Subjt:  SPPHRN----INHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSF

Arabidopsis top hitse value%identityAlignment
AT1G34670.1 myb domain protein 933.8e-6843.9Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD + LKKGPW+PEEDQKLI +I  HG G+WR+LPKLA +NRCGKSCRLRW NYLRPDIKRGKF+ EEE+ IL+LHSILGNKWSAIA  L GRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMN-----NPLWEEHPQI--LQYLDQYHLLQPPSLTNIINSINPTEI------
        DNEIKNFWNTH+K+KLI+MG DP+TH+P+T++  SL  +I LA LK L+      + +  E  Q+  LQYL +          N  N+ +P+ I      
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMN-----NPLWEEHPQI--LQYLDQYHLLQPPSLTNIINSINPTEI------

Query:  EIMSLINSLSS------PLF----------QNRMIMTQNYTNDNNNNPLPS----LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCV-------
          M+L+NS+ S      P F          QN+ +    +  D    PL      L      L     P L+ +P +      TP N ++  +       
Subjt:  EIMSLINSLSS------PLF----------QNRMIMTQNYTNDNNNNPLPS----LGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCV-------

Query:  VQNNEVCDTNNSPWQLPS------SSTAASPPHRNINHDGPSSNTNNCGDACSSSSFVGSSAC--IWPD
          +    D N S W LPS       +  +S PH            NN  DA SSSS+ G  A    WPD
Subjt:  VQNNEVCDTNNSPWQLPS------SSTAASPPHRNINHDGPSSNTNNCGDACSSSSFVGSSAC--IWPD

AT3G02940.1 myb domain protein 1071.1e-6244.93Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD S LKKGPW+PEEDQKLI HI+ HG G+WR+LPK AG+NRCGKSCRLRW NYLRPDIKRG FT EEE+ I+NLHS+LGNKWS+IA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQT---NITWSLAHIIGLATLKQL--MNNPLWEEHPQILQYLDQYHLLQPPSLTNIINS--INPTEIEIMSLI
        DNEIKN+WNTH+++KLI+MG DP+THRP+T   N+  +L  ++  A    L  +N  +  +   + +    + ++Q  S  N  +S  I+ T   +    
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQT---NITWSLAHIIGLATLKQL--MNNPLWEEHPQILQYLDQYHLLQPPSLTNIINS--INPTEIEIMSLI

Query:  NSLSS-PLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTAASPPHR
        + L + P  +N    TQ  ++  ++ PL S  +    ++  H      IP    +  T+P   KE  ++  N+     N     PSS++  +  H+
Subjt:  NSLSS-PLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTAASPPHR

AT5G10280.1 myb domain protein 921.8e-7045.71Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSP  D S LKKGPW+P+ED+KL+ ++Q HG  +WR+LPKLAG+NRCGKSCRLRW NYLRPDIKRG+F+ +EE+ ILNLHS+LGNKWS IA QLPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLAT-----LKQLMNNPLWEEHP-----------QILQYLDQYHLLQPPSLTNIINSINP
        DNEIKNFWNTH+K+KLI+MGFDPMTHRP+T+I   L+ ++ L++     +      P+ +EH            Q+ QY     LLQP S++   N++NP
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLAT-----LKQLMNNPLWEEHP-----------QILQYLDQYHLLQPPSLTNIINSINP

Query:  TEIEIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSP-WQLPSSSTAA
         + + +SL+NS++S  F+     T N T  NN + L  LG  S       LPSL+ + +N       P N+ +     + +  +   SP W    SST  
Subjt:  TEIEIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSP-WQLPSSSTAA

Query:  SPPHRN----INHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSF
        +P H N     N  G     +N   +    S   +SA  WPDHLL +  F
Subjt:  SPPHRN----INHDGPSSNTNNCGDACSSSSFVGSSACIWPDHLLLEHSF

AT5G16770.1 myb domain protein 92.0e-6142.68Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSPCCD + LKKGPW+ EED KLI HIQ HG G+WR+LPK AG+NRCGKSCRLRW NYLRPDIKRG FT EEE+ I+NLHS+LGNKWS+IA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQT---NITWSLAHIIGLATLKQLMNNPLWEEHPQILQYLDQYHLLQPPSLTNIINSINPTEIEIMSLINSLS
        DNEIKN+WNTH+++KL++MG DP+THRP+T   N+  +L  +I  A    L+N             L+Q   L   +L      +  T I+++S  N+ +
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQT---NITWSLAHIIGLATLKQLMNNPLWEEHPQILQYLDQYHLLQPPSLTNIINSINPTEIEIMSLINSLS

Query:  SPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTAASPPHRNI-NHDGPS
        +P F +  +   N       + L +        + SH+   E +   T I +    +      +Q     D N+ P  +P+S   +    R I N D   
Subjt:  SPLFQNRMIMTQNYTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTAASPPHRNI-NHDGPS

Query:  SNTNNCGDACSSSS
         + ++  +  SS+S
Subjt:  SNTNNCGDACSSSS

AT5G65230.1 myb domain protein 534.7e-6654.13Show/hide
Query:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT
        MGRSP  D + LKKGPW PEED KLI +I  HG  +W +LPKLAG+NRCGKSCRLRW NYLRPDIKRGKF+ EEEE ILNLH++LGNKWS IA  LPGRT
Subjt:  MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRT

Query:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQT-NITWSLAHIIGLATLKQLMNNPLWEEHP----------------QILQYLDQYHLLQPPSLTNIINSIN
        DNEIKNFWNTH+K+KLI+MGFDPMTH+P+T +I  SL+ ++ L+ L+ L++  L ++ P                Q+ QY     LLQP      IN+IN
Subjt:  DNEIKNFWNTHMKRKLIEMGFDPMTHRPQT-NITWSLAHIIGLATLKQLMNNPLWEEHP----------------QILQYLDQYHLLQPPSLTNIINSIN

Query:  PTEIEIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGT
        P  + ++   NS++S +    +    ++  D NNN LPSL T
Subjt:  PTEIEIMSLINSLSSPLFQNRMIMTQNYTNDNNNNPLPSLGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGGAGCCCTTGCTGTGACGGAAGTCCCCTCAAAAAGGGCCCTTGGTCCCCTGAAGAAGACCAAAAGCTCATCACACATATCCAAAACCATGGCCAAGGC
AACTGGAGATCCCTTCCCAAACTCGCCGGAATAAATAGATGTGGGAAGAGTTGCAGATTAAGATGGATAAACTACCTTAGGCCGGATATAAAGAGGGGCAAATTC
ACTCGAGAAGAAGAGGAAATCATTCTCAATCTCCATTCCATTCTTGGGAATAAGTGGTCGGCAATTGCAAGACAACTACCTGGACGAACGGACAATGAAATAAAG
AATTTTTGGAATACTCACATGAAGAGAAAACTAATTGAGATGGGTTTTGATCCAATGACCCACCGTCCTCAAACTAACATAACTTGGAGTTTAGCACACATCATA
GGCTTGGCTACTCTTAAACAACTCATGAATAACCCCTTATGGGAAGAACACCCACAAATCCTTCAATACCTTGATCAATATCATCTCCTTCAACCACCTTCCTTA
ACCAACATTATTAATAGTATTAACCCTACTGAAATTGAAATCATGAGTTTAATCAACTCTCTCTCTTCTCCTCTCTTTCAAAACCGAATGATCATGACTCAAAAT
TATACTAATGATAATAACAATAATCCTCTTCCATCACTCGGTACTGACTCAGTGGGTCTCTCTCTTTCCCATTTGCCAAGCTTGGAAATAATTCCAAATAATACA
AATATTTACGAAACGACGCCGTTCAATGTGAAGGAAATGTGTGTTGTCCAAAATAATGAAGTTTGTGATACTAATAATTCTCCATGGCAACTGCCTTCTTCTTCC
ACCGCCGCATCCCCGCCGCACCGGAATATTAATCATGACGGGCCTTCTTCAAACACTAATAATTGTGGAGATGCATGTAGTAGTTCAAGCTTTGTGGGATCATCT
GCTTGCATTTGGCCAGACCACCTCCTCCTTGAACACTCTTTCCTTCATAATACTCCACTACTTCCTCCTTGGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGGAGCCCTTGCTGTGACGGAAGTCCCCTCAAAAAGGGCCCTTGGTCCCCTGAAGAAGACCAAAAGCTCATCACACATATCCAAAACCATGGCCAAGGC
AACTGGAGATCCCTTCCCAAACTCGCCGGAATAAATAGATGTGGGAAGAGTTGCAGATTAAGATGGATAAACTACCTTAGGCCGGATATAAAGAGGGGCAAATTC
ACTCGAGAAGAAGAGGAAATCATTCTCAATCTCCATTCCATTCTTGGGAATAAGTGGTCGGCAATTGCAAGACAACTACCTGGACGAACGGACAATGAAATAAAG
AATTTTTGGAATACTCACATGAAGAGAAAACTAATTGAGATGGGTTTTGATCCAATGACCCACCGTCCTCAAACTAACATAACTTGGAGTTTAGCACACATCATA
GGCTTGGCTACTCTTAAACAACTCATGAATAACCCCTTATGGGAAGAACACCCACAAATCCTTCAATACCTTGATCAATATCATCTCCTTCAACCACCTTCCTTA
ACCAACATTATTAATAGTATTAACCCTACTGAAATTGAAATCATGAGTTTAATCAACTCTCTCTCTTCTCCTCTCTTTCAAAACCGAATGATCATGACTCAAAAT
TATACTAATGATAATAACAATAATCCTCTTCCATCACTCGGTACTGACTCAGTGGGTCTCTCTCTTTCCCATTTGCCAAGCTTGGAAATAATTCCAAATAATACA
AATATTTACGAAACGACGCCGTTCAATGTGAAGGAAATGTGTGTTGTCCAAAATAATGAAGTTTGTGATACTAATAATTCTCCATGGCAACTGCCTTCTTCTTCC
ACCGCCGCATCCCCGCCGCACCGGAATATTAATCATGACGGGCCTTCTTCAAACACTAATAATTGTGGAGATGCATGTAGTAGTTCAAGCTTTGTGGGATCATCT
GCTTGCATTTGGCCAGACCACCTCCTCCTTGAACACTCTTTCCTTCATAATACTCCACTACTTCCTCCTTGGCTTTAG
Protein sequenceShow/hide protein sequence
MGRSPCCDGSPLKKGPWSPEEDQKLITHIQNHGQGNWRSLPKLAGINRCGKSCRLRWINYLRPDIKRGKFTREEEEIILNLHSILGNKWSAIARQLPGRTDNEIK
NFWNTHMKRKLIEMGFDPMTHRPQTNITWSLAHIIGLATLKQLMNNPLWEEHPQILQYLDQYHLLQPPSLTNIINSINPTEIEIMSLINSLSSPLFQNRMIMTQN
YTNDNNNNPLPSLGTDSVGLSLSHLPSLEIIPNNTNIYETTPFNVKEMCVVQNNEVCDTNNSPWQLPSSSTAASPPHRNINHDGPSSNTNNCGDACSSSSFVGSS
ACIWPDHLLLEHSFLHNTPLLPPWL