; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011706 (gene) of Snake gourd v1 genome

Gene IDTan0011706
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionhomeobox-leucine zipper protein ATHB-20-like
Genome locationLG11:4582562..4584986
RNA-Seq ExpressionTan0011706
SyntenyTan0011706
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602484.1 Homeobox-leucine zipper protein HAT7, partial [Cucurbita argyrosperma subsp. sororia]8.5e-13682.3Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL
        MASPHHSHSF+FQSRPAD HH+Y P+ SFN IP CPPHLYFHDGVVPLMMKRSMSFS  ENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL
Subjt:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL

Query:  GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAA
        GNKLEPERK+QLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+VLKKQFEALKADND+LQAQNTKL+AQ             L ALK +KDS   GE A
Subjt:  GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAA

Query:  GAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE
         AGG  T NLKKENERCWSSDNSCDINL+ISRTQA IT+N   GGR CSQPGIKDLFP   FRSAAITQLLQHGSSRSTVDH   VI EESF+QMFNG+E
Subjt:  GAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE

Query:  EQQQTSAAGFWPWTSDQNSHFH
        EQ QT+AAGFWPW+SDQNSHFH
Subjt:  EQQQTSAAGFWPWTSDQNSHFH

XP_008459304.1 PREDICTED: homeobox-leucine zipper protein ATHB-20-like [Cucumis melo]2.2e-13683.38Show/hide
Query:  MASP-HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFE
        MASP HHSHSF+FQSRPA  HHEYVPS SFNTIPSCPPHLYFHDGVVP+MMKRSMSFSGVENGCE+VNGDEGLSDDG ALGEKKKRLNLEQVKALEKSFE
Subjt:  MASP-HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFE

Query:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEA
        +GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADND+LQAQNTKL+A+             L ALK +KDS   GE 
Subjt:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEA

Query:  AGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPG-IKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG
         G GGG T NLKKENERCWSSDNSCDINLDIS TQ  I    GGGGRACSQPG IKDLFP   FRSAAITQLLQHGSSRSTVD   QVI EESFSQMFNG
Subjt:  AGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPG-IKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG

Query:  IEEQQQT-SAAGFWPWTSDQNSHFH
        IEEQQQT +AAGFWPW+SDQNSHFH
Subjt:  IEEQQQT-SAAGFWPWTSDQNSHFH

XP_022991191.1 homeobox-leucine zipper protein ATHB-20-like [Cucurbita maxima]5.9e-13782.41Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVP--STSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSF
        MASPHHSHSF+FQSRPAD HH+Y P  + SFN IP CPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSF
Subjt:  MASPHHSHSFIFQSRPADHHHEYVP--STSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSF

Query:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE
        ELGNKLEPERK+QLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+VLKKQFEALKADND+LQAQNTKL+A+             L ALK +KDS   GE
Subjt:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE

Query:  AAGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG
         A AGG  T NLKKENERCWSSDNSCDINL+ISRTQA IT+N   GGR CSQPGIKDLFP   FRSAAITQLLQHGSSRSTV+H   VI EESF+QMFNG
Subjt:  AAGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG

Query:  IEEQQQTSAAGFWPWTSDQNSHFH
        IEEQQQT+AAGFWPW+SDQNSHFH
Subjt:  IEEQQQTSAAGFWPWTSDQNSHFH

XP_023545145.1 homeobox-leucine zipper protein ATHB-20-like [Cucurbita pepo subsp. pepo]1.6e-13481.79Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVP--STSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSF
        MASPHHSHSF+FQSRPAD HH+Y P  + SFN IP CPPHLYFHDGVVPLMMKRSMSFS  ENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSF
Subjt:  MASPHHSHSFIFQSRPADHHHEYVP--STSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSF

Query:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE
        ELGNKLEPERK+QLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+VLKKQFEALKADND+LQAQNTKL+AQ             L ALK +KDS   GE
Subjt:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE

Query:  AAGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG
         A AGG  T NLKKENERCWSSDNSCDINL++SRTQA+IT+N   GGR CSQPGIKDLFP   FRSAAITQLLQHGSSRSTVDH   VI EESF+QMFNG
Subjt:  AAGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG

Query:  IEEQQQTSAAGFWPWTSDQNSHFH
        +EEQQQT+AAGFWPW SDQNSHFH
Subjt:  IEEQQQTSAAGFWPWTSDQNSHFH

XP_038890842.1 LOW QUALITY PROTEIN: homeobox-leucine zipper protein ATHB-20-like [Benincasa hispida]2.0e-13782.93Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL
        MASPHHSHSF+FQSRPAD HHEYVPS SFN IPSCPPHLYFHDGVVP+MMKRSMSFSGVENGCEEVNGDEGLSDDG ALGEKKKRLNLEQVKALEKSFEL
Subjt:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL

Query:  GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAA
        GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADND+LQAQNTKL+A+             L ALK +KDS   GE  
Subjt:  GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAA

Query:  GAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPG-IKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGI
          GG  T NLKKENE CWSSDNSCDINLDIS+TQA+I   +GGGGR CSQPG IKDLFP   FRSAAITQLLQHGSSRSTVDHP QVI EESFSQMFNGI
Subjt:  GAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPG-IKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGI

Query:  EEQQQT-----SAAGFWPWTSDQNSHFH
        EEQQQT     +AAGFWPW SDQNSHFH
Subjt:  EEQQQT-----SAAGFWPWTSDQNSHFH

TrEMBL top hitse value%identityAlignment
A0A0A0KS90 Homeobox domain-containing protein5.0e-13482.52Show/hide
Query:  MASP-HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFE
        MASP HHSHSF+FQSRPA  HHEY+PS SFNTIPSCPPHLYFHDGVVP+MMKRSMSFS VENGCE+VNGDEGLSDDG ALGEKKKRLNLEQVKALEKSFE
Subjt:  MASP-HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFE

Query:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEA
        +GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADND+LQAQNTKL+A+             L ALK +KDS   GE 
Subjt:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEA

Query:  AGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPG-IKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG
        AG GGG T NLKKENERCWSSDNSCDINLDIS TQ  I    G GGR CSQPG IKDLFP   FRSAAITQLLQHGSSRSTVD   QVI EESFSQMFNG
Subjt:  AGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPG-IKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG

Query:  IEEQQQT-SAAGFWPW-TSDQNSHFH
        IEEQQQT +AAGFWPW TSDQNSHFH
Subjt:  IEEQQQT-SAAGFWPW-TSDQNSHFH

A0A1S3C9V4 homeobox-leucine zipper protein ATHB-20-like1.1e-13683.38Show/hide
Query:  MASP-HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFE
        MASP HHSHSF+FQSRPA  HHEYVPS SFNTIPSCPPHLYFHDGVVP+MMKRSMSFSGVENGCE+VNGDEGLSDDG ALGEKKKRLNLEQVKALEKSFE
Subjt:  MASP-HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFE

Query:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEA
        +GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADND+LQAQNTKL+A+             L ALK +KDS   GE 
Subjt:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEA

Query:  AGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPG-IKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG
         G GGG T NLKKENERCWSSDNSCDINLDIS TQ  I    GGGGRACSQPG IKDLFP   FRSAAITQLLQHGSSRSTVD   QVI EESFSQMFNG
Subjt:  AGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPG-IKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG

Query:  IEEQQQT-SAAGFWPWTSDQNSHFH
        IEEQQQT +AAGFWPW+SDQNSHFH
Subjt:  IEEQQQT-SAAGFWPWTSDQNSHFH

A0A6J1GNV6 homeobox-leucine zipper protein ATHB-20-like7.7e-13581.68Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL
        MASPHHSHSF+FQSRPAD HH+Y P+ SFN IP CPPHLYFHDGVVPLMMKRSMSFS  ENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL
Subjt:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL

Query:  GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAA
        GNKLEPERK+QLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+VLKKQFEALKADND+LQAQNTKL+AQ             L ALK +KDS   GE A
Subjt:  GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAA

Query:  GAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE
         AGG  T NLKKENERCWSSDNSCDINL+ISRTQA IT+N   GGR CSQPGIKDLFP   FRSAAITQLLQHGSSRSTV+H   VI EESF+QMFNG+E
Subjt:  GAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE

Query:  EQQQTSAAGFWPWTSDQNSHFH
         Q QT+AAGFWPW+SDQNSHFH
Subjt:  EQQQTSAAGFWPWTSDQNSHFH

A0A6J1JQ21 homeobox-leucine zipper protein ATHB-20-like2.8e-13782.41Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVP--STSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSF
        MASPHHSHSF+FQSRPAD HH+Y P  + SFN IP CPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSF
Subjt:  MASPHHSHSFIFQSRPADHHHEYVP--STSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSF

Query:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE
        ELGNKLEPERK+QLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+VLKKQFEALKADND+LQAQNTKL+A+             L ALK +KDS   GE
Subjt:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE

Query:  AAGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG
         A AGG  T NLKKENERCWSSDNSCDINL+ISRTQA IT+N   GGR CSQPGIKDLFP   FRSAAITQLLQHGSSRSTV+H   VI EESF+QMFNG
Subjt:  AAGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNG

Query:  IEEQQQTSAAGFWPWTSDQNSHFH
        IEEQQQT+AAGFWPW+SDQNSHFH
Subjt:  IEEQQQTSAAGFWPWTSDQNSHFH

A0A6J1JTG1 homeobox-leucine zipper protein ATHB-20-like8.5e-13480.86Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL
        MASPHHSHSF+FQSRPAD HHEY+PS SFN IPSCPPHLYFHDGV+P+MMKRSMSFSGVENGCEEVNGDEGLSDDG ALGEKKKRLNLEQVKALEKSFEL
Subjt:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFEL

Query:  GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAA
        GNKLEPERK+QLAKALGLQPRQ+AIWFQNRRARWKTKQLERDYEVLKK FE+LKADND+LQAQN+KL+AQ             L ALK +KD+   GE  
Subjt:  GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAA

Query:  GAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE
         AGGG T NLKKENERCWSSDNSCDINLDIS+TQA I  N G GGRAC +PGIKDLFP   FRS AITQL+Q GSSRSTVDHP QVI EESFSQMFNGIE
Subjt:  GAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP---FRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE

Query:  EQQQT--SAAGFWPWTSDQNSHFH
        EQQQT  +AAGFWPW SDQ+SHF+
Subjt:  EQQQT--SAAGFWPWTSDQNSHFH

SwissProt top hitse value%identityAlignment
A2XD08 Homeobox-leucine zipper protein HOX212.6e-4740.78Show/hide
Query:  HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCP--------PHLYFHDGVVPLMMKRSMSFSGVENGCEEVN--GDEGLSDDGSALGEKKKRLNLEQVKAL
        HH H          HHH   P       P  P        P L    G+ P++ KR MS+     G +EVN  G++ LSDDGS  GEKK+RLN+EQV+ L
Subjt:  HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCP--------PHLYFHDGVVPLMMKRSMSFSGVENGCEEVN--GDEGLSDDGSALGEKKKRLNLEQVKAL

Query:  EKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSA
        EK+FELGNKLEPERKMQLA+ALGLQPRQ+AIWFQNRRARWKTKQLE+DY+ LK+Q +A+KA+ND L   N KL A+             + ALK  +++A
Subjt:  EKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSA

Query:  GEGEAAGAGGGGTANLKKENERCWS--SDNSCDINLDISRT--------QATITT---NNGGGGRACSQPGIKDLFPFRSAA----------ITQLLQH-
         E            NL KE E   S  S+NS +INLDISRT         A  T    ++GGGG      G   + PF ++           I QLL   
Subjt:  GEGEAAGAGGGGTANLKKENERCWS--SDNSCDINLDISRT--------QATITT---NNGGGGRACSQPGIKDLFPFRSAA----------ITQLLQH-

Query:  --GSSRSTVDH-------PDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQNSHFH
          G+    ++H           +   SF  +  G++E        FWPW   Q  HFH
Subjt:  --GSSRSTVDH-------PDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQNSHFH

Q00466 Homeobox-leucine zipper protein HAT78.1e-5747.04Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVE----------------NGCEEVNGDEGLSDDGS--ALGEK
        MA P   H F+FQ    D+ H ++PS +  ++PSCPPHL F+ G    MM RSMSF+GV                 N  ++V  ++ LSDDGS   LGEK
Subjt:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVE----------------NGCEEVNGDEGLSDDGS--ALGEK

Query:  KKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAV
        KKRLNLEQV+ALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+ LKKQF+ LK+DND L A N KL+A+            
Subjt:  KKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAV

Query:  LLSALKISKDSAGEGEAAGAGGGGTANLKKE-NERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP--FRSAAITQLLQHGSSRSTVDH
         L ALK  K    E          +A +K+E  E  WS++ S + N            +N     A     IKDLFP   RSA  T    H      +DH
Subjt:  LLSALKISKDSAGEGEAAGAGGGGTANLKKE-NERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP--FRSAAITQLLQHGSSRSTVDH

Query:  PDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQNSH
              ++ F  MFNGI+E   T++A +W W   Q  H
Subjt:  PDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQNSH

Q8LAT0 Homeobox-leucine zipper protein ATHB-202.4e-5645.15Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDG--SALGEKKKRLNLEQVKALEKSF
        MA P   H F+FQ    D+        S + +PSCPPHL+  +G    MM RSMS   V+    +   +E LSDDG  + LGEKKKRL LEQVKALEKSF
Subjt:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDG--SALGEKKKRLNLEQVKALEKSF

Query:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE
        ELGNKLEPERK+QLAKALG+QPRQIAIWFQNRRARWKT+QLERDY+ LKKQFE+LK+DN  L A N KL               L   + +      EG 
Subjt:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE

Query:  AAGAGGGGTANLKKENERCW----SSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFPFRSAAITQLLQHGSSRSTVDHPD-----QVIPEESF
                   +K+E E  W    S++NS DINL++ R   T   N            IKDLFP             S RS+    D     +++ EES 
Subjt:  AAGAGGGGTANLKKENERCW----SSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFPFRSAAITQLLQHGSSRSTVDHPD-----QVIPEESF

Query:  SQMFNGIEEQQQTSAAGFWPWTSDQNSHFH
          MFNGI+E   T+ AG+W W+   ++H H
Subjt:  SQMFNGIEEQQQTSAAGFWPWTSDQNSHFH

Q8LC03 Homeobox-leucine zipper protein ATHB-131.5e-4743.53Show/hide
Query:  SFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRS--MSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFELGNKLEP
        +F+ Q+   D H    PS +   +PSC      H G    + KRS       +E G   +NG+E  SDDGS +GEKK+RLN+EQVK LEK+FELGNKLEP
Subjt:  SFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRS--MSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFELGNKLEP

Query:  ERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAAGAGGGG
        ERKMQLA+ALGLQPRQIAIWFQNRRARWKTKQLE+DY+ LK+QF+ LKA+NDLLQ  N KL A+             +  LK  + +             
Subjt:  ERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAAGAGGGG

Query:  TANLKKENERCWS--SDNSCD-INLDISRTQATITTNNGGGGRACSQPGIKDLFPFRSAAIT------QLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE
        + NL KE E   S  SDNS D + LDIS    +  +   GG     Q   +  FP   A  T      Q  Q+ SS  ++     V  E S S MF  ++
Subjt:  TANLKKENERCWS--SDNSCD-INLDISRTQATITTNNGGGGRACSQPGIKDLFPFRSAAIT------QLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE

Query:  EQQQTSAAGFWPWTSDQ
        +      +GFWPW   Q
Subjt:  EQQQTSAAGFWPWTSDQ

Q8S7W9 Homeobox-leucine zipper protein HOX212.0e-4740.66Show/hide
Query:  HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPH--------------LYFHDGVVPLMMKRSMSFSGVENGCEEVN--GDEGLSDDGSALGEKKKRLNL
        HH H    Q +   HHH   P       P  PPH              L    G+ P++ KR MS+     G +EVN  G++ LSDDGS  GEKK+RLN+
Subjt:  HHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPH--------------LYFHDGVVPLMMKRSMSFSGVENGCEEVN--GDEGLSDDGSALGEKKKRLNL

Query:  EQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALK
        EQV+ LEK+FELGNKLEPERKMQLA+ALGLQPRQ+AIWFQNRRARWKTKQLE+DY+ LK+Q +A+KA+ND L   N KL A+             + ALK
Subjt:  EQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALK

Query:  ISKDSAGEGEAAGAGGGGTANLKKENERCWS--SDNSCDINLDISRT----QATITT-------NNGGGGRACSQPGIKDLFPFRSAA----------IT
          +++A E            NL KE E   S  S+NS +INLDISRT     A + T       ++GGGG      G   + PF ++           I 
Subjt:  ISKDSAGEGEAAGAGGGGTANLKKENERCWS--SDNSCDINLDISRT----QATITT-------NNGGGGRACSQPGIKDLFPFRSAA----------IT

Query:  QLLQH---GSSRSTVDH-------PDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQNSHFH
        QLL     G+    ++H           +   SF  +  G++E        FWPW   Q  HFH
Subjt:  QLLQH---GSSRSTVDH-------PDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQNSHFH

Arabidopsis top hitse value%identityAlignment
AT1G26960.1 homeobox protein 239.9e-4244.15Show/hide
Query:  KRSMSFSGVENGCE-EVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQ
        KRS   + V+  C  ++NGDE  SDDGS +GEKK+RLN+EQ+KALEK FELGNKLE +RK++LA+ALGLQPRQIAIWFQNRRAR KTKQLE+DY++LK+Q
Subjt:  KRSMSFSGVENGCE-EVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQ

Query:  FEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAAGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACS
        FE+L+ +N++LQ QN KL AQ             + ALK  +               + NL KE E    SD S +I+ DI                   
Subjt:  FEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAAGAGGGGTANLKKENERCWSSDNSCDINLDISRTQATITTNNGGGGRACS

Query:  QPGIKDLFPFRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQ
         P I   F       T  +Q   + S+      V  E S S MF GI++Q     +GFWPW   Q
Subjt:  QPGIKDLFPFRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQ

AT1G69780.1 Homeobox-leucine zipper protein family1.1e-4843.53Show/hide
Query:  SFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRS--MSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFELGNKLEP
        +F+ Q+   D H    PS +   +PSC      H G    + KRS       +E G   +NG+E  SDDGS +GEKK+RLN+EQVK LEK+FELGNKLEP
Subjt:  SFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRS--MSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFELGNKLEP

Query:  ERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAAGAGGGG
        ERKMQLA+ALGLQPRQIAIWFQNRRARWKTKQLE+DY+ LK+QF+ LKA+NDLLQ  N KL A+             +  LK  + +             
Subjt:  ERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAAGAGGGG

Query:  TANLKKENERCWS--SDNSCD-INLDISRTQATITTNNGGGGRACSQPGIKDLFPFRSAAIT------QLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE
        + NL KE E   S  SDNS D + LDIS    +  +   GG     Q   +  FP   A  T      Q  Q+ SS  ++     V  E S S MF  ++
Subjt:  TANLKKENERCWS--SDNSCD-INLDISRTQATITTNNGGGGRACSQPGIKDLFPFRSAAIT------QLLQHGSSRSTVDHPDQVIPEESFSQMFNGIE

Query:  EQQQTSAAGFWPWTSDQ
        +      +GFWPW   Q
Subjt:  EQQQTSAAGFWPWTSDQ

AT3G01220.1 homeobox protein 201.7e-5745.15Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDG--SALGEKKKRLNLEQVKALEKSF
        MA P   H F+FQ    D+        S + +PSCPPHL+  +G    MM RSMS   V+    +   +E LSDDG  + LGEKKKRL LEQVKALEKSF
Subjt:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDG--SALGEKKKRLNLEQVKALEKSF

Query:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE
        ELGNKLEPERK+QLAKALG+QPRQIAIWFQNRRARWKT+QLERDY+ LKKQFE+LK+DN  L A N KL               L   + +      EG 
Subjt:  ELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGE

Query:  AAGAGGGGTANLKKENERCW----SSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFPFRSAAITQLLQHGSSRSTVDHPD-----QVIPEESF
                   +K+E E  W    S++NS DINL++ R   T   N            IKDLFP             S RS+    D     +++ EES 
Subjt:  AAGAGGGGTANLKKENERCW----SSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFPFRSAAITQLLQHGSSRSTVDHPD-----QVIPEESF

Query:  SQMFNGIEEQQQTSAAGFWPWTSDQNSHFH
          MFNGI+E   T+ AG+W W+   ++H H
Subjt:  SQMFNGIEEQQQTSAAGFWPWTSDQNSHFH

AT5G15150.1 homeobox 35.8e-5847.04Show/hide
Query:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVE----------------NGCEEVNGDEGLSDDGS--ALGEK
        MA P   H F+FQ    D+ H ++PS +  ++PSCPPHL F+ G    MM RSMSF+GV                 N  ++V  ++ LSDDGS   LGEK
Subjt:  MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVE----------------NGCEEVNGDEGLSDDGS--ALGEK

Query:  KKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAV
        KKRLNLEQV+ALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+ LKKQF+ LK+DND L A N KL+A+            
Subjt:  KKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAV

Query:  LLSALKISKDSAGEGEAAGAGGGGTANLKKE-NERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP--FRSAAITQLLQHGSSRSTVDH
         L ALK  K    E          +A +K+E  E  WS++ S + N            +N     A     IKDLFP   RSA  T    H      +DH
Subjt:  LLSALKISKDSAGEGEAAGAGGGGTANLKKE-NERCWSSDNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFP--FRSAAITQLLQHGSSRSTVDH

Query:  PDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQNSH
              ++ F  MFNGI+E   T++A +W W   Q  H
Subjt:  PDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQNSH

AT5G65310.1 homeobox protein 53.0e-3067Show/hide
Query:  GLSDDGSALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQ
        G+    S   EKK+RL +EQVKALEK+FE+ NKLEPERK++LA+ LGLQPRQ+AIWFQNRRARWKTKQLERDY VLK  F+ALK + D LQ  N  L  Q
Subjt:  GLSDDGSALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCCTCATCATTCCCATAGCTTCATCTTCCAATCTCGCCCCGCCGATCATCACCATGAATACGTCCCTTCTACTTCCTTCAACACCATCCCTTCCTGTCCTCC
TCACCTCTACTTTCACGATGGGGTCGTTCCGCTGATGATGAAGAGATCGATGTCGTTTTCGGGAGTCGAAAACGGGTGCGAGGAAGTGAATGGAGACGAGGGATTGTCCG
ATGACGGATCGGCGTTGGGGGAGAAGAAGAAGCGTTTAAACTTAGAGCAAGTGAAGGCTTTGGAGAAGAGCTTTGAGCTGGGAAACAAGCTTGAGCCTGAGAGGAAAATG
CAGCTGGCCAAAGCTCTGGGGTTGCAGCCAAGACAGATTGCTATTTGGTTTCAGAACAGAAGGGCCAGGTGGAAGACGAAGCAATTGGAGAGAGATTACGAGGTCTTGAA
GAAACAGTTTGAAGCTCTAAAGGCCGACAATGATCTTCTTCAAGCTCAAAATACCAAACTCAATGCACAGAGTTGTTTTCACAACCCTTTCGATCTTTCCGCAGTTTTAT
TGTCTGCATTAAAAATTTCCAAAGACTCCGCAGGCGAGGGCGAGGCGGCAGGGGCAGGTGGCGGTGGCACGGCGAACCTAAAGAAAGAAAACGAACGCTGTTGGAGCAGC
GACAACAGTTGCGACATCAACCTGGACATCTCAAGAACACAAGCAACAATAACAACCAACAACGGCGGTGGAGGAAGAGCATGTTCCCAACCTGGAATCAAGGACCTGTT
CCCGTTTCGATCCGCCGCCATAACGCAGCTGCTTCAACACGGCTCGTCCAGATCAACGGTCGACCATCCTGATCAAGTGATTCCAGAAGAAAGCTTCTCTCAAATGTTCA
ATGGAATCGAAGAACAACAACAAACTTCAGCAGCTGGGTTTTGGCCATGGACTTCAGATCAAAATTCCCATTTTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCCTCATCATTCCCATAGCTTCATCTTCCAATCTCGCCCCGCCGATCATCACCATGAATACGTCCCTTCTACTTCCTTCAACACCATCCCTTCCTGTCCTCC
TCACCTCTACTTTCACGATGGGGTCGTTCCGCTGATGATGAAGAGATCGATGTCGTTTTCGGGAGTCGAAAACGGGTGCGAGGAAGTGAATGGAGACGAGGGATTGTCCG
ATGACGGATCGGCGTTGGGGGAGAAGAAGAAGCGTTTAAACTTAGAGCAAGTGAAGGCTTTGGAGAAGAGCTTTGAGCTGGGAAACAAGCTTGAGCCTGAGAGGAAAATG
CAGCTGGCCAAAGCTCTGGGGTTGCAGCCAAGACAGATTGCTATTTGGTTTCAGAACAGAAGGGCCAGGTGGAAGACGAAGCAATTGGAGAGAGATTACGAGGTCTTGAA
GAAACAGTTTGAAGCTCTAAAGGCCGACAATGATCTTCTTCAAGCTCAAAATACCAAACTCAATGCACAGAGTTGTTTTCACAACCCTTTCGATCTTTCCGCAGTTTTAT
TGTCTGCATTAAAAATTTCCAAAGACTCCGCAGGCGAGGGCGAGGCGGCAGGGGCAGGTGGCGGTGGCACGGCGAACCTAAAGAAAGAAAACGAACGCTGTTGGAGCAGC
GACAACAGTTGCGACATCAACCTGGACATCTCAAGAACACAAGCAACAATAACAACCAACAACGGCGGTGGAGGAAGAGCATGTTCCCAACCTGGAATCAAGGACCTGTT
CCCGTTTCGATCCGCCGCCATAACGCAGCTGCTTCAACACGGCTCGTCCAGATCAACGGTCGACCATCCTGATCAAGTGATTCCAGAAGAAAGCTTCTCTCAAATGTTCA
ATGGAATCGAAGAACAACAACAAACTTCAGCAGCTGGGTTTTGGCCATGGACTTCAGATCAAAATTCCCATTTTCATTAA
Protein sequenceShow/hide protein sequence
MASPHHSHSFIFQSRPADHHHEYVPSTSFNTIPSCPPHLYFHDGVVPLMMKRSMSFSGVENGCEEVNGDEGLSDDGSALGEKKKRLNLEQVKALEKSFELGNKLEPERKM
QLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDLLQAQNTKLNAQSCFHNPFDLSAVLLSALKISKDSAGEGEAAGAGGGGTANLKKENERCWSS
DNSCDINLDISRTQATITTNNGGGGRACSQPGIKDLFPFRSAAITQLLQHGSSRSTVDHPDQVIPEESFSQMFNGIEEQQQTSAAGFWPWTSDQNSHFH