; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G20040 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G20040
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionhomeobox-leucine zipper protein ATHB-20-like
Genome locationClcChr09:33646621..33649669
RNA-Seq ExpressionClc09G20040
SyntenyClc09G20040
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578593.1 Homeobox-leucine zipper protein HAT7, partial [Cucurbita argyrosperma subsp. sororia]1.7e-14986.92Show/hide
Query:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
        MASPHHSHSF+FQSRPADHHEY+PSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSG+ENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
Subjt:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG

Query:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGAT
        NKLEPERK+QLAKALGLQPRQ+AIWFQNRRARWKTKQLERDYEVLKK FE+LKADNDVLQAQNTKLHAE+              +LKTKDSGE AGGGAT
Subjt:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGAT

Query:  MNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT-
        MNLKKENERCWSSDNSCDINLDISKTQA IN   GG GGRAC +PG IKDLFPSAAFRS AITQL+Q GSSRSTVDHPQVIQEESFSQMFNGIEEQQQ+ 
Subjt:  MNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT-

Query:  ---AAAAGFWPWSSDQNSHFH
           AAAAGFWPW SDQNSHF+
Subjt:  ---AAAAGFWPWSSDQNSHFH

XP_004148689.1 homeobox-leucine zipper protein ATHB-20 [Cucumis sativus]3.9e-14988.82Show/hide
Query:  MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE
        MASP HHSHSFMFQSRPA DHHEY+PSASFN IPSCPPHLYFHDGVVPVMMKRSMSFS VENGCE+VNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE
Subjt:  MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE

Query:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGG
        +GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+              +LKTKDSGE A GG
Subjt:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGG

Query:  GATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ
        GATMNLKKENERCWSSDNSCDINLDIS TQ PI    GG GGR CSQPG+IKDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQ
Subjt:  GATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ

Query:  QQTAAAAGFWPWS-SDQNSHFH
        QQTAAAAGFWPWS SDQNSHFH
Subjt:  QQTAAAAGFWPWS-SDQNSHFH

XP_008459304.1 PREDICTED: homeobox-leucine zipper protein ATHB-20-like [Cucumis melo]2.9e-15290.03Show/hide
Query:  MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE
        MASP HHSHSFMFQSRPA DHHEYVPSASFN IPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCE+VNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE
Subjt:  MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE

Query:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGG
        +GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+              +LKTKDSGE   GG
Subjt:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGG

Query:  GATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ
        GATMNLKKENERCWSSDNSCDINLDIS TQ PI    GGGGGRACSQPG+IKDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQ
Subjt:  GATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ

Query:  QQTAAAAGFWPWSSDQNSHFH
        QQTAAAAGFWPWSSDQNSHFH
Subjt:  QQTAAAAGFWPWSSDQNSHFH

XP_022993652.1 homeobox-leucine zipper protein ATHB-20-like [Cucurbita maxima]1.5e-14886.79Show/hide
Query:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
        MASPHHSHSF+FQSRPADHHEY+PSASFNAIPSCPPHLYFHDGV+PVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
Subjt:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG

Query:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGAT
        NKLEPERK+QLAKALGLQPRQ+AIWFQNRRARWKTKQLERDYEVLKK FE+LKADNDVLQAQN+KLHA++              +LKTKD+GE AGGGAT
Subjt:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGAT

Query:  MNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT-
        MNLKKENERCWSSDNSCDINLDISKTQA IN   GG GGRAC +PG IKDLFPSAAFRS AITQL+Q GSSRSTVDHPQVIQEESFSQMFNGIEEQQQT 
Subjt:  MNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT-

Query:  AAAAGFWPWSSDQNSHFH
        AAAAGFWPW SDQ+SHF+
Subjt:  AAAAGFWPWSSDQNSHFH

XP_038890842.1 LOW QUALITY PROTEIN: homeobox-leucine zipper protein ATHB-20-like [Benincasa hispida]4.7e-15590.37Show/hide
Query:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
        MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
Subjt:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG

Query:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAA-GGGA
        NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+              +LKTKDSGE   GG A
Subjt:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAA-GGGA

Query:  TMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT
        TMNLKKENE CWSSDNSCDINLDISKTQA I G G GGGGR CSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT
Subjt:  TMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT

Query:  ----AAAAGFWPWSSDQNSHFH
            AAAAGFWPW+SDQNSHFH
Subjt:  ----AAAAGFWPWSSDQNSHFH

TrEMBL top hitse value%identityAlignment
A0A0A0KS90 Homeobox domain-containing protein1.9e-14988.82Show/hide
Query:  MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE
        MASP HHSHSFMFQSRPA DHHEY+PSASFN IPSCPPHLYFHDGVVPVMMKRSMSFS VENGCE+VNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE
Subjt:  MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE

Query:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGG
        +GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+              +LKTKDSGE A GG
Subjt:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGG

Query:  GATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ
        GATMNLKKENERCWSSDNSCDINLDIS TQ PI    GG GGR CSQPG+IKDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQ
Subjt:  GATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ

Query:  QQTAAAAGFWPWS-SDQNSHFH
        QQTAAAAGFWPWS SDQNSHFH
Subjt:  QQTAAAAGFWPWS-SDQNSHFH

A0A1S3C9V4 homeobox-leucine zipper protein ATHB-20-like1.4e-15290.03Show/hide
Query:  MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE
        MASP HHSHSFMFQSRPA DHHEYVPSASFN IPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCE+VNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE
Subjt:  MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE

Query:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGG
        +GNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+              +LKTKDSGE   GG
Subjt:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGG

Query:  GATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ
        GATMNLKKENERCWSSDNSCDINLDIS TQ PI    GGGGGRACSQPG+IKDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQ
Subjt:  GATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ

Query:  QQTAAAAGFWPWSSDQNSHFH
        QQTAAAAGFWPWSSDQNSHFH
Subjt:  QQTAAAAGFWPWSSDQNSHFH

A0A5D3CVK8 Homeobox-leucine zipper protein ATHB-20-like3.0e-14789.68Show/hide
Query:  MFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKM
        MFQSRPA DHHEYVPSASFN IPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCE+VNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE+GNKLEPERKM
Subjt:  MFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKM

Query:  QLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGGGATMNLKKENE
        QLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+              +LKTKDSGE   GGGATMNLKKENE
Subjt:  QLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE-AAGGGATMNLKKENE

Query:  RCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQQQTAAAAGFWP
        RCWSSDNSCDINLDIS TQ PI    GG GGRACSQPG+IKDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQQQTAAAAGFWP
Subjt:  RCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQQQTAAAAGFWP

Query:  WSSDQNSHFH
        WSSDQNSHFH
Subjt:  WSSDQNSHFH

A0A6J1FFJ1 homeobox-leucine zipper protein ATHB-20-like1.2e-14886.29Show/hide
Query:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
        MASPHHSHSF+FQSRPADHHEY+PSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSG+ENGCEE+NGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
Subjt:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG

Query:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGAT
        NKLEPERK+QLAKALGLQPRQ+AIWFQNRRARWKTKQLERDYEVLKK FE+LKADNDVLQAQNTKLHAE+              +LKTKDSGE AGGGAT
Subjt:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGAT

Query:  MNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT-
        MNLKKENERCWSSDNSCDINLDISKTQA IN   GG GGRAC +PG IKDLFPSAAF S AITQL+Q GSSRSTVDHPQVIQEESFSQMFNGIEEQQQ+ 
Subjt:  MNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT-

Query:  ---AAAAGFWPWSSDQNSHFH
           AAAAGFWPW SDQNSHF+
Subjt:  ---AAAAGFWPWSSDQNSHFH

A0A6J1JTG1 homeobox-leucine zipper protein ATHB-20-like7.2e-14986.79Show/hide
Query:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
        MASPHHSHSF+FQSRPADHHEY+PSASFNAIPSCPPHLYFHDGV+PVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG
Subjt:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELG

Query:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGAT
        NKLEPERK+QLAKALGLQPRQ+AIWFQNRRARWKTKQLERDYEVLKK FE+LKADNDVLQAQN+KLHA++              +LKTKD+GE AGGGAT
Subjt:  NKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGAT

Query:  MNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT-
        MNLKKENERCWSSDNSCDINLDISKTQA IN   GG GGRAC +PG IKDLFPSAAFRS AITQL+Q GSSRSTVDHPQVIQEESFSQMFNGIEEQQQT 
Subjt:  MNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT-

Query:  AAAAGFWPWSSDQNSHFH
        AAAAGFWPW SDQ+SHF+
Subjt:  AAAAGFWPWSSDQNSHFH

SwissProt top hitse value%identityAlignment
A2XD08 Homeobox-leucine zipper protein HOX212.5e-5041.27Show/hide
Query:  PADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVN--GDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAK
        P  H+ ++PS++      CP    F  G+ P++ KR MS+     G +EVN  G++ LSDDG   GEKK+RLN+EQV+ LEK+FELGNKLEPERKMQLA+
Subjt:  PADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVN--GDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAK

Query:  ALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKENERCWS-
        ALGLQPRQ+AIWFQNRRARWKTKQLE+DY+ LK+Q +A+KA+ND L   N KL AE+  +     ++ L                  +NL KE E   S 
Subjt:  ALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKENERCWS-

Query:  -SDNSCDINLDISKTQAP--------------INGSGGGGGG------------RACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEES
         S+NS +INLDIS+T  P               +G GGGGGG            R  S  G+  D    ++   A   ++  HG   +       +   S
Subjt:  -SDNSCDINLDISKTQAP--------------INGSGGGGGG------------RACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEES

Query:  FSQMFNGIEEQQQTAAAAGFWPWSSDQNSHFH
        F  +  G++E         FWPW   Q  HFH
Subjt:  FSQMFNGIEEQQQTAAAAGFWPWSSDQNSHFH

Q00466 Homeobox-leucine zipper protein HAT71.6e-6047.32Show/hide
Query:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVE----------------NGCEEVNGDEGLSDDG--LALGEKK
        MA P   H FMFQ    D+  ++PS +  ++PSCPPHL F+ G    MM RSMSF+GV                 N  ++V  ++ LSDDG  + LGEKK
Subjt:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVE----------------NGCEEVNGDEGLSDDG--LALGEKK

Query:  KRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVL
        KRLNLEQV+ALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+ LKKQF+ LK+DND L A N KLHAE+            
Subjt:  KRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVL

Query:  DYSLKTKDSGEAAGGGATMNLKKE-NERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVI
          +LK  D  E+A       +K+E  E  WS++ S + N + + + A              +   +IKDLFPS + RSA  T    H      +DH Q++
Subjt:  DYSLKTKDSGEAAGGGATMNLKKE-NERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVI

Query:  --QEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSH
          Q++ F  MFNGI+E      +A +W W   Q  H
Subjt:  --QEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSH

Q8LAT0 Homeobox-leucine zipper protein ATHB-202.1e-5747.37Show/hide
Query:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDG--LALGEKKKRLNLEQVKALEKSFE
        MA P   H FMFQ    D+       S + +PSCPPHL+  +G    MM RSMS   V+    +   +E LSDDG    LGEKKKRL LEQVKALEKSFE
Subjt:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDG--LALGEKKKRLNLEQVKALEKSFE

Query:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGG
        LGNKLEPERK+QLAKALG+QPRQIAIWFQNRRARWKT+QLERDY+ LKKQFE+LK+DN  L A N KL AEV              +LK K+  E     
Subjt:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGG

Query:  ATMNLKKENERCW----SSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIE
            +K+E E  W    S++NS DINL++ +                 +    IKDLFPS + RS+A      H        + +++QEES   MFNGI+
Subjt:  ATMNLKKENERCW----SSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIE

Query:  EQQQTAAAAGFWPWSSDQNSHFH
        E       AG+W WS   ++H H
Subjt:  EQQQTAAAAGFWPWSSDQNSHFH

Q8LC03 Homeobox-leucine zipper protein ATHB-133.6e-4944.66Show/hide
Query:  SFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRS--MSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPE
        +FM Q+   D H +   +    +PSC      H G    + KRS       +E G   +NG+E  SDDG  +GEKK+RLN+EQVK LEK+FELGNKLEPE
Subjt:  SFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRS--MSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPE

Query:  RKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKE
        RKMQLA+ALGLQPRQIAIWFQNRRARWKTKQLE+DY+ LK+QF+ LKA+ND+LQ  N KL AE+               LK ++  E      ++NL KE
Subjt:  RKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKE

Query:  NERCWS--SDNSCD-INLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAA
         E   S  SDNS D + LDIS T  P N S   GG     Q  + +  FP +   +   T  +Q   + S+     V +E S S MF  +++       +
Subjt:  NERCWS--SDNSCD-INLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAA

Query:  GFWPWSSDQ
        GFWPW   Q
Subjt:  GFWPWSSDQ

Q8S7W9 Homeobox-leucine zipper protein HOX212.5e-5040.17Show/hide
Query:  HHSHSFMFQSRPADHHEYVPSASFNAIPSCP--------PHLYFHDGVVPVMMKRSMSFSGVENGCEEVN--GDEGLSDDGLALGEKKKRLNLEQVKALE
        HH H    Q +   HH   P       P  P        P L    G+ P++ KR MS+     G +EVN  G++ LSDDG   GEKK+RLN+EQV+ LE
Subjt:  HHSHSFMFQSRPADHHEYVPSASFNAIPSCP--------PHLYFHDGVVPVMMKRSMSFSGVENGCEEVN--GDEGLSDDGLALGEKKKRLNLEQVKALE

Query:  KSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEA
        K+FELGNKLEPERKMQLA+ALGLQPRQ+AIWFQNRRARWKTKQLE+DY+ LK+Q +A+KA+ND L   N KL AE+  +     ++ L            
Subjt:  KSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEA

Query:  AGGGATMNLKKENERCWS--SDNSCDINLDISKTQAP--------------INGSGGGGGG------------RACSQPGIIKDLFPSAAFRSAAITQLL
              +NL KE E   S  S+NS +INLDIS+T  P               +G GGGGGG            R  S  G+  D    ++   A   ++ 
Subjt:  AGGGATMNLKKENERCWS--SDNSCDINLDISKTQAP--------------INGSGGGGGG------------RACSQPGIIKDLFPSAAFRSAAITQLL

Query:  QHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSHFH
         HG   +       +   SF  +  G++E         FWPW   Q  HFH
Subjt:  QHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSHFH

Arabidopsis top hitse value%identityAlignment
AT1G26960.1 homeobox protein 236.8e-4344.7Show/hide
Query:  KRSMSFSGVENGCE-EVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQ
        KRS   + V+  C  ++NGDE  SDDG  +GEKK+RLN+EQ+KALEK FELGNKLE +RK++LA+ALGLQPRQIAIWFQNRRAR KTKQLE+DY++LK+Q
Subjt:  KRSMSFSGVENGCE-EVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQ

Query:  FEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGII
        FE+L+ +N+VLQ QN KL A+V              +LK+++  E      ++NL KE E    SD S +I+ DI                    +P  I
Subjt:  FEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGII

Query:  KDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAAGFWPWSSDQ
           F      +    Q  Q+ SS    +   V +E S S MF GI++Q      +GFWPW   Q
Subjt:  KDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAAGFWPWSSDQ

AT1G69780.1 Homeobox-leucine zipper protein family2.6e-5044.66Show/hide
Query:  SFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRS--MSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPE
        +FM Q+   D H +   +    +PSC      H G    + KRS       +E G   +NG+E  SDDG  +GEKK+RLN+EQVK LEK+FELGNKLEPE
Subjt:  SFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRS--MSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPE

Query:  RKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKE
        RKMQLA+ALGLQPRQIAIWFQNRRARWKTKQLE+DY+ LK+QF+ LKA+ND+LQ  N KL AE+               LK ++  E      ++NL KE
Subjt:  RKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKE

Query:  NERCWS--SDNSCD-INLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAA
         E   S  SDNS D + LDIS T  P N S   GG     Q  + +  FP +   +   T  +Q   + S+     V +E S S MF  +++       +
Subjt:  NERCWS--SDNSCD-INLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAA

Query:  GFWPWSSDQ
        GFWPW   Q
Subjt:  GFWPWSSDQ

AT3G01220.1 homeobox protein 201.5e-5847.37Show/hide
Query:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDG--LALGEKKKRLNLEQVKALEKSFE
        MA P   H FMFQ    D+       S + +PSCPPHL+  +G    MM RSMS   V+    +   +E LSDDG    LGEKKKRL LEQVKALEKSFE
Subjt:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDG--LALGEKKKRLNLEQVKALEKSFE

Query:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGG
        LGNKLEPERK+QLAKALG+QPRQIAIWFQNRRARWKT+QLERDY+ LKKQFE+LK+DN  L A N KL AEV              +LK K+  E     
Subjt:  LGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGG

Query:  ATMNLKKENERCW----SSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIE
            +K+E E  W    S++NS DINL++ +                 +    IKDLFPS + RS+A      H        + +++QEES   MFNGI+
Subjt:  ATMNLKKENERCW----SSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIE

Query:  EQQQTAAAAGFWPWSSDQNSHFH
        E       AG+W WS   ++H H
Subjt:  EQQQTAAAAGFWPWSSDQNSHFH

AT5G15150.1 homeobox 31.1e-6147.32Show/hide
Query:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVE----------------NGCEEVNGDEGLSDDG--LALGEKK
        MA P   H FMFQ    D+  ++PS +  ++PSCPPHL F+ G    MM RSMSF+GV                 N  ++V  ++ LSDDG  + LGEKK
Subjt:  MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVE----------------NGCEEVNGDEGLSDDG--LALGEKK

Query:  KRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVL
        KRLNLEQV+ALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+ LKKQF+ LK+DND L A N KLHAE+            
Subjt:  KRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVL

Query:  DYSLKTKDSGEAAGGGATMNLKKE-NERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVI
          +LK  D  E+A       +K+E  E  WS++ S + N + + + A              +   +IKDLFPS + RSA  T    H      +DH Q++
Subjt:  DYSLKTKDSGEAAGGGATMNLKKE-NERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVI

Query:  --QEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSH
          Q++ F  MFNGI+E      +A +W W   Q  H
Subjt:  --QEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSH

AT5G65310.1 homeobox protein 58.6e-3063.73Show/hide
Query:  GLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE
        G+        EKK+RL +EQVKALEK+FE+ NKLEPERK++LA+ LGLQPRQ+AIWFQNRRARWKTKQLERDY VLK  F+ALK + D LQ  N  L  +
Subjt:  GLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE

Query:  VE
        ++
Subjt:  VE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCCTCACCATTCCCATAGCTTCATGTTCCAATCCCGCCCCGCCGATCACCACGAATACGTCCCCTCCGCTTCCTTCAACGCCATTCCCTCCTGCCCTCCTCA
CCTCTACTTTCACGATGGAGTGGTTCCAGTGATGATGAAGAGATCGATGTCGTTTTCGGGAGTCGAAAACGGGTGCGAGGAAGTGAATGGCGACGAGGGGTTATCAGACG
ATGGATTGGCATTGGGAGAGAAGAAGAAGCGATTAAATTTAGAGCAAGTGAAGGCTTTGGAGAAGAGCTTTGAGTTAGGGAATAAGCTTGAGCCAGAGAGGAAAATGCAG
CTAGCCAAAGCTTTGGGGTTGCAGCCAAGACAGATTGCTATTTGGTTTCAGAATAGAAGGGCCAGATGGAAAACCAAGCAGTTGGAGAGAGATTATGAAGTGTTGAAGAA
ACAGTTTGAAGCTCTTAAAGCTGACAATGATGTACTTCAAGCTCAAAATACCAAACTCCATGCAGAGGTTGAATTTATAAATAATTCCAACAGATCAACTGTTTTAGATT
ATTCATTAAAAACCAAAGACTCCGGCGAGGCAGCGGGGGGCGGCGCCACCATGAACCTAAAGAAAGAAAACGAACGCTGTTGGAGCAGCGACAACAGTTGCGACATTAAT
CTCGACATCTCAAAGACACAAGCACCAATAAACGGCAGCGGCGGTGGTGGCGGAGGTAGAGCATGTTCTCAACCAGGAATAATCAAAGATCTTTTCCCATCGGCGGCGTT
CCGATCCGCCGCCATAACGCAGCTGCTTCAACACGGGTCGTCCAGATCAACGGTGGACCATCCTCAAGTGATTCAAGAAGAAAGCTTCTCTCAAATGTTCAATGGCATTG
AAGAACAACAACAAACTGCAGCAGCAGCTGGGTTTTGGCCATGGAGTTCAGATCAAAATTCCCATTTTCATTAA
mRNA sequenceShow/hide mRNA sequence
TCTCTCTTTACCTAAAATTCGCTGTCTTTCAGACTTCTCTCTCTCTTTGCAAAAATCCATCTCTTCTCACCATGAGAGTGAGAGCCACACCACTCTTTCAACCATTTCTC
TCTTCTTCTTCTTCTTCTTCTTTAATCAAATTCTTCTTCTATTTCTTCTCATCTTTCTCTCTTTGATCTCAATCCAAATCCAACACACAATATTGAGATCTAGCATGCAT
TGCCATTCCTATGGCTTCCCCTCACCATTCCCATAGCTTCATGTTCCAATCCCGCCCCGCCGATCACCACGAATACGTCCCCTCCGCTTCCTTCAACGCCATTCCCTCCT
GCCCTCCTCACCTCTACTTTCACGATGGAGTGGTTCCAGTGATGATGAAGAGATCGATGTCGTTTTCGGGAGTCGAAAACGGGTGCGAGGAAGTGAATGGCGACGAGGGG
TTATCAGACGATGGATTGGCATTGGGAGAGAAGAAGAAGCGATTAAATTTAGAGCAAGTGAAGGCTTTGGAGAAGAGCTTTGAGTTAGGGAATAAGCTTGAGCCAGAGAG
GAAAATGCAGCTAGCCAAAGCTTTGGGGTTGCAGCCAAGACAGATTGCTATTTGGTTTCAGAATAGAAGGGCCAGATGGAAAACCAAGCAGTTGGAGAGAGATTATGAAG
TGTTGAAGAAACAGTTTGAAGCTCTTAAAGCTGACAATGATGTACTTCAAGCTCAAAATACCAAACTCCATGCAGAGGTTGAATTTATAAATAATTCCAACAGATCAACT
GTTTTAGATTATTCATTAAAAACCAAAGACTCCGGCGAGGCAGCGGGGGGCGGCGCCACCATGAACCTAAAGAAAGAAAACGAACGCTGTTGGAGCAGCGACAACAGTTG
CGACATTAATCTCGACATCTCAAAGACACAAGCACCAATAAACGGCAGCGGCGGTGGTGGCGGAGGTAGAGCATGTTCTCAACCAGGAATAATCAAAGATCTTTTCCCAT
CGGCGGCGTTCCGATCCGCCGCCATAACGCAGCTGCTTCAACACGGGTCGTCCAGATCAACGGTGGACCATCCTCAAGTGATTCAAGAAGAAAGCTTCTCTCAAATGTTC
AATGGCATTGAAGAACAACAACAAACTGCAGCAGCAGCTGGGTTTTGGCCATGGAGTTCAGATCAAAATTCCCATTTTCATTAA
Protein sequenceShow/hide protein sequence
MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQ
LAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKENERCWSSDNSCDIN
LDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSHFH