; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC10G190220 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC10G190220
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Descriptionhomeobox-leucine zipper protein ATHB-6-like
Genome locationCmU531Chr10:17339243..17340891
RNA-Seq ExpressionCmUC10G190220
SyntenyCmUC10G190220
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031308.1 Homeobox-leucine zipper protein ATHB-6 [Cucurbita argyrosperma subsp. argyrosperma]1.3e-12574.29Show/hide
Query:  TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE
        TD EQSPRN   N V GTEFQSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQVAVWFQNRRARWKTKQLE
Subjt:  TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE

Query:  RDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQ
        RDYGVLKTNY+NLKL+YETLQ+DNQALLK                                  +I+ELK KLQEDNSESNLSVEEE  V  DSENALIEQ
Subjt:  RDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQ

Query:  TKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK---
         KPEI DQFSVP A+ESQDFN+ S +NNGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISSP VLQ H   HFMTGA SP+PS  VK   
Subjt:  TKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK---

Query:  ----LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
            LNYLQ+QKGYQ QTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  ----LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

XP_022136962.1 homeobox-leucine zipper protein ATHB-6-like [Momordica charantia]1.1e-12772.8Show/hide
Query:  DSTNSFPILC--TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ
        DS  +   +C  +D EQSPRN   NHVYG EFQSMLDGF+EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ
Subjt:  DSTNSFPILC--TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ

Query:  NRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMV
        NRRARWKTKQLERDYGVLKTNYE LKL+YETLQ DN ALLK                                  +IRELKSKLQEDNSESN+SVEEEMV
Subjt:  NRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMV

Query:  V-ATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGA
        + A DSENALIE+T+PE GD FSVPPA+E +D N+ESFNNNGGEGEE   EE SLF DFKDGSSDSDSSAILNEDYS TA ISSPGVLQN Q YHFM  +
Subjt:  V-ATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGA

Query:  VSPAPSAAVK-------LNYLQFQKGYQQQTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWW
         SP+PSAAVK       LNY QFQK Y QQTQ++PKMEEHNFF+GEE  CNFFS+EQAP+LHWW
Subjt:  VSPAPSAAVK-------LNYLQFQKGYQQQTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWW

XP_022999792.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita maxima]1.0e-12574.5Show/hide
Query:  DHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLER
        D EQSPRN   N V GTEFQSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQVAVWFQNRRARWKTKQLER
Subjt:  DHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLER

Query:  DYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQT
        DYGVLKTNY+NLKL+YETLQ+DNQALLK                                  +I+ELK KLQEDNS+SNLSVEEEM V  DSENALIEQ 
Subjt:  DYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQT

Query:  KPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK----
        KPEI DQFSVP A+ESQDFN+ES +NNGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISS  VLQ H   HFMTGA SP+PS  VK    
Subjt:  KPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK----

Query:  ---LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
           LNYLQ+QKGYQQQTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  ---LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

XP_023547219.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita pepo subsp. pepo]1.2e-12674.57Show/hide
Query:  TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE
        TD EQSPRN   N V GTEFQSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQVAVWFQNRRARWKTKQLE
Subjt:  TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE

Query:  RDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQ
        RDYGVLKTNY+NLKL+YETLQ+DNQALLK                                  +I+ELK KLQEDNSESNLSVEEEM V  DSENALIEQ
Subjt:  RDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQ

Query:  TKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK---
         KPEI DQFSVP A+E+QDFN+ES ++NGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISSP VLQ H   HFMTGA SP+PS  VK   
Subjt:  TKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK---

Query:  ----LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
            LNYLQ+QKGYQQQTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  ----LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

XP_038905018.1 homeobox-leucine zipper protein ATHB-6-like [Benincasa hispida]2.9e-15282.02Show/hide
Query:  DSTNSFPILC--TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ
        DS  +   +C  +DHEQSPRNKNSNHVY TEFQSMLDGF+EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ
Subjt:  DSTNSFPILC--TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ

Query:  NRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMV
        NRRARWKTKQLERDYGVLKTNYENLKLS+ETLQNDNQ LLK                                  QIRELK+KLQEDNSESNLSVEEEMV
Subjt:  NRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMV

Query:  VATDSENALIEQTKPEI-GDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGA
        VA +SENA IEQTKPEI GDQFSVPPASESQDFN+ESFN+NGGEGEEA +EE +LF DFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYH MTGA
Subjt:  VATDSENALIEQTKPEI-GDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGA

Query:  VSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
        +SP PSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  VSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

TrEMBL top hitse value%identityAlignment
A0A6J1C6W4 homeobox-leucine zipper protein ATHB-6-like5.3e-12872.8Show/hide
Query:  DSTNSFPILC--TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ
        DS  +   +C  +D EQSPRN   NHVYG EFQSMLDGF+EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ
Subjt:  DSTNSFPILC--TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ

Query:  NRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMV
        NRRARWKTKQLERDYGVLKTNYE LKL+YETLQ DN ALLK                                  +IRELKSKLQEDNSESN+SVEEEMV
Subjt:  NRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMV

Query:  V-ATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGA
        + A DSENALIE+T+PE GD FSVPPA+E +D N+ESFNNNGGEGEE   EE SLF DFKDGSSDSDSSAILNEDYS TA ISSPGVLQN Q YHFM  +
Subjt:  V-ATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGA

Query:  VSPAPSAAVK-------LNYLQFQKGYQQQTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWW
         SP+PSAAVK       LNY QFQK Y QQTQ++PKMEEHNFF+GEE  CNFFS+EQAP+LHWW
Subjt:  VSPAPSAAVK-------LNYLQFQKGYQQQTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWW

A0A6J1ENE6 homeobox-leucine zipper protein ATHB-6-like7.0e-11267.51Show/hide
Query:  TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE
        +D EQSPRNKNSNHVY  EFQ MLDGF+E    EE GHVSEKKRRL VEQVKALEKNFEVENKLEPERK+KLA+ELGLQPRQVAVWFQNRRARWKTKQLE
Subjt:  TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE

Query:  RDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQ
        RDYGVLKTNY+NLKLS+E LQNDNQALLK                                  +IRELK+K+QEDNS        EM+V  DSENALIEQ
Subjt:  RDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQ

Query:  TKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP-APS------
        TKPEI D FSVPPA         SFNNNGGEG+E            KDGSSDSDSSAILNEDYSPTAG+SSPGVLQN+   HFMTGA+ P +PS      
Subjt:  TKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP-APS------

Query:  ----AAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
            A   LNYLQFQKGYQQ   MFPKMEEHNFF GEEACNFFSDEQAPTLHWW
Subjt:  ----AAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

A0A6J1FNC9 homeobox-leucine zipper protein ATHB-6-like7.2e-12573.93Show/hide
Query:  DHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLER
        D EQSPRN   N V GTEFQSMLDGF E+G VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQVAVWFQNRRARWKTKQLER
Subjt:  DHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLER

Query:  DYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQT
        DYGVLKTNY+NLKL+YETLQ+DNQALLK                                  +I+ELK KLQEDNSESNLSVEEE  V  DSENALIEQ 
Subjt:  DYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQT

Query:  KPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK----
        KPEI DQFSVP A+ESQDFN+ S +NNGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISSP VLQ H   HFMTGA SP+PS  VK    
Subjt:  KPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK----

Query:  ---LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
           LNYLQ+QKGYQQQ+QMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  ---LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

A0A6J1JAW1 homeobox-leucine zipper protein ATHB-6-like1.7e-11066.67Show/hide
Query:  TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE
        +D EQSPRNKNSNHVY  EFQ MLDGF+E    EE GHVSEKKRRL VEQVK+LEKNFEVENKLEPERK+KLA+ELGLQPRQVAVWFQNRRARWKTKQLE
Subjt:  TDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE

Query:  RDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQ
        RDYGVLKTNY+NLKLS+E LQNDNQALLK                                  +IRELK+K+QEDNS        EM+   DSENALIEQ
Subjt:  RDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQ

Query:  TKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP-APS------
        TKPEI D FSVPPA         SFNNNGGEG+E            KDGSSDSDSSAILNEDYSPTAG+SSPGVLQN+   HFMTG + P +PS      
Subjt:  TKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP-APS------

Query:  ----AAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
            A   LNYLQFQKGYQQ   MFPKMEEHNFF GEEACNFFSDEQAPTLHWW
Subjt:  ----AAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

A0A6J1KBS4 homeobox-leucine zipper protein ATHB-6-like5.0e-12674.5Show/hide
Query:  DHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLER
        D EQSPRN   N V GTEFQSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQVAVWFQNRRARWKTKQLER
Subjt:  DHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLER

Query:  DYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQT
        DYGVLKTNY+NLKL+YETLQ+DNQALLK                                  +I+ELK KLQEDNS+SNLSVEEEM V  DSENALIEQ 
Subjt:  DYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQT

Query:  KPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK----
        KPEI DQFSVP A+ESQDFN+ES +NNGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISS  VLQ H   HFMTGA SP+PS  VK    
Subjt:  KPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK----

Query:  ---LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
           LNYLQ+QKGYQQQTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  ---LNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

SwissProt top hitse value%identityAlignment
P46667 Homeobox-leucine zipper protein ATHB-56.6e-4338.92Show/hide
Query:  TDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR
        TD + SPR   +  +Y    ++  M D  E++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQPRQVA+WFQNRR
Subjt:  TDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR

Query:  ARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEE--MVV
        ARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL Q+K                                  ELK+KL   N E    +EE   +  
Subjt:  ARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEE--MVV

Query:  ATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMT
           +++ +      E+  +   PP     D              E A E  S+F    +F+D  +D SDSSA+LNE+YSP                    
Subjt:  ATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMT

Query:  GAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ
          V  A + A     +     + Q    F KMEEH + FSGEEAC  F+D +
Subjt:  GAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ

P46668 Homeobox-leucine zipper protein ATHB-61.0e-5141.96Show/hide
Query:  DSTNSFPILC---TDHEQSPRNKNSNHVYGTEFQSMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQV
        DS      LC   +  EQSPR        G EFQSML+G+  EEE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+ELGLQPRQV
Subjt:  DSTNSFPILC---TDHEQSPRNKNSNHVYGTEFQSMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQV

Query:  AVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKL-----QEDNSE
        AVWFQNRRARWKTKQLE+DYGVLKT Y++L+ ++++L+ DN++LL+                                  +I +LK+KL     +E+  E
Subjt:  AVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKL-----QEDNSE

Query:  SNLSVEEEMVVATDSENALIEQTKPEIGDQFSVPP--ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQ
        +N +V  E  ++   E   + +   +I +  S PP     S   N+ SF +        A    S FA     S  SDSSA+LNE+ S    +++P  + 
Subjt:  SNLSVEEEMVVATDSENALIEQTKPEIGDQFSVPP--ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQ

Query:  NHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
                              N+ QF K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Subjt:  NHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

Q6K498 Homeobox-leucine zipper protein HOX46.2e-3367.57Show/hide
Query:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ
        G E EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++ L+
Subjt:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ

Query:  NDNQALLKQVK
         D  ALL ++K
Subjt:  NDNQALLKQVK

Q940J1 Homeobox-leucine zipper protein ATHB-163.2e-4540.66Show/hide
Query:  EQSPRNKNSNHVYGTEFQSMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKT
        EQSPR       YG+ +QSML+G++E+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA+ELGLQPRQVAVWFQNRRARWKT
Subjt:  EQSPRNKNSNHVYGTEFQSMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKT

Query:  KQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKL--QEDNSESNL---SVEEEMVVAT
        KQLE+DYGVLK  Y++L+ ++++L+ DN +LL+                                  +I ++K+K+  +EDN+ +      V+EE V  T
Subjt:  KQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKL--QEDNSESNL---SVEEEMVVAT

Query:  DSENALIEQTKPEIGDQFSVPPASESQDFNHES-FNNNGGEGEEAAIEEVSLFADFKD----------GSSDS-DSSAILNEDYSPTAGISSPGVLQNHQ
        DS                   P+S  Q   H S FN                F D +D          GSSDS DSSA+LN++ S   G  +P V     
Subjt:  DSENALIEQTKPEIGDQFSVPPASESQDFNHES-FNNNGGEGEEAAIEEVSLFADFKD----------GSSDS-DSSAILNEDYSPTAGISSPGVLQNHQ

Query:  QYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
            +TG            ++LQF K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Subjt:  QYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

Q9XH37 Homeobox-leucine zipper protein HOX46.2e-3367.57Show/hide
Query:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ
        G E EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++ L+
Subjt:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ

Query:  NDNQALLKQVK
         D  ALL ++K
Subjt:  NDNQALLKQVK

Arabidopsis top hitse value%identityAlignment
AT1G69780.1 Homeobox-leucine zipper protein family3.6e-2839.27Show/hide
Query:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALL
        EE   ++   + EKKRRL++EQVK LEKNFE+ NKLEPERK++LAR LGLQPRQ+A+WFQNRRARWKTKQLE+DY  LK  ++ LK   + LQ  NQ L 
Subjt:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALL

Query:  KQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQTKP----EIGDQFSVP-PASESQDFNHE
         ++                + ++N    E++ L  +  E     + DNS  NL +  ++  A  S ++ +    P     +G  F  P PA+ +      
Subjt:  KQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQTKP----EIGDQFSVP-PASESQDFNHE

Query:  SFNNNGGEGEEAAIEEVSL
         F  N   G+    EE S+
Subjt:  SFNNNGGEGEEAAIEEVSL

AT2G22430.1 homeobox protein 67.2e-5341.96Show/hide
Query:  DSTNSFPILC---TDHEQSPRNKNSNHVYGTEFQSMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQV
        DS      LC   +  EQSPR        G EFQSML+G+  EEE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+ELGLQPRQV
Subjt:  DSTNSFPILC---TDHEQSPRNKNSNHVYGTEFQSMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQV

Query:  AVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKL-----QEDNSE
        AVWFQNRRARWKTKQLE+DYGVLKT Y++L+ ++++L+ DN++LL+                                  +I +LK+KL     +E+  E
Subjt:  AVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKL-----QEDNSE

Query:  SNLSVEEEMVVATDSENALIEQTKPEIGDQFSVPP--ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQ
        +N +V  E  ++   E   + +   +I +  S PP     S   N+ SF +        A    S FA     S  SDSSA+LNE+ S    +++P  + 
Subjt:  SNLSVEEEMVVATDSENALIEQTKPEIGDQFSVPP--ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQ

Query:  NHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
                              N+ QF K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Subjt:  NHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

AT4G40060.1 homeobox protein 162.3e-4640.66Show/hide
Query:  EQSPRNKNSNHVYGTEFQSMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKT
        EQSPR       YG+ +QSML+G++E+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA+ELGLQPRQVAVWFQNRRARWKT
Subjt:  EQSPRNKNSNHVYGTEFQSMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKT

Query:  KQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKL--QEDNSESNL---SVEEEMVVAT
        KQLE+DYGVLK  Y++L+ ++++L+ DN +LL+                                  +I ++K+K+  +EDN+ +      V+EE V  T
Subjt:  KQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKL--QEDNSESNL---SVEEEMVVAT

Query:  DSENALIEQTKPEIGDQFSVPPASESQDFNHES-FNNNGGEGEEAAIEEVSLFADFKD----------GSSDS-DSSAILNEDYSPTAGISSPGVLQNHQ
        DS                   P+S  Q   H S FN                F D +D          GSSDS DSSA+LN++ S   G  +P V     
Subjt:  DSENALIEQTKPEIGDQFSVPPASESQDFNHES-FNNNGGEGEEAAIEEVSLFADFKD----------GSSDS-DSSAILNEDYSPTAGISSPGVLQNHQ

Query:  QYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
            +TG            ++LQF K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Subjt:  QYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

AT5G65310.1 homeobox protein 54.7e-4438.92Show/hide
Query:  TDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR
        TD + SPR   +  +Y    ++  M D  E++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQPRQVA+WFQNRR
Subjt:  TDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR

Query:  ARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEE--MVV
        ARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL Q+K                                  ELK+KL   N E    +EE   +  
Subjt:  ARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEE--MVV

Query:  ATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMT
           +++ +      E+  +   PP     D              E A E  S+F    +F+D  +D SDSSA+LNE+YSP                    
Subjt:  ATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMT

Query:  GAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ
          V  A + A     +     + Q    F KMEEH + FSGEEAC  F+D +
Subjt:  GAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ

AT5G65310.2 homeobox protein 52.1e-4438.59Show/hide
Query:  ILCTDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ
        ++ TD + SPR   +  +Y    ++  M D  E++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQPRQVA+WFQ
Subjt:  ILCTDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ

Query:  NRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEE--
        NRRARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL Q+K                                  ELK+KL   N E    +EE   
Subjt:  NRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRNLVIMETVKLCLQIRELKSKLQEDNSESNLSVEEE--

Query:  MVVATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYH
        +     +++ +      E+  +   PP     D              E A E  S+F    +F+D  +D SDSSA+LNE+YSP                 
Subjt:  MVVATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYH

Query:  FMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ
             V  A + A     +     + Q    F KMEEH + FSGEEAC  F+D +
Subjt:  FMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACCTGCAGTCAGCTCAGATTCCTTGGGTGCACTCATCTCCATTTGCCCAACTTCAGACATACCCATGTGGGATGTGGGGTTTCTTTTTCTCTGTCTCCTCCTAC
TTTCAAACTTCAGTTTACAATGGTCAATCAATCAATCAGCCTTTTTCCTTTCTGGGTTTTGCAAGATTCCACTAATTCTTTTCCAATTTTGTGTACAGATCATGAACAGA
GTCCGAGAAACAAGAACAGTAACCATGTTTACGGCACAGAATTCCAGTCTATGCTGGATGGATTTGAGGAAGAAGGGTGCGTTGAAGAATCGGGGCATGTTTCAGAGAAG
AAAAGGCGACTTAGTGTGGAGCAAGTGAAGGCTCTAGAGAAGAATTTCGAAGTCGAAAACAAGCTCGAGCCAGAAAGGAAAGTGAAGCTTGCTCGAGAACTTGGATTACA
GCCTCGACAAGTGGCTGTTTGGTTTCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTAGAAAGAGACTATGGCGTTCTCAAAACAAATTATGAGAATCTCAAACTCA
GTTATGAAACTCTCCAAAATGACAATCAAGCTCTCCTCAAACAGGTAAAATTACAAACCCCAAACAAAATCTCCCCCTTTTTATTTTTTGTCTTTTTAAGTATCAGAAAT
TTGGTAATAATGGAGACTGTAAAATTGTGTTTGCAGATTCGGGAACTGAAATCAAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAAATGGTGGT
GGCGACCGATTCTGAAAATGCTCTGATCGAACAAACTAAGCCGGAAATTGGTGATCAGTTCTCTGTTCCGCCGGCGAGTGAATCCCAAGACTTCAATCACGAGAGCTTCA
ACAACAATGGCGGAGAAGGGGAAGAGGCAGCAATAGAAGAAGTGTCATTGTTCGCCGATTTCAAAGATGGGTCATCTGATAGCGATTCGAGCGCAATTTTAAACGAAGAT
TACAGCCCGACGGCGGGCATTTCTTCACCTGGGGTGCTGCAGAATCACCAGCAGTACCATTTCATGACGGGAGCGGTATCTCCGGCGCCCTCCGCCGCCGTGAAACTCAA
CTACTTGCAGTTTCAGAAGGGGTATCAACAACAAACCCAGATGTTTCCAAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGGCTTGTAACTTCTTCTCCGATGAGC
AAGCTCCGACTCTGCACTGGTGGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACCTGCAGTCAGCTCAGATTCCTTGGGTGCACTCATCTCCATTTGCCCAACTTCAGACATACCCATGTGGGATGTGGGGTTTCTTTTTCTCTGTCTCCTCCTAC
TTTCAAACTTCAGTTTACAATGGTCAATCAATCAATCAGCCTTTTTCCTTTCTGGGTTTTGCAAGATTCCACTAATTCTTTTCCAATTTTGTGTACAGATCATGAACAGA
GTCCGAGAAACAAGAACAGTAACCATGTTTACGGCACAGAATTCCAGTCTATGCTGGATGGATTTGAGGAAGAAGGGTGCGTTGAAGAATCGGGGCATGTTTCAGAGAAG
AAAAGGCGACTTAGTGTGGAGCAAGTGAAGGCTCTAGAGAAGAATTTCGAAGTCGAAAACAAGCTCGAGCCAGAAAGGAAAGTGAAGCTTGCTCGAGAACTTGGATTACA
GCCTCGACAAGTGGCTGTTTGGTTTCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTAGAAAGAGACTATGGCGTTCTCAAAACAAATTATGAGAATCTCAAACTCA
GTTATGAAACTCTCCAAAATGACAATCAAGCTCTCCTCAAACAGGTAAAATTACAAACCCCAAACAAAATCTCCCCCTTTTTATTTTTTGTCTTTTTAAGTATCAGAAAT
TTGGTAATAATGGAGACTGTAAAATTGTGTTTGCAGATTCGGGAACTGAAATCAAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAAATGGTGGT
GGCGACCGATTCTGAAAATGCTCTGATCGAACAAACTAAGCCGGAAATTGGTGATCAGTTCTCTGTTCCGCCGGCGAGTGAATCCCAAGACTTCAATCACGAGAGCTTCA
ACAACAATGGCGGAGAAGGGGAAGAGGCAGCAATAGAAGAAGTGTCATTGTTCGCCGATTTCAAAGATGGGTCATCTGATAGCGATTCGAGCGCAATTTTAAACGAAGAT
TACAGCCCGACGGCGGGCATTTCTTCACCTGGGGTGCTGCAGAATCACCAGCAGTACCATTTCATGACGGGAGCGGTATCTCCGGCGCCCTCCGCCGCCGTGAAACTCAA
CTACTTGCAGTTTCAGAAGGGGTATCAACAACAAACCCAGATGTTTCCAAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGGCTTGTAACTTCTTCTCCGATGAGC
AAGCTCCGACTCTGCACTGGTGGGGCTGA
Protein sequenceShow/hide protein sequence
METCSQLRFLGCTHLHLPNFRHTHVGCGVSFSLSPPTFKLQFTMVNQSISLFPFWVLQDSTNSFPILCTDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEK
KRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQVKLQTPNKISPFLFFVFLSIRN
LVIMETVKLCLQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNED
YSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWG