; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg05843 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg05843
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionhomeobox-leucine zipper protein ATHB-6-like
Genome locationCarg_Chr16:4792081..4793696
RNA-Seq ExpressionCarg05843
SyntenyCarg05843
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577319.1 Homeobox-leucine zipper protein ATHB-16, partial [Cucurbita argyrosperma subsp. sororia]4.7e-16899.67Show/hide
Query:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
        MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
Subjt:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
        QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSEN LIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
Subjt:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT

Query:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
        KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
Subjt:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA

Query:  PTLHWWS
        PTLHWWS
Subjt:  PTLHWWS

KAG7015409.1 Homeobox-leucine zipper protein ATHB-16 [Cucurbita argyrosperma subsp. argyrosperma]5.6e-169100Show/hide
Query:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
        MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
Subjt:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
        QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
Subjt:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT

Query:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
        KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
Subjt:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA

Query:  PTLHWWS
        PTLHWWS
Subjt:  PTLHWWS

XP_022929541.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita moschata]1.2e-16699.35Show/hide
Query:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
        MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
Subjt:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
        QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSEN LIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
Subjt:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT

Query:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
        KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLN ATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
Subjt:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA

Query:  PTLHWWS
        PTLHWWS
Subjt:  PTLHWWS

XP_022984558.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita maxima]1.2e-16698.7Show/hide
Query:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
        MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVK+LEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
Subjt:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
        QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEML PADSEN LIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
Subjt:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT

Query:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
        KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTG IPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
Subjt:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA

Query:  PTLHWWS
        PTLHWWS
Subjt:  PTLHWWS

XP_023552669.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita pepo subsp. pepo]1.9e-16497.72Show/hide
Query:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
        MKRRSDSMAALIS SPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
Subjt:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
        QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVP DSEN LIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
Subjt:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT

Query:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
        KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFM G IP SSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
Subjt:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA

Query:  PTLHWWS
        P+LHWWS
Subjt:  PTLHWWS

TrEMBL top hitse value%identityAlignment
A0A6J1C6W4 homeobox-leucine zipper protein ATHB-6-like1.2e-11371.56Show/hide
Query:  SDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDE----EELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
        SDS+ AL+SI PTSD+EQSPRN   NHVY  EFQ MLDGFDE    EE GHVSEKKRRL VEQVKALEKNFEVENKLEPERK+KLA+ELGLQPRQVAVWF
Subjt:  SDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDE----EELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSE---------MLVPADSENDLIEQTKPEITDDFSVPPA------
        QNRRARWKTKQLERDYGVLKTNY+ LKL++E LQ DN ALLKEIRELK+K+QEDNSE         +L  ADSEN LIE+T+PE T DFSVPPA      
Subjt:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSE---------MLVPADSENDLIEQTKPEITDDFSVPPA------

Query:  --RSFNNNGGEGDEPPT---------KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQ
           SFNNNGGEG+E PT         KDGSSDSDSSAILNEDYS TA +SSPGVLQN ++    A P  SPS   A VK NC+T ALNY QFQK YQQTQ
Subjt:  --RSFNNNGGEGDEPPT---------KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQ

Query:  MMFPKMEEHNFFGGEE-ACNFFSDEQAPTLHWWS
         ++PKMEEHNFF GEE  CNFFS+EQAP+LHWWS
Subjt:  MMFPKMEEHNFFGGEE-ACNFFSDEQAPTLHWWS

A0A6J1ENE6 homeobox-leucine zipper protein ATHB-6-like5.6e-16799.35Show/hide
Query:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
        MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
Subjt:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
        QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSEN LIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
Subjt:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT

Query:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
        KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLN ATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
Subjt:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA

Query:  PTLHWWS
        PTLHWWS
Subjt:  PTLHWWS

A0A6J1FNC9 homeobox-leucine zipper protein ATHB-6-like2.2e-11874.1Show/hide
Query:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDE----EELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQV
        MKR +DSM AL+SISPT+DQEQSPRN   N V   EFQ MLDGF E    EELGHVSEKKRRL VEQVKALEKNFEVENKLEPERK+KLA+ELGLQ RQV
Subjt:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDE----EELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQV

Query:  AVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNS--------EMLVPADSENDLIEQTKPEITDDFSVPPA---
        AVWFQNRRARWKTKQLERDYGVLKTNY NLKL++E LQ+DNQALLKEI+ELK K+QEDNS        E  VPADSEN LIEQ KPEITD FSVP A   
Subjt:  AVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNS--------EMLVPADSENDLIEQTKPEITDDFSVPPA---

Query:  -----RSFNNNGGEGDE----PPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNN-HFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQM
              S +NNGGEG+E    P  KDGSSDSDSSAILNEDY PT  +SSP VLQ+N+ HFMTGA  PS     +  VKLNCATTALNYLQ+QKGYQQ   
Subjt:  -----RSFNNNGGEGDE----PPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNN-HFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQM

Query:  MFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS
        MFPKMEEHNFF GEE CNFFSDEQAPTLHWWS
Subjt:  MFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS

A0A6J1JAW1 homeobox-leucine zipper protein ATHB-6-like5.6e-16798.7Show/hide
Query:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
        MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVK+LEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF
Subjt:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
        QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEML PADSEN LIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT
Subjt:  QNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPT

Query:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
        KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTG IPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA
Subjt:  KDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQA

Query:  PTLHWWS
        PTLHWWS
Subjt:  PTLHWWS

A0A6J1KBS4 homeobox-leucine zipper protein ATHB-6-like2.8e-11874.1Show/hide
Query:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDE----EELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQV
        MKR +DSM AL+SISPT+DQEQSPRN   N V   EFQ MLDGF E    EELGHVSEKKRRL VEQVKALEKNFEVENKLEPERK+KLA+ELGLQ RQV
Subjt:  MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDE----EELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQV

Query:  AVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNS--------EMLVPADSENDLIEQTKPEITDDFSVPPA---
        AVWFQNRRARWKTKQLERDYGVLKTNY NLKL++E LQ+DNQALLKEI+ELK K+QEDNS        EM VPADSEN LIEQ KPEITD FSVP A   
Subjt:  AVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNS--------EMLVPADSENDLIEQTKPEITDDFSVPPA---

Query:  -----RSFNNNGGEGDE----PPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNN-HFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQM
              S +NNGGEG+E    P  KDGSSDSDSSAILNEDY PT  +SS  VLQ+N+ HFMTGA  PS     +  VKLNCATTALNYLQ+QKGYQQ   
Subjt:  -----RSFNNNGGEGDE----PPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNN-HFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQM

Query:  MFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS
        MFPKMEEHNFF GEE CNFFSDEQAPTLHWWS
Subjt:  MFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS

SwissProt top hitse value%identityAlignment
P46667 Homeobox-leucine zipper protein ATHB-59.0e-4541.46Show/hide
Query:  SDSMAALISI-SPTSDQEQSPRNKNSNHVYE--MEFQCMLDGFDE----EELGHV-------SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELG
        SDS++  + I   T+D++ SPR   +  +Y    ++  M D  ++    E+LG V       +EKKRRLGVEQVKALEKNFE++NKLEPERK+KLAQELG
Subjt:  SDSMAALISI-SPTSDQEQSPRNKNSNHVYE--MEFQCMLDGFDE----EELGHV-------SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELG

Query:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAK--------IQEDNSEMLVPAD----SENDLIEQTKPEITD
        LQPRQVA+WFQNRRARWKTKQLERDYGVLK+N+D LK + ++LQ DN +LL +I+ELKAK        I+E+ +   V A+    + N+++E +    + 
Subjt:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAK--------IQEDNSEMLVPAD----SENDLIEQTKPEITD

Query:  DFSVPPARSFNNNGGE------GDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQ
           +P     +    E        E    D +  SDSSA+LNE+YSP            N     GA+  ++  ++T G    C +              
Subjt:  DFSVPPARSFNNNGGE------GDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQ

Query:  TQMMFPKMEEH-NFFGGEEACNFFSDEQ
            F KMEEH + F GEEAC  F+D +
Subjt:  TQMMFPKMEEH-NFFGGEEACNFFSDEQ

P46668 Homeobox-leucine zipper protein ATHB-64.3e-5546.67Show/hide
Query:  MKR--RSDSMAALISISP-TSDQEQSPRNKNSNHVYEMEFQCMLDGFDE------EELGHV--SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQEL
        MKR   SDS+  LIS+ P TS  EQSPR          EFQ ML+G++E      EE GHV  SEKKRRL + QVKALEKNFE+ENKLEPERK+KLAQEL
Subjt:  MKR--RSDSMAALISISP-TSDQEQSPRNKNSNHVYEMEFQCMLDGFDE------EELGHV--SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQEL

Query:  GLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI-----QEDNSEMLVPADSENDLI----EQTKPE-ITDD
        GLQPRQVAVWFQNRRARWKTKQLE+DYGVLKT YD+L+ +F++L+ DN++LL+EI +LK K+     +E+  E      +E+D+     E + PE IT+ 
Subjt:  GLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI-----QEDNSEMLVPADSENDLI----EQTKPE-ITDD

Query:  FSVPPARSFNNNG-------GEGDEPPTK---------DGSSD-SDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALN
         S PP    +++G          D  P K          GSSD SDSSA+LNE+ S    V++P  +   N F                           
Subjt:  FSVPPARSFNNNG-------GEGDEPPTK---------DGSSD-SDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALN

Query:  YLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS
          QF K  +QT+      +  +F  GEEAC FFSDEQ P+LHW+S
Subjt:  YLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS

Q6K498 Homeobox-leucine zipper protein HOX41.8e-4043.91Show/hide
Query:  DGFDEEEL---GHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQ
        +G  EEE+   G   EKKRRL VEQV+ALE++FEVENKLEPERK +LA++LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +YD+L+L  +AL+ D  
Subjt:  DGFDEEEL---GHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQ

Query:  ALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTG
        ALL EI+ELKAK+ ++ +     +  E       +P  +D    PPA  F              GSSDSDSSA+LN+  +  A  ++   L        G
Subjt:  ALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTG

Query:  AIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEAC-NFFSDEQAPTL-HWWS
        A P        AG     A  A +   F  G      +  + +E  F   +E C  FF+D+Q P L  WW+
Subjt:  AIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEAC-NFFSDEQAPTL-HWWS

Q940J1 Homeobox-leucine zipper protein ATHB-169.9e-5244.51Show/hide
Query:  MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEE------------ELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLA
        MKR   SDSM  LIS   TS  EQSPR   SN      +Q ML+G+DE+             +G +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA
Subjt:  MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEE------------ELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLA

Query:  QELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI--QEDNSEMLVPADSENDLIEQTKPEITDDFSVPP
        QELGLQPRQVAVWFQNRRARWKTKQLE+DYGVLK  YD+L+ +F++L+ DN +LL+EI ++KAK+  +EDN+      +     +++ +   TD     P
Subjt:  QELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI--QEDNSEMLVPADSENDLIEQTKPEITDDFSVPP

Query:  ARSFNNNGGEG-------------DEPPTKDGSSDS-DSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGY
         +   ++ G               +    + GSSDS DSSA+LN++ S   G  +P                  P   T G          ++LQF K  
Subjt:  ARSFNNNGGEG-------------DEPPTKDGSSDS-DSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGY

Query:  QQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS
        +QT+      +  +F  GEEAC FFSDEQ P+LHW+S
Subjt:  QQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS

Q9XH37 Homeobox-leucine zipper protein HOX41.8e-4043.91Show/hide
Query:  DGFDEEEL---GHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQ
        +G  EEE+   G   EKKRRL VEQV+ALE++FEVENKLEPERK +LA++LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +YD+L+L  +AL+ D  
Subjt:  DGFDEEEL---GHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQ

Query:  ALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTG
        ALL EI+ELKAK+ ++ +     +  E       +P  +D    PPA  F              GSSDSDSSA+LN+  +  A  ++   L        G
Subjt:  ALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTG

Query:  AIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEAC-NFFSDEQAPTL-HWWS
        A P        AG     A  A +   F  G      +  + +E  F   +E C  FF+D+Q P L  WW+
Subjt:  AIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEAC-NFFSDEQAPTL-HWWS

Arabidopsis top hitse value%identityAlignment
AT2G22430.1 homeobox protein 63.1e-5646.67Show/hide
Query:  MKR--RSDSMAALISISP-TSDQEQSPRNKNSNHVYEMEFQCMLDGFDE------EELGHV--SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQEL
        MKR   SDS+  LIS+ P TS  EQSPR          EFQ ML+G++E      EE GHV  SEKKRRL + QVKALEKNFE+ENKLEPERK+KLAQEL
Subjt:  MKR--RSDSMAALISISP-TSDQEQSPRNKNSNHVYEMEFQCMLDGFDE------EELGHV--SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQEL

Query:  GLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI-----QEDNSEMLVPADSENDLI----EQTKPE-ITDD
        GLQPRQVAVWFQNRRARWKTKQLE+DYGVLKT YD+L+ +F++L+ DN++LL+EI +LK K+     +E+  E      +E+D+     E + PE IT+ 
Subjt:  GLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI-----QEDNSEMLVPADSENDLI----EQTKPE-ITDD

Query:  FSVPPARSFNNNG-------GEGDEPPTK---------DGSSD-SDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALN
         S PP    +++G          D  P K          GSSD SDSSA+LNE+ S    V++P  +   N F                           
Subjt:  FSVPPARSFNNNG-------GEGDEPPTK---------DGSSD-SDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALN

Query:  YLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS
          QF K  +QT+      +  +F  GEEAC FFSDEQ P+LHW+S
Subjt:  YLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS

AT3G01470.1 homeobox 11.1e-2961.26Show/hide
Query:  DGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALL
        D F +++L    EKKRRL  EQV  LEK+FE ENKLEPERK +LA++LGLQPRQVAVWFQNRRARWKTKQLERDY +LK+ YD L  +++++  DN  L 
Subjt:  DGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALL

Query:  KEIRELKAKIQ
         E+  L  K+Q
Subjt:  KEIRELKAKIQ

AT4G40060.1 homeobox protein 167.0e-5344.51Show/hide
Query:  MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEE------------ELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLA
        MKR   SDSM  LIS   TS  EQSPR   SN      +Q ML+G+DE+             +G +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA
Subjt:  MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEE------------ELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLA

Query:  QELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI--QEDNSEMLVPADSENDLIEQTKPEITDDFSVPP
        QELGLQPRQVAVWFQNRRARWKTKQLE+DYGVLK  YD+L+ +F++L+ DN +LL+EI ++KAK+  +EDN+      +     +++ +   TD     P
Subjt:  QELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI--QEDNSEMLVPADSENDLIEQTKPEITDDFSVPP

Query:  ARSFNNNGGEG-------------DEPPTKDGSSDS-DSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGY
         +   ++ G               +    + GSSDS DSSA+LN++ S   G  +P                  P   T G          ++LQF K  
Subjt:  ARSFNNNGGEG-------------DEPPTKDGSSDS-DSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGY

Query:  QQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS
        +QT+      +  +F  GEEAC FFSDEQ P+LHW+S
Subjt:  QQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS

AT5G65310.1 homeobox protein 56.4e-4641.46Show/hide
Query:  SDSMAALISI-SPTSDQEQSPRNKNSNHVYE--MEFQCMLDGFDE----EELGHV-------SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELG
        SDS++  + I   T+D++ SPR   +  +Y    ++  M D  ++    E+LG V       +EKKRRLGVEQVKALEKNFE++NKLEPERK+KLAQELG
Subjt:  SDSMAALISI-SPTSDQEQSPRNKNSNHVYE--MEFQCMLDGFDE----EELGHV-------SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELG

Query:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAK--------IQEDNSEMLVPAD----SENDLIEQTKPEITD
        LQPRQVA+WFQNRRARWKTKQLERDYGVLK+N+D LK + ++LQ DN +LL +I+ELKAK        I+E+ +   V A+    + N+++E +    + 
Subjt:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAK--------IQEDNSEMLVPAD----SENDLIEQTKPEITD

Query:  DFSVPPARSFNNNGGE------GDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQ
           +P     +    E        E    D +  SDSSA+LNE+YSP            N     GA+  ++  ++T G    C +              
Subjt:  DFSVPPARSFNNNGGE------GDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQ

Query:  TQMMFPKMEEH-NFFGGEEACNFFSDEQ
            F KMEEH + F GEEAC  F+D +
Subjt:  TQMMFPKMEEH-NFFGGEEACNFFSDEQ

AT5G65310.2 homeobox protein 51.6e-4441.72Show/hide
Query:  SDQEQSPRNKNSNHVYE--MEFQCMLDGFDE----EELGHV-------SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRR
        +D++ SPR   +  +Y    ++  M D  ++    E+LG V       +EKKRRLGVEQVKALEKNFE++NKLEPERK+KLAQELGLQPRQVA+WFQNRR
Subjt:  SDQEQSPRNKNSNHVYE--MEFQCMLDGFDE----EELGHV-------SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRR

Query:  ARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAK--------IQEDNSEMLVPAD----SENDLIEQTKPEITDDFSVPPARSFNNNG
        ARWKTKQLERDYGVLK+N+D LK + ++LQ DN +LL +I+ELKAK        I+E+ +   V A+    + N+++E +    +    +P     +   
Subjt:  ARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAK--------IQEDNSEMLVPAD----SENDLIEQTKPEITDDFSVPPARSFNNNG

Query:  GE------GDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEH-NF
         E        E    D +  SDSSA+LNE+YSP            N     GA+  ++  ++T G    C +                  F KMEEH + 
Subjt:  GE------GDEPPTKDGSSDSDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEH-NF

Query:  FGGEEACNFFSDEQ
        F GEEAC  F+D +
Subjt:  FGGEEACNFFSDEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGACGATCAGATTCCATGGCTGCACTTATCTCCATTTCCCCAACATCAGATCAAGAACAGAGTCCGAGAAATAAGAACAGTAACCATGTTTATGAGATGGAATT
CCAGTGTATGCTTGATGGGTTTGATGAGGAAGAATTAGGGCATGTTTCTGAGAAGAAAAGGCGACTTGGTGTGGAGCAAGTTAAGGCGTTAGAGAAGAATTTCGAAGTTG
AAAATAAGCTCGAACCAGAGAGGAAATTGAAGCTTGCTCAAGAACTTGGATTACAGCCTCGACAAGTGGCTGTTTGGTTCCAAAATCGTCGAGCTAGATGGAAAACTAAG
CAATTAGAAAGAGATTATGGTGTTCTTAAAACCAATTACGACAATCTTAAACTCAGTTTTGAAGCTCTCCAAAATGATAATCAAGCTCTTCTCAAAGAGATTCGGGAATT
GAAAGCAAAGATTCAAGAAGATAACTCAGAGATGTTGGTGCCGGCCGATTCTGAAAATGATCTGATCGAACAAACTAAGCCGGAAATTACCGATGACTTCTCTGTTCCAC
CGGCGAGAAGCTTCAACAACAATGGCGGAGAAGGGGATGAGCCACCAACAAAAGATGGGTCATCCGACAGCGATTCGAGCGCGATTTTAAACGAAGATTACAGCCCGACG
GCCGGCGTTTCTTCACCGGGAGTGTTGCAGAACAACAACCATTTCATGACGGGAGCGATACCTCCATCATCTCCGTCCATCGCCACCGCTGGCGTGAAACTGAACTGCGC
GACGACGGCGCTGAATTACTTGCAGTTTCAAAAGGGATATCAACAAACCCAGATGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCGGCGGGGAGGAGGCTTGTAACT
TCTTTTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATTTAAAGAGAGAGAAAGCAGGAAGGCCACCTGGGTTCTGAACTGGCTTGCTTGCTTCACTCATCATCATCATCATCATCCTTCCCATTTCTCTTCTATTTCCATGTTTT
TTTTTTTTGCTTAATTTGATCCGTTTTTCTTCTGGGTTTTGCCTGTAATCTTCTAAAATGGGTTCTGAGGGAGAAGAAATCTTCAACAACAATGATGATGAAGCAGGAAG
AAGCAACAATTTGTACAGTGATTGTGAGCAGCTCTGTTCAGCAAGTAGTTCAGACACTTAAAAACAAACCAAGAACAATCATTAAAATCATCATCGTCTTCATCTTCTAA
GTTTTCTGTGTGTTTAATTTTGTGCTGATTCGCAATCATGAAGAGACGATCAGATTCCATGGCTGCACTTATCTCCATTTCCCCAACATCAGATCAAGAACAGAGTCCGA
GAAATAAGAACAGTAACCATGTTTATGAGATGGAATTCCAGTGTATGCTTGATGGGTTTGATGAGGAAGAATTAGGGCATGTTTCTGAGAAGAAAAGGCGACTTGGTGTG
GAGCAAGTTAAGGCGTTAGAGAAGAATTTCGAAGTTGAAAATAAGCTCGAACCAGAGAGGAAATTGAAGCTTGCTCAAGAACTTGGATTACAGCCTCGACAAGTGGCTGT
TTGGTTCCAAAATCGTCGAGCTAGATGGAAAACTAAGCAATTAGAAAGAGATTATGGTGTTCTTAAAACCAATTACGACAATCTTAAACTCAGTTTTGAAGCTCTCCAAA
ATGATAATCAAGCTCTTCTCAAAGAGATTCGGGAATTGAAAGCAAAGATTCAAGAAGATAACTCAGAGATGTTGGTGCCGGCCGATTCTGAAAATGATCTGATCGAACAA
ACTAAGCCGGAAATTACCGATGACTTCTCTGTTCCACCGGCGAGAAGCTTCAACAACAATGGCGGAGAAGGGGATGAGCCACCAACAAAAGATGGGTCATCCGACAGCGA
TTCGAGCGCGATTTTAAACGAAGATTACAGCCCGACGGCCGGCGTTTCTTCACCGGGAGTGTTGCAGAACAACAACCATTTCATGACGGGAGCGATACCTCCATCATCTC
CGTCCATCGCCACCGCTGGCGTGAAACTGAACTGCGCGACGACGGCGCTGAATTACTTGCAGTTTCAAAAGGGATATCAACAAACCCAGATGATGTTTCCGAAAATGGAG
GAGCATAATTTCTTCGGCGGGGAGGAGGCTTGTAACTTCTTTTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGAGCTGAATCCATGGCGGAATTTAAGAGAATAATAA
TAGAAAAAAGTAGATTT
Protein sequenceShow/hide protein sequence
MKRRSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTK
QLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENDLIEQTKPEITDDFSVPPARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYSPT
AGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNCATTALNYLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS