; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014950 (gene) of Snake gourd v1 genome

Gene IDTan0014950
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionhomeobox-leucine zipper protein ATHB-6-like
Genome locationLG03:73841442..73843881
RNA-Seq ExpressionTan0014950
SyntenyTan0014950
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036322.1 homeobox-leucine zipper protein ATHB-6-like [Cucumis melo var. makuwa]1.1e-16593.52Show/hide
Query:  HGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQ
        HGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSML+GLDEEGSIEEHCHVGEKKRRL+VDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQ
Subjt:  HGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQ

Query:  NRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFN
        NRRARWKTKQLERDYG+LKANYE+LKRSFDTLQQDNDALLKEIKELKSKL+EEKTESNLSVKEEIFVSESDNLLIEQ+ NHLPVDHISLPVASDHSDDF+
Subjt:  NRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFN

Query:  YESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNFFSG
        YESF++      DDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSA AGMLQSHHQILSSPA+SLNCFPFQKA YNNAQQFVKIEEHNFFSG
Subjt:  YESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNFFSG

Query:  EETCNLFSDEQAPSLHWYCSDQWN
        EETCNLFSDEQAPS+HWYC DQWN
Subjt:  EETCNLFSDEQAPSLHWYCSDQWN

XP_004143453.1 homeobox-leucine zipper protein ATHB-6 [Cucumis sativus]2.5e-16793.58Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
        MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSML+GLDEEGSIEEHCHVGEKKRRL+VDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV

Query:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD
        WFQNRRARWKTKQLERDYG+LKANYE+LKRSFDTLQQDNDALLKEIKELKSKL+EEKTESNLSVKEEIFVSESDNLLIEQ+ NHLPVDHISLPVASDHSD
Subjt:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD

Query:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF
        DFNYESF++      DDGDDQRVEVSLF DFKDGSSDSDSSAILNEDNSPNAVVSSA AGMLQSHHQILSSPA+SLNC+PFQKAAYNNAQQFVKIEEHNF
Subjt:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF

Query:  FSGEETCNLFSDEQAPSLHWYCSDQWN
        FSGEETCNLFSDEQAPS+HWYC DQWN
Subjt:  FSGEETCNLFSDEQAPSLHWYCSDQWN

XP_008440572.1 PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Cucumis melo]1.5e-16793.58Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
        MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSML+GLDEEGSIEEHCHVGEKKRRL+VDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV

Query:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD
        WFQNRRARWKTKQLERDYG+LKANYE+LKRSFDTLQQDNDALLKEIKELKSKL+EEKTESNLSVKEEIFVSESDNLLIEQ+ NHLPVDHISLPVASDHSD
Subjt:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD

Query:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF
        DF+YESF++      DDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSA AGMLQSHHQILSSPA+SLNCFPFQKA YNNAQQFVKIEEHNF
Subjt:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF

Query:  FSGEETCNLFSDEQAPSLHWYCSDQWN
        FSGEETCNLFSDEQAPS+HWYC DQWN
Subjt:  FSGEETCNLFSDEQAPSLHWYCSDQWN

XP_022962943.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita moschata]2.1e-15890.85Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLD-EEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVA
        MKRHGSSDSLGALMSVCPTSEEQSPRN HVYG EFQSMLEGLD EEGS+EEHCH+G KKRRL VDQVK LEKTFEIENKLEPERKVKLAQELGLQPRQVA
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLD-EEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVA

Query:  VWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHS
        VWFQNRRARWKTKQLERDYGVLKANY+TLKRSFDTLQQDNDALLK+IKELKSK+QEEKTESNLSVKEEIF  ESD  LIEQ+NN LPVD ISLP ASD S
Subjt:  VWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHS

Query:  DDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHN
        DDFNYESFKSA+VA D DG DQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAG+LQSHHQILSSPASSLNCFPFQKAAYNNAQ FVKIEEHN
Subjt:  DDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHN

Query:  FFSGEETCNLFSDEQAPSLHWYCSDQWN
        FFSGEETCNLFSDEQAPSL W   DQWN
Subjt:  FFSGEETCNLFSDEQAPSLHWYCSDQWN

XP_038882136.1 homeobox-leucine zipper protein ATHB-6-like isoform X2 [Benincasa hispida]3.0e-16893.88Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
        MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSML+GLDEEGSIEEHCHVGEKKRRL+VDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV

Query:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD
        WFQNRRARWKTKQLERDYG+LKANYE+LKRSFDTLQQDNDALLKEIKELKSKL+EEKTESNLSVKEEIFVSESDNLLIEQ+ NHLPVDHISLPVASDHSD
Subjt:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD

Query:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF
        DFNYE F++      DDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSA AGMLQSHHQI+SSPASSLNCFPFQKAAYNNAQQFVKIEEHNF
Subjt:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF

Query:  FSGEETCNLFSDEQAPSLHWYCSDQWN
        FSGEETCNLFSDEQAPS+HWYC DQWN
Subjt:  FSGEETCNLFSDEQAPSLHWYCSDQWN

TrEMBL top hitse value%identityAlignment
A0A0A0KGQ3 Homeobox domain-containing protein1.2e-16793.58Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
        MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSML+GLDEEGSIEEHCHVGEKKRRL+VDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV

Query:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD
        WFQNRRARWKTKQLERDYG+LKANYE+LKRSFDTLQQDNDALLKEIKELKSKL+EEKTESNLSVKEEIFVSESDNLLIEQ+ NHLPVDHISLPVASDHSD
Subjt:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD

Query:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF
        DFNYESF++      DDGDDQRVEVSLF DFKDGSSDSDSSAILNEDNSPNAVVSSA AGMLQSHHQILSSPA+SLNC+PFQKAAYNNAQQFVKIEEHNF
Subjt:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF

Query:  FSGEETCNLFSDEQAPSLHWYCSDQWN
        FSGEETCNLFSDEQAPS+HWYC DQWN
Subjt:  FSGEETCNLFSDEQAPSLHWYCSDQWN

A0A1S3B264 homeobox-leucine zipper protein ATHB-6-like7.1e-16893.58Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
        MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSML+GLDEEGSIEEHCHVGEKKRRL+VDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAV

Query:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD
        WFQNRRARWKTKQLERDYG+LKANYE+LKRSFDTLQQDNDALLKEIKELKSKL+EEKTESNLSVKEEIFVSESDNLLIEQ+ NHLPVDHISLPVASDHSD
Subjt:  WFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSD

Query:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF
        DF+YESF++      DDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSA AGMLQSHHQILSSPA+SLNCFPFQKA YNNAQQFVKIEEHNF
Subjt:  DFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNF

Query:  FSGEETCNLFSDEQAPSLHWYCSDQWN
        FSGEETCNLFSDEQAPS+HWYC DQWN
Subjt:  FSGEETCNLFSDEQAPSLHWYCSDQWN

A0A5D3CQL1 Homeobox-leucine zipper protein ATHB-6-like5.1e-16693.52Show/hide
Query:  HGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQ
        HGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSML+GLDEEGSIEEHCHVGEKKRRL+VDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQ
Subjt:  HGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQ

Query:  NRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFN
        NRRARWKTKQLERDYG+LKANYE+LKRSFDTLQQDNDALLKEIKELKSKL+EEKTESNLSVKEEIFVSESDNLLIEQ+ NHLPVDHISLPVASDHSDDF+
Subjt:  NRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFN

Query:  YESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNFFSG
        YESF++      DDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSA AGMLQSHHQILSSPA+SLNCFPFQKA YNNAQQFVKIEEHNFFSG
Subjt:  YESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNFFSG

Query:  EETCNLFSDEQAPSLHWYCSDQWN
        EETCNLFSDEQAPS+HWYC DQWN
Subjt:  EETCNLFSDEQAPSLHWYCSDQWN

A0A6J1HG96 homeobox-leucine zipper protein ATHB-6-like1.0e-15890.85Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLD-EEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVA
        MKRHGSSDSLGALMSVCPTSEEQSPRN HVYG EFQSMLEGLD EEGS+EEHCH+G KKRRL VDQVK LEKTFEIENKLEPERKVKLAQELGLQPRQVA
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLD-EEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVA

Query:  VWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHS
        VWFQNRRARWKTKQLERDYGVLKANY+TLKRSFDTLQQDNDALLK+IKELKSK+QEEKTESNLSVKEEIF  ESD  LIEQ+NN LPVD ISLP ASD S
Subjt:  VWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHS

Query:  DDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHN
        DDFNYESFKSA+VA D DG DQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAG+LQSHHQILSSPASSLNCFPFQKAAYNNAQ FVKIEEHN
Subjt:  DDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHN

Query:  FFSGEETCNLFSDEQAPSLHWYCSDQWN
        FFSGEETCNLFSDEQAPSL W   DQWN
Subjt:  FFSGEETCNLFSDEQAPSLHWYCSDQWN

A0A6J1KNZ1 homeobox-leucine zipper protein ATHB-6-like1.0e-15890.85Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLD-EEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVA
        MKRHGSSDSLGALMSVCP SEEQSPRN HVYG EFQSMLEGLD EEGSIEEHCH+G KKRRL VDQVK LEKTFEIENKLEPERKVKLAQELGLQPRQVA
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLD-EEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVA

Query:  VWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHS
        VWFQNRRARWKTKQLERDYGVLKANY+TLKRSFDTLQQDNDALLK+IKELKSK+QEEKTESNLSVKEEIF  ESD  LIEQ+NN LPVD ISLP ASD S
Subjt:  VWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHS

Query:  DDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHN
        DDFNYESFKSA+VA D DG DQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSA AG+LQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHN
Subjt:  DDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHN

Query:  FFSGEETCNLFSDEQAPSLHWYCSDQWN
        FFSGEETCNLFSDEQAPSL W   DQWN
Subjt:  FFSGEETCNLFSDEQAPSLHWYCSDQWN

SwissProt top hitse value%identityAlignment
P46667 Homeobox-leucine zipper protein ATHB-56.4e-5745.58Show/hide
Query:  MKR-HGSSDSLGALMSV--CPTSEEQSPRNS-----HVYGREFQSMLEGLDEEGSIEEHCHVG-------EKKRRLNVDQVKALEKTFEIENKLEPERKV
        MKR  GSSDSL   + +    T ++ SPR +     +    ++  M + L+++GS+E+   VG       EKKRRL V+QVKALEK FEI+NKLEPERKV
Subjt:  MKR-HGSSDSLGALMSV--CPTSEEQSPRNS-----HVYGREFQSMLEGLDEEGSIEEHCHVG-------EKKRRLNVDQVKALEKTFEIENKLEPERKV

Query:  KLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKT---ESNLSVKEEIFVSESDNLLIEQSN
        KLAQELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LKR+ D+LQ+DND+LL +IKELK+KL  E     E N ++K     +   N  +  +N
Subjt:  KLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKT---ESNLSVKEEIFVSESDNLLIEQSN

Query:  NHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEV-SLFP---DFKDGSSD-SDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLN
          L + H S P    H             +  D    +   E+ S+FP   +F+D  +D SDSSA+LNE+ SPN V ++ A           +   S++ 
Subjt:  NHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEV-SLFP---DFKDGSSD-SDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLN

Query:  CFPFQKAAYNNAQQFVKIEEH-NFFSGEETCNLFSDEQAPSLHWYCSDQWN
        CF           QFVK+EEH + FSGEE C LF+D +     WYCSDQWN
Subjt:  CFPFQKAAYNNAQQFVKIEEH-NFFSGEETCNLFSDEQAPSLHWYCSDQWN

P46668 Homeobox-leucine zipper protein ATHB-63.1e-6749.13Show/hide
Query:  MKRHGSSDSLGALMSVCPT--SEEQSPRNSHVYGREFQSMLEGL--DEEGSIEEHCHVG--EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQ
        MKR  SSDS+G L+S+CPT  ++EQSPR     GREFQSMLEG   +EE  +EE  HVG  EKKRRL+++QVKALEK FE+ENKLEPERKVKLAQELGLQ
Subjt:  MKRHGSSDSLGALMSVCPT--SEEQSPRNSHVYGREFQSMLEGL--DEEGSIEEHCHVG--EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQ

Query:  PRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKL-----QEEKTESNLSVKEEIFVS-ESDNLLIEQSNNHLPVD
        PRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ +FD+L++DN++LL+EI +LK+KL     +EE+ E+N +V  E  +S + + + + +     P  
Subjt:  PRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKL-----QEEKTESNLSVKEEIFVS-ESDNLLIEQSNNHLPVD

Query:  HISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNN
          S P   +HSD  NY SF         D    +   S F      S  SDSSA+LNE++S N  V++                               N
Subjt:  HISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNN

Query:  AQQFVKIEE----HNFFSGEETCNLFSDEQAPSLHWYCS-DQWN
          QFVK+E+     +F SGEE C  FSDEQ PSLHWY + D WN
Subjt:  AQQFVKIEE----HNFFSGEETCNLFSDEQAPSLHWYCS-DQWN

Q6K498 Homeobox-leucine zipper protein HOX46.0e-3940.2Show/hide
Query:  GLDEEGSIEEHCHV----GEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQ
        G++ EG +EE        GEKKRRL+V+QV+ALE++FE+ENKLEPERK +LA++LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+   D L+
Subjt:  GLDEEGSIEEHCHV----GEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQ

Query:  QDNDALLKEIKELKSKL-QEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGS
        +D DALL EIKELK+KL  EE   S  SVKEE                         P ASD                               P    GS
Subjt:  QDNDALLKEIKELKSKL-QEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGS

Query:  SDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNA---QQFVKIEEH--NFFSGEETC-NLFSDEQAPSL-HWYC--SDQW
        SDSDSSA+LN+ ++  A  ++  A   ++   + + PA+         A++        F+K+EE    F   +E C   F+D+Q P L  W+   ++ W
Subjt:  SDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNA---QQFVKIEEH--NFFSGEETC-NLFSDEQAPSL-HWYC--SDQW

Query:  N
        N
Subjt:  N

Q940J1 Homeobox-leucine zipper protein ATHB-162.9e-5746.76Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEE-----HCHVG--EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGL
        MKR  SSDS+  L+S   +++EQSPR    YG  +QSMLEG DE+ ++ E     H H+G  EKKRRL VDQVKALEK FE+ENKLEPERK KLAQELGL
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEE-----HCHVG--EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGL

Query:  QPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQ-EEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISL
        QPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ +FD+L++DND+LL+EI ++K+K+  EE   +N ++ E +   E          + +P   +  
Subjt:  QPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQ-EEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISL

Query:  PVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDS-DSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQ
            +HS  FNY   +S +   D   +   VE         GSSDS DSSA+LN++ S              S +  L+ P +             +  Q
Subjt:  PVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDS-DSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQ

Query:  FVKIEE----HNFFSGEETCNLFSDEQAPSLHWY-CSDQW
        FVK E+     +F SGEE C  FSDEQ PSLHWY  SD W
Subjt:  FVKIEE----HNFFSGEETCNLFSDEQAPSLHWY-CSDQW

Q9XH37 Homeobox-leucine zipper protein HOX46.0e-3940.2Show/hide
Query:  GLDEEGSIEEHCHV----GEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQ
        G++ EG +EE        GEKKRRL+V+QV+ALE++FE+ENKLEPERK +LA++LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+   D L+
Subjt:  GLDEEGSIEEHCHV----GEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQ

Query:  QDNDALLKEIKELKSKL-QEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGS
        +D DALL EIKELK+KL  EE   S  SVKEE                         P ASD                               P    GS
Subjt:  QDNDALLKEIKELKSKL-QEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGS

Query:  SDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNA---QQFVKIEEH--NFFSGEETC-NLFSDEQAPSL-HWYC--SDQW
        SDSDSSA+LN+ ++  A  ++  A   ++   + + PA+         A++        F+K+EE    F   +E C   F+D+Q P L  W+   ++ W
Subjt:  SDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNA---QQFVKIEEH--NFFSGEETC-NLFSDEQAPSL-HWYC--SDQW

Query:  N
        N
Subjt:  N

Arabidopsis top hitse value%identityAlignment
AT1G69780.1 Homeobox-leucine zipper protein family2.1e-3151.3Show/hide
Query:  EEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALL
        EE   ++   +GEKKRRLN++QVK LEK FE+ NKLEPERK++LA+ LGLQPRQ+A+WFQNRRARWKTKQLE+DY  LK  ++TLK   D LQ  N  L 
Subjt:  EEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALL

Query:  KEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASD
         EI  LK++ Q E    N    E    + SDN     S+++L +D  + P ++D
Subjt:  KEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASD

AT2G22430.1 homeobox protein 62.2e-6849.13Show/hide
Query:  MKRHGSSDSLGALMSVCPT--SEEQSPRNSHVYGREFQSMLEGL--DEEGSIEEHCHVG--EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQ
        MKR  SSDS+G L+S+CPT  ++EQSPR     GREFQSMLEG   +EE  +EE  HVG  EKKRRL+++QVKALEK FE+ENKLEPERKVKLAQELGLQ
Subjt:  MKRHGSSDSLGALMSVCPT--SEEQSPRNSHVYGREFQSMLEGL--DEEGSIEEHCHVG--EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQ

Query:  PRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKL-----QEEKTESNLSVKEEIFVS-ESDNLLIEQSNNHLPVD
        PRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ +FD+L++DN++LL+EI +LK+KL     +EE+ E+N +V  E  +S + + + + +     P  
Subjt:  PRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKL-----QEEKTESNLSVKEEIFVS-ESDNLLIEQSNNHLPVD

Query:  HISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNN
          S P   +HSD  NY SF         D    +   S F      S  SDSSA+LNE++S N  V++                               N
Subjt:  HISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNN

Query:  AQQFVKIEE----HNFFSGEETCNLFSDEQAPSLHWYCS-DQWN
          QFVK+E+     +F SGEE C  FSDEQ PSLHWY + D WN
Subjt:  AQQFVKIEE----HNFFSGEETCNLFSDEQAPSLHWYCS-DQWN

AT4G40060.1 homeobox protein 162.0e-5846.76Show/hide
Query:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEE-----HCHVG--EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGL
        MKR  SSDS+  L+S   +++EQSPR    YG  +QSMLEG DE+ ++ E     H H+G  EKKRRL VDQVKALEK FE+ENKLEPERK KLAQELGL
Subjt:  MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEE-----HCHVG--EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGL

Query:  QPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQ-EEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISL
        QPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ +FD+L++DND+LL+EI ++K+K+  EE   +N ++ E +   E          + +P   +  
Subjt:  QPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQ-EEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISL

Query:  PVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDS-DSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQ
            +HS  FNY   +S +   D   +   VE         GSSDS DSSA+LN++ S              S +  L+ P +             +  Q
Subjt:  PVASDHSDDFNYESFKSASVAADDDGDDQRVEVSLFPDFKDGSSDS-DSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQ

Query:  FVKIEE----HNFFSGEETCNLFSDEQAPSLHWY-CSDQW
        FVK E+     +F SGEE C  FSDEQ PSLHWY  SD W
Subjt:  FVKIEE----HNFFSGEETCNLFSDEQAPSLHWY-CSDQW

AT5G65310.1 homeobox protein 54.5e-5845.58Show/hide
Query:  MKR-HGSSDSLGALMSV--CPTSEEQSPRNS-----HVYGREFQSMLEGLDEEGSIEEHCHVG-------EKKRRLNVDQVKALEKTFEIENKLEPERKV
        MKR  GSSDSL   + +    T ++ SPR +     +    ++  M + L+++GS+E+   VG       EKKRRL V+QVKALEK FEI+NKLEPERKV
Subjt:  MKR-HGSSDSLGALMSV--CPTSEEQSPRNS-----HVYGREFQSMLEGLDEEGSIEEHCHVG-------EKKRRLNVDQVKALEKTFEIENKLEPERKV

Query:  KLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKT---ESNLSVKEEIFVSESDNLLIEQSN
        KLAQELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LKR+ D+LQ+DND+LL +IKELK+KL  E     E N ++K     +   N  +  +N
Subjt:  KLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKT---ESNLSVKEEIFVSESDNLLIEQSN

Query:  NHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEV-SLFP---DFKDGSSD-SDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLN
          L + H S P    H             +  D    +   E+ S+FP   +F+D  +D SDSSA+LNE+ SPN V ++ A           +   S++ 
Subjt:  NHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRVEV-SLFP---DFKDGSSD-SDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLN

Query:  CFPFQKAAYNNAQQFVKIEEH-NFFSGEETCNLFSDEQAPSLHWYCSDQWN
        CF           QFVK+EEH + FSGEE C LF+D +     WYCSDQWN
Subjt:  CFPFQKAAYNNAQQFVKIEEH-NFFSGEETCNLFSDEQAPSLHWYCSDQWN

AT5G65310.2 homeobox protein 55.5e-5647.42Show/hide
Query:  EFQSMLEGLDEEGSIEEHCHVG-------EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYE
        ++  M + L+++GS+E+   VG       EKKRRL V+QVKALEK FEI+NKLEPERKVKLAQELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++
Subjt:  EFQSMLEGLDEEGSIEEHCHVG-------EKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKANYE

Query:  TLKRSFDTLQQDNDALLKEIKELKSKLQEEKT---ESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRV
         LKR+ D+LQ+DND+LL +IKELK+KL  E     E N ++K     +   N  +  +N  L + H S P    H             +  D    +   
Subjt:  TLKRSFDTLQQDNDALLKEIKELKSKLQEEKT---ESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDDQRV

Query:  EV-SLFP---DFKDGSSD-SDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEH-NFFSGEETCNLFSDEQAPS
        E+ S+FP   +F+D  +D SDSSA+LNE+ SPN V ++ A           +   S++ CF           QFVK+EEH + FSGEE C LF+D +   
Subjt:  EV-SLFP---DFKDGSSD-SDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEH-NFFSGEETCNLFSDEQAPS

Query:  LHWYCSDQWN
          WYCSDQWN
Subjt:  LHWYCSDQWN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGACATGGCAGCTCAGATTCCTTGGGCGCTCTGATGTCCGTCTGTCCTACATCAGAGGAACAGAGTCCGAGAAACAGCCATGTTTATGGAAGGGAGTTTCAGTC
GATGTTAGAGGGCCTAGACGAAGAAGGCTCCATTGAAGAACACTGTCATGTGGGGGAGAAGAAAAGGAGACTCAACGTAGATCAAGTTAAGGCCTTAGAGAAAACATTCG
AGATTGAAAACAAGCTCGAACCAGAGAGGAAAGTGAAGCTTGCCCAAGAACTTGGTCTGCAACCACGGCAAGTTGCTGTTTGGTTCCAAAATCGTAGAGCCCGATGGAAA
ACTAAGCAACTTGAAAGAGATTATGGCGTTCTCAAAGCCAATTATGAAACTCTCAAGCGTAGCTTCGATACCCTTCAGCAGGACAATGATGCTCTGCTCAAAGAGATTAA
GGAATTGAAATCAAAGCTTCAGGAAGAGAAGACGGAGAGCAATTTATCAGTGAAGGAAGAGATTTTCGTGTCGGAATCGGACAATTTACTAATTGAACAATCCAACAATC
ATCTTCCGGTGGATCATATTTCTCTTCCTGTTGCGTCGGATCATTCCGATGACTTCAACTACGAGAGCTTTAAATCCGCCTCCGTTGCCGCCGATGACGACGGCGACGAT
CAGAGAGTTGAAGTGTCGTTGTTCCCTGATTTCAAAGATGGATCATCGGACAGTGACTCGAGCGCCATATTGAACGAAGACAACAGCCCAAACGCCGTCGTTTCATCGGC
GGCTGCCGGAATGCTTCAAAGCCACCACCAAATTCTGTCGTCTCCGGCGTCGTCTTTGAACTGCTTCCCGTTTCAAAAGGCTGCTTATAATAATGCACAACAATTTGTGA
AAATTGAAGAGCACAATTTCTTCAGCGGAGAAGAGACCTGCAATTTGTTCTCCGATGAACAAGCCCCCTCTTTGCATTGGTACTGCTCTGATCAGTGGAACTAG
mRNA sequenceShow/hide mRNA sequence
CTACGCTCGTAATTTGCATTTGGCGTGAATTTGAGACGCATTTGGTTTCGGCAAACCGCAGTGAACCCAACCCCAGTTAGATTCTAACCCTACTTCTGACACGAGTTGGA
TTATTTAAAGAGAGAGAAAGTATTAGACTAATCGTAATCGGGCTTCACACTGCCCTCCTTCTTCTTCTTCTTCATCTTCTTCCTCTTCTTCTTCTTCATATCCCACTCCC
AAAGCCTCAACCCAACGCACTCTCATAATGCAGTAGAGTGAGAGACAACCAAAAACAAGGAGAGAGAAATAAATTTCCTCTAGGCCAATCAATCAATCAATCAATCCTCT
GTTTCTGTTTCTTCTTCTTCTTCTGCGGGTTGGGTTTTGGGCTTTGATTTTACCATTTGCGAGGCTTGTTTTGAATTGGGTTTTGTTTTGTGGATCTTTCTTTTGATGGG
TTCAGAGAGGGAAGGGGGATCGAAGAAGATCGAGAAAGATCCAGGGGAAACAGGGGAACCAACTGGGAACTTGTACAGAGTCGGCCGATTTTCATTTATGGAACGTCCTC
TGTTTCTTCCTTTAACTCTACAGCTTTCGGCTACTTTGAACCACTGTTTCAGCTCCTAAAAGTTCCTTCTTTACGCTTCAAACAAACAACGCCTCGAAAGTTTTACGGGG
TTTTTATTAAAAGGATCTGATTCAATTTTGGGACTTGTTTCTTTAATTTCTTCCGTCATGAAGAGACATGGCAGCTCAGATTCCTTGGGCGCTCTGATGTCCGTCTGTCC
TACATCAGAGGAACAGAGTCCGAGAAACAGCCATGTTTATGGAAGGGAGTTTCAGTCGATGTTAGAGGGCCTAGACGAAGAAGGCTCCATTGAAGAACACTGTCATGTGG
GGGAGAAGAAAAGGAGACTCAACGTAGATCAAGTTAAGGCCTTAGAGAAAACATTCGAGATTGAAAACAAGCTCGAACCAGAGAGGAAAGTGAAGCTTGCCCAAGAACTT
GGTCTGCAACCACGGCAAGTTGCTGTTTGGTTCCAAAATCGTAGAGCCCGATGGAAAACTAAGCAACTTGAAAGAGATTATGGCGTTCTCAAAGCCAATTATGAAACTCT
CAAGCGTAGCTTCGATACCCTTCAGCAGGACAATGATGCTCTGCTCAAAGAGATTAAGGAATTGAAATCAAAGCTTCAGGAAGAGAAGACGGAGAGCAATTTATCAGTGA
AGGAAGAGATTTTCGTGTCGGAATCGGACAATTTACTAATTGAACAATCCAACAATCATCTTCCGGTGGATCATATTTCTCTTCCTGTTGCGTCGGATCATTCCGATGAC
TTCAACTACGAGAGCTTTAAATCCGCCTCCGTTGCCGCCGATGACGACGGCGACGATCAGAGAGTTGAAGTGTCGTTGTTCCCTGATTTCAAAGATGGATCATCGGACAG
TGACTCGAGCGCCATATTGAACGAAGACAACAGCCCAAACGCCGTCGTTTCATCGGCGGCTGCCGGAATGCTTCAAAGCCACCACCAAATTCTGTCGTCTCCGGCGTCGT
CTTTGAACTGCTTCCCGTTTCAAAAGGCTGCTTATAATAATGCACAACAATTTGTGAAAATTGAAGAGCACAATTTCTTCAGCGGAGAAGAGACCTGCAATTTGTTCTCC
GATGAACAAGCCCCCTCTTTGCATTGGTACTGCTCTGATCAGTGGAACTAGAAAAAAAAACACGCCGGAAAAAAAAGAATTGCGATCACCGGAGTATAATCTAGTCTTGG
TAATTTGCATTATGCAGAATATTAATAAAAAAAAAAACGGTGCGATTGGTCGGAGAGGGAGCTGCTGCCGGCTCCGACGAGAGAAAAACGGTGGAGAGAGAGATTCAGAT
GGGTAAAAGAGAAGACTGACAAAGAGTCTTCATTGGGTTGAAGCTGTAAAAAGGAAACTTTAGAGTTCGATGTTGGGTGATTTAGAACAAATGTTGTAATGAACAAATAA
TCATTTCCTTTTGCTTTTCAATTTTTTTTCTTTTTTTCTTTTTTTCTTTTTCCCTTTTCGGGGTTCATTTTATTTCTAAACCATTTTCAGGTCTTATACTTTTTTTTTCT
TAATGAGGTGAAACTACTAGTTTATATAA
Protein sequenceShow/hide protein sequence
MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLEGLDEEGSIEEHCHVGEKKRRLNVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWK
TKQLERDYGVLKANYETLKRSFDTLQQDNDALLKEIKELKSKLQEEKTESNLSVKEEIFVSESDNLLIEQSNNHLPVDHISLPVASDHSDDFNYESFKSASVAADDDGDD
QRVEVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSAAAGMLQSHHQILSSPASSLNCFPFQKAAYNNAQQFVKIEEHNFFSGEETCNLFSDEQAPSLHWYCSDQWN