; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004549 (gene) of Snake gourd v1 genome

Gene IDTan0004549
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMYB transcription factor
Genome locationLG01:5769829..5783559
RNA-Seq ExpressionTan0004549
SyntenyTan0004549
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005730 - nucleolus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003691 - double-stranded telomeric DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0042803 - protein homodimerization activity (molecular function)
GO:1990841 - promoter-specific chromatin binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR005818 - Linker histone H1/H5, domain H15
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily
IPR044597 - Single myb histone 1-6


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600281.1 Telomere repeat-binding factor 1, partial [Cucurbita argyrosperma subsp. sororia]1.8e-14688.82Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWTNQEE ALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSR+KARLA KR HAPKKDESIV+Q +AVQSEDE PE
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRKEEED-------EEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKL
         K+VSLSLD KQIAGPK SNVRKEEED       EEEEEKEVER ERDA YDCHRLDNLIIEAISTLRERSGSIK SIA YI+ QYWAPPDFKRLLSSKL
Subjt:  PKSVSLSLDIKQIAGPKRSNVRKEEED-------EEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKL

Query:  KFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAA
        KFLTASGKLVKVKRRYRLAPTVSS ERRGSMLLLDD HRAS+R +RDEMCTLAKAQ+DLELAKMRTMTSQEAAA AARAVAEAEAAIA+AEEAAREAEAA
Subjt:  KFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAA

Query:  EADAEAAQSFAEAAMKTLKGRNLPKMQMIPV
        EADA AAQSFAEAAMKTLKGRNLPKMQM+ V
Subjt:  EADAEAAQSFAEAAMKTLKGRNLPKMQMIPV

XP_022941688.1 telomere repeat-binding factor 1-like isoform X1 [Cucurbita moschata]8.3e-14788.86Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSREKARLA KR HAPKKDESIV+Q +AVQSEDE PE
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRK--------EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSK
         K+VSLSLDIKQIAGPK+SNVRK        EEE+EEEEEKEVER ERDA YDCHRLDNLIIEAISTLRERSGSIK SIA YI+ QYWAPPDFKRLLSSK
Subjt:  PKSVSLSLDIKQIAGPKRSNVRK--------EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSK

Query:  LKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEA
        LKFLTASGKLVKVKRRYRLA TVSS ERRGSMLLLDD HRAS+R +RDEMCTLAKAQ+DLELAKMRTMTSQEAAA AARAVAEAEAAIA+AEEAAREAEA
Subjt:  LKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEA

Query:  AEADAEAAQSFAEAAMKTLKGRNLPKMQMIPV
        AEADA AAQSFAEAAMKTLKGRNLPKMQM+ V
Subjt:  AEADAEAAQSFAEAAMKTLKGRNLPKMQMIPV

XP_022941690.1 telomere repeat-binding factor 1-like isoform X2 [Cucurbita moschata]3.5e-14588.75Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSREKARLA KR HAPKKDESIV+Q +AVQSEDE PE
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRK--------EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSK
         K+VSLSLDIKQIAGPK+SNVRK        EEE+EEEEEKEVER ERDA YDCHRLDNLIIEAISTLRERSGSIK SIA YI+ QYWAPPDFKRLLSSK
Subjt:  PKSVSLSLDIKQIAGPKRSNVRK--------EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSK

Query:  LKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEA
        LKFLTASGKLVKVKRRYRLA TVSS ERRGSMLLLDD HRAS+R +RDEMCTLAKAQ+DLELAKMRTMTSQEAAA AARAVAEAEAAIA+AEEAAREAEA
Subjt:  LKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEA

Query:  AEADAEAAQSFAEAAMKTLKGRNLPKMQM
        AEADA AAQSFAEAAMKTLKGRNLPKM +
Subjt:  AEADAEAAQSFAEAAMKTLKGRNLPKMQM

XP_023527383.1 telomere repeat-binding factor 1-like [Cucurbita pepo subsp. pepo]2.1e-15091.05Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSR+KARLA KRLHAPKKDESIV+Q +AVQSEDE PE
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTASG
         K+VSLSLDIKQIAGPK+SNVR EEEDEEEEEKEVER ERDA YDCHRLDNLIIEAISTLRERSGSIK SIA YI+DQYWAPPDFKRLLSSKLKFLTASG
Subjt:  PKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTASG

Query:  KLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADAEAA
        KLVKVKRRYRLAPTVSS ERRGSMLLLDD HRAS+R +RDE+CTLAKAQ+DLELAKMRTMTS+EAAA AARAVAEAEAAIA+AEEAAREAEAAEADA AA
Subjt:  KLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADAEAA

Query:  QSFAEAAMKTLKGRNLPKMQMIPV
        QSFAEAAMKTLKGRNLPKMQM+ V
Subjt:  QSFAEAAMKTLKGRNLPKMQMIPV

XP_023553536.1 telomere repeat-binding factor 1-like isoform X3 [Cucurbita pepo subsp. pepo]4.3e-14386.97Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWTN+EE ALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSREKARLA KRLHAPKKDE+I+++SVAVQSEDELPE
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRKEEE------DEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLK
         KSVSLS DIK IAGPKRSNVRKEEE      D+EEEE E ER ERDARYDCHRLDNLIIEAI+TLRE  GS KT I SYI+DQYWAPPDFKRLLSSKLK
Subjt:  PKSVSLSLDIKQIAGPKRSNVRKEEE------DEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLK

Query:  FLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE
        FLTASGKLVKVKR+YRL P + SSERR SML LDDH RAS+R+D+DEMC LAKAQIDLELAKMRTMTSQEAAAAAA AVAEAEAAIA+AEEAAREAEAAE
Subjt:  FLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE

Query:  ADAEAAQSFAEAAMKTLKGRNLPKMQMIPV
        ADAEAAQSFAEAAMKTLKGRNLPKMQMI V
Subjt:  ADAEAAQSFAEAAMKTLKGRNLPKMQMIPV

TrEMBL top hitse value%identityAlignment
A0A1S3BY21 MYB transcription factor4.3e-14187.42Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWT++EE ALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSREKARLA KRLHAP+KDE+ V+ SVA QSEDEL E
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRK--EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTA
         KSVSLSLDIKQI GPKRSNVRK  EEE+EEEEEKEVERIERDARYDCHRLDNLIIEAI+TLRE  GS KT I SYI+DQYWAPPDFKRLLSSKLKFLTA
Subjt:  PKSVSLSLDIKQIAGPKRSNVRK--EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTA

Query:  SGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADAE
        S KLVKVKR+YRL P+V+ SERR SMLLL+D  RA++R+D+DEMC LAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIA+AEEAAREAEAAEADAE
Subjt:  SGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADAE

Query:  AAQSFAEAAMKTLKGRNLPKMQMIPV
        AAQSFAEAAMKTLKGRNLPKMQMI V
Subjt:  AAQSFAEAAMKTLKGRNLPKMQMIPV

A0A6J1E4A7 MYB transcription factor7.3e-14186.77Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWTN+EE ALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSREKARLA KRLHAPKKD +I+++SVAVQSEDELPE
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRKEEE------DEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLK
         KSVSLS DIK IAGPKRSNVRKEEE      D+EEEE E ER ERDARYDCHRLDNLIIEAI+TLRE  GS KT I SYI+DQYWAPPDFKRLLSSKLK
Subjt:  PKSVSLSLDIKQIAGPKRSNVRKEEE------DEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLK

Query:  FLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE
        FLTASGKLVKVKR+YRL P + SSERR SML LDDH RAS+R+D+DEMC LAKAQIDLELAKMRTMTSQEAAAAAA AVAEAEAAIA+AEEAAREAEAAE
Subjt:  FLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE

Query:  ADAEAAQSFAEAAMKTLKGRNLPKM
        ADAEAAQSFAEAAMKTLKGRNLPKM
Subjt:  ADAEAAQSFAEAAMKTLKGRNLPKM

A0A6J1E9S6 MYB transcription factor7.3e-14185.63Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWTN+EE ALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSREKARLA KRLHAPKKD +I+++SVAVQSEDELPE
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRKEEE------DEEEEEKEVERIERDARYDCH----RLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLS
         KSVSLS DIK IAGPKRSNVRKEEE      D+EEEE E ER ERDARYDCH    RLDNLIIEAI+TLRE  GS KT I SYI+DQYWAPPDFKRLLS
Subjt:  PKSVSLSLDIKQIAGPKRSNVRKEEE------DEEEEEKEVERIERDARYDCH----RLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLS

Query:  SKLKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREA
        SKLKFLTASGKLVKVKR+YRL P + SSERR SML LDDH RAS+R+D+DEMC LAKAQIDLELAKMRTMTSQEAAAAAA AVAEAEAAIA+AEEAAREA
Subjt:  SKLKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREA

Query:  EAAEADAEAAQSFAEAAMKTLKGRNLPKMQMIPV
        EAAEADAEAAQSFAEAAMKTLKGRNLPKMQMI V
Subjt:  EAAEADAEAAQSFAEAAMKTLKGRNLPKMQMIPV

A0A6J1FST7 MYB transcription factor4.0e-14788.86Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSREKARLA KR HAPKKDESIV+Q +AVQSEDE PE
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRK--------EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSK
         K+VSLSLDIKQIAGPK+SNVRK        EEE+EEEEEKEVER ERDA YDCHRLDNLIIEAISTLRERSGSIK SIA YI+ QYWAPPDFKRLLSSK
Subjt:  PKSVSLSLDIKQIAGPKRSNVRK--------EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSK

Query:  LKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEA
        LKFLTASGKLVKVKRRYRLA TVSS ERRGSMLLLDD HRAS+R +RDEMCTLAKAQ+DLELAKMRTMTSQEAAA AARAVAEAEAAIA+AEEAAREAEA
Subjt:  LKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEA

Query:  AEADAEAAQSFAEAAMKTLKGRNLPKMQMIPV
        AEADA AAQSFAEAAMKTLKGRNLPKMQM+ V
Subjt:  AEADAEAAQSFAEAAMKTLKGRNLPKMQMIPV

A0A6J1FUF3 MYB transcription factor1.7e-14588.75Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFS+VLYLRSNVDLKDKWRNMSVMANGWGSREKARLA KR HAPKKDESIV+Q +AVQSEDE PE
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRK--------EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSK
         K+VSLSLDIKQIAGPK+SNVRK        EEE+EEEEEKEVER ERDA YDCHRLDNLIIEAISTLRERSGSIK SIA YI+ QYWAPPDFKRLLSSK
Subjt:  PKSVSLSLDIKQIAGPKRSNVRK--------EEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSK

Query:  LKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEA
        LKFLTASGKLVKVKRRYRLA TVSS ERRGSMLLLDD HRAS+R +RDEMCTLAKAQ+DLELAKMRTMTSQEAAA AARAVAEAEAAIA+AEEAAREAEA
Subjt:  LKFLTASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEA

Query:  AEADAEAAQSFAEAAMKTLKGRNLPKMQM
        AEADA AAQSFAEAAMKTLKGRNLPKM +
Subjt:  AEADAEAAQSFAEAAMKTLKGRNLPKMQM

SwissProt top hitse value%identityAlignment
B4FT40 Single myb histone 22.2e-5747.5Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSE-DELP
        MG PKQ+WT +EE ALKAGV KHG GKWRTIL+D +FS +L LRSNVDLKDKWRN+SV A G+GSREKAR+A       KK   +V +  A   + DE  
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSE-DELP

Query:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS
           +    +D++ +A         E  D+                   RLD+LI+EAI  L E SGS K  I+ YI+DQYW P DF+ LLS+KLK L  S
Subjt:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS

Query:  GKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADAEA
        GKL+KV ++YR+AP   SS   G    +       + ++ + +  L K Q+  EL KM+ MT +EAAA AA+AVAEAE A+A+AEEAAR AEAAE DAEA
Subjt:  GKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADAEA

Query:  AQSFAEAAMKTLKGRNLPKM
        A++F +A + +++ RN   M
Subjt:  AQSFAEAAMKTLKGRNLPKM

C0HIA3 Single myb histone 61.4e-7251.7Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDE-SIVSQSVAVQSEDEL
        MGAPKQ+WT++EE AL+AG+ +HG GKWRTILKDPEFS+ L  RSNVDLKDKWRNM+V+ +   SR+KA+ A KR+   PK +E ++    V    +DE+
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDE-SIVSQSVAVQSEDEL

Query:  PEPKS-VSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLT
         + K  VSL  + K  +  K+S                           HRLDN+I+EAI  L E +GS +T+IA+YI++QYW P DF  LLS+KLK L+
Subjt:  PEPKS-VSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLT

Query:  ASGKLVKVKRRYRLAPTVSSSERRG-SMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEAD
         SGKL+KV R+YR+AP+  +SERR   M LL+D  R  ++   D+  TL ++Q+D ELA+M TMT++EA+ AAARAVAEAEA +A+AE AA+EAEAAEA+
Subjt:  ASGKLVKVKRRYRLAPTVSSSERRG-SMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEAD

Query:  AEAAQSFAEAAMKTLKGRNLPKM
        A+AAQ+FAEAA  TLK RN  K+
Subjt:  AEAAQSFAEAAMKTLKGRNLPKM

Q6WLH3 Single myb histone 53.1e-6448.14Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLH-APKKDESIVSQS-VAVQSEDEL
        MGAPKQ+WT++EE AL+AGV +HG G WR IL DPE S+ L  RSNVDLKDKWRNM+V+     +R++ R + +R   APK ++ +++ S +  + +DE+
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLH-APKKDESIVSQS-VAVQSEDEL

Query:  PEPKS-VSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLT
         + K  VS+S++     G   SN +K                        RLDN+I+EAI  L E +GS +T+IA+YI++QYW P DF  LLS+KLK+L 
Subjt:  PEPKS-VSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLT

Query:  ASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADA
         SGKL+KV R+YR+AP+           LL+D  R  ++   D   TL ++Q+D EL +M TMT + AAAAAA AVAEAEA +A+AE AAREAEAAEA+A
Subjt:  ASGKLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADA

Query:  EAAQSFAEAAMKTLKGRNLPKM
         AAQ+FAEAA+ TLK RN  K+
Subjt:  EAAQSFAEAAMKTLKGRNLPKM

Q6WS85 Single myb histone 15.0e-6247.98Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE
        MGAPKQ+WT +EE ALKAGV KHG GKWRTIL+D +FS +L LRSNVDLKDKWRN+SV A G+GSREKAR+A K+        +     V V+  D+  +
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPE

Query:  PKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTASG
              ++D++ +A    S   +E  D+                   RLD+LI+EAI  L+E SG  K +IA+YI+DQYW P DF+RLLS+KLK L  SG
Subjt:  PKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTASG

Query:  KLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADAEAA
        KL+KV ++YR+AP+   S R G+ +         ++++ +    L K Q+  EL KM+ MT +EAAA AA+AVAEAE AIA+AEEAAR AEAAE DAEAA
Subjt:  KLVKVKRRYRLAPTVSSSERRGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADAEAA

Query:  QSFAEAAMKTLKGRNLPKMQM
        ++F +A   +++ RN   M +
Subjt:  QSFAEAAMKTLKGRNLPKMQM

Q8VWK4 Telomere repeat-binding factor 12.8e-8156.57Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDESIVSQSVAVQSEDELP
        MGAPKQKWT +EE+ALK+GV+KHG GKWRTILKDPEFS VLYLRSNVDLKDKWRNMSVMANGWGSREK+RLA KR  + PK++E+ ++ + ++QS++E  
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDESIVSQSVAVQSEDELP

Query:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS
        +  S    L +     P+R NV                          RLD+LI+EAI+TL+E  G  KT+I +YI+DQY APPDFKRLLS+KLK+LT+ 
Subjt:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS

Query:  GKLVKVKRRYRLA-PTVSSSERRGSMLLLDDHHRASI----RSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE
        GKLVKVKR+YR+   T  SS RR  + +     R S     ++D DE+    ++QID E+A+M++M   EAAA AA+AVAEAEAA+A+AEEAA+EAEAAE
Subjt:  GKLVKVKRRYRLA-PTVSSSERRGSMLLLDDHHRASI----RSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE

Query:  ADAEAAQSFAEAAMKTLKGRNLPKMQM
        A+AEAAQ+FAE A KTLKGRN+ KM +
Subjt:  ADAEAAQSFAEAAMKTLKGRNLPKMQM

Arabidopsis top hitse value%identityAlignment
AT1G49950.1 telomere repeat binding factor 12.0e-8256.57Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDESIVSQSVAVQSEDELP
        MGAPKQKWT +EE+ALK+GV+KHG GKWRTILKDPEFS VLYLRSNVDLKDKWRNMSVMANGWGSREK+RLA KR  + PK++E+ ++ + ++QS++E  
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDESIVSQSVAVQSEDELP

Query:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS
        +  S    L +     P+R NV                          RLD+LI+EAI+TL+E  G  KT+I +YI+DQY APPDFKRLLS+KLK+LT+ 
Subjt:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS

Query:  GKLVKVKRRYRLA-PTVSSSERRGSMLLLDDHHRASI----RSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE
        GKLVKVKR+YR+   T  SS RR  + +     R S     ++D DE+    ++QID E+A+M++M   EAAA AA+AVAEAEAA+A+AEEAA+EAEAAE
Subjt:  GKLVKVKRRYRLA-PTVSSSERRGSMLLLDDHHRASI----RSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE

Query:  ADAEAAQSFAEAAMKTLKGRNLPKMQM
        A+AEAAQ+FAE A KTLKGRN+ KM +
Subjt:  ADAEAAQSFAEAAMKTLKGRNLPKMQM

AT1G49950.2 telomere repeat binding factor 12.0e-8256.57Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDESIVSQSVAVQSEDELP
        MGAPKQKWT +EE+ALK+GV+KHG GKWRTILKDPEFS VLYLRSNVDLKDKWRNMSVMANGWGSREK+RLA KR  + PK++E+ ++ + ++QS++E  
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDESIVSQSVAVQSEDELP

Query:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS
        +  S    L +     P+R NV                          RLD+LI+EAI+TL+E  G  KT+I +YI+DQY APPDFKRLLS+KLK+LT+ 
Subjt:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS

Query:  GKLVKVKRRYRLA-PTVSSSERRGSMLLLDDHHRASI----RSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE
        GKLVKVKR+YR+   T  SS RR  + +     R S     ++D DE+    ++QID E+A+M++M   EAAA AA+AVAEAEAA+A+AEEAA+EAEAAE
Subjt:  GKLVKVKRRYRLA-PTVSSSERRGSMLLLDDHHRASI----RSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE

Query:  ADAEAAQSFAEAAMKTLKGRNLPKMQM
        A+AEAAQ+FAE A KTLKGRN+ KM +
Subjt:  ADAEAAQSFAEAAMKTLKGRNLPKMQM

AT1G49950.3 telomere repeat binding factor 12.0e-8256.57Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDESIVSQSVAVQSEDELP
        MGAPKQKWT +EE+ALK+GV+KHG GKWRTILKDPEFS VLYLRSNVDLKDKWRNMSVMANGWGSREK+RLA KR  + PK++E+ ++ + ++QS++E  
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHA-PKKDESIVSQSVAVQSEDELP

Query:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS
        +  S    L +     P+R NV                          RLD+LI+EAI+TL+E  G  KT+I +YI+DQY APPDFKRLLS+KLK+LT+ 
Subjt:  EPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTAS

Query:  GKLVKVKRRYRLA-PTVSSSERRGSMLLLDDHHRASI----RSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE
        GKLVKVKR+YR+   T  SS RR  + +     R S     ++D DE+    ++QID E+A+M++M   EAAA AA+AVAEAEAA+A+AEEAA+EAEAAE
Subjt:  GKLVKVKRRYRLA-PTVSSSERRGSMLLLDDHHRASI----RSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAE

Query:  ADAEAAQSFAEAAMKTLKGRNLPKMQM
        A+AEAAQ+FAE A KTLKGRN+ KM +
Subjt:  ADAEAAQSFAEAAMKTLKGRNLPKMQM

AT5G67580.1 Homeodomain-like/winged-helix DNA-binding family protein7.9e-5546.39Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKK--DESIVSQSVAVQSEDEL
        MGAPKQKWT +EE ALKAGV+KHG GKWRTIL D EFS +L  RSNVDLKDKWRN+SV A  WGSR+KA+LA KR     K  D +     VA+ ++DE 
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKK--DESIVSQSVAVQSEDEL

Query:  PEPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTA
         +P S   S       G  R+   K                         LD +I EAI+ LRE  GS +TSI  YI++ +  PP+ KR ++ +LK L++
Subjt:  PEPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTA

Query:  SGKLVKVKRRYRLAPTV--SSSERRGSMLLLDDHHRAS-IRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEA
        +G LVK+K +YR +     + + ++   L L+ +++    + + +   +L K ++D EL  ++ MT+QEAA AAARAVAEAE AI +AE+AA+EAE AEA
Subjt:  SGKLVKVKRRYRLAPTV--SSSERRGSMLLLDDHHRAS-IRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEA

Query:  DAEAAQSFAEAAMKTLKGR
        +AEAAQ FA+AAMK LK R
Subjt:  DAEAAQSFAEAAMKTLKGR

AT5G67580.2 Homeodomain-like/winged-helix DNA-binding family protein7.9e-5546.39Show/hide
Query:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKK--DESIVSQSVAVQSEDEL
        MGAPKQKWT +EE ALKAGV+KHG GKWRTIL D EFS +L  RSNVDLKDKWRN+SV A  WGSR+KA+LA KR     K  D +     VA+ ++DE 
Subjt:  MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKK--DESIVSQSVAVQSEDEL

Query:  PEPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTA
         +P S   S       G  R+   K                         LD +I EAI+ LRE  GS +TSI  YI++ +  PP+ KR ++ +LK L++
Subjt:  PEPKSVSLSLDIKQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTA

Query:  SGKLVKVKRRYRLAPTV--SSSERRGSMLLLDDHHRAS-IRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEA
        +G LVK+K +YR +     + + ++   L L+ +++    + + +   +L K ++D EL  ++ MT+QEAA AAARAVAEAE AI +AE+AA+EAE AEA
Subjt:  SGKLVKVKRRYRLAPTV--SSSERRGSMLLLDDHHRAS-IRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEA

Query:  DAEAAQSFAEAAMKTLKGR
        +AEAAQ FA+AAMK LK R
Subjt:  DAEAAQSFAEAAMKTLKGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGCTCCGAAGCAGAAATGGACTAACCAAGAAGAAACAGCCCTAAAGGCTGGAGTTGTTAAGCATGGAGCAGGAAAGTGGCGAACAATACTTAAGGATCCTGAATT
CAGCAATGTACTGTATCTTCGTTCAAATGTTGATCTCAAGGACAAGTGGAGAAACATGAGTGTCATGGCTAATGGCTGGGGATCCCGAGAGAAGGCCAGGTTAGCACAAA
AGAGATTGCATGCTCCTAAAAAAGATGAGAGTATTGTGTCTCAAAGCGTTGCTGTCCAAAGTGAAGATGAATTGCCAGAACCTAAGTCTGTTTCCCTTTCCTTGGATATC
AAGCAGATAGCTGGTCCAAAGCGATCAAATGTAAGAAAAGAAGAGGAAGATGAAGAAGAAGAAGAAAAAGAAGTAGAAAGAATAGAAAGAGATGCAAGATATGATTGTCA
CAGGTTAGACAATCTTATAATTGAGGCAATATCTACCTTGAGAGAACGTAGTGGCTCTATTAAGACAAGCATTGCTTCATATATAAAGGATCAATACTGGGCTCCTCCAG
ACTTCAAGAGGCTGTTATCATCAAAATTGAAGTTCTTAACAGCTAGTGGTAAACTGGTCAAGGTTAAACGGAGATATAGGCTTGCTCCAACGGTGTCTTCATCAGAGAGA
AGGGGCTCTATGTTATTGTTGGACGACCATCATAGAGCTTCTATAAGATCTGACAGAGATGAAATGTGTACTCTTGCCAAAGCTCAAATTGATCTCGAATTAGCCAAGAT
GAGGACCATGACTTCCCAAGAGGCAGCTGCCGCTGCTGCTCGAGCAGTTGCAGAAGCCGAAGCAGCAATAGCAGATGCCGAAGAGGCAGCAAGGGAAGCCGAGGCAGCTG
AAGCTGATGCAGAAGCAGCACAATCATTTGCAGAAGCTGCAATGAAGACACTGAAGGGAAGAAATCTCCCAAAGATGCAGATGATCCCGGTTTGA
mRNA sequenceShow/hide mRNA sequence
GGCAGCTGAGTCTGTTCCTGATAAAAAGTGGTTTTTTCTCTGAAAAATTACAAGCAGGCTCAGAAAAATATATAAATATCTGAATCTCGACGAGCGATAGAGAGAAACAG
AGAAGGGCGAATTTTTATACCCTTTCTGCAAATTTCAACAGAATTTTTGAATGTTCGAGCATCGAAAAACCTAACAGATACCATTATTCCTTCTGCGTAACTGAGTCGGC
CCTGGGAGTGAGCCTTCTTTCTGCTGCCTTTGATGGGTGCTCCGAAGCAGAAATGGACTAACCAAGAAGAAACAGCCCTAAAGGCTGGAGTTGTTAAGCATGGAGCAGGA
AAGTGGCGAACAATACTTAAGGATCCTGAATTCAGCAATGTACTGTATCTTCGTTCAAATGTTGATCTCAAGGACAAGTGGAGAAACATGAGTGTCATGGCTAATGGCTG
GGGATCCCGAGAGAAGGCCAGGTTAGCACAAAAGAGATTGCATGCTCCTAAAAAAGATGAGAGTATTGTGTCTCAAAGCGTTGCTGTCCAAAGTGAAGATGAATTGCCAG
AACCTAAGTCTGTTTCCCTTTCCTTGGATATCAAGCAGATAGCTGGTCCAAAGCGATCAAATGTAAGAAAAGAAGAGGAAGATGAAGAAGAAGAAGAAAAAGAAGTAGAA
AGAATAGAAAGAGATGCAAGATATGATTGTCACAGGTTAGACAATCTTATAATTGAGGCAATATCTACCTTGAGAGAACGTAGTGGCTCTATTAAGACAAGCATTGCTTC
ATATATAAAGGATCAATACTGGGCTCCTCCAGACTTCAAGAGGCTGTTATCATCAAAATTGAAGTTCTTAACAGCTAGTGGTAAACTGGTCAAGGTTAAACGGAGATATA
GGCTTGCTCCAACGGTGTCTTCATCAGAGAGAAGGGGCTCTATGTTATTGTTGGACGACCATCATAGAGCTTCTATAAGATCTGACAGAGATGAAATGTGTACTCTTGCC
AAAGCTCAAATTGATCTCGAATTAGCCAAGATGAGGACCATGACTTCCCAAGAGGCAGCTGCCGCTGCTGCTCGAGCAGTTGCAGAAGCCGAAGCAGCAATAGCAGATGC
CGAAGAGGCAGCAAGGGAAGCCGAGGCAGCTGAAGCTGATGCAGAAGCAGCACAATCATTTGCAGAAGCTGCAATGAAGACACTGAAGGGAAGAAATCTCCCAAAGATGC
AGATGATCCCGGTTTGATGGGGAAGTAAAAACTGATCTAGAGGGAGTTATGGAGCAGTTCTTAAATGGACCGACGTGTTTCGTGCTAGCTATTGAGTAGGCATAATAAGC
AACCTCTCCCCTGCCATTTTCAATCGTCTCACATTGGGCAACTCGAGCAGCAGAATCATGTCTGACTAGAAGACTGGATATCCAAACCAGAAGGAAAATTCTGAGTTTAG
TTAGTTGCATGTAAAAATGTCTATTGCTTGTAGTGTAAGAAATCTGTGGGCTTTATCTATGCTCTCTAATGTTAATGAATTATTATCATTGTTTATTTCGATTAGTCTGA
CCTTGTAGTTGGTACTGAAACTCTGGAGTTATATTTACTGTAAACAAGGAAAAAGGATAGGGAAATAAATGAGTTTTTTAACCG
Protein sequenceShow/hide protein sequence
MGAPKQKWTNQEETALKAGVVKHGAGKWRTILKDPEFSNVLYLRSNVDLKDKWRNMSVMANGWGSREKARLAQKRLHAPKKDESIVSQSVAVQSEDELPEPKSVSLSLDI
KQIAGPKRSNVRKEEEDEEEEEKEVERIERDARYDCHRLDNLIIEAISTLRERSGSIKTSIASYIKDQYWAPPDFKRLLSSKLKFLTASGKLVKVKRRYRLAPTVSSSER
RGSMLLLDDHHRASIRSDRDEMCTLAKAQIDLELAKMRTMTSQEAAAAAARAVAEAEAAIADAEEAAREAEAAEADAEAAQSFAEAAMKTLKGRNLPKMQMIPV