; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016648 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016648
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMYB transcription factor
Genome locationChr03:6767817..6774868
RNA-Seq ExpressionHG10016648
SyntenyHG10016648
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005730 - nucleolus (cellular component)
GO:0003691 - double-stranded telomeric DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR005818 - Linker histone H1/H5, domain H15
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily
IPR044597 - Single myb histone 1-6


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036409.1 telomere repeat-binding factor 2 isoform X2 [Cucumis melo var. makuwa]3.8e-15296.36Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+A+KHHD+P+PVSTVLPNEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGT R NGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPL GRRNAPLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LEDKQ DSSKTEKSEVKIITKSQVD ELSKMKVMTAEEAAIAAARAVAEAEAAI EAERAAREAE+AEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQ
        LQ
Subjt:  LQ

XP_008440400.1 PREDICTED: telomere repeat-binding factor 2 isoform X1 [Cucumis melo]2.2e-15296.04Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+A+KHHD+P+PVSTVLPNEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGT R NGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPL GRRNAPLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LEDKQ DSSKTEKSEVKIITKSQVD ELSKMKVMTAEEAAIAAARAVAEAEAAI EAERAAREAE+AEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQE
        LQ+
Subjt:  LQE

XP_008440403.1 PREDICTED: telomere repeat-binding factor 2 isoform X2 [Cucumis melo]3.8e-15296.36Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+A+KHHD+P+PVSTVLPNEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGT R NGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPL GRRNAPLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LEDKQ DSSKTEKSEVKIITKSQVD ELSKMKVMTAEEAAIAAARAVAEAEAAI EAERAAREAE+AEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQ
        LQ
Subjt:  LQ

XP_038883921.1 telomere repeat-binding factor 2 isoform X1 [Benincasa hispida]1.1e-15195.71Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+A+K HDNPMPVSTVL NEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGTIR NGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAP NLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRR+APLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LEDKQMDSSKTEK+EVKIITKSQVDSELSKM+VMTAEEAAIAAARAVAEAEAAI EAERAAREAEEAEAEAETAQVFAEAAMKALECRTFP+RSP+QKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQE
        LQ+
Subjt:  LQE

XP_038883922.1 telomere repeat-binding factor 2 isoform X2 [Benincasa hispida]1.9e-15196.03Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+A+K HDNPMPVSTVL NEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGTIR NGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAP NLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRR+APLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LEDKQMDSSKTEK+EVKIITKSQVDSELSKM+VMTAEEAAIAAARAVAEAEAAI EAERAAREAEEAEAEAETAQVFAEAAMKALECRTFP+RSP+QKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQ
        LQ
Subjt:  LQ

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ2 MYB transcription factor1.3e-15095.7Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+A+KHHDN +PVSTVLPNEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGT R NGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYW P NLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPL GRRN PLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LEDKQ DSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAI EAERAAREAE+AEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQ
        LQ
Subjt:  LQ

A0A1S3B132 MYB transcription factor1.9e-15296.36Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+A+KHHD+P+PVSTVLPNEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGT R NGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPL GRRNAPLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LEDKQ DSSKTEKSEVKIITKSQVD ELSKMKVMTAEEAAIAAARAVAEAEAAI EAERAAREAE+AEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQ
        LQ
Subjt:  LQ

A0A1S3B1N4 MYB transcription factor1.1e-15296.04Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+A+KHHD+P+PVSTVLPNEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGT R NGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPL GRRNAPLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LEDKQ DSSKTEKSEVKIITKSQVD ELSKMKVMTAEEAAIAAARAVAEAEAAI EAERAAREAE+AEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQE
        LQ+
Subjt:  LQE

A0A5A7T4X2 MYB transcription factor1.9e-15296.36Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+A+KHHD+P+PVSTVLPNEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGT R NGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPL GRRNAPLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LEDKQ DSSKTEKSEVKIITKSQVD ELSKMKVMTAEEAAIAAARAVAEAEAAI EAERAAREAE+AEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQ
        LQ
Subjt:  LQ

A0A6J1INP0 MYB transcription factor7.3e-14993.73Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNS+ALKHHDNPM +STVL NEEIVDA
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL
        KPLAISNGT R NGPKEPLARLD+LISEAINNLKEPRGSDRAAIA+YIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNS LSGRRN PLLL
Subjt:  KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLL

Query:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL
        LE+K +DSSK EKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAI EAERAAREAEEAEAEAE+AQVFAEAAMKALECRTFP+RSP+QKVL
Subjt:  LEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVL

Query:  LQE
        LQ+
Subjt:  LQE

SwissProt top hitse value%identityAlignment
C0HIA3 Single myb histone 65.7e-7456.01Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINV-TAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLP--NEEI
        MGAPKQ+WT+EEEAAL+AG+ +HG GKWRTIL DPEFSS L  RSNVDLKDKWRN+NV  +   SR KAK ALK+     K++++ M ++ V    ++EI
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINV-TAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLP--NEEI

Query:  VDAKPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP
        VD KP+       +     +   RLD +I EAI NL EP GS R  IA YIEE YW P +   LLS KLK ++ +GKLIKV  KYRIAP+SP S RR+  
Subjt:  VDAKPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP

Query:  LLLLEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
        + LLED Q +  K    + K +T+SQVD+EL++M  MTAEEA++AAARAVAEAEA + EAE AA+EAE AEAEA+ AQ FAEAA   L+ R
Subjt:  LLLLEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR

Q6WS85 Single myb histone 14.8e-6553.4Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTA-IWGSRQKAKLALKK-NSVALKHHDNPMPVSTV---LPNE
        MGAPKQ+WT EEEAALKAGV KHG GKWRTIL D +FS++L  RSNVDLKDKWRN++VTA  +GSR+KA++ALKK   V  K    PM V        ++
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTA-IWGSRQKAKLALKK-NSVALKHHDNPMPVSTV---LPNE

Query:  EIVDAKPLAISNGTI-RPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRR
          +D +PLA++  ++     P + +ARLD LI EAI  LKEP G  +AAIA YIE+ YW P + ++LLSTKLK +  +GKLIKV  KYRIAP+ P SGR 
Subjt:  EIVDAKPLAISNGTI-RPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRR

Query:  NAPLLLLEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
           +        +  K E +  K +TK QV +EL KMK MT EEAA  AA+AVAEAE AI EAE AAR AE AE +AE A+ F +A   ++  R
Subjt:  NAPLLLLEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR

Q8VWK4 Telomere repeat-binding factor 15.3e-7255.93Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAI-WGSRQKAKLALKKNSVALKHHDNPMPVSTVL-PNEEIV
        MGAPKQKWT EEE+ALK+GVIKHG GKWRTIL DPEFS +L+ RSNVDLKDKWRN++V A  WGSR+K++LA+K+     K  +N + ++  L  +EE V
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAI-WGSRQKAKLALKKNSVALKHHDNPMPVSTVL-PNEEIV

Query:  DA-KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP
        DA   L +S+       P+ P  RLD LI EAI  LKEP G ++  I  YIE+ Y APP+ K+LLSTKLK++T+ GKL+KVK KYRI  ++PLS  R   
Subjt:  DA-KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP

Query:  LLLLEDKQMDSS----KTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
        L +   KQ  SS    KT+  EV   T+SQ+D+E+++MK M   EAA  AA+AVAEAEAA+ EAE AA+EAE AEAEAE AQ FAE A K L+ R
Subjt:  LLLLEDKQMDSS----KTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR

Q9FJW5 Telomere repeat-binding factor 26.5e-7857.68Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWT EEEAALKAGV+KHG GKWRTIL+D EFS IL  RSNVDLKDKWRNI+VTA+WGSR+KAKLALK+     K  DN   ++ V    +   A
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLA---ISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPN-SPLSGRRNA
        KP +      G+ R    K  +  LDK+I EAI NL+E RGSDR +I +YIEE++  PPN+K+ ++ +LKH+++NG L+K+KHKYR + N  P   R+ A
Subjt:  KPLA---ISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPN-SPLSGRRNA

Query:  PLLLLE-DKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
        P L LE + + D +K E++    +TK +VD EL  +K MTA+EAA AAARAVAEAE AI EAE+AA+EAE AEAEAE AQ+FA+AAMKAL+ R
Subjt:  PLLLLE-DKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR

Q9M2X3 Telomere repeat-binding factor 32.3e-6753.1Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVA-LKHHDNPMPVSTVLPNEEIVD
        MGAPK KWT EEE ALKAGV+KHG GKWRTIL+DP +S+IL  RSNVDLKDKWRNI+VTA+WGSR+KAKLALK+  ++  +  DN   ++ V      V 
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVA-LKHHDNPMPVSTVLPNEEIVD

Query:  AKPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGR-RNAPL
         + +   +       P  P   +DK+I EAI +LK P G D  +I MYIEE++   P++K+L++++LK++T  G L+K KHKYRI+ N    G  + +P 
Subjt:  AKPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGR-RNAPL

Query:  LLLEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
        LLLE  + ++ K E++ VK +TKSQV  E+  M  MT +EAA AAARAVAEAE A+ EAE AAREA++AEAEAE A +FA+AAMKA++ R
Subjt:  LLLEDKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR

Arabidopsis top hitse value%identityAlignment
AT1G49950.1 telomere repeat binding factor 13.8e-7355.93Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAI-WGSRQKAKLALKKNSVALKHHDNPMPVSTVL-PNEEIV
        MGAPKQKWT EEE+ALK+GVIKHG GKWRTIL DPEFS +L+ RSNVDLKDKWRN++V A  WGSR+K++LA+K+     K  +N + ++  L  +EE V
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAI-WGSRQKAKLALKKNSVALKHHDNPMPVSTVL-PNEEIV

Query:  DA-KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP
        DA   L +S+       P+ P  RLD LI EAI  LKEP G ++  I  YIE+ Y APP+ K+LLSTKLK++T+ GKL+KVK KYRI  ++PLS  R   
Subjt:  DA-KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP

Query:  LLLLEDKQMDSS----KTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
        L +   KQ  SS    KT+  EV   T+SQ+D+E+++MK M   EAA  AA+AVAEAEAA+ EAE AA+EAE AEAEAE AQ FAE A K L+ R
Subjt:  LLLLEDKQMDSS----KTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR

AT1G49950.2 telomere repeat binding factor 13.8e-7355.93Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAI-WGSRQKAKLALKKNSVALKHHDNPMPVSTVL-PNEEIV
        MGAPKQKWT EEE+ALK+GVIKHG GKWRTIL DPEFS +L+ RSNVDLKDKWRN++V A  WGSR+K++LA+K+     K  +N + ++  L  +EE V
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAI-WGSRQKAKLALKKNSVALKHHDNPMPVSTVL-PNEEIV

Query:  DA-KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP
        DA   L +S+       P+ P  RLD LI EAI  LKEP G ++  I  YIE+ Y APP+ K+LLSTKLK++T+ GKL+KVK KYRI  ++PLS  R   
Subjt:  DA-KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP

Query:  LLLLEDKQMDSS----KTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
        L +   KQ  SS    KT+  EV   T+SQ+D+E+++MK M   EAA  AA+AVAEAEAA+ EAE AA+EAE AEAEAE AQ FAE A K L+ R
Subjt:  LLLLEDKQMDSS----KTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR

AT1G49950.3 telomere repeat binding factor 13.8e-7355.93Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAI-WGSRQKAKLALKKNSVALKHHDNPMPVSTVL-PNEEIV
        MGAPKQKWT EEE+ALK+GVIKHG GKWRTIL DPEFS +L+ RSNVDLKDKWRN++V A  WGSR+K++LA+K+     K  +N + ++  L  +EE V
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAI-WGSRQKAKLALKKNSVALKHHDNPMPVSTVL-PNEEIV

Query:  DA-KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP
        DA   L +S+       P+ P  RLD LI EAI  LKEP G ++  I  YIE+ Y APP+ K+LLSTKLK++T+ GKL+KVK KYRI  ++PLS  R   
Subjt:  DA-KPLAISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAP

Query:  LLLLEDKQMDSS----KTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
        L +   KQ  SS    KT+  EV   T+SQ+D+E+++MK M   EAA  AA+AVAEAEAA+ EAE AA+EAE AEAEAE AQ FAE A K L+ R
Subjt:  LLLLEDKQMDSS----KTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR

AT5G67580.1 Homeodomain-like/winged-helix DNA-binding family protein4.6e-7957.68Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWT EEEAALKAGV+KHG GKWRTIL+D EFS IL  RSNVDLKDKWRNI+VTA+WGSR+KAKLALK+     K  DN   ++ V    +   A
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLA---ISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPN-SPLSGRRNA
        KP +      G+ R    K  +  LDK+I EAI NL+E RGSDR +I +YIEE++  PPN+K+ ++ +LKH+++NG L+K+KHKYR + N  P   R+ A
Subjt:  KPLA---ISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPN-SPLSGRRNA

Query:  PLLLLE-DKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
        P L LE + + D +K E++    +TK +VD EL  +K MTA+EAA AAARAVAEAE AI EAE+AA+EAE AEAEAE AQ+FA+AAMKAL+ R
Subjt:  PLLLLE-DKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR

AT5G67580.2 Homeodomain-like/winged-helix DNA-binding family protein4.6e-7957.68Show/hide
Query:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA
        MGAPKQKWT EEEAALKAGV+KHG GKWRTIL+D EFS IL  RSNVDLKDKWRNI+VTA+WGSR+KAKLALK+     K  DN   ++ V    +   A
Subjt:  MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDA

Query:  KPLA---ISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPN-SPLSGRRNA
        KP +      G+ R    K  +  LDK+I EAI NL+E RGSDR +I +YIEE++  PPN+K+ ++ +LKH+++NG L+K+KHKYR + N  P   R+ A
Subjt:  KPLA---ISNGTIRPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPN-SPLSGRRNA

Query:  PLLLLE-DKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR
        P L LE + + D +K E++    +TK +VD EL  +K MTA+EAA AAARAVAEAE AI EAE+AA+EAE AEAEAE AQ+FA+AAMKAL+ R
Subjt:  PLLLLE-DKQMDSSKTEKSEVKIITKSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGCGCCGAAGCAGAAATGGACAGCTGAAGAGGAAGCTGCCCTTAAAGCAGGAGTGATCAAGCATGGAGCTGGAAAATGGCGCACAATACTCACAGATCCTGAGTT
CAGCTCAATTTTGCATCAACGCTCAAATGTGGATCTCAAGGATAAGTGGAGAAATATAAATGTTACTGCAATATGGGGGTCTAGGCAAAAGGCTAAGCTCGCACTTAAAA
AGAATTCTGTGGCCCTAAAACATCATGATAATCCTATGCCTGTAAGCACCGTGCTTCCAAATGAGGAGATTGTTGACGCTAAGCCACTTGCAATTTCAAATGGAACAATT
CGCCCTAATGGTCCAAAAGAGCCACTAGCAAGATTGGACAAACTTATATCAGAGGCAATTAACAACTTAAAGGAGCCAAGGGGTTCTGACCGGGCTGCAATTGCTATGTA
CATAGAGGAGCATTACTGGGCCCCACCAAACCTTAAAAAACTTCTTTCAACAAAATTAAAGCATATGACTGCAAATGGAAAGTTGATAAAGGTAAAGCATAAGTACAGAA
TTGCTCCAAATTCTCCCTTATCTGGAAGAAGAAATGCTCCATTGTTACTCCTTGAGGACAAGCAGATGGATTCTTCAAAGACCGAGAAGAGTGAAGTCAAAATTATCACT
AAATCCCAGGTTGACTCAGAACTTTCAAAGATGAAGGTAATGACTGCAGAGGAGGCAGCTATAGCTGCTGCACGTGCAGTAGCTGAGGCAGAGGCTGCCATTGTAGAGGC
TGAAAGGGCAGCAAGAGAAGCAGAAGAAGCAGAGGCTGAAGCTGAAACAGCACAAGTTTTTGCTGAAGCAGCAATGAAGGCATTGGAGTGCAGAACATTCCCCAATAGAA
GTCCTATTCAGAAAGTTCTTCTCCAAGAATATGATTACCTTATGAGCAAGACGGCTGCTGGGGATCATATCATCAGAAGTGTATGTCGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGCGCCGAAGCAGAAATGGACAGCTGAAGAGGAAGCTGCCCTTAAAGCAGGAGTGATCAAGCATGGAGCTGGAAAATGGCGCACAATACTCACAGATCCTGAGTT
CAGCTCAATTTTGCATCAACGCTCAAATGTGGATCTCAAGGATAAGTGGAGAAATATAAATGTTACTGCAATATGGGGGTCTAGGCAAAAGGCTAAGCTCGCACTTAAAA
AGAATTCTGTGGCCCTAAAACATCATGATAATCCTATGCCTGTAAGCACCGTGCTTCCAAATGAGGAGATTGTTGACGCTAAGCCACTTGCAATTTCAAATGGAACAATT
CGCCCTAATGGTCCAAAAGAGCCACTAGCAAGATTGGACAAACTTATATCAGAGGCAATTAACAACTTAAAGGAGCCAAGGGGTTCTGACCGGGCTGCAATTGCTATGTA
CATAGAGGAGCATTACTGGGCCCCACCAAACCTTAAAAAACTTCTTTCAACAAAATTAAAGCATATGACTGCAAATGGAAAGTTGATAAAGGTAAAGCATAAGTACAGAA
TTGCTCCAAATTCTCCCTTATCTGGAAGAAGAAATGCTCCATTGTTACTCCTTGAGGACAAGCAGATGGATTCTTCAAAGACCGAGAAGAGTGAAGTCAAAATTATCACT
AAATCCCAGGTTGACTCAGAACTTTCAAAGATGAAGGTAATGACTGCAGAGGAGGCAGCTATAGCTGCTGCACGTGCAGTAGCTGAGGCAGAGGCTGCCATTGTAGAGGC
TGAAAGGGCAGCAAGAGAAGCAGAAGAAGCAGAGGCTGAAGCTGAAACAGCACAAGTTTTTGCTGAAGCAGCAATGAAGGCATTGGAGTGCAGAACATTCCCCAATAGAA
GTCCTATTCAGAAAGTTCTTCTCCAAGAATATGATTACCTTATGAGCAAGACGGCTGCTGGGGATCATATCATCAGAAGTGTATGTCGGTAA
Protein sequenceShow/hide protein sequence
MGAPKQKWTAEEEAALKAGVIKHGAGKWRTILTDPEFSSILHQRSNVDLKDKWRNINVTAIWGSRQKAKLALKKNSVALKHHDNPMPVSTVLPNEEIVDAKPLAISNGTI
RPNGPKEPLARLDKLISEAINNLKEPRGSDRAAIAMYIEEHYWAPPNLKKLLSTKLKHMTANGKLIKVKHKYRIAPNSPLSGRRNAPLLLLEDKQMDSSKTEKSEVKIIT
KSQVDSELSKMKVMTAEEAAIAAARAVAEAEAAIVEAERAAREAEEAEAEAETAQVFAEAAMKALECRTFPNRSPIQKVLLQEYDYLMSKTAAGDHIIRSVCR