; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019185 (gene) of Snake gourd v1 genome

Gene IDTan0019185
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-binding protein BIN4
Genome locationLG01:189945..231336
RNA-Seq ExpressionTan0019185
SyntenyTan0019185
Gene Ontology termsGO:0042023 - DNA endoreduplication (biological process)
GO:0009330 - DNA topoisomerase complex (ATP-hydrolyzing) (cellular component)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR033246 - DNA-binding protein BIN4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147326.1 DNA-binding protein BIN4 isoform X1 [Cucumis sativus]7.1e-14777.61Show/hide
Query:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD
        MSSSREQSPDWMRSFQAPTGVALSSNS SSKNGSSSMDNAIDQ+DPSSHKTTQ LDGDQ+QGD GN NLAKEVKL+ HTGHENS+HSVWMLS DSESC D
Subjt:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD

Query:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL
         +FIKEDYS+HEEL+EL TS+ QGR KDENAGR+F +GKSKSRKVSN+ SPKK+VKS  CTS KE ++NS TNK G  +EGSE  V+N GDVEI+EKDAL
Subjt:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL

Query:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQL
        DDC GPP                  ALVECEG SI+LSGDMGAVGR+VVSDSSSAKNELCLDLKGT+YRA +V SRTFCI         IESIMNDFIQL
Subjt:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQL

Query:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
        KALSKVDEAETMVEGTLDGFSFDSED+AEKITK A+SP  QNE VEGLN KSKNKA+KSSGRKRVKTGG+LQAPKK RKK QGSKTKN KSKK
Subjt:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

XP_008460815.1 PREDICTED: DNA-binding protein BIN4 isoform X2 [Cucumis melo]4.2e-14777.61Show/hide
Query:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD
        MSSSREQSPDWMRSFQAPTGVALSSNS SSKNGSSSMDNAIDQ+DPSSHKTTQ LDGDQ+QGD GN NLAKE KL+  TGHENS HSVWMLSSDSESC D
Subjt:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD

Query:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL
         +FIKED +HHEELSEL TS+ QGR KDENAGR+F +GKSKSRKVS + SPKK++KS  CTS KEK++NSDTNK GL LEGSE  V+N G+ EI+EKDAL
Subjt:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL

Query:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQL
        DDC  PP                  ALVECEG SI+LSGDMGAVGR+VVSDSSSAKNELCLDLKGT+YRA +V SRTFCI         IESIMNDFIQL
Subjt:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQL

Query:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
        KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSP  QNE VEGLN KSKNKA+KSSGRKRVK+GG+LQAPKK RKK QGSKTKN KSKK
Subjt:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

XP_022140827.1 DNA-binding protein BIN4 isoform X1 [Momordica charantia]1.3e-14576.59Show/hide
Query:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD
        MSSSREQSPDWMRSFQ PTGVALSSNSESS N SS MDNAIDQKD SSHKTTQ LDGDQ+QGD G+ NL KE+KLEEH GH +S+HSVWMLSSDSE C D
Subjt:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD

Query:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL
         S IKEDYSHHEEL E  TSQF GR KDEN  R+F DGKSKSRKVS+KKSPKK+VKS   T TKEK++N  TNK G +LEGSECCV+NGGDVEII KDAL
Subjt:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL

Query:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCII---------ESIMNDFIQL
        DDCNGPP                  ALVECEG SI+LSGD+GAVGR+VVSDSS AKNELCLDLKGTIYRAA+V SRTFCI+         E IMNDFIQL
Subjt:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCII---------ESIMNDFIQL

Query:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
        KA S +DEAETMVEGTLDGFSFDSEDEAEKITKV+SSPT QNE VEGL+KKSKNKA+KSSGRKRV+TGGKLQAPKKARKK QG KTKN KSKK
Subjt:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

XP_038894608.1 DNA-binding protein BIN4 isoform X1 [Benincasa hispida]7.6e-14975.79Show/hide
Query:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD
        M SSREQSPDWMRSFQAP GVALSSNSESSKN SSSMDNA+DQK PSS+KTTQ LDGDQ+QGD GN NLAKEVK EEHT HENS+HSVWMLSSDSESCPD
Subjt:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD

Query:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL
         +FIKE+YSHHEELSE  TSQFQGRG+DENAG +F +GKSKS KVSNKKSPKKQVKS  CTS KEK++NS+TNK GL+LEGSE  V+NG DV+IIEKDAL
Subjt:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL

Query:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI--------------------
        D CNGPP                  ALVECEG SI+LSGDMGAVGR+VVSDSSS KNELCLDLKGTIYRAA+V SRTFCI                    
Subjt:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI--------------------

Query:  -----IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGS
             IESIMNDFIQLKALSKVDEAETM+EGTLDGFSFDSEDEAEKI KV SSPT QNE VEGLN KSKNK +KSSGRKRVK GGKLQAPKK RKK QGS
Subjt:  -----IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGS

Query:  KTKNTKSKK
        KTK+TKSKK
Subjt:  KTKNTKSKK

XP_038894663.1 DNA-binding protein BIN4 isoform X2 [Benincasa hispida]1.1e-15078.88Show/hide
Query:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD
        M SSREQSPDWMRSFQAP GVALSSNSESSKN SSSMDNA+DQK PSS+KTTQ LDGDQ+QGD GN NLAKEVK EEHT HENS+HSVWMLSSDSESCPD
Subjt:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD

Query:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL
         +FIKE+YSHHEELSE  TSQFQGRG+DENAG +F +GKSKS KVSNKKSPKKQVKS  CTS KEK++NS+TNK GL+LEGSE  V+NG DV+IIEKDAL
Subjt:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL

Query:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQL
        D CNGPP                  ALVECEG SI+LSGDMGAVGR+VVSDSSS KNELCLDLKGTIYRAA+V SRTFCI         IESIMNDFIQL
Subjt:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQL

Query:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
        KALSKVDEAETM+EGTLDGFSFDSEDEAEKI KV SSPT QNE VEGLN KSKNK +KSSGRKRVK GGKLQAPKK RKK QGSKTK+TKSKK
Subjt:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

TrEMBL top hitse value%identityAlignment
A0A0A0LJZ5 Uncharacterized protein5.5e-13776.61Show/hide
Query:  MRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPDKSFIKEDYSHH
        MRSFQAPTGVALSSNS SSKNGSSSMDNAIDQ+DPSSHKTTQ LDGDQ+QGD GN NLAKEVKL+ HTGHENS+HSVWMLS DSESC D +FIKEDYS+H
Subjt:  MRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPDKSFIKEDYSHH

Query:  EELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDALDDCNGPP----
        EEL+EL TS+ QGR KDENAGR+F +GKSKSRKVSN+ SPKK+VKS  CTS KE ++NS TNK G  +EGSE  V+N GDVEI+EKDALDDC GPP    
Subjt:  EELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDALDDCNGPP----

Query:  --------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAET
                      ALVECEG SI+LSGDMGAVGR+VVSDSSSAKNELCLDLKGT+YRA +V SRTFCI         IESIMNDFIQLKALSKVDEAET
Subjt:  --------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAET

Query:  MVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQG
        MVEGTLDGFSFDSED+AEKITK A+SP  QNE VEGLN KSKNKA+KSSGRKRVKTGG+LQAPKK RKK QG
Subjt:  MVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQG

A0A1S3CDA8 DNA-binding protein BIN4 isoform X22.0e-14777.61Show/hide
Query:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD
        MSSSREQSPDWMRSFQAPTGVALSSNS SSKNGSSSMDNAIDQ+DPSSHKTTQ LDGDQ+QGD GN NLAKE KL+  TGHENS HSVWMLSSDSESC D
Subjt:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD

Query:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL
         +FIKED +HHEELSEL TS+ QGR KDENAGR+F +GKSKSRKVS + SPKK++KS  CTS KEK++NSDTNK GL LEGSE  V+N G+ EI+EKDAL
Subjt:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL

Query:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQL
        DDC  PP                  ALVECEG SI+LSGDMGAVGR+VVSDSSSAKNELCLDLKGT+YRA +V SRTFCI         IESIMNDFIQL
Subjt:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQL

Query:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
        KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSP  QNE VEGLN KSKNKA+KSSGRKRVK+GG+LQAPKK RKK QGSKTKN KSKK
Subjt:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

A0A1S3CDB0 DNA-binding protein BIN4 isoform X12.1e-14474.75Show/hide
Query:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD
        MSSSREQSPDWMRSFQAPTGVALSSNS SSKNGSSSMDNAIDQ+DPSSHKTTQ LDGDQ+QGD GN NLAKE KL+  TGHENS HSVWMLSSDSESC D
Subjt:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD

Query:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL
         +FIKED +HHEELSEL TS+ QGR KDENAGR+F +GKSKSRKVS + SPKK++KS  CTS KEK++NSDTNK GL LEGSE  V+N G+ EI+EKDAL
Subjt:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL

Query:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLK---------------GTIYRAAVVSSRTFCI-----
        DDC  PP                  ALVECEG SI+LSGDMGAVGR+VVSDSSSAKNELCLDLK               GT+YRA +V SRTFCI     
Subjt:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLK---------------GTIYRAAVVSSRTFCI-----

Query:  ----IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSK
            IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSP  QNE VEGLN KSKNKA+KSSGRKRVK+GG+LQAPKK RKK QGSK
Subjt:  ----IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSK

Query:  TKNTKSKK
        TKN KSKK
Subjt:  TKNTKSKK

A0A5D3BS94 DNA-binding protein BIN4 isoform X26.5e-13876.66Show/hide
Query:  APTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPDKSFIKEDYSHHEELSE
        APTGVALSSNS SSKNGSSSMDNAIDQ+DPSSHKTTQ LDGDQ+QGD GN NLAKE KL+  TGHENS HSVWMLSSDSESC D +FIKED +HHEELSE
Subjt:  APTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPDKSFIKEDYSHHEELSE

Query:  LTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDALDDCNGPP---------
        L TS+ QGR KDENAGR+F +GKSKSRKVS + SPKK++KS  CTS KEK++NSDTNK GL LEGSE  V+N G+ EI+EKDALDDC  PP         
Subjt:  LTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDALDDCNGPP---------

Query:  ---------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGT
                 ALVECEG SI+LSGDMGAVGR+VVSDSSSAKNELCLDLKGT+YRA +V SRTFCI         IESIMNDFIQLKALSKVDEAETMVEGT
Subjt:  ---------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGT

Query:  LDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
        LDGFSFDSEDEAEKITKVASSP  QNE VEGLN KSKNKA+KSSGRKRVK+GG+LQAPKK RKK QGSKTKN KSKK
Subjt:  LDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

A0A6J1CI66 DNA-binding protein BIN4 isoform X16.5e-14676.59Show/hide
Query:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD
        MSSSREQSPDWMRSFQ PTGVALSSNSESS N SS MDNAIDQKD SSHKTTQ LDGDQ+QGD G+ NL KE+KLEEH GH +S+HSVWMLSSDSE C D
Subjt:  MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPD

Query:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL
         S IKEDYSHHEEL E  TSQF GR KDEN  R+F DGKSKSRKVS+KKSPKK+VKS   T TKEK++N  TNK G +LEGSECCV+NGGDVEII KDAL
Subjt:  KSFIKEDYSHHEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKS--CTSTKEKMVNSDTNK-GLILEGSECCVKNGGDVEIIEKDAL

Query:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCII---------ESIMNDFIQL
        DDCNGPP                  ALVECEG SI+LSGD+GAVGR+VVSDSS AKNELCLDLKGTIYRAA+V SRTFCI+         E IMNDFIQL
Subjt:  DDCNGPP------------------ALVECEGNSIELSGDMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCII---------ESIMNDFIQL

Query:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
        KA S +DEAETMVEGTLDGFSFDSEDEAEKITKV+SSPT QNE VEGL+KKSKNKA+KSSGRKRV+TGGKLQAPKKARKK QG KTKN KSKK
Subjt:  KALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

SwissProt top hitse value%identityAlignment
Q9FLU1 DNA-binding protein BIN47.4e-3032.9Show/hide
Query:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQ----GDSGNPNLAKEVKLEE-----------------
        SSSRE SPDW+RS++AP   + ++LSS+ + S    S + +++   D        VL+ + V+     +S    + K+V +E+                 
Subjt:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQ----GDSGNPNLAKEVKLEE-----------------

Query:  HTGHEN----------SRH--------SVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGK
          G EN          S+H        SVW++SSDSE S P           D  F+ E            +E S  T S+   +  K+ N+ ++ +  +
Subjt:  HTGHEN----------SRH--------SVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGK

Query:  SK------SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSG
         K      + +V+ +KSPK + KS                T  K+K  ++DT     +   +    + G    +     +  N    LVECEG+SI+LSG
Subjt:  SK------SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSG

Query:  DMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPT
        DMGAVGR+VVSD++    ++ LDLKGTIY++ ++ SRTFC+         IE+IMNDFIQL   S V EAETMVEGTL+GF+F+S+DE+ K  K A  P 
Subjt:  DMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPT

Query:  HQN-----ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
         Q+     ET      K+K K +   G+KR +   + Q P    KKA+ S  K  K+KK
Subjt:  HQN-----ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

Arabidopsis top hitse value%identityAlignment
AT5G24630.1 double-stranded DNA binding3.1e-3133.11Show/hide
Query:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQ----GDSGNPNLAKEVKLEE-------------HTGH
        SSSRE SPDW+RS++AP   + ++LSS+ + S    S + +++   D        VL+ + V+     +S    + K+V +E+               G 
Subjt:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQ----GDSGNPNLAKEVKLEE-------------HTGH

Query:  EN----------SRH---------SVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGKSK-
        EN          S+H         SVW++SSDSE S P           D  F+ E            +E S  T S+   +  K+ N+ ++ +  + K 
Subjt:  EN----------SRH---------SVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGKSK-

Query:  -----SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSGDMG
             + +V+ +KSPK + KS                T  K+K  ++DT     +   +    + G    +     +  N    LVECEG+SI+LSGDMG
Subjt:  -----SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSGDMG

Query:  AVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQN
        AVGR+VVSD++    ++ LDLKGTIY++ ++ SRTFC+         IE+IMNDFIQL   S V EAETMVEGTL+GF+F+S+DE+ K  K A  P  Q+
Subjt:  AVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQN

Query:  -----ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
             ET      K+K K +   G+KR +   + Q P    KKA+ S  K  K+KK
Subjt:  -----ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

AT5G24630.3 double-stranded DNA binding1.1e-3132.16Show/hide
Query:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNA---------------------IDQKDPSSHKTTQVLDGDQV-------------QGDS
        SSSRE SPDW+RS++AP   + ++LSS+ + S    S + ++                     + +K+  +   T+ +  +QV             +G  
Subjt:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNA---------------------IDQKDPSSHKTTQVLDGDQV-------------QGDS

Query:  GNPNLAKEVKLEEHTGHENSRHSVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGKSK---
           N+  E    +H   +    SVW++SSDSE S P           D  F+ E            +E S  T S+   +  K+ N+ ++ +  + K   
Subjt:  GNPNLAKEVKLEEHTGHENSRHSVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGKSK---

Query:  ---SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSGDMGAV
           + +V+ +KSPK + KS                T  K+K  ++DT     +   +    + G    +     +  N    LVECEG+SI+LSGDMGAV
Subjt:  ---SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSGDMGAV

Query:  GRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQN--
        GR+VVSD++    ++ LDLKGTIY++ ++ SRTFC+         IE+IMNDFIQL   S V EAETMVEGTL+GF+F+S+DE+ K  K A  P  Q+  
Subjt:  GRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQN--

Query:  ---ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
           ET      K+K K +   G+KR +   + Q P    KKA+ S  K  K+KK
Subjt:  ---ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

AT5G24630.4 double-stranded DNA binding1.1e-3132.16Show/hide
Query:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNA---------------------IDQKDPSSHKTTQVLDGDQV-------------QGDS
        SSSRE SPDW+RS++AP   + ++LSS+ + S    S + ++                     + +K+  +   T+ +  +QV             +G  
Subjt:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNA---------------------IDQKDPSSHKTTQVLDGDQV-------------QGDS

Query:  GNPNLAKEVKLEEHTGHENSRHSVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGKSK---
           N+  E    +H   +    SVW++SSDSE S P           D  F+ E            +E S  T S+   +  K+ N+ ++ +  + K   
Subjt:  GNPNLAKEVKLEEHTGHENSRHSVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGKSK---

Query:  ---SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSGDMGAV
           + +V+ +KSPK + KS                T  K+K  ++DT     +   +    + G    +     +  N    LVECEG+SI+LSGDMGAV
Subjt:  ---SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSGDMGAV

Query:  GRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQN--
        GR+VVSD++    ++ LDLKGTIY++ ++ SRTFC+         IE+IMNDFIQL   S V EAETMVEGTL+GF+F+S+DE+ K  K A  P  Q+  
Subjt:  GRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQN--

Query:  ---ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
           ET      K+K K +   G+KR +   + Q P    KKA+ S  K  K+KK
Subjt:  ---ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

AT5G24630.5 double-stranded DNA binding1.1e-3132.44Show/hide
Query:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNA---------------------IDQKDPSSHKTTQVLDGDQV-------------QGDS
        SSSRE SPDW+RS++AP   + ++LSS+ + S    S + ++                     + +K+  +   T+ +  +QV             +G  
Subjt:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNA---------------------IDQKDPSSHKTTQVLDGDQV-------------QGDS

Query:  GNPNLAKEVKLEEHTGHENSRHSVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGKSK---
           N+  E    +H   +    SVW++SSDSE S P           D  F+ E            +E S  T S+   +  K+ N+ ++ +  + K   
Subjt:  GNPNLAKEVKLEEHTGHENSRHSVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGKSK---

Query:  ---SRKVSNKKSPKKQVKSCTST-------KEKMVNSDTNKGLIL----EGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSGDMGAVGRIV
           + +V+ +KSPK + KS   T       +E +   DT+   I+       +    + G    +     +  N    LVECEG+SI+LSGDMGAVGR+V
Subjt:  ---SRKVSNKKSPKKQVKSCTST-------KEKMVNSDTNKGLIL----EGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSGDMGAVGRIV

Query:  VSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQN-----E
        VSD++    ++ LDLKGTIY++ ++ SRTFC+         IE+IMNDFIQL   S V EAETMVEGTL+GF+F+S+DE+ K  K A  P  Q+     E
Subjt:  VSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQN-----E

Query:  TVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
        T      K+K K +   G+KR +   + Q P    KKA+ S  K  K+KK
Subjt:  TVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK

AT5G24630.6 double-stranded DNA binding5.2e-3132.9Show/hide
Query:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQ----GDSGNPNLAKEVKLEE-----------------
        SSSRE SPDW+RS++AP   + ++LSS+ + S    S + +++   D        VL+ + V+     +S    + K+V +E+                 
Subjt:  SSSREQSPDWMRSFQAP---TGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQ----GDSGNPNLAKEVKLEE-----------------

Query:  HTGHEN----------SRH--------SVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGK
          G EN          S+H        SVW++SSDSE S P           D  F+ E            +E S  T S+   +  K+ N+ ++ +  +
Subjt:  HTGHEN----------SRH--------SVWMLSSDSE-SCP-----------DKSFIKEDYSH-------HEELSELTTSQFQGR-GKDENAGRKFIDGK

Query:  SK------SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSG
         K      + +V+ +KSPK + KS                T  K+K  ++DT     +   +    + G    +     +  N    LVECEG+SI+LSG
Subjt:  SK------SRKVSNKKSPKKQVKSC---------------TSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSG

Query:  DMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPT
        DMGAVGR+VVSD++    ++ LDLKGTIY++ ++ SRTFC+         IE+IMNDFIQL   S V EAETMVEGTL+GF+F+S+DE+ K  K A  P 
Subjt:  DMGAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCI---------IESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPT

Query:  HQN-----ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK
         Q+     ET      K+K K +   G+KR +   + Q P    KKA+ S  K  K+KK
Subjt:  HQN-----ETVEGLNKKSKNKADKSSGRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGTTCGAGAGAGCAATCTCCAGATTGGATGCGATCTTTCCAAGCACCAACTGGTGTTGCCCTATCCTCTAATTCTGAATCTTCAAAGAATGGTAGCTCATCAAT
GGACAATGCAATTGATCAAAAGGATCCATCTTCACATAAAACCACACAAGTTTTAGATGGAGATCAGGTTCAAGGGGATAGTGGCAACCCTAATCTGGCAAAGGAAGTGA
AACTTGAGGAACATACAGGCCATGAAAATTCAAGGCACTCAGTTTGGATGTTATCATCGGATTCAGAGTCATGTCCTGATAAGAGTTTTATAAAGGAGGATTATAGTCAT
CATGAAGAATTATCTGAACTTACGACATCTCAATTCCAAGGTAGAGGGAAGGATGAAAATGCAGGTCGCAAATTCATCGATGGAAAATCTAAATCAAGGAAAGTATCAAA
TAAAAAGTCTCCAAAAAAACAAGTGAAATCATGCACTTCGACCAAAGAGAAGATGGTCAATTCAGATACAAATAAAGGCCTTATTTTGGAAGGTTCTGAATGCTGCGTAA
AAAATGGTGGCGATGTGGAGATTATAGAAAAAGATGCATTGGATGACTGCAATGGGCCTCCTGCACTTGTTGAGTGTGAAGGAAATTCAATAGAGCTGAGTGGCGACATG
GGTGCTGTAGGACGAATTGTTGTTTCTGATTCCTCATCTGCAAAAAATGAACTTTGCCTAGATTTGAAAGGTACAATTTATAGAGCGGCAGTAGTTTCTTCAAGGACATT
TTGCATCATAGAATCTATCATGAATGACTTCATACAGTTGAAGGCACTGTCGAAAGTTGATGAGGCTGAAACTATGGTTGAAGGAACATTGGATGGCTTCTCATTTGATT
CTGAAGATGAGGCTGAGAAAATAACTAAAGTCGCTTCTTCGCCAACTCACCAAAATGAGACTGTAGAAGGGCTCAACAAAAAATCTAAAAATAAAGCCGATAAATCATCA
GGGCGGAAGCGTGTTAAAACTGGAGGAAAGCTGCAAGCGCCAAAGAAAGCAAGGAAGAAAGCTCAAGGTTCTAAAACTAAAAATACAAAGAGCAAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGTTCGAGAGAGCAATCTCCAGATTGGATGCGATCTTTCCAAGCACCAACTGGTGTTGCCCTATCCTCTAATTCTGAATCTTCAAAGAATGGTAGCTCATCAAT
GGACAATGCAATTGATCAAAAGGATCCATCTTCACATAAAACCACACAAGTTTTAGATGGAGATCAGGTTCAAGGGGATAGTGGCAACCCTAATCTGGCAAAGGAAGTGA
AACTTGAGGAACATACAGGCCATGAAAATTCAAGGCACTCAGTTTGGATGTTATCATCGGATTCAGAGTCATGTCCTGATAAGAGTTTTATAAAGGAGGATTATAGTCAT
CATGAAGAATTATCTGAACTTACGACATCTCAATTCCAAGGTAGAGGGAAGGATGAAAATGCAGGTCGCAAATTCATCGATGGAAAATCTAAATCAAGGAAAGTATCAAA
TAAAAAGTCTCCAAAAAAACAAGTGAAATCATGCACTTCGACCAAAGAGAAGATGGTCAATTCAGATACAAATAAAGGCCTTATTTTGGAAGGTTCTGAATGCTGCGTAA
AAAATGGTGGCGATGTGGAGATTATAGAAAAAGATGCATTGGATGACTGCAATGGGCCTCCTGCACTTGTTGAGTGTGAAGGAAATTCAATAGAGCTGAGTGGCGACATG
GGTGCTGTAGGACGAATTGTTGTTTCTGATTCCTCATCTGCAAAAAATGAACTTTGCCTAGATTTGAAAGGTACAATTTATAGAGCGGCAGTAGTTTCTTCAAGGACATT
TTGCATCATAGAATCTATCATGAATGACTTCATACAGTTGAAGGCACTGTCGAAAGTTGATGAGGCTGAAACTATGGTTGAAGGAACATTGGATGGCTTCTCATTTGATT
CTGAAGATGAGGCTGAGAAAATAACTAAAGTCGCTTCTTCGCCAACTCACCAAAATGAGACTGTAGAAGGGCTCAACAAAAAATCTAAAAATAAAGCCGATAAATCATCA
GGGCGGAAGCGTGTTAAAACTGGAGGAAAGCTGCAAGCGCCAAAGAAAGCAAGGAAGAAAGCTCAAGGTTCTAAAACTAAAAATACAAAGAGCAAGAAATGA
Protein sequenceShow/hide protein sequence
MSSSREQSPDWMRSFQAPTGVALSSNSESSKNGSSSMDNAIDQKDPSSHKTTQVLDGDQVQGDSGNPNLAKEVKLEEHTGHENSRHSVWMLSSDSESCPDKSFIKEDYSH
HEELSELTTSQFQGRGKDENAGRKFIDGKSKSRKVSNKKSPKKQVKSCTSTKEKMVNSDTNKGLILEGSECCVKNGGDVEIIEKDALDDCNGPPALVECEGNSIELSGDM
GAVGRIVVSDSSSAKNELCLDLKGTIYRAAVVSSRTFCIIESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTHQNETVEGLNKKSKNKADKSS
GRKRVKTGGKLQAPKKARKKAQGSKTKNTKSKK