; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0002 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0002
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNA-binding protein BIN4
Genome locationMC10:21908..32906
RNA-Seq ExpressionMC10g0002
SyntenyMC10g0002
Gene Ontology termsGO:0042023 - DNA endoreduplication (biological process)
GO:0009330 - DNA topoisomerase complex (ATP-hydrolyzing) (cellular component)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR033246 - DNA-binding protein BIN4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147326.1 DNA-binding protein BIN4 isoform X1 [Cucumis sativus]1.89e-19478.37Show/hide
Query:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD
        MSSSREQSPDWMRSFQ PTGVALSSNS SS N SS MDNAIDQ+D SSHKTTQDLDGDQIQGD G+HNL KE+KL+ H GH +SKHSVWMLS DSE CSD
Subjt:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD

Query:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL
        N+ IKEDYS+HEEL E  TS+  GRRKDEN  R FT+GKSKSRKVS++ SPKK+VKS+V T  KE I+N  TNK G  +EGSE  VRNG DVEI+ KDAL
Subjt:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL

Query:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQL
        DDC GPPVSSSRLPLVLSDK HRLKALVECEGTSIDLSGD+GAVGRVVVSDSS AKNELCLDLKGT+YRA IVPSRTFCI+         E IMNDFIQL
Subjt:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQL

Query:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        KA S +DEAETMVEGTLDGFSFDSED+AEKITK +S P DQNE VEGL+ KSKNKAEKSSGRKRV+TGG+LQAPKK RKKVQG KTKN KSKK
Subjt:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

XP_008460815.1 PREDICTED: DNA-binding protein BIN4 isoform X2 [Cucumis melo]1.38e-19478.63Show/hide
Query:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD
        MSSSREQSPDWMRSFQ PTGVALSSNS SS N SS MDNAIDQ+D SSHKTTQDLDGDQIQGD G+HNL KE KL+   GH +S+HSVWMLSSDSE CSD
Subjt:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD

Query:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL
        N+ IKED +HHEEL E  TS+  GRRKDEN  R FT+GKSKSRKVS + SPKK++KSQV T  KEKIIN  TNK G  LEGSE  VRNG + EI+ KDAL
Subjt:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL

Query:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQL
        DDC  PPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGD+GAVGRVVVSDSS AKNELCLDLKGT+YRA IVPSRTFCI+         E IMNDFIQL
Subjt:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQL

Query:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        KA S +DEAETMVEGTLDGFSFDSEDEAEKITKV+SSP DQNE VEGL+ KSKNKAEKSSGRKRV++GG+LQAPKK RKKVQG KTKN KSKK
Subjt:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

XP_022140827.1 DNA-binding protein BIN4 isoform X1 [Momordica charantia]4.08e-26197.71Show/hide
Query:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD
        MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD
Subjt:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD

Query:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL
        NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL
Subjt:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL

Query:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQL
        DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI         MECIMNDFIQL
Subjt:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQL

Query:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
Subjt:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

XP_022140828.1 DNA-binding protein BIN4 isoform X2 [Momordica charantia]1.21e-23597.48Show/hide
Query:  MDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFT
        MDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFT
Subjt:  MDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFT

Query:  DGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSID
        DGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSID
Subjt:  DGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSID

Query:  LSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSS
        LSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI         MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSS
Subjt:  LSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSS

Query:  SPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        SPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
Subjt:  SPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

XP_038894663.1 DNA-binding protein BIN4 isoform X2 [Benincasa hispida]2.06e-19679.13Show/hide
Query:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD
        M SSREQSPDWMRSFQ P GVALSSNSESS N SS MDNA+DQK  SS+KTTQDLDGDQIQGD G+HNL KE+K EEH  H +SKHSVWMLSSDSE C D
Subjt:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD

Query:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL
        N+ IKE+YSHHEEL E  TSQF GR +DEN    FT+GKSKS KVS+KKSPKK+VKSQV T  KEKIIN  TNK G +LEGSE  VRNG DV+II KDAL
Subjt:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL

Query:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQL
        D CNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGD+GAVGRVVVSDSS +KNELCLDLKGTIYRAAIVPSRTFCI+         E IMNDFIQL
Subjt:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQL

Query:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        KA S +DEAETM+EGTLDGFSFDSEDEAEKI KV+SSPTDQNE VEGL+ KSKNK EKSSGRKRV+ GGKLQAPKK RKKVQG KTK+ KSKK
Subjt:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

TrEMBL top hitse value%identityAlignment
A0A1S3CDA8 DNA-binding protein BIN4 isoform X26.67e-19578.63Show/hide
Query:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD
        MSSSREQSPDWMRSFQ PTGVALSSNS SS N SS MDNAIDQ+D SSHKTTQDLDGDQIQGD G+HNL KE KL+   GH +S+HSVWMLSSDSE CSD
Subjt:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD

Query:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL
        N+ IKED +HHEEL E  TS+  GRRKDEN  R FT+GKSKSRKVS + SPKK++KSQV T  KEKIIN  TNK G  LEGSE  VRNG + EI+ KDAL
Subjt:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL

Query:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQL
        DDC  PPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGD+GAVGRVVVSDSS AKNELCLDLKGT+YRA IVPSRTFCI+         E IMNDFIQL
Subjt:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQL

Query:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        KA S +DEAETMVEGTLDGFSFDSEDEAEKITKV+SSP DQNE VEGL+ KSKNKAEKSSGRKRV++GG+LQAPKK RKKVQG KTKN KSKK
Subjt:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

A0A1S3CDB0 DNA-binding protein BIN4 isoform X11.03e-19075.74Show/hide
Query:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD
        MSSSREQSPDWMRSFQ PTGVALSSNS SS N SS MDNAIDQ+D SSHKTTQDLDGDQIQGD G+HNL KE KL+   GH +S+HSVWMLSSDSE CSD
Subjt:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD

Query:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL
        N+ IKED +HHEEL E  TS+  GRRKDEN  R FT+GKSKSRKVS + SPKK++KSQV T  KEKIIN  TNK G  LEGSE  VRNG + EI+ KDAL
Subjt:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL

Query:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKG---------------TIYRAAIVPSRTFCIM----
        DDC  PPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGD+GAVGRVVVSDSS AKNELCLDLKG               T+YRA IVPSRTFCI+    
Subjt:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKG---------------TIYRAAIVPSRTFCIM----

Query:  -----ECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFK
             E IMNDFIQLKA S +DEAETMVEGTLDGFSFDSEDEAEKITKV+SSP DQNE VEGL+ KSKNKAEKSSGRKRV++GG+LQAPKK RKKVQG K
Subjt:  -----ECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFK

Query:  TKNGKSKK
        TKN KSKK
Subjt:  TKNGKSKK

A0A5D3BS94 DNA-binding protein BIN4 isoform X21.64e-18277.72Show/hide
Query:  VPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFE
         PTGVALSSNS SS N SS MDNAIDQ+D SSHKTTQDLDGDQIQGD G+HNL KE KL+   GH +S+HSVWMLSSDSE CSDN+ IKED +HHEEL E
Subjt:  VPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFE

Query:  SKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLV
          TS+  GRRKDEN  R FT+GKSKSRKVS + SPKK++KSQV T  KEKIIN  TNK G  LEGSE  VRNG + EI+ KDALDDC  PPVSSSRLPLV
Subjt:  SKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLV

Query:  LSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQLKAESNIDEAETMVEGT
        LSDKVHRLKALVECEGTSIDLSGD+GAVGRVVVSDSS AKNELCLDLKGT+YRA IVPSRTFCI+         E IMNDFIQLKA S +DEAETMVEGT
Subjt:  LSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIM---------ECIMNDFIQLKAESNIDEAETMVEGT

Query:  LDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        LDGFSFDSEDEAEKITKV+SSP DQNE VEGL+ KSKNKAEKSSGRKRV++GG+LQAPKK RKKVQG KTKN KSKK
Subjt:  LDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

A0A6J1CGV6 DNA-binding protein BIN4 isoform X25.84e-23697.48Show/hide
Query:  MDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFT
        MDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFT
Subjt:  MDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFT

Query:  DGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSID
        DGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSID
Subjt:  DGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSID

Query:  LSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSS
        LSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI         MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSS
Subjt:  LSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSS

Query:  SPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        SPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
Subjt:  SPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

A0A6J1CI66 DNA-binding protein BIN4 isoform X11.98e-26197.71Show/hide
Query:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD
        MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD
Subjt:  MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSD

Query:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL
        NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL
Subjt:  NSLIKEDYSHHEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDAL

Query:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQL
        DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI         MECIMNDFIQL
Subjt:  DDCNGPPVSSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQL

Query:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
Subjt:  KAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

SwissProt top hitse value%identityAlignment
Q9FLU1 DNA-binding protein BIN49.2e-3933.26Show/hide
Query:  SSSREQSPDWMRSFQVPTGVALSSNSES--------------------------------------SNNVSSFMDNAIDQKDLSSHKTTQDLD---GDQI
        SSSRE SPDW+RS++ P   +L S S S                                       N+ +  +   +  + + S K   D      D  
Subjt:  SSSREQSPDWMRSFQVPTGVALSSNSES--------------------------------------SNNVSSFMDNAIDQKDLSSHKTTQDLD---GDQI

Query:  QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCS--------------------------------DNSLIKEDYSHHEELFESKTSQFLGRRKD
         G    +N+  E    +H        SVW++SSDSE  S                                + S   +  S  +   E  ++Q + + +D
Subjt:  QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCS--------------------------------DNSLIKEDYSHHEELFESKTSQFLGRRKD

Query:  ENTD----REFTDGKS-KSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHR
        ++TD     + T  KS K++  S +K+PK+E  +Q    T++K  +  T+ +  + E            E+     +   +G   SSSRLPLVLS+KV+R
Subjt:  ENTD----REFTDGKS-KSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHR

Query:  LKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFD
         K LVECEG SIDLSGD+GAVGRVVVSD++    ++ LDLKGTIY++ I+PSRTFC+         +E IMNDFIQL  +SN+ EAETMVEGTL+GF+F+
Subjt:  LKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFD

Query:  SEDEAEKITKVSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        S+DE+ K  K +  P DQ+   E         K+K K E   G+KR R   + Q P    KK +    K  K+KK
Subjt:  SEDEAEKITKVSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

Arabidopsis top hitse value%identityAlignment
AT5G24630.1 double-stranded DNA binding6.5e-4033.62Show/hide
Query:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAI-------------------------DQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLE
        SSSRE SPDW+RS++ P   +L S S SS++ S + ++ +                          +K+  +   T+ +  +Q+   +   +    ++ +
Subjt:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAI-------------------------DQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLE

Query:  EHAGHGD-----SKH---------SVWMLSSDSELCS--------------------------------DNSLIKEDYSHHEELFESKTSQFLGRRKDEN
        E+  + D     SKH         SVW++SSDSE  S                                + S   +  S  +   E  ++Q + + +D++
Subjt:  EHAGHGD-----SKH---------SVWMLSSDSELCS--------------------------------DNSLIKEDYSHHEELFESKTSQFLGRRKDEN

Query:  TD----REFTDGKS-KSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHRLK
        TD     + T  KS K++  S +K+PK+E  +Q    T++K  +  T+ +  + E            E+     +   +G   SSSRLPLVLS+KV+R K
Subjt:  TD----REFTDGKS-KSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKVHRLK

Query:  ALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSE
         LVECEG SIDLSGD+GAVGRVVVSD++    ++ LDLKGTIY++ I+PSRTFC+         +E IMNDFIQL  +SN+ EAETMVEGTL+GF+F+S+
Subjt:  ALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSE

Query:  DEAEKITKVSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        DE+ K  K +  P DQ+   E         K+K K E   G+KR R   + Q P    KK +    K  K+KK
Subjt:  DEAEKITKVSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

AT5G24630.2 double-stranded DNA binding1.1e-4235.7Show/hide
Query:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQI-----------------------------------------
        SSSRE SPDW+RS++ P   +L S S S +      D+   + ++ S     D DGD I                                         
Subjt:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQI-----------------------------------------

Query:  --QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDE----NTDREFTDGKSKSRKVSDKKSPKKE
          +G    +N+  E    +H        SVW++SSDSE    +S IK++ +   E    K + F+    +E     T R+    K+KS+  S +K+PK+ 
Subjt:  --QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDE----NTDREFTDGKSKSRKVSDKKSPKKE

Query:  VKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVE-----IIGKDALDDCNGPPV--SSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRV
          +Q    T++K  +    ++ F             D +     II ++   D    P   SSSRLPLVLS+KV+R K LVECEG SIDLSGD+GAVGRV
Subjt:  VKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVE-----IIGKDALDDCNGPPV--SSSRLPLVLSDKVHRLKALVECEGTSIDLSGDVGAVGRV

Query:  VVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVE-
        VVSD++    ++ LDLKGTIY++ I+PSRTFC+         +E IMNDFIQL  +SN+ EAETMVEGTL+GF+F+S+DE+ K  K +  P DQ+   E 
Subjt:  VVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPTDQNEAVE-

Query:  ----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
                K+K K E   G+KR R   + Q P    KK +    K  K+KK
Subjt:  ----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

AT5G24630.3 double-stranded DNA binding1.0e-4033.54Show/hide
Query:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQI-----------------------------------------
        SSSRE SPDW+RS++ P   +L S S S +      D+   + ++ S     D DGD I                                         
Subjt:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQI-----------------------------------------

Query:  --QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCS--------------------------------DNSLIKEDYSHHEELFESKTSQFLGRR
          +G    +N+  E    +H        SVW++SSDSE  S                                + S   +  S  +   E  ++Q + + 
Subjt:  --QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCS--------------------------------DNSLIKEDYSHHEELFESKTSQFLGRR

Query:  KDENTD----REFTDGKS-KSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKV
        +D++TD     + T  KS K++  S +K+PK+E  +Q    T++K  +  T+ +  + E            E+     +   +G   SSSRLPLVLS+KV
Subjt:  KDENTD----REFTDGKS-KSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKV

Query:  HRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFS
        +R K LVECEG SIDLSGD+GAVGRVVVSD++    ++ LDLKGTIY++ I+PSRTFC+         +E IMNDFIQL  +SN+ EAETMVEGTL+GF+
Subjt:  HRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFS

Query:  FDSEDEAEKITKVSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        F+S+DE+ K  K +  P DQ+   E         K+K K E   G+KR R   + Q P    KK +    K  K+KK
Subjt:  FDSEDEAEKITKVSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

AT5G24630.4 double-stranded DNA binding1.0e-4033.54Show/hide
Query:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQI-----------------------------------------
        SSSRE SPDW+RS++ P   +L S S S +      D+   + ++ S     D DGD I                                         
Subjt:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQI-----------------------------------------

Query:  --QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCS--------------------------------DNSLIKEDYSHHEELFESKTSQFLGRR
          +G    +N+  E    +H        SVW++SSDSE  S                                + S   +  S  +   E  ++Q + + 
Subjt:  --QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCS--------------------------------DNSLIKEDYSHHEELFESKTSQFLGRR

Query:  KDENTD----REFTDGKS-KSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKV
        +D++TD     + T  KS K++  S +K+PK+E  +Q    T++K  +  T+ +  + E            E+     +   +G   SSSRLPLVLS+KV
Subjt:  KDENTD----REFTDGKS-KSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDKV

Query:  HRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFS
        +R K LVECEG SIDLSGD+GAVGRVVVSD++    ++ LDLKGTIY++ I+PSRTFC+         +E IMNDFIQL  +SN+ EAETMVEGTL+GF+
Subjt:  HRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFS

Query:  FDSEDEAEKITKVSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
        F+S+DE+ K  K +  P DQ+   E         K+K K E   G+KR R   + Q P    KK +    K  K+KK
Subjt:  FDSEDEAEKITKVSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK

AT5G24630.5 double-stranded DNA binding5.9e-4135.05Show/hide
Query:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQI-----------------------------------------
        SSSRE SPDW+RS++ P   +L S S S +      D+   + ++ S     D DGD I                                         
Subjt:  SSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQI-----------------------------------------

Query:  --QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDE----NTDREFTDGKSKSRKVSDKKSPKKE
          +G    +N+  E    +H        SVW++SSDSE    +S IK++ +   E    K + F+    +E     T R+    K+KS+  S +K+PK+ 
Subjt:  --QGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSHHEELFESKTSQFLGRRKDE----NTDREFTDGKSKSRKVSDKKSPKKE

Query:  VKSQVRTLTKEKIINF-------------HTNKEGFVLEGSECCVRNGGDVE------IIGKDALDDCNGPPV--SSSRLPLVLSDKVHRLKALVECEGT
          +Q    T++K  +                +K        E C +     E      II ++   D    P   SSSRLPLVLS+KV+R K LVECEG 
Subjt:  VKSQVRTLTKEKIINF-------------HTNKEGFVLEGSECCVRNGGDVE------IIGKDALDDCNGPPV--SSSRLPLVLSDKVHRLKALVECEGT

Query:  SIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITK
        SIDLSGD+GAVGRVVVSD++    ++ LDLKGTIY++ I+PSRTFC+         +E IMNDFIQL  +SN+ EAETMVEGTL+GF+F+S+DE+ K  K
Subjt:  SIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCI---------MECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITK

Query:  VSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK
         +  P DQ+   E         K+K K E   G+KR R   + Q P    KK +    K  K+KK
Subjt:  VSSSPTDQNEAVE-----GLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGTTCGAGAGAGCAATCTCCAGATTGGATGCGATCTTTCCAAGTACCAACTGGTGTTGCACTATCCTCTAATTCTGAATCATCAAATAATGTGAGCTCATTTAT
GGACAATGCAATTGATCAAAAGGATCTATCTTCACATAAAACCACACAAGACTTGGATGGAGATCAGATACAAGGGGACCGTGGCCACCATAATCTGGTGAAGGAAATGA
AACTTGAGGAACACGCGGGACATGGGGACTCAAAGCACTCAGTTTGGATGTTATCATCAGATTCGGAGTTGTGTTCTGATAATAGTCTTATAAAGGAGGATTATAGTCAT
CACGAAGAATTATTTGAATCTAAAACATCTCAATTCCTAGGGAGACGGAAGGATGAAAATACAGATCGAGAATTCACTGATGGAAAATCTAAATCAAGGAAAGTGTCAGA
TAAAAAGTCTCCAAAAAAAGAGGTCAAATCACAAGTTCGTACTTTGACAAAAGAGAAGATAATCAATTTTCACACCAATAAAGAAGGTTTTGTGTTAGAAGGATCTGAAT
GCTGTGTAAGAAATGGTGGAGACGTGGAGATTATAGGAAAAGATGCATTGGATGACTGCAACGGACCTCCTGTTTCCTCCTCAAGGTTGCCATTGGTCCTCTCTGACAAA
GTCCATCGGTTGAAGGCACTTGTTGAGTGTGAAGGAACTTCAATAGATTTGAGTGGTGACGTGGGTGCTGTAGGACGAGTTGTAGTTTCAGATTCCTCATTTGCAAAAAA
TGAACTTTGTCTAGATTTGAAAGGTACAATTTATAGAGCGGCAATAGTTCCTTCAAGGACATTTTGCATTATGGAGTGTATCATGAATGACTTTATACAGTTGAAGGCAG
AATCCAATATTGATGAGGCTGAAACTATGGTTGAAGGAACATTGGATGGCTTCTCATTTGATTCCGAAGATGAGGCTGAGAAAATAACTAAAGTTTCTTCCTCTCCAACC
GACCAAAATGAGGCAGTAGAAGGGCTCAGCAAAAAATCAAAAAATAAAGCTGAGAAATCATCAGGGCGGAAGCGCGTTAGAACTGGAGGAAAGCTGCAGGCACCAAAGAA
AGCAAGGAAGAAAGTTCAAGGTTTTAAAACTAAAAATGGCAAGAGCAAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAACTATCTAGGTAAATCAACACAAATCCTTTTAACATGATTTGTTCTCACTCACTTGCTTCCAAAGAATATCTGTTAAAGATCACGATCGAGTCATTGAAAATAA
AGGTGTACCACATTGATACAATTTTTTAAAATTTTTATTAACAGCGCTTCCATGATCTCAGGATCCCTCTCATTCAGACGTGGTACCGGTTTATCTATGTACCTATCCTC
TCTACTCGGACGTCACATGCCCACCAACTTCAGCTTGGTTCGTCCCCCAATTACACCGTATTAGGAGAGATTCCACTAATCTACTTCTGTTTCCCTCTTCCTCGACCAGC
TAGCCCTTCACGCCAGACCCCACCTCAGGCGCTCTCTGAACTCGCTAAAACCCTCCCCCTCATCCTCCATCCCCTGCCGGCGACACACTCACGGCGAACACCACGTCAGG
CGGTGGTAGTGCAGATCTTCGACGAACGGCGAAACTCCAACGGCCCGCGATTCCCCGACACGAACTTCTTCTCCGGCGTCCGCCGAATGTTGAGACAGGCTCTCGATTAG
CGGTACGTCGGGGATCGTTTGGAGCTGCGCTGTGATCGTTCAGGAAGGCTTAATCGGGATCGGTCTCGAGCTCTCTGTACTCAGGCGTGCCCGGTTCCGGCATGAATCGG
CGGCTCACTGTCGGGCGATTCGGTAGGTAGCCGGCGTAGGGGTATTGGCCGAAGTTCACAGCGATCCTTATAACAATGACAAAATACTTTATTTTCTGGGTTATATATCT
TTTGATTTGGTATTTTGTACGGGGGAAGAACTTTGTATGATTTCGACGGCCTTCCATTCACCAAGTAAGAGAATAGTGGATCTCGAATCCGTGGACGTTATTTGGCAGCG
GAATAGTGCAGTGGAAAGACCAAAAGACTTCTGAATTAATACGAAGGCCCAAATGAGCAGTTCGAGAGAGCAATCTCCAGATTGGATGCGATCTTTCCAAGTACCAACTG
GTGTTGCACTATCCTCTAATTCTGAATCATCAAATAATGTGAGCTCATTTATGGACAATGCAATTGATCAAAAGGATCTATCTTCACATAAAACCACACAAGACTTGGAT
GGAGATCAGATACAAGGGGACCGTGGCCACCATAATCTGGTGAAGGAAATGAAACTTGAGGAACACGCGGGACATGGGGACTCAAAGCACTCAGTTTGGATGTTATCATC
AGATTCGGAGTTGTGTTCTGATAATAGTCTTATAAAGGAGGATTATAGTCATCACGAAGAATTATTTGAATCTAAAACATCTCAATTCCTAGGGAGACGGAAGGATGAAA
ATACAGATCGAGAATTCACTGATGGAAAATCTAAATCAAGGAAAGTGTCAGATAAAAAGTCTCCAAAAAAAGAGGTCAAATCACAAGTTCGTACTTTGACAAAAGAGAAG
ATAATCAATTTTCACACCAATAAAGAAGGTTTTGTGTTAGAAGGATCTGAATGCTGTGTAAGAAATGGTGGAGACGTGGAGATTATAGGAAAAGATGCATTGGATGACTG
CAACGGACCTCCTGTTTCCTCCTCAAGGTTGCCATTGGTCCTCTCTGACAAAGTCCATCGGTTGAAGGCACTTGTTGAGTGTGAAGGAACTTCAATAGATTTGAGTGGTG
ACGTGGGTGCTGTAGGACGAGTTGTAGTTTCAGATTCCTCATTTGCAAAAAATGAACTTTGTCTAGATTTGAAAGGTACAATTTATAGAGCGGCAATAGTTCCTTCAAGG
ACATTTTGCATTATGGAGTGTATCATGAATGACTTTATACAGTTGAAGGCAGAATCCAATATTGATGAGGCTGAAACTATGGTTGAAGGAACATTGGATGGCTTCTCATT
TGATTCCGAAGATGAGGCTGAGAAAATAACTAAAGTTTCTTCCTCTCCAACCGACCAAAATGAGGCAGTAGAAGGGCTCAGCAAAAAATCAAAAAATAAAGCTGAGAAAT
CATCAGGGCGGAAGCGCGTTAGAACTGGAGGAAAGCTGCAGGCACCAAAGAAAGCAAGGAAGAAAGTTCAAGGTTTTAAAACTAAAAATGGCAAGAGCAAGAAATGAAGC
TTGAAATCTCTACTGTATAAGCTTTTAATCCAAGGTGGTTGAATAGTTGTTTCTCATGGCCCCTAGGAGGGCAGAGTCAGAATGCAAGAGGTGAGGCAAAACGAACGAGA
TGCTTCAACCAACAGGGCTTGAAACATGGGGATTTTTTTGTTTTTGTTTTTTGAGTACAACATGTGGGTAGGGGTGGGGGATTCTTGATTGGAGATATATGTCAATTACT
ATTGAGTTATGTTTGCTTTGGTGAAATATGAGATTTTTTGCGTGAAACGTAGGATCTTTTCTCCCCTTTTATCTCTGACACGAAGTTGAATAGTTTAATGAATGTCGTTT
TTGCTGTGAAGTTC
Protein sequenceShow/hide protein sequence
MSSSREQSPDWMRSFQVPTGVALSSNSESSNNVSSFMDNAIDQKDLSSHKTTQDLDGDQIQGDRGHHNLVKEMKLEEHAGHGDSKHSVWMLSSDSELCSDNSLIKEDYSH
HEELFESKTSQFLGRRKDENTDREFTDGKSKSRKVSDKKSPKKEVKSQVRTLTKEKIINFHTNKEGFVLEGSECCVRNGGDVEIIGKDALDDCNGPPVSSSRLPLVLSDK
VHRLKALVECEGTSIDLSGDVGAVGRVVVSDSSFAKNELCLDLKGTIYRAAIVPSRTFCIMECIMNDFIQLKAESNIDEAETMVEGTLDGFSFDSEDEAEKITKVSSSPT
DQNEAVEGLSKKSKNKAEKSSGRKRVRTGGKLQAPKKARKKVQGFKTKNGKSKK