; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015584 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015584
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA-binding protein BIN4
Genome locationtig00004835:370208..379148
RNA-Seq ExpressionSgr015584
SyntenySgr015584
Gene Ontology termsGO:0042023 - DNA endoreduplication (biological process)
GO:0009330 - DNA topoisomerase complex (ATP-hydrolyzing) (cellular component)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR033246 - DNA-binding protein BIN4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147326.1 DNA-binding protein BIN4 isoform X1 [Cucumis sativus]7.2e-14278.14Show/hide
Query:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSE
        APTGV LS NS SSKNGSSSMDNAIDQ+DPSSHKTTQDLDGDQI+GD GNHNLAK+VKL+ H  HE+SKHS+WMLS DSESC  N+ IKEDYS+HEEL+E
Subjt:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSE

Query:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL
        L TS+ QGR KDENA R+ T+GKSK  KVS++ SPK++VKS+V TS KE I+NS  +K     EGSE +VRN GDVEI+EKDA D C GPPV+SSRLPL+
Subjt:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL

Query:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGT
        LSDK HRLKALVECEG SIDLSGDMGAVGRV+VSDSSS KNELCLDLKGT+YRA IVPSRTFCIVSFGQSEAKIE+IMNDFIQL+A S VDEAETMVEGT
Subjt:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGT

Query:  LDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        LDGFSFDSED+AEKITK A+SP DQNE V+GLN KSKNK EKSSG  RKR KTGG+LQAPKK RKK
Subjt:  LDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

XP_022140827.1 DNA-binding protein BIN4 isoform X1 [Momordica charantia]9.4e-15081.64Show/hide
Query:  PTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSEL
        PTGV LS NSESS N SS MDNAIDQKD SSHKTTQDLDGDQI+GD G+HNL K++KLEEH  H DSKHS+WMLSSDSE C  NSLIKEDYSHHEEL E 
Subjt:  PTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSEL

Query:  KTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLLL
        KTSQF GR KDEN +R+ TDGKSK  KVSDKKSPK++VKSQVRT  KEKIIN   +K     EGSEC VRN GDVEII KDA D CNGPPV+SSRLPL+L
Subjt:  KTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLLL

Query:  SDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTL
        SDKVHRLKALVECEG SIDLSGD+GAVGRV+VSDSS  KNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAK+E IMNDFIQL+A+SN+DEAETMVEGTL
Subjt:  SDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTL

Query:  DGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        DGFSFDSEDEAEKITKV+SSPTDQNEAV+GL+KKSKNK EKSSG  RKR +TGGKLQAPKKARKK
Subjt:  DGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

XP_022140828.1 DNA-binding protein BIN4 isoform X2 [Momordica charantia]1.7e-14382.08Show/hide
Query:  MDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSELKTSQFQGRGKDENANRKLT
        MDNAIDQKD SSHKTTQDLDGDQI+GD G+HNL K++KLEEH  H DSKHS+WMLSSDSE C  NSLIKEDYSHHEEL E KTSQF GR KDEN +R+ T
Subjt:  MDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSELKTSQFQGRGKDENANRKLT

Query:  DGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLLLSDKVHRLKALVECEGNSID
        DGKSK  KVSDKKSPK++VKSQVRT  KEKIIN   +K     EGSEC VRN GDVEII KDA D CNGPPV+SSRLPL+LSDKVHRLKALVECEG SID
Subjt:  DGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLLLSDKVHRLKALVECEGNSID

Query:  LSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVAS
        LSGD+GAVGRV+VSDSS  KNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAK+E IMNDFIQL+A+SN+DEAETMVEGTLDGFSFDSEDEAEKITKV+S
Subjt:  LSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVAS

Query:  SPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        SPTDQNEAV+GL+KKSKNK EKSSG  RKR +TGGKLQAPKKARKK
Subjt:  SPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

XP_038894608.1 DNA-binding protein BIN4 isoform X1 [Benincasa hispida]1.8e-14578.01Show/hide
Query:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCP-NSLIKEDYSHHEELSE
        AP GV LS NSESSKN SSSMDNA+DQK PSS+KTTQDLDGDQI+GD GNHNLAK+VK EEH  HE+SKHS+WMLSSDSESCP N+ IKE+YSHHEELSE
Subjt:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCP-NSLIKEDYSHHEELSE

Query:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL
          TSQFQGRG+DENA  + T+GKSK  KVS+KKSPK+QVKSQV TS KEKIINS  +K     EGSE YVRN  DV+IIEKDA DGCNGPPV+SSRLPL+
Subjt:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL

Query:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAK----------------IEAIMNDFIQL
        LSDKVHRLKALVECEG SIDLSGDMGAVGRV+VSDSSS KNELCLDLKGTIYRAAIVPSRTFCIV+FGQSEAK                IE+IMNDFIQL
Subjt:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAK----------------IEAIMNDFIQL

Query:  EAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        +A S VDEAETM+EGTLDGFSFDSEDEAEKI KV SSPTDQNE V+GLN KSKNK EKSSG  RKR K GGKLQAPKK RKK
Subjt:  EAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

XP_038894663.1 DNA-binding protein BIN4 isoform X2 [Benincasa hispida]1.4e-14881.42Show/hide
Query:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCP-NSLIKEDYSHHEELSE
        AP GV LS NSESSKN SSSMDNA+DQK PSS+KTTQDLDGDQI+GD GNHNLAK+VK EEH  HE+SKHS+WMLSSDSESCP N+ IKE+YSHHEELSE
Subjt:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCP-NSLIKEDYSHHEELSE

Query:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL
          TSQFQGRG+DENA  + T+GKSK  KVS+KKSPK+QVKSQV TS KEKIINS  +K     EGSE YVRN  DV+IIEKDA DGCNGPPV+SSRLPL+
Subjt:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL

Query:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGT
        LSDKVHRLKALVECEG SIDLSGDMGAVGRV+VSDSSS KNELCLDLKGTIYRAAIVPSRTFCIV+FGQSEAKIE+IMNDFIQL+A S VDEAETM+EGT
Subjt:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGT

Query:  LDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        LDGFSFDSEDEAEKI KV SSPTDQNE V+GLN KSKNK EKSSG  RKR K GGKLQAPKK RKK
Subjt:  LDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

TrEMBL top hitse value%identityAlignment
A0A0A0LJZ5 Uncharacterized protein3.5e-14278.14Show/hide
Query:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSE
        APTGV LS NS SSKNGSSSMDNAIDQ+DPSSHKTTQDLDGDQI+GD GNHNLAK+VKL+ H  HE+SKHS+WMLS DSESC  N+ IKEDYS+HEEL+E
Subjt:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSE

Query:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL
        L TS+ QGR KDENA R+ T+GKSK  KVS++ SPK++VKS+V TS KE I+NS  +K     EGSE +VRN GDVEI+EKDA D C GPPV+SSRLPL+
Subjt:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL

Query:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGT
        LSDK HRLKALVECEG SIDLSGDMGAVGRV+VSDSSS KNELCLDLKGT+YRA IVPSRTFCIVSFGQSEAKIE+IMNDFIQL+A S VDEAETMVEGT
Subjt:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGT

Query:  LDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        LDGFSFDSED+AEKITK A+SP DQNE V+GLN KSKNK EKSSG  RKR KTGG+LQAPKK RKK
Subjt:  LDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

A0A1S3CDA8 DNA-binding protein BIN4 isoform X21.0e-14178.14Show/hide
Query:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSE
        APTGV LS NS SSKNGSSSMDNAIDQ+DPSSHKTTQDLDGDQI+GD GNHNLAK+ KL+    HE+S+HS+WMLSSDSESC  N+ IKED +HHEELSE
Subjt:  APTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSE

Query:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL
        L TS+ QGR KDENA R+ T+GKSK  KVS + SPK+++KSQV TS KEKIINS  +K     EGSE +VRN G+ EI+EKDA D C  PPV+SSRLPL+
Subjt:  LKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLL

Query:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGT
        LSDKVHRLKALVECEG SIDLSGDMGAVGRV+VSDSSS KNELCLDLKGT+YRA IVPSRTFCIVSFGQSEAKIE+IMNDFIQL+A S VDEAETMVEGT
Subjt:  LSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGT

Query:  LDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        LDGFSFDSEDEAEKITKVASSP DQNE V+GLN KSKNK EKSSG  RKR K+GG+LQAPKK RKK
Subjt:  LDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

A0A5D3BS94 DNA-binding protein BIN4 isoform X26.0e-14277.93Show/hide
Query:  VAPTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELS
        +APTGV LS NS SSKNGSSSMDNAIDQ+DPSSHKTTQDLDGDQI+GD GNHNLAK+ KL+    HE+S+HS+WMLSSDSESC  N+ IKED +HHEELS
Subjt:  VAPTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELS

Query:  ELKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPL
        EL TS+ QGR KDENA R+ T+GKSK  KVS + SPK+++KSQV TS KEKIINS  +K     EGSE +VRN G+ EI+EKDA D C  PPV+SSRLPL
Subjt:  ELKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPL

Query:  LLSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEG
        +LSDKVHRLKALVECEG SIDLSGDMGAVGRV+VSDSSS KNELCLDLKGT+YRA IVPSRTFCIVSFGQSEAKIE+IMNDFIQL+A S VDEAETMVEG
Subjt:  LLSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEG

Query:  TLDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        TLDGFSFDSEDEAEKITKVASSP DQNE V+GLN KSKNK EKSSG  RKR K+GG+LQAPKK RKK
Subjt:  TLDGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

A0A6J1CGV6 DNA-binding protein BIN4 isoform X28.3e-14482.08Show/hide
Query:  MDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSELKTSQFQGRGKDENANRKLT
        MDNAIDQKD SSHKTTQDLDGDQI+GD G+HNL K++KLEEH  H DSKHS+WMLSSDSE C  NSLIKEDYSHHEEL E KTSQF GR KDEN +R+ T
Subjt:  MDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSELKTSQFQGRGKDENANRKLT

Query:  DGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLLLSDKVHRLKALVECEGNSID
        DGKSK  KVSDKKSPK++VKSQVRT  KEKIIN   +K     EGSEC VRN GDVEII KDA D CNGPPV+SSRLPL+LSDKVHRLKALVECEG SID
Subjt:  DGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLLLSDKVHRLKALVECEGNSID

Query:  LSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVAS
        LSGD+GAVGRV+VSDSS  KNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAK+E IMNDFIQL+A+SN+DEAETMVEGTLDGFSFDSEDEAEKITKV+S
Subjt:  LSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVAS

Query:  SPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        SPTDQNEAV+GL+KKSKNK EKSSG  RKR +TGGKLQAPKKARKK
Subjt:  SPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

A0A6J1CI66 DNA-binding protein BIN4 isoform X14.6e-15081.64Show/hide
Query:  PTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSEL
        PTGV LS NSESS N SS MDNAIDQKD SSHKTTQDLDGDQI+GD G+HNL K++KLEEH  H DSKHS+WMLSSDSE C  NSLIKEDYSHHEEL E 
Subjt:  PTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESC-PNSLIKEDYSHHEELSEL

Query:  KTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLLL
        KTSQF GR KDEN +R+ TDGKSK  KVSDKKSPK++VKSQVRT  KEKIIN   +K     EGSEC VRN GDVEII KDA D CNGPPV+SSRLPL+L
Subjt:  KTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHK-----EGSECYVRNSGDVEIIEKDASDGCNGPPVASSRLPLLL

Query:  SDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTL
        SDKVHRLKALVECEG SIDLSGD+GAVGRV+VSDSS  KNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAK+E IMNDFIQL+A+SN+DEAETMVEGTL
Subjt:  SDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTL

Query:  DGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK
        DGFSFDSEDEAEKITKV+SSPTDQNEAV+GL+KKSKNK EKSSG  RKR +TGGKLQAPKKARKK
Subjt:  DGFSFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKK

SwissProt top hitse value%identityAlignment
Q9FLU1 DNA-binding protein BIN44.8e-4841.53Show/hide
Query:  DQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KL
        D   G    +N+  +    +HK  +    S+W++SSDSE  P+S IK++ +              EE   +KT + +   K ++ + + T  +G S  ++
Subjt:  DQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KL

Query:  MKVSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGN
        +K  DK             KSPK + KS  +T  +E     +   E  +       D  I E+  +D    P   +SSRLPL+LS+KV+R K LVECEG+
Subjt:  MKVSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGN

Query:  SIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITK
        SIDLSGDMGAVGRV+VSD++    ++ LDLKGTIY++ I+PSRTFC+V+ GQ+EAKIEAIMNDFIQL  QSNV EAETMVEGTL+GF+F+S+DE+ K  K
Subjt:  SIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITK

Query:  VASSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA
         A  P DQ+   E       K K K +  + + +KR +   + Q P KKAR  A
Subjt:  VASSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA

Arabidopsis top hitse value%identityAlignment
AT5G24630.1 double-stranded DNA binding2.6e-4941.93Show/hide
Query:  IEGDSGNHNLAKDVKLEEHK-AHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KLM
        +EG    +N+  +    +HK A +    S+W++SSDSE  P+S IK++ +              EE   +KT + +   K ++ + + T  +G S  +++
Subjt:  IEGDSGNHNLAKDVKLEEHK-AHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KLM

Query:  KVSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNS
        K  DK             KSPK + KS  +T  +E     +   E  +       D  I E+  +D    P   +SSRLPL+LS+KV+R K LVECEG+S
Subjt:  KVSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNS

Query:  IDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKV
        IDLSGDMGAVGRV+VSD++    ++ LDLKGTIY++ I+PSRTFC+V+ GQ+EAKIEAIMNDFIQL  QSNV EAETMVEGTL+GF+F+S+DE+ K  K 
Subjt:  IDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKV

Query:  ASSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA
        A  P DQ+   E       K K K +  + + +KR +   + Q P KKAR  A
Subjt:  ASSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA

AT5G24630.2 double-stranded DNA binding2.0e-4942.56Show/hide
Query:  IEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYSHHEELSELKTSQFQGRGKDENANRKLT--DGKSKLMKVSDKKSPKEQVKSQ
        +EG    +N+  +    +HK  +    S+W++SSDSE  P+S IK++ +    +S  K + F     +E    K    +   K    S +K+PKE   +Q
Subjt:  IEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYSHHEELSELKTSQFQGRGKDENANRKLT--DGKSKLMKVSDKKSPKEQVKSQ

Query:  --VRTSPK--EKIINSVPHKEGSECYVRNSG-------DVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSD
          ++T  K  +  I      +G++ ++ ++        D  I E+  +D    P   +SSRLPL+LS+KV+R K LVECEG+SIDLSGDMGAVGRV+VSD
Subjt:  --VRTSPK--EKIINSVPHKEGSECYVRNSG-------DVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSD

Query:  SSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTDQN---EAVDGL
        ++    ++ LDLKGTIY++ I+PSRTFC+V+ GQ+EAKIEAIMNDFIQL  QSNV EAETMVEGTL+GF+F+S+DE+ K  K A  P DQ+   E     
Subjt:  SSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVASSPTDQN---EAVDGL

Query:  NKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA
          K K K +  + + +KR +   + Q P KKAR  A
Subjt:  NKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA

AT5G24630.3 double-stranded DNA binding9.1e-5041.76Show/hide
Query:  IEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KLMK
        +EG    +N+  +    +HK  +    S+W++SSDSE  P+S IK++ +              EE   +KT + +   K ++ + + T  +G S  +++K
Subjt:  IEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KLMK

Query:  VSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNSI
          DK             KSPK + KS  +T  +E     +   E  +       D  I E+  +D    P   +SSRLPL+LS+KV+R K LVECEG+SI
Subjt:  VSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNSI

Query:  DLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVA
        DLSGDMGAVGRV+VSD++    ++ LDLKGTIY++ I+PSRTFC+V+ GQ+EAKIEAIMNDFIQL  QSNV EAETMVEGTL+GF+F+S+DE+ K  K A
Subjt:  DLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVA

Query:  SSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA
          P DQ+   E       K K K +  + + +KR +   + Q P KKAR  A
Subjt:  SSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA

AT5G24630.4 double-stranded DNA binding9.1e-5041.76Show/hide
Query:  IEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KLMK
        +EG    +N+  +    +HK  +    S+W++SSDSE  P+S IK++ +              EE   +KT + +   K ++ + + T  +G S  +++K
Subjt:  IEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KLMK

Query:  VSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNSI
          DK             KSPK + KS  +T  +E     +   E  +       D  I E+  +D    P   +SSRLPL+LS+KV+R K LVECEG+SI
Subjt:  VSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNSI

Query:  DLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVA
        DLSGDMGAVGRV+VSD++    ++ LDLKGTIY++ I+PSRTFC+V+ GQ+EAKIEAIMNDFIQL  QSNV EAETMVEGTL+GF+F+S+DE+ K  K A
Subjt:  DLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVA

Query:  SSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA
          P DQ+   E       K K K +  + + +KR +   + Q P KKAR  A
Subjt:  SSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA

AT5G24630.5 double-stranded DNA binding5.3e-5041.76Show/hide
Query:  IEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KLMK
        +EG    +N+  +    +HK  +    S+W++SSDSE  P+S IK++ +              EE   +KT + +   K ++ + + T  +G S  +++K
Subjt:  IEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSDSESCPNSLIKEDYS------------HHEELSELKTSQFQGRGKDENANRKLT--DGKS--KLMK

Query:  VSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNSI
          DK             KSPK + KS  +T  +E     +   E ++       D  I E+  +D    P   +SSRLPL+LS+KV+R K LVECEG+SI
Subjt:  VSDK-------------KSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPV-ASSRLPLLLSDKVHRLKALVECEGNSI

Query:  DLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVA
        DLSGDMGAVGRV+VSD++    ++ LDLKGTIY++ I+PSRTFC+V+ GQ+EAKIEAIMNDFIQL  QSNV EAETMVEGTL+GF+F+S+DE+ K  K A
Subjt:  DLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGFSFDSEDEAEKITKVA

Query:  SSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA
          P DQ+   E       K K K +  + + +KR +   + Q P KKAR  A
Subjt:  SSPTDQN---EAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAP-KKARKKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAAAATGAAAAACGGTAGACGAACCAGGCTTTTCTGTTTCCGGACAACTTTCCTCCGCTTTCCCTTTATTTCATCAGGTGGCTGGCGTGTGGCACCAACTGGTGT
TGTATTATCCTTTAATTCTGAATCTTCAAAGAATGGTAGCTCATCAATGGACAATGCAATTGATCAAAAGGATCCATCTTCACATAAAACCACACAAGATTTAGATGGGG
ATCAGATTGAAGGGGATAGTGGCAACCATAATCTGGCTAAGGATGTGAAACTTGAGGAACATAAGGCGCATGAAGATTCAAAGCACTCATTGTGGATGTTATCATCGGAT
TCAGAGTCATGTCCTAATAGTCTTATAAAGGAGGATTATAGTCATCATGAGGAATTATCTGAACTTAAAACTTCTCAATTCCAAGGGAGAGGAAAGGATGAAAATGCAAA
TCGCAAACTCACTGATGGAAAATCTAAATTAATGAAAGTATCGGATAAAAAGTCTCCAAAAGAACAAGTCAAATCACAAGTTCGTACTTCGCCCAAAGAGAAAATAATCA
ATTCTGTCCCACATAAAGAAGGATCTGAATGCTATGTAAGAAATAGTGGAGATGTGGAGATTATAGAAAAAGATGCATCGGATGGCTGCAACGGACCTCCTGTTGCCTCC
TCAAGGTTGCCATTGTTGCTGTCTGACAAAGTCCATCGGTTGAAGGCACTTGTTGAGTGTGAAGGAAATTCAATAGATTTGAGTGGTGACATGGGTGCTGTAGGACGAGT
TATAGTTTCAGATTCCTCATCTGTAAAAAATGAGCTTTGCCTAGATTTGAAAGGTACAATTTATAGGGCGGCAATAGTTCCTTCAAGGACATTTTGCATTGTTAGCTTTG
GTCAGTCAGAGGCAAAGATAGAGGCTATCATGAATGACTTCATACAGTTGGAGGCACAGTCCAATGTTGATGAGGCTGAAACTATGGTTGAAGGAACATTGGACGGGTTC
TCATTTGATTCCGAAGATGAGGCTGAGAAAATAACTAAAGTTGCTTCTTCTCCAACTGACCAAAATGAGGCTGTAGACGGGCTCAACAAAAAATCCAAAAATAAAGTTGA
GAAATCATCAGGGCTTGCACGGAAGCGCTTTAAAACTGGAGGAAAGCTGCAGGCACCAAAGAAAGCAAGGAAGAAAGCGGCAGCAAACCATGTTATCAAGGGAATCGGCC
TTGTGAAACCTTTTGAGTCAGAGGCCCAAGTCCGCTCGATGAAATATCATGATTCCAGAGGGCTTTGCATGGAGATTAGTCGCGCAAATCAGGAGAGTAAGTTAATGACG
ACGAATATAATTGCCACAGGAAAGGGTAGGAGAATGTTCGATGAAGAAACCAAATGTATGACCTCCCCACTGCTCGGTTCTTCAATGTGTGGAGTGGTTATTGTTTCATT
TGATGCTTTCTTCGTTCTGCATCGCATGCATCACTTCCTGTGGCATTACACGGTCCATGACACCTCGAGGCACGACAACGAGCGGCAAGTTATAGCCATCTACAATGCTG
ACATCGTAGAACTGCATCCGGTCCTTGCCCATATTCGTCCTGACCATCCTGGAGTCGATGGAATTCTAACACTTTGGCCCGACTCCAACCGGAAACCAGTTGTCGGAAGT
GGAGGTGTGCCAGCGCCAGCTAGCGTGCCGGGCCATATGGGATGCAGGCAATTGTTTGTGATGGTGAATATGCAAGAGCAAGCACAAGAGAAGGAAGAAAGGACGGCAAG
AAAGAAGCCAAAACCCATCCAAGTTGTCATGTTGGAGCCAACAGTCTCCAAGGAAACAAAACCTCCCTGCATACCACCTCTGCCACCAAAAGCTGCAAAAAACCACAACA
ATGAGTGGCAAGTGCTCCCCATGAGTCCCCTGCTTTCAAGGCAAGGGCCTCGGGTGGAGGGGAAGAAGTACTACCACTGCCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGAAAATGAAAAACGGTAGACGAACCAGGCTTTTCTGTTTCCGGACAACTTTCCTCCGCTTTCCCTTTATTTCATCAGGTGGCTGGCGTGTGGCACCAACTGGTGT
TGTATTATCCTTTAATTCTGAATCTTCAAAGAATGGTAGCTCATCAATGGACAATGCAATTGATCAAAAGGATCCATCTTCACATAAAACCACACAAGATTTAGATGGGG
ATCAGATTGAAGGGGATAGTGGCAACCATAATCTGGCTAAGGATGTGAAACTTGAGGAACATAAGGCGCATGAAGATTCAAAGCACTCATTGTGGATGTTATCATCGGAT
TCAGAGTCATGTCCTAATAGTCTTATAAAGGAGGATTATAGTCATCATGAGGAATTATCTGAACTTAAAACTTCTCAATTCCAAGGGAGAGGAAAGGATGAAAATGCAAA
TCGCAAACTCACTGATGGAAAATCTAAATTAATGAAAGTATCGGATAAAAAGTCTCCAAAAGAACAAGTCAAATCACAAGTTCGTACTTCGCCCAAAGAGAAAATAATCA
ATTCTGTCCCACATAAAGAAGGATCTGAATGCTATGTAAGAAATAGTGGAGATGTGGAGATTATAGAAAAAGATGCATCGGATGGCTGCAACGGACCTCCTGTTGCCTCC
TCAAGGTTGCCATTGTTGCTGTCTGACAAAGTCCATCGGTTGAAGGCACTTGTTGAGTGTGAAGGAAATTCAATAGATTTGAGTGGTGACATGGGTGCTGTAGGACGAGT
TATAGTTTCAGATTCCTCATCTGTAAAAAATGAGCTTTGCCTAGATTTGAAAGGTACAATTTATAGGGCGGCAATAGTTCCTTCAAGGACATTTTGCATTGTTAGCTTTG
GTCAGTCAGAGGCAAAGATAGAGGCTATCATGAATGACTTCATACAGTTGGAGGCACAGTCCAATGTTGATGAGGCTGAAACTATGGTTGAAGGAACATTGGACGGGTTC
TCATTTGATTCCGAAGATGAGGCTGAGAAAATAACTAAAGTTGCTTCTTCTCCAACTGACCAAAATGAGGCTGTAGACGGGCTCAACAAAAAATCCAAAAATAAAGTTGA
GAAATCATCAGGGCTTGCACGGAAGCGCTTTAAAACTGGAGGAAAGCTGCAGGCACCAAAGAAAGCAAGGAAGAAAGCGGCAGCAAACCATGTTATCAAGGGAATCGGCC
TTGTGAAACCTTTTGAGTCAGAGGCCCAAGTCCGCTCGATGAAATATCATGATTCCAGAGGGCTTTGCATGGAGATTAGTCGCGCAAATCAGGAGAGTAAGTTAATGACG
ACGAATATAATTGCCACAGGAAAGGGTAGGAGAATGTTCGATGAAGAAACCAAATGTATGACCTCCCCACTGCTCGGTTCTTCAATGTGTGGAGTGGTTATTGTTTCATT
TGATGCTTTCTTCGTTCTGCATCGCATGCATCACTTCCTGTGGCATTACACGGTCCATGACACCTCGAGGCACGACAACGAGCGGCAAGTTATAGCCATCTACAATGCTG
ACATCGTAGAACTGCATCCGGTCCTTGCCCATATTCGTCCTGACCATCCTGGAGTCGATGGAATTCTAACACTTTGGCCCGACTCCAACCGGAAACCAGTTGTCGGAAGT
GGAGGTGTGCCAGCGCCAGCTAGCGTGCCGGGCCATATGGGATGCAGGCAATTGTTTGTGATGGTGAATATGCAAGAGCAAGCACAAGAGAAGGAAGAAAGGACGGCAAG
AAAGAAGCCAAAACCCATCCAAGTTGTCATGTTGGAGCCAACAGTCTCCAAGGAAACAAAACCTCCCTGCATACCACCTCTGCCACCAAAAGCTGCAAAAAACCACAACA
ATGAGTGGCAAGTGCTCCCCATGAGTCCCCTGCTTTCAAGGCAAGGGCCTCGGGTGGAGGGGAAGAAGTACTACCACTGCCCCTAG
Protein sequenceShow/hide protein sequence
MVKMKNGRRTRLFCFRTTFLRFPFISSGGWRVAPTGVVLSFNSESSKNGSSSMDNAIDQKDPSSHKTTQDLDGDQIEGDSGNHNLAKDVKLEEHKAHEDSKHSLWMLSSD
SESCPNSLIKEDYSHHEELSELKTSQFQGRGKDENANRKLTDGKSKLMKVSDKKSPKEQVKSQVRTSPKEKIINSVPHKEGSECYVRNSGDVEIIEKDASDGCNGPPVAS
SRLPLLLSDKVHRLKALVECEGNSIDLSGDMGAVGRVIVSDSSSVKNELCLDLKGTIYRAAIVPSRTFCIVSFGQSEAKIEAIMNDFIQLEAQSNVDEAETMVEGTLDGF
SFDSEDEAEKITKVASSPTDQNEAVDGLNKKSKNKVEKSSGLARKRFKTGGKLQAPKKARKKAAANHVIKGIGLVKPFESEAQVRSMKYHDSRGLCMEISRANQESKLMT
TNIIATGKGRRMFDEETKCMTSPLLGSSMCGVVIVSFDAFFVLHRMHHFLWHYTVHDTSRHDNERQVIAIYNADIVELHPVLAHIRPDHPGVDGILTLWPDSNRKPVVGS
GGVPAPASVPGHMGCRQLFVMVNMQEQAQEKEERTARKKPKPIQVVMLEPTVSKETKPPCIPPLPPKAAKNHNNEWQVLPMSPLLSRQGPRVEGKKYYHCP