; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0223 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0223
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionhomeobox protein knotted-1-like LET6
Genome locationMC04:1731981..1735467
RNA-Seq ExpressionMC04g0223
SyntenyMC04g0223
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR005539 - ELK domain
IPR005540 - KNOX1
IPR005541 - KNOX2
IPR008422 - Homeobox KN domain
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022846.1 Homeobox protein SBH1 [Cucurbita argyrosperma subsp. argyrosperma]1.11e-16881.06Show/hide
Query:  ILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDD
        + D+NNNN NN           S +     F+P   H         +CRAK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR  ARGDD
Subjt:  ILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDD

Query:  PALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEV-DMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQ
        P LDQFMEAYCEMLSKYEQEL+KPFKEAM+FFSRIESQLKALA SSS+SDG ELVGQ+ECSKE+EV DMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQ
Subjt:  PALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEV-DMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQ

Query:  EFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTL
        EFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICN PFSMDCSSTL
Subjt:  EFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTL

Query:  F
        F
Subjt:  F

XP_004144513.1 homeobox protein knotted-1-like LET6 [Cucumis sativus]1.45e-17675.63Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHNL
        ME GGSSGSS   SFMA      NN++S+  S M+MM + ++NN Q  DN       K+F+PLSWS+++          T   +SAF+PQP  NNSN ++
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHNL

Query:  ---QNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA
            ++ +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCRA   G+DPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLKA
Subjt:  ---QNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA

Query:  LAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE
         AVSS   DG ELVGQ+ECSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAE
Subjt:  LAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSST
        STGLDLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVICN PFSMDCSS+
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSST

XP_008455471.1 PREDICTED: homeobox protein knotted-1-like LET6 [Cucumis melo]3.41e-17474.65Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSN-HN
        ME GGSSGSS   SFMA      NN++S+  S M+MM + +NNN Q  DN       K+F+PLSWS+++          T   +S F+PQP  NNSN H+
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSN-HN

Query:  LQ--NSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA
            ++ +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCRA   G+DPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLKA
Subjt:  LQ--NSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA

Query:  LAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE
         AVSS   DG +LV Q++CSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAE
Subjt:  LAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        STGLDLKQINNWFINQRKRHWKP+EDMQFVVMD  HPHYYLDNVICN PFSMDCSS+ F
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

XP_022136261.1 homeobox protein SBH1-like [Momordica charantia]7.55e-247100Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMA
        MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMA
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMA

Query:  HPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCEL
        HPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCEL
Subjt:  HPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCEL

Query:  VGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWF
        VGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWF
Subjt:  VGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWF

Query:  INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
Subjt:  INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

XP_038887406.1 homeobox protein knotted-1-like LET6 [Benincasa hispida]5.31e-18278.77Show/hide
Query:  MERGGSSGSSGGSSFMAGFGE-----NNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATST-------AQRVSAFVPQPPHNNSNH
        ME GGSSGSS   SFMA         NNNNS++     M+MM +S  NN Q LD+NNNN   K+F+PLSWS++++        QRVSAFVP P  NN++ 
Subjt:  MERGGSSGSSGGSSFMAGFGE-----NNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATST-------AQRVSAFVPQPPHNNSNH

Query:  NLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKAL
           ++ +AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVA GSCRA  RGDDPALDQFMEAYCEML+KYEQEL+KPF+EAMLFFSRIESQLKAL
Subjt:  NLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKAL

Query:  AVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAES
        AV S   DG ELV Q+ECSKE+EVDMN+NYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAES
Subjt:  AVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAES

Query:  TGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        TGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICN PFSMDCSSTLF
Subjt:  TGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

TrEMBL top hitse value%identityAlignment
A0A0A0K5G5 Uncharacterized protein7.02e-17775.63Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHNL
        ME GGSSGSS   SFMA      NN++S+  S M+MM + ++NN Q  DN       K+F+PLSWS+++          T   +SAF+PQP  NNSN ++
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHNL

Query:  ---QNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA
            ++ +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCRA   G+DPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLKA
Subjt:  ---QNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA

Query:  LAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE
         AVSS   DG ELVGQ+ECSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAE
Subjt:  LAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSST
        STGLDLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVICN PFSMDCSS+
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSST

A0A1S3C0Z5 homeobox protein knotted-1-like LET61.65e-17474.65Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSN-HN
        ME GGSSGSS   SFMA      NN++S+  S M+MM + +NNN Q  DN       K+F+PLSWS+++          T   +S F+PQP  NNSN H+
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSN-HN

Query:  LQ--NSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA
            ++ +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCRA   G+DPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLKA
Subjt:  LQ--NSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA

Query:  LAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE
         AVSS   DG +LV Q++CSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAE
Subjt:  LAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        STGLDLKQINNWFINQRKRHWKP+EDMQFVVMD  HPHYYLDNVICN PFSMDCSS+ F
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

A0A6J1C313 homeobox protein SBH1-like3.66e-247100Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMA
        MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMA
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMA

Query:  HPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCEL
        HPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCEL
Subjt:  HPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCEL

Query:  VGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWF
        VGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWF
Subjt:  VGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWF

Query:  INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
Subjt:  INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

A0A6J1ERR4 homeobox protein knotted-1-like LET61.17e-16785Show/hide
Query:  HNNSNHNLQNS---------------CRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQEL
        HNN N+N  +S               CR K+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR  ARGDDP LDQFMEAYCEMLSKYEQEL
Subjt:  HNNSNHNLQNS---------------CRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQEL

Query:  SKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEV-DMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLD
        +KPFKEAM+FFSRIESQLKALA SSS+SDG ELVGQ+ECSKE+EV DMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLD
Subjt:  SKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEV-DMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLD

Query:  WWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        WWSRHYKWPYPSE+QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICN PFSMDCSSTLF
Subjt:  WWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

A0A6J1JKT8 homeobox protein knotted-1-like LET62.68e-16885.61Show/hide
Query:  HNNSNHNLQNS-------------CRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSK
        HNN+N +  N+             CRAK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR  ARGDDP LDQFMEAYCEMLSKYEQEL+K
Subjt:  HNNSNHNLQNS-------------CRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSK

Query:  PFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEV-DMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWW
        PFKEAM+FFSRIESQLKALA SSS+SDG ELVGQ+ECSKE+EV DMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWW
Subjt:  PFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEV-DMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWW

Query:  SRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        SRHYKWPYPSE+QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICN PFSMDCSSTLF
Subjt:  SRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

SwissProt top hitse value%identityAlignment
O22299 Homeobox protein knotted-1-like LET62.5e-9956.99Show/hide
Query:  MERGGSSGSSGGSSFMAGFG--ENNNNSSSSGISS------------MVMM-----SSSSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQP
        ME G S  +S     M G+G  ENNNN++ +G  +            M+MM     S ++NNN+   + +NNN+   LF+P   +  +         PQ 
Subjt:  MERGGSSGSSGGSSFMAGFG--ENNNNSSSSGISS------------MVMM-----SSSSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQP

Query:  PHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATAR------GDDPALDQFMEAYCEMLSKYEQELSKPFKEAM
         +N+S+    +S ++KIMAHP + RLL AY+NCQK+GAPPEVVARLE+ CA +    R+++       G+DPALDQFMEAYCEML+KYEQELSKPFKEAM
Subjt:  PHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATAR------GDDPALDQFMEAYCEMLSKYEQELSKPFKEAM

Query:  LFFSRIESQLKALAVSSSTSDGCEL--VGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYK
        +F SRIE Q KAL ++ ++S    L        S + EVD+N ++IDPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQ+L+DWW RH K
Subjt:  LFFSRIESQLKALAVSSSTSDGCEL--VGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYK

Query:  WPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTL
        WPYPSE+QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYY+DNV+ N+ F MD + +L
Subjt:  WPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTL

O65034 Homeobox protein knotted-1-like 123.6e-7461.07Show/hide
Query:  SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALA--V
        S +AKIMAHP +  LLAAY++CQKVGAPPEV+ RL    A             DP LDQFMEAYC ML+KY +EL++P  EAM F  R+ESQL  +A   
Subjt:  SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALA--V

Query:  SSSTSDGCELV---GQSECSKEVEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEA
            +    L+   G+SEC    E DM+    EN    IDP+AE+KELK QLL+KYSGYL SL+QEF KKKK GKLPKEARQ+LL WW  HYKWPYPSE 
Subjt:  SSSTSDGCELV---GQSECSKEVEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEA

Query:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP
        +K+ALAESTGLD KQINNWFINQRKRHWKPSEDM FV+M+  HP
Subjt:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP

P46608 Homeobox protein SBH11.8e-10258.16Show/hide
Query:  GSSGSSGGSSFMAGFGENNNNSSSSGISSMVMM---SSSSNNNSQILDNNNNNVN-NKLFVPLSWSATSTAQ---------------------RVSAFVP
        G S SS G+S++  FGENN    S G+  M MM   +S    +  I  +NNNNVN N LF+P   ++T T                        +  +  
Subjt:  GSSGSSGGSSFMAGFGENNNNSSSSGISSMVMM---SSSSNNNSQILDNNNNNVN-NKLFVPLSWSATSTAQ---------------------RVSAFVP

Query:  QPPHNNSNHNLQN-----------SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA----VAGGSCRA--TARGDDPALDQFMEAYCEMLSK
        +  H++ +H   N           + +AKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+ACA    +AGG   A  +  G+DPALDQFMEAYCEML+K
Subjt:  QPPHNNSNHNLQN-----------SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA----VAGGSCRA--TARGDDPALDQFMEAYCEMLSK

Query:  YEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ
        YEQELSKP KEAMLF  RIE Q K L +SSS     E  G    S E +VD++ N IDPQAE+++LKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQ
Subjt:  YEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ

Query:  ELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCS
        +LL+WW+RHYKWPYPSE+QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMD +HPHYY+DNV+  NPF MD S
Subjt:  ELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCS

Q38874 Homeobox protein SHOOT MERISTEMLESS1.1e-9968.77Show/hide
Query:  SNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRI
        S+ +   S +AKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC+ A  +  +       G+DP LDQFMEAYCEML KYEQELSKPFKEAM+F  R+
Subjt:  SNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRI

Query:  ESQLKALAVSSSTS----DGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYP
        E Q K+L++SS +S        +   +  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQ+LLDWWSRHYKWPYP
Subjt:  ESQLKALAVSSSTS----DGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYP

Query:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMD-CSSTL
        SE QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HPH+Y  + +  NPF MD  SST+
Subjt:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMD-CSSTL

Q9M6D9 Homeobox protein SHOOT MERISTEMLESS1.5e-9670.08Show/hide
Query:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAV
        +AKIMAHP + RLL AYVNCQKVGAPPEV ARLE+ C+ A  +  +     + G+DP LDQFMEAYCEML KYEQELSKPFKEAM+F   +E Q K+L++
Subjt:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAV

Query:  SSSTSDG---CELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE
        SS +S G     +   +  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQ+LLDWWSRHYKWPYPSE QK+ALAE
Subjt:  SSSTSDG---CELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNNPFSMD
        STGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++ NV+  NPF +D
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNNPFSMD

Arabidopsis top hitse value%identityAlignment
AT1G23380.1 KNOTTED1-like homeobox gene 62.2e-6345.85Show/hide
Query:  LSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ--------ACAVAGGSCRATARGDDPALDQFME
        LS + ++ +   ++  P+   N+ N +L    +AKI  HP +PRLL AY++CQKVGAPPE+   LE+           V   SC     G DP LD+FME
Subjt:  LSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ--------ACAVAGGSCRATARGDDPALDQFME

Query:  AYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKK
         YC++L KY+ +L++PF EA  F ++IE QL+ L     ++ G    G     +E+    +E   D   + E+++LK +LLRK+   + +LK EF KKKK
Subjt:  AYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKK

Query:  NGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
         GKLP+EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQRKRHWKPSE+M F +MD +   ++ +
Subjt:  NGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT1G23380.2 KNOTTED1-like homeobox gene 67.2e-6245.52Show/hide
Query:  LSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQK--VGAPPEVVARLEQ--------ACAVAGGSCRATARGDDPALDQF
        LS + ++ +   ++  P+   N+ N +L    +AKI  HP +PRLL AY++CQK  VGAPPE+   LE+           V   SC     G DP LD+F
Subjt:  LSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQK--VGAPPEVVARLEQ--------ACAVAGGSCRATARGDDPALDQF

Query:  MEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKK
        ME YC++L KY+ +L++PF EA  F ++IE QL+ L     ++ G    G     +E+    +E   D   + E+++LK +LLRK+   + +LK EF KK
Subjt:  MEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKK

Query:  KKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        KK GKLP+EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQRKRHWKPSE+M F +MD +   ++ +
Subjt:  KKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT1G62360.1 KNOX/ELK homeobox transcription factor7.8e-10168.77Show/hide
Query:  SNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRI
        S+ +   S +AKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC+ A  +  +       G+DP LDQFMEAYCEML KYEQELSKPFKEAM+F  R+
Subjt:  SNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRI

Query:  ESQLKALAVSSSTS----DGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYP
        E Q K+L++SS +S        +   +  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQ+LLDWWSRHYKWPYP
Subjt:  ESQLKALAVSSSTS----DGCELVGQSECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYP

Query:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMD-CSSTL
        SE QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HPH+Y  + +  NPF MD  SST+
Subjt:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMD-CSSTL

AT1G70510.1 KNOTTED-like from Arabidopsis thaliana 22.3e-6048.98Show/hide
Query:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLK
        ++KI +HPL+PRLL  Y++CQKVGAP E+   LE+           VA  SC     G DP LD+FME YC++L KY+ +L++PF EA  F ++IE QL+
Subjt:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLK

Query:  ALAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQ--AEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVA
         L    +++      G     +E+  D +    D Q  + +++LK QLLRK+  ++ SLK EF KKKK GKLP+EARQ LLDWW+ H KWPYP+E  K++
Subjt:  ALAVSSSTSDGCELVGQSECSKEVEVDMNENYIDPQ--AEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVA

Query:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        LAE TGLD KQINNWFINQRKRHWKPSE+M F +MD ++  ++ +
Subjt:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT4G08150.1 KNOTTED-like from Arabidopsis thaliana3.5e-6947.75Show/hide
Query:  ERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSS---SSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFV-----PQPPHNNSNHNLQN-
        +   ++ ++  S++  G+   NNN+          MSS    +  N    D++  N NN   V  S +++S     S  +      Q  +NN+N N+ + 
Subjt:  ERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSS---SSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFV-----PQPPHNNSNHNLQN-

Query:  -SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQA-----CAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLK
         + +AKI+AHP +  LL AY++CQK+GAPP+VV R+  A           +   +A   DP LDQFMEAYC+ML KY +EL++P +EAM F  RIESQL 
Subjt:  -SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQA-----CAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLK

Query:  ALAVSS----STSDG-CELVGQSECSKE----VEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYP
         L  S     +  DG  + +G S+  +E     E ++ E  IDP+AE++ELK  LL+KYSGYL SLKQE  KKKK GKLPKEARQ+LL WW  HYKWPYP
Subjt:  ALAVSS----STSDG-CELVGQSECSKE----VEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYP

Query:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD
        SE++KVALAESTGLD KQINNWFINQRKRHWKPSEDMQF+VMD   HPH+   Y+D
Subjt:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCGTGGAGGATCCAGTGGAAGCAGTGGTGGCAGTTCTTTCATGGCGGGATTTGGGGAAAACAACAACAACAGCAGCAGCAGTGGGATTTCTTCTATGGTGATGAT
GAGTAGTAGTAGTAATAATAATTCCCAAATATTGGACAACAACAACAACAACGTTAATAATAAGCTGTTTGTGCCTTTATCTTGGTCTGCTACTTCCACTGCACAAAGGG
TTTCTGCCTTTGTCCCTCAGCCTCCTCATAATAATTCAAATCATAATCTTCAAAACAGCTGCAGAGCTAAGATCATGGCTCATCCTCTCTTCCCCCGCCTCTTGGCTGCC
TATGTCAACTGTCAAAAGGTGGGTGCGCCGCCGGAAGTGGTGGCAAGGCTGGAGCAGGCGTGCGCGGTGGCCGGAGGGAGCTGCAGGGCGACGGCGCGTGGGGATGATCC
AGCACTGGATCAGTTCATGGAGGCGTACTGTGAGATGTTGAGCAAGTACGAGCAAGAGTTGAGCAAGCCCTTCAAAGAAGCCATGCTTTTCTTCTCAAGAATCGAGTCTC
AGCTCAAAGCCTTGGCAGTTTCTTCTTCTACTTCTGATGGTTGCGAGCTGGTCGGGCAAAGCGAGTGTTCGAAGGAGGTCGAGGTCGATATGAATGAGAACTACATTGAC
CCGCAGGCTGAAGAGAAGGAATTGAAGGGCCAGCTTCTACGCAAATACAGCGGATACCTCGGCAGCCTGAAGCAGGAGTTTCTGAAGAAGAAGAAGAACGGAAAGTTGCC
GAAAGAAGCCCGGCAGGAGCTGCTCGACTGGTGGAGCAGGCACTACAAGTGGCCATACCCCTCGGAGGCACAAAAGGTGGCGTTGGCGGAGTCCACGGGGCTGGACCTGA
AGCAGATCAACAACTGGTTCATAAACCAGAGGAAGAGGCATTGGAAGCCATCAGAGGATATGCAGTTTGTGGTGATGGATGCAGCTCATCCACACTACTATTTGGACAAT
GTCATCTGCAATAATCCCTTCTCTATGGACTGCTCCTCTACTCTCTTCTGA
mRNA sequenceShow/hide mRNA sequence
GGAGTGTCTCCTCCAATAGCACCCAAAATACCTTTATTCCCATCAATCCCACCACATCTGATTATCCATTTTGATCCTCCCCACATCCCCATCTTCATCATCATTAATTA
AAACCCCCATTTCTCTCCTCTTTCCTTCTTACTTCTCTTCATTCTTCTTTCTTTTACTCACTTTTCCCCTCTCATGGGGTGCAACCAACGAGGTTCTTCCATATCTTCAC
TGTTTTTTCTTTCATCGGGAAGGAAACTCGCAGTCTCAGTCTCAGTCCCAGTCCCAGTCCCATAAAGAAAAAGATTTTAGGGTTCTTGTCCTCGTCGTCGTCGTTGTTAT
ATATTGTTTGTAAAAGAAAAAGAAAAAAGACAAGTACGAAAAAGGACTGTTTGAGTTAGCTTAAATAGCGGGAGAAAAGGGCGCAGTAGTAGCAGGTTTTATGGAGCGTG
GAGGATCCAGTGGAAGCAGTGGTGGCAGTTCTTTCATGGCGGGATTTGGGGAAAACAACAACAACAGCAGCAGCAGTGGGATTTCTTCTATGGTGATGATGAGTAGTAGT
AGTAATAATAATTCCCAAATATTGGACAACAACAACAACAACGTTAATAATAAGCTGTTTGTGCCTTTATCTTGGTCTGCTACTTCCACTGCACAAAGGGTTTCTGCCTT
TGTCCCTCAGCCTCCTCATAATAATTCAAATCATAATCTTCAAAACAGCTGCAGAGCTAAGATCATGGCTCATCCTCTCTTCCCCCGCCTCTTGGCTGCCTATGTCAACT
GTCAAAAGGTGGGTGCGCCGCCGGAAGTGGTGGCAAGGCTGGAGCAGGCGTGCGCGGTGGCCGGAGGGAGCTGCAGGGCGACGGCGCGTGGGGATGATCCAGCACTGGAT
CAGTTCATGGAGGCGTACTGTGAGATGTTGAGCAAGTACGAGCAAGAGTTGAGCAAGCCCTTCAAAGAAGCCATGCTTTTCTTCTCAAGAATCGAGTCTCAGCTCAAAGC
CTTGGCAGTTTCTTCTTCTACTTCTGATGGTTGCGAGCTGGTCGGGCAAAGCGAGTGTTCGAAGGAGGTCGAGGTCGATATGAATGAGAACTACATTGACCCGCAGGCTG
AAGAGAAGGAATTGAAGGGCCAGCTTCTACGCAAATACAGCGGATACCTCGGCAGCCTGAAGCAGGAGTTTCTGAAGAAGAAGAAGAACGGAAAGTTGCCGAAAGAAGCC
CGGCAGGAGCTGCTCGACTGGTGGAGCAGGCACTACAAGTGGCCATACCCCTCGGAGGCACAAAAGGTGGCGTTGGCGGAGTCCACGGGGCTGGACCTGAAGCAGATCAA
CAACTGGTTCATAAACCAGAGGAAGAGGCATTGGAAGCCATCAGAGGATATGCAGTTTGTGGTGATGGATGCAGCTCATCCACACTACTATTTGGACAATGTCATCTGCA
ATAATCCCTTCTCTATGGACTGCTCCTCTACTCTCTTCTGATCATCATATAATGCCATTAATTTTCTAAGTAATGCAAGCTTTAATGTGTTTATGTTCAAGTAATTTCCA
TTTTGTTTAAGTTTAGGCTATTTGTATTGGATTTTATGGTCTTTTTTTACCACTAATCTAGCAATGCCTCATAGCTACCCCTTTCTCCATTCCCTTTGTTTCCATAGCTT
GGTTTTCTGCTTTCCTTTTTAACAAATAGAATAGAATGATCTCACTTGTATGCTCAAACATGATTTTGCCCAATTTGGATCAGACCTTAATGTTACTACTCACTCAGAC
Protein sequenceShow/hide protein sequence
MERGGSSGSSGGSSFMAGFGENNNNSSSSGISSMVMMSSSSNNNSQILDNNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAA
YVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQSECSKEVEVDMNENYID
PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQELLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN
VICNNPFSMDCSSTLF