; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022360 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022360
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionhomeobox protein knotted-1-like LET6
Genome locationscaffold47:2787059..2789826
RNA-Seq ExpressionMS022360
SyntenyMS022360
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR005539 - ELK domain
IPR005540 - KNOX1
IPR005541 - KNOX2
IPR008422 - Homeobox KN domain
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589145.1 Homeobox protein SBH1, partial [Cucurbita argyrosperma subsp. sororia]4.7e-13286.33Show/hide
Query:  HNNSNHNLQN-------------SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSK
        HNN+N +  N             +CRAK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR  ARGDDP LDQFMEAYCEMLSKYEQEL+K
Subjt:  HNNSNHNLQN-------------SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSK

Query:  PFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW
        PFKEAM+FFSRIESQLKALA SSS+SDG ELVGQNECSKE+E VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW
Subjt:  PFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW

Query:  SRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        SRHYKWPYPSE+QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+IC NPFSMDCSSTLF
Subjt:  SRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

XP_004144513.1 homeobox protein knotted-1-like LET6 [Cucumis sativus]1.2e-13876.47Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHNL
        ME GGSSGSS   SFMA      NNN+S++  S M+MM + ++NN Q  D      NK+F+PLSWS+++          T   +SAF+PQP  NNSN ++
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHNL

Query:  ---QNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA
            ++ +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCRA   G+DPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLKA
Subjt:  ---QNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA

Query:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE
         AVS   SDG ELVGQNECSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSE+QKVALAE
Subjt:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSST
        STGLDLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVIC NPFSMDCSS+
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSST

XP_008455471.1 PREDICTED: homeobox protein knotted-1-like LET6 [Cucumis melo]6.4e-13775.21Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHN-
        ME GGSSGSS   SFMA      NNN+S++  S M+MM + +NNN Q  D      NK+F+PLSWS+++          T   +S F+PQP  NNSN + 
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHN-

Query:  --LQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA
            ++ +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCRA   G+DPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLKA
Subjt:  --LQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA

Query:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE
         AVS   SDG +LV QN+CSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSE+QKVALAE
Subjt:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        STGLDLKQINNWFINQRKRHWKP+EDMQFVVMD  HPHYYLDNVIC NPFSMDCSS+ F
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

XP_022136261.1 homeobox protein SBH1-like [Momordica charantia]1.3e-18798.85Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILD-NNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIM
        MERGGSSGSSGGSSFMAGFGE NNNNSSSSGISSMVMMSSSSNNNSQILD NNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIM
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILD-NNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIM

Query:  AHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCE
        AHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCE
Subjt:  AHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCE

Query:  LVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNW
        LVGQ+ECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNW
Subjt:  LVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNW

Query:  FINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        FINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
Subjt:  FINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

XP_038887406.1 homeobox protein knotted-1-like LET6 [Benincasa hispida]2.0e-14379.04Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATST-------AQRVSAFVPQPPHNNSNHNLQNS
        ME GGSSGSS   SFMA         ++++  + M+MM +S  NN Q LD+NN  NNK+F+PLSWS++++        QRVSAFVP P  NN++    ++
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATST-------AQRVSAFVPQPPHNNSNHNLQNS

Query:  CRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSS
         +AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVA GSCRA  RGDDPALDQFMEAYCEML+KYEQEL+KPF+EAMLFFSRIESQLKALAV   
Subjt:  CRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSS

Query:  TSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDL
         SDG ELV QNECSKE+EVDMN+NYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSE+QKVALAESTGLDL
Subjt:  TSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDL

Query:  KQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        KQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC NPFSMDCSSTLF
Subjt:  KQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

TrEMBL top hitse value%identityAlignment
A0A0A0K5G5 Uncharacterized protein5.6e-13976.47Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHNL
        ME GGSSGSS   SFMA      NNN+S++  S M+MM + ++NN Q  D      NK+F+PLSWS+++          T   +SAF+PQP  NNSN ++
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHNL

Query:  ---QNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA
            ++ +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCRA   G+DPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLKA
Subjt:  ---QNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA

Query:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE
         AVS   SDG ELVGQNECSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSE+QKVALAE
Subjt:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSST
        STGLDLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVIC NPFSMDCSS+
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSST

A0A1S3C0Z5 homeobox protein knotted-1-like LET63.1e-13775.21Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHN-
        ME GGSSGSS   SFMA      NNN+S++  S M+MM + +NNN Q  D      NK+F+PLSWS+++          T   +S F+PQP  NNSN + 
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATS----------TAQRVSAFVPQPPHNNSNHN-

Query:  --LQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA
            ++ +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCRA   G+DPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLKA
Subjt:  --LQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA

Query:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE
         AVS   SDG +LV QN+CSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSE+QKVALAE
Subjt:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        STGLDLKQINNWFINQRKRHWKP+EDMQFVVMD  HPHYYLDNVIC NPFSMDCSS+ F
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

A0A6J1C313 homeobox protein SBH1-like6.5e-18898.85Show/hide
Query:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILD-NNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIM
        MERGGSSGSSGGSSFMAGFGE NNNNSSSSGISSMVMMSSSSNNNSQILD NNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIM
Subjt:  MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILD-NNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIM

Query:  AHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCE
        AHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCE
Subjt:  AHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCE

Query:  LVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNW
        LVGQ+ECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNW
Subjt:  LVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNW

Query:  FINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        FINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
Subjt:  FINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

A0A6J1ERR4 homeobox protein knotted-1-like LET61.5e-13181.33Show/hide
Query:  ILDNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDP
        + D+NN+ NN           S +     F+P   H         SCR K+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR  ARGDDP
Subjt:  ILDNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDP

Query:  ALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQE
         LDQFMEAYCEMLSKYEQEL+KPFKEAM+FFSRIESQLKALA SSS+SDG ELVGQNECSKE+E VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQE
Subjt:  ALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQE

Query:  FLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        FLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSE+QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+IC NPFSMDCSSTLF
Subjt:  FLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

A0A6J1JKT8 homeobox protein knotted-1-like LET62.3e-13286.33Show/hide
Query:  HNNSNHNLQN-------------SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSK
        HNN+N +  N             +CRAK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR  ARGDDP LDQFMEAYCEMLSKYEQEL+K
Subjt:  HNNSNHNLQN-------------SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSK

Query:  PFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW
        PFKEAM+FFSRIESQLKALA SSS+SDG ELVGQNECSKE+E VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW
Subjt:  PFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW

Query:  SRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF
        SRHYKWPYPSE+QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+IC NPFSMDCSSTLF
Subjt:  SRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTLF

SwissProt top hitse value%identityAlignment
O22299 Homeobox protein knotted-1-like LET61.7e-10057.8Show/hide
Query:  MERGGSSGSSGGSSFMAGFGE---NNNNNSSSSGISS----------MVMM-----SSSSNNNSQILDNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPP
        ME G S  +S     M G+G+   NNNNN + +G  +          M+MM     S ++NNN++   +NNN+   LF+P   +  +         PQ  
Subjt:  MERGGSSGSSGGSSFMAGFGE---NNNNNSSSSGISS----------MVMM-----SSSSNNNSQILDNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPP

Query:  HNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATAR------GDDPALDQFMEAYCEMLSKYEQELSKPFKEAML
        +N+S+    +S ++KIMAHP + RLL AY+NCQK+GAPPEVVARLE+ CA +    R+++       G+DPALDQFMEAYCEML+KYEQELSKPFKEAM+
Subjt:  HNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATAR------GDDPALDQFMEAYCEMLSKYEQELSKPFKEAML

Query:  FFSRIESQLKALAVSSSTSDGC---ELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK
        F SRIE Q KAL ++ ++S      E + +N  S E EVD+N ++IDPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQL+DWW RH K
Subjt:  FFSRIESQLKALAVSSSTSDGC---ELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK

Query:  WPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTL
        WPYPSE+QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYY+DNV+ N+ F MD + +L
Subjt:  WPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCSSTL

O65034 Homeobox protein knotted-1-like 126.1e-7460.66Show/hide
Query:  SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALA--V
        S +AKIMAHP +  LLAAY++CQKVGAPPEV+ RL    A             DP LDQFMEAYC ML+KY +EL++P  EAM F  R+ESQL  +A   
Subjt:  SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALA--V

Query:  SSSTSDGCELV---GQNECSKEVEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEA
            +    L+   G++EC    E DM+    EN    IDP+AE+KELK QLL+KYSGYL SL+QEF KKKK GKLPKEARQ+LL WW  HYKWPYPSE 
Subjt:  SSSTSDGCELV---GQNECSKEVEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEA

Query:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP
        +K+ALAESTGLD KQINNWFINQRKRHWKPSEDM FV+M+  HP
Subjt:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP

P46608 Homeobox protein SBH11.8e-10258.01Show/hide
Query:  GSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMM----SSSSNNNSQILDNNNNVN-NKLFVPLSWSATSTAQ---------------------RVSAFV
        G S SS G+S++  FGENN     S G+  M MM    S  + ++     NNNNVN N LF+P   ++T T                        +  + 
Subjt:  GSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMM----SSSSNNNSQILDNNNNVN-NKLFVPLSWSATSTAQ---------------------RVSAFV

Query:  PQPPHNNSNHNLQN-----------SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA----VAGGSCRA--TARGDDPALDQFMEAYCEMLS
         +  H++ +H   N           + +AKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+ACA    +AGG   A  +  G+DPALDQFMEAYCEML+
Subjt:  PQPPHNNSNHNLQN-----------SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA----VAGGSCRA--TARGDDPALDQFMEAYCEMLS

Query:  KYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEAR
        KYEQELSKP KEAMLF  RIE Q K L +SSS     E  G    S E +VD++ N IDPQAE+++LKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEAR
Subjt:  KYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEAR

Query:  QQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCS
        QQLL+WW+RHYKWPYPSE+QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMD +HPHYY+DNV+  NPF MD S
Subjt:  QQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMDCS

Q38874 Homeobox protein SHOOT MERISTEMLESS1.3e-10069.52Show/hide
Query:  SNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRI
        S+ +   S +AKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC+ A  +  +       G+DP LDQFMEAYCEML KYEQELSKPFKEAM+F  R+
Subjt:  SNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRI

Query:  ESQLKALAVSSSTS----DGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYP
        E Q K+L++SS +S        +   N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLDWWSRHYKWPYP
Subjt:  ESQLKALAVSSSTS----DGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYP

Query:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMD-CSSTL
        SE QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HPH+Y  + +  NPF MD  SST+
Subjt:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMD-CSSTL

Q9M6D9 Homeobox protein SHOOT MERISTEMLESS1.8e-9770.87Show/hide
Query:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAV
        +AKIMAHP + RLL AYVNCQKVGAPPEV ARLE+ C+ A  +  +     + G+DP LDQFMEAYCEML KYEQELSKPFKEAM+F   +E Q K+L++
Subjt:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAV

Query:  SSSTSDG---CELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE
        SS +S G     +   N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLDWWSRHYKWPYPSE QK+ALAE
Subjt:  SSSTSDG---CELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNNPFSMD
        STGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++ NV+  NPF +D
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNNPFSMD

Arabidopsis top hitse value%identityAlignment
AT1G23380.1 KNOTTED1-like homeobox gene 62.2e-6345.85Show/hide
Query:  LSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ--------ACAVAGGSCRATARGDDPALDQFME
        LS + ++ +   ++  P+   N+ N +L    +AKI  HP +PRLL AY++CQKVGAPPE+   LE+           V   SC     G DP LD+FME
Subjt:  LSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ--------ACAVAGGSCRATARGDDPALDQFME

Query:  AYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKK
         YC++L KY+ +L++PF EA  F ++IE QL+ L     ++ G    G     +E+    +E   D   + E+++LK +LLRK+   + +LK EF KKKK
Subjt:  AYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKK

Query:  NGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
         GKLP+EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQRKRHWKPSE+M F +MD +   ++ +
Subjt:  NGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT1G23380.2 KNOTTED1-like homeobox gene 67.2e-6245.52Show/hide
Query:  LSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQK--VGAPPEVVARLEQ--------ACAVAGGSCRATARGDDPALDQF
        LS + ++ +   ++  P+   N+ N +L    +AKI  HP +PRLL AY++CQK  VGAPPE+   LE+           V   SC     G DP LD+F
Subjt:  LSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAAYVNCQK--VGAPPEVVARLEQ--------ACAVAGGSCRATARGDDPALDQF

Query:  MEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKK
        ME YC++L KY+ +L++PF EA  F ++IE QL+ L     ++ G    G     +E+    +E   D   + E+++LK +LLRK+   + +LK EF KK
Subjt:  MEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKK

Query:  KKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        KK GKLP+EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQRKRHWKPSE+M F +MD +   ++ +
Subjt:  KKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT1G62360.1 KNOX/ELK homeobox transcription factor9.2e-10269.52Show/hide
Query:  SNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRI
        S+ +   S +AKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC+ A  +  +       G+DP LDQFMEAYCEML KYEQELSKPFKEAM+F  R+
Subjt:  SNHNLQNSCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAGGSCRAT----ARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRI

Query:  ESQLKALAVSSSTS----DGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYP
        E Q K+L++SS +S        +   N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLDWWSRHYKWPYP
Subjt:  ESQLKALAVSSSTS----DGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYP

Query:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMD-CSSTL
        SE QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HPH+Y  + +  NPF MD  SST+
Subjt:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNNPFSMD-CSSTL

AT1G70510.1 KNOTTED-like from Arabidopsis thaliana 23.0e-6048.98Show/hide
Query:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLK
        ++KI +HPL+PRLL  Y++CQKVGAP E+   LE+           VA  SC     G DP LD+FME YC++L KY+ +L++PF EA  F ++IE QL+
Subjt:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLK

Query:  ALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQ--AEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVA
         L    +++      G     +E+  D +    D Q  + +++LK QLLRK+  ++ SLK EF KKKK GKLP+EARQ LLDWW+ H KWPYP+E  K++
Subjt:  ALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQ--AEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVA

Query:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        LAE TGLD KQINNWFINQRKRHWKPSE+M F +MD ++  ++ +
Subjt:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT4G08150.1 KNOTTED-like from Arabidopsis thaliana1.6e-6946.91Show/hide
Query:  ERGGSSGSSGGSSFMAGFGENNNNNSSSS-----GISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATSTAQRVSAFV--PQPPHNNSNHNLQN--
        +   ++ ++  S++  G+   NNNN          +SS++  ++ +   S     NNN N  +    S S  +    +   +   Q  +NN+N N+ +  
Subjt:  ERGGSSGSSGGSSFMAGFGENNNNNSSSS-----GISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATSTAQRVSAFV--PQPPHNNSNHNLQN--

Query:  SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQA-----CAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA
        + +AKI+AHP +  LL AY++CQK+GAPP+VV R+  A           +   +A   DP LDQFMEAYC+ML KY +EL++P +EAM F  RIESQL  
Subjt:  SCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQA-----CAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKA

Query:  LAVSS----STSDGCELVGQNECSKEVEVDMNEN------YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYP
        L  S     +  DG      N  S + E + N         IDP+AE++ELK  LL+KYSGYL SLKQE  KKKK GKLPKEARQ+LL WW  HYKWPYP
Subjt:  LAVSS----STSDGCELVGQNECSKEVEVDMNEN------YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYP

Query:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD
        SE++KVALAESTGLD KQINNWFINQRKRHWKPSEDMQF+VMD   HPH+   Y+D
Subjt:  SEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCGTGGAGGATCCAGTGGAAGCAGTGGTGGCAGTTCTTTCATGGCGGGATTTGGGGAAAACAACAACAACAACAGCAGCAGCAGTGGGATTTCTTCTATGGTGAT
GATGAGTAGTAGTAGTAATAATAATTCCCAAATATTGGACAACAACAACAACGTTAATAATAAGCTGTTTGTGCCTTTATCTTGGTCTGCTACTTCCACTGCACAAAGGG
TTTCTGCCTTTGTCCCTCAGCCTCCTCATAATAATTCAAATCATAATCTTCAAAATAGCTGCAGAGCTAAGATCATGGCTCATCCTCTCTTCCCCCGCCTCTTGGCTGCC
TATGTCAACTGTCAAAAGGTGGGTGCGCCGCCGGAAGTGGTGGCAAGGCTGGAGCAGGCGTGCGCGGTGGCCGGAGGGAGCTGCAGGGCGACGGCGCGTGGGGATGATCC
AGCACTGGATCAGTTCATGGAGGCGTACTGTGAGATGTTGAGCAAGTACGAGCAAGAGTTGAGCAAGCCCTTCAAAGAAGCCATGCTTTTCTTCTCAAGAATCGAGTCTC
AGCTCAAAGCCTTGGCAGTTTCTTCTTCTACTTCTGATGGTTGCGAGCTGGTCGGGCAAAACGAGTGTTCGAAGGAGGTTGAGGTCGATATGAATGAGAACTACATTGAC
CCGCAGGCTGAAGAGAAGGAATTGAAGGGCCAGCTTCTACGCAAATACAGCGGATACCTCGGCAGCCTGAAGCAGGAGTTTCTGAAGAAGAAGAAGAACGGAAAGTTGCC
GAAAGAAGCCCGGCAGCAGCTGCTCGACTGGTGGAGCAGGCACTACAAGTGGCCATACCCCTCGGAGGCACAAAAGGTGGCGTTGGCGGAGTCCACGGGGCTGGACCTGA
AGCAGATCAACAACTGGTTCATAAACCAGAGGAAGAGGCATTGGAAGCCATCAGAGGATATGCAGTTTGTGGTGATGGATGCAGCTCATCCACACTACTATTTGGACAAT
GTCATCTGCAATAATCCCTTCTCTATGGACTGCTCCTCTACTCTCTTC
mRNA sequenceShow/hide mRNA sequence
ATGGAGCGTGGAGGATCCAGTGGAAGCAGTGGTGGCAGTTCTTTCATGGCGGGATTTGGGGAAAACAACAACAACAACAGCAGCAGCAGTGGGATTTCTTCTATGGTGAT
GATGAGTAGTAGTAGTAATAATAATTCCCAAATATTGGACAACAACAACAACGTTAATAATAAGCTGTTTGTGCCTTTATCTTGGTCTGCTACTTCCACTGCACAAAGGG
TTTCTGCCTTTGTCCCTCAGCCTCCTCATAATAATTCAAATCATAATCTTCAAAATAGCTGCAGAGCTAAGATCATGGCTCATCCTCTCTTCCCCCGCCTCTTGGCTGCC
TATGTCAACTGTCAAAAGGTGGGTGCGCCGCCGGAAGTGGTGGCAAGGCTGGAGCAGGCGTGCGCGGTGGCCGGAGGGAGCTGCAGGGCGACGGCGCGTGGGGATGATCC
AGCACTGGATCAGTTCATGGAGGCGTACTGTGAGATGTTGAGCAAGTACGAGCAAGAGTTGAGCAAGCCCTTCAAAGAAGCCATGCTTTTCTTCTCAAGAATCGAGTCTC
AGCTCAAAGCCTTGGCAGTTTCTTCTTCTACTTCTGATGGTTGCGAGCTGGTCGGGCAAAACGAGTGTTCGAAGGAGGTTGAGGTCGATATGAATGAGAACTACATTGAC
CCGCAGGCTGAAGAGAAGGAATTGAAGGGCCAGCTTCTACGCAAATACAGCGGATACCTCGGCAGCCTGAAGCAGGAGTTTCTGAAGAAGAAGAAGAACGGAAAGTTGCC
GAAAGAAGCCCGGCAGCAGCTGCTCGACTGGTGGAGCAGGCACTACAAGTGGCCATACCCCTCGGAGGCACAAAAGGTGGCGTTGGCGGAGTCCACGGGGCTGGACCTGA
AGCAGATCAACAACTGGTTCATAAACCAGAGGAAGAGGCATTGGAAGCCATCAGAGGATATGCAGTTTGTGGTGATGGATGCAGCTCATCCACACTACTATTTGGACAAT
GTCATCTGCAATAATCCCTTCTCTATGGACTGCTCCTCTACTCTCTTC
Protein sequenceShow/hide protein sequence
MERGGSSGSSGGSSFMAGFGENNNNNSSSSGISSMVMMSSSSNNNSQILDNNNNVNNKLFVPLSWSATSTAQRVSAFVPQPPHNNSNHNLQNSCRAKIMAHPLFPRLLAA
YVNCQKVGAPPEVVARLEQACAVAGGSCRATARGDDPALDQFMEAYCEMLSKYEQELSKPFKEAMLFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVEVDMNENYID
PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSEAQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN
VICNNPFSMDCSSTLF