; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G028010 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G028010
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Descriptionhomeobox protein knotted-1-like LET6
Genome locationCmU531Chr02:2094866..2099239
RNA-Seq ExpressionCmUC02G028010
SyntenyCmUC02G028010
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR005539 - ELK domain
IPR005540 - KNOX1
IPR005541 - KNOX2
IPR008422 - Homeobox KN domain
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589145.1 Homeobox protein SBH1, partial [Cucurbita argyrosperma subsp. sororia]1.3e-12983.39Show/hide
Query:  SSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKY
        ++NN++ +     F+P ++          T +AK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR   RGDDP LDQFMEAYCEML+KY
Subjt:  SSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKY

Query:  EQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ
        EQELTKPFKEAM+FFSRIESQLK LA    SSDG+ELVGQNECSKEIE VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ
Subjt:  EQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ

Query:  QLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        QLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICNPFSMDCSSTLF
Subjt:  QLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

XP_004144513.1 homeobox protein knotted-1-like LET6 [Cucumis sativus]5.8e-15986.65Show/hide
Query:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINNT-NNPQTLDNNNNNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNNND---L
        MEGGGSSGSSCSFMATCNS TNNNN+N  + MMMMIN T NNPQT D        NKMFLPLSWSS++S    NNN QTQ  +SAF+PQ +TNN++   +
Subjt:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINNT-NNPQTLDNNNNNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNNND---L

Query:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL
          TSTSKAKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVATGSCRAAG G+DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLK  
Subjt:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL

Query:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
        AVSSDGFELVGQNECSKEIEVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
Subjt:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL

Query:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST
        DLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVICNPFSMDCSS+
Subjt:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST

XP_008455471.1 PREDICTED: homeobox protein knotted-1-like LET6 [Cucumis melo]2.5e-15785.31Show/hide
Query:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINNT-NNPQTLDNNNNNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNN---NDL
        MEGGGSSGSSCSFMATCNS TNNNN+N  + MMMMIN T NNPQT D        NKMFLPLSWSS++S    NNN QTQ  +S F+PQ +TNN   +D 
Subjt:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINNT-NNPQTLDNNNNNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNN---NDL

Query:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL
          TSTSKAKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVATGSCRAAG G+DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLK  
Subjt:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL

Query:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
        AVSSDGF+LV QN+CSKEIEVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
Subjt:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL

Query:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        DLKQINNWFINQRKRHWKP+EDMQFVVMD  HPHYYLDNVICNPFSMDCSS+ F
Subjt:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

XP_022136261.1 homeobox protein SBH1-like [Momordica charantia]2.5e-14680.91Show/hide
Query:  MEGGGSSGSS--CSFMATCNSTNNNNTN---TPMMMMINNTNNPQTLDNNNNNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSK
        ME GGSSGSS   SFMA     NNN+++   + M+MM +++NN   + +NNNNN NNK+F+PLSWS+TS+      QRVSAFVPQ   NN++    ++ +
Subjt:  MEGGGSSGSS--CSFMATCNSTNNNNTN---TPMMMMINNTNNPQTLDNNNNNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSK

Query:  AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS---S
        AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVA GSCRA  RGDDPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLK LAVS   S
Subjt:  AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS---S

Query:  DGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQ
        DG ELVGQ+ECSKE+EVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAESTGLDLKQ
Subjt:  DGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQ

Query:  INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF
        INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC NPFSMDCSSTLF
Subjt:  INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF

XP_038887406.1 homeobox protein knotted-1-like LET6 [Benincasa hispida]1.1e-17392.53Show/hide
Query:  MEGGGSSGSSCSFMATCNS----TNNNNTNTPMMMMINNTNNPQTLDNNNNNNNNNKMFLPLSWSSTSS--NNNAQTQRVSAFVPQSNTNNNDLLTTSTS
        MEGGGSSGSSCSFMATCNS    TNNNN+NTPMMMM  N+NNPQTLD    +NNNNKMFLPLSWSS++S  NNNAQTQRVSAFVP  +TNNND+LTTSTS
Subjt:  MEGGGSSGSSCSFMATCNS----TNNNNTNTPMMMMINNTNNPQTLDNNNNNNNNNKMFLPLSWSSTSS--NNNAQTQRVSAFVPQSNTNNNDLLTTSTS

Query:  KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDG
        KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPF+EAMLFFSRIESQLK LAV SDG
Subjt:  KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDG

Query:  FELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQIN
        FELV QNECSKEIEVDMN+NYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQIN
Subjt:  FELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQIN

Query:  NWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        NWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
Subjt:  NWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

TrEMBL top hitse value%identityAlignment
A0A0A0K5G5 Uncharacterized protein2.8e-15986.65Show/hide
Query:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINNT-NNPQTLDNNNNNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNNND---L
        MEGGGSSGSSCSFMATCNS TNNNN+N  + MMMMIN T NNPQT D        NKMFLPLSWSS++S    NNN QTQ  +SAF+PQ +TNN++   +
Subjt:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINNT-NNPQTLDNNNNNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNNND---L

Query:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL
          TSTSKAKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVATGSCRAAG G+DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLK  
Subjt:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL

Query:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
        AVSSDGFELVGQNECSKEIEVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
Subjt:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL

Query:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST
        DLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVICNPFSMDCSS+
Subjt:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST

A0A1S3C0Z5 homeobox protein knotted-1-like LET61.2e-15785.31Show/hide
Query:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINNT-NNPQTLDNNNNNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNN---NDL
        MEGGGSSGSSCSFMATCNS TNNNN+N  + MMMMIN T NNPQT D        NKMFLPLSWSS++S    NNN QTQ  +S F+PQ +TNN   +D 
Subjt:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINNT-NNPQTLDNNNNNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNN---NDL

Query:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL
          TSTSKAKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVATGSCRAAG G+DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLK  
Subjt:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL

Query:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
        AVSSDGF+LV QN+CSKEIEVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
Subjt:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL

Query:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        DLKQINNWFINQRKRHWKP+EDMQFVVMD  HPHYYLDNVICNPFSMDCSS+ F
Subjt:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

A0A6J1C313 homeobox protein SBH1-like1.2e-14680.91Show/hide
Query:  MEGGGSSGSS--CSFMATCNSTNNNNTN---TPMMMMINNTNNPQTLDNNNNNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSK
        ME GGSSGSS   SFMA     NNN+++   + M+MM +++NN   + +NNNNN NNK+F+PLSWS+TS+      QRVSAFVPQ   NN++    ++ +
Subjt:  MEGGGSSGSS--CSFMATCNSTNNNNTN---TPMMMMINNTNNPQTLDNNNNNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSK

Query:  AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS---S
        AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVA GSCRA  RGDDPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLK LAVS   S
Subjt:  AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS---S

Query:  DGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQ
        DG ELVGQ+ECSKE+EVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAESTGLDLKQ
Subjt:  DGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQ

Query:  INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF
        INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC NPFSMDCSSTLF
Subjt:  INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF

A0A6J1ERR4 homeobox protein knotted-1-like LET68.9e-12982.11Show/hide
Query:  STSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLT
        +  +NN++ +     F+P ++          + + K+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR   RGDDP LDQFMEAYCEML+
Subjt:  STSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLT

Query:  KYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEA
        KYEQELTKPFKEAM+FFSRIESQLK LA    SSDG+ELVGQNECSKEIE VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEA
Subjt:  KYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEA

Query:  RQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        RQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICNPFSMDCSSTLF
Subjt:  RQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

A0A6J1JKT8 homeobox protein knotted-1-like LET61.4e-12980.46Show/hide
Query:  DNNNNN---NNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCR
        D+NNNN   +NN KMFLP        N++ QT R                      AK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR
Subjt:  DNNNNN---NNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCR

Query:  AAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKYSG
           RGDDP LDQFMEAYCEML+KYEQELTKPFKEAM+FFSRIESQLK LA    SSDG+ELVGQNECSKEIE VDMNENYIDPQAEEKELKGQLLRKYSG
Subjt:  AAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKYSG

Query:  YLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSM
        YLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICNPFSM
Subjt:  YLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSM

Query:  DCSSTLF
        DCSSTLF
Subjt:  DCSSTLF

SwissProt top hitse value%identityAlignment
O22299 Homeobox protein knotted-1-like LET63.6e-10359.14Show/hide
Query:  MEGG--GSSGSSCSFMATCNSTNNNNTNT----------------PMMMMINNTNNPQTLDNNNN--NNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQ
        MEGG  G++ +SC  M       NNN N                 PMMMM+     P +L NNNN   +NNN +FLP  +   ++NNN          PQ
Subjt:  MEGG--GSSGSSCSFMATCNSTNNNNTNT----------------PMMMMINNTNNPQTLDNNNN--NNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQ

Query:  SNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGR------GDDPALDQFMEAYCEMLTKYEQELTKPFKEA
         + N+    ++S+ K+KIMAHP + RLL AY+NCQK+GAPPEVVARLE+ CA +    R++        G+DPALDQFMEAYCEMLTKYEQEL+KPFKEA
Subjt:  SNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGR------GDDPALDQFMEAYCEMLTKYEQELTKPFKEA

Query:  MLFFSRIESQLK--TLAVSSDGFELVGQ---NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHY
        M+F SRIE Q K  TLA +S     +G+      S + EVD+N ++IDPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQL+DWW RH 
Subjt:  MLFFSRIESQLK--TLAVSSDGFELVGQ---NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHY

Query:  KWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTL
        KWPYPSESQK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYY+DNV+ N F MD + +L
Subjt:  KWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTL

O65034 Homeobox protein knotted-1-like 121.7e-7660.62Show/hide
Query:  KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGD--DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSS
        KAKIMAHP +  LLAAY++CQKVGAPPEV+ RL      A    R  GR D  DP LDQFMEAYC ML KY +ELT+P  EAM F  R+ESQL T+A  +
Subjt:  KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGD--DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSS

Query:  DG--------FELVGQNECSKEIEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES
         G            G++EC    E DM+    EN    IDP+AE+KELK QLL+KYSGYL SL+QEF KKKK GKLPKEARQ+LL WW  HYKWPYPSE+
Subjt:  DG--------FELVGQNECSKEIEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES

Query:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMD
        +K+ALAESTGLD KQINNWFINQRKRHWKPSEDM FV+M+  HP       +  PF  D
Subjt:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMD

P46608 Homeobox protein SBH11.1e-10463.64Show/hide
Query:  SSCSFMATCNSTNNNNTNTPMMMMINNTNNPQTLDNNNNNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTS---KAKIMAHPLFP
        ++C F+  C    +N+T TP +M+ NN NN +T D++NNNN        L +    S+++             N NNN   ++S+S   KAKIMAHP + 
Subjt:  SSCSFMATCNSTNNNNTNTPMMMMINNTNNPQTLDNNNNNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTS---KAKIMAHPLFP

Query:  RLLAAYVNCQKVGAPPEVVARLEQACAVA---TGSCRAAGR---GDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDGF--EL
        RLLAAYVNCQKVGAPPEVVARLE+ACA A    G   AAG    G+DPALDQFMEAYCEMLTKYEQEL+KP KEAMLF  RIE Q K L +SS  F    
Subjt:  RLLAAYVNCQKVGAPPEVVARLEQACAVA---TGSCRAAGR---GDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDGF--EL

Query:  VGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWF
         G    S E +VD++ N IDPQAE+++LKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLL+WW+RHYKWPYPSESQK+ALAESTGLD KQINNWF
Subjt:  VGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWF

Query:  INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCS
        INQRKRHWKPSEDMQFVVMD +HPHYY+DNV+ NPF MD S
Subjt:  INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCS

Q38874 Homeobox protein SHOOT MERISTEMLESS1.4e-10271.7Show/hide
Query:  TTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAG----RGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQL
        ++++ KAKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC+ A  +  + G     G+DP LDQFMEAYCEML KYEQEL+KPFKEAM+F  R+E Q 
Subjt:  TTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAG----RGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQL

Query:  KTLAVSS-DGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQ
        K+L++SS   F   G+      N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLDWWSRHYKWPYPSE Q
Subjt:  KTLAVSS-DGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQ

Query:  KVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL
        K+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++DNV+ NPF MD  SST+
Subjt:  KVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL

Q9M6D9 Homeobox protein SHOOT MERISTEMLESS1.2e-9869.62Show/hide
Query:  LLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAG----RGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIES
        +L++   KAKIMAHP + RLL AYVNCQKVGAPPEV ARLE+ C+ A  +  + G     G+DP LDQFMEAYCEML KYEQEL+KPFKEAM+F   +E 
Subjt:  LLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAG----RGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIES

Query:  QLKTLAVSSDGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES
        Q K+L++SS      G+      N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLDWWSRHYKWPYPSE 
Subjt:  QLKTLAVSSDGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES

Query:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD
        QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++ NV+ NPF +D
Subjt:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD

Arabidopsis top hitse value%identityAlignment
AT1G23380.1 KNOTTED1-like homeobox gene 61.4e-6246.69Show/hide
Query:  NAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ--------ACAVATGSCRAAGRGDDPALDQFMEAYCEM
        +A +   ++  P+    N+D ++ +  KAKI  HP +PRLL AY++CQKVGAPPE+   LE+           V   SC     G DP LD+FME YC++
Subjt:  NAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ--------ACAVATGSCRAAGRGDDPALDQFMEAYCEM

Query:  LTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLP
        L KY+ +L +PF EA  F ++IE QL+ L     S+ G    G     +E+    +E   D   + E+++LK +LLRK+   + +LK EF KKKK GKLP
Subjt:  LTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLP

Query:  KEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        +EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQRKRHWKPSE+M F +MD +   ++ +
Subjt:  KEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT1G23380.2 KNOTTED1-like homeobox gene 64.6e-6146.35Show/hide
Query:  NAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQK--VGAPPEVVARLEQ--------ACAVATGSCRAAGRGDDPALDQFMEAYC
        +A +   ++  P+    N+D ++ +  KAKI  HP +PRLL AY++CQK  VGAPPE+   LE+           V   SC     G DP LD+FME YC
Subjt:  NAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQK--VGAPPEVVARLEQ--------ACAVATGSCRAAGRGDDPALDQFMEAYC

Query:  EMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGK
        ++L KY+ +L +PF EA  F ++IE QL+ L     S+ G    G     +E+    +E   D   + E+++LK +LLRK+   + +LK EF KKKK GK
Subjt:  EMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGK

Query:  LPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        LP+EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQRKRHWKPSE+M F +MD +   ++ +
Subjt:  LPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT1G62360.1 KNOX/ELK homeobox transcription factor9.8e-10471.7Show/hide
Query:  TTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAG----RGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQL
        ++++ KAKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC+ A  +  + G     G+DP LDQFMEAYCEML KYEQEL+KPFKEAM+F  R+E Q 
Subjt:  TTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRAAG----RGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQL

Query:  KTLAVSS-DGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQ
        K+L++SS   F   G+      N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLDWWSRHYKWPYPSE Q
Subjt:  KTLAVSS-DGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQ

Query:  KVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL
        K+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++DNV+ NPF MD  SST+
Subjt:  KVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL

AT1G70510.1 KNOTTED-like from Arabidopsis thaliana 22.1e-6149.22Show/hide
Query:  DLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFF
        D  + S  K+KI +HPL+PRLL  Y++CQKVGAP E+   LE+           VA  SC     G DP LD+FME YC++L KY+ +L +PF EA  F 
Subjt:  DLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFF

Query:  SRIESQLKTL--------AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK
        ++IE QL+ L        A+S DG   V  +E  +E + D+  +    ++ +++LK QLLRK+  ++ SLK EF KKKK GKLP+EARQ LLDWW+ H K
Subjt:  SRIESQLKTL--------AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK

Query:  WPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        WPYP+E  K++LAE TGLD KQINNWFINQRKRHWKPSE+M F +MD ++  ++ +
Subjt:  WPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT4G08150.1 KNOTTED-like from Arabidopsis thaliana8.1e-7449.32Show/hide
Query:  SSGSSCSFMATCNSTNNNNTNTPMMMMINNTNN-----------------PQTLDN-----NNNNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNT
        SS +     +  N+ NNNN ++      NNTNN                 PQT +N     ++  NNNN   +    SS+  N+ +   R      ++N 
Subjt:  SSGSSCSFMATCNSTNNNNTNTPMMMMINNTNN-----------------PQTLDN-----NNNNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNT

Query:  NNNDLLT-TSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARL-------EQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAM
        NNND ++     KAKI+AHP +  LL AY++CQK+GAPP+VV R+       E     +T S  A+ R  DP LDQFMEAYC+ML KY +ELT+P +EAM
Subjt:  NNNDLLT-TSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARL-------EQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAM

Query:  LFFSRIESQLKTLAVSS----DGFELVGQNECSKEIEVDMNEN------YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW
         F  RIESQL  L  S     +  +    N  S + E + N         IDP+AE++ELK  LL+KYSGYL SLKQE  KKKK GKLPKEARQ+LL WW
Subjt:  LFFSRIESQLKTLAVSS----DGFELVGQNECSKEIEVDMNEN------YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW

Query:  SRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD
          HYKWPYPSES+KVALAESTGLD KQINNWFINQRKRHWKPSEDMQF+VMD   HPH+   Y+D
Subjt:  SRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAGGAGGATCCAGTGGAAGCAGCTGTTCATTCATGGCTACTTGTAACAGCACCAACAACAATAATACTAACACACCCATGATGATGATGATCAATAATACTAA
TAATCCTCAAACGTTGGACAACAATAACAACAACAACAACAACAACAAGATGTTCTTGCCTTTGTCTTGGTCTTCTACTTCTTCCAACAATAATGCTCAAACTCAAAGGG
TTTCTGCTTTTGTTCCTCAATCTAATACCAATAATAATGATCTTCTTACCACTTCCACTTCCAAAGCTAAAATTATGGCTCATCCTCTCTTCCCTCGCCTCCTCGCTGCC
TACGTCAACTGCCAAAAGGTGGGTGCGCCGCCGGAAGTGGTGGCGAGGCTAGAGCAGGCGTGTGCCGTGGCGACGGGAAGCTGTAGGGCGGCGGGACGTGGGGATGATCC
AGCGTTGGATCAGTTCATGGAGGCTTATTGTGAGATGTTGACCAAATATGAACAAGAGTTGACCAAACCTTTTAAAGAAGCAATGCTCTTCTTCTCAAGAATTGAGTCTC
AGCTGAAAACCCTAGCAGTTTCTTCTGATGGTTTCGAGTTGGTTGGGCAAAACGAGTGCTCGAAGGAGATTGAGGTGGATATGAACGAAAACTACATAGACCCTCAAGCC
GAAGAGAAGGAACTCAAAGGCCAACTTCTACGCAAATACAGCGGATATCTTGGGAGCCTAAAACAAGAGTTTTTGAAGAAGAAAAAGAATGGGAAGTTGCCAAAAGAAGC
TAGACAACAATTGCTCGACTGGTGGAGTCGACACTACAAATGGCCATATCCCTCGGAGTCGCAAAAGGTGGCGTTGGCGGAGTCGACGGGGCTAGACTTGAAGCAGATCA
ATAATTGGTTTATTAACCAAAGAAAGCGCCATTGGAAGCCATCAGAGGATATGCAGTTTGTGGTGATGGATGCTGCTCATCCACATTACTATTTGGACAATGTCATATGC
AATCCTTTCTCTATGGATTGTTCTTCTACTCTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTACACTTTCCCCCAAAACTCACTCTTTCTTCTCTAAAAAATCCCACTCAGAGAAAACCCCAAAACAAATACTCTACCCTCATGCTCCAATGAACGAGGTTGTTCCA
TTTCATCGGGAAGTACGAACAACACTGTTTTAGTTAAGCTTAAATAGCGGGAGAAAAAGAAAGACATGGAAGGAGGAGGATCCAGTGGAAGCAGCTGTTCATTCATGGCT
ACTTGTAACAGCACCAACAACAATAATACTAACACACCCATGATGATGATGATCAATAATACTAATAATCCTCAAACGTTGGACAACAATAACAACAACAACAACAACAA
CAAGATGTTCTTGCCTTTGTCTTGGTCTTCTACTTCTTCCAACAATAATGCTCAAACTCAAAGGGTTTCTGCTTTTGTTCCTCAATCTAATACCAATAATAATGATCTTC
TTACCACTTCCACTTCCAAAGCTAAAATTATGGCTCATCCTCTCTTCCCTCGCCTCCTCGCTGCCTACGTCAACTGCCAAAAGGTGGGTGCGCCGCCGGAAGTGGTGGCG
AGGCTAGAGCAGGCGTGTGCCGTGGCGACGGGAAGCTGTAGGGCGGCGGGACGTGGGGATGATCCAGCGTTGGATCAGTTCATGGAGGCTTATTGTGAGATGTTGACCAA
ATATGAACAAGAGTTGACCAAACCTTTTAAAGAAGCAATGCTCTTCTTCTCAAGAATTGAGTCTCAGCTGAAAACCCTAGCAGTTTCTTCTGATGGTTTCGAGTTGGTTG
GGCAAAACGAGTGCTCGAAGGAGATTGAGGTGGATATGAACGAAAACTACATAGACCCTCAAGCCGAAGAGAAGGAACTCAAAGGCCAACTTCTACGCAAATACAGCGGA
TATCTTGGGAGCCTAAAACAAGAGTTTTTGAAGAAGAAAAAGAATGGGAAGTTGCCAAAAGAAGCTAGACAACAATTGCTCGACTGGTGGAGTCGACACTACAAATGGCC
ATATCCCTCGGAGTCGCAAAAGGTGGCGTTGGCGGAGTCGACGGGGCTAGACTTGAAGCAGATCAATAATTGGTTTATTAACCAAAGAAAGCGCCATTGGAAGCCATCAG
AGGATATGCAGTTTGTGGTGATGGATGCTGCTCATCCACATTACTATTTGGACAATGTCATATGCAATCCTTTCTCTATGGATTGTTCTTCTACTCTTTTCTGATGAGAT
ATGCACCATTTTAATTTAATGTTGCTTTGTTTATGTTAAATGTTATTTCCCATTTTGTTTTTAAGATTATTCTACATTTTGAACTTATAAAACCCTTTCCTCTTTTCTTC
TTAATTCTTAGTTTAGGCTCCTTGTATTATTGTATGTGTTTTTTAGTACTAATTTACCAATGGCTGCTCGCTTGCTAAAGCTGAT
Protein sequenceShow/hide protein sequence
MEGGGSSGSSCSFMATCNSTNNNNTNTPMMMMINNTNNPQTLDNNNNNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAA
YVNCQKVGAPPEVVARLEQACAVATGSCRAAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDGFELVGQNECSKEIEVDMNENYIDPQA
EEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC
NPFSMDCSSTLF